Introduction

Temperature and precipitation are the most widely used climatic parameter that unveils the impact of climate change over a region. Alteration in local water availability for irrigation purposes, occurrences of extreme events like droughts and floods, change in temperature patterns and severe heat wave occurrences are some of the common climate change impacts on society1. To tackle the above problems and have better infrastructure planning for the future, it is important to predict the impacts of climate change in terms of temperature and/ or precipitation. Global Climate Models (GCMs) are used for projecting future climatic data that can be used for hydro-climatological studies. Several studies worldwide consider climatic variables like maximum and minimum temperature, precipitation, surface mean temperature, and sea surface temperature for simulating GCMs in combination with the observed data2,3,4,5,6,7,8,9,10,11.

The factors like complex topography of a region, monsoon dynamics with its onset, strength, and duration are influenced by the atmosphere, land, and ocean's complex interaction12,13. The natural climate variabilities like El Nino-Southern Oscillation (ENSO) and Indian Ocean Dipole (IOD) often biases GCMs’ maximum and minimum temperatures in different seasons14,15. The correlation between surface temperature and precipitation involving 17 CMIP5 GCMs, observed that models performed better in the cold season than in the warm season, and better over the land than over the oceans of the Indian subcontinent16. Low-frequency air eddies may alter global and regional climate over decadal periods17.

There are various other uncertainties associated with the GCM simulations such as inappropriate parameterization of aerosols, initial and boundary conditions, greenhouse gas emission, systematic model errors, and socio-economic factors making it challenging to use at local and regional scales. The additional uncertainties like random internal climate variability, scenarios of indefinite change due to anthropogenic activities, and physical responses by model equation arising during downscaling to regional scale2,18. Also, sometimes a limited number of GCMs are used for climate impact studies due to time and resource constraints. Hence, a detailed evaluation is important for selecting a suitable GCM or ensemble of GCMs (from a pool of available GCMs) before employing them for climate change impact studies.

CMIP6 models are upgraded versions of the models that took part in prior phases of CMIP, in terms of increased spatial resolution, physical parameterizations (such as cloud microphysics), better carbon–nitrogen cycle parameterizations, and better aerosol representation19. Nonetheless, there is still a need to analyze the accuracy and dependability of GCMs for modeling historical data in order to determine how well they simulate previous climatic conditions and to make efforts to lessen the uncertainty associated with future climate projections. Ranking of GCMs and suggesting the best ensemble GCMs for certain areas are one of the ways to reduce the uncertainty related to the selection of GCMs. Several studies were conducted for climatic variables including maximum and minimum temperatures, average temperatures, and precipitation. Some of the studies include, a relative ranking of 36 CMIP6 GCMs across Telangana state2, 36 CMIP5 GCMs over India11, 11 CMIP6 based HighRes MIP over India5, and 26 CMIP6 GCMs over the Krishna river basin, India3. For the evaluation of GCMs’ ability to simulate historical observations, several simple, effective, and meaningful performance indicators are used2,10,20. Some of the performance indicators considered for ranking GCMs are correlation coefficient (CC), normalized root mean square error (NRMSE), and skill score (SS)11. Different weights were assigned by using entropy method and the ranking of GCMS were carried out by compromise programming method11.

Skill score-based indicator is based on the overlapping of the probability density functions (PDFs) between GCMs simulated and observed data for precipitation, maximum and minimum temperature9. Several studies were carried out for different climate variables using skill score indicator2,6,7,9,10. In addition, other performance indicators used were Nash Sutcliffe efficiency2,4, normalized root mean square error/deviation, absolute mean bias error/deviation, correlation coefficient3,5,10,11. Another commonly used and an important method of ranking of GCMs is the multicriteria decision-making (MCDM). It is a dynamic method for multicriteria ranking of alternatives by selecting the best one from several alternatives. In most of the MCDM methods, quantitative weights are assigned for different indicators to assess the relative importance of different indicators21,22,23,24,25. Different MCDM methods result in different outcomes in ranking GCMs26 and hence, the compromise solution approach is crucial in the MCDM method. The basis of the compromise solution is a concept of feasible solution which is closest to the ideal one27. VlseKriterijuska Optimizacija I Komoromisno Resenje (VIKOR)28 is a compromise solution method adopted by various researchers in material selection29,30, performance evaluation24,31,32,33, sustainable and renewable energy34,35,36,37 and water resources planning38,39.

From the literatures, it is evident that there are only a few studies exploring the utility of MCDM for ranking of GCMs. Also, it was observed that the ranking of GCMs related to the latest version of coupled model inter comparison project i.e., CMIP6, are limited for India. To the best of authors’ knowledge, no study of ranking CMIP6 GCMs using compromise solution method namely, VIKOR has been done yet. Hence in this study, the ranking of GCMs from CMIP6 for projected changes in temperature over India was carried out with 30 GCMs using VIKOR method. Both the maximum and minimum temperatures were used to perform critical assessment of GCMs from CMIP6. Understanding these temperatures can help to predict the climate extremes, weather patterns and make informed decisions. Maximum temperature is important for studies related to extreme weather events such as urban heat islands (UHIs) and hot spell40. At the same time, the minimum temperature is needed for determining nocturnal UHI and agriculture related studies41,42. Five performance indicators, namely absolute normalized mean bias error (ANMBE), correlation coefficient (CC), normalized root mean square error (NRMSE), Nash–Sutcliffe efficiency (NSE), and Perkins’s skill score (PSS), are considered in this study. The entropy method was adopted to assign weights to various indicators, and best-ranked GCMs were computed using VIKOR method.

Study area and data collection

The Indian subcontinent lying in the northern hemisphere, with longitude and latitude ranging from 67.5° to 97.5° E and 7.5° to 37.5° N, respectively covering 335 numbers of one-degree spatial resolution grids, was considered as the study area. The model-simulated data from World Climate Research Programme (WCRP) was utilized to acquire GCMs under CMIP6 (https://esgf-node.llnl.gov/search/cmip6/) as a part of IPCC’s sixth assessment report1. Outputs from 30 GCMs for maximum and minimum temperature (designated as TMAX and TMIN) with daily temporal resolution were used as historical simulated data. The details of 30 GCMs under CMIP6 used in this study are tabulated in Supplementary Table S1. The gridded daily TMAX and TMIN data (https://www.imdpune.gov.in/lrfindex.php) for 50 years (1965–2014) at 1˚ spatial resolution are collected from the Indian Meteorological Department (IMD). These data are used as the historical observed data to evaluate the performance of climate models. The base period 1965–2014 was selected considering CMIP6 historical simulation data sets' availability until 2014. Also, CMIP6 is an updated version of CMIP5 to produce relatively higher resolution data with an increased number of distinct climate models and eight future scenarios representing shared-socioeconomic pathways. All the gridded GCMs data available at different spatial resolutions were brought down to a common grid resolution of 1° × 1° using bilinear interpolation techniques3,43,44.

It was observed from the previous studies that model selection should be done rationally for climate change impact studies4,10,11,20,45. Each CMIP6 model differs from each other, and within each model, different ensemble members result in different GCM outputs. Considering these facts, 30 GCMs were selected for the analysis such that all the models belong to the same modeling structure (i.e., Atmospheric General Circulation Model) and with the same ensemble realizations r1i1p1f1 (indicating realization index, initialization index, physics index, and forcing index as 1).

Methodology

Selection and evaluation of performance indicators

Performance indicators are statistical metrics used for testing the capability of GCMs in simulating the observed data (IMD gridded daily temperature data). No universally agreed criteria exists for selecting performance indicators for GCM assessment20. Hence, performance indicators are chosen from different categories and distributed into two scenarios. Scenario 1 (S1) has absolute normalized mean bias error (ANMBE), correlation coefficient (CC), normalized root mean square error (NRMSE), and Perkins’s skill score (PSS), whereas scenario 2 (S2) has an additional indicator Nash–Sutcliffe efficiency (NSE) apart from S1. These indicators were selected to ensure that at least one belongs to each category of error, correlation, and skill score. Mathematical representation of above discussed indicators is shown in supplementary equations S1 to S5.

Normalization and weight computation of performance indicators

Normalization methods were adopted to measure various non-proportional performance indicators on a common scale21,22,31,46,47,48. In this study, the normalization is carried out using the Max–Min method to obtain the decision matrix. After normalization, the equity contribution for each indicator is calculated using the Sum method. Using the contributing values, the weights are computed using the Entropy method. The lower entropy value of the indicator corresponds to its more valuable information, i.e., larger entropy-based weight. Finally, the weighted normalized decision matrix is calculated which will subsequently be used as an input in MCDM method. Mathematical representation of above discussed normalization steps is shown in supplementary equations S6 to S10.

Multicriteria Decision-Making using VIKOR

VIKOR (VlseKriterijuska Optimizacija I Komoromisno Resenje), primarily developed by Opricovnic in 1979, is a well-known multicriteria decision-making (MCDM) method49. VIKOR, a compromise ranking method, yields the feasible solution nearest to the ideal, hence helping the decision makers to conclude final solutions39,47. The methodology for ranking GCMs to obtain compromise solution using this method is described in Fig. 1. The computation of utility measure (Si), regret measure (Ri) and the index values (Qi) are carried out using the following equations:

$$S_{i} = \mathop \sum \limits_{j = 1}^{J} \frac{{\left( {P_{ij} } \right)_{max} - P_{ij} }}{{\left( {P_{ij} } \right)_{max} - \left( {P_{ij} } \right)_{min} }}$$
(1)
$$R_{i} = \mathop {max }\limits_{j} \left\{ {\frac{{\left( {P_{ij} } \right)_{max} - P_{ij} }}{{\left( {P_{ij} } \right)_{max} - \left( {P_{ij} } \right)_{min} }}} \right\}$$
(2)
$$Q_{i} = \vartheta \left\{ {\frac{{S_{i} - \mathop {min}\limits_{i} \left( {S_{i} } \right)}}{{\mathop {max}\limits_{i} \left( {S_{i} } \right) - \mathop {min}\limits_{i} \left( {S_{i} } \right)}}} \right\} + \left( {1 - \vartheta } \right)\left\{ {\frac{{R_{i} - \mathop {min}\limits_{i} \left( {R_{i} } \right)}}{{\mathop {max}\limits_{i} \left( {R_{i} } \right) - \mathop {min}\limits_{i} \left( {R_{i} } \right)}}} \right\}$$
(3)

Here, \(\vartheta\) is a balancing factor between the utility measure (overall benefit) and the regret measure (maximum individual deviation). The value of \(\vartheta\) ranges between 0 and 1, with “Voting by majority rule” (\(\vartheta\) > 0.5) or “by consensus” (for \(\vartheta\) = 0.5) or “with a veto” (for \(\vartheta\) < 0.5)29,30,47,48.

Figure 1
figure 1

Methodology for ranking GCMs to obtain compromise solution using VIKOR method.

Group decision-making method

The study area consists of 335 grids, each with a distinctive rank. In order to create a combined rank for the study area, a group decision-making process25 was adopted. The steps involved in this method are as follows:

At each grid, rankings were first separated into two halves and organized in descending order. GCMs with ranking 1 to X make up the first half of the sample (X = I/2, where I is the total number of GCMs). The GCM \(i\) strength is stated as follows:

$$S_{i} = \mathop \sum \limits_{k = 1}^{a} \mathop \sum \limits_{z}^{x} \left( {x - z + 1} \right) q_{iz}^{k} \forall i, \;\;k \forall z = 1, \ldots ,x$$
(4)

where, \(q_{iz}^{k}\) = 1 if GCM \(i\) is in rank z for the grid point k and zero in all other case. \(i\) corresponds to the GCM in the first half portion and z ranges from 1st to xth rank, and k represents the grid points.

The weakness of GCM \(i\) is given as:

$$W_{i} = \mathop \sum \limits_{k = 1}^{a} \mathop \sum \limits_{z = y}^{I} \left( {z - y + 1} \right) q_{iz}^{k} \forall i,\;\; k \forall z = y, \ldots ,I$$
(5)

where, \(q_{iz}^{k}\) = 1 if GCM \(i\) is in rank z for the grid point k and zero in all other case. \(i\) corresponds to the GCM in the second half portion and z ranges from 1st to yth rank up to last ranking in the portion, and k represents the grid points.

Net strength is calculated as:

$$N_{i} = S_{i} - W_{i}$$
(6)

The GCM with the highest net strength was regarded as the most appropriate or the best, and others were ordered in accordance with their values.

Ethical approval

Ethics violation has not been done in the study.

Results and discussions

Due to the complex atmospheric processes, model structure, and parameterization variability in representing land surface processes (vegetation dynamics, soil moisture, and land–atmosphere interactions), the temperature data may be over or underestimated from different CMIP models. Variable numerical schemes, grid configurations, spatial grid resolutions in capturing small-scale features, and representation of climate forcing using datasets and methods (for aerosols, land-use changes, greenhouse gas concentrations, ocean circulations, ice and snow albedo, and aerosols) contribute to differences among GCMs, even for same realization, initialization, physics, and forcings20,50,51,52. Therefore, there is a need to appraise the uncertainties associated with climate change data before incorporating it in hydro-climatological studies. Hence, the analysis was carried out for the entire Indian sub-continent consisting of 335 grids. But for demonstrating the performance evaluation of various GCMs for TMAX and TMIN, a grid with a longitude 94.5° E and a latitude 26.5° N located in North-East India was selected. The detailed description of the behavior of different GCMs on different performance indicators are explained in the following sub section.

Performance evaluation of GCMs based on correlation Coefficient for grid (94.5° E, 26.5° N)

Figure 2a and b depict the Taylor Diagram53 showing the correlation between the observed and the simulated temperature data from 30 GCMs for TMAX and TMIN, respectively. It can be observed from Fig. 2a (for TMAX data) that the GCMs, NESM3, INM-CM4-8, E3SM-1-0, ACCESS-ESM1-5, ACCESS-CM2, and INM-CM5-0 are having the highest CC values of 0.7919, 0.7843, 0.7798, 0.7766, 0.7742, and 0.7632, respectively. While KACE-1-0-G and KIOST-ESM are the worst correlated to the observed data with a CC value of − 0.0548 and 0.2774, respectively. Most of the GCM for TMAX (25 in number) had CC values between 0.6 and 0.8, exhibiting moderate matching with the observed data. Similarly, from Fig. 2b, it can be observed that the CC values of 25 models falls between 0.90 and 0.95 for TMIN. Hence, most of the models performed well in simulating the observed minimum temperature for the demonstration grid compared to TMAX. The KACE-1-0-G was the only GCM with small negative correlation for TMIN and TMAX, and hence not shown in Fig. 2. It is not prudent to ascertain the best GCMs for TMAX and TMIN only based on CC. Therefore, the following section further evaluates the GCMs based on other performance indicators before ascertaining the best GCMs for TMAX and TMIN.

Figure 2
figure 2

Taylor Diagram for (a) TMAX and (b) TMIN of 30 CMIP6 GCMs and the IMD gridded data.

Analysis of performance indicators, entropy and VIKOR method at a grid (94.5° E, 26.5° N)

For demonstrating entropy and the VIKOR method, minimum temperature at a grid with a longitude 94.5° E and a latitude 26.5° N located in North-East India was selected. The analysis used the performance indicators under two scenarios, S1: ANMBE-CC-NRMSE-PSS and S2: ANMBE-CC-NRMSE-NSE-PSS and the results are listed in Table 1. From Table 1, it can be observed that the GCM, NESM3 is having the maximum similarity, PSS (97.02%) with the observed PDF and is the most preferred GCM. Other performance indicators also suggest that the same GCM performs better with the values of ANMBE (0.0218), NRMSE (0.1118), and NSE (0.8697). Similarly, INM-CM4-8 showed the least similarity (51.79%) with the observed PDF and was least preferred. Also, BCC-ESM1 was the least preferred in the case of indicators ANMBE (0.4965), NRMSE (0.5183), and NSE (− 1.8007). NorESM2-MM was the best correlated to observed data with a value of 0.9379, while KACE-1-0-G was the worst correlated to the observed data with a value of − 0.1273. The above analysis reveals that the indicators behave differently with distinct GCMs.

Table 1 Performance indicator (PI) values, utility measure (S), regret measure (R), and index values (Q) for minimum temperature, for each GCM under both scenarios S1 and S2, for the grid (94.5° E, 26.5° N) in North-East India.

Further, all the indicators were normalized using the Max–Min method and then the equity contribution for each indicator was calculated using the Sum method. Indicator values are made consistent with the requirements of the entropy approach by the normalization procedures, which also guarantee that large range indicators do not overpower the small range indicators. For S1, among the four indicators, PSS has the highest importance (33.73%) which shows that its effect on GCM ranking is the highest, followed by NRMSE (32.90%), ANMBE (28.44%) and CC (4.93%). There is no significant difference in the contribution of PSS, NRMSE, and ANMBE and their total contribution amount to 95.07%, making them an equally important indicator for GCM ranking for S1 at grid (94.5° E, 26.5° N). For S2, among the five indicators, NSE has the highest importance (35.93%), followed by PSS (21.61%), NRMSE (21.08%), ANMBE (18.22%), and CC (3.16%). The entropy method makes it easy to rank 30 GCMs by providing differential weights opportunities instead of equal weights. Weights computed by entropy methods were used to obtain a normalized weighted decision matrix, subsequently used as inputs to the VIKOR method. The compromise solution is obtained by computing utility measure (Si), regret measure (Ri), and index values (Qi) using Eqs. (13). In this study, the balancing factor \(\vartheta\) is taken as 0.5. GCM, GFDL-CM4 is identified as the compromise solution for both scenarios S1 and S2 by satisfying the conditions C1 and C2. From Table 1, it can be observed that GFDL-CM4 ranks the best by the measure Q (minimum value 0.1656 for S1 and 0.2385 for S2) and by measure R (minimum value of 0.5054 for S1 and 0.6137 for S2). The compromise solution was accepted as obtained by the minimum individual regret (minimum R value) of the “opponent”.

Analysis of performance indicators, entropy and VIKOR method for India

The procedure described in previous section was repeated for the entire India comprising of 335 grids for minimum and maximum temperature using MATLAB and Python in-house developed code. For each grid, all the 30 GCMs were considered and 4 (or 5) indicators for scenario S1 (or S2) were used in achieving compromise solutions. It is noted that weights vary with indicators and with grids. The range of indicators depict a significant variation in the performance of various GCMs. A GCM may perform well in accordance with an indicator, and at the same time, the same GCM performs poorly in accordance with another indicator. One can refer the Supplementary Figs. S1–S10 for individual indicator values corresponding to all the 335 grids and all the GCMs over India, for TMAX and TMIN. For scenario S1, NRMSE is the most crucial indicator with a mean weightage of 41.18% and 45.88% for maximum and minimum temperatures, respectively. For scenario S2, NSE dominates NRMSE, with mean weightage of 35.30% and 42.95% for maximum and minimum temperature, respectively (Supplementary Fig. S11). Therefore, instead of assigning equal weights for indicators, differential weight opportunities were adopted for indicators using the entropy method.

The distribution of weights in various weight ranges (in %) obtained by the entropy method for the entire India, for scenarios S1 and S2 are listed in Table 2. It was observed that the number of grids with a weight less than 10% is the highest for CC (i.e., 323 for TMAX (S1), 318 for TMIN (S1), 333 for TMAX (S2), and 329 for TMIN (S2)), indicating that it is the least prominent indicator for ranking GCMs. Similarly, a greater number of grids in higher weight ranges indicate that NRMSE and NSE are the most prominent indicators for S1 and S2, respectively.

Table 2 Distribution of weights to performance indicators in various ranges under scenarios S1 and S2, over 335 grids of India.

The compromise solutions were computed for entire India, and their solutions were accepted as obtained by the maximum group utility (minimum S value) of the “majority” and the minimum individual regret (minimum R value) of the “opponent”. From Figs. 3 and 4, it can be noticed that 301 grids (89.85%) for maximum temperature and 318 grids (94.92%) for minimum temperature, yield at least one same best-ranked GCMs for scenarios S1 and S2. By analyzing maximum and minimum temperatures from Figs. 3 and 4, it can also be observed that only 36 grids (10.75%) for S1 and 27 grids (8.06%) for S2, produced at least one same best-ranked GCMs. A uniform ranking pattern was seen in both scenarios as indicated by high similarity in compromise solutions (89.85% for maximum temperature and 94.92% for minimum temperature). Moreover, a nonuniform ranking pattern existed between maximum and minimum temperature under both scenarios (i.e., similarity under S1 was 10.75%, and S2 was 8.06%).

Figure 3
figure 3

Spatial distributions of compromise solutions of maximum temperature for scenario S1 (a-c) and S2 (d-f)(Note: each figure is a complete compromise solution) (Maps created using ArcGIS Desktop 10.6.1, url: https://www.arcgis.com/index.html).

Figure 4
figure 4

Spatial distributions of compromise solutions of minimum temperature for scenario S1 (a-c) and S2 (d-f) (Note: each figure is a complete compromise solution) (Maps created using ArcGIS Desktop 10.6.1, url: https://www.arcgis.com/index.html).

Ensemble of GCMs identified for the Indian subcontinent

It is evident that different grids have different best-ranked GCMs, and an effort was made to rank GCMs for the entire India. According to the group decision-making method discussed under methodology, the top five ranked GCMs for maximum temperature are KIOST-ESM, MRI-ESM2-0, NESM3, MIROC6, and CanESM5 for S1 and MRI-ESM2-0, NESM3, KIOST-ESM, E3SM-1-0, and MIROC6 for S2. The respective net strengths for the mentioned GCMs are 3275, 3121, 2791, 2162, and 1548 for S1 whereas 3883, 3331, 3329, 2583, and 2164 for S2 as shown in Fig. 5. The higher ranked GCMs are not mentioned as their net strength difference is more as compared to the top five mentioned GCMs. In the same way for minimum temperature, E3SM-1-0, NESM3, INM-CM5-0, CMCC-ESM2, CanESM5, and GFDL-CM4 are ranked first, second, fourth, sixth, eighth, and ninth for S1 and first, third, fourth, eighth, sixth, and ninth for S2 (listed in Table 3). The average ranking perspective (average of all ranks corresponding to each GCM over 335 grids) was also evaluated54. On the other hand, from average ranking method, it can be observed that MRI-ESM2-0, KIOST-ESM, NESM3, MIROC6 and CanESM5 are ranked first, second, third, fourth, and seventh for S1 and first, third, second, fifth and seventh for S2 (Table 3) for maximum temperature. Similarly, for minimum temperature, E3SM-1-0, NESM3, CMCC-ESM2, INM-CM5-0, and CanESM5 are ranked first, second, fourth, fifth, and eighth for S1 and first, third, sixth, fifth, and fourth for S2. For maximum temperature, MPI-ESM-1-2-HR, EC-Earth3, EC-Earth3-Veg, INM-CM4-8, NorESM2-MM, and EC-Earth3, EC-Earth3-Veg, KACE-1-0-G, NorESM2-LM, NorESM2-MM, are ranked among last five, under both scenarios S1 and S2, using group decision making and average ranking perspective, respectively. For minimum temperature, AWI-CM-1-1-MR, NorCPM1, EC-Earth3-Veg, MPI-ESM1-2-HR, ACCESS-ESM1-5 and EC-Earth3, EC-Earth3-Veg, KACE-1-0-G, ACCESS-ESM1-5, MPI-ESM-1-2-HAM, IITM-ESM, are ranked among last eights, under both scenarios S1 and S2, using group decision making and average ranking perspective, respectively.

Figure 5
figure 5

Net strength of GCMs under scenarios S1 and S2, for maximum and minimum temperatures.

Table 3 Combined ranks of GCMs for maximum and minimum temperatures, under both scenarios S1 and S2, using group decision-making and average perspective methods.

After the extensive analysis of compromise solutions for 335 grids individually, group decision-making, and average raking perspective as a group for both scenarios S1 and S2, it can be suggested that no single GCM is suitable for India as a whole. An ensemble of GCMs can be employed for climate change impact studies for temperature data. Further, the ensemble of GCMs has been recommended by considering India's average cumulative percentage coverage as listed in Table 4, group decision-making, and average ranking perspective. It was observed that KIOST-ESM covers an average 50.85% with compromise solution 1 (CS1) at 173 grids, CS2 at 165, and CS3 at 173 grids for scenario S1 for maximum temperature. Also, KIOST-ESM covers an average of 57.71% with CS1 at 196 grids, CS2 at 188, and CS3 at 196 grids for scenario S2 for maximum temperature. Similarly, E3SM-1-0 covers an average 50.15% for scenario S1 and 53.53% for S2 minimum temperature. Notably, percentage coverage has increased from scenarios S1 to S2 by including NSE as an additional indicator. The KIOST-ESM, MRI-ESM2-0, MIROC6, NESM3, and CanESM5, collectively cover 80.30% (S1) and 85.57% (S2) of India and can be used as an ensemble for maximum temperature. Furthermore, E3SM-1-0, NESM3, CanESM5, GFDL-CM4, INM-CM5-0, and CMCC-ESM2 collectively cover 87.26% (S1) and 87.16% (S2) of India and can be used as an ensemble for minimum temperature.

Table 4 Number of grids over which respective GCMs are compromise solutions (compromise solutions less than seven grids are not tabulated).

ACCESS-CM2, INM-CM4-8, INM-CM5-0, KACE-1-0-G, and MPI-ESM1-2-HR have never been compromise solutions for any of the grids, while AWI-CM-1-1-MR, CMCC-ESM2, EC-Earth3, EC-Earth3-Veg, GISS-E2-1-G, IITM-ESM, MPI-ESM-1-2-HAM, and NorESM2-MM were compromise solutions for three or fewer grids. Also, these GCMs were ranked among the last fifteens using group decision and average perspective methods and hence are the less prominent GCMs for maximum temperature. Moreover, KACE-1-0-G was not found to be suitable for any of the grids and ACCESS-CM2, ACCESS-ESM1-5, EC-Earth3, EC-Earth3-Veg, GFDL-ESM4, GISS-E2-1-G, IITM-ESM, MPI-ESM-1-2-HAM, MPI-ESM1-2-HR, MPI-ESM1-2-LR, and NorCPM1 to three or fewer grids, as compromise solutions for minimum temperature. Also, these GCMs were ranked among the last 15 using group decision and average perspective methods and hence are the less prominent GCMs for minimum temperature.

Ensembles of CNRM-CM5, FGOALS-s2, and MIROC5 for the maximum temperature and MIROC4h, NorESM1-M, MIROC5, and CESM1-CAM5 for the minimum temperature were already proposed in the literature11 using compromise programming and a group decision approach. It is found out that the GCMs ensemble for maximum temperature included MIROC5 and MIROC6, respectively, from this study as well as from the literature11. Both the GCMs are from the same modeling institution, the Atmosphere and Ocean Research Institute, University of Tokyo, Japan. Both studies produce distinct GCMs ensemble suggestions for maximum and minimum temperature, which might be a result of different chosen performance indicators, decision-making approaches, spatial resolutions, and model selections.

Conclusions

This study deals with identifying the best ensemble of GCMs for the Indian subcontinent for studying the futuristic climate change impact. The identification was based on five performance indicators under two scenarios, S1 (ANMBE-CC-NRMSE-PSS) and S2 (ANMBE-CC-NRMSE-NSE-PSS), for ranking 30 CMIP6 GCMs. Grid wise performance was evaluated using these indicators at 335 grids for maximum and minimum temperature. The entropy method was operated to assign weights to the indicators, after normalizing their values using the max–min and the sum methods. Based on indicators and their assigned weights, a multicriteria decision-making method VIKOR was used to rank GCMs and obtain compromise solutions at all grids. Group decision-making, average ranking perspective and cumulative percentage coverage of India, collectively, were used to suggest an ensemble of GCMs. It is understood that seasonal changes and precipitation influences surface temperature. However, this study has not accounted seasonal influences for identifying the best GCMs. A detailed study is needed to understand the model biases associated with seasonal changes and precipitation. The conclusions based on the analysis from this study are summarized as follows:

  1. 1.

    For scenario S1, NRMSE is the most crucial indicator, with a mean weightage of 41.18% and 45.88% for maximum and minimum temperature, respectively. For scenario S2, NSE dominates NRMSE, with mean weightage of 35.30% and 42.95% for maximum and minimum temperature, respectively.

  2. 2.

    Number of grids with weight less than 10% is the highest for CC, indicating it as the least prominent indicator. More number of grids in higher weight ranges indicate NRMSE and NSE as the most prominent indicators for S1 and S2, respectively.

  3. 3.

    Weights vary with indicators and grids, and a particular GCM may perform well considering an indicator, while the same GCM performs poorly considering other indicators. So, it necessitates considering multiple criteria for GCM assessment.

  4. 4.

    A uniform ranking pattern was seen in both scenarios as there was 89.85% similarity in compromise solutions of maximum temperature for S1 and S2, whereas it was 94.92% for minimum temperature. A nonuniform ranking pattern was observed for maximum and minimum temperature under both scenarios (i.e., similarity under S1 was 10.75%, and S2 was 8.06%).

  5. 5.

    No single GCM was suitable for the Indian region as a whole, and hence an ensemble of best GCMs was suggested. Ensemble of KIOST-ESM, MRI-ESM2-0, MIROC6, NESM3, and CanESM5 for maximum temperature and E3SM-1-0, NESM3, CanESM5, GFDL-CM4, INM-CM5-0, and CMCC-ESM2 for minimum temperature was recommended based on this study.