Background & Summary

Climate change tends to cause sea level rise and more frequent climate extremes, such as heatwaves and heavy precipitation, resulting in serious damage to human society and ecosystems1,2. To avoid irreversible climate change, the Paris Agreement in 2015 aimed to limit the increase in global mean surface temperature (GMST) below 2 °C and preferably keep the rise below 1.5 °C above the preindustrial level3,4. Since then, climate change and its impact under 1.5 and 2 °C warming have received increasing attention5,6,7,8.

The general circulation model (GCM) is a primary tool to investigate climate change under 1.5 and 2 °C warming scenarios. Note that in this paper, GCM refers to the atmosphere-ocean coupled general circulation model or earth system model for simplicity. Many studies used GCM simulations derived from the Coupled Model Inter-comparison Project (CMIP) under Representative Concentration Pathways or Shared Socioeconomic Pathways to examine the climate change in the transient 1.5 and 2 °C warmer GMST9,10,11. As the increase in GMST to 1.5 and 2 °C presents transiently in these simulations, CMIP GCMs cannot deal with the climate change under stabilized 1.5 and 2 °C global warming9,12,13,14, limiting the knowledge of the stabilized 1.5 and 2 °C warmer worlds. To fill the gap, a set of ensemble simulations using the Community Earth System Model (CESM) were produced under three low-warming scenarios in line with the Paris targets, including the stabilization pathways at 1.5 and 2 °C global warming12, which are widely used in many studies15,16,17. Recently, four types of ensemble experiments on 1.5 and 2 °C global warming scenarios were carried out using the Nanjing University of Information Science and Technology Earth System Model (NESM), i.e., 1.5 °C overshoot, stabilized 1.5 °C, stabilized 2 °C, and transient 2 °C18. Compared with the CESM low-warming simulations, NESM3 additionally carried out a simulation under a transient 2 °C global warming scenario, which can provide a more comprehensive understanding of the 1.5 and 2 °C warmer worlds.

However, with a coarse horizontal resolution of 1.25° × 1.25°, the NESM3 cannot provide fine-scale climate change information under 1.5 and 2 °C warmer worlds for many climate-related studies, such as vulnerability, impact, and adaptation assessment and climate extreme projections. Dynamical downscaling is one of the widely-used approaches to generating spatially refined regional climate simulations. The traditional dynamical downscaling approach uses the GCM simulations as the initial and lateral boundary conditions to drive the regional climate model (RCM)19,20,21. As the bias in the initial and lateral boundary conditions can propagate into the RCM, the accuracy of downscaling simulations is largely dependent on the quality of GCMs’ output22,23,24,25,26. Like other GCMs, NESM3 also suffers from systematic bias in terms of various driving variables during the historical period27. Therefore, it is necessary to correct the NESM3 bias to provide high-quality large-scale driving fields for dynamical downscaling projections under the 1.5 and 2 °C global warming scenarios.

Over the past decade, many GCM bias correction methods have been developed, e.g., mean bias correction28, mean and variance bias corrections29,30, quantile-quantile correction31,32, and multi-model ensemble bias correction33. Recently, Xu et al.34 proposed a novel mean-variance-trend (MVT) method, which integrates the advantages of some previous GCM bias correction methods. Our evaluation suggested that the MVT method enables to improve dynamical downscaling simulations of multiple variables in terms of their climatological mean, interannual-to-interdecadal variances, annual cycle, and day-to-day variability35. Therefore, with the aid of the MVT method, we corrected the climatological mean and inter-annual variance biases in NESM3 based on the European Centre for Medium-Range Weather Forecasts Reanalysis 5 (ERA5) dataset. The bias corrections were applied to historical simulation during 1979–2014 and four types of future simulations on 1.5/2 °C global warming scenarios. This bias-corrected dataset provides high-quality large-scale driving fields for dynamical downscaling under 1.5 and 2 °C global warming scenarios, which is expected to generate more reliable simulations at the regional scale relative to the dynamical downscaling using raw GCM data.

Methods

Data acquisition

We used the six-hourly data of the first ensemble run produced by the NESM3 during the historical and future periods. The historical simulation during 1979–2014 is available in the CMIP Phase 6 (CMIP6) historical experiments36,37,38. The future simulations include four types of the 1.5 and 2 °C global warming scenarios, i.e., the stabilized 1.5 °C scenario during 2070–2100, the 1.5 °C overshoot scenario during 2070–2100, the stabilized 2 °C scenario during 2070–2100, and the transient 2 °C scenario during 2031–206118,39. For the stabilized 1.5 and 2 °C scenario, the GMST warming is designed to be stabilized at the level of 1.5 and 2 °C ( ± 0.1 °C) above preindustrial levels for at least three decades by 2100, respectively. The 1.5 °C overshoot scenario is designed such that the GMST warming slightly overshoots before returning to the 1.5 °C by 2100. For the transient 2 °C scenario, the GMST is designed to continuously increase and reach 2 °C by 2050. These four types of future simulations can help to comprehensively assess the climate impacts of 1.5 and 2 °C global warming and understand the driving mechanism of climate change in different warming scenarios. Compared with the CESM low-warming simulations, the additional scenario of the transient 2 °C can also be used to assess the difference and similarity of climate responses under transient and stabilized global warming. Besides, the NESM3 is configured with a horizontal resolution of 1.9° by 1.9° in the atmosphere and 1° by 1° in the ocean37. The used variables include the five upper air variables (i.e., air temperature, geopotential height, specific humidity, zonal wind, and meridional wind) and six surface variables (i.e., surface pressure, sea level pressure, sea surface temperature, soil moisture, soil temperature, and 2 m temperature). Therein, upper air variables are at 17 different pressure levels, including 1000, 925, 850, 700, 600, 500, 400, 300, 250, 200, 150, 100, 70, 50, 30, 20, and 10hPa. Soil moisture and temperature consist of 4 depth levels, including 0.05, 0.25, 0.7, and 1.5 m.

We also used the high-resolution reanalysis dataset—ERA5 during 1979–201440,41. Assimilating vast amounts of observations by advanced modeling and data assimilation system, ERA5 shows improvements compared with the ERA-Interim in many aspects, e.g., cloud, precipitation, wind fields, and humidity42,43,44. Both the NESM3 and ERA5 are re-gridded to the horizontal resolution of 1.25° by 1.25° using the bilinear interpolation with the aid of climate data operator (CDO).

GCM bias-correction method

Similar to the MVT method proposed by Xu et al.34, we corrected the NESM3 biases of the inter-annual variance and climate mean. As the future simulations under 1.5 and 2 °C global warming scenarios are not available with a limited number of CMIP6 models, we did not adjust the long-term non-linear trend of NESM3 using the trend derived from the multi-model ensemble mean. Instead, we preserved the long-term non-linear trend of NESM3.

The bias corrections were applied to the six-hourly data. For each six-hour period and day of the year, the multi-year time series derived from the NESM3 simulation and the ERA reanalysis can be broken down into a long-term non-linear trend plus an inter-annual perturbation term, respectively:

$${\rm{N}}{\rm{E}}{\rm{S}}{\rm{M}}={{\rm{N}}{\rm{E}}{\rm{S}}{\rm{M}}}_{LT}+{{\rm{N}}{\rm{E}}{\rm{S}}{\rm{M}}}^{{\rm{{\prime} }}}$$
(1)
$${\rm{E}}{\rm{R}}{\rm{A}}={{\rm{E}}{\rm{R}}{\rm{A}}}_{LT}+{{\rm{E}}{\rm{R}}{\rm{A}}}^{{\rm{{\prime} }}}$$
(2)

The non-linear trend was calculated using the monthly mean data by the ensemble empirical mode decomposition (EEMD) method45. EEMD is an intuitive, direct, and adaptive method based on empirical mode decomposition, which can extract coexisting oscillations of different frequencies in the original data to a few separate components45. Here, the non-linear trend is a monotonic curve that filters all high-frequency fluctuation.

We first corrected the variance bias of NESM3 by multiplying a scaling factor derived from the inter-annual perturbation term:

$${{\rm{N}}{\rm{E}}{\rm{S}}{\rm{M}}}_{v}={{\rm{N}}{\rm{E}}{\rm{S}}{\rm{M}}}_{LT}+{{\rm{N}}{\rm{E}}{\rm{S}}{\rm{M}}}^{{\rm{{\prime} }}}\times {r}_{s}^{hist}$$
(3)
$${r}_{s}^{hist}=\frac{{\sigma }_{{\rm{ERA}}}}{{\sigma }_{{\rm{NESM}}}^{hist}}$$
(4)

where \({r}_{s}^{hist}\) is the ratio of the standard deviation of the detrended ERA5 reanalysis to that of the detrended NESM3 simulation during the historical period (1979–2014), and NESMv is the NESM3 simulation with bias-corrected variance. We calculated the standard deviation using all 36 years of data. Then, we removed the years with anomalies greater than three times the original standard deviation and recalculated the standard deviation to remove the unrealistic standard deviation induced by extreme weather systems, e.g. tropical cyclones. Note that the bias correction of the inter-annual variance for each six-hour of the year is conducted on the detrended NESM3, which cannot completely reduce the bias of the inter-annual variability of NESM3 because the inter-annual variance of the long-term trend is not considered here.

Next, we removed the climatological mean bias of NESM3 for each six-hour of the year. Since the climatological mean value is included in the long-term non-linear trend at the monthly scale, we replaced the climatological mean values of the non-linear trend \(\overline{({{\rm{NESM}}}_{LT}}\)) with the climatological mean values for each six-hour of the year (\(\overline{{\rm{NESM}}}\)) in the NESM3. Subsequently, the bias-corrected data can be constructed as follows:

$${{\rm{N}}{\rm{E}}{\rm{S}}{\rm{M}}}_{mv}={{\rm{N}}{\rm{E}}{\rm{S}}{\rm{M}}}_{LT}-\bar{{{\rm{N}}{\rm{E}}{\rm{S}}{\rm{M}}}_{LT}}+\bar{{\rm{N}}{\rm{E}}{\rm{S}}{\rm{M}}}-\bar{({{\rm{N}}{\rm{E}}{\rm{S}}{\rm{M}}}^{hist}}-\bar{{\rm{E}}{\rm{R}}{\rm{A}}5})+{{\rm{N}}{\rm{E}}{\rm{S}}{\rm{M}}}^{{\rm{{\prime} }}}\times {r}_{s}^{hist}$$
(5)

where the overbar indicates the climatological mean, and \(\bar{({{\rm{N}}{\rm{E}}{\rm{S}}{\rm{M}}}^{hist}}-\bar{{\rm{E}}{\rm{R}}{\rm{A}}5})\) is the climatological mean bias of NESM3 for each six-hour of the year during the historical period. Thus, the bias-corrected six-hourly NESM3 data (NESMmv) preserved the long-term non-linear trend and removed the biases of climate mean and inter-annual variance. Under the assumption that the historical biases remain stationary in future scenarios, the bias corrections were conducted to the NESM3 historical and future simulations according to the NESM3 biases during historical period. Besides, considering the significant difference in the inter-annual variance between the sea water and sea ice surface, we did not correct the variance bias of the sea surface temperature in the NESM3 for the region where there is sea ice (water) in the NESM3 while sea water (ice) in the ERA5 during historical period.

Evaluation method

For the time series, we evaluated the bias-corrected NESM3 performance by the mean and standard deviation. Besides, the frequency distribution of the time series was also taken into account, and we used the Kolmogorov–Smirnov statistic to evaluate the goodness-of-fit46. A larger value for the Kolmogorov–Smirnov statistic represents more difference between the two distributions46.

To give a more comprehensive evaluation, we focused on the spatial fields of the seasonal mean and variability as well as the extreme values simulated by the bias-corrected NESM3. Here, the 2nd percentile and 98th percentile of daily simulations were used as the extreme values, following the framework of VALUE47. Four centered statistics were used to measure model performance in terms of the scalar fields, i.e., mean error (ME), standard deviation (SD), correlation coefficient (CORR), and root mean square error (RMSE). Similarly, we also used the vector mean error (VME), the centered root mean square length (cRMSL), the centered vector similarity coefficient (cVSC), and the root mean square vector difference (cRMSVD)48,49,50 for the vector fields, such as wind fields. Note that these four statistics take the wind speed and wind direction into consideration simultaneously48,49,50. ME (VME) and RMSE (RMSVE) measure the mean bias and the overall bias for a scalar (vector) variable, respectively. SD (cRMSL) measures the amplitude of an anomalous scalar (vector) field, and CORR (cVSC) measures the pattern similarity between two anomalous scalar (vector) fields.

Data Records

The dataset presented in this paper was open access in the China Science Data Bank51. The NetCDF data were compressed to save space. Users should use two attributes (i.e., scale_factor and add_offset) to unpack the variables. We also provide a FORTRAN code to convert these compressed NetCDF files to WRF intermediate files. The dataset includes three surface variables and five upper air variables for five sets of bias-corrected NESM3 simulations at six-hourly interval, i.e., the historical data from 1979 to 2014, the transient 2 °C scenario from 2031 to 2061, as well as the stabilized 1.5 °C scenario, the 1.5 °C overshoot scenario, and the stabilized 2 °C scenario from 2070 to 2100 (Table 1). The upper air variables consist of 17 pressure levels (1000, 925, 850, 700, 600, 500, 400, 300, 250, 200, 150, 100, 70, 50, 30, 20, and 10hPa). In addition, the soil moisture, the soil temperature, and the 2 m temperature are also provided as the initial conditions (ICs) for the dynamical downscaling. Considering that the impact of the ICs is neglectable for long-term RCM simulations, the three variables are at the monthly scale. Due to the missing data for the soil moisture and the soil temperature of NESM3 during the historical period, we used the ERA5 data as the historical soil conditions and provided the future soil conditions from the NESM3 with no bias correction. Note that the monthly 2 m temperature is bias-corrected in terms of the climate mean during both historical and future periods, which is used as the surface temperature of the land when generating the ICs for RCM.

Table 1 Variables in the bias-corrected NESM3 dataset. The tick represents the NESM3 simulation of a variable with bias corrections. ‘ERA5’ represents the simulation of a variable derived from the ERA5, while ‘Raw’ indicates the original NESM3 simulation. The variable with the star symbol is provided at a monthly time scale, and the others are at six-hourly interval.

The horizontal resolution of the dataset is 1.25° by 1.25°. All the bias-corrected data were stored in a self-describing NetCDF format. The NetCDF files are named “atm_experiment_yyyy_mm.nc”, where “experiment” donates different experiments, i.e., hist = historical, 1.5_overshoot = 1.5 °C overshoot scenario, 1.5_stable = stabilized 1.5 °C scenario, 2.0_stable = stabilized 2 °C scenario, and 2.0_transient = transient 2 °C scenario. “yyyy” and “mm” donate the year and the month of the data, respectively. Similarly, the monthly dataset of the soil moisture, the soil temperature, and the 2 m temperature are stored in the NetCDF files named “lnd_experiment_yyyy_mm.nc”. Each NetCDF file includes all the six-hourly or monthly data for one month of a year. The complete dataset is about 1.57TB in size. All the data are freely available at the China Science Data Bank51.

Technical Validation

As a state-of-the-art reanalysis dataset, the ERA5 reanalysis is regarded as the reference data to correct the NESM3 historical and future simulations. To validate the bias-corrected data, we compared the performance of the original and the bias-corrected NESM3 during the historical period against the ERA5. The difference between the bias-corrected NESM3 and the ERA5 represents the remaining bias that was not corrected. Compared with the NESM3, a smaller bias in the bias-corrected NESM3 indicates that the MVT bias correction is more effective, and vice versa.

Time series

To explicitly illustrate the changes induced by the GCM bias correction, we used the sea level pressure (PSL) in the Himalayas (30°N, 82.5°E) as an example. The time series of the original and the bias-corrected NESM3 are compared with that of the ERA5 data during the historical period (Fig. 1a). The NESM3 significantly underestimates the climatological mean of PSL by 17.12hPa against the ERA5 (0.05 significance level of student-t test), while the climatological mean of PSL in the bias-corrected NESM3 is the same as the ERA5. Regarding the inter-annual standard deviation, there is a significant overestimation in the NESM3 relative to the ERA5 (0.1 significance level of F-test), characterized by 3.07hPa in the NESM3 against 2.28hPa in the ERA5. With the standard deviation of 2.5hPa, the bias of the inter-annual variability is largely removed in the bias-corrected NESM3. Due to no bias correction for the inter-annual variance of the long-term trend, the bias of the inter-annual variability in the raw NESM3 is not completely reduced in the bias-corrected NESM3. Similar to the historical period, the bias-corrected NESM3 shows a greater climatological mean and smaller inter-annual variability than the raw NESM3 in the four scenarios during the future periods (Fig. 1b–e). Note that the bias correction method affects the climatological mean and inter-annual variability of the time series but does not alter the phase in the time series.

Fig. 1
figure 1

Time series of sea level pressure (PSL) in the Himalalys (30°N, 82.5°E) at 0900 UTC on July 15 in (a) historical and (b,c,d,e) future simulations. During historical period (1979–2014), the NESM3 data and the bias-corrected NESM3 data (NESM3.bc) are compared against the ERA5 data. The 36-year averaged value (Mean) and temporal standard deviation (Stddev) for each dataset are shown in the upper-left of the panel (a). During future periods, the time series for the NESM3 and NESM3.bc are shown in the (b) 1.5 °C overshoot scenario (2070–2100), (c) stabilized 1.5 °C scenario (2070–2100), (d) stabilized 2 °C scenario (2070–2100), and (e) transient 2 °C scenario (2031–2061).

Besides, the frequency distribution of 850hPa air temperature (T850) over the western North Pacific (45°N, 150°E) was also evaluated in the original and the bias-corrected NESM3 (Fig. 2). During the historical period, there is an obvious overestimation of the high-temperature events between 287–295 K in the NESM3 (Fig. 2a). With the Kolmogorov–Smirnov statistic of 0.36, the NSEM3 shows significantly different frequency distribution against that for the ERA5 (0.05 significance level of Kolmogorov–Smirnov test). By contrast, the bias-corrected NESM3 shows no significant difference in frequency distribution with that of the ERA5 (0.05 significance level of Kolmogorov–Smirnov test), characterized by the Kolmogorov–Smirnov statistic of 0.05 (Fig. 2a). Generally, the frequency distribution in the bias-corrected NESM3 has higher goodness-of-fit than that for the original one. It indicates that the bias correction method can indirectly correct the frequency characteristics of the driving variable, which favors a more accurate simulation of climate extreme events. During future periods, the frequency distribution also shifts to the low temperature in the bias-corrected NESM3 relative to that for the NESM3 under each global warming scenario (Fig. 2b–e).

Fig. 2
figure 2

Frequency distribution of 850hPa air temperature (T850) in the western North Pacific (45°N, 150°E) at 0300 UTC in July in (a) historical and (b,c,d,e) future simulations. During historical period (1979–2014), the NESM3 data and the bias-corrected NESM3 (NESM3.bc) data are compared against the ERA5 data, and the curve of each data contains a total of 1116 days (36 years × 31 days). During future periods, the frequency distribution of the NESM3 and NESM3.bc are shown in the (b) 1.5 °C overshoot scenario (2070–2100), (c) stabilized 1.5 °C scenario (2070–2100), (d) stabilized 2 °C scenario (2070–2100), and (e) transient 2 °C scenario (2031–2061). A total of 961 days (31 years × 31 days) are included in the curve of each data in the future scenario.

Seasonal mean

The performances of the original and the bias-corrected NESM3 in simulating the seasonal mean of seven driving variables in terms of the climatological mean during the historical period (1979–2014) were evaluated using ME (VME), SD (cRMSL), and CORR (cVSC) against the ERA5 data (Fig. 3). The seven driving variables can be regarded as a proxy for the CMIP6 models’ quality as driving fields for dynamical downscaling27. Results show that the NESM3 suffers from obvious bias in some variables, like other CMIP6 models27. There is a cold bias in the NESM3 simulation of global mean sea surface temperature (SST) throughout the year, especially in June–July–August (JJA) (Fig. 3a). In terms of the wind fields, the NESM3 shows inferior performance in the upper level to that of the lower level, especially during December–January–February (DJF) and March–April–May (MAM), characterized by overestimation of the mean and amplitude (Fig. 3a and b). Regarding the air temperature, the NESM3 also shows poor performance in the upper level during all seasons, which is similar to other CMIP6 models34. Notably, the NESM3 suffers from a cold bias in the global mean of 200hPa air temperature (T200) throughout the year (Fig. 3a). Simultaneously, the NESM3 shows the overestimated amplitude in the anomalous fields of T200 during DJF, MAM, and September–October–November (SON), along with a poor pattern similarity with the ERA5 data (Fig. 3b and c). Besides, the NESM3 shows a wet bias in the global mean of the low-level specific humidity and obviously overestimates its amplitude throughout the year (Fig. 3a and b). Compared with the original NESM3, the bias-corrected NESM3 removes all biases of the seasonal mean fields in terms of spatial mean, amplitude, and pattern similarity.

Fig. 3
figure 3

Statistical metrics to measure the performance of the NESM3 and the bias-corrected NESM3 (NESM3.bc) in simulating the climatological mean (1979–2014) of multiple driving variables in terms of the seasonal mean. Four seasons include December–January–February (DJF), March–April–May (MAM), June–July–August (JJA), and September–October–November (SON). Driving variables include sea surface temperature (SST, unit: °C), 850hPa vector winds (UV850, unit: m/s), 200hPa vector winds (UV200, unit: m/s), 500hPa air temperature (T500, unit: °C), 200hPa air temperature (T200, unit: °C), 850hPa specific humidity (Q850, unit: g/kg), and 700hPa specific humidity (Q700, unit: g/kg). The ERA5 data is used as the reference. (a) ME is the mean error of the GCM data against the ERA5 data, (b) SD is the ratio of the spatial standard deviation of the GCM data to the ERA5 data, (c) CORR is the spatial correlation coefficient between the GCM data and ERA5 data. Note that UV850 and UV200 are treated as vector variables in the evaluation, and then ME, SD, and CORR for the vector wind are equivalent to VME, cRMSL, and cVSC, respectively. The left-upper (right-lower) triangle in each square represents the original (bias-corrected) NESM3 performance. The color filling in each triangle donates the value of the metrics, and lighter color indicates better performance.

In addition to the seasonal mean, the climatological mean bias on the monthly and six-hour time scales should be completely removed in the bias-corrected NESM3 against the original NESM3, according to the process of the bias corrections. Thus, we compared the overall bias of the SST and 850hPa wind fields (UV850) on the time scales of the month and six-hour in two simulations (Fig. 4). For the climatological mean of monthly SST, the NESM3 suffers from the overall bias greater than 1.35 °C, and the relatively large RMSE appears in February and March (Fig. 4a left). In terms of UV850, the NESM3 shows the RMSVE of 1.96–2.86 m/s, with relatively poor performance in February and March (Fig. 4b left). Compared with the NESM3, the bias-corrected NESM3 reduces almost all the bias of the SST and UV850. As for the climatological mean fields on six-hour time scale, the NESM3 generally shows the RMSE of approximately 1.4 °C in SST and RMSVE of 1.56–1.6 m/s in UV850 (Fig. 4 right). Compared with the NESM3, the RMSE of SST is greatly reduced to 0.08 °C, and the RMSVE of UV850 is completely removed in the bias-corrected NESM3. Note that we only corrected SST of the NESM3 in the region where there is ice or water in both NESM3 and ERA5 and kept that in the region where there is ice (water) in the NESM3 while water (ice) in the ERA5, considering the different inter-annual variance between the ice and water surface.

Fig. 4
figure 4

Overall bias of climatological mean fields on the monthly (left) and six-hour (right) time scales for sea surface temperature (SST) and 850hPa wind fields (UV850) in the NESM3 and the bias-corrected NESM3 (NESM3.bc). The overall bias is measured by the root mean square error (RMSE) for SST while the root mean square vector difference (RMSVE) for UV850.

Seasonal variability

We also evaluated the performance of the original and the bias-corrected NESM3 in simulating the climatological mean of the seasonal variability for the seven driving variables during the historical period from three aspects (Fig. 5). Here, we used the standard deviation of the daily data across the season in one year to measure the seasonal variability. Seasonal variability of a variable represents the amplitude of the anomalous values throughout the season, which is critical for the downscaling simulation of extreme weather. The NESM3 performance in simulating seasonal variability varies with different variables. The NSEM3 overestimates the spatial mean of SST variability throughout the year (Fig. 5a). Simultaneously, the NESM3 shows a relatively large amplitude during DJF, MAM, and JJA as well as the rather poor pattern similarity during DJF and MAM against the ERA5 (Fig. 5b and c). In terms of the wind fields, there is a relatively large overestimation of the spatial mean in the upper level throughout the year (Fig. 5a). As for the air temperature, the NESM3 overestimates the spatial mean of the 500hPa air temperature (T500) and underestimates the amplitude of the T200 in the anomalous fields throughout the year (Fig. 5a and b). In addition, the NESM3 also shows the overestimated amplitude in the low-level specific humidity throughout the year except for the 850hPa specific humidity (Q850) during DJF (Fig. 5b). For the Q850 during DJF, the NESM3 shares the relatively poor pattern similarity with the ERA5 than that of the other seasons (Fig. 5c). Taking three aspects of model performance into account, the bias-corrected NESM3 preforms greater than the original NESM3 in simulating driving variables. The bias corrections reduce the spatial mean bias of more than 17% in all variables except for UV850 during JJA as well as Q850 during MAM, JJA, and SON (Fig. 5a). In terms of the anomalous fields, the bias corrections improve the NESM3 simulation of amplitude and pattern similarity across all variables, with the ratio of the amplitude against the ERA5 between the 0.98 and 1.06 as well as the pattern similarity greater than 0.97 (Fig. 5b and c).

Fig. 5
figure 5

Same as Fig. 3, but for the NESM3 and the bias-corrected NESM3 (NESM3.bc) in simulating the climatological mean (1979–2014) of seasonal variability for multiple driving variables. The seasonal variability is measured by the standard deviation of the daily data across the season of one year.

Extreme values

The extreme value of the large-scale driving variable is also vital to the dynamical downscaling. Considering the SST as an important forcing in the RCM, we calculated the 98th percentile of daily SST during July from 1979–2014 for the original and the bias-corrected NESM3 as well as the ERA5 (Fig. 6a,c, and e). The original NESM3 generally shows a warm bias of more than 2 °C in the North Pacific and the west coast of Africa and a cold bias of less than −2 °C in the intertropical Pacific convergence zone, Labrador Basin, and Norwegian Sea (Fig. 6c). In the contrast, the SST extreme bias in the bias-corrected NESM3 is between -1 °C and 1 °C in almost all grid cells (Fig. 6e). In general, the overall bias of SST extreme is reduced by 68% in the bias-corrected NESM3.

Fig. 6
figure 6

Spatial fields of the 98th percentile of the daily sea surface temperature (SST, unit: °C) and length of 850hPa wind anomaly (LWA850, unit: m/s) during July derived from 1979–2014. (a, b) the ERA5 data, (c, d) difference between the NESM3 and the ERA5, (e, f) difference between the bias-corrected NESM3 (NESM3.bc) and the ERA5.

In addition to the SST, the wind fields also play a vital role in transferring information into the RCM and can further affect downscaled precipitation by water vapor transport26. To take the direction and speed of the vector wind into consideration simultaneously, we defined the length of the vector wind anomaly (LWA) as follows:

$${{\rm{LWA}}}_{{\rm{i}}}=\sqrt{{({u}_{i}-\bar{u})}^{2}+{({v}_{i}-\bar{v})}^{2}}$$
(6)

where i indicates the time, and u (v) represents the zonal (meridional) wind. \(\bar{u}\) (\(\bar{v}\)) is the climatological mean value of u (v) during the historical period. LWA can measure the deviation of the vector wind relative to its climate mean. We compared the 98th percentile of the daily length of the 850hPa wind anomaly (LWA850) between the original and the bias-corrected NESM3 (Fig. 6b,d, and f). The original NESM3 shows a obvious overestimation in the northern part of the equatorial Pacific, with a bias greater than 4 m/s (Fig. 6d). Large bias of the LWA850’s extreme (>4 m/s and <−4 m/s) also appears in the eastern part of the Central Indian Ocean and the high altitudes areas, e.g., the Tibetan Plateau and the Rocky Mountains (Fig. 6d). In contrast, the bias is greatly improved in the bias-corrected NESM3, and the overall bias of the LWA850’s extreme is reduced by 55% in the bias-corrected NESM3 (Fig. 6f).

Further, we comprehensively evaluated the 2nd percentile and 98th percentile of seven driving variables using daily data during July from 1979–2014 simulated by the original and the bias-corrected NESM3 using ME, SD, and CORR against the ERA5 data (Fig. 7). Among seven variables, there are relatively large amplitude biases in the anomalous field of the 98th percentile of 700hPa specific humidity (Q700) as well as the 2nd percentile of the length of the 200hPa wind anomaly (LWA200), Q700, and Q850 simulated by the NESM3 (Fig. 7a and c). Specifically, the NESM3 overestimates these variables, with a ratio of amplitude greater than 1.1 to the ERA5 data. Simultaneously, the NESM3 shows relatively poor pattern similarity in the 98th of LWA850 and T200 as well as the 2nd percentile of LWA850, LWA200, Q700, and Q850 than the other variables, characterized by CORR less than 0.95 (Fig. 7a and c). For both amplitude and pattern similarity of the anomalous fields, the bias-corrected NESM3 shows improvement in simulating the extremes of seven driving variables compared with the original NESM3, with CORR greater than 0.97 and the ratio of the amplitude between 0.94–1.06 against the ERA5 (Fig. 7b and d). In terms of the spatial mean, the bias-corrected NESM3 improves the simulation in all extreme values of seven variables against the original NESM3 by 65% on average, except for the 98th percentile of LWA850 (Fig. 7b and d). For the LWA850, the positive and negative biases of the spatial field in the original NESM3 cancel with each other (Fig. 6d), leading to a smaller spatial mean bias than that of the bias-corrected NESM3. Taking the three aspects of model performance into account, the bias corrections greatly enhance the extreme values of seven driving variables during July from 1979–2014 simulated by the NESM3.

Fig. 7
figure 7

Taylor diagram for the 98th percentile and 2nd percentile of various variables during July from 1979–2014 simulated by (a, c) NESM3 and (b, d) the bias-corrected NESM3 (NESM3.bc). Various variables include sea surface temperature (SST, unit: °C), length of 850hPa wind anomaly (LWA850, unit: m/s), length of 200hPa wind anomaly (LWA200, unit: m/s), 500hPa air temperature (T500, unit: °C), 200hPa air temperature (T200, unit: °C), 850hPa specific humidity (Q850, unit: g/kg), and 700hPa specific humidity (Q700, unit: g/kg). The azimuthal position gives correlation coefficient (CORR), the radial distance from the origin indicates standard deviation (SD), and the distance between the model and the reference points provides root mean square error of the anomalous fields. Note that the SD of the model is normalized by that derived from the ERA5. The value in parentheses behind the variable name is the spatial mean bias of a variable.

Usage Notes

The dataset presented in this paper was saved in NetCDF format. Many different types of software are available to manipulate or display the NetCDF data, e.g., FORTRAN, CDO, NCO, NCL, Python, Panoply, Ncview, and GrADS. More software can be found at https://www.unidata.ucar.edu/software/netcdf/software.html. The data provided here can be used to generate the initial conditions, underlying conditions, and lateral boundary conditions of the RCM.