Deep learning downscaled high-resolution daily near surface meteorological datasets over East Asia

Lin, Hai; Tang, Jianping; Wang, Shuyu; Wang, Shuguang; Dong, Guangtao

doi:10.1038/s41597-023-02805-9

Download PDF

Data Descriptor
Open access
Published: 12 December 2023

Deep learning downscaled high-resolution daily near surface meteorological datasets over East Asia

Hai Lin^1,2,
Jianping Tang ORCID: orcid.org/0000-0002-1098-2656^1,2,3,
Shuyu Wang²,
Shuguang Wang^1,2 &
…
Guangtao Dong³

Scientific Data volume 10, Article number: 890 (2023) Cite this article

2611 Accesses
1 Citations
Metrics details

Subjects

Abstract

U-Net, a deep-learning convolutional neural network, is used to downscale coarse meteorological data. Based on 19 models from the Coupled Model Intercomparison Project Phase 6 and the Multi-Source Weather (MSWX) dataset, bias correction and UNet downscaling approaches are used to develop high resolution dataset over the East Asian region, referred to as Climate Change for East Asia with Bias corrected UNet Dataset (CLIMEA-BCUD). CLIMEA-BCUD provides nine meteorological variables including 2-m air temperature, 2-m daily maximum air temperature, 2-m daily minimum air temperature, precipitation, 10-m wind speed, 2-m relative humidity, 2-m specific humidity, downward shortwave radiation and downward longwave radiation with 0.1° horizontal resolution at daily intervals over the historical period of 1950–2014 and three future scenarios (SSP1-2.6, SSP2-4.5 and SSP5-8.5) of 2015–2100. Validation against MSWX indicates that CLIMEA-BCUD shows reasonable performance in terms of climatology, and it is capable of simulating seasonal cycles and future changes well. It is suggested that CLIMEA-BCUD can promote the application of deep learning in climate research in the areas of climate change, hydrology, etc.

Exploring dominant processes for multi-month predictability of western Pacific precipitation using deep learning

Article Open access 30 September 2023

ECMWF short-term prediction accuracy improvement by deep learning

Article Open access 12 May 2022

Anthropogenic fingerprints in daily precipitation revealed by deep learning

Article Open access 30 August 2023

Background & Summary

Climate change exerts tremendous influence on water resources^1,2, agriculture³, and renewable energy⁴, particularly in densely populated regions like East Asia. In the context of global warming, water availability and agriculture are affected by increasing extreme events including floods⁵. Extreme temperature and precipitation not only pose risks to people’s safety but also inflict damage on agriculture crops throughout East Asia. On the other hand, climate change significantly affects the reliability and performance of the energy system, notably solar energy and wind energy⁶. Consequently, there is a growing emphasis on the assessment of climate change in East Asia within the broader context of global warming.

Global Climate Model (GCM) is a crucial tool for understanding climate change, as it can produce long-term and gridded climate information. However, due to the coarse resolution, GCMs are unable to represent the physical processes at fine resolution⁷. Moreover, due to limited knowledge of the earth system and simplified parameterisation, significant biases exist in GCM outputs in comparison to observations^8,9,10. One way to remedy this is bias correction (BC) and downscaling, which are often considered as an essential step for the assessment of climate change. Bias correction is a statistic approach to reducing the discrepancy between GCM simulations and observations¹¹. Downscaling technology aims to obtain data at finer resolution to characterize local-scale features. Usually, downscaling can be classified by dynamical downscaling (DD) and statistic downscaling (SD). Based on BC, SD approach can reduce the bias in GCM¹², especially in CMIP6¹³. There are many downscaled datasets such as NEX-DCP30¹⁴ (0.5°), MACAv2-METDATA¹⁵ (2.5°), MACAv2-LIVNEH¹⁶ (3.75°), NEX-GDDP-CMIP6¹⁷ (0.25°) and Bias-corrected CMIP6 global dataset for dynamical downscaling of the Earth’s historical and future climate¹⁸ (1.25°). However, few downscaling datasets cover large-scale regions with a high resolution of 0.1° from CMIP6 under global warming.

SD establishes statistical relationships between large-scale GCM outputs at coarse resolution and local-scale observations at fine resolution during the training period, and applies the relationships to obtain fine information during the projected period. It is computationally inexpensive and easy to implement¹⁹. There are different SD approaches, for example, regression, weather classifications, and weather generators. Regression approaches are very popular, such as multi linear regression (MLR)²⁰, generalized linear model (GLM)²¹, and machine learning (ML) method including support vector machine (SVM)²², random forests (RF)²³, and artificial neural networks (ANN)^24,25. Many studies have compared the performance among different regression approaches^26,27,28,29.

Deep learning (DL) has been proved to be good at capturing complex and abstract features from numerous data³⁰. Many studies have applied the DL based super-resolution (SR) approaches for downscaling^31,32,33. Among the DL approaches, UNet shows superior performance in the field of SR and has been used in statistical downscaling. Sha et al.^34,35 developed new UNet archives, named UNet-AE and Nest-UNet for temperature and precipitation downscaling respectively, and found that the UNet-based models show better performance than spatial disaggregation. Adewoyin et al.³⁶ applied Temporal Recurrent UNet (TRU-NET) to downscale precipitation, and showed TRU-NET had better performance than a DL model prevalent in precipitation downscaling and dynamical downscaling method.

In this study, we develop a new bias correction and downscaling approach, named BC-UNet, to construct a Climate Change for East Asia with Bias Corrected UNet Dataset (CLIMEA-BCUD) based on CMIP6. The BC-UNet downscaling approach firstly applied Quantile Delta Mapping (QDM) to correct CMIP6 models biases based on the MSWX³⁷ dataset at 1.0° × 1.0° spacing resolution³⁸, then the UNet is trained for downscaling the biased corrected GCM dataset. The BC-UNet archive is applied to the historical simulations (1950–2014) and three future (2015–2100) scenarios of SSP1-2.6, SSP2-4.5 and SSP5-8.5. There are nine near-surface meteorological variables including 2-m air temperature (tas), 2-m daily maximum air temperature (tasmax), 2-m daily minimum air temperature (tasmin), precipitation (pr), 10-m wind speed (sfcWind), downward longwave radiation (rlds), downward shortwave radiation (rsds), 2-m relative humidity (hurs) and 2-m specific humidity (huss) (Table 1). CLIMEA-BCUD provides high-resolution large-scale DL downscaling in East Asia, which we suggest will be helpful for assessing climate change under global warming.

Table 1 Variables included in the CLIMEA-BCUD.

Full size table

Methods

Data acquisition

The MSWX gridded high-resolution bias-corrected meteorological dataset is used as observations. Based on ERA5, MSWX produces 10 widely used near-surface meteorological variables with 0.1° horizontal resolution and 3-hour temporal resolution. The study area covers the whole of East Asia from 4.95°N to 60.05°N and 64.75°E to 150.25°E (Fig. 1). In order to construct the bias correction and a UNet downscaling model, the high-resolution MSWX datasets are averaged to coarse resolution at 1.0° × 1.0° as MSWX_LR using the area average method.

For climate change downscaling, we use the CMIP6 data, which provides the latest GCM simulations including voluminous global gridded model data over the historical period of 1950–2014 and four Shared Socioeconomic Pathways (SSPs) scenarios with 2015–2100 period. There are 19 GCM outputs for historical simulations and three representative future scenarios (SSP1-2.6, SSP2-4.5, and SSP5-8.5) (Table 2). As shown in Table 2, the original CMIP6 GCMs outputs have coarse spacing resolution. All CMIP6 data can be downloaded at https://esgf-node.llnl.gov/projects/cmip6/.

Table 2 CMIP6 modes included in downscaled archive.

Full size table

BC-UNet

The framework to construct the CLIMEA-BCUD, called BC-UNet is demonstrated in Fig. 2. BC-UNet takes GCM simulation datasets and observation as input. It has two main steps: (1) bias correction and (2) UNet downscaling. The details of the two steps are as below.

In the first step, the bias correction method using QDM is applied, which can reduce the bias between observations and GCMs outputs and preserves the change of model projection in quantile^39,40. When applying bias correction, the GCMs outputs are interpolated to 1° × 1° coarse horizontal resolution to match the MSWX_LR with bi-linear interpolation algorithm. Then QDM is used to correct the biases between GCMs and MSWX_LR at coarse resolution, and to calculate the bias corrected GCM results (GCM_BC).

In the second step, UNet with 3 layers neural network, known for its exceptional performance in super-resolution and downscaling tasks, is used for climate downscaling⁴¹. Every convolution and downsampling operation lead to a feature map, which captures the spatial features. The UNet with 3 layers represent 3 downsampling and 3 upsampling. A convolution operation of each layer will generate a feature map, and the number of convolution channels represents the number of feature maps extracted by this layer. The downsampling component of UNet captures crucial spatial features, while the upsampling counterpart generates high-resolution data, effectively facilitating the downscaling process. The convolution channel numbers to capture the spatial features in UNet are {64, 96, 128, 160} for precipitation and {56, 112, 224, 448} for the other variables (Fig. 3).

As the goal of training stage, the loss function⁴² plays an important role in directing the neural network parameter update. The neural network minimizes the loss function value by continuously updating its parameters during the training stage. Training of the UNet model is completed when the loss function converges to the minimum. This study proposes a new loss function based on the mean absolute error (MAE). The loss function effectively augments the UNet’s capacity to regenerate extreme precipitation events and mitigating the bias of variable underestimation. The loss function is as follows:

$$Loss=\frac{1}{n}\mathop{\sum }\limits_{i=0}^{n}\left|{y}_{p,i}-{y}_{o,i}\right|+\frac{w}{m}\mathop{\sum }\limits_{j=0}^{m}\left|{y}_{p,j}-{y}_{o,j}\right|$$

Where i is the grid point which are less than mean, and j indicates grid point which are greater than mean. Weight w is 5 to decrease the underestimation of downscaling model.

In order to effectively capture the fine features of the MSWX dataset in different seasons, four UNets are trained for each variable, with each UNet being responsible for a different season (MAM, JJA, SON and DJF for spring, summer, autumn and winter respectively). To achieve this, inputs for each season are constructed from data (0.1° × 0.1°) which is downscaled from MSWX_LR by a factor of 10 using bi-linear interpolation algorithm and static elevation (z; coarse to 0.1° spacing resolution) from Global 30 Arc-Second Elevation⁴³ (GTOPO30), and original MSWX serves as label for each season. This study feed the univariate image and terrain data to the UNet, and the outcome is a single image. The UNet uses max-pooling for downsampling, deconvolution for upsampling, and long-hop connections to concatenate feature maps of the same resolution. All inputs and labels are spatially normalized before being fed into the UNet. Adaptive moment estimation (Adam) are used as the optimizer in optimization process during the training stage. The GCM_BC are normalized and fed into the trained UNet to generate the downscaling results. Finally, the downscaling results are denormalized to generate CLIMEA-BCUD.

Data Records

CLIMEA-BCUD contains nine meteorological variables (Table 1) of about 19 downscaled CMIP6 outputs. It has the spatial coverage of 4.95°N–60.05°N and 64.75°E–150.25°E at 0.1° × 0.1° horizontal resolution. The time period of historical climate ranges from January 1, 1950 to December 31, 2014. The future period of three climate change scenarios (SSP1-2.6, SSP2-4.5, SSP5-8.5) is from January 1, 2015 to December 31, 2100 at daily intervals. All data are archived in the NetCDF format in CLIMEA-BCUD, named as “/{variables}/{scenarios}/{year}.nc”, where {variables} is the name of the variables, {scenarios} refers to the historical and three future scenarios (SSP1-2.6, SSP2-4.5, SSP5-8.5), and {year} is the year, respectively. The size of multi-model ensemble mean data is about 2.0 TB. Due to the large size of the dataset, the Science Data Bank (https://www.scidb.cn/en) is chosen for the dissemination of the multi-model ensemble mean CLIMEA-BCUD (https://doi.org/10.57760/sciencedb.07718)⁴⁴.

Technical Validation

In order to comprehensively assess the accuracy of the CLIMEA-BCUD, the spatial distribution of climate mean, the variation of annual mean and root mean square error (RMSE) are calculated against the MSWX dataset from 1979 to 2014 (Table 3). The RMSEs between the raw GCM and MSWX are listed to assess the accuracy of the CLIMEA-BCUD. Notably, INM-CM5-0, MPI-ESM1-2-HR, and MPI-ESM1-2-LR in CLIMEA-BCUD exhibit better skills with relatively low RMSEs for surface air temperature. Tasmax in CLIMEA-BCUD shows the best performance with the RMSEs below 0.58 °C and MBs between −0.52 °C and −0.27 °C, which is better than the raw GCM with the RMSEs above 2.31 °C and MBs between −1.27 °C and 1.13 °C. Tasmin in CLIMEA-BCUD shows a lower RMSE (0.78 °C) than the raw GCM whose lowest RMSE is 2.32 °C. For precipitation, most CMIP6 models in CLIMEA-BCUD are able to reproduce the distribution of mean precipitation with the RMSEs below 0.37 mm/day, showing better performance than the raw GCM with the RMSEs of around 1.00 mm/day. The surface wind speed in CLIMEA-BCUD has RMSEs ranging from 0.13 m/s to 0.15 m/s and surface relative humidity has a degree of RMSEs between 0.97% and 1.60%. Compared with CLIMEA-BCUD, the surface wind speed in raw GCM has RMSEs ranging from 0.92 m/s to 1.41 m/s and surface relative humidity has a degree of RMSEs between 6.50% and 11.87%. CLIMEA-BCUD also has RMSEs larger than 3.0 W/m² for surface downward radiative fluxes, especially for surface downward longwave radiation.

Table 3 RMSEs between MSWX and GCM and CLIMEA-BCUD.

Full size table

Figure 4 illustrates the distribution of multi-model ensemble mean bias between the raw GCM and MSWX, CLIMEA-BCUD and MSWX. Evidently, tas in CLIMEA-BCUD is comparable to that in MSWX over regions with flat terrain, showing much better performance than the raw GCM which has much larger bias. Even in the high-altitude regions such as the Qinghai-Tibet Plateau, the multi-model ensemble mean of CLIMEA-BCUD is able to capture the key features including the variation of annual mean tas and spatial patterns of the tas climate mean. Compared with CLIMEA-BCUD, tas in the raw GCM is significantly underestimated over the Qinghai-Tibet Plateau. For precipitation, the bias of multi-model ensemble mean ranges from −0.6 mm/day to 0.6 mm/day over most regions in East Asia. While precipitation in the raw GCM is significantly overestimated over East Asia by around 1.0 mm/day. A relatively large bias for CLIMEA-BCUD above 1.0 mm/day can be found over the south eastern side of the Qinghai-Tibet Plateau, and the west coast of Africa, which is smaller than the raw GCM with a bias above 1.4 mm/day. Compared with the raw GCM, CLIMEA-BCUD for all variables can effectively and generally reproduce the spatial distribution of climatological average from 1979 to 2014 with higher SCCs and lower MBs and variation of annual mean with much lower RMSEs.

Seasonality

Figure 5 illustrates the seasonal cycle of all variables from the raw GCM output. Figure 6 shows that CLIMEA-BCUD can well reproduce the seasonal cycle of surface air temperature with a correlation of 1.0, but shows large uncertainties and warm biases in summer. Compared with raw GCM, the multi-model ensemble mean of CLIMEA-BCUD can well reproduce the seasonal cycle of surface air temperature with a correlation of 1.0 and lower uncertainties; yet significant cold biases are found in spring and winter. Because of the normalization, surface air temperature in CLIMEA-BCUD maintains the advantage of QDM outputs, which can represent time series with higher CC and lower uncertainties than the raw GCM. For the seasonal cycle of precipitation, the multi-model ensemble mean of CLIMEA-BCUD exhibits a good correlation (0.99) and a low RMSE of 0.2 mm/day, but has relatively large uncertainties, particularly in summer when precipitation shows strong spatio-temporal variability.

Surface wind speed in the multi-model ensemble mean of the raw GCM shows good correspondence with MSWX with a high correlation of 0.98 and RMSE of 0.33 m/s, but it clearly overestimates wind speed and exhibits a large uncertainty. While surface wind speed in the multi-model ensemble mean of CLIMEA-BCUD shows good coherence with MSWX with a high correlation of 0.98 and a lower RMSE of 0.15 m/s and significantly reduces the uncertainty, it clearly overestimates wind speed in winter and underestimates it in summer. The surface relative humidity displays a lower degree of seasonal variation than that of MSWX, leading to a rather low correlation of 0.63, which is still higher than the raw GCM (correlation 0.43). Multi-model ensemble mean of CLIMEA-BCUD can well generate downward longwave radiation, downward shortwave radiation, and surface specific humidity. In general, the multi-model ensemble mean of CLIMEA-BCUD, compared with the raw GCM, reduces the uncertainties and achieves higher correlation and lower RMSE.

Extreme events

Regarding the precipitation events, 4 distinctive classes of precipitation events are categorized: light rain (1 ≤ pr < 10 mm/day), moderate rain (10 ≤ pr < 25 mm/day), heavy rain (25 ≤ pr < 50 mm/day) and rainstorm (pr ≥ 50 mm/day) according to the China Meteorological Administration⁴⁵ (CMA). By counting the frequency of precipitation events at each grid and comparing it with the raw GCM, the performance of the CLIMEA-BCUD in generating the precipitation events can be assessed (Fig. 7). For the light rain events, CLIMEA-BCUD is capable of capturing the overall pattern of MSWX, and shows more detail than the raw GCM. QDM can preserve daily precipitation extreme events well, which are also preserved by CLIMEA-BCUD. CLIMEA-BCUD has a higher frequency between 60% and 70% than MSWX which is below 60% over the eastern Pacific. For moderate rain events, the over shift of rain belt for the raw GCM is found in the eastern Pacific and the Qinghai-Tibet Plateau. Moreover, the raw GCM overestimates the frequency over southern China. CLIMEA-BCUD performs better in producing the distribution of frequency, with two main rain belts over the Pacific. But it slightly underestimates the frequency over land areas, especially over southeastern China. For the heavy rain events, GCMs overestimate the frequency over the southeastern Pacific and southern China. CLIMEA-BCUD can capture the spatial distribution of frequency with slight underestimation over most regions in East Asia and perform more details than the raw GCM. For the rainstorm events, the raw GCM cannot regenerate the distribution over East Asia. CLIMEA-BCUD can reproduce the distribution over oceanic areas. Notably, CLIMEA-BCUD narrows down areas with rainstorm events frequency between 1% and 2%, especially over the Kyushu region of Japan. In general, CLIMEA-BCUD can capture different rank precipitation events well, especially moderate rain, but there are some obvious biases in the eastern Pacific.

Projected changes

Based on the evaluation of downscaled daily precipitation and surface air temperature, projections in surface air temperature and precipitation at the end of the 21st century (2070–2100) from CLIMEA-BCUD for all the scenarios (SSP1-2.6, SSP2-4.5, and SSP5-8.5) can be estimated. Figure 8 (the raw GCM) and Figure 9 (CLIMEA-BCUD) shows the changes in multi-model ensemble mean surface air temperature and precipitation at the end 21^st century for all the scenarios (SSP1-2.6, SSP2-4.5, and SSP5-8.5). It is found that the surface air temperature will rise in East Asia, with a greater warming range in the northern part of China especially under the SSP5-8.5 scenario, which shows a similar distribution with the raw GCM. The ensemble mean median change in tas from CLIMEA-BCUD is projected to increase by 1.57 °C in SSP1-2.6, 2.53 °C in SSP2-4.5, and 4.52 °C in SSP5-8.5, which is similar to the raw GCM with 1.63 °C in SSP1-2.6, 2.59 °C in SSP2-4.5 and 4.58 °C in SSP5-8.5. The projection of ensemble mean tasmax and tasmin from CLIMEA-BCUD is similar to that of tas, with the temperature increasing from south to north across East Asia, indicating that the CLIMEA-BCUD preserves the climatic trend from the raw GCM. In terms of precipitation, the projected change in CLIMEA-BCUD generally shows an increase over most areas in East Asia, and the ensemble mean median change is projected to increase by 0.19 mm/day in SSP1-2.6, 0.22 mm/day in SSP2-4.5 and 0.34 mm/day in SSP5-8.5. While the projected change in the raw GCM has the same changes and the ensemble mean median change is projected to increase by 0.20 mm/day in SSP1-2.6, 0.24 mm/day in SSP2-4.5 and 0.37 mm/day in SSP5-8.5. A significant increase of precipitation in the raw GCM is found over the Indian Ocean and the western Pacific Ocean. For CLIMEA-BCUD, precipitation will significantly increase in eastern China, and slightly decrease in the northwestern regions. It will also increase in India, especially under the SSP5-8.5 scenario. The increase of precipitation over the ocean is more notable, mainly in the Indian Ocean and the western Pacific Ocean.

Usage Notes

In this study, we describe the CLIMEA-BCUD dataset for East Asia, which provides daily time series of nine meteorological variables at 0.1 spacing resolution based on 19 CMIP6 GCMs. CLIMEA-BCUD is provided for both the historical period (1950–2014) and the future period (2015–2100), and it incorporates three different emission scenarios for the future: SSP1-2.6, SSP2-4.5 and SSP5-8.5. By delivering such high-resolution information, CLIMEA-BCUD can be very useful for various hydroclimatic research. Furthermore, CLIMEA-BCUD may also prove useful for users not only in the hydrometeorological field but also in others, such as climate change, agriculture, energy, etc. Given East Asia’s continental proportions and its role in global climate, the high resolution (0.1°) of gridded data is critical for developing regional and global assessments and aiding decision- and policy-making. CLIMEA-BCUD is presented in netCDF format (.nc), and it is freely available at the Science Data Bank (https://doi.org/10.57760/sciencedb.07718)⁴⁴. While CLIMEA-BCUD has a wonderful performance in producing the overall patterns of climate mean, seasonal cycle, frequency, and future changes, some limitations must be acknowledged. Firstly, data users should be aware of underestimation when using CLIMEA-BCUD due to its underestimation in representing observations. Secondly, despite displaying good performance in reproducing seasonal variability and extreme events, the bias-corrected products may contain inherent uncertainties, and obscure some fundamental deficiencies presented by the climate models.

Numerous studies have extensively researched methods to enhance model performance in the field of super-resolution, and these advancements are expected to be applicable to downscaling tasks as well. Among them, image enhancement techniques including adaptive gamma correction with weighting distribution⁴⁶ (AGCWD), adaptive gamma correction with color preserving framework⁴⁷ (AGCCPF), range limited Bi-histogram equalization^48,49 (RLBHE), and region adaptive contrast limited adapted histogram equalization⁵⁰ (RACLAHE) are common and powerful tools for improving the performance of DL model. It is valuable to explore its effectiveness in the context of climate downscaling. Furthermore, several studies have explored improved models based on UNet such as UNet++⁵¹, UNet3+⁵², ResUNet⁵³ and USE-NET⁵⁴, which have demonstrated significant potential in various applications. Additionally, models that combine technologies such as generative adversarial network⁵⁵ (GAN) and Transformer⁵⁶ have also shown great potential for further improvement.

Code availability

QDM approach in this study is carried out using the R-packages of the Multivariate Bias Correction of Climate Model Outputs (MBC) project and it is available through the following Github link: https://github.com/cran/MBC. The UNet downscaling approach is carried out using the python-packages of the tensorflow2 and it is available through the following Github link: https://github.com/tensorflow/tensorflow.

All code used in this study can be available through the following Github link: https://github.com/LinHai-debug/CLIMEA-BCUD-code.

References

Goyal, M. K. & Surampalli, R. Y. Impact of Climate Change on Water Resources in India. Journal of Environmental Engineering. 144, https://doi.org/10.1061/(ASCE)EE.1943-7870.0001394 (2018).
Luo, M. et al. Identifying climate change impacts on water resources in Xinjiang, China. Sci Total Environ. 676, 613–626, https://doi.org/10.1016/j.scitotenv.2019.04.297 (2019).
Article ADS PubMed CAS Google Scholar
Arora, N. K. Impact of climate change on agriculture production and its sustainable solutions. Environmental Sustainability. 2, 95–96, https://doi.org/10.1007/s42398-019-00078-w (2019).
Article Google Scholar
Gernaat, D. E. H. J. et al. Climate change impacts on renewable energy supply. Nat. Clim. Chang. 11, 119–125, https://doi.org/10.1038/s41558-020-00949-9 (2021).
Article ADS Google Scholar
Xia, Y. et al. Influences of extreme events on water and carbon cycles of cropland ecosystems: A comprehensive exploration combining site and global modeling. Water Resources Research. 57, e2021WR029884, https://doi.org/10.1029/2021WR029884 (2021).
Article ADS Google Scholar
Cronin, J., Anandarajah, G. & Dessens, O. Climate change impacts on the energy system: a review of trends and gaps. Climatic Change. 151, 79–93, https://doi.org/10.1007/s10584-018-2265-4 (2018).
Article ADS PubMed PubMed Central Google Scholar
Watterson, I. G., Bathols, J. & Heady, C. What Influences the Skill of Climate Models over the Continents? Bulletin of the American Meteorological Society. 95, 689–700, https://doi.org/10.1175/BAMS-D-12-00136.1 (2014).
Article ADS Google Scholar
Maraun, D. Bias Correcting Climate Change Simulations - a Critical Review. Curr Clim Change Rep. 2, 211–220, https://doi.org/10.1007/s40641-016-0050-x (2016).
Article Google Scholar
Sachindra, D. A., Huang, F., Barton, A. & Perera, B. J. C. Statistical downscaling of general circulation model outputs to precipitation—part 1: calibration and validation. Int. J. Climatol. 34, 3264–3281, https://doi.org/10.1002/joc.3914 (2014).
Article Google Scholar
Zelinka, M. D. et al. Causes of higher climate sensitivity in CMIP6 models. Geophysical Research Letters. 47, e2019GL085782, https://doi.org/10.1029/2019GL085782 (2020).
Article ADS Google Scholar
Miao, C. et al. A nonstationary bias-correction technique to remove bias in GCM simulations. J. Geophys. Res. Atmos. 121, 5718–5735, https://doi.org/10.1002/2015JD024159 (2016).
Article Google Scholar
Yang, Y. et al. An intercomparison of multiple statistical downscaling methods for daily precipitation and temperature over China: present climate evaluations. Clim Dyn. 53, 4629–4649, https://doi.org/10.1007/s00382-019-04809-x (2019).
Article Google Scholar
AlMutairi, B. S., Grossmann, I. & Small, M. J. Climate model projections for future seasonal rainfall cycle statistics in Northwest Costa Rica. Int J Climatol. 39, 2933–2946, https://doi.org/10.1002/joc.5993 (2019).
Article Google Scholar
NASA Earth Exchange (NEX) Downscaled Climate Projections (NEX-DCP30) https://cds.nccs.nasa.gov/nex/ (2012).
University of Idaho, Multivariate Adaptive Constructed Analogs Applied to Global Climate Models https://climate.northwestknowledge.net/MACA/ (2011).
glisaclimate https://www.worldclim.org/ (2020).
Thrasher, B. et al. NASA Global Daily Downscaled Projections, CMIP6. Sci Data. 9, 262, https://doi.org/10.1038/s41597-022-01393-4 (2022).
Article PubMed PubMed Central Google Scholar
Xu, Z. et al. Bias-corrected CMIP6 global dataset for dynamical downscaling of the historical and future climate (1979–2100). Sci Data. 8, 293, https://doi.org/10.1038/s41597-021-01079-3 (2021).
Article PubMed PubMed Central CAS Google Scholar
Le, R. R. et al. Comparison of statistical and dynamical downscaling results from the WRF model. Environmental Modelling & Software. 100, 67–73, https://doi.org/10.1016/j.envsoft.2017.11.002 (2018).
Article Google Scholar
Sachindra, D. A., Huang, F., Barton, A. F. & Perera, B. J. C. Multi-model ensemble approach for statistically downscaling general circulation model outputs to precipitation. Q.J.R. Meteorol. Soc. 140, 1161–1178, https://doi.org/10.1002/qj.2205 (2014).
Article ADS Google Scholar
Beecham, S., Rashid, M. & Chowdhury, R. K. Statistical downscaling of multi-site daily rainfall in a South Australian catchment using a Generalized Linear Model. Int. J. Climatol. 34, 3654–3670, https://doi.org/10.1002/joc.3933 (2014).
Article Google Scholar
Pour, S. H., Shahid, S., Chung, E. S. & Wang, X. J. Model output statistics downscaling using support vector machine for the projection of spatial and temporal changes in rainfall of Bangladesh. Atmos. Res. 213, 149–162, https://doi.org/10.1016/j.atmosres.2018.06.006 (2018).
Article Google Scholar
Legasa, M. N., Manzanas, R., Calviño, A. & Gutiérrez, J. M. A posteriori random forests for stochastic downscaling of precipitation by predicting probability distributions. Water Resources Research. 58, e2021WR030272, https://doi.org/10.1029/2021WR030272 (2022).
Article ADS Google Scholar
Hosseini Baghanam, A., Norouzi, E. & Nourani, V. Wavelet-based predictor screening for statistical downscaling of precipitation and temperature using the artificial neural network method. Hydrology Research. 53, 385–406, https://doi.org/10.2166/nh.2022.094 (2022).
Article Google Scholar
Laddimath, R. S. & Patil, N. S. Artificial Neural Network Technique for Statistical Downscaling of Global Climate Model. MAPAN. 34, 121–127, https://doi.org/10.1007/s12647-018-00299-0 (2019).
Article Google Scholar
Campozano, L., Tenelanda, D., Sanchez, E., Samaniego, E. & Feyen, J. Comparison of statistical downscaling methods for monthly total precipitation: case study for the Paute River Basin in Southern Ecuador. Adv Meteorol. https://doi.org/10.1155/2016/6526341 (2016).
Duan, K. & Mei, Y. A comparison study of three statistical downscaling methods and their model-averaging ensemble for precipitation downscaling in China. Theor Appl Climatol. 116, 707–719, https://doi.org/10.1007/s00704-013-1069-8 (2014).
Article ADS Google Scholar
Ghorbanpour, A. K., Hessels, T., Moghim, S. & Afshar, A. Comparison and assessment of spatial downscaling methods for enhancing the accuracy of satellite-based precipitation over Lake Urmia Basin. Journal of Hydrology. 126055, https://doi.org/10.1016/j.jhydrol.2021.126055 (2021).
Vandal, T., Kodra, E. & Ganguly, A. R. Intercomparison of machine learning methods for statistical downscaling: the case of daily and extreme precipitation. Theor Appl Climatol. 137, 557–570, https://doi.org/10.1007/s00704-018-2613-3 (2019).
Article ADS Google Scholar
Najafabadi, M. M. et al. Deep learning applications and challenges in big data analytics. Journal of Big Data. 2, 1, https://doi.org/10.1186/s40537-014-0007-7 (2015).
Article Google Scholar
Sun, L. & Lan, Y. Statistical downscaling of daily temperature and precipitation over China using deep learning neural models: Localization and comparison with other methods. Int J Climatol. 41, 1128–1147, https://doi.org/10.1002/joc.6769 (2021).
Article Google Scholar
Tran Anh, D., Van, S. P., Dang, T. D. & Hoang, L. P. Downscaling rainfall using deep learning long short-term memory and feedforward neural network. Int J Climatol. 39, 4170–4188, https://doi.org/10.1002/joc.6066 (2019).
Article Google Scholar
Wang, F., Tian, D., Lowe, L., Kalin, L. & Lehrter, J. Deep learning for daily precipitation and temperature downscaling. Water Resources Research. 57, e2020WR029308, https://doi.org/10.1029/2020WR029308 (2021).
Article ADS Google Scholar
Sha, Y., Gagne II, D. J., West, G. & Stull, R. Deep-Learning-Based Gridded Downscaling of Surface Meteorological Variables in Complex Terrain. Part I: Daily Maximum and Minimum 2-m Temperature. Journal of Applied Meteorology and Climatology. 59, 2057–2073, https://doi.org/10.1175/JAMC-D-20-0057.1 (2020).
Article ADS Google Scholar
Sha, Y., Gagne II, D. J., West, G. & Stull, R. Deep-Learning-Based Gridded Downscaling of Surface Meteorological Variables in Complex Terrain. Part II: Daily Precipitation. Journal of Applied Meteorology and Climatology. 59, 2075–2092, https://doi.org/10.1175/JAMC-D-20-0058.1 (2020).
Article ADS Google Scholar
Adewoyin, R. A., Dueben, P., Watson, P., He, Y. L. & Dutta, R. TRU-NET: a deep learning approach to high resolution prediction of rainfall. Mach Learn. 110, 2035–2062, https://doi.org/10.1007/s10994-021-06022-6 (2021).
Article MathSciNet MATH Google Scholar
MSWX gridded high-resolution bias-corrected meteorological dataset https://www.gloh2o.org/mswx/ (2022).
Beck, H. E. et al. MSWX: Global 3-Hourly 0.1° Bias-Corrected Meteorological Data Including Near-Real-Time Updates and Forecast Ensembles. Bulletin of the American Meteorological Society. 103, E710–E732, https://doi.org/10.1175/BAMS-D-21-0145.1 (2022).
Article Google Scholar
Cannon, A. J., Sobie, S. R. & Murdock, T. Q. Bias Correction of GCM Precipitation by Quantile Mapping: How Well Do Methods Preserve Changes in Quantiles and Extremes? Journal of Climate. 28, 6938–6959, https://doi.org/10.1175/JCLI-D-14-00754.1 (2015).
Article ADS Google Scholar
Tong, Y. et al. Bias correction of temperature and precipitation over China for RCM simulations using the QM and QDM methods. Clim Dyn. 57, 1425–1443, https://doi.org/10.1007/s00382-020-05447-4 (2021).
Article Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In: Navab, N., Hornegger, J., Wells, W., Frangi, A. (eds) Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015. MICCAI 2015. Lecture Notes in Computer Science(), vol 9351. Springer, Cham. https://doi.org/10.1007/978-3-319-24574-4_28 (2015).
Zhao, H., Gallo, O., Frosio, I. & Kautz, J. Loss Functions for Neural Networks for Image Processing. arXiv https://doi.org/10.48550/arXiv.1511.08861 (2016).
Article Google Scholar
United States Geological Survey https://www.usgs.gov/centers/eros/science/usgs-eros-archive-digital-elevation-global-30-arc-second-elevation-gtopo30 (1996).
Lin, H. et al. Deep learning downscaled CMIP6 high-resolution (0.1°) daily near surface meteorological datasets over East Asia (ensemble mean). Sciencedb. https://doi.org/10.57760/sciencedb.07718 (2023).
China Meteorological Administration http://data.cma.cn/ (2023).
Huang, C., Cheng, F. & Chiu, Y. Efficient Contrast Enhancement Using Adaptive Gamma Correction With Weighting Distribution. IEEE Transactions on Image Processing. 22, 1032–1041, https://ieeexplore.ieee.org/document/6336819 (2013).
Article ADS MathSciNet PubMed MATH Google Scholar
Gupta, B. & Tiwari, M. Minimum mean brightness error contrast enhancement of color images using adaptive gamma correction with color preserving framework. Optik. 127, 1671-1676, https://www.sciencedirect.com/science/article/abs/pii/S0030402615014230 (2016).
Zuo, C., Chen, Q. & Sui, X. Range Limited Bi-Histogram Equalization for image contrast enhancement. Optik. 124, 425–431, https://www.sciencedirect.com/science/article/abs/pii/S0030402612001118 (2013).
Article ADS Google Scholar
Agarwal, M. & Mahajan, R. Medical Image Contrast Enhancement using Range Limited Weighted Histogram Equalization. Procedia Computer Science. 125, 149–156, https://www.sciencedirect.com/science/article/pii/S1877050917327850 (2018).
Article Google Scholar
Zaridis, D. et al. Region-adaptive magnetic resonance image enhancement for improving CNN-based segmentation of the prostate and prostatic zones. Scientific Reports. 13, 714 https://www.nature.com/articles/s41598-023-27671-8 (2023).
Article ADS PubMed PubMed Central CAS Google Scholar
Zhou, Z., Siddiquee, M., Tajbakhsh, N., & Liang, J. UNet++: Redesigning Skip Connections to Exploit Multiscale Features in Image Segmentation. arxiv, https://arxiv.org/abs/1912.05074 (2020).
Huang, H., et al UNet 3+: A Full-Scale Connected UNet for Medical Image Segmentation. arxiv, https://arxiv.org/abs/2004.08790 (2020).
Zhang, Z., Liu, Q. & Wang, Y. Road Extraction by Deep Residual U-Net. IEEE Geoscience and Remote Sensing Letters. 15, 749–753, https://ieeexplore.ieee.org/document/8309343 (2017).
Article ADS Google Scholar
Rundo, L. et al. USE-Net: Incorporating Squeeze-and-Excitation blocks into U-Net for prostate zonal segmentation of multi-institutional MRI datasets. Neurocomputing. 365, 31-43, https://www.sciencedirect.com/science/article/abs/pii/S0925231219309245 (2019).
Creswell, A. et al. Generative Adversarial Networks: An Overview. IEEE Signal Processing Magazine. 35, 53–65, https://arxiv.org/abs/1710.07035 (2018).
Article Google Scholar
Vaswani A, et al. Attention is all you need. Advances in neural information processing systems. 30, https://arxiv.org/abs/1706.03762 (2017).
Law, R. M. et al. The carbon cycle in the Australian Community Climate and Earth System Simulator (ACCESS-ESM1) - Part 1: Model description and pre-industrial simulation. Geosci. Model Dev. 10, 2567–2590, https://doi.org/10.5194/GMD-10-2567-2017 (2017).
Article ADS CAS Google Scholar
Ziehn, T. et al. The Australian Earth System Model: ACCESS-ESM1.5. J. South. Hemisph. Earth Syst. Sci. 70, 193–214, https://doi.org/10.1071/ES19035 (2020).
Article Google Scholar
Wu, T. et al. BCC-CSM2-HR: a high-resolution version of the Beijing Climate Center Climate System Model. Geosci. Model Dev. 14, 2977–3006, https://doi.org/10.5194/gmd-14-2977-2021 (2021).
Article ADS Google Scholar
Swart, N. C. et al. The Canadian Earth System Model version 5 (CanESM5.0.3). Geosci. Model Dev. 12, 4823–4873, https://doi.org/10.5194/gmd-12-4823-2019 (2019).
Article ADS CAS Google Scholar
Danabasoglu, G. et al. The Community Earth System Model Version 2 (CESM2). Journal of Advances in Modeling Earth Systems. 12, e2019MS001916, https://doi.org/10.1029/2019MS001916 (2020).
Article ADS Google Scholar
Cherchi, A. et al. Global Mean Climate and Main Patterns of Variability in the CMCC-CM2 Coupled Model. J. Adv. Model. Earth Syst. 11, 185–209, https://doi.org/10.1029/2018MS001369 (2019).
Article ADS Google Scholar
Voldoire, A. et al. Evaluation of CMIP6 DECK experiments with CNRM-CM6-1. Journal of Advances in Modeling Earth Systems. 11, 2177–2213, https://doi.org/10.1029/2019MS001683 (2019).
Article ADS Google Scholar
Séférian, R. et al. Evaluation of CNRM Earth-System model, CNRM-ESM2-1: role of Earth system processes in present-day and future climate. Journal of Advances in Modeling Earth Systems. 11, 4182–4227, https://doi.org/10.1029/2019MS001791 (2019).
Article ADS Google Scholar
Döscher, R. et al. The EC-Earth3 Earth system model for the Coupled Model Intercomparison Project 6. Geosci. Model Dev. 15, 2973–3020, https://doi.org/10.5194/GMD-15-2973-2022 (2022).
Article ADS Google Scholar
Li, L. et al. The flexible global ocean-atmosphere-land system model grid-point version 3 (fgoals-g3): description and evaluation. Journal of Advances in Modeling Earth Systems. 12, e2019MS002012, https://doi.org/10.1029/2019MS002012 (2020).
Article ADS Google Scholar
Dunne, J. P. et al. The GFDL Earth System Model Version 4.1 (GFDL-ESM 4.1): Overall Coupled Model Description and Simulation Characteristics. J. Adv. Model. Earth Syst. 12, e2019MS002015, https://doi.org/10.1029/2019MS002015 (2020).
Article ADS Google Scholar
Volodin, E. et al. (2019). INM INM-CM5-0 model output prepared for CMIP6 CMIP piControl. Earth System Grid Federation. https://doi.org/10.22033/ESGF/CMIP6.5081 (2019).
Volodin, E. M. et al. Simulation of the present-day climate with the climate model INMCM5. Clim. Dyn. 49, 3715–3734, https://doi.org/10.1007/S00382-017-3539-7/FIGURES/18 (2017).
Article Google Scholar
Boucher, O. et al. Presentation and Evaluation of the IPSL-CM6A-LR Climate Model. J. Adv. Model. Earth Syst. 12, e2019MS002010, https://doi.org/10.1029/2019MS002010 (2020).
Article ADS Google Scholar
Tatebe, H. et al. Description and basic evaluation of simulated mean state, internal variability, and climate sensitivity in MIROC6. Geosci. Model Dev. 12, 2727–2765, https://doi.org/10.5194/GMD-12-2727-2019 (2019).
Article ADS CAS Google Scholar
Hajima, T. et al. Development of the MIROC-ES2L Earth system model and the evaluation of biogeochemical processes and feedbacks. Geosci. Model Dev. 13, 2197–2244, https://doi.org/10.5194/gmd-13-2197-2020 (2020).
Article ADS Google Scholar
Gutjahr, O. et al. Max Planck Institute Earth System Model (MPI-ESM1.2) for the High-Resolution Model Intercomparison Project (HighResMIP). Geosci. Model Dev. 12, 3241–3281, https://doi.org/10.5194/GMD-12-3241-2019 (2019).
Article ADS CAS Google Scholar
Müller, W. A. et al. A Higher-resolution Version of the Max Planck Institute Earth System Model (MPI-ESM1.2-HR). J. Adv. Model. Earth Syst. 10, 1383–1413, https://doi.org/10.1029/2017MS001217 (2018).
Article ADS Google Scholar
Yukimoto, S. et al. The Meteorological Research Institute Earth System Model Version 2.0, MRI-ESM2.0: Description and Basic Evaluation of the Physical Component. J. Meteorol. Soc. Japan. Ser. II 97, 2019–051, https://doi.org/10.2151/JMSJ.2019-051 (2019).
Article Google Scholar
Seland, Ø. et al. Overview of the Norwegian Earth System Model (NorESM2) and key climate response of CMIP6 DECK, historical, and scenario simulations. Geosci. Model Dev. 13, 6165–6200, https://doi.org/10.5194/gmd-13-6165-2020 (2020).
Article ADS CAS Google Scholar

Download references

Acknowledgements

The Second Tibetan Plateau Scientific Expedition and Research Program (STEP, Grant No. 2019QZKK0206) and National Key Research and Development Program of China (2018YFA0606003) and jointly fund this work.

Author information

Authors and Affiliations

Key Laboratory of Mesoscale Severe Weather/Ministry of Education, Nanjing University, Nanjing, 210023, China
Hai Lin, Jianping Tang & Shuguang Wang
School of Atmospheric Sciences, Nanjing University, Nanjing, 210023, China
Hai Lin, Jianping Tang, Shuyu Wang & Shuguang Wang
Key Laboratory of Citie’s Mitigation and Adaptation to Climate Change in Shanghai, China Meteorological Administration, Shanghai, 200030, China
Jianping Tang & Guangtao Dong

Authors

Hai Lin
View author publications
You can also search for this author in PubMed Google Scholar
Jianping Tang
View author publications
You can also search for this author in PubMed Google Scholar
Shuyu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Shuguang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Guangtao Dong
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.T. provided the funding, downloaded the data for experiments, and revised the text. H.L. finished the bias correction, trained the UNet and applied it to downscaling, make the BCUD datasets, plot the figures and tables, and wrote the manuscript text. G.D., S.W. and S.W. revised the text. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Jianping Tang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lin, H., Tang, J., Wang, S. et al. Deep learning downscaled high-resolution daily near surface meteorological datasets over East Asia. Sci Data 10, 890 (2023). https://doi.org/10.1038/s41597-023-02805-9

Download citation

Received: 25 April 2023
Accepted: 30 November 2023
Published: 12 December 2023
DOI: https://doi.org/10.1038/s41597-023-02805-9