Global high-resolution total water storage anomalies from self-supervised data assimilation using deep learning algorithms

Gou, Junyang; Soja, Benedikt

doi:10.1038/s44221-024-00194-w

Download PDF

Article
Open access
Published: 12 February 2024

Global high-resolution total water storage anomalies from self-supervised data assimilation using deep learning algorithms

Nature Water volume 2, pages 139–150 (2024)Cite this article

5451 Accesses
2 Citations
36 Altmetric
Metrics details

Subjects

Abstract

Total water storage anomalies (TWSAs) describe the variations of the terrestrial water cycle, which is essential for understanding our climate system. This study proposes a self-supervised data assimilation model with a new loss function to provide global TWSAs with a spatial resolution of 0.5°. The model combines hydrological simulations as well as measurements from the Gravity Recovery and Climate Experiment (GRACE) and its follow-on (GRACE-FO) satellite missions. The efficiency of the high-resolution information is proved by closing the water balance equation in small basins while preserving large-scale accuracy inherited from the GRACE(-FO) measurements. The product contributes to monitoring natural hazards locally and shows potential for better understanding the impacts of natural and anthropogenic activities on the water cycle. We anticipate our approach to be generally applicable to other TWSA data sources and the resulting products to be valuable for the geoscience community and society.

Global prediction of extreme floods in ungauged watersheds

Article Open access 20 March 2024

Global peak water limit of future groundwater withdrawals

Article 22 April 2024

Accurate medium-range global weather forecasting with 3D neural networks

Article Open access 05 July 2023

Main

Monitoring variations in the global water cycle is crucial for understanding the Earth’s climate system¹. The long-term trends in ice-sheet melting² and freshwater availability³ can be investigated by analysing the water cycle over decades, while the short-term variations of the water cycle contain fruitful information for monitoring natural hazards like flood⁴ and drought⁵ events. To quantify variations in the global water cycle, total water storage (TWS), defined as the storage in all forms of water, has been implemented as an essential climate variable⁶. For decades, TWS has mainly been modelled by simulations from global hydrological models, including global hydrology and water resource models and land surface models⁷. The hydrological models can provide spatial variance and short-term temporal variations but suffer from providing reliable long-term trends, which indicate the climate and human-induced changes in water storage⁸. Since 2002, the Gravity Recovery and Climate Experiment (GRACE) and its follow-on (GRACE-FO) missions (hereafter GRACE) have provided us with a unique opportunity to monitor the changes in global TWS anomalies (TWSAs) by measuring gravity field variations^9,10,11. The satellite-measured TWSAs have unprecedented accuracy with global coverage due to the physical measurement principle and provide valuable information about the Earth’s climate system from a macro perspective^1,12,13,14.

Although GRACE products have been widely used, their coarse spatial resolution of about 3° is among the key factors limiting the applications in related fields, such as understanding water storage changes in small catchments¹⁴. The problem of low spatial resolution mainly comes from two origins as follows. First, the design of the orbit and the accuracy of the instruments inherently limit the possible spatial resolution^15,16. Second, postprocessing approaches are needed for obtaining meaningful signals^17,18 but they also attenuate the actual geophysical signals, especially the high-frequency signals^19,20,21. As a result, reconstructing these high-frequency geophysical signals is vital. To improve the resolution, incorporating additional information with higher spatial resolution is necessary. In the specific task of downscaling GRACE TWSAs, the most promising high-resolution information is contained in hydrological models and measurements. The hydrological models can directly simulate TWSAs, whereas other hydrological parameters such as precipitation provide valuable auxiliary information by considering the water balance^14,22. Many studies have proven the feasibility of assimilating data to downscale GRACE products over specific regions^23,24,25,26 but few of them applied the methods successfully on a global scale. The downscaling algorithms with global generalizability so far can provide GRACE TWSAs of 0.5° using partial least squares regression²⁷ or an ensemble Kalman filter assimilation pipeline²⁸ but still have some deficiencies, such as insufficient intrabasin variability preservation or interbasin mass conservation.

In recent years, deep learning has progressed rapidly and shown remarkable potential in modelling the Earth system^29,30,31. Many studies investigated the potential of applying deep learning or classical machine learning approaches to downscale GRACE measurements in a supervised learning context^32,33. The main challenge is the need for high-resolution ground truth of TWSAs, which are inaccessible. Therefore, the studies usually generate the training pairs by downsampling the high-resolution hydrological simulations into the same resolution as GRACE. Under the assumption that the relationship between predictors and targets holds in different resolution domains, they apply the trained model on the original high-resolution hydrological simulations to obtain the downscaled GRACE predictions with the same resolution^34,35,36. Another way to deal with the lack of ground truth is generating GRACE-like TWSAs by applying a Gaussian smoother on the hydrological simulations and training the model based on synthetic data pairs³⁷. In this context, the necessary assumption is that the captured relationship also holds for the real GRACE measurements. However, all the aforementioned studies only applied their methods locally to continentally, indicating the difficulty of applying proposed deep learning methods globally due to their inherent challenges about generalization³⁸. This study contributes to the downscaling problem and solves the two mentioned inadequacies of the existing deep learning algorithms, namely relieving assumptions between different domains and providing global generalizability. First, we developed a loss function based on the average deviations between the outputs and the GRACE TWSAs over a certain area and the similarity between the outputs and WaterGAP Hydrology Model (WGHM)³⁹ simulations (Fig. 1; Methods). Since the GRACE and WGHM TWSAs are part of the inputs, we do not need extra labels or certain assumptions for generating synthetic training pairs. As a result, the network parameters can be optimized in a self-supervised manner without explicit assumptions bridging the high–low spatial resolution domains or simulation–measurement domains. Second, our downscaled TWSAs inherit the global generalizability from both GRACE and WGHM TWSAs and cover all the land areas, including coastal areas and small islands, except for Greenland and Antarctica due to the deficiency of hydrological models^39,40. Our model is based on the principle of convolutional neural networks⁴¹, allowing us to consider spatial correlations between individual cells. Our analysis shows the impressive performance of the proposed algorithm in providing high-resolution TWSAs, which represent the high-resolution structures while keeping accurate mass conservation on the basin scale. Therefore, the water balance equation can be better closed in the basins smaller than the GRACE-effective resolution. Ultimately, we also discuss the potential usages of the obtained high-resolution TWSA product for analysing the impacts of climate change and anthropogenic activities on a local scale, as well as for natural hazard monitoring.

**Fig. 1: The structure of the designed deep learning model in this study with an enlarged structure of the used residual blocks.**

Global downscaled TWSA product with uncertainties

We determined a global high-resolution TWSA product from April 2002 to December 2019, covering all global land areas except for Greenland and Antarctica. To provide uncertainty information, we combined probabilistic deep learning principles and Monte Carlo simulations to estimate the uncertainties using deep ensembles⁴². An example of the highly resolved TWSA product is shown in Fig. 2 with six major river basins enlarged. On the global scale, the seasonal changes in water storage are consistent with the ones observed by the GRACE measurements (Supplementary Videos 1 and 2). The regional maps demonstrate the visibility of the main river systems with refined details, which are inherited from the WGHM simulations. Moreover, the results are visually smoother than the WGHM TWSAs, indicating effective noise reduction. The reduction of the outliers is not only beneficial to reducing the amount of abnormal pixels but also helpful for gaining accurate values of the other neighbouring pixels. Once the outliers are reduced, the magnitudes of the neighbouring pixels with actual signals are calibrated by considering the constraint of agreements with GRACE TWSAs over an area larger than the GRACE-effective resolution¹⁶. Overall, the downscaled TWSAs have a global median uncertainty of 7.3 mm. The regions where water storage changes rapidly are usually accompanied by relatively large uncertainties due to their higher TWSA values, such as the mainstreams shown in Fig. 2.

**Fig. 2: The downscaled TWSAs (top) and their uncertainties (bottom) in August 2008.**

High-resolution details with large-scale mass conservation

Since accurate high-resolution TWSA measurements on a global scale are inaccessible, we cannot directly evaluate the quality of the downscaled product. Therefore, we rely on the GRACE measurements and WGHM simulations in different contexts. First, we compared the downscaled TWSAs with the WGHM simulations to study whether the high-resolution structures are sufficiently reconstructed. To achieve this goal, we considered each pixel over the whole time span as a time series and computed the pixel-wise Pearson correlations between downscaled and WGHM TWSAs. The impact of the inaccurate magnitudes in the WGHM TWSAs is reduced since the Pearson correlation is invariant under changes in scale. As shown in Fig. 3a, the overall correlation is high with a median value of 0.80, meaning an improvement of 51% compared to GRACE TWSAs (0.53). The relatively low correlations are mainly found in arid regions such as the north of Africa, the Middle East and the middle of Asia. These low correlations are understandable due to weak hydrological signals in these arid regions since GRACE and WGHM are not sufficiently sensitive to accurately measure or simulate them. Furthermore, comparisons with independent satellite altimetry measurements show that the high-resolution information of the downscaled product is beneficial (Supplementary Table 1 and Supplementary Fig. 5).

**Fig. 3: Evaluation of the downscaled TWSAs from 2002 to the end of 2019.**

To evaluate the basin-wise quality of the downscaled TWSAs, we rely on the GRACE measurements since the GRACE TWSAs are considered to be accurate over the their effective resolution^16,43. We first averaged GRACE and downscaled TWSAs of each individual basin to generate basin-wise time series and compute the root mean square errors (RMSEs) between these two types of time series. The results are shown in Fig. 3b for 288 basins globally. The RMSEs are lower than 30 mm in most of the land areas, resulting in a global average RMSE of 21.9 mm weighted by basin areas. This value demonstrates the quality of the downscaled TWSAs since the typical GRACE uncertainties are 20–30 mm (ref. ¹⁵). Compared to WGHM simulations (weighted RMSE of 49.2 mm), our method provides an improvement of around 56%. Further analysis reveals that certain basins exhibit relatively high RMSEs, such as the basins in the glaciated areas of Alaska. Insufficiently modelling of glaciers and ice sheets is a known issue of the hydrological models^8,39. Since we trained one neural network for the whole globe, the network cannot handle the substantial differences between the hydrological simulations and GRACE measurements in these specific regions because the issue is inessential in other regions, constituting the major part of the samples. Therefore, we consider this issue as a trade-off between generalizability and performance in specific areas.

Long-term trends and seasonal variations

To understand the performance of the proposed deep learning model on the basin scale in more detail, we plot the time series of the average TWSAs over the six selected major basins (Fig. 4). In the Amazon basin, dominant seasonal signals are relatively homogeneous. Both downscaled and WGHM TWSAs agree well with GRACE measurements, although the WGHM simulations tend to underestimate the amplitudes. For the basins where average TWSAs are less stationary and with more inhomogeneous variations (Mississippi, Congo, Lena and Nile), the downscaled TWSAs agree better with the GRACE observations than the WGHM simulations. These phenomena may be related to the known limitation of hydrological simulations, as they cannot fully capture the trends in water changes⁸. For example, the downscaled TWSAs successfully model the steady decrease in the Lena basin after 2007 and the increase in the Nile basin since 2010, whereas the WGHM time series do not represent these trends properly. However, both downscaled and WGHM TWSAs underestimate the increasing trend in the Yangtze River basin after 2010. A potential reason is the active anthropogenic impacts in this basin, such as construction of dams⁴⁴. Although the WGHM model considers human intervention, it may perform unsatisfactorily in specific regions and result in a relatively big disparity from GRACE measurements³⁹. As a result, the typical relationship captured by our model based on global data may not be the best solution in these regions, which is again a trade-off between generalizability and performance in specific areas. Conversely, the good agreements between the downscaled TWSAs and the GRACE measurements in the Mississippi and Congo basins demonstrate the performance of our model.

**Fig. 4: The basin-wise average TWSA time series (mm) of six selected major river basins for the whole studied time span from 2002 to the end of 2019.**

To quantify the performance of our method in retaining long-term trend, annual and semi-annual signals, we estimated these signals from the three TWSA types over 160 basins larger than 200,000 km² and show the results in Fig. 5a. Here, we set the spatial threshold of 200,000 km² to obtain more reliable GRACE estimations as reference^16,39,43. The correlation between WGHM- and GRACE-derived trends is only 0.47, showing the major limitation of the hydrological models in capturing long-term trends. Our method substantially improves this situation and reaches a correlation of 0.94. This improvement reveals the effectiveness of the proposed algorithm for data assimilation. The network has learned to rely on the GRACE measurements to calibrate the average magnitudes over an area larger than the GRACE-effective resolution. Therefore, the trends contained in the GRACE measurements have been successfully inherited to the downscaled TWSAs. The largest trend differences are found in the glaciated basins of Alaska and their neighbouring basins (Fig. 5b), which related to the problem of insufficient modelling of glaciers and the resulting leakage errors again. The performance of WGHM on annual and semi-annual signal estimations is clearly better than estimating trends with correlations of 0.83 for both. Nevertheless, our method still outperforms it and reaches correlations of 0.97 and 0.95, respectively. The results prove the realistic temporal changes in our downscaled TWSAs on the basin scale, even though we treated every month separately and did not explicitly feed any temporal information to the model. Moreover, the phases of the annual signals of our downscaled product agree well with those measured by GRACE (Fig. 5c), with the only major differences occurring in the north of Africa where the hydrological signals are weak and the phases are therefore ambiguously defined. It is a remarkable improvement compared to WGHM simulations, which suffer from phase shifts of 2 or 3 months for many of the basins globally (Supplementary Figs. 3 and 4).

**Fig. 5: Temporal decomposition of the basin-wise signals for the 160 basins larger than 200,000 km².**

Closing water balance equation beyond the GRACE resolution

The downscaled TWSAs are beneficial for better closing the water balance equation in regions smaller than the GRACE-effective resolution. Figure 6 depicts the agreements between water changes inferred from the downscaled TWSAs and those computed from ERA5-Land water budget components⁴⁵ in level-4 basins⁴⁶. The downscaled TWSAs show a reasonable ability to close the water balance equation globally with positive Nash–Sutcliffe efficiency (NSE)⁴⁷ of 83% studied area, whereas GRACE and WGHM have positive NSE in 77% and 75%. Most of the negative values come from the arid regions where the hydrological signals are weak. The improvements of downscaled TWSAs compared to the original GRACE product strongly correlate with the basin sizes (Fig. 6b). The average improvement in NSE given by the downscaled TWSAs is 0.13 for basins larger than GRACE-effective resolution (200,000 km²)³⁹, 0.21 for basins between the effective resolution and limiting resolution (63,000 km²)¹⁶ and 1.21 for basins smaller than 63,000 km². Conversely, the benefits compared to WGHM TWSAs do not strongly correlate with basin sizes but probably come from the more accurate values obtained from data assimilation.

**Fig. 6: Agreements between total water storage changes inferred from downscaled TWSAs and those computed from water budget components.**

Discussion

The downscaled TWSAs provide special insight for studying the climate and anthropogenic impacts locally, which enables the development of targeted strategies for sustainable management of water resources. Figure 7a shows the comparison of the derived trends from the three sources of TWSAs. The high-resolution signals in the downscaled trends are inherited from WGHM simulations and their values are calibrated by considering the agreement with GRACE measurements on the effective resolution. For example, the downscaled trends clearly show the three hot spots of groundwater depletion in the United States (the High Plains aquifer, the Mississippi embayment and the Central Valley of California)^48,49. Among them, the notable negative trends in the High Plains aquifer are not observable in GRACE TWSAs since the positive trends caused by progress from dry to wet periods³ on the neighbouring pixels average them out. GRACE TWSAs indicate negligible to positive trends mainly caused by increasing precipitation in central and southern India^3,50, which do not fully represent the remarkable water storage declines in the regions with high population density and groundwater irrigation⁵¹. Caveat: we note that the downscaled product generally depends on the quality of used hydrological simulations. Prominent trends in the simulations will impact downscaled trends to a certain extent. We should interpret these signals with care since relatively larger uncertainties are expected as a result of imperfections in the hydrological simulations.

**Fig. 7: Comparison of three high-level products derived from three different sources of TWSAs.**

The downscaled TWSAs enable multiple downstream applications, including flood and drought monitoring on a local scale. To study the potential benefits of our downscaled TWSAs, we computed two well-known TWSA-based indices: the flooding potential index (FPI)⁴ and the drought severity index (DSI)⁵. We report the maximum FPI of the year 2008 in Fig. 7b to show the most notable signals in a single plot, whereas Fig. 7c depicts the monthly DSI of August 2008. FPI is sensitive to the value range of TWSAs since it relies on the relative relationship between the storage deficit and accumulated precipitation. The outliers present in WGHM simulations can ruin the relationship and cause unrealistic flood potential, resulting in an FPI map with far more high-risk regions than GRACE-derived FPI. Our proposed method clearly ameliorates this issue by suppressing the outliers, which allows us to obtain realistic high-resolution FPI with a reasonable visual agreement with GRACE-derived FPI, such as in the Congo basin and along the eastern coastline of North America. Similarly, the abnormal values in the WGHM simulations may cause opposite categories in DSI, resulting in noisy patterns (north of Africa) or abnormally underestimated severity (Australia). Again, the DSI derived from the downscaled TWSAs agree better with the GRACE-derived ones and open the window to monitor extreme environmental events with higher spatial resolution. However, the environmental monitoring indices derived from downscaled TWSAs inherited the same limitations that GRACE TWSAs have⁵², since the downscaled TWSAs only provide higher spatial resolution but do not provide longer observations.

Our current approach still has some limitations, which reveal the potential for further improving high-resolution TWSAs. First, more effort should be put into modelling the glaciers by considering additional measurements or specific models. Second, human intervention modelling is sophisticated and may need specific modifications. Including population, farming area and water usage statistics in the deep learning model may provide better results. Last, deep learning models have the potential to consider constraints based on the interactions between different forms of water, such as the interaction between terrestrial water and ocean or free water and glaciers. Nevertheless, with the preliminary study on the potential use of the downscaled TWSAs for monitoring water change and natural hazards, we demonstrate the significance of the proposed method. In practice, timeliness is a key factor. The training process of the proposed algorithm can be finished in around 3 days for the global model using consumer-level platforms (NVIDIA RTX3080TI), which is efficient considering the typical delay of GRACE monthly products. Therefore, the major limitations for rapidly delivering high-resolution TWSAs are the processing time demands of GRACE measurements and hydrological simulations. For applications that need higher temporal frequencies, such as daily to weekly solutions, we can benefit from the principle of online machine learning and the proposed model can be updated within 1 hour. The operational delivery of the downscaled product should be beneficial for the geoscience community and society, especially in the fields of hydrology, climate science, sustainable water management and hazard prediction.

Methods

GRACE mascon solution

The analysis centres of GRACE provide a variety of products regularly. One of the most user-friendly products is the mass concentration (mascon) solution, where the mass variations are directly estimated by explicitly relating the intersatellite range–rate measurements to the mascon formulation^53,54,55. Compared to the spherical harmonic solutions, the mascon solutions suffer less from leakage errors and can better separate the land and ocean signals⁵⁶. Therefore, the mascon solutions typically have a finer resolution for small regions⁵⁷. In this study, we used the mascon solutions provided by NASA Jet Propulsion Laboratory (JPL)⁵⁸. JPL has applied many data processing steps, including the replacement of C_2,0 coefficients with the solutions from satellite laser ranging⁵⁹, applying a Glacier Isostatic Adjustment (GIA) model⁶⁰, removal of the impacts of ocean, atmosphere and land ice masses. In the end, the remaining monthly gravity changes can provide a precise measure of mass redistribution in the Earth’s water cycle⁵⁷. Besides, the mean values from 2004.0 to 2009.999 are removed from the products to produce the TWSA in the form of EWH. The product used in this study is without the land-grid-scaling gain factors⁶¹ so that the data can provide us with information that is entirely independent of hydrological models.

Hydrological models and basin boundaries

WaterGAP is a global hydrological model that thoroughly describes water storage, usage and resources in all land areas except for Antarctica. WaterGAP v.2.2d, including WGHM, was published in 2021³⁹. WGHM comprehensively models daily water flows and water storage since it includes various forms of water, such as groundwater, rivers and snow. As one of the standard WGHM outputs, monthly TWS is provided at a high spatial resolution of 0.5° × 0.5°. This TWS product is the sum of the water storage in the canopy, snow, soil, groundwater, wetland, lake, reservoir and river storage³⁹. Owing to the modelling approach and high spatial resolution, we can observe the principal rivers from the simulated TWS. The direct comparison between the obtained TWSAs and the GRACE TWSAs is possible⁶². However, we must remove the average values from 2004.0 to 2009.999 to generate the TWSAs with the same temporal baseline. Since the WGHM-modelled TWS does not include assimilation of GRACE measurements, the obtained WGHM TWSAs and GRACE TWSAs are entirely independent. Although the global structures of the WGHM TWSAs are noticeably more finely resolved than GRACE products, the WGHM TWSAs also have two non-negligible limitations. First, the WGHM simulations are much noisier due to errors in the simulation procedure. Second, the values of the WGHM TWSAs are less accurate compared to the GRACE TWSAs since they are not based on real observations. WGHM clearly underestimates the mean annual TWSA amplitudes in more than half of 147 investigated river basins by more than 10%, which may relate to the fact that WaterGAP does not simulate glaciers³⁹.

In addition, we included hydrological information from the Global Land Data Assimilation System (GLDAS), which aims to assimilate satellite- and ground-based observational products to provide fields of land surface states and fluxes⁴⁰. Within this platform, many land surface models (LSM) are integrated. In this study, we used the data from the GLDAS Noah Land Surface Model L4 monthly 0.25° × 0.25° v.2.1 (refs. ^40,63). GLDAS v.2.1 is forced with a combination of model and observation data from 2000 to the present without assimilating GRACE measurements. Therefore, the products provided by GLDAS v.2.1 are additional data sources independent of the GRACE TWSAs. Considering the water balance equation, which describes the relationship between the changes of TWSA (TWSC) and precipitation (P), evapotranspiration (ET) and runoff (R):

$${{{\rm{TWSC}}}}=P-{\it{ET}}-R,$$

(1)

we exclusively focused on the three mentioned parameters. The data are downsampled into the resolution of 0.5° × 0.5° by computing the average of four neighbouring pixels to obtain the same resolution as the WGHM TWSAs.

The hydrological boundaries are obtained from HydroBasins⁴⁶, which represent a series of vectorized polygon layers that depict sub-basin boundaries at a global scale. All continents, except Antarctica, are included. The HydroBASINS product follows the Pfafstetter concept⁶⁴ and provides levels 1 to 12 globally. In this study, we focused on the HydroBasins level 1 (nine continents), 3 (292 sub-basins) and 4 (1,342 sub-basins) products.

Feature selection and preprocessing

One of the essential prerequisites for the success of the deep learning model is to determine a set of meaningful features that can represent the changes in TWSAs to a sufficient degree. First, the TWSAs from GRACE and WGHM are the most important features because the GRACE TWSAs have relatively accurate values over a larger region and those from WGHM provide information about high-resolution structures. Furthermore, we included precipitation, evapotranspiration and runoff as features inspired by equation (1). We should note that the GLDAS provides runoff split into three components: storm surface runoff, baseflow-groundwater runoff and snow melt. Therefore, we have five additional features in total. In the end, since multiple studies pointed out the correlation between the changes of TWS and geocoordinates¹ and the positive contribution of geocoordinates to global deep learning models⁶⁵, we also considered latitudes and longitudes as additional features. We normalized the features on the basis of their 0.01th percentiles and 99.99th percentiles to reduce the impacts of outliers.

The final step before we fed the data into our model is splitting the global area into small patches of the same size. We first found all the pixels on land by relying on the basin boundaries. Then, we considered them as the central pixels and generated a 16° × 16° patch around each central pixel, which means each patch has a size of 32 × 32 with a resolution of 0.5°. The patch size is a multiple of the coarse resolution of GRACE TWSAs (3° × 3°). A 16° × 16° patch contains more than 25 effective GRACE pixels, making the patch-wise average of GRACE TWSAs meaningful. Some patches near the coastlines unavoidably contain areas over oceans where the hydrological models do not provide any values. We filled these with the average values of valid pixels in the same patch. The patches were considered as images in our deep learning model and the nine features were put into nine channels.

Self-supervised data assimilation model

The lack of high-resolution ground truth impedes the application of supervised learning approaches to provide high-resolution TWSAs. All the mentioned deep learning-based studies artificially generated input–output pairs, which capacitates a supervised regression but always under the constraints of certain assumptions. To avoid strict assumptions about the input–output pairs and exploit the information from the available data sources as much as possible, we proposed a data assimilation model with a specifically designed loss function that allows self-supervised optimization. Our model is based on the principle of the convolutional neural networks⁴¹, which is a specific type of neural networks that uses convolution in place of general matrix multiplication in some layers. The convolutional operators, usually known as kernels, can extract high-level feature maps by considering the relative positional relationship between pixels or low-level features. In our specific case, the model can extract information about hydrological phenomena, such as water storage changes in waterbodies, from the values of individual pixels. We also applied the concept of residual learning⁶⁶. In this context, the network explicitly approximates the residual function ${{{\mathcal{F}}}}(x)={{{\mathcal{H}}}}(x)-x$, which is the difference between the original target function ${{{\mathcal{H}}}}(x)$ and the input x. The fitting of the residual function should not be more difficult than fitting the original target function itself due to the existence of the skip connection. As a result, a deeper model should have a training error no greater than its shallower counterpart⁶⁶. Batch normalization⁶⁷ is also included to reduce the sensitivity to the initialization and thereby improve the optimizing process.

Our model aims to assimilate the satellite observations and hydrological simulation by balancing the accurate values from GRACE observations over an area larger than their effective resolution and the high-resolution structures from the WGHM simulations. Therefore, the loss function is designed in the way that the outputs of our model are compared with both inputs, GRACE TWSAs and WGHM TWSAs. The first goal of our optimizing process is to let the values of the outputs be as close as the GRACE TWSAs over each patch. Since the GRACE measurements of individual 0.5° pixels are not representative, we computed the absolute error (AE) between the averaged GRACE TWSAs and the averaged predicted TWSAs over each patch:

$${{{{\rm{AE}}}}}_{{{{\rm{G}}}}}\left({{{{P}}}}_{{{{\rm{G}}}}},\hat{{{{P}}}}\right)=\left\vert \frac{1}{N}\mathop{\sum }\limits_{n=1}^{N}{p}_{{{{\rm{G}}}},n}-\frac{1}{N}\mathop{\sum }\limits_{n=1}^{N}{\hat{p}}_{n}\right\vert ,$$

(2)

where ${P}_{\rm{G}}$ and $\hat{{{{P}}}}$ denote the GRACE patches and predicted patches, respectively. Each patch includes N pixels with values denoted by p_G and $\hat{p}$ for GRACE and predicted patches. The second goal of our optimizing process is to learn the high-resolution structures from the WGHM TWSAs. For this purpose, we introduced the Pearson correlation coefficient (R) between the outputs and WGHM TWSAs to describe the similarity since it proves superior to other similarity metrics like structure similarity index (SSIM) as argued in another study⁶⁸ and confirmed in our tests. We introduced a second metric to enhance the measurements of structural similarity, namely the mean absolute error (MAE) between WGHM TWSAs and predicted TWSAs. Equations (3) and (4) show the corresponding definitions with p_W denoting the pixels of the WGHM patch ${P}_{\rm{W}}$ and ${\hat{p}}$ denoting the pixels of the output patch ${\hat{P}}$, respectively:

$$\begin{array}{l}{\rm{R}}({{{\it{P}}}}_{{{{\rm{W}}}}},\hat{{{\it{P}}}})=\frac{{\sum }_{m}{\sum }_{n}\left({{{{P}}}}_{{{{\rm{W}}}},mn}-{\overline{{{{P}}}}}_{{{{\rm{W}}}}}\right)\left({\hat{{{{P}}}}}_{mn}-\overline{\hat{{{{P}}}}}\,\right)}{\sqrt{\left({\sum }_{m}{\sum }_{n}{\left({{{{P}}}}_{{{{\rm{W}}}},mn}-{\overline{{{{P}}}}}_{{{{\rm{W}}}}}\right)}^{2}\right)\left({\sum }_{m}{\sum }_{n}{\left({\hat{{{{P}}}}}_{mn}-\overline{\hat{{{{P}}}}}\,\right)}^{2}\right)}}\end{array},$$

(3)

$${{{{\rm{MAE}}}}}_{{{{\rm{W}}}}}\left({{{{P}}}}_{{{{\rm{W}}}}},\hat{{{{P}}}}\right)=\frac{1}{N}\mathop{\sum }\limits_{n=1}^{N}\left\vert {p}_{{{{\rm{W}}}},n}-{\hat{p}}_{n}\right\vert .$$

(4)

Finally, we combined the proposed terms to achieve the two goals within the same optimizing process, leading to the following formulation of the loss function:

$${{{\mathcal{L}}}}\left({{{{P}}}}_{{{{\rm{G}}}}},{{{{P}}}}_{{{{\rm{W}}}}},\hat{{{{P}}}}\right)=\frac{1}{B}\mathop{\sum }\limits_{b = 1}^{B}\left\{{{{{\rm{AE}}}}}_{{{{\rm{G}}}}}({{{{P}}}}_{{{{\rm{G}}}}},\hat{{{{P}}}})+\left[1-{{{\rm{R}}}}\left({{{{P}}}}_{{{{\rm{W}}}}},\hat{{{{P}}}}\right)\right]\times {{{{\rm{MAE}}}}}_{{{{\rm{W}}}}}\left({{{{P}}}}_{{{{\rm{W}}}}},\hat{{{{P}}}}\right)\right\},$$

(5)

where B is the batch size. The reason for using both AE and MAE rather than the L2 metrics is that the L1 metrics are usually more robust against outliers. During our experiments, we observed that the GRACE-term and WGHM-term converge with similar magnitudes, indicating the ability of our model to balance the two terms without over-relying on any of them. Thus, there is no need to add more hyperparameters to weigh these two terms explicitly.

The detailed realization of the global model is shown in Fig. 1, including details of the applied residual blocks. The encoder contains three two-dimensional (2D) convolutional layers followed by a ReLU activation function $\left({{{\rm{ReLU}}}}({{{x}}})=\max (0,{{{x}}})\right)$ and three residual blocks. The encoding process is realized by the fact that each convolutional layer has a stride of 2 to reduce the size of the outputs and increase the receptive field. The convolutional layers have increasing numbers of kernels (16, 32 and 64) with a size of 3 to increase the latent information, namely the feature dimension. In the decoder, the feature maps have to be upsampled first. Here, we use 2D-bilinear upsampling layers to pre-upsample the feature maps and feed them into the 2D convolutional layers, followed by ReLU and residual blocks. Unlike the encoder, the convolutional layers have stride 1 to keep the size of outputs the same. They have 64, 32 and 16 kernels to reproduce the spatial information from the latent information. At the end of the architecture, another convolutional layer with kernel size 1 without activation function is designed so that it can project the final feature maps to the actual TWSA values. The resulting output size is the same as the input size (32 × 32). Once the outputs are generated, they are compared to the original GRACE and WGHM TWSAs to compute the loss using equation (5). To understand the benefits of our proposed model structure, we can analyse the two parts of the optimizing process separately. If we only consider the parts of the loss function related to the WGHM terms (bottom part in Fig. 1), our model is similar to an autoencoder^69,70, which aims to reconstruct the WGHM TWSAs while reducing noise. Therefore, the number of outliers is remarkably reduced. Then, optimizing the part of the loss function related to the GRACE terms is like a regression problem (top part in Fig. 1), aiming to calibrate the values of the reconstructed high-resolution TWSAs on the patch scale. Our proposed loss function combines these two principles so that they are optimized jointly.

To tune the model structure and hyperparameters, we started with experiments over four river basins (Amazon, Congo, Mississippi and Lena). At this stage, we randomly split the data of 158 months until the end of 2016 into training (70%, 110 months), validation (15%, 24 months) and test sets (15%, 24 months). By monitoring the training process, we observed that the models usually converged between training epochs 120 and 150. The models did not suffer from overfitting issues and had similar performance on data from 2017 to the end of 2019. Therefore, we decided to train our models using all 180 months of data to obtain the same quality for the whole time interval (April 2002 to December 2019). We used TensorFlow v.2.6.0 (ref. ⁷¹) to implement all the networks as well as the training, validation and test processes. The optimizer is Adam⁷² with default settings and a batch size of 512. The training process is efficient and can be finished in around 3 days for the global model using consumer-level platforms (NVIDIA RTX 3080TI).

Estimating uncertainties based on deep ensemble

To quantify the uncertainties of downscaled TWSAs, we followed the principle of deep ensemble learning⁴² and trained five independent deep learning models from scratch with different random initial states. As a relatively small number of independent models is sufficient for modelling the predictive uncertainty⁷³, we computed our ensemble results (${\mu }_{* }$) and uncertainties (${\sigma }_{* }$) as:

$${\mu }_{* }=\frac{1}{M}\mathop{\sum }\limits_{m=1}^{M}{\mu }_{{\theta }_{m}},$$

(6)

$${\sigma }_{* }=\sqrt{\frac{1}{M}\mathop{\sum }\limits_{m=1}^{M}\left({\mu }_{{\theta }_{m}}^{2}+{\sigma }_{{\theta }_{m}}^{2}\right)-{\mu }_{* }^{2}},$$

(7)

where ${\mu }_{{\theta }_{m}}$ and ${\sigma }_{{\theta }_{m}}$ are the predicted TWSAs and associated uncertainties of model m and M is the total number of models, namely five in this study. However, due to the specific design of the loss function, ${\sigma }_{{\theta }_{m}}$ cannot be directly estimated by deep learning models. To overcome this issue, we used Monte Carlo simulations. We sampled 20 sets of GRACE inputs randomly on the basis of their uncertainties for each model m to estimate ${\sigma }_{{\theta }_{m}}$. Ultimately, the ensemble uncertainty σ_* was estimated from five independent deep learning models, with 100 Monte Carlo simulation runs in total. We note that the uncertainties of WGHM simulations and GLDAS inputs are unavailable and not considered. Therefore, the uncertainties reported in this study may be underestimated.

Closure of water balance equation

The closure of the water balance equation is realized by comparing the left side of equation (1) computed from the derivatives of TWSA products and the right side of equation (1) computed from water budget components. We chose the precipitation, evapotranspiration and runoff products from ERA5-Land⁴⁵ to provide independent external evaluation since they are not considered in the deep learning model. Moreover, they are proven to agree well with GRACE measurements²². To obtain homogeneous time steps for water changes, we interpolated the GRACE and our downscaled TWSAs to the middle of each month using PCHIP⁷⁴ interpolation. The GRACE and GRACE-FO eras were dealt with separately to avoid a biased interpolation caused by the gap of 1 year. The TWSCs were obtained by centred finite differences of TWSAs:

$${{{\rm{TWSC}}}}(t)=\frac{{{{\rm{TWSA}}}}({{{t}}}+1)-{{{\rm{TWSA}}}}({{{t}}}-1)}{2{{\Delta }}t},$$

(8)

where Δt indicates 1 month. The P, ET and R time series were further smoothed to reduce potential high-frequency artefacts introduced by the finite differences⁷⁵:

$$\widetilde{X}(t)=\frac{1}{4} X(t-1)+\frac{1}{2}X(t)+\frac{1}{4}X(t+1),$$

(9)

where X denotes P, ET or R. To this end, we received two types of TWSCs: TWSC_GRACE estimated from GRACE measurements or other TWSA product and TWSC_budget estimated from P, ET and R. The agreement of these two TWSCs was quantified by computing the NSE⁴⁷:

$${{{\rm{NSE}}}}=1-\frac{\frac{1}{T}\mathop{\sum }\nolimits_{t = 1}^{T}{\left({{{{\rm{TWSC}}}}}_{{{{\rm{budget}}}}}(t)-{{{{\rm{TWSC}}}}}_{{{{\rm{GRACE}}}}}(t)\right)}^{2}}{\frac{1}{T}\mathop{\sum }\nolimits_{t = 1}^{T}{\left({{{{\rm{TWSC}}}}}_{{{{\rm{GRACE}}}}}(t)-{\overline{{{{\rm{TWSC}}}}}}_{{{{\rm{GRACE}}}}}(t)\right)}^{2}}.$$

(10)

TWSA-derived environmental monitoring indices

To demonstrate the applicability of the highly resolved TWSAs, we relied on the concepts of environmental monitoring indices introduced in previous studies, including FPI⁴ and DSI⁵. The motivation for FPI is the different capacities of storing water in each cell. We obtained the water deficit S_Def(t) = S_Max − S(t − 1) by computing the difference between the water storage of the previous month $\left(S(t-1)\right)$ and historic maxima S_Max for each individual cell. Under the assumption that the high flooding risks are caused by extreme precipitation, we computed the flood potential index F(t) = P_Mon − S_Def(t), where P_Mon(t) = P(t) × dt denotes the monthly precipitation (ERA5L in this study). In the end, the FPI was normalized to the range 0 to 1: ${{{\rm{FPI}}}}=F(t)/\max \left[F(t)\right]$. During this approach, the gaps within the TWSA records may cause incorrect water deficit due to the integral over time for the monthly precipitation. DSI is a standardized metric and also considers the different characteristics of each cell. First, we computed the average values (${\overline{{{{\rm{TWSA}}}}}}_{m}$) and standard deviations (σ_m) of each month m over all the years with valid TWSA records. Then, we computed standardized anomalies for the month m and year y as $({{{{\rm{TWSA}}}}}_{m,y}-{\overline{{{{\rm{TWSA}}}}}}_{m})/{\sigma }_{m}$. In the end, the standardized anomalies were place in 11 categories on the basis of the thresholds set by ranking percentiles (30%, 20%, 10%, 5% and 2%, two-sided)⁷⁶.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The downscaled TWSA product generated in this study is publicly available at https://doi.org/10.3929/ethz-b-000648738 (ref. ⁷⁷). The raw data used in this study are publicly available. The JPL GRACE mason product is available at https://grace.jpl.nasa.gov/data/get-data/. The WGHM TWSA simulations are available at https://doi.org/10.1594/PANGAEA.948461 (ref. ⁷⁸). The GLDAS Noah products are available at https://disc.gsfc.nasa.gov/. The ERA5-Land products are available at https://cds.climate.copernicus.eu. The HydroBasin data are available at https://www.hydrosheds.org/products/hydrobasins. Source data are provided with this paper.

Code availability

The core codes for the deep learning model used including the trained model and weights are available at https://gitlab.ethz.ch/spacegeodesy_public/grace_seda. Further codes for analysing and producing other results are available from the corresponding authors upon request.

References

Tapley, B. D. et al. Contributions of GRACE to understanding climate change. Nat. Clim. Change 9, 358–369 (2019).
Article ADS Google Scholar
Kim, B.-H. et al. Antarctic ice mass change (2003–2016) jointly estimated by satellite gravimetry and altimetry. J. Geophys. Res. 127, e2021JB023297 (2022).
Article ADS Google Scholar
Rodell, M. et al. Emerging trends in global freshwater availability. Nature 557, 651–659 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Reager, J. T. & Famiglietti, J. S. Global terrestrial water storage capacity and flood potential using GRACE. Geophys. Res. Lett. https://doi.org/10.1029/2009GL040826 (2009).
Zhao, M., Velicogna, I. & Kimball, J. S. Satellite observations of regional drought severity in the continental United States using GRACE-based terrestrial water storage changes. J. Climate 30, 6297–6308 (2017).
Article ADS Google Scholar
Zemp, M. et al. The GCOS 2022 Implementation Plan (GCOS-244) (WMO, 2022).
Bierkens, M. F. Global hydrology 2015: state, trends and directions. Water Resour. Res. 51, 4923–4947 (2015).
Article ADS Google Scholar
Scanlon, B. R. et al. Global models underestimate large decadal declining and rising water storage trends relative to GRACE satellite data. Proc. Natl Acad. Sci. USA 115, E1080–E1089 (2018).
Article CAS PubMed PubMed Central Google Scholar
Tapley, B. D., Bettadpur, S., Watkins, M. & Reigber, C. The gravity recovery and climate experiment: mission overview and early results. Geophys. Res. Lett. https://doi.org/10.1029/2004GL019920 (2004).
Wahr, J., Swenson, S., Zlotnicki, V. & Velicogna, I. Time-variable gravity from GRACE: first results. Geophys. Res. Lett. https://doi.org/10.1029/2004GL019779 (2004).
Tapley, B. D., Bettadpur, S., Ries, J. C., Thompson, P. F. & Watkins, M. M. GRACE measurements of mass variability in the Earth system. Science 305, 503–505 (2004).
Article ADS CAS PubMed Google Scholar
Rodell, M. & Reager, J. T. Water cycle science enabled by the GRACE and GRACE-FO satellite missions. Nat. Water 1, 47–59 (2023).
Article Google Scholar
Reager, J. et al. A decade of sea level rise slowed by climate-driven hydrology. Science 351, 699–703 (2016).
Article ADS CAS PubMed Google Scholar
Chen, J. et al. Applications and challenges of GRACE and GRACE follow-on satellite gravimetry. Surv. Geophys. https://doi.org/10.1007/s10712-021-09685-x (2022).
Wahr, J., Swenson, S. & Velicogna, I. Accuracy of GRACE mass estimates. Geophys. Res. Lett. https://doi.org/10.1029/2005GL025305 (2006).
Vishwakarma, B. D., Devaraju, B. & Sneeuw, N. What is the spatial resolution of GRACE satellite products for hydrology? Remote Sens. 10, 852 (2018).
Article ADS Google Scholar
Wahr, J., Molenaar, M. & Bryan, F. Time variability of the Earth’s gravity field: hydrological and oceanic effects and their possible detection using GRACE. J. Geophys. Res. 103, 30205–30229 (1998).
Article ADS Google Scholar
Swenson, S. & Wahr, J. Post-processing removal of correlated errors in GRACE data. Geophys. Res. Lett. https://doi.org/10.1029/2005GL025285 (2006).
Klees, R., Zapreeva, E., Winsemius, H. & Savenije, H. The bias in GRACE estimates of continental water storage variations. Hydrol. Earth Syst. Sci. 11, 1227–1241 (2007).
Article ADS Google Scholar
Velicogna, I. & Wahr, J. Time-variable gravity observations of ice sheet mass balance: precision and limitations of the GRACE satellite data. Geophys. Res. Lett. 40, 3055–3063 (2013).
Article ADS Google Scholar
Vishwakarma, B. D., Devaraju, B. & Sneeuw, N. Minimizing the effects of filtering on catchment scale GRACE solutions. Water Resour. Res. 52, 5868–5890 (2016).
Article ADS Google Scholar
Lehmann, F., Vishwakarma, B. D. & Bamber, J. How well are we able to close the water budget at the global scale? Hydrol. Earth Syst. Sci. 26, 35–54 (2022).
Article ADS Google Scholar
Houborg, R., Rodell, M., Li, B., Reichle, R. & Zaitchik, B. F. Drought indicators based on model-assimilated Gravity Recovery and Climate Experiment (GRACE) terrestrial water storage observations. Water Resour. Res. https://doi.org/10.1029/2011WR011291 (2012).
Eicker, A., Schumacher, M., Kusche, J., Döll, P. & Schmied, H. M. Calibration/data assimilation approach for integrating GRACE data into the WaterGAP Global Hydrology Model (WGHM) using an ensemble Kalman filter: first results. Surv. Geophys. 35, 1285–1309 (2014).
Article ADS Google Scholar
Gemitzi, A., Koutsias, N. & Lakshmi, V. A spatial downscaling methodology for GRACE Total water storage anomalies using GPM IMERG precipitation estimates. Remote Sens. 13, 5149 (2021).
Article ADS Google Scholar
Tourian, M. J. et al. A copula-supported Bayesian framework for spatial downscaling of GRACE-derived terrestrial water storage flux. Remote Sens. Environ. 295, 113685 (2023).
Article Google Scholar
Vishwakarma, B. D., Zhang, J. & Sneeuw, N. Downscaling GRACE total water storage change using partial least squares regression. Sci. Data 8, 95 (2021).
Gerdener, H., Kusche, J., Schulze, K., Döll, P. & Klos, A. The global land water storage data set release 2 (GLWS2.0) derived via assimilating GRACE and GRACE-FO data into a global hydrological model. J. Geodesy 97, 73 (2023).
Article ADS Google Scholar
Reichstein, M. et al. Deep learning and process understanding for data-driven Earth system science. Nature 566, 195–204 (2019).
Article ADS CAS PubMed Google Scholar
Irrgang, C. et al. Towards neural Earth system modelling by integrating artificial intelligence in Earth system science. Nat. Mach. Intell. 3, 667–674 (2021).
Article Google Scholar
Yu, S. & Ma, J. Deep learning for geophysics: current and future trends. Rev. Geophys. 59, e2021RG000742 (2021).
Article ADS MathSciNet Google Scholar
Sabzehee, F., Amiri-Simkooei, A., Iran-Pour, S., Vishwakarma, B. & Kerachian, R. Enhancing spatial resolution of GRACE-derived groundwater storage anomalies in Urmia catchment using machine learning downscaling methods. J. Environ. Manag. 330, 117180 (2023).
Article CAS Google Scholar
Seyoum, W. M., Kwon, D. & Milewski, A. M. Downscaling GRACE TWSA data into high-resolution groundwater level anomaly using machine learning-based models in a glacial aquifer system. Remote Sens. 11, 824 (2019).
Article ADS Google Scholar
He, H. et al. Deep learning approaches to spatial downscaling of GRACE terrestrial water storage products using EALCO model over Canada. Can. J. Remote Sens. 47, 657–675 (2021).
Article ADS Google Scholar
Ali, S. et al. Constructing high-resolution groundwater drought at spatio-temporal scale using GRACE satellite data based on machine learning in the Indus Basin. J. Hydrol. 612, 128295 (2022).
Article Google Scholar
Yin, W. et al. Improving the spatial resolution of GRACE-based groundwater storage estimates using a machine learning algorithm and hydrological model. Hydrogeol. J. 30, 947–963 (2022).
Article ADS Google Scholar
Irrgang, C., Saynisch-Wagner, J., Dill, R., Boergens, E. & Thomas, M. Self-validating deep learning for recovering terrestrial water storage from gravity and altimetry measurements. Geophys. Res. Lett. 47, e2020GL089258 (2020).
Article ADS Google Scholar
Goodfellow, I., Bengio, Y. & Courville, A. Deep Learning (MIT Press, 2016).
Müller Schmied, H. et al. The global water resources and use model WaterGAP v2.2d: model description and evaluation. Geosci. Model Dev. 14, 1037–1079 (2021).
Article ADS Google Scholar
Rodell, M. et al. The global land data assimilation system. Bull. Am. Meteorol. Soc. 85, 381–394 (2004).
Article ADS Google Scholar
LeCun, Y., Bottou, L., Bengio, Y. & Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 86, 2278–2324 (1998).
Article Google Scholar
Lakshminarayanan, B., Pritzel, A. & Blundell, C. Simple and scalable predictive uncertainty estimation using deep ensembles. in Advances in Neural Information Processing Systems 30 (eds Guyon, I. et al.) 6402–6413 (NIPS, 2017).
Boergens, E. et al. Uncertainties of GRACE-based terrestrial water storage anomalies for arbitrary averaging regions. J. Geophys. Res. 127, e2021JB022081 (2022).
Article ADS Google Scholar
Zhang, A. T. & Gu, V. X. Global Dam Tracker: a database of more than 35,000 dams with location, catchment and attribute information. Sci. Data 10, 111 (2023).
Article PubMed PubMed Central Google Scholar
Hersbach, H. et al. The ERA5 global reanalysis. Q. J. R. Meteorol. Soc. 146, 1999–2049 (2020).
Article ADS Google Scholar
Lehner, B. & Grill, G. Global river hydrography and network routing: baseline data and new approaches to study the world’s large river systems. Hydrol. Proc. 27, 2171–2186 (2013).
Article Google Scholar
Nash, J. E. & Sutcliffe, J. V. River flow forecasting through conceptual models part I—a discussion of principles. J. Hydrol. 10, 282–290 (1970).
Article ADS Google Scholar
Konikow, L. F. Long-term groundwater depletion in the United States. Groundwater 53, 2–9 (2015).
Article CAS Google Scholar
Scanlon, B. R. et al. Groundwater depletion and sustainability of irrigation in the US High Plains and Central Valley. Proc. Natl Acad. Sci. USA 109, 9320–9325 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Asoka, A., Gleeson, T., Wada, Y. & Mishra, V. Relative contribution of monsoon precipitation and pumping to changes in groundwater storage in India. Nat. Geosci. 10, 109–117 (2017).
Article ADS CAS Google Scholar
Dangar, S., Asoka, A. & Mishra, V. Causes and implications of groundwater depletion in India: a review. J. Hydrol. 596, 126103 (2021).
Article Google Scholar
Vishwakarma, B. D. Monitoring droughts from GRACE. Front. Environ. Sci. 8, 584690 (2020).
Article Google Scholar
Watkins, M. M., Wiese, D. N., Yuan, D.-N., Boening, C. & Landerer, F. W. Improved methods for observing Earth’s time variable mass distribution with GRACE using spherical cap mascons. J. Geophys. Res. 120, 2648–2671 (2015).
Article ADS Google Scholar
Save, H., Bettadpur, S. & Tapley, B. D. High-resolution CSR GRACE RL05 mascons. J. Geophys. Res. 121, 7547–7569 (2016).
Article ADS Google Scholar
Luthcke, S. B. et al. Antarctica, Greenland and Gulf of Alaska land-ice evolution from an iterated GRACE global mascon solution. J. Glaciol. 59, 613–631 (2013).
Article ADS Google Scholar
Scanlon, B. R. et al. Global evaluation of new GRACE mascon products for hydrologic applications. Water Resour. Res. 52, 9412–9429 (2016).
Article ADS Google Scholar
Landerer, F. W. & Cooley, S. S.Gravity Recovery and Climate Experiment Follow-on (GRACE-FO) Level-3 Data Product User Handbook (Jet Propulsion Laboratory, California Institute of Technology, 2021).
Wiese, D. N., Dah-Ning, Boening, C., Landerer, F. W. & Watkins, M. M. JPL GRACE Mascon Ocean, Ice and Hydrology Equivalent Water Height RL06 CRI Filtered Version 02 (JPL, accessed 27 February 2022); https://doi.org/10.5067/TEMSC-3JC62
Cheng, M., Ries, J. C. & Tapley, B. D. Variations of the Earth’s figure axis from satellite laser ranging and GRACE. J. Geophys. Res. https://doi.org/10.1029/2010JB000850 (2011).
Richard Peltier, W., Argus, D. F. & Drummond, R. Comment on “An assessment of the ICE-6G_C (VM5a) glacial isostatic adjustment model" by Purcell et al. J. Geophys. Res. 123, 2019–2028 (2018).
Article ADS Google Scholar
Wiese, D. N., Landerer, F. W. & Watkins, M. M. Quantifying and reducing leakage errors in the JPL RL05M GRACE mascon solution. Water Resour. Res. 52, 7490–7502 (2016).
Article ADS Google Scholar
Angermann, D., Pail, R., Seitz, F. & Hugentobler, U. Mission Earth (Springer, 2022).
Beaudoing, H. & Rodell, M. GLDAS Noah Land Surface Model L4 monthly 0.25 × 0.25 degree V2.1 (GES DISC, accessed 17 March 2022); https://doi.org/10.5067/SXAVCZFAQLNO
Verdin, K. & Verdin, J. A topological system for delineation and codification of the Earth’s river basins. J. Hydrol. 218, 1–12 (1999).
Article ADS Google Scholar
Lang, N., Jetz, W., Schindler, K. & Wegner, J. D. A high-resolution canopy height model of the Earth. Nat. Ecol. Evol. https://doi.org/10.1038/s41559-023-02206-6 (2023).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. in Proc. IEEE Conf. Computer Vision and Pattern Recognition 770–778 (IEEE, 2016).
Ioffe, S. & Szegedy, C. Batch normalization: accelerating deep network training by reducing internal covariate shift. In Proc. 32nd Int. Conf. Machine Learning Vol. 37 (eds Bach, F. & Blei, D.) 448–456 (PMLR, 2015).
Starovoytov, V., Eldarova, E. & Iskakov, K. T. Comparative analysis of the SSIM index and the pearson coefficient as a criterion for image similarity. Eur. J. Math. Comput. Appl. 8, 76–90 (2020).
Google Scholar
Bourlard, H. & Kamp, Y. Auto-association by multilayer perceptrons and singular value decomposition. Biol. Cybernet. 59, 291–294 (1988).
Article MathSciNet CAS Google Scholar
Hinton, G. E. & Salakhutdinov, R. R. Reducing the dimensionality of data with neural networks. Science 313, 504–507 (2006).
Article ADS MathSciNet CAS PubMed Google Scholar
Abadi, M. et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems Software (tensorflow.org, 2015).
Kingma, D. P. & Ba, J. Adam: a method for stochastic optimization. Preprint at https://arxiv.org/abs/1412.6980 (2014).
Ovadia, Y. et al. Can you trust your model’s uncertainty? Evaluating predictive uncertainty under dataset shift. in Advances in Neural Information Processing Systems 32, 13991–14002 (2019).
Google Scholar
Fritsch, F. N. & Carlson, R. E. Monotone piecewise cubic interpolation. SIAM J. Numeric. Anal. 17, 238–246 (1980).
Article ADS MathSciNet Google Scholar
Landerer, F. W., Dickey, J. O. & Güntner, A. Terrestrial water budget of the Eurasian pan-Arctic from GRACE satellite measurements during 2003–2009. J. Geophys. Res. https://doi.org/10.1029/2010JD014584 (2010).
Svoboda, M. et al. The drought monitor. Bull. Am. Meteorolog. Soc. 83, 1181–1190 (2002).
Article ADS Google Scholar
Gou, J. & Soja, B. GRACE-SeDA: A Global Total Water Storage Anomaly Product with a Spatial Resolution of 0.5 Degrees From Self-Supervised Data Assimilation (ETH Research Collection, 2023); https://doi.org/10.3929/ethz-b-000648738
Müller Schmied, H. et al. The global water resources and use model WaterGAP v2.2d—alternative model output driven with gswp3-w5e5. PANGAEA https://doi.org/10.1594/PANGAEA.948461 (2022).

Download references

Acknowledgements

We thank the Jet Propulsion Laboratory, Goddard Space Flight Center, the WaterGap research group and the HydroSHEDS research group, who provide the valuable data publicly available. Special thanks to H. Müller Schmied, who kindly provided us with support in handling the WGHM data with clear explanations.

Funding

Open access funding provided by Swiss Federal Institute of Technology Zurich.

Author information

Authors and Affiliations

Institute of Geodesy and Photogrammetry, ETH Zurich, Zurich, Switzerland
Junyang Gou & Benedikt Soja

Authors

Junyang Gou
View author publications
You can also search for this author in PubMed Google Scholar
Benedikt Soja
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.G. implemented the experiments, did the analysis and prepared the original paper. B.S. proposed the topic, supervised the study and revised the paper. Both of the authors worked on the theoretical considerations, discussed the results and contributed to the final paper.

Corresponding author

Correspondence to Junyang Gou.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Water thanks the anonymous reviewers for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Figs. 1–6, Table 1 and Discussion.

Reporting Summary

Supplementary Video 1

The animation of downscaled TWSAs.

Supplementary Video 2

The animation of estimated uncertainties of downscaled TWSAs.

Source data

Source Data Fig. 2

Downscaled TWSAs and their uncertainties in 08.2008.

Source Data Fig. 3

Pixel-wise correlation and basin-wise RMSE.

Source Data Fig. 4

Basin-wise average TWSAs of the six selected basins.

Source Data Fig. 5

Temporal decomposition, basin-wise trend differences and phase shifts.

Source Data Fig. 6

Basin-wise NSE computed from downscaled TWSAs and the differences compared to GRACE and WGHM.

Source Data Fig. 7

Pixel-wise trends, FPI and DSI estimated from the three products.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gou, J., Soja, B. Global high-resolution total water storage anomalies from self-supervised data assimilation using deep learning algorithms. Nat Water 2, 139–150 (2024). https://doi.org/10.1038/s44221-024-00194-w

Download citation

Received: 13 February 2023
Accepted: 05 January 2024
Published: 12 February 2024
Issue Date: February 2024
DOI: https://doi.org/10.1038/s44221-024-00194-w

This article is cited by

Defining thresholds to protect groundwater-dependent vegetation
- Dylan J. Irvine
- Richard A. Crabbe
Nature Water (2024)
Learning to downscale satellite gravimetry data through artificial intelligence
- Alexander Sun
Nature Water (2024)