Daily GRACE satellite data evaluate short-term hydro-meteorological fluxes from global atmospheric reanalyses

Changes in terrestrial water storage as observed by the satellite gravity mission GRACE (Gravity Recovery and Climate Experiment) represent a new and completely independent way to constrain the net flux imbalance in atmospheric reanalyses. In this study daily GRACE gravity field changes are used for the first time to investigate high-frequency hydro-meteorological fluxes over the continents. Band-pass filtered water fluxes are derived from GRACE water storage time series by first applying a numerical differentiation filter and subsequent high-pass filtering to isolate fluxes at periods between 5 and 30 days corresponding to typical time-scales of weather system persistence at moderate latitudes. By comparison with the latest atmospheric reanalysis ERA5 of the European Centre for Medium-Range Weather Forecasts (ECWMF) we show that daily GRACE gravity field models contain realistic high-frequency water flux information. Furthermore, GRACE-derived water fluxes can clearly identify improvements realized within ERA5 over its direct predecessor ERA-Interim particularly in equatorial and temperate climate zones. The documented improvements are in good agreement with rain gauge validation, but GRACE also identifies three distinct regions (Sahel Zone, Okavango Catchment, Kimberley Plateau) with a slight degradation of net-fluxes in ERA5 with respect to ERA-Interim, thereby highlighting the potentially added value of non-standard daily GRACE gravity series for hydro-meteorological monitoring purposes.

Global numerical reanalyses of the atmosphere 1-3 , oceans 4 , land surface 5 and other components of the Earth system are essential tools for climate monitoring and research. Atmospheric reanalyses aim at merging several observational records, while applying physically consistent modeling in an analysis scheme that does not change over time. Recent reanalysis efforts in particular focused on a better representation of atmospheric water fluxes and components of the terrestrial water cycle as specifically relevant quantities for society in terms of water availability for hydroelectricity and human consumption. Validation is typically performed for the individual fluxes utilizing globally distributed observations of precipitation 6 , evapotranspiration 7 , and lateral runoff 8 . The numerical modeling of clouds and thus atmospheric water fluxes is challenging 9 but also highly relevant for, e.g., agricultural applications, and has seen rapid progress during the most recent decade with the advent of new satellite observing techniques and its associated data assimilation methodologies 10 .
Satellite gravimetry as realized with the satellite missions GRACE 11 (2002-2017) and GRACE-FO 12 (since 2018) has brought fundamentally new insight into mass transport and mass redistribution processes of the Earth system 13 . Gravimetry is the only remote sensing concept that provides quantitative estimates of water mass changes at or beyond the Earth's surface and has thus contributed unique and highly accurate estimates of ice mass loss from continental ice sheets 14 and mountain glaciers 15 ; seasonal terrestrial water storage changes 16 and groundwater depletion 17,18 ; as well as the contribution of net-inflow of water into the ocean basins to global mean sea-level rise 19 . Time variations in terrestrial water storage as observed by the GRACE mission are closely related to atmospheric net-fluxes of precipitation, evapotranspiration and lateral runoff via the water balance equation 20 (see Methods section). From monthly GRACE gravity fields, net-fluxes accumulated over 30 days have been compared to flux estimates from different global and regional reanalyses [21][22][23][24] which allowed for the identification of long-term flux biases and even trends 25 .
During recent years, mathematical methods were developed to parameterize global gravity field variations with a temporal sampling of 10 days 26 , 1 week 27 or even 24 hours [28][29][30][31][32][33] . Daily sampled gravity fields were successfully applied to study high-frequency wind-driven sea-level changes 34 , short-term transport variations of the Antarctic Circumpolar Current 35 , and the characteristics of major flood events in the Ganges-Brahmaputra catchment 36 .
In view of further progress in daily gravity field modeling from GRACE 37,38 (see Methods section), we demonstrate in this paper that band-pass filtered atmospheric net-fluxes from satellite gravimetry contain realistic high-frequency signals over the continents and can be used to document quality differences between different atmospheric reanalyses. To this end we first discuss global signals in band-pass filtered GRACE flux data and the latest reanalysis ERA5 39 of the European Centre for Medium-Range Weather Forecasts (ECWMF) in Section 2, followed by a detailed comparison of the time series for an exemplary grid cell (Section 3). The potential of GRACE to identify improvements realized in ERA5 over its direct predecessor ERA-Interim 40 is shown in Section 4, particularly with a focus on periods between 5 and 30 days that are exclusively accessible from a new non-standard GRACE gravity series with daily sampling. A comparison with the rather conventional approach of evaluating reanalyses time series against rain gauge observations shows good consistency with the GRACE results but also highlights the potentially added-value of satellite gravimetry for hydro-meteorological monitoring purposes (Section 5).

Global Patterns of Net-Flux Estimates
Although we eventually aim at using GRACE for the evaluation of quality differences between two generations of reanalyses, here we first make a comparison of GRACE and the latest ECMWF reanalysis ERA5 to investigate whether they see comparable high-frequency signals. To this end, band-pass filtered time series of water fluxes were derived from GRACE gravity field models (see Methods Section and Supplementary Information S1) and their root mean squared (RMS) signal variability is displayed together with equally filtered atmospheric net-fluxes from ERA5 in Fig. 1.
We note entirely disparate patterns over the oceans (Fig. 1a,b) related to the fact that short-term fluctuations in sea-level coherent with a mass change are dominated by surface winds instead of atmospheric water fluxes. Time variable gravity observations over the oceans thus provide information about the wind-driven 41 and regionally also the thermohaline circulation 42 , but are not sensitive to atmospheric water fluxes: precipitation and evaporation mainly affect the density of the near-surface layer of the oceans and any horizontal pressure gradients induced are almost fully compensated with depth in line with the thermal wind equations.
However, over the continents away from the coasts (Fig. 1c,d), we indeed find rather similar features in the fluxes from both ERA5 and GRACE. Estimates from GRACE in arid regions are substantially higher than the values from the reanalysis, reflecting the current level of GRACE observation and analysis noise. For the Sahara desert region (see box outline in Fig. 1), where no substantial day-to-day flux signals can be expected, we find maximum temporal RMS values up to 1.8 mm/day and an area-weighted mean of 1.3 mm/day. Comparative estimates for GRACE releases made available in 2014 43 and 2016 44 (both with a mean RMS of 1.7 mm/day in the Sahara desert region) underline the recent progress in understanding GRACE sensor characteristics. This particularly includes a better processing of sensor data from accelerometers and star cameras 45,46 as well as improved tidal 47 and non-tidal de-aliasing models 48 , which are commonly regarded as the main error sources in satellite gravimetry from GRACE and GRACE-FO 12 .
We further note that many coastal regions are affected by the limited spatial resolution of GRACE leading to the leakage of signals over the shoreline. The signal RMS in the GRACE time series is considerably larger than in the reanalysis especially in various places along Alaska's west coast, Hudson Bay, North Sea, Baltic Sea, Black Sea, Persian Gulf, and the Southeast Asian Seas. Similar features can be observed in the Gulf of Carpentaria northwards of Australia. At the same time, GRACE signal variability is dampened with respect to the reanalysis along various coasts in the tropics where no ocean signal is found in the gravity data. We assess the impact of both leakage-in of ocean mass variability onto the continents as well as signal loss of terrestrial water storage variations (see Methods and Supplementary Information S3) and mask all pixels affected by ocean dynamics. The coastal distance of such pixels varies depending on local signal strength and coastal geometry. As a rule of thumb, regions within 200-400 km of the coastline (in extreme cases up to 750 km) are disregarded in the subsequent analysis.

Atmospheric Net-Fluxes for Aruanã, Brazil
For the comparison of flux time series from GRACE and reanalyses (here: ERA5) we initially focus on one particular 1° grid cell around the city of Aruanã in the Amazon catchment that is characterized by a humid tropical climate (Fig. 2). Examples for different climatic conditions are given in the Supplementary Information (S4).
We note a generally good agreement between the two data sets, with peaks coinciding especially well in times of large flux variations (November to March) representing the season with highest precipitation rates in the region. The correlation coefficient for the two time series is ρ = 0.61 for the 2009/10 time span illustrated here, but shows substantial fluctuations over the year: During the months April until August when precipitation is low, correlation drops to ρ = 0.17, but reaches values of up to ρ = 0.75 for the wet season during November to March. We also calculate relative explained variances (VAR; see Supplementary Information S2) and note that a substantial part of the GRACE signal can be explained by ERA5 (VAR = 0.33) from November to March, while from April -mid of August the explained variance is negative (VAR = −0.17), which means that subtracting the reanalysis actually does not decrease the signal variability picked up by GRACE. During this time span the signal standard deviation (2.0 mm/day) is only slightly above the GRACE noise floor, indicating that GRACE errors still dominate the signal at times of small fluxes. Investigating the signal characteristics of the band-pass filter applied to both GRACE and ERA5 (see Supplementary Information S1) reveals that periods down to 5 days are detectable in the flux time series. An upper bound of 30 days has been deliberately chosen to demonstrate the added-value of a daily sampled time-series over the conventional monthly GRACE products analyzed previously 25 .

Evaluation of Global Reanalyses
We now investigate the potential of GRACE to detect quality differences between subsequently published reanalyses by examining ERA5 together with its predecessor ERA-Interim using data spanning the years 2003-2015.  www.nature.com/scientificreports www.nature.com/scientificreports/ and Arabian desert) with very small flux signals. In general, we find higher correlations of GRACE with ERA5 than with ERA-Interim, the difference between both is shown in Fig. 3g. While negative correlation differences especially occur in regions with very small correlation (e.g. Sahara and Arabian Peninsula), GRACE clearly identifies improved correlations for ERA5 in most of the continental areas with particularly large differences in almost the entire North and South American continents, Asia and the central part of Africa.
A more regionally diverse picture is obtained from analyzing the RMSD between GRACE and the reanalyses. Here, particularly ERA-Interim shows large discrepancies to GRACE in South America in the region around the Paraná river basin and in Colombia, in the west of North America and in Southern China. While these areas still show rather large RMSD values also in ERA5, the often negative RMSD differences Fig. 3h indicate improvements in the latest reanalysis compared to its predecessor. Also equatorial Africa exhibits a strong reduction in RMSD.
Increasing RMSD values are only observed on the transition between equatorial and arid climate zones in Africa (Sahel and Namibia), in the Okavango catchment, and at the Kimberley Plateau in north-western Australia.
With respect to the relative explained variance (VAR), we note that the reanalyses only explain a very small part of the GRACE signal variance. In ERA5 some positive areas are visible in Asia and South America with maximum values around 0.45, while for ERA-Interim the explained variance is negative almost everywhere with a maximum value of 0.22. Despite the overall small values, the change of explained variance when moving from ERA-Interim to ERA5 (Fig. 3i) reveals a general improvement of the most recent reanalysis with a quite similar pattern as the one found for the RMSD. In general VAR is the most challenging metric when focusing on high-frequency (high-pass filtered) water flux signals. The very different numbers obtained for individual seasons in the exemplary time series of Aruanã in Fig. 2 (positive VAR in the rainy season and negative VAR in the drier season), however, suggest that the results are strongly influenced by time spans of low GRACE signal variability and thus an unfavorable signal-to-noise ratio (SNR). For a more detailed investigation of this influence, we recomputed the VAR grids for ERA5 excluding time spans with a GRACE signal variability below a certain threshold for each grid cell. We find that the explained variance increases substantially when raising the threshold. From a minimum signal RMS of 1.8 mm/day upwards, which, according to the Sahara test in Section 2 can be regarded as the upper bound of the noise range of GRACE, the explained variances become largely positive with a maximum value of 0.59, see Fig. 4. A more detailed analysis including different choices of thresholds is given in the Supplementary Information S5.
We also group the statistics calculated for all individual 1° pixels (for the full time series) in terms of area-weighted cumulative distribution functions for each of the three validation metrics (Fig. 5). We note that all global percentiles for correlation, RMSD, and VAR indicate an improvement in ERA5 with respect to www.nature.com/scientificreports www.nature.com/scientificreports/ ERA-Interim (Fig. 5a). Additionally, the percentiles are shown for individual climate zones 49 in Fig. 5b with the strongest differences found for all three metrics in the equatorial climate zone.
Here the median correlation, for example, has increased by 50% for ERA5 (ρ = 0.45) compared to its predecessor (ρ = 0.30), while the median RMSD has decreased by 23% (2.0 mm/day vs. 2.6 mm/day). Considerable improvements are also detected in the temperate climate zone, with an increase of 26% in median correlation and a decrease in 18% of the mean RMSD. In arid climates with much smaller variations in high-frequency atmospheric water fluxes, correlations are generally lower and also the absolute differences between the two reanalyses are less pronounced. However, even in the arid climate zone the relative improvement is clearly detectable with a median correlation increase of 35% from ERA-Interim to ERA5.

Added Value with Respect to Precipitation Observations
We also calculate RMSD changes (ERA5 vs. ERA-Interim) for just precipitation rates against globally gridded rain gauge observations from GPCC 6 (see Methods) shown in Fig. 6a. Precipitation in reanalyses is not only sensitive to changes in the observing system but also to model physics and the resulting general atmospheric circulation. Therefore, validation against in situ measurements is commonly perceived as a critical evaluation measure and a commonly applied diagnostic tool for assessing the quality of atmospheric reanalyses 50 . The overall negative values indicate a smaller RMSD and thus a better agreement of the ERA5 precipitation rates with the rain gauge data when compared to ERA-Interim.
While comparing the regions of particularly large improvements to those detected in the net flux imbalance observed by GRACE (Fig. 6b, same as Fig. 3i), we note a generally good consistency for the validation based on gravity and rain gauges. Strong improvements are found by both data sets in large parts of South America, especially in the south-east around the region of the Paraná River basin. Very high agreement between the two figures is also found in Asia along the Himalaya mountain range and in Southern China. North America sees its largest improvements in the southeastern part of the United States. Hence we conclude that in regions where an improvement in reanalysis P can be detected by rain gauge data hinting at precipitation-dominated water fluxes, GRACE is also able to identify a reduction in RMSD with similar magnitudes and geographical distribution.
However, since GRACE observes the net-flux imbalance, it also contains information about the other fluxes E and R and is, therefore, not redundant to GPCC. As a remote sensing data set, the coverage of satellite gravimetry is globally homogeneous and not depending on the number of hydro-meteorological stations operated (and made available publicly) by a particular country. GRACE thereby documents even more pronounced improvements than GPCC in equatorial Africa, in northern South America at the Colombian Highlands, and the Eastern parts of China. Interestingly, we also note three distinct regions of increasing misfit in the Sahel Zone, the Okavango Catchment, and the Kimberley Plateau, where GRACE-based flux estimates are more aligned to ERA-Interim than ERA5. We speculate that GRACE might point to a deficit in ERA5 in the high-frequency atmospheric fluxes when compared to ERA-Interim that is not detectable from validation against in situ rain gauge data alone and advise further studies to elaborate possible reasons for those discrepancies.

Discussion and outlook
With a record of about 15.5 years of observations secured by the GRACE mission and with GRACE-FO operating nominally since its launch in May 2018, satellite gravimetry has matured into an operational observing system of global mass change and mass re-distribution. The project team consisting of scientists based in the U.S. and Germany is progressing well towards the low-latency provision of hydrospheric mass change estimates for various applications in the physical Earth sciences. Based on a series of non-standard daily gravity fields, we have demonstrated the potential of GRACE to provide information about atmospheric net-fluxes of water even at short periods between 5 and 30 days. Our approach of relating band-pass filtered water fluxes from GRACE via the terrestrial water balance equation to reanalyses output is generally applicable to all continental regions away from the coasts. We find good correspondence between GRACE flux time series and the most recent state-of-the-art reanalyses ERA5 from ECMWF. The GRACE noise floor is estimated to have an upper bound of around 1.8 mm/  Fig. 3f).
day, meaning that in arid climates or during time spans with correspondingly small variations in atmospheric water fluxes, the signal-to-noise ratio limits the reliability of the evaluation.
In regions with short-term flux variations larger than this threshold, our analysis reveals the potential of GRACE to discriminate between atmospheric reanalyses in terms of quality of their atmospheric net-fluxes of water. In the equatorial climate zone, an increase of 50% in median correlation and a decrease of 23% in RMSD was detected in ERA5 with respect to its predecessor ERA-Interim. The assessment of the predictive skill of reanalyses in terms of the explained variance of the GRACE signal is challenging, since in regions and time spans with low signal variability, the GRACE time series is dominated by noise. Even though a clear improvement from ERA-Interim to ERA5 is detected, an evaluation of reanalyses remains difficult because of largely negative values for the explained variance. First experiments show that limiting the evaluation spatially and temporally to regions/time spans with a favorable GRACE signal-to-noise ratio substantially improves the fit. We currently recommend to use GRACE in regions and at times with a minimum signal RMS of 1.8 mm/day, but note that this threshold might change depending on the particular application.  GRACE results largely confirm the improvement in precipitation modeling achieved in ERA5 as already previously known from the comparison against rain gauge observations. In addition, GRACE also identifies degradations of atmospheric net-fluxes of water in ERA5 as compared to ERA-Interim in three distinctive regions not detectable from the rain gauge comparison: the reasons are currently unclear, but should be evaluated further in order to provide potentially valuable feedback to the meteorologic reanalysis community.
Only the recent progress in GRACE data processing has enabled the use of daily GRACE time series for evaluating high-frequency atmospheric fluxes. The accuracy of the previous daily GRACE time series ITSG-Grace2016 would not have been sufficient to carry out such an assessment as demonstrated in the Supplementary  Information (S9). It can be assumed that the potential of GRACE data analysis has not yet been fully exploited and that future improvements in gravity field determination can also be expected from the GRACE Follow-On laser ranging instrument measurements. Observations are currently limited to flux variations of periods of 5 days, which might be decreased even further when a constellation of two GRACE-like missions operating simultaneously at differently inclined orbits will be realized in line with multi-disciplinary user requirements 51 . It thus would be sensible to work towards the assimilation of GRACE-based fluxes into numerical weather prediction models in order to fully exploit those ongoingly recorded satellite observations in the field of meteorology.

Methods
Daily GRACE gravity fields. In our study we apply the daily solutions of ITSG-Grace2018, which is the latest release of time-variable gravity field models computed at Graz University of Technology 37 . As in the standard processing of monthly GRACE gravity field models, mass variations changing faster than one month are removed prior to the data processing by subtracting the output of geophysical background models from the observations (de-aliasing). These include mass changes caused by ocean, solid Earth and pole tides, as well as non-tidal atmospheric and ocean mass variations 48 . The de-aliasing process thereby removes the gravitational effects of atmospheric mass changes from the daily gravity field models, isolating water storage changes at or below the surface which are caused by vertical and lateral water fluxes in line with the terrestrial water balance equation Eq. (1). For a more detailed discussion on a possible influence of the de-aliasing reductions on the results of this study see the Supplementary Information (S6).
Compared to the standard monthly solutions, the limited satellite ground track coverage during one day does not allow for a stable global gravity field inversion so that additional information has to be introduced. The time series is therefore processed by a Kalman smoother approach similar to the one described by 29 , which introduces in its process model statistical information about the expected time evolution of the gravity field signal. The ITSG-Grace2018 daily solutions apply an auto-regressive (AR) model of order 3 to express the spatio-temporal correlations between epochs, allowing for a better description of the gravity field's temporal evolution compared to the AR model of order 1 as in previous releases. The process model was derived from the output of the updated Earth System Model of the European Space Agency 52 (ESA ESM) and primarily includes information about hydrological signal variability, but also residual errors in the background models for atmosphere and ocean dynamics. Since only stochastic information is introduced, no bias towards the ESA ESM is to be expected.
The daily gravity field models are provided as coefficients of a spherical harmonic expansion up to degree n = 40 representing water storage anomalies (i.e. deviations from a long-term mean gravity field model) with a spatial resolution of approximately 500 km. No additional spatial filtering is required since spatially correlated noise is effectively suppressed by the Kalman smoother. Since we only focus on the sub-monthly variations, the post-processing steps generally applied to monthly GRACE data (geocenter correction, replacement of coefficient c 20 , removal of glacial isostatic adjustment, for example applied in 24 ) are omitted. These corrections primarily affect monthly and longer time-scales and would be removed during the high-pass filtering process (see below). We use the time span 2003-2015 for our study, which represents all full years of GRACE sensor data unaffected by the waning battery capacities towards the end of the mission. Even though the Kalman smoother output provides a continuous daily time series without data gaps, all days without GRACE observations were excluded from the analysis. During these time spans the daily solutions are only informed by the process model and thus tend towards a mean trend and annual signal, which vanishes after high-pass filtering. Figure  Atmospheric reanalyses. We use daily gridded precipitation, evapotranspiration, and runoff output fields obtained from two subsequent generations of global atmospheric reanalyses produced by the European Centre for Medium-Range Weather Forecasts (ECMWF). The first and older one is the In-terim Re-Analysis (ERA-Interim) 40,53 . A more recent reanalysis ERA5 is currently being produced by ECMWF within the Copernicus Climate Change Service (C3S) 39 . Important changes relative to ERA-Interim include a higher spatial resolution (31 km compared to 80 km), a strongly improved representation of the troposphere, a better global balance of precipitation and evaporation, improved precipitation over land in the deep tropics, and a new land surface scheme leading to an enhanced consistency of soil moisture and land surface fluxes. Previous studies 54 found a consistent improvement of land hydrology variables when driving a land surface model by ERA5 vs. ERA-Interim atmospheric forcing. The reanalysis fields were converted from gridded data to a spherical harmonics representation up to degree n = 40 in order to obtain a spatial resolution consistent with the GRACE data.
Daily gridded precipitation data. Daily observational records of precipitation as provided by the Global Precipitation Climatology Centre (GPCC) were applied for additional comparison. We use the most recent version of the GPCC Full Data Daily V.2018 6 , which is based on more than 35,000 gauging stations world-wide and covers the period 1982-2016. The data is available at 1° resolution. www.nature.com/scientificreports www.nature.com/scientificreports/ Band-pass filtered atmospheric fluxes from observed water storage variations. The daily gravity field solutions computed from the GRACE data can be converted to water storage anomalies 55 and linked to the net flux imbalance of the hydrological fluxes precipitation (P), evapotranspiration (E), and river discharge (R) via the terrestrial water balance equation: As a first step, the daily water storage time series S is converted to water fluxes (or storage changes) dS/dt by applying a numerical differentiation filter. Here a simple forward differencing between two subsequent days is not meaningful, as the daily solutions provided by the Kalman smoother are not independent but temporally correlated. The information content of GRACE on daily time scales is closely tied to the ground track patterns of the satellites 29 . Due to the polar orbit configurations, regions around the North and South Pole are effectively sampled every 90 mins, while towards the equator the ground track separation and thus revisit time increases. In a typical, i.e. non-repeat, orbit cycle the globe is fully sampled approximately every 4-5 days.
To account for these sampling characteristics, we use a somewhat broader differentiation filter taking into account the two preceding and the two following days using central weights − − , ,0 . We further apply a third-order Butterworth high-pass filter (forwards and backwards to avoid phase shifts) with cut-off frequency of 30 days. Together with the low-pass effect of the 5-point differentiation filter, the result is a band-pass filtered flux time series effectively reducing the analysis to fluxes at periods between 5 and 30 days, which corresponds to the typical time-scale of a mid-latitude cyclone persistence 56 . To allow a meaningful comparison, an equivalent band-pass filter is applied also to the daily reanalyses data on the right-hand side of Eq. (1); for a detailed discussion on the computation steps and the effect of the different filters see Section S1 of the Supplementary Information.
Equation (1) is strictly valid only on the scale of river basins and not on grid-scale, as surface water dynamics in rivers, lakes, and wetlands are not captured in the grid-wise net flux imbalance of the reanalyses. To investigate whether these lateral water transports influence the results of our study, we carried out an additional experiment using a global land surface scheme and discharge model 57 , see Supplementary Information S8. Two different runs of this model were forced by the two atmospheric reanalyses (ERA5 and ERA-Interim) to simulate daily water storage values. Afterward exactly the same analysis steps were applied as to the observed water storage variations from GRACE (i.e. expansion into spherical harmonics up to d/o 40; 5-points differentiation filter to derive fluxes; high-pass filtering). These simulated fluxes are still dominated by ERA5 or ERA-Interim precipitation, but differ from the original reanalyses fluxes P − E − R since they also include all apparent fluxes induced by the simulated surface dynamics. Nevertheless, a comparison with GRACE shows very similar results compared to directly using the reanalyses fluxes as presented above. These simulations thereby confirm that the simplified water balance equation employed has only marginal effects on our results, meaning that surface water variations caused by lateral transport in rivers and lakes do not contribute significantly to the rapid variations in terrestrial water storage observed by GRACE when data averaged over 500 km and at sub-monthly temporal scales is considered. We moreover note that a part of the improvement in the ERA5 net-fluxes over its predecessor indeed appears to be caused by evapotranspiration, which is -in contrast to precipitation -updated in the energy and water balance calculations of the land surface scheme. Spatial leakage at the coasts. The low spatial resolution of the daily GRACE data (spherical harmonic degree n = 40) does not allow for a strict separation between land and ocean areas along the coast line but results in spatial leakage effects. Additionally, also the Kalman filter approach used for the computation of the gravity field solutions can introduce spurious ocean signal over land. Two different effects need to be considered: Firstly, in areas in which the oceans exhibit larger high-frequency signals than the land areas, the ocean signal leaks onto the continents causing unrealistic signals in continental grid cells. This results in particularly large RMSD values between GRACE and reanalyses at the coast. Secondly, in regions with very small oceanic variability such as the equatorial Atlantic and the Indian Ocean, this leads to a dampening of the GRACE signal on land. We thus mask out all coastal regions that are dominated by coastal leakage as outlined in more detail in the Supplementary Information S5.