Introduction

Freshwater on land is a vital resource for human society and natural environment1. However, its quantity, spatial distribution, and variability generally remain highly unknown in most parts of the world, preventing the development of sustainable strategies to manage water resources2. This is of particular relevance for the Congo Basin, a vast and remote tropical watershed located in the heart of the African continent. Covering 3.7 × 106 km2, the Congo Basin stands as the second largest river basin in the world, hosting 120 million inhabitants whose activities are generally highly dependent on water resource availability3. The basin is also the second largest freshwater outlet to the world oceans4, only surpassed by the Amazon, making it of global importance for the world’s climate and water cycle5. Furthermore, the region is largely covered by dense tropical forest (e.g. the Cuvette Centrale, the world’s largest tropical peatland, made of swamp forests) that harbors vast natural resources and biodiversity6, but also acts as a carbon sink that stores about 80 Gigaton of carbon (equivalent to about 2.5 years of current global anthropogenic emissions)7. Despite its importance, knowledge about the availability and distribution of water resources in the region is still inadequate. To the best of our knowledge, the only source for water resource estimates in the Congo Basin is Food and Agriculture Organization’s AQUASTAT (FAO-AQUASTAT) renewable water resource assessment, which is based on parameters such as precipitation, evapotranspiration, runoff, and soil moisture collected through surveys and other data collection methods3,8. More surprisingly, the Congo Basin has not attracted as much attention among the scientific communities as done for other large tropical river basins4 and remains relatively understudied, currently leaving an insufficient knowledge of its hydrology characteristics9 and future changes. Recent studies project that the total runoff in the Congo Basin is expected to increase by 5% over the next two decades and 7% by mid-century10. Towards the end of the 21st century (2070–2099 period), almost all projections suggest that the temperature over the basin will increase between 2 and 4C, while precipitation changes still remain highly variable across model projections in terms of sign and magnitude of change (− 9% to 27%)11. Under such a scenario of temperature increase, the mean annual runoff of the basin will increase by 7% to 20%.12. The basin is thus facing great threats under the combined effects of current climate change and increased anthropogenic pressure13, such as groundwater stress, damming and deforestation6,14,15. These environmental alterations, along with inadequate infrastructure and low adaptive capacity, make the population particularly vulnerable to hydro-climatic variability and any future changes in the water cycle16.

While the Congo Basin was a relatively well-gauged basin prior to 1960, today only a few active hydrological gauges exist across the basin4,9. Therefore, understanding the major factors controlling the basin’s freshwater variability at proper space and time scales remains a challenge. The lack of data has become less pronounced in the last two decades due to the availability of spaceborne observations on surface water characteristics through satellite altimetry missions e.g TOPEX/Poseidon, Jason-series, and remote sensing missions like MODIS and Landsat17,18. Additionally, the Gravity Recovery and Climate Experiment (GRACE) and GRACE Follow-On satellite gravimetry missions provide unique estimates of continental water storage variability19,20. Over the Congo River, GRACE observations contributed to quantifying the dynamic of water storage change21,22 and to reveal groundwater stress and pressure on water resources23. However, GRACE satellites are blind to the Total Drainable Water Storage (TDWS), i.e., the total amount of drainable freshwater stored in soils and at the surface on a long-term average. In fact, TDWS is a known unknown in hydrology24, despite its major importance for water resources. The only basin for which an estimate of TDWS is currently available is the Amazon Basin amounting to 1766 ± 47 km3 in ref. 25. The absolute drainable water storage of the Mississippi Basin26 was further estimated to range between 2900 ± 400 and 3600 ± 400 km3.

Within this complex context, the present study targets two main objectives: 1) to quantify, for the first time, the contemporary TDWS over the entire Congo Basin and the geographical distribution of water availability among its major sub-basins Kasaï, Middle- Congo, Ubangui, Sangha, and Lualaba (Fig. 1 and Table S1), 2) estimate the hydraulic time constant representing the resistance of a basin to discharge its water storage for the Congo Basin and its sub-basins. We further assess the plausibility of our results with respect to other external estimates and show that our estimates are consistent with previous investigations done over the Amazon basin. Finally, we discuss the implications of our findings regarding Congo’s water resource availability.

Fig. 1: The Congo Basin and its sub-basins.
figure 1

The Congo Basin and its sub-basins: Kasaï, Middle-Congo, Ubangui, Sangha, Lualaba-North, Lualaba-South and Lualaba-Lukuga covering a total area of 3.7 × 106 km2. Red dots represent the discharge gauging stations and yellow dots show the location of selected satellite altimetric virtual stations. Wetlands as identified by73 are shown in light blue color and lakes are depicted in dark blue color. Five wetlands around lakes Mai-Ndombe, Bangwelu, Mweru, Mweru Wantipa, and Upemba are considered for quantifying water storage in wetlands.

Results

Water storage-discharge relationship

Assuming that the lakes and wetlands are storages loosely coupled from the drainage system, we obtain Drainable Water Storage Anomaly (DWSA) for each sub-basin by subtracting the Lake Water Storage Anomaly (LWSA) and the Wetland Water Storage Anomaly (WWSA) from the Total Water Storage Anomaly (TWSA) (See Data and Methods). In essence, wetlands and natural lakes can store and release water independently of river flow and have a high water-holding capacity allowing them to store water for extended periods of time, even during periods of low or no river discharge27. In fact, DWSA would be a proxy of the variations of groundwater storage and soil moisture storage, which are primary contributors to the river system. As shown in Fig. 1, there are no major lakes or wetlands in the Ubangui, Middle-Congo, and Sangha sub-basins, which means that no LWSA or WWSA is subtracted from TWSA. The Cuvette Centrale region, composed of forests and wetlands, extends into these three sub-basins. Recent findings by17 indicate a strong temporally correlated influence of the middle Congo floodplain on the Congo Basin discharge at the Brazzaville/Kinshasa station, for which a substantial part of the variability is explained by variations in surface water extent in the Cuvette Centrale region. Such an influence suggests that the wetlands in the Cuvette Centrale are mostly connected to the river system28, which means that their storage should not be removed from the TWSA for obtaining DWSA. Therefore, TWSA of these three basins can be equally considered as DWSA. Note that Lake Tumba over the Middle-Congo is neglected because its maximum volume variation is about 0.5 km3, which falls below the GRACE noise level. In the Lualaba-North sub-basin, with no major wetland and lake, TWSA is assumed to be fully relatable with its discharge at the outlet. Across the Lualaba-Lukuga region, encompassing Lakes Tanganyika and Kivu, LWSA has a major role in TWSA, while no large wetland decelerates discharge in the Lukuga River basin. In the Kasaï and Lualaba South sub-basins, both LWSA and WWSA from five lakes, and the surrounding wetlands of Mai-Ndombe, Bangwelu, Mweru, Mweru Wantipa, and Upemba are subtracted from TWSA.

Figure 2 shows the time series of TWSA, WWSA, and LWSA for seven sub-basins and the entire Congo Basin in form of equivalent water height in millimeter, which are obtained by dividing storages by the area of the corresponding basin. The time series represent distinct seasonal variations, with peak values reached in October and November in the northern sub-basins (Ubangui, Sangha) and in Middle-Congo that straddles the both northern and southern parts of the basin. The time series display peak values between February and April in the southern sub-basins (Kasaï, Lualaba-North, Lualaba-South and Lualaba-Lukuga). The TWSA time series across the entire Congo typically shows a double annual peak, one originating from the contribution of the southern basins and one from the northern ones (Fig. 2). Over the entire Congo, LWSA account for approximately 10% of the TWSA amplitude (100 mm), with most of its variations (about 80%) coming from Lake Tanganyika. The WWSA, which accounts for about 20% of TWSA, exhibits strong seasonal variations linked to the variations in the southern parts of the basin, which is to be expected since the wetlands are primarily located in Kasaï and Lualaba-South sub-basins.

Fig. 2: TWSA, WWSA, and LWSA over the Congo Basin and its sub-basins.
figure 2

Time series of Total Water Storage Anomaly (TWSA), Wetland Water Storage Anomaly (WWSA) and Lake Water Storage Anomaly (LWSA) for the 7 sub-basins and the entire Congo Basin, in millimeter of equivalent water height. Over the Kasaï and Lualaba-South, WWSA represents storage anomaly of wetlands, contributing to Congo’s WWSA (Fig. 1). LWSA time series for the various sub-basins represent water storage anomaly of lakes listed in Table 3.

The Kasaï WWSA, with an amplitude of about 10 mm, represents the equivalent water height of about 10 km3 wetland water storage, located in the surroundings of the Lake Mai-Ndombe (Fig. S2), which is part of the Tumba-Ngiri-Maindombe, one of the world’s largest wetlands, and a site of major significance as recognized by the Ramsar Convention (https://rsis.ramsar.org/ris/1784). LWSA in the Kasaï is dominated by Lake Mai-Ndombe itself, which fluctuates with an amplitude of about 3 km3 (Figure S1), corresponding to an LWSA with an amplitude of about 3 mm. Over the Lualaba-Lukuga, LWSA, which accounts for about 50% of the TWSA, comes from Lake Kivu with a storage variation of about 2 km3 and Lake Tanganyika with a variation of about 20 km3 (Fig. S1), imposing thus considerable control on the discharge of the Lukuga River29,30. Over the Lualaba-South, Bangwelu Wetland with about 20 km3, Mweru Wetland with 4 km3, Mweu Wantipa Wetland with about 1 km3 and Upemba Wetland with about 2 km3 of amplitude (Fig. S2) are the main contributors of WWSA with 100 mm amplitude, while lakes generate LWSA with an amplitude of about 10 mm.

Following the DWSA-discharge relationships of mean monthly values (mean of all Jan., of all Feb., …) shown as scatter plots in Fig. 3 and as time series in Fig. 4 (left), we observe that the Congo Basin, unlike the Amazon25, generally exhibits a clockwise hysteresis (shown in Figure with light blue arrows). The clockwise behavior indicates that the storage time series lags behind discharge (Fig. 4 bottom right). Note that here we deal with storage level and not the storage change (a derivative of the storage) as a flux. While the catchment shows a clockwise relationship, February–May presents a counteracting behavior. This is also seen in the southern sub-basins of Kasaï, Lualaba-North and Lualaba-South and originates from excess precipitation during the rainy season in the headwaters of these sub-basins (Fig. 4). Such a precipitation pattern in the south also causes most lakes to reach their maximum storage capacity. The WWSA and LWSA time series of the Congo Basin in Fig. 2 (bottom) show their peak values between March and May. These peaks in the LWSA time series correlate with one of the two peaks of each year in the TWSA time series.

Fig. 3: Mean monthly river discharge against mean monthly DWSA.
figure 3

Mean monthly river discharge Q against mean monthly drainable water storage anomaly (DWSA = TWSA − WWSA − LWSA) (thin gray curve with colored disks) and time-shifted drainable water storage anomaly (thicker black curve). Mean monthly discharge values are obtained from discharge time series available over different time periods (see in S2). DWSA mean monthly values were obtained from time series from 2002 to 2015 for Kasaï, Lualaba South, and Congo, and 2002 to 2017 for the other sub-basins. The arrows represent the annual behavior of the runoff-storage relationship.

Fig. 4: Mean monthly DWSA, precipitation and river discharge for Congo and its sub-basins.
figure 4

Column 1 and 2) Mean monthly river discharge and mean monthly Drainable Water Storage Anomaly (DWSA = TWSA − WWSA − LWSA) for Congo and its sub-basins. Note that mean monthly refers to the mean of all Jan., of all Feb. etc. Mean monthly discharge values were obtained from discharge time series available over different time periods (see S2). DWSA mean monthly values were obtained from time series from 2002 to 2015 for Kasaï, Lualaba South, and Congo, and 2002 to 2017 for the other sub-basins. Columns 3 and 4) Mean monthly precipitation over Congo and its sub-basins from Global Precipitation Climatology Centre (GPCC) dataset67 averaged over 1990–2019 together with mean monthly river discharge time series.

For the entire Congo Basin, from December to June, when both storage and discharge decrease, Fig. 3 shows a slight exponential-like behaviour (ignoring March–April). This is explained by the fact that generally, as a basin drains, the water leaves the basin at a slower rate i.e. storage loss is smaller than discharged water. Such a pattern is also seen in sub-basins Ubangui, Sangha, Kasaï, and Lualaba-North. On the other hand, within the draining phase, Middle Congo and Lualaba-South appear to drain at a faster rate since the discharge-storage relationship is rather linear. Such behavior may be explained by the fact that the soils are particularly shallow and sandy31 and drain quickly as precipitation decreases (Fig. S7). The (sub-)sub-basins of Lualaba show different relationships. The presence of lakes and wetlands, with massive storage capacities like the Bangwelu swamp, the Upemba depression, and Lake Tanganyika, greatly influences the flow regime of the downstream sub-basins32. In the Lualaba-North, the discharge reaches its maximum in November, while in the Lualaba-South and Lualaba-Lukuga, the maximum discharge is reached in April–May17. This is mainly driven by the difference in precipitation patterns over these three sub-basins (Figs. S7 and 4). In the Lualaba-North, heavy rainfall occurs in October/November, while in the southern parts of the Lualaba, heavy rainfall events typically occur in December–March (Fig. 4). On the other hand, the Lualaba-Lukuga exhibits very different characteristics than all other sub-basins, with the discharge-storage relationship that proceeds counterclockwise with their minimum values reached in November (Fig. 3 and Fig. 4). Looking at Fig. 3 (bottom left), we observe that in its dry period from May to November Lualaba-Lukuga loses its storage very fast, as the discharge-storage relationship represents an exponential behaviour i.e. storage loss is much larger than discharged water. This behavior is attributed to the enormous loss of water by evaporation from the surface of Lake Tanganyika, which accounts for 82% of the total annual water loss of the lake29.

In Fig. 3, the black lines represent the relationship between discharge and the lag-corrected drainable water storage anomaly \(\Delta {S}_{\phi }^{{{{{{{{\rm{d}}}}}}}}}\). As expected after removing the lag between DWSA and discharge, the relationship becomes a near-linear relationship (Fig. 3). This linear relationship allows the estimation of the active water storage in the basin, or the corresponding storage point at which storage-induced discharge approaches zero. This level, in turn, operationally defines the TDWS of the watershed.

Total drainable water storage

We determine the TDWS of the Congo Basin to be only 476 ± 10 km3 (Fig. 3). This water amount corresponds to 133 ± 3 mm of equivalent drainable water height, i.e., the level at which the discharge in Fig. 3 reaches zero. It means that when the Congo reaches its minimum drainable storage in August ca. − 40 mm (137 km3), the basin could still be drained by about 339 km3 (476 − 137 km3). Conversely, when both the water storage and the discharge reach their maximum in December, the basin holds a water volume of 613 km3. The estimated TDWS volume is equivalent to that of Lake Erie (480 km3), one of the great lakes of North America. In the same analogy, the Congo Basin is equivalent to a lake with a mean water volume of 476 ± 10 km3 that fluctuates annually with an amplitude of 137 km3.

The TDWS over the entire Congo agrees, at a difference of 5%, with the weighted average value of 503 ± 10 km3 obtained from the sub-basin analysis. Table 1 summarizes the results and shows that the Kasaï sub-basin has the highest capacity, storing 220 ± 4 km3 or about 43% of the total drainable water of the Congo Basin. Sangha, Ubangui, and Middle Congo contain in total 173 ± 8 km3, equivalent to 63% of the Kasaï sub-basin. The Middle Congo, with its Cuvette Centrale, stores 90 ± 6 km3 of TDWS, only 40% of the Kasaï estimates. Such a difference is expected, as both Kasaï and Middle Congo receive similar annual precipitation (Table S1), but the latter is a shallow watershed with much larger discharge (498 ± 20 mm/yr, see Table S1) and evapotranspiration. The Lualaba sub-basin accounts for approximately 21% of the basin TDWS with 109 ± 4 km3 (sum of its sub-basins North, South and Lukuga), but most of the storage (61%) is concentrated in the Lualaba-South (67 ± 2 km3). Over the Lualaba-Lukuga the TDWS is 20 ± 1 km3, for which the storage of Lake Tanganyika was subtracted from TWSA.

Table 1 Estimated hydraulic time constant τ and Total Drainable Water Storage (TDWS) S0 over Congo and its sub-basins.

To quantify DWSA, we have subtracted the storage anomaly of decoupled surface water bodies, e.g., lakes (LWSA) and wetlands (WWSA), from the GRACE total water storage anomaly. In reality, these open surface water bodies are not completely decoupled from the drainage system. Lake Tanganyika for instance partially drains into the Lukuga River at Kalemie and further feeds the Lualaba basin33. This discharge contributes to 18% of the total annual water loss of the lake, while its water balance is mainly governed by evaporation, corresponding to 82% of its total annual water loss29. Similarly, Lake Bangwelu and its surrounding wetlands ultimately discharge into the Luapula River, Lake Mai-Ndombe and its wetlands contribute to the Fimi River, and Lake Mweru with its surrounding wetlands are drained by the Luvua River. Given these contributions into the river system, it can be argued that the storage anomaly of these water bodies should not be fully excluded from the total water storage anomaly. To examine the effect of removing the WWSA and the LWSA from the TWSA and its impact on the results, we estimate the same quantities by analyzing the relationship between river discharge and TWSA (Fig. S5). Along with the estimates from the DWSA-discharge relationship, Table 1 lists the estimates from the TWSA-discharge relationship. In this case, \({S}_{0}^{{{{{{{{\rm{t}}}}}}}}}\) and τt differ from \({S}_{0}^{{{{{{{{\rm{d}}}}}}}}}\) and τd for the entire Congo and for sub-basins of Lualaba-South, Lualaba-Lukuga, and Kasaï, as the lakes and wetlands are distributed among these three sub-basins (see Fig. 1). Since all storage compartments are included in TWSA, the estimate of 139 ± 6 mm for \({S}_{0}^{{{{{{{{\rm{t}}}}}}}}}\), which corresponds to 502 ± 22 km3, can be considered an upper bound for the estimate of TDWS. On the other hand, since the wetlands and lakes in the Congo Basin have marginal outflows and contribute to the drainage system, the assumption that they are fully decoupled from the drainage system is not perfectly true. Therefore, the estimated value of 476 ± 10 km3 for \({S}_{0}^{{{{{{{{\rm{d}}}}}}}}}\) can be considered a lower bound for TDWS. To this end, it is safer to express that the TDWS of the Congo Basin lies between 476 ± 10 km3 and 502 ± 22 km3.

In Kasaï, the upper bound of the TDWS is 228 ± 18 km3 for which storage of wetland and Lake Mai-Ndombe are included in the estimation. In Lualaba-South inclusion of storage from wetlands and lakes Bangwelu, Upemba, and Mweru and the wetland of Mweru Wantipa results in an estimate of 107 ± 5 km3 for the upper bound of TDWS (Table 1). Over the Lualaba-Lukuga, the TDWS with and without the storage of Lake Tanganyika varies between 20 ± 1 km3 and 40 ± 1 km3. In fact, if Lake Tanganyika would have been fully involved in the river system, its storage anomaly would result in an increased TDWS by 20 ± 1 km3 for the Lualaba-Lukuga sub-basin. Our results show that about 65% of Congo’s total drainable water storage is stored in the southern part of the basin in the two sub-basins of Kasaï and Lualaba. The remaining 35% is nearly evenly split between the middle Congo and the northern sub-basins Ubangui and Sangha.

Hydraulic time constant and basin resistance

The parameter τd in Table 1, the resistance of a basin to discharge its water storage, is related to the basin’s slope, length, topography, presence of surface water bodies and soil types25. We can make an analogy with Ohm’s law by considering the discharge as a current, the storage anomaly as a voltage, and τd as a resistance. The higher τd is, the lower the current (discharge) flows and vice versa. Similarly, we can find analogies for a factor such as the resistivity of the material that directly affects the resistance of a conductor. In the case of a watershed, such resistivity is governed by slope, topography, vegetation pattern, and soil type. For the entire Congo Basin, we obtain a time constant of 4.3 ± 0.1 months (Table 1). The Middle Congo, with a relatively flat topography and without any large surface water body, has a time constant of 2.6 ± 0.2 months. Similarly, Lualaba-North shows a small time constant of 1.7 ± 0.3 months. Such relatively small time constants may be related to the fact that these two sub-basins are intermediate basins receiving inflow from upstream sub-basins. Moreover, based on the soil type map by31, it can be seen that distinct dominant soil types are prevalent in these two sub-basins (see https://data.apps.fao.org/map/catalog/srv/eng/catalog.search#/metadata/446ed430-8383-11db-b9b2-000d939bc5d8), which may be a reason for this relatively rapid τd. On the other hand, in the Kasaï, Lualaba-South, and Lualaba-Lukuga sub-basins, which are characterized by the presence of extensive open surface water area, we estimate larger time constants, as expected. It should be noted that by excluding WWSA and LWSA from the TWSA, we only exclude the magnitude of the wetland and lake water storage, not its impact on discharge dynamics. In the Lualaba-Lukuga, Lakes Tanganyika and Kivu act like strong resistors that regulate the river flow and the contribution of the lakes to the discharge leads to an extraordinarily large τd of 105.8 ± 3 months. In case these two lakes are considered as fully coupled lakes to the drainage system, the estimated time constant τt would be 174.2 ± 4.7 months. The difference of 69 ± 5 months is due to the time constant resulting from the storage of primarily Lake Tanganyika and, to a small extent, of Lake Kivu. Over the Lualaba-South with the permanent wetlands and lakes of Bangwelu, Mweru, Mweru Wantipa and Upemba, the time constant τd is 7.7 ± 0.2 months. Such a value turns into a relatively larger time constant τt of 13.9 ± 0.9 months when the storage of these open surface waters is considered as fully coupled compartments in the drainage system. These results imply that water stored in the permanent wetlands and lakes in Lualaba-South impose an extra resistance time of about 6.2 ± 0.9 months with about 5 months due to the wetlands and rest due to the lakes. Furthermore, Lake Mai-Ndombe and its surrounding wetland explain the large τd of 9.3 ± 0.2 months and τt of 10.1 ± 0.8 months in the Kasaï.

Discussion

Internal plausibility assessment

Our results provide a robust regional estimate of the contemporary distribution of the total amount of freshwater available across the Congo Basin. Due to the nature of the components we aim to quantify and due to the lack of large-scale observations for all storage compartments in the Congo Basin, a direct validation against independent measurements is not feasible. Nevertheless, our findings are supported by the following pieces of evidence. Firstly, our independent analysis for the sub-basins yielded an accumulated result that is, within the range of uncertainties, very similar to the full-basin analysis. This confirms the internal consistency of our method across different scales. Secondly, we found larger hydraulic time constants for sub-basins that host larger open surface water bodies, suggesting that our results are consistent with the physical understanding of a hydrological system. Thirdly, Fig. 5 shows the GRACE-derived lowest recorded mean equivalent water height anomaly in each 0. 5 × 0. 5 grid cell, varying between − 5 and − 140 mm. These values represent the driest state of each grid cell, regardless of the months they were recorded. Multiplied by the area of each grid cell and summed over the entire Congo Basin, we obtain an estimate of a storage level −316 km3. This represents the potential all-time lowest recorded Congo’s water storage level with respect to its long-time average (during the GRACE and GRACE-FO era). The fact that this value 316 km3 is smaller than the estimated TDWS 476 ± 10 km3 adds further credibility to our result. The pattern in Fig. 5 is also consistent with the estimated TDWS values (Table 1), with Kasaï and Lualaba-South showing larger anomalies as compared to Lualaba-North, Lualaba-Lukuga, and Sangha.

Fig. 5: Lowest mean monthly equivalent water height anomaly recorded by GRACE.
figure 5

Lowest mean monthly equivalent water height anomaly throughout 2002–2017 recorded by GRACE for each grid cell of the Congo Basin, in which Kasaï and Lualaba-South show lower values compared to other sub-basins.

Congo versus Amazon

The only other basin, for which a TDWS estimate is available is the Amazon Basin25. Since Congo and Amazon are both catchments dominated to a great extent by similar tropical climate34, rainforest vegetation pattern, and geomorphology, we provide here a mutual assessment of the results. Comparing the estimates over these two basins, we obtain a ratio of about 25% between Congo’s TDWS 476 ± 10 km3 to Amazon’s TDWS 1766 ± 47 km3 (Table 2). This is roughly equivalent to the ratio estimated between their mean annual discharge to the ocean (Congo 40000 m3/s, Amazon 209,000 m3/s), and also the ratio between the amplitude of their water storage anomaly (Congo 40 mm, Amazon 170 mm) as well as the ratio between their recharge (P − ET) (Congo 300 mm/yr, Amazon 1100 mm/yr)35. In addition, the hydraulic time constant of 4.3 ± 0.2 months for the Congo basin (Table 1) is similar to the Amazon basin 4.4 ± 0.12 months25. In the absence of ground truth data to directly validate our results, these comparisons reinforce the plausibility of our estimates.

Table 2 Magnitude of the different hydrological parameters in the Congo and Amoazon basins.

Implications for water resource availability

We compare our results with the FAO Aquastat database which provides an estimate of renewable groundwater resources of 421 km3/yr for the Democratic Republic of the Congo’s3. In principle, this quantity is different from the TDWS we estimate. However, since we have excluded LWSA and WWSA from TWSA and since we performed our calculations using monthly mean values averaged over the years, our TDWS value can be seen primarily as an estimate of total drainable water for storage with relatively long retention times, such as groundwater36. Assuming that the renewable groundwater resources are stationary over the years, this hypothesis allows us to compare its value with our TDWS estimate. The agreement between the two numbers (421 and 476 km3) brings considerable credibility to our results and enables a cross-check between two sources obtained with very different methodologies.

Another insight that one can gain by comparing our TDWS estimates with the FAO AQUASTAT values is the time required for the Congo Basin to reach a hypothetical zero discharge in the event of an unimaginable scenario where precipitation halts or is drastically reduced. The variability of the rainy season, precipitation patterns37,38,39,40,41 and atmospheric processes42 over the Congo basin are still poorly understood, which leads to large uncertainties in quantifying current and future rainfall over the region43,44. Given that45 estimated the surface water storage annual change to be of 80 km3/yr for the entire Congo Basin, we can assume that, at the surface, only a few km3 remain in the rivers at low flow. This estimate can be roughly taken as an estimate for total renewable surface water. Adding this value to FAO Aquastat’s renewable groundwater resources 421 km3/yr, and assuming that the DRC largely represents the Congo Basin, it gives a total water storage of about 500 km3/yr. Comparing this number to our estimated TDWS, this suggests that, assuming that the discharge rate stays constant in the unimaginable no-rain scenario the basin will be drained after about one year, at which point discharge will become zero.

Limitations

Since we deal with an assessment of freshwater availability and variability in a poorly gauged basin based on large-scale remote sensing data, inevitably, there are limitations and uncertainties in our estimates and the interpretations of our results. The main uncertainties of our results are due to:

  • cyclostationarity assumption of hydrological process

  • estimation of discharge using legacy data for certain sub-basins

  • inherent uncertainty in spaceborne measurements.

They are described and discussed in detail below:

  1. 1.

    Our results are based on the mean monthly values of both discharge and DWSA. However, due to poor data availability, these monthly averages are calculated over different time periods for different sub-basins and data sets. This might lead to uncertainty in our estimates due to the assumption of cyclostationarity. Such an assumption might not be the best choice as the Congo Basin experiences long-term variability and changes in its hydrological cycle46, due for instance to anthropogenic activities such as deforestation47. While deforestation rates remain relatively low in the basin, they reach high levels in some parts, particularly in savanna and gallery forests and especially near urban centers3. In addition, the Congo Basin has undergone extreme events in the last years, which may cast doubt on the cyclostationnarity assumption.

  2. 2.

    Due to the poor data availability, we estimated river discharge time series of Kasaï, Lualaba-North, Lualaba-South and Lualaba-Lukuga (Table S2) by developing rating curves between satellite altimetry data after 2002 and legacy discharge data before 1991. Again, here, we made an assumption of stationarity in the long-term behaviour of discharge for these sub-basins. However, any deviation from stationarity would lead to uncertainty in the estimated discharge from altimetric data. Nevertheless, we argue that, in terms of the overall dynamics and magnitude, our discharge estimates (Fig. S4) are consistent with precipitation (Fig. 4) and water storage anomalies (Fig. 4). Another source of uncertainty in this regard could arise from a different dynamic between altimetry-derived water level and legacy discharge. For example, if the selected virtual station is far away from the discharge gauge, resulting in a water transit time of more than one month, the good performance of the discharge estimation on the monthly time scale cannot be guaranteed48. To this end, the virtual stations listed in Table S2 are selected in such a way that longer than one month transit time between the gauge and the virtual station is avoided.

  3. 3.

    Due to the coarse spatial resolution of GRACE observations, in addition to its inherent uncertainty, the results over smaller sub-basins are prone to signal leakage error from neighboring areas. Although we dealt with this error, some uncertainties due to leakage might persist. Finally, it should be noted that LWSA and WWSA estimates also have their own uncertainties, mainly due to the inherent uncertainties of the observations that are used for their calculations, such as uncertainties in satellite imagery and altimetry observations.

Final word and way forward

Our study quantifies the Total Drainable Water Storage of the Congo Basin. It provides a new and robust regional estimate of the contemporary distribution of the water across the basin, improving our basic understanding of its spatial distribution throughout its sub-basins. While our results do not reflect the total amount of water available in the form of fossil groundwater or deeper groundwater storage, they do provide an indication of the fraction of water that is coupled with the drainage system and can be accessed from modern and young groundwater layers49. Unlike most groundwater stored in deep layers, for instance, in marine sediments that are brackish or saline, TDWS represents freshwater available in the soil layers. Therefore, our estimates provide a new basis for future research and applications addressing sustainable water supply management and agricultural water use needs in the Congo region. In particular, it provides new insight for addressing the challenge to develop strategies for transboundary water management and better managing acute water resource problems in the basin, especially in rural area50. Our TDWS results are also important for assessing the basin’s vulnerability to land use practices, pressure on resources, and climate change. Such assessments are necessary to enable long-term water resource planning and infrastructure investments and to sustain water demand in a region with rapidly changing demography. Many challenges related to water security, especially in rural areas are currently emerging in most parts of the Congo Basin, and groundwater remains a major means of water supply to support rural growth. In this regard, our results will give a basis for developing strategies that aim at optimizing the use of freshwater resources.

Materials and methods

Data

Total water storage anomaly from GRACE

To obtain TWSA from GRACE for the time period of 2002–2017, we use the spherical harmonic coefficients from the Institute of Geodesy at Graz University of Technology (ITSG)-Grace2018, which outperforms other available solutions in terms of the noise of mid-to-high degrees spherical harmonics51,52. Given that the GRACE estimates of the lowest degree zonal harmonic coefficient are not accurate, we replace the GRACE coefficients C2,0 and C3,0 with the coefficients obtained from the Satellite Laser Ranging (SLR) data53. Furthermore, since GRACE does not provide an estimate for degree 1, we add the coefficients for degree 1 according to the estimate of ref. 54. Then, the long-term average of each of the spherical harmonic coefficients between 2004 and 2009 is used as a reference to obtain the spherical harmonic coefficient anomaly. Due to imperfect tidal models, the GRACE solutions are contaminated by a primary and a secondary tidal error, resulting in errors in the estimated TWSA over the Congo of up to one 8 mm (see Fig. 4.40 in ref. 55). Therefore, we eliminated the primary and secondary tidal alias errors of the main tidal components S1, S2, P1, K1, K2, M2, O2, O1, and Q1 from the monthly GRACE solutions using a least-squares Fourier analysis55. The coefficients are then filtered using a Gaussian filter with radius 350 km and the destriping filter proposed by ref. 56. The spherical harmonic coefficient anomalies are then turned into the spatial domain to obtain TWSA57. When Glacial Isostatic Adjustment (GIA) is considered as predicted by the ICE6G-D model58, a contemporary geoid rate of about −0.1 mm/year is expected over the Congo. Therefore, to avoid the GIA-driven trend in the TWSA time series, we corrected GIA using the model by ref. 59. Finally, the basin-wise leakage error is mitigated by the so-called data-driven method developed by ref. 60.

Lake water storage anomaly

We estimate the monthly lake volume anomaly for Lake Mweru, Lake Upemba, Lake Bangwelu, Lake Tanganyika, Lake Kivu and Lake Mai-Ndombe, which are located in the Lualaba-South, the Lualaba-Lukuga, and the the Kasaï (see Fig. 1 and Table 3) by combining time series of surface water extent and water levels of the lakes provided by the HydroSat database61 and accessible at http://hydrosat.gis.uni-stuttgart.de and the Hydroweb, accessible at https://hydroweb.theia-land.fr.

Table 3 Lakes in the Congo Basin.

Note that Lake Tumba in the Middle Congo is not included in the analysis as Sentinel-3B is the only altimetry data available there as of mid-2018. However, according to Hydroweb, the amplitude of volume variation reaches its maximum to 0.5 km3, which is within GRACE noise and can be neglected for DWSA estimation.

The lake water volume variation ΔV is then estimated by assuming that between two successive pair of measurements (water level H and lake area A), the lake morphology is regular and has a pyramidal shape,62.

$$\Delta V(t)=\frac{1}{3}\left[A(t)+A(t-1)+\sqrt{A(t)A(t-1)}\right]\left[H(t)-H(t-1)\right]$$
(1)

the lake water volume anomaly is then obtained by numerical integration of obtained ΔV. To ensure consistency with respect to the reference period for determining anomaly values, similar to GRACE TWSA, data from 2004 to 2009 are used as a reference. Figure S1 shows the time series of lake water volume anomaly of these lakes. The obtained lake volume estimates are then divided by the area of the corresponding sub-basin to obtain the Lake Water Storage Anomaly (LWSA). Figure 2 shows the time series of LWSA for Kasaï, Lualaba-South, Lualaba-North, Lualaba-Lukuga, and for the entire Congo Basin.

Wetland water storage anomaly

The Wetland Water Storage Anomaly (WWSA) for the Congo River Basin is obtained following the methodology developed by ref. 63,64 based on the hypsometric curve approach. It is based on the combination of Surface Water Extent (SWE) estimates from the Global Inundation Extent from Multi-Satellite (GIEMS-2, ref. 65) with topographic data from the Forest And Buildings removed Copernicus 30m Digital Elevation Model (hereafter FABDEM,66, accessible at https://data.bris.ac.uk/data/dataset/25wfy0f9ukoge2gs7a5mqpq2j7). GIEMS-2 provides monthly estimates of SWE in wetlands with a spatial resolution of 0.25 × 0.25 at the equator (on an equal-area grid, i.e. each pixel covers 773 km2) over 1992–2015. The methodology to estimate WWSA is a four-step process, as described in detail in ref. 64: 1) Cumulative distribution function of elevation values within each pixel of GIEMS-2 from the corresponding subset of FABDEM is established to obtain the hypsometric curves. In other words, for each pixel of GIEMS-2, the DEM elevation values are sorted in ascending order to represent an area-elevation relationship; (2) The obtained FABDEM hypsometric curve are then corrected following63 to ovoid overestimation of WWSA. This is based on the FABDEM-derived maximum elevation amplitude calculated from the corresponding average minimum and maximum of SWE for each pixel of GIEMS-2. For each FABDEM hypsometric curve that tends to overestimate the elevation amplitude in comparison to the corresponding maximum satellite-derived water level amplitude from radar altimetry17, a linear correction is applied to smooth the slope of the hypsometric curve; (3) Each corrected hypsometric curve representing the area - elevation relationship is then converted into the area - surface water storage relationship following a similar formula to (1):

$$V(\alpha )=\frac{1}{3}\mathop{\sum }\limits_{i=1}^{\alpha }({H}_{i}-{H}_{i-1})\cdot (\delta {A}_{i}+\delta {A}_{i-1}+\sqrt{\delta {A}_{i}\cdot \delta {A}_{i-1}})$$
(2)

where V is the Surface Water Storage (SWS) in km3 for a percentage of surface water extent α. Note that the incrementation is on a step of 1%. δA is the 773 km2 area of GIEMS-2 pixel, and H represents the elevation (in km) for a percentage of surface water extent α given by the hypsometric curve; 4) To estimate the surface water storage for each pixel of GIEMS-2, the area-surface water storage relationship is then merged with the monthly variations of SWE. At the intersection of the hypsometric curve and the SWE for each month corresponds the surface water storage for that month. Note that for each pixel, the SWS is computed in reference to the minimum surface storage obtained over the period of observations. For each wetland, we then isolate the SWS variations and remove the long-term mean of 2004–2009 (to be consistent with TWSA and LWSA) to obtain the Surface Water Storage Anomaly (SWSA). Figure S2 shows SWSA time series over the selected wetlands Mai-Ndombe (18550 km2), Bangwelu (30150 km2), Mweru (10820 km2), Mweru Wantipa (4640 km2) and Upemba (12370 km2) for the period 1992–2015 in km3. With the exception of the Mai-Ndombe wetland, which is in the Kasaï sub-basin, the rest are in Lualaba-South, covering an area of 57980 km2. For Lualaba-South, we sum the SWSA time series from Bangwelu, Mweru, Mweru Wantipa, and Upemba, remove the long-term mean of 2004–2009, and then divide it by the area of Lualaba South to obtain Wetland Water Storage Anomaly (WWSA) in terms of equivalent water height (Fig. 2). Over Kasaï sub-basin, WWSA is determined by dividing the SWSA of Mai-Ndombe wetland by the area of Kasaï sub-basin (Fig. 2).

River discharge

Table S2 lists the used discharge data of the outlet of each basin. For the Kasaï, the Lualaba-North, the Lualaba-South and the Lualaba-Lukuga discharge data are available only up to 1959 (shaded gray). For these sub-basins, we estimated discharge Q based on water level from satellite altimetry H using the quantile mapping approach proposed by ref. 48. There, we obtain the so-called rating curve function Q = F(H) by turning both data into their quantile functions. The quantile functions of discharge and altimetric water level have the same x-axis, which is the cumulative probability. Therefore, by connecting their y-axis directly, we can obtain F(. ) (See Fig. 6 in ref. 48). Since this approach does not explicitly include the time coordinate, the requirement for synchronous datasets becomes obsolete. This means that pre-satellite river discharge data sets can be salvaged and converted into usable data for the satellite altimetry time frame.

The location of the selected altimetric virtual stations are depicted as yellow dots in Fig. 1. Figure S3 shows the obtained rating curve and Fig. S4 represents estimated altimetric discharge for these 4 basins.

Precipitation

We obtain precipitation data over Congo and its sub-basins from Global Precipitation Climatology Centre (GPCC) dataset67 from 1990–2019. GPCC is acquired from more than 85,000 stations worldwide, and several studies have already used GPCC as a reference to compare precipitation products globally and regionally68,69. Figure 4 shows monthly mean time series (climatology) of precipitation and Fig. S7 shows their distribution over the Congo River Basin and its subbasins.

Methodology

Our analysis is guided by the findings of refs. 25,70,71 which demonstrated that the relationship between water storage anomaly and discharge at the outlet of a basin can be described as a linear, time-independent system. Such a linear relationship allows the estimation of the active water storage in the basin, or the corresponding storage point at which storage-induced discharge approaches zero. This level, in turn, operationally defines the TDWS of a watershed. Here, in addition to the entire watershed, we also perform our analysis over Congo’s 5 major sub-basins. Due to its complexity and very different characteristics as present in the precipitation pattern (Fig. S7) and evident in the discharge-storage relationship (Fig. 3), the analysis over the Lualaba sub-basin is further carried out individually and this basin is treated with respect to its (sub)sub-basins defined as Lualaba-North, Lualaba-South and Lualaba-Lukuga. Table S1 lists the sub-basins’ main characteristics with their mean annual precipitation, discharge, evapotranspiration and dominant climate. For each basin (or sub-basin), we apply a similar methodology to quantify TDWS, which is illustrated in Fig. 6. We analyze the relationship of mean monthly river discharge at the outlet (altimetric and in situ) with the mean monthly water storage anomaly. Locations of the gauge records are displayed in Fig. 1. See SI for more details on the data.

Fig. 6: Flowchart of estimating TDWS.
figure 6

Flowchart of estimating the upper bound and lower bound of Total Drainable Water Storage (TDWS). Data inputs are depicted as rectangles with white backgrounds. Rounded rectangles with light gray backgrounds show estimation steps and rectangles with dark gray backgrounds represent the output of the algorithm.

Drainable water storage anomaly

The obtained lake and wetland volume estimates (Fig. S1 and Fig. S2) are divided by the area of the corresponding sub-basin (Table 3) to obtain the Lake Water Storage Anomaly (LWSA) and Wetland Water Storage Anomaly (WWSA). For the quantification of Drainable Water Storage Anomaly (DWSA) we subtract the time series of LWSA and WWSA from the time series of GRACE-based Total Water Storage Anomaly (TWSA). Note that there are two approaches for subtracting LWSA and WWSA from the TWSA: 1) subtracting them from the corresponding GRACE grid and then aggregating over the basin, or 2) subtracting at the level of aggregated time series of a basin. The first method would face the problem of mismatch in the resolution of GRACE and satellite imagery. Therefore, the time series of LWSA and WWSA over a basin are subtracted from the aggregated TWSA time series of that basin.

Estimation of total drainable water storage and basin resistance

Following the method proposed by ref. 25,70 we aim to obtain a linear relationship between the basin’s river discharge at the outlet, Q, and its Drainable Water Storage Anomaly (DWSA) ΔSd. We perform all our analysis based on the relationship of mean monthly values of discharge with the mean monthly values of DWSA. We select mean monthly values because different datasets with different available time periods are involved to obtain the input data. Moreover, since water storage anomaly is obtained from GRACE and it is insensitive to fast hydrological changes, the equation [(3)] only holds if we filter out fast discharge variations due to individual rainfall events. For this reason25, considered a baseflow for the calculation of TDWS. In this study, the use of mean monthly values instead of monthly values ensures that fast discharge fluctuations are filtered out. Assuming that water stored in soils and near the surface will drive runoff into local streams, we can write:

$$Q(t)=\frac{1}{{\tau }^{{{{{{{{\rm{d}}}}}}}}}}\left({S}_{0}^{{{{{{{{\rm{d}}}}}}}}}+\Delta {S}^{{{{{{{{\rm{d}}}}}}}}}(t+\Delta t)\right)=\frac{1}{{\tau }^{{{{{{{{\rm{d}}}}}}}}}}\left({S}_{0}^{{{{{{{{\rm{d}}}}}}}}}+\Delta {S}_{\phi }^{{{{{{{{\rm{d}}}}}}}}}(t)\right),$$
(3)

where τd is a constant factor, \(\Delta {S}_{\phi }^{{{{{{{{\rm{d}}}}}}}}}\) is the lag-removed ΔSd and \({S}_{0}^{{{{{{{{\rm{d}}}}}}}}}\) is the x-axis intercept satisfying Q = 0 representing the Total Drainable Water Storage (TDWS). Note that the same equation as above can be set up for TWSA, where τd and \({S}_{0}^{{{{{{{{\rm{d}}}}}}}}}\) become τt and \({S}_{0}^{{{{{{{{\rm{t}}}}}}}}}\). Since the discharge is a flux quantity and storage is not, τd is naturally given the unit of time, which can be automatically interpreted as a hydraulic time constant.

In order to estimate \({S}_{0}^{{{{{{{{\rm{d}}}}}}}}}\) and τd, we first need to obtain ΔSϕ and remove the time lag between ΔSd and Q. To this end, we decompose the time series of Q and ΔSd into harmonics with frequencies \({\omega }_{i}=\frac{2\pi \,\times\, i}{365}{{{{{{{\rm{rad}}}}}}}}/{{{{{{{\rm{day}}}}}}}}\) with i = 1, 2, …, 6

$$\Delta {S}^{{{{{{{{\rm{d}}}}}}}}}(t)={c}_{0}^{{{{{{{{\rm{s}}}}}}}}}+{c}_{1}^{{{{{{{{\rm{s}}}}}}}}}t+\mathop{\sum }\limits_{i=1}^{6}{a}_{i}^{{{{{{{{\rm{s}}}}}}}}}\cos ({\omega }_{i}t+{\phi }_{i}^{{{{{{{{\rm{s}}}}}}}}})$$
(4)
$$Q(t)={c}_{0}^{{{{{{{{\rm{q}}}}}}}}}+c{a}_{1}^{{{{{{{{\rm{q}}}}}}}}}t+\mathop{\sum }\limits_{i=1}^{6}{a}_{i}^{{{{{{{{\rm{q}}}}}}}}}\cos ({\omega }_{i}t+{\phi }_{i}^{{{{{{{{\rm{q}}}}}}}}})$$
(5)

Least-squares spectral analysis72 is then used to estimate unknown parameters \({c}_{0}^{{{{{{{{\rm{s}}}}}}}}}\), \({c}_{1}^{{{{{{{{\rm{s}}}}}}}}}\), \({a}_{i}^{{{{{{{{\rm{s}}}}}}}}}\), \({\phi }_{i}^{{{{{{{{\rm{s}}}}}}}}}\), \({c}_{0}^{{{{{{{{\rm{q}}}}}}}}}\), \({c}_{1}^{{{{{{{{\rm{q}}}}}}}}}\), \({a}_{i}^{{{{{{{{\rm{q}}}}}}}}}\) and \({\phi }_{i}^{{{{{{{{\rm{q}}}}}}}}}\). Once the unknown parameters are obtained the lag-removed drainable water storage anomaly can be obtained by using the phase of discharge \({\phi }_{i}^{{{{{{{{\rm{q}}}}}}}}}\) for ΔSd, thus forcing all spectral components to be in phase.

$$\Delta {S}_{\phi }^{{{{{{{{\rm{d}}}}}}}}}(t)={c}_{0}^{{{{{{{{\rm{s}}}}}}}}}+{c}_{1}^{{{{{{{{\rm{s}}}}}}}}}t+\mathop{\sum }\limits_{i=1}^{6}{a}_{i}^{{{{{{{{\rm{s}}}}}}}}}\cos ({\omega }_{i}t+{\phi }_{i}^{{{{{{{{\rm{q}}}}}}}}})$$
(6)

Unlike the standard cross-correlation analysis for time lag estimation, which gives a time lag dictated by the dominant frequency, the above method guarantees synchronization of both time series for all hydrologic dynamics. τd and \({S}_{0}^{{{{{{{{\rm{d}}}}}}}}}\) are then estimated by implementing a Gauss-Helmert Model (GHM) using Q and \(\Delta {S}_{\phi }^{{{{{{{{\rm{d}}}}}}}}}\) as observations.

In the GHM, we add two terms eQ and eΔS, to equation (3), representing the error in discharge and DWSA, respectively. For ease of reading of GHM implementation, we use τ below, which could be τd or τt, and S0, which could be \({S}_{0}^{{{{{{{{\rm{d}}}}}}}}}\) or \({S}_{0}^{{{{{{{{\rm{t}}}}}}}}}\).

$$Q-{e}_{{{{{{{{\rm{Q}}}}}}}}}-\frac{1}{\tau }({S}_{0}+\Delta {S}_{\phi }-{e}_{\Delta {{{{{{{\rm{S}}}}}}}}})=f(\tau ,{S}_{0},{e}_{\Delta {{{{{{{\rm{S}}}}}}}}},{e}_{{{{{{{{\rm{Q}}}}}}}}})=0.$$
(7)

we now have an implicit functional relationship f(.) with unknown parameters τ, S0, eΔS, eQ. We can linearize it by splitting up the quantities

$$\tau = {\tau }^{0}+\delta \tau ,\,\,{S}_{0}={S}_{0}^{0}+\delta {S}_{0}\\ {e}_{{{{{{{{\rm{Q}}}}}}}}} ={e}_{{{{{{{{\rm{Q}}}}}}}}}^{0}+\delta {e}_{{{{{{{{\rm{Q}}}}}}}}},\,\,{e}_{\Delta {{{{{{{\rm{S}}}}}}}}}={e}_{\Delta {{{{{{{\rm{S}}}}}}}}}^{0}+\delta {e}_{\Delta {{{{{{{\rm{S}}}}}}}}}$$
(8)

and choose a Taylor point related to both parameters and uncertainties: τ0, So0, \({e}_{{{{{{{{\rm{R}}}}}}}}}^{0}\) and \({e}_{\Delta {{{{{{{\rm{S}}}}}}}}}^{0}\). Linearization using a Taylor series and the Taylor points yields

$$f={f}^{0}+{\left.\frac{\partial f}{\partial \tau }\right\vert }_{0}\delta \tau +{\left.\frac{\partial f}{\partial {S}_{0}}\right\vert }_{0}\delta {S}_{0}+{\left.\frac{\partial f}{\partial {e}_{{{{{{{{\rm{Q}}}}}}}}}}\right\vert }_{0}\left({e}_{{{{{{{{\rm{Q}}}}}}}}}-{e}_{{{{{{{{\rm{Q}}}}}}}}}^{0}\right)+{\left.\frac{\partial f}{\partial {e}_{\Delta {{{{{{{\rm{S}}}}}}}}}}\right\vert }_{0}\left({e}_{\Delta {{{{{{{\rm{S}}}}}}}}}-{e}_{\Delta {{{{{{{\rm{S}}}}}}}}}^{0}\right)$$
(9)

After reshaping we have

$$f = \underbrace{f^0-\left.\frac{\partial f}{\partial e_{{{{{\mathrm{Q}}}}}}}\right|_0 {e_{{{{{\mathrm{Q}}}}}}^0}-\left.\frac{\partial f}{\partial {e_{{{\Delta}}{{{{{\mathrm{S}}}}}}}}}\right|_0 {e_{{{\Delta}}{{{{{\mathrm{S}}}}}}}^0}}_{{{w} \atop {m \times 1}}}+ \underbrace{\left[ \frac{\partial f}{\partial \tau}\left|{\,\!}_{0} \quad \frac{\partial f}{\partial S_0}\right|{\,\!}_0\right]}_{{{A}\atop{m \times 2}}} \underbrace{\left[ \begin{array}{c}\delta \tau \\ \delta S_0 \end{array}\right]}_{{{\delta x}\atop{2 \times 1}}} \\ + \underbrace{\left[ \begin{array}{ccccccc}\left.\frac{\partial f^1}{\partial e_{{{{{\mathrm{Q}}}}}}}\right|{\,\!}_0 & \left.\frac{\partial f^1}{\partial {e_{{{\Delta}}{{{{{\mathrm{S}}}}}}}}}\right|{\,\!}_{0}&0&0&\ldots &0&0\\ 0&0& \left.\frac{\partial f^2}{\partial e_{{{{{\mathrm{Q}}}}}}}\right|{\,\!}_{0} & \left.\frac{\partial f^2}{\partial {e_{{{\Delta}}{{{{{\mathrm{S}}}}}}}}}\right|{\!\,}_{0}&\ldots &0&0\\ \ldots &\ldots &\ldots &\ldots &\ldots &\ldots &\ldots \\ 0&0&0&0&\ldots & \left.\frac{\partial f^m}{\partial e_{{{{{\mathrm{Q}}}}}}}\right|{\,\!}_{0} & \left.\frac{\partial f^m}{\partial {e_{{{\Delta}}{{{{{\mathrm{S}}}}}}}}}\right|{\!\,}_{0} \end{array}\right]}_{{{B^{\mathsf{T}}}\atop{m \times 2m}}} \underbrace{\left[\begin{array}{c} {e_{{{{{\mathrm{Q}}}}}}}^1 \\ {e_{{{\Delta}}{{{{{\mathrm{S}}}}}}}}^1 \\ \ldots \\ {e_{{{{{\mathrm{Q}}}}}}}^m\\ {e_{{{\Delta}}{{{{{\mathrm{S}}}}}}}}^m \end{array}\right]}_{{{{\delta v}}\atop {2m \times 1}}}=0$$
(10)

Then, we have

$${w+A\delta x+B}^{{\mathsf{T}}}\delta v=0$$
(11)

The uppercase numbers 1,2,..,m in the matrix BT and vector δv are an index into observations of discharge and drainable water storage anomaly. For solving the formulated problem (11) and performing the adjustment we minimize

$${\delta v}^{{\mathsf{T}}}P\delta v$$
(12)

where

$$P=\left[\begin{array}{cccccc}1/{\sigma }_{{\Delta {{{{{{{\rm{S}}}}}}}}}_{1}}^{2}&0&0&\ldots &0&0\\ 0&1/{\sigma }_{{{{{{{{{\rm{Q}}}}}}}}}_{1}}^{2}&0&\ldots &0&0\\ .&.&.&.&.&.\\ 0&0&0&\ldots &1/{\sigma }_{{\Delta {{{{{{{\rm{S}}}}}}}}}_{m}}^{2}&0\\ 0&0&0&\ldots &0&1/{\sigma }_{{{{{{{{{\rm{Q}}}}}}}}}_{m}}^{2}\end{array}\right]$$
(13)

The corresponding Lagrangian reads

$$L(v,\delta x,\lambda )={\delta v}^{{\mathsf{T}}}P\delta v-2{\lambda }^{{\mathsf{T}}}({B}^{{\mathsf{T}}}\delta v+A\delta x+w).$$
(14)

Setting the gradients of the Lagrangian to zero yields the equations

$$\left[\begin{array}{ccc}P&0&B\\ 0&0&{A}^{{\mathsf{T}}}\\ {B}^{{\mathsf{T}}}&A&0\end{array}\right]\left[\begin{array}{c}\delta v\\ \delta x\\ \lambda \end{array}\right]=\left[\begin{array}{c}0\\ 0\\ -w\end{array}\right]$$
(15)

by solving (15) we obtain δx and δv, using which we update τ, S0, eQ and eΔS. We then iterate the whole procedure till we obtain small updates, i.e 10−13 for the norm of δx and δv. After the final iteration, we obtain the estimated hydraulic time constant \({\hat{\tau }}^{{{{{{{{\rm{d}}}}}}}}}\) and TDWS \({\hat{S}}_{0}^{{{{{{{{\rm{d}}}}}}}}}\) (Fig. 3), for which uncertainties of both water level and discharge are taken into consideration. Table 1 shows the estimated \({\hat{S}}_{0}^{{{{{{{{\rm{d}}}}}}}}}\), \({\hat{S}}_{0}^{{{{{{{{\rm{t}}}}}}}}}\), \({\hat{\tau }}^{{{{{{{{\rm{d}}}}}}}}}\) and \({\hat{\tau }}^{{{{{{{{\rm{t}}}}}}}}}\) for Congo and its sub-basins. Since the lakes and wetlands of the Congo Basin are distributed among the Lualaba-South, Lualaba-Lukuga, and Kasaï sub-basins, the estimates of \({S}_{0}^{t}\) and \({S}_{0}^{{{{{{{{\rm{d}}}}}}}}}\) differ only for these three sub-basins. Figures S5 and S6 show the discharge-storage relationship for Congo River Basin and its sub-basins for both cases of the TWSA and the DWSA, respectively.