Introduction

The purpose of constructing a large number of reservoirs around the world is to control river floods, store water for irrigation, and generate hydroelectric power. However, this inevitably has an impact on the hydrological processes of river ecosystems1. The construction and operation of reservoirs in river basins have a significant impact on river flow, sediment transport, channel morphology, and riverine ecosystems2,3,4,5,6,7. Some scholars, after evaluating the impact of dam construction on downstream water conditions, have pointed out that the duration and magnitude of river flow and peak flow change after the construction of dams8,9,10,11,12,13,14. After the construction of the Garrison Dam on the upper Missouri River in the United States, the peak flow decreased from an extreme value of 10,279 m3/s upstream of the dam to 4390 m3/s12. In the Mekong River, reservoirs cause an increase in average monthly flow by 25–160% during the non-flood season (December–May) and a decrease by 3–53% during the flood season (June–October)13. Some studies14,15,16,17,18 evaluating the impact of dam construction on hydrological conditions and sediment transport have found that the construction of dams has led to significant changes in sediment flux in many of the world’s major rivers, such as the Mississippi River19 in the United States, the Nile River20 in Egypt, the Yangtze River21,22, and the Yellow River23,24,25 in China. Sediment trapping by dams in the global hydrologic north has contributed to global sediment flux declines to 49% of pre-dam conditions26. In addition, in the early twenty-first century, approximately 25–30% of the sediment flux in coastal environments was intercepted by existing dams27,28. Dams and reservoirs have greatly altered the water discharge and sediment conditions in their downstream rivers.

To adapt to the changes in water and sediment caused by dam construction, downstream river sections may experience phenomena such as riverbank erosion and water level adjustments29,30,31. The stability of river ecosystems, production and domestic water consumption in coastal areas, the ecological environment, and the safety of flood control are all directly impacted by changes in water level32. Due to the river channel’s low water level, it is challenging to support life and economic activity near the coast; on the other hand, a high water level will inundate the floodplain and raise the risk of flooding33.The change of water level is affected by many factors, such as the scouring and sedimentation of the river bed, the constriction of the river channel, the change in roughness, and the backwater caused by the dense vegetation. These are all important factors in the change of water level. However, aside from vegetation, dam development is generally to blame for the changes in the aforementioned variables. After the construction of the Aswan Dam on the Nile River in Egypt34, the lower river channel was cut down by an average of 0.45 m, and the water level of the same flow showed a downward trend. Similarly, the Tagliamento, Piave and Brenta rivers in Italy were cut down 4–6 m after the dam was put into use35. Although the flood flow of the Mississippi River in the United States in 2011 was lower than it was in 1927 and 1973, the flood level rose as a result of backwater brought on by dense vegetation in the floodplain’s upper reaches36. After the construction of a reservoir in the Missouri River in the United States, the low water level fell by more than 2.5 m37, while the downstream flood level around Kansas City rose by nearly 1 m. In China, the Danjiangkou Dam on the Han River and the Three Gorges Dam on the Yangtze River33,38,39,40 both lowered the water level at the same river discharge at the downstream hydrological stations. However, after the completion of the dam, the changes in water levels across different rivers were not uniform33. The middle sections of the Yangtze River’s high water level have decreased since the Three Gorges Dam began operating, however the flood water level under the same flow has not showed a declining trend41 and some studies even indicate that the water level has increased42. In short, affected by many factors, the change of water level itself is an extremely complex hydrological process.

The hydraulic law indicates that under appropriate conditions, there exists a unique and stable relationship between water level and discharge, which can be described using a rating curve43. However, due to complex variations in water levels, there is significant uncertainty in the rating curve44. Due to the difficulty in measuring discharge accurately, scholars often use rating curves for discharge prediction43,45,46,47. This curve only requires water level data to indirectly estimate discharge48. Some scholars also use rating curves for tasks such as water depth parameter calibration49, calculating annual peak water levels50, and predicting tidal river water levels51. The Yellow River was once the river with the highest sediment load in the world. However, since the construction and operation of the Xiaolangdi Reservoir, the sediment concentration in the water has significantly decreased due to the interception of sediment by the reservoir52. Furthermore, the Lower Yellow River has transitioned from short-duration high-discharge conditions to long-duration moderate-low discharge conditions. Under the long-term influence of low-sediment water, the downstream river channel has shifted from a state of continuous sedimentation to a state of continuous erosion53. This change has led to great uncertainty in the water level-discharge relationship in the lower reaches of the Yellow River. Currently, scholars have made significant progress in the study of the lower reaches of the Yellow River54,55,56,57,58. However, most of the research has focused on land use changes in the lower Yellow River59,60, variations in sediment element content and heavy metal concentrations in the water61,62, and the coordinated development of multiple systems such as the economic environment63. Yet, the lower Yellow River, being a flood-prone area64,65,66, remains under-researched. Few scholars have investigated and predicted the water level-discharge relationship in the lower reaches of the Yellow River. Therefore, studying the rating curve of key hydrological stations in the lower Yellow River is of great practical significance for flood control in the floodplain. This study proposes a water level prediction method for the HYK statio’s rating curve, which can accurately predict the water level-discharge relationship. This method enables early flood warning during the flood season, thereby ensuring the safety of the lives and property of downstream residents. Insight from the study can be used to predict water levels to better inform decision making about water resources management, and flood emergency response in the lower Yellow River. This study focuses on the variation patterns and influencing factors of the water level-discharge relationship at the HYK hydrological station in the lower Yellow River. The objectives are: A) to use Ensemble Empirical Mode Decomposition (EEMD) to decompose the annual average water level and annual average discharge at the HYK hydrological station, analyze the trends in water level and discharge; B) analyze the corresponding relationship between water level and discharge during different periods, study the impact of different reservoir operations on the rating curve in the lower Yellow River; C) study the influencing factors of changes in the rating curve, and establish a water level prediction method based on the rating curve.

Study area

The Yellow River has a length of 5464 km and a basin size of 752,000 km2. Lower Yellow River reaches, with a river length of 786 km and a downstream area of 23,000 km2, are located below Taohuayu. According to the natural form of the river and the change of river potential, the lower reaches of the Yellow River can be divided into wandering river section (Tiexie–Gaocun), transitional river section (Gaocun–Taochengpu) and curved river section (Taochengpu–Lijin), with the estuary section below Lijin. The division of the Yellow River is shown in Fig. 1a. The hydrological station in the initial downstream reach is the HYK hydrological station. Figure 1b shows the riverbed elevation at different distances within 10,000 m downstream from the HYK cross-section as the starting point, as measured data.

Fig. 1
figure 1

Overview of the study area.

The HYK hydrographic station is a crucial control and observation point along the Yellow River’s lower reaches. A total of 730,000 km2—roughly 97 percent of the Yellow River basin—makes up its control basin. Strong oscillations in this section’s main channel make the morphology and water–sand conditions very sensitive to changes in the water and sand entering the system from upstream. For the purpose of examining the development of the lower Yellow River channel, HYK Station’s examination of water and sand variation serves as both a prerequisite and a starting point. Additionally, the analysis of the rating curve at this section can aid in flood prediction and early warning for the downstream section of the river as well as the upstream reservoir to transfer water and sand. The water potential of this station also partially reflects the actual situation of water and sand entering the downstream.

Daily average water level and daily average river discharge at the HYK section from 1949 to 2020 were obtained from the Yellow River Conservancy Commission. The measured river discharge of flood events from 2006 to 2020 and their corresponding water levels were obtained from the Yellow River Hydrological Yearbook (2006–2020), the data are considered accurate and reliable.

Method and material

Ensemble empirical mode decomposition (EEMD)

For the multi-timescale analysis of time series, Huang67 proposed Empirical Mode Decomposition (EMD) that can separate the fluctuations of different timescales step by step, namely the Intrinsic Mode Function (IMF). This is a new method for handling nonlinear and non-stationary data series. It can linearize and smooth the data, preserving their characteristics during decomposition. The result includes trend components and fluctuation components of various scales, with the fluctuation components defined as IMF across different time scales. Each IMF needs to satisfy the following two conditions: (1) The number of extreme values in the whole signal data sequence must be equal to the number of over zeroes or differ by at most one; (2) At any moment, the average value of the envelope of local maxima (upper envelope) and the envelope of local minima (lower envelope) must be zero. Model mixing means that a single IMF mixes different frequency components, or different IMFs contain the same frequency components, and is a major drawback of empirical model decomposition that is difficult to avoid. To solve this problem, white noise is added to the signal to be decomposed, facilitating the separation of the different scales of the input data68,69. The decomposition steps of EEMD are as follows:

  1. (1)

    The original signal data sequence xi(t) is supplemented with Gaussian white noise wi(t) to obtain the signal to be processed after noise addition. The new sequence can be calculated as Eq. (1).

    $${\text{x}}_{\text{i}}\left({\text{t}}\right)\text{ = x(t)+}{\text{w}}_{\text{i}}\text{(t)}$$
    (1)
  2. (2)

    Decomposition of the new data sequence into IMFs using the method of EMD to decompose the signal to be processed.

  3. (3)

    Repeat the above steps with different white noises.

  4. (4)

    Calculate the average of the corresponding IMFs obtained at each stage as the final result. Refer to Eq. (2) for the calculation.

    $${\text{c}}_{\text{j}}\text{=}\frac{1}{{\text{N}}}\sum_{\text{i=1}}^{\text{N}}{{\text{c}}}_{\text{j,I}}$$
    (2)

    where N is the ensemble times of adding noise, cj reparents the j-th IMF component; cj,I is the i-th IMF when adding the ith noise.

Methods for variance assessment

Ma Bingyan suggested three techniques to determine the variance of various indicators, including the angle method, the area method, and the distance method, in order to assess the sensitivity of the parameters in the Storm Water Management Model (SWMM)70. Due to the influence of rating curve, this study adopts the area method for calculating the variance. Equation (3) is the area method, using curves ca and cb to fit rating curve. Then using area (A) enclosed by the two curves ca and cb represent the difference between the scattered sequences. The larger A is, the greater difference between the two curves. S for degree of variation, the larger S represents a greater degree of variation.

$${\text{S}} = \frac{{\text{A}}}{{{\text{Q}}_{0} - {\text{Q}}_{1} }} = \frac{{\int_{{{\text{Q}}_{0} }}^{{{\text{Q}}_{1} }} {\left( {{\text{C}}_{{\text{a}}} - {\text{C}}_{{\text{b}}} } \right)dQ} }}{{{\text{Q}}_{0} - {\text{Q}}_{1} }}$$
(3)

where S, represent degree of variation by the area methods; A represents the area of the inner envelope between the two curves; Q0 and Q1 are the minimum and maximum discharge of floods in different fields.

Water level forecast method

This study uses the rating curve to predict the water level. Depending on specific factors, such as whether the rating curve is a single value function, the rating curve can be classified as either stable or unstable. The function was fitted by correlation analysis in order to objectively represent the rating curve, and the degree of variation (S) was proposed to highlight the stability of the rating curve following analysis. First, based on the cross-sectional data, water level data, and river discharge data from the HYK station, the width-depth ratio (B), water surface slope (J), dispersion (W), and roughness (n) were calculated. The linear change in the relationship between the daily average water level and discharge at different stages for the HYK section was analyzed to determine the critical value of W under the 95% prediction band, was determined to be 21. If the W value is below 21, it indicates that the water level-discharge relationship points are stable, making the area method suitable for prediction. Conversely, if the W value exceeds 21, the relationship points are scattered, rendering the area method unsuitable for prediction. When W meets the criteria, the area method can be used to determine the envelope range (A) and the variation (S) of the curve. The relationships between B, J, and S are compared, and value of S for different magnitudes for the predicted year is assessed. The water level prediction equation’s parameters (see Eq. 4) are determined through linear regression analysis. Finally, the relationship between the forecasted year water levels and river flows is examined. The prediction technical route is shown in Fig. 2.

$$Z = a*\ln \left( Q \right) + b$$
(4)
Fig. 2
figure 2

Water level prediction technology roadmap.

In which, Q represents discharge, and Z represents the water level corresponding to that discharge, a, b are parameters.

Results

EEMD decomposition of water level and river discharge

Five eigenmodal functions (c1–c5 components) and one residual trend term (Res component) were obtained from the EEMD decomposition of a total of 72-years time series of annual mean discharge and annual mean water level from 1949 to 2020 at the HYK section. By using empirical mode decomposition and residual trend terms, periodic variations and trends in mean annual flow and mean annual water level can be obtained. These results are depicted in Fig. 3.

  1. (1)

    The process of annual average discharge and water level in the HYK section is non-linear and non-stationary, as can be observed from the above figure. The time series' annual mean discharge and annual mean water level are both the results of the combined action of several fluctuation components. These components can be divided into five oscillatory components with various fluctuation periods and one trend component, which reflects the multi-timescale characteristics of the annual mean water level and flow variation at the HYK section.

  2. (2)

    The quasi-periods corresponding to the c1 component of the annual mean discharge and the annual mean water level are about 3.13a and 3.27a, the quasi-periods of both the c2 component are 8a, the quasi-periods of both the c3 component are 14.4a, the quasi-periods of the c4 component are about 40a and 42.35a, and the quasi-periods of the c5 component are about 65.45a and 60a.

  3. (3)

    The Res component displays the general trend for both the average annual river discharge and water level. The annual average discharge of the HYK section significantly decreased from 1949 to 2000 as a result of the construction of the major reservoirs; after 2000, however, as a result of the operation of the Xiaolangdi reservoir, the discharge of the HYK section was directly impacted by the discharge of Xiaolangdi and essentially stabilized. The average yearly water level was essentially steady from the 1940s until the 1980s. Due to the management of the Yellow River by the management agencies, the lower Yellow River’s siltation improved after the 1980s and the average annual water level declined quickly.

Fig. 3
figure 3

Annual average discharge and annual average water level through EEMD decomposition.

Influence of reservoirs construction on the rating curve

On the Yellow River’s main course, three reservoirs have been constructed since New China’s foundation. In chronological order, they are Sanmenxia Reservoir (SMXR), Longyangxia Reservoir (LYXR), and Xiaolangdi Reservoir (XLDR). The research period (1950–2020) was divided into four stages based on the time when the reservoir was put into use, namely the natural state (1950–1959), the operation stage of SMXR (1960–1985), and the operation stage of LYXR (1986–1999) and the operation stage of XLDR (2000–2020).

In this study, two representative years from various reservoir operation stages were selected to analysis the relationship between the daily average water level (Z) and discharge in order to reflect the change in the rating curve at the HYK section (Fig. 4). The rating curve is a typical rope type in its natural form (1957 and 1958), and the highest daily average flow in the chosen years is greater than 10,000 m3/s. The rope-shaped curve in the natural state has transformed to a comparatively dispersed single curve after the operation of the SMXR (1979 and 1984). The linear shape has been greatly constrained. Additionally, the maximum daily average flow has fallen by more than 50% as a result of human activity. Although LYXR is a large-scale reservoir on the Yellow River, it can regulate the water and sediment conditions in the downstream reaches of the reservoir, but because it is located in the upper reaches of Qinghai Province, it is more than 1500 km away from HYK Water Section, and there is SMXR between them. Therefore, the influence of LYXR on HYK Section (1989 and 1990) is weakened. From e (2006) and f (2016) of Fig. 4, compared with the previous stage, the rating curve of HYK section has not changed greatly. Xiaolangdi Dam is located 129.7 km upstream of HYK section. It is the closest of the three dams to this section, so it has the strongest constricting effect. It can be seen from the g and h in Fig. 4 that the rating curve of HYK section in 2006 and 2016 are stable single curves.

Fig. 4
figure 4

The linear change of the relationship between the daily average water level and discharge of HYK section at different stages.

Classification of rating curve after the completion of Xiaolangdi Reservoir

The results of fitting the rating curve of several floods that occurred at the HYK station during the course of the previous 14 years are displayed in Fig. 5.

Fig. 5
figure 5

Flood events in different years from 2006 to 2019.

With the exception of a few years, the majority of in-year flood events follow a single curve and are generally stable. Considering the inconsistent changes in the occurrence of floods of different magnitudes, the flood discharge is divided into two grades of 1000–2000 and 2000–4000 m3/s according to the size of flood flow in recent years (The flood events in 2016 and 2017 at the HYK section did not fall between 2000 and 4000 m3/s, so they did not participate in the second level of calculation). Taking 2010, which is in the center of the line cluster, as the benchmark, the variance between the fitted function curves of different years and the fitted function curves of 2010 is calculated. The curves of different years are classified based on the variance. The calculation results are shown in Table 1.

Table 1 The variance between different years and the base year calculated by the area method.

Based on the calculation results in the table, it is easy to see that there has been a change in the rating curve in recent years, indicating a decrease in water level at the same flow rate. In addition to this the curves for higher flow rates are decreasing more sharply. Scatter plots were plotted for fitting (Fig. 6), and the fitted straight lines had a strong downward trend, and all single-value correlations reached a degree of 0.95 or higher.

Fig. 6
figure 6

Trends in the variance of fitted curves at different flood magnitudes.

Influencing factors of rating curve

In general, the rating curve is influenced by the section conditions (roughness, hydraulic radius, overflow area, etc.) and water and sand conditions (sand content, incoming sand coefficient, etc.). In order to predict the range of water level change under the same discharge by using the rating curve through the section condition and sand content of pre-flood, the influencing factors of rating curve are analyzed here from two perspectives of line and fitted curve.

Factors influencing the dispersion of the rating curve

The rating curve is used to reflect the correspondence between discharge and water level. During the course of the study, it was found discovered that the ratio of discharge to water level somewhat predicts the dispersion of the scattered points; more specifically, the larger the ratio of discharge to water level, the higher the dispersion, at which time the rating curve resembles the type of rope set. Therefore, W (ratio of discharge to water level) is used as an index reflecting the degree of scattering dispersion to analyze its closely related section conditions as well as water and sand conditions.

Figure 7 shows the correspondence between W and pre-flood section conditions (roughness, overland flow area, hydraulic radius, and water surface slope). As shown in the figure, the cross-sectional overflow area and the water surface slope from the HYK cross-section to the Tiexi cross-section have no correlation with the rating curve at the HYK. Relatively speaking, both roughness and hydraulic radius have a large effect on W, and the value of W decreases with the increase of roughness and hydraulic radius. This means that the rating curve at the HYK section is more stable as the roughness and hydraulic radius increase.

Fig. 7
figure 7

The correspondence between W and pre-flood section conditions.

Figure 8 shows the correspondence between W and the water and sand conditions (sediment concentration (H) and sand transmission rate (T)) during the flood season. The figure shows that the scattering of water level-discharge points has a high degree of correlation with the sand content and transmission rate. When the sand content is higher than 5 kg/m3, the tendency of dispersion increasing with the sand content decreases; the dispersion increases with the increase of sand transport rate.

Fig. 8
figure 8

The correspondence between W and the water and sand conditions.

Changes in cross-sectional conditions as well as water and sand conditions can cause changes in the degree of scattering of water level-discharge points. But these two types of factors have the opposite effect, the increase of roughness and hydraulic radius will lead to a more stable rating curve, while the increase of sand content and sand transport rate will lead to a more rope set type of rating curve.

Factors influencing the variation between water level and discharge

As above, the area method was used to divide the lower discharge (1000–2000 m3/s) and higher discharge (2000–4000 m3/s) in the rating curve from 2006 to 2020 into two categories each. Here, the factors affecting the rating curve at lower discharge and higher discharge are analyzed in terms of section conditions and water and sand conditions respectively.

Figures 9 and 10 shows the degree of variation of the fitted curve of water level and discharge for each year from 2006 to 2020 at the HYK section for the lower discharge and the higher discharge and the degree of fitting between various factors for the lower discharge and higher flow rate. The figure shows that there is a high degree of correlation between both width-to-depth ratio and water surface slope and the degree of variance, while the influence factors in other section conditions and discharge conditions have a low effect on the degree of variance. As the aspect ratio decreases, the variance turns negative and the absolute value gradually increases. In contrast, the variance shifts to negative values and increases in absolute value as the water surface slope decreases. The R2 values for the high flow in the width-depth ratio and water surface drop plots are 0.91 and 0.88, respectively. For the low flow, the R2 values in the width-depth ratio and water surface drop plots are 0.89 and 0.93, respectively. Compared with the fitting degree of low flow, the fitting degree of the high flow and width-to-depth ratio is higher, and the fitting degree of water surface gradient is relatively lower.

Fig. 9
figure 9

Factors influencing the degree of variation at low flow rates.

Fig. 10
figure 10

Factors influencing the degree of variation at high flow rates.

Discussion

Predictive feasibility analysis

The previous analysis identified the factors influencing the scattering of the water level-discharge relationship and correspondence at HYK hydrographic station and analyzed their interactions through curve fitting. Changes in rating curve are influenced by factors such as water and sand and cross-sectional morphology. The above analysis shows that it is possible and reasonable to predict the water level corresponding to the real-time flow at the HYK section by the rating curve. The data of 2020 is used as an example to verify the feasibility of the method prediction. On the basis of the forecast, the forecast values are compared with the actual data in 2020 to analyze the accuracy of the forecast range.

Analysis of prediction results

  1. (1)

    Prediction of the fitted curve equation. According to the known information, the pre-flood width-to-depth ratio B of the HYK section to the upstream Tiexie section in 2020 is 309.58. Equations (5) and (6) depicts the fitted curves of B versus the variance S at low and high flow rates.

    $$S=-0.0009{\times B}^{2}+2.4791\times B-1144\; {R}^{2}=0.7094$$
    (5)
    $$S=-0.0024{\times B}^{2}+5.8126\times B-2276.3\; {R}^{2}=0.7728$$
    (6)

    The predicted variance under low flow in 2020 is − 462.78, and the variance under high flow is − 706.846. The above equation establishes a system of equations to predict the rating curve of HYK in 2018, which is presented as Eq. (7).

    $$Z = 0.8264\ln Q + 83.46$$
    (7)

    In which, Q represents discharge, and Z represents the water level corresponding to that discharge.

  2. (2)

    Prediction of the scattered degree of rating curve. Scattering degree can be calculated according to Eq. (8).

    $$W = 11786n^{2} - 1028.1*n + 31.285$$
    (8)

The roughness of the section of the hydrographic station in HYK before the flood in 2020 is 0.01227. The predicted W value of the HYK section in 2020 is 18.305, which is medium scattering. Plotting the predicted curve against the measured values in the same coordinate system (Fig. 11), it can be found that the predicted results are satisfactory. The difference between the upper and lower envelopes and the predicted result is 0.2%, i.e. the accuracy of the predicted result is within 0.2%. Segmental analysis of different flow levels: when the flow rate is between 1000 and 2800 m3/s, the prediction curve is located in the center of the point group, and the prediction results are more accurate; when the flow rate is between 2800 and 4000 m3/s, the prediction results are generally slightly smaller than the measured values. In general, the difference between predicted water level and actual water level is less than 0.4 m71,72. It can be regarded that the predicted results are less different from the measured values.

Fig. 11
figure 11

Comparison of water level prediction results and measured results.

Conclusions

The rating curve was used to predict water level in the lower reaches of the Yellow River. In this study, a complete and reliable water level prediction method based on the rating curve is proposed, taking the HYK hydrological station as an example. The main conclusions of this paper are as follows.

  1. (1)

    Based on EEMD, the time series of annual mean river discharge and annual mean water level for a total of 72 years, from 1949 to 2020, were decomposed. Mean annual flow in the HYK reach declined significantly before 2000 and stabilized largely after 2000. The mean annual water level was basically stable from the 1940s to the 1980s, and after the 1980s, siltation in the lower reaches of the Yellow River improved, and the water level in the HYK declined rapidly.

  2. (2)

    The construction of the dam resulted in a more stable rating curve at the downstream section. After Sanmenxia Dam, the first dam in the lower reaches of the Yellow River, was put into operation, the daily rating curve at the HYK section changed from a typical rope-jacket type curve to a relatively stable rating curve. After the Long Yangxia Dam and Xiaolangdi Dam were put into operation, the daily average rating curve at the HYK section changed to a more single stable curve in recent years.

  3. (3)

    The influencing factors of the rating curve were analyzed in terms of both the scattering degree and correspondence of the rating curve. Both section conditions as well as water and sand conditions are important influencing factors for the scattering degree of water level discharge points. The increase of roughness and hydraulic radius will lead to a more stable rating curve. In contrast, the increase of sand content and sand transport rate will lead to a reverse pattern.

  4. (4)

    This paper verifies the feasibility of the prediction method by using the actual measurement data in 2020 as an example. The predicted 2020 rating curve from the pre-flood section width-to-depth ratio of the HYK is Z = 0.8264lnx + 83.46. The predicted 2020 rating curve using the correlation analysis method is a moderate dispersion, and the predicted 2020 curve and its outer envelope are plotted. The prediction results are relatively consistent with the measured sequence with high accuracy.