LXGB: a machine learning algorithm for estimating the discharge coefficient of pseudo-cosine labyrinth weir

Emami, Somayeh; Emami, Hojjat; Parsa, Javad

doi:10.1038/s41598-023-39272-6

Download PDF

Article
Open access
Published: 29 July 2023

LXGB: a machine learning algorithm for estimating the discharge coefficient of pseudo-cosine labyrinth weir

Somayeh Emami¹,
Hojjat Emami² &
Javad Parsa³

Scientific Reports volume 13, Article number: 12304 (2023) Cite this article

1567 Accesses
1 Citations
1 Altmetric
Metrics details

Subjects

Abstract

One of the practical and financial solutions to increase the efficiency of weirs is to modify the geometry of the plan and increase the length of the weir to a specific width. This increases the discharge coefficient (C_d) of the weir. In this study, a new weir referred to pseudo-cosine labyrinth weir (PCLW) was introduced. A hybrid machine learning LXGB algorithm was introduced to estimate the C_d of the PCLW. The LXGB is a combination of the linear population size reduction history-based adaptive differential evolution (LSHADE) and extreme gradient boosting (XGB) algorithm. Seven different input scenarios were presented to estimate the discharge coefficient of the PCLW weir. To train and test the proposed method, 132 data series, including geometric and hydraulic parameters from PCLW1 and PCLW2 models were used. The root mean square error (RMSE), relative root mean square error (RRMSE), and Nash–Sutcliffe model efficiency coefficient (NSE) indices were used to evaluate the proposed approach. The results showed that the input variables were the ratio of the radius to the weir height (R/W), the ratio of the length of the weir to the weir height (L/W), and the ratio of the hydraulic head to the weir height (H/W), with the average values of RMSE = 0.009, RRMSE = 0.010, and NSE = 0.977 provided better results in estimating the C_dof PCLW1 and PCLW2 models. The improvement compared to SAELM, ANFIS-FFA, GEP, and ANN in terms of R² is 2.06%, 3.09%, 1.03%, and 5.15%. In general, intelligent hybrid approaches can be introduced as the most suitable method for estimating the C_d of PCLW weirs.

An ensemble-based machine learning solution for imbalanced multiclass dataset during lithology log generation

Article Open access 07 December 2023

Modeling of wave run-up by applying integrated models of group method of data handling

Article Open access 18 May 2022

Utilizing machine learning for flow zone indicators prediction and hydraulic flow unit classification

Article Open access 20 February 2024

Introduction

One of the main concerns of hydraulic engineers is the optimal management of limited water resources, in Iran. The ever-increasing growth of national investment in water projects leads to the optimization of water control and management projects in order to save national capital^1,2,3. In recent years, hydraulic engineers have tried to measure the discharge with proper accuracy by building and installing measuring structures in the channels. One of the common structures in many dams and water transfer channels are labyrinth weirs, which are used for draining, measuring, and controlling the water level^{4, 5}. These types of weirs are among the most practical surface structures, which have recently attracted the attention of various researchers. The pseudo-cosine labyrinth weirs (PCLW) with a long crown have a suitable performance for regulating the water level compared to other weirs. Numerous parameters are effective in determining the C_d in labyrinth weir with different plans. These parameters are related to several factors, including the upstream total hydraulic head (H_u), downstream hydraulic head (H_d), weir height (W), radius (R), number of cycles (N), shape of the weir crest (CR), collision of nape (Na), the approach flow conditions (AF), etc.⁴. Nowadays, several issues, including the increase in costs, time-consuming, and the occurrence of human error, have led to the use of 3D and computer models^{6, 7}. Since manual calculations may involve human error, it is necessary to use novel intelligent methods such as meta-heuristic algorithms, artificial neural networks, fuzzy logic, etc. Several studies have been carried out by researchers in the investigation of the C_d of labyrinth weirs^{8,9,10,11,12,13,14,15}. Considering some structural limitations (such as structure dimensions and weir angle) and using classical calculation methods such as linear and non-linear regression methods, the researchers have determined the C_d of weirs.

Azamathulla and Wu¹⁶ used the support vector machine (SVM) to accurately estimate the longitudinal dispersion coefficients in natural rivers. With a test on real-world datasets, the SVM algorithm is proven to generate encouraging results. In another work, Azamathulla et al.¹⁷ proposed SVM to estimate the C_d in side weirs. The experimental results proved the superiority of the SVM compared with counterpart adaptive neuro-fuzzy inference systems (ANFIS) and artificial neural networks (ANNs). Bilhan et al.¹⁸ estimate the C_dof labyrinth weirs using support vector regression (SVR) and an outlier robust extreme learning machine. The results showed that machine learning methods estimated the C_d values more accurately. Safarrazavizadeh et al.¹⁹ performed a laboratory investigation of the flow on labyrinth weirs with a semicircular and sinusoidal plan. Observations showed that the discharge coefficient in labyrinth weirs with a semicircular and sinusoidal plan, unlike linear weirs, has an upward trend in low water loads (H_T/P < 0.35) and decreases after reaching its maximum value. Bonakdari et al.²⁰ investigated the effectiveness of the gene expression programming (GEP) method for estimating C_d. Results show that the GEP method provides better results in predicting C_d. Shafiei et al.²¹ used the ANFIS-firefly algorithm (ANFIS-FFA) method to estimate the C_d of triangular labyrinth weirs. Results showed that the ANFIS-FFA model is more accurate in predicting the C_d of triangular labyrinth weirs. Emami et al.⁸ estimated the C_dof W-planform labyrinth weirs using the improved self-adaptive differential evolutionary algorithm and support vector regression (ISaDE-SVR) method. ISaDE-SVR is highly effective in estimating the C_d of W-planform weirs. Norouzi et al.²² simulated C_d using a self-adaptive robust learning machine (SAELM) model. The results showed that the SAELM model estimated the C_d with high accuracy. Wang et al.²³, investigated the application of genetic algorithm (GA), particle swarm optimization (PSO), and traditional BP neural network in predicting the C_d of triangular labyrinth weir. The results showed that GA-BPNN and PSO-BPNN methods have high efficiency in predicting C_d. Chen et al.²⁴ used SVM, random forest (RF), linear regression, SVM, k-nearest neighbor (KNN), and decision tree (DT) in predicting the C_d of streamlined weirs. Ahmad et al.²⁵ used the ANN model to predict the C_d of an arced labyrinth side weir. The results indicated that C_d calculated by ANN is more accurate. Emami et al.²⁶ used the Walnut algorithm and SVR method to predict the C_d of triangular labyrinth weirs. Safari et al.²⁷ evaluated ANN, GEP, and regression models to estimate the C_d of the broad-crested weir. The results showed that ANN estimates the C_d better than GEP models and regression models.

In the previous studies, according to the many geometrical models that have been investigated by different researchers, the C_d of PCLW has not been investigated. Therefore, in the present study, by using the intelligent model of the differential evolution (LSHADE) and extreme gradient boosting (XGB) approach, the C_d of the PCLW was estimated. The proposed approach was investigated with different combinations of features to identify the high-performance combination of features.

The contributions of this paper are as follows:

(a)
Introducing the LXGB algorithm, which integrates the LSHADE with XGB to tune the XGB parameters and further enhance its estimation performance.
(b)
Using the LXGB algorithm to estimate the C_d of PCLW. The proposed algorithm models the
(c)
Evaluating the proposed model with a real-world dataset and compared with state-of-the art algorithms. The experimental results show the superiority of the proposed method compared with counterparts in terms of performance measures.

The remaining sections of this study are organized as follows. Section "Material and methods" illustrates the experimental materials and the presented hybrid approach. Section "Results and Discussion" presents the results and discussions. Section "Conclusion" summarizes the paper and supplies recommendations for coming work.

Material and methods

Dimensional analysis

The 1-dimensional equation of the flow on the PCLW is as follows²⁸:

$$Q = \frac{2}{3}\,\mathop C\nolimits_{d} \,L\,\sqrt {2g} \,\,\mathop {\mathop H\nolimits_{T} }\nolimits^{1.5}$$

(1)

where Q is the discharge, g shows the acceleration of gravity, L is the length of the weir, and H_T is the hydraulic height (h + V₂/2 g). The C_d of labyrinth weirs in free flow conditions depends on geometric and hydraulic parameters as follows:

$$C_{d} = \,f_{1} (B,\,L,\,H_{T} ,\,H_{d} ,\,V,W,\,R,\,S,\,t,\,\alpha ,\,N,\,g,\,\rho ,\,\mu ,\,\sigma ,\,CS,\,JS,\,SW)$$

(2)

where B is the channel width, H_d is the total hydraulic height (downstream of the weir), V shows the flow velocity, W indicates the height of the weir, R is the radius of weir curvature, S is the length of the straight part between the curves of the weir, t is the thickness of the weir, α represents the angle of the straight section between the weir curves with the direction of the channel, N indicates the number of cycles, ρ indicates the fluid density, μ the dynamic viscosity, σ shows the surface tension, CS means the shape of the weir crest, JS denotes the shape of the flowing blade, and SW represents the approaching flow and the sidewall effect.

Equation (2) can be written as follows:

$$C_{d} = \,f_{2} \,({\text{Re}} ,\,We,\,Fr,\,\frac{{H_{T} }}{W},\,\frac{{H_{d} }}{W},\,\frac{L}{W},\,\frac{B}{W},\,\frac{R}{W},\,\frac{S}{W},\frac{t}{W},\,\alpha ,\,N,\,CS,\,JS,\,SW)$$

(3)

where Re is the Reynolds number, We mean the Weber number, and Fr is the Froude number. Henderson²⁹ concluded that if Re < 2000, the effect of viscosity can be neglected. Novak et al.³⁰ concluded that if the water height on the weir is more than 3 to 4 cm, the effect of surface tension is ignored. Due to the turbulent flow and minimum water height of 5 cm on the weir, the impacts of the Re and We numbers were removed. The shape of the edge of all used weirs was selected as a sharp-crested, and the effect of CS was ignored. Due to the installation of weirs perpendicular to the main flow and the absence of local contraction at their installation location, the conditions of the approaching SW flow were considered the same for all experiments.

Equation (3). is simplified as the following equation:

$$C_{d} = \,f_{3} \,(\frac{{H_{T} }}{W},\,\frac{L}{W},\,\frac{B}{W},\,\frac{R}{W},\,\frac{S}{W},\,\frac{t}{W},\,\alpha ,\,N)$$

(4)

Experimental models

The simulation of the flow around the PCLW was carried out in a channel with a width, length, and height of 0.49 m to 1.115 m, 3.2 m, and 0.5 m, respectively. In Fig. 1, the PCLW models and their geometric features are shown.

The geometric features and the range of experimental parameters of the PLCW are presented in Table 1.

Table 1 The geometric features of PLCW used in simulations.

Full size table

Extreme gradient boosting (XGB)

XGB^31,32,33 is a robust supervised learning solution to regression, classification, and ranking problems in a fast and accurate way. XGB is a more generalized form of gradient-boosting decision trees. It utilizes parallel processing, resolves missing values efficiently, prevents overfitting, and performs well on datasets of different sizes.

For a given dataset with n examples and m features $D \, = \, \{ f(x_{i} , \, y_{i} )\} \, (\left| D \right| = \, n, \, x_{i} \in R^{m} , \, y_{i} \in R)$, XGB consists of an ensemble of K classification and regression trees (CARTs). The final prediction is formulated as follows³¹:

$$\hat{y}_{i} = \sum\limits_{k = 1}^{K} {f_{k} (x_{i} ),} \, f_{k} \in F$$

(5)

$\hat{y}_{i}$ is the final predictive value, F is the list of CARTs, and $f_{k} (x_{i} )$ is the function of input in the k-th decision tree. In the XGB, the objective function consists of two components: regularization and training error, which are defined as follows³¹:

$$X_{obj} = \sum\limits_{i = 1}^{n} {l(y_{i} ,\hat{y}_{i} )} { + }\sum\limits_{k = 1}^{K} {\Omega (f_{k} )}$$

(6)

where $\sum\limits_{i = 1}^{n} {l(y_{i} ,\hat{y}_{i} )}$ calculates the difference between the predicted value and the observed value of the loss function. $\sum\limits_{k = 1}^{K} {\Omega (f_{k} )}$ calculates the regularization component, which is:

$$\Omega (f_{k} ) = \gamma T + \frac{1}{2}\lambda \left\| w \right\|^{2}$$

(7)

where $\gamma$ is the leaf penalty coefficient, T is the total number of a leaf node, $\lambda$ guarantees that the scores of a leaf node are not too large, and w is the scores of a leaf node. XGB employs the gradient boosting strategy, appends one new tree at each iteration, and modifies the preceding test results by fitting the residuals of the previous prediction:

$$y_{i}^{(K)} = \sum\limits_{i = 1}^{K} {f_{k} (x_{i} )} = \hat{y}_{i}^{K - 1} + f_{K} (x_{i} )$$

(8)

Integrating Eq. (1) and (2), the objective function for the t-th tree can be written as³¹:

$$L^{(K)} = \sum\limits_{i = 1}^{n} {l(y_{i} ,\hat{y}_{i}^{(K - 1)} + f_{K} (X_{i} ))} + \Omega (f_{k} )$$

(9)

Taking the Taylor expansion of the loss function up to the second order, Eq. (9) can be approximated as follows:

$$L^{(K)} = \sum\limits_{i = 1}^{n} {[l(y_{i} ,\hat{y}_{i}^{(K - 1)} + f_{K} (X_{i} )) + \frac{1}{2}h_{i} f_{K}^{2} (X_{i} )] + \Omega (f_{k} )}$$

(10)

$$\begin{aligned} X_{{obj}} = & \sum\limits_{{i = 1}}^{n} {\left[ {g_{i} f_{K} (x_{i} ) + \frac{1}{2}h_{i} f_{K}^{2} (x_{i} )} \right] + } \Omega (f_{k} ) \\ = & \sum\limits_{{i = 1}}^{n} {\left[ {g_{i} w_{q} (x_{i} ) + \frac{1}{2}h_{i} w_{q}^{2} (x_{i} )} \right] + } \Omega (f_{k} ) + \lambda T + \frac{1}{2}\lambda \sum\limits_{{j = 1}}^{T} {w_{j}^{2} } \\ = & \sum\limits_{{j = 1}}^{n} {\left[ {\left( {\sum\nolimits_{{i \in I_{j} }} {g_{i} } } \right)w_{j} + \frac{1}{2}\left( {\sum\nolimits_{{i \in I_{j} }} {h_{i} } + \lambda } \right)w_{j}^{2} } \right] + } \lambda T \\ \end{aligned}$$

(11)

where $g_{i} = \partial \hat{y}^{K - 1} l(y_{i} ,\hat{y}^{K - 1} )$ and $h_{i} = \partial^{2} \hat{y}^{K - 1} l(y_{i} ,\hat{y}^{K - 1} )$ are the first and second-order gradient statistics of the loss function.

The optimal weight $w_{j}$ of leaf j, and the objective function of a tree can be written as follows:

$$w_{i} = - \frac{{G_{i} }}{{H_{i} + \lambda }}$$

(12)

where $G_{i} = \sum\nolimits_{{i \in I_{j} }} {g_{i} }$ and $H_{i} = \sum\nolimits_{{i \in I_{j} }} {h_{i} } + \lambda$.

$$w_{j} = - \frac{1}{2}\sum\limits_{j = 1}^{T} {\frac{{G_{j} }}{{H_{j} + \lambda }}} + \gamma T$$

(13)

the weak fitting model will be intensified as follows:

$$f_{k} (x_{i} ) = f_{k - 1} (x_{i} ) + \sum\limits_{j = 1}^{T} {w_{j} .\eta }$$

(14)

where $\eta$ is the learning rate. XGB appends new trees at each iteration by continuously dividing features. Appending a new tree to the model is learning a new function $f_{k} (X,\theta_{k} )$ to fit the residual of previous prediction. Once K trees are learned, the strong fitting model $F(x_{i} )$ used to predict:

$$F(x_{i} ) = f_{0} (x_{i} ) + \sum\limits_{k = 1}^{K} {\sum\limits_{j = 1}^{T} {w_{j} .\eta } }$$

(15)

where, F(x_i) is the strong-fitting model.

Figure 2 shows the working principle of XGB.

Since the hyper-parameters of XGB are often set empirically, optimal tuning of parameters is essential for designing robust XGB. In this paper, we used the LSHADE algorithm to tune the XGB parameters including the number of decision trees (K), learning rate ($\eta$), maximum depth (md), minimum child weight (mcw), gamma value ($\gamma$), sub-sample (ss). Table 2 lists the XGB parameters and their range used in the implementation.

Table 2 The parameters of the XGB algorithm that need to be tuned.

Full size table

LSHADE

Success-history-based parameter adaptation for differential evolution (SHADE)³⁴ is an adaptive evolutionary optimization strategy. LSHADE³⁵ enhances SHADE with a linear population size reduction technique, which gradually reduces the size of the population using a linear function. LSHADE starts its optimization process with a randomly generated population of real parameter vectors. The algorithm repeats a process of trail vector generation and selection until some termination conditions are satisfied.

LSHADE-XGB (LXGB)

The incentive mechanism of LXGB is to improve the classification performance of XGB by integrating the LSHADE optimization algorithm with XGB. Figure 3 shows the working principle of the LXGB algorithm.

Assessment metrics

RMSE, RRMSE and NSE metrics were used to evaluate the performance of LXGB approach (Eqs. 16–18).

$$RMSE = \sqrt {\frac{1}{n}\sum\limits_{i = 1}^{n} {\mathop {(\mathop Y\nolimits_{i} - \mathop X\nolimits_{i} )}\nolimits^{2} } }$$

(16)

$$NSE = 1 - \frac{{\sum\limits_{i = 1}^{n} {\mathop {(\mathop Y\nolimits_{i} - \mathop X\nolimits_{i} )}\nolimits^{2} } }}{{\sum\limits_{i = 1}^{n} {\mathop {(\mathop Y\nolimits_{i} - \overline{X} )}\nolimits^{2} } }}$$

(17)

$$RRMSE = \frac{RMSE}{{\sum\limits_{i = 1}^{n} {\mathop Y\nolimits_{i} } }}$$

(18)

RMSE: Root mean square error; NSE: Nash–Sutcliffe model efficiency coefficient; RRMSE: Relative root mean square error.

Where X_i is the predicted values, Y_i is the observed values, and $\overline{X}$ is the average of X.

Results and discussion

The C_d of PCLW1 and PCLW2 weirs was estimated using the hybrid LXGB approach. At first, all available data were normalized to remove or correct outliers³⁶.

$$X_{n} = \frac{{X - X_{\min } }}{{X_{\max } - X_{\min } }}$$

(19)

where X_min is the minimum data, X represents the raw data, X_max is the maximum data, and X_n is the normalized data.

The ratio of the weir length to the weir height (L/W), the ratio of the channel width to the weir height (B/W), the ratio of the weir thickness to weir height (t/W), the number of cycles (N), the radius to the weir height (R/W), the ratio of the straight section between the weir curves length to the weir height (S/W), the ratio of the, the ratio of the hydraulic head to the weir height (H/W), were considered as input parameters of the LXGB approach. 132 datasets, including geometric and hydraulic parameters, were selected. The data were randomly divided into two parts: 80% (106 data) for training the model and 20% (26 data) for testing it.

Seven models with different variables were examined to introduce the most influential input parameters in estimating the C_d of PCLW1 and PCLW2 weirs. Tables 3 and 4 and Figs. 4 and 5 present various input variables.

Table 3 Combinations of the variables of weir in the PCLW1 plan.

Full size table

Table 4 Combinations of the variables of weir in the PCLW2 plan.

Full size table

In Tables 5 and 6, the evaluation criteria for different input variables to estimate the C_d are presented. A part of the modeling process by the LXGB approach is presented in Fig. 6.

Table 5 Performance of LXGB on the PCLW1 plan with different combination models.

Full size table

Table 6 Performance of LXGB on the PCLW2 plan with different combination models.

Full size table

The results show the accuracy of the presented LXGB approach in estimating the C_d of PCLW1 and PCLW2 models of PCLW. Mahmoud et al.³⁷ concluded that the ANFIS-PSO and MLP-FA (multi-layer perceptron and firefly optimization algorithm) methods are the most accurate in estimating the C_d of triangular labyrinth weirs, respectively. In a similar study, Majediasl and Fuladipanah³⁸ concluded that the SVM model produces the most exact results in predicting the C_d of labyrinth weir with RMSE = 0.0118. Shafiei et al.²¹ reported that the ANFIS-FFA model is quite accurate in estimating the C_d of the labyrinth weir. Karami et al.¹⁰ showed that the ELM method with RMSE = 0.006 has acceptable efficiency in estimating the C_d of the labyrinth weir. In a similar study, the effectiveness of the least-squares support vector machine-bat algorithm (LSSVM-BA) method was used to investigate the discharge of a curved labyrinth weir³⁹. The results of the studies showed that the SVM-based model gave accurate results in estimating the C_d of the arched labyrinth weir with values of RMSE = 0.013 and R² = 0.970⁴⁰. Multi-layer perceptron neural network (MLPNN) managed to estimate the discharge over the triangular arced labyrinth weirs of RMSE = 0.00385 and R² = 0.999⁴¹.

The results of the estimated and observed C_d of the PCLW1 and PCLW2 models of pseudo-cosine labyrinth weirs were compared in Figs. 7 and 8. According to the results, the K₆ model with the input variables of (R/W), (L/W), and (H/W), had the optimal values of statistical indicators. The C_d of PCLW1 and PCLW2 weirs increases with the increase of the weir height. In a similar study, it was concluded that with the increase in the weir height, the C_d of the triangular duckbill labyrinth weir increases, which is in agreement with the results of the present study⁷. The increase in the effective length of the labyrinths at a specified width, due to the radius increases of PCLW1 and PCLW2 weirs causes an increase in the Cd. The studies showed that increasing the radius causes a reduction in eddy flows, turbulence, and a sudden increase in water height during the weir^{39, 40, 42}. The results of the investigations showed that with the increase of R/W, the C_d increases in the arched labyrinth weir, which is consistent with the results of the present study⁴¹. Also, the K₂ model (H/W, L/W, R/W, N) is in the second rank, which shows that length, weir height, radius, and the number of cycles have a more significant impact on C_d of PCLW1 and PCLW2 weirs. By increasing the number of labyrinth weir cycles, discharge and Cd increase, which is consistent with the results of the present study^{40, 43}. Figure 9 shows the importance of the influential input parameters in estimating the C_d of PCLW.

Emami et al.⁴⁴ predicted the C_d of a curved plan labyrinth weirs using the WOA-ANFIS method, and the input parameters H/W and θ (weir arc angle) were introduced as the most effective parameters in estimating the C_d. Majediasal and Fuladipanah³⁸, investigated the support vector machine (SVM) method for C_d of sharp-crested triangular labyrinth weirs and concluded that the input combination, including geometric parameters (θ, h/w, L/B), has the best results. Mohammadi et al.⁴⁵ reported that the parameters H_t/P, W/P (the ratio of the weir width to the height), R/W, W/LC (the ratio of the weir width to the effective length) as input variables have the most accuracy and efficiency in estimating the C_d of U-shaped labyrinth weirs. Haghiabi et al.⁴⁶ indicated the C_d of triangular labyrinth weirs using the ANFIS system and concluded that the ANFIS has a proper implementation in C_d estimation. Studies showed that the H/W parameter is the most influential parameter on the C_d of a labyrinth and arced labyrinth weirs⁴⁷.

Table 7 compares the performances of the XGB and LXGB on the test dataset. The results show the superiority of the LXGB compared with the XGB algorithm in terms of performance measures. This issue proves that combining the LSHADE with XGB improves the estimation performance.

Table 7 Performance evaluation of the XGB and LXGB algorithms on test dataset.

Full size table

In Table 8, the values of the evaluation criteria for estimating the C_d of labyrinth weirs with different plans have been compared with the results of other studies. The results for LXGB are generated with the PCLW1 plan. The comparisons show the appropriate accuracy of the LXGB approach in estimating the C_d of labyrinth weirs with R² = 0.97 and RMSE = 0.014.

Table 8 Performance evaluation of the LXGB approach and similar methods.

Full size table

Conclusion

This study introduces a novel design for labyrinth weirs called pseudo-cosine labyrinth weirs (PCLW). The LXGB was used to estimate the C_d of the PCLW weir. Seven models with different combinations of appropriate input parameters were introduced. A proper model was defined by analyzing the estimation results. The superior model estimates C_d by considering input parameters H/W, R/W, and L/W. LXGB was achieved in estimating the C_d of PCLW overflows by obtaining values of R² = 0.971, RMSE = 0.014, and NSE = 0.97. The results demonstrated that the proposed LXGB algorithm generated more significant results than previous studies in estimating the C_d of labyrinth weirs. Such a cost-effective prediction model may have significant practical application, as it can be an economical alternative to the expensive laboratory solution, which is costly and time-consuming. The proposed model is useful to correct the design of water transfer systems.

Data availability

The datasets generated and/or analyzed during the current study are not publicly available but are available from the corresponding author on reasonable request.

References

El Bedawy, R. Water resources management: Alarming crisis for Egypt. J. Mgmt. Sustain. 4, 108 (2014).
Article Google Scholar
Zomorodian, M. et al. The state-of-the-art system dynamics application in integrated water resources modeling. J. Environ. Manage. 227, 294–304 (2018).
Article PubMed Google Scholar
Verma, S., Verma, M. K., Prasad, A. D., Mehta, D. J., & Islam, M. N. Modeling of uncertainty in the estimation of hydrograph components in conjunction with the SUFI-2 optimization algorithm by using multiple objective functions. Model. Earth Syst. Environ. 1–19 (2023).
Singh, D., & Kumar, M. Hydraulic design and analysis of piano key weirs: A review. Arab. J. Sci. Eng.. 1–15 (2021).
Achour, B., Amara, L. & Mehta, D. New theoretical considerations on the gradually varied flow in a triangular channel. LARHYSS J. 50, 7–29 (2022).
Google Scholar
Miller, D. S. (ed.) Discharge characteristics: IAHR hydraulic structures design manuals 8th edn. (Routledge, 2017).
Google Scholar
Emami, S., Arvanaghi, H. & Parsa, J. Numerical investigation of geometric parameters effect of the labyrinth weir on the discharge coefficient. J. Rehabil. Civil Eng. 6(1), 1–9 (2018).
Google Scholar
Emami, S., Parsa, J., Emami, H. & Abbaspour, A. An ISaDE algorithm combined with support vector regression for estimating discharge coefficient of W-planform weirs. Water Supply. 21(7), 3459–3476 (2021).
Article Google Scholar
Roushangar, K., Alami, M. T., Majedi Asl, M. & Shiri, J. Modeling discharge coefficient of normal and inverted orientation labyrinth weirs using machine learning techniques. ISH J. Hydraul. Eng. 23(3), 331–340 (2017).
Article Google Scholar
Karami, H., Karimi, S., Bonakdari, H. & Shamshirband, S. Predicting discharge coefficient of triangular labyrinth weir using extreme learning machine, artificial neural network and genetic programming. Neural Comput. Appl. 29, 983–989 (2018).
Article Google Scholar
Bilhan, O., Emiroglu, M. E. & Kisi, O. Use of artificial neural networks for prediction of discharge coefficient of triangular labyrinth side weir in curved channels. Adv. Eng. Softw. 42(4), 208–214 (2011).
Article MATH Google Scholar
Parsaie, A., & Haghiabi, A. H. Prediction of side weir discharge coefficient by genetic programming technique. Jordan J. Civil Eng. 11(1), (2017).
Emiroglu, M. E., Bilhan, O. & Kisi, O. Neural networks for estimation of discharge capacity of triangular labyrinth side-weir located on a straight channel. Expert Syst. Appl. 38(1), 867–874 (2011).
Article MATH Google Scholar
Honar, T., Tarazkar, M. H. & Tarazkar, M. R. Estimating of side weir discharge coefficient by using Neuro-Fuzzy (ANFIS). J. Water Soil Conserv. 17(2), 169–176 (2011).
Google Scholar
Aghdarimoghaddam, A. & Nodoshan, J. Geometry optimization of triangle labyrinth spillway using anfis models and genetic algorithms. J. Model. Eng. 5(19), 57–68 (2009).
Google Scholar
Azamathulla, H. M. D. & Wu, F. Support vector machine approach for longitudinal dispersion coefficients in natural streams. Appl. Soft Comput. 11(2), 2902–2905 (2011).
Article Google Scholar
Azamathulla, H. M., Haghiabi, A. H. & Parsaie, A. Prediction of side weir discharge coefficient by support vector machine technique. Water Sci. Technol. Water Supply. 16(4), 1002–1016 (2016).
Article Google Scholar
Bilhan, O., Aydin, M. C., Emiroglu, M. E. & Miller, C. J. Experimental and CFD analysis of circular labyrinth weirs. J. Irrig. Drain. Eng. 144(6), 04018007 (2018).
Article Google Scholar
Safarrazavizadeh, M., Esmaeilivaraki, M. & Biabani, R. Experimental study on flow over sinusoidal and semicircular labyrinth weirs. ISH J. Hydraul. Eng. 27(1), 304–313 (2021).
Article Google Scholar
Bonakdari, H., Ebtehaj, I., Gharabaghi, B., Sharifi, A., & Mosavi, A. Prediction of discharge capacity of labyrinth weir with gene expression programming. In Intelligent Systems and Applications: Proceedings of the 2020 Intelligent Systems Conference (IntelliSys). 1, 202–217 (2021).
Shafiei, S., Najarchi, M. & Shabanlou, S. A novel approach using CFD and neuro-fuzzy-firefly algorithm in predicting labyrinth weir discharge coefficient. J. Braz. Soc. Mech. Sci. Eng. 42, 1–19 (2020).
Article Google Scholar
Norouzi, P., Rajabi, A. & Shabanlou, S. Estimation of labyrinth weir discharge coefficient using self-adaptive extreme learning machine. J. Water Soil Sci. 32(1), 39–52 (2022).
Google Scholar
Wang, F., Zheng, S., Ren, Y., Liu, W. & Wu, C. Application of hybrid neural network in discharge coefficient prediction of triangular labyrinth weir. Flow Meas. Instrum. 83, 102108 (2022).
Article Google Scholar
Chen, W. et al. Accurate discharge coefficient prediction of streamlined weirs by coupling linear regression and deep convolutional gated recurrent unit. Eng. Appl. Comput. Fluid Mech. 16(1), 965–976 (2022).
Google Scholar
Ahmad, F., Hussain, A. & Ansari, M. A. Development of ANN model for the prediction of discharge coefficient of an arced labyrinth side weir. Model. Earth Syst. Environ. 9, 1–8 (2022).
Google Scholar
Emami, H., Emami, S. & Parsa, J. A Walnut optimization algorithm applied to discharge coefficient prediction on labyrinth weirs. Soft. Comput. 26(22), 12197–12215 (2022).
Article Google Scholar
Safari, S. et al. Evaluation of ANN, GEP, and regression models to estimate the discharge coefficient for the rectangular broad-crested weir. Polish J. Environ. Stud. 31(5), 4817 (2022).
Article Google Scholar
Mohammadi, M. & Yasi, M. Investigation of labyrinth weir with arc plan. J. Agric. Sci. Technol. 11(41), 1–12 (2007).
Google Scholar
Henderson, F. M. Open channel flow (Macmillan Publishing, 1996).
Google Scholar
Novak, P., Guinot, V., Jeffrey, A. & Reeve, D.E. Hydraulic modelling- an introduction. Spon Press, an Imprint of Taylor & Francis, London and New York. 599 (2010).
Guo, R. et al. Degradation state recognition of piston pump based on ICEEMDAN and XGBoost. Appl. Sci. 10(18), 6593 (2020).
Article CAS Google Scholar
Thongsuwan, S., Jaiyen, S., Padcharoen, A. & Agarwal, P. ConvXGB: A new deep learning model for classification problems based on CNN and XGBoost. Nucl. Eng. Technol. 53(2), 522–531 (2021).
Article CAS Google Scholar
Gu, Y., Zhang, D. & Bao, Z. A new data-driven predictor, PSO-XGBoost, used for permeability of tight sandstone reservoirs: A case study of member of chang 4+ 5, western Jiyuan Oilfield, Ordos Basin. J. Petrol. Sci. Eng. 199, 108350 (2021).
Article CAS Google Scholar
Tanabe, R., & Fukunaga, A. Success-history based parameter adaptation for differential evolution. IEEE congress on evolutionary computation. 71–78 (2013).
Tanabe, R., & Fukunaga, A. S. Improving the search performance of SHADE using linear population size reduction. IEEE congress on evolutionary computation (CEC). 1658–1665 2014.
Larose, D. T. & Larose, C. D. Discovering knowledge in data: an introduction to data mining (Wiley, 2014).
MATH Google Scholar
Mahmoud, A., Yuan, X., Kheimi, M. & Yuan, Y. Interpolation accuracy of hybrid soft computing techniques in estimating discharge capacity of triangular labyrinth weir. IEEE Access 9, 6769–6785 (2021).
Article Google Scholar
Majediasl, M. & Fuladipanah, M. Application of the evolutionary methods in determining the discharge coefficient of triangular labyrinth weirs. JWSS-Isfahan Univ. Technol. 22(4), 279–290 (2019).
Google Scholar
Hu, Z. et al. Using soft computing and machine learning algorithms to predict the discharge coefficient of curved labyrinth overflows. Eng. Appl. Comput. Fluid Mech. 15(1), 1002–1015 (2021).
Google Scholar
Roushangar, K., Alami, M. T., Shiri, J. & Asl, M. M. Determining discharge coefficient of labyrinth and arced labyrinth weirs using support vector machine. Hydrol. Res. 49(3), 924–938 (2018).
Article Google Scholar
Zounemat-Kermani, M., Kermani, S. G., Kiyaninejad, M. & Kisi, O. Evaluating the application of data-driven intelligent methods to estimate discharge over triangular arced labyrinth weir. Flow Meas. Instrum. 68, 101573 (2019).
Article Google Scholar
Achour, B., Amara, L. & Mehta, D. Control of the hydraulic jump by a thin-crested sill in a rectangular channel new experimental consideration. LARHYSS J. 50, 31–48 (2022).
Google Scholar
Zadghorban, M., Masoudian, M., Esmaeilivaraki, M. & Gharagezlou, M. Investigation of cyliderical weir roughness on hydraulic characteristics. Iran. J. Watershed Manag. Sci. Eng. 12(40), 69–80 (2018).
Google Scholar
Emami, S., Parsa, J. & Emami, H. Estimation of discharge coefficient of curved plan-form labyrinth weirs using a Hybrid WOA-ANFIS method. Iran. J. Irrigat. Drainage 14(5), 1664–1676 (2020).
Google Scholar
Mohammadi, F., Hassanzadeh, Y. & Roushangar, K. Determining the discharge coefficient of one-cycle sharp-crested u-shape weirs using kernel-based SVM approach. Iran. J. Irrigat. Drainage 14(5), 1722–1736 (2020).
Google Scholar
Haghiabi, A. H., Parsaie, A. & Ememgholizadeh, S. Prediction of discharge coefficient of triangular labyrinth weirs using adaptive neuro fuzzy inference system. Alex. Eng. J. 57(3), 1773–1782 (2018).
Article Google Scholar
Heydari, M., Dosti, M., & Safari, H. Optimizing the flow coefficient of trapezoidal zigzag overflows using the intelligent algorithm of gradual cooling. 10^th International Seminar on River Engineering, Shahid Chamran University, Ahvaz, Iran (2014).

Download references

Author information

Authors and Affiliations

Department of Water Engineering, University of Tabriz, Tabriz, 5971982284, Iran
Somayeh Emami
Department of Computer Engineering, University of Bonab, Bonab, Iran
Hojjat Emami
Department of Water Engineering, University of Tabriz, Tabriz, Iran
Javad Parsa

Authors

Somayeh Emami
View author publications
You can also search for this author in PubMed Google Scholar
Hojjat Emami
View author publications
You can also search for this author in PubMed Google Scholar
Javad Parsa
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.E., H.E., and J.P. designed the study, analyzed and discussed the data and results. S.E. and H.E. performed the experiments and simulations. S.E. and H.E. prepared the hybrid proposed model/materials/equipment's/irrigation system. All authors reviewed the manuscript.

Corresponding author

Correspondence to Somayeh Emami.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Emami, S., Emami, H. & Parsa, J. LXGB: a machine learning algorithm for estimating the discharge coefficient of pseudo-cosine labyrinth weir. Sci Rep 13, 12304 (2023). https://doi.org/10.1038/s41598-023-39272-6

Download citation

Received: 16 May 2023
Accepted: 22 July 2023
Published: 29 July 2023
DOI: https://doi.org/10.1038/s41598-023-39272-6

This article is cited by

Enhanced renewable power and load forecasting using RF-XGBoost stacked ensemble
- Rita Banik
- Ankur Biswas
Electrical Engineering (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.