Introduction

Precisely estimating hydrological parameters in the ungauged basin has drawn the attention of hydrologists and water resources engineering1. In meteorology, the assessment of runoff is extremely important2, especially in areas where there is no measuring station that cannot be calibrated. Therefore, the regionalization method is optional for transferring model parameters from the gauged basin to the ungauged basin3,4. The popular regionalization methods are physical similarity, spatial proximity, and regression5. Previously, many studies have been conducted and compared the performance of the regionalization methods to predict total streamflow or direct runoff6 in the ungauged catchment with various hydrological models (WASMOD7, VIC8, SWAT9, GR4J10, HMETS10, MOHYSE10, and HEC-HMS11) for different regions. Some of them showed that distance-based (spatial proximity, physical similarity) outperformed regression methods7,8. The combined watershed classification of inverse distance weighted (IDW) and physical similarity methods was investigated to predict streamflow in the ungauged catchment by Kanishka and Eldho9. Swain and Patra3 pointed that the spatial proximity between the gauged catchment and the ungauged catchment gave better results than the physical similarity for predicting the continuous streamflow. Arsenault et al.10 studied the efficacy of three regionalization methods: multiple linear regression (MLR), spatial proximity, and physical similarity, to predict current flow in ungauged catchments of Mexico. They showed that transferring a set of parameters from a nearby reservoir is the most efficient method for estimating the runoff in ungauged basins. Tegegne and Kim12 proposed a catchment runoff-response similarity (CRRS) method to identify crucial properties for supporting the hydrological similarity. This approach was conducted with South Korea's Geum River Basin (GRB) and Ethiopia's Lake Tana Basin (LTB). The results showed that CRRS performed better than the others. Koçyiğit et al.13 found that when the geometric dimensions of the sub-basin changed, the hydrological parameters of those sub-basins also changed.

Recently, machine learning has been popular and widely applied in hydrology and water resources engineering. Hussain and Khan14 indicated that random forest (RF) was more effective than multilayer perceptron (MLP) and support vector regression (SVR) for monthly flow forecasting in Hunza River, Pakistan. Schoppa et al.15 showed that random forest could simulate both small and medium floods equivalent to the HYDROMAD model. Wang and Wang16 found that multiple linear regression (MLR) and M5P model tree (M5P) gave more precisely in predicting the daily water level in Lake Erie than Gaussian process (GP), multilayer perceptron (MLP), random forest (RF), and k-nearest neighbor (KNN). For predicting the flow in ungauged basins, Araza et al.17 applied the regionalizing RF models in the 21 mountainous regions of Luzon, Philippines. However, very few researches have been conducted for applying machine learning for estimating hydrological model parameters. Recently, Saadi et al.18 explored RF algorithms' ability to express the relationship between hourly hydrological model (GR4H) parameters and climate/landscape catchment in the 870 catchments in the United States and 1355 catchments in France.

The GR2M model (Rural Genius model) has lately been used to simulate a watershed's hydrological features due to climate variability on hydrologic regimes19 and evaluate the effects of climate change on runoff20. It was utilized to screen hydrologic data to study hydrological response across the Lower Mekong Basin21. Boulariah et al.22 found that the GR2M model gave better performance than the ABCD. The runoff simulation under various climate conditions in the Wimmera catchment was studied using four monthly rainfall-runoff models: abcd, Budyko, GR2M, and WASMOD by Topalović et al.23. The effects of climate scenarios on monthly river runoff in the Cheliff, Tafna, and Macta in North-West24 were conducted by the GR2M model. Rintis and Setyoasri25 found that the GR2M model gave a comparable performance to Mock and NRECA methods. The GR2M model and Artificial Neural Network were utilized to reconstruct monthly river flow for Irish catchments26. The regionalized GR2M model’s parameters were developed to predict monthly runoff in the ungauged basins for northern Algeria27.

Thailand's southern region, located in the tropical climate, has been experiencing water-related disasters (e.g., flooding and drought). The comprehension of spatiotemporal hydrological characteristics, especially runoff, can reduce or alleviate extensive damage to human lives and properties and sustain economic growth28,29,30,31. As mentioned earlier, accuracy in estimating monthly runoff variation in the ungauged basin is vital for water resources planning and management. The regionalization approach has been widely accepted for this purpose. However, in Thailand, especially in the southern region, there was no research work using machine learning to estimate the regionalized GR2M parameters. To fulfill this gap in the literatures, our research aimed to investigate and compare the performance of two main regionalization approaches, i.e., regression-based and distance-based methods, to determine the GR2M model parameters for monthly runoff estimation at an ungauged basin in southern Thailand. It is the first attempt to discover the most practical approach for obtaining the regionalized GR2M parameters under the data scarcity context in the south region, Thailand. The following is an outline of how this article is structured: (1) the rationale for conducting this research work and its related literature review; (2) the study area explanation; (3) The GR2M description; (4) the framework and its detailed information of this research methodology; (5) our results, findings, and discussion; (6) this research conclusion and its contributions.

Study area

Our research work focused on three of five major river basins in the southern basin of Thailand (see Fig. 1): the Peninsula-East Coast (26,024 km2), Peninsula-West Coast (18,841 km2), and Thale Sap Songkhla (8484 km2) due to the available hydrological information. This peninsula area is located in between the Andaman Sea and the South China Sea. In the northern and central regions, there is a long western mountain range, and in the midst of the ridge's southern region is the Nakhon Si Thammarat ridge. The monsoon winds from the northeast and southwest are primarily responsible for its climatological characteristics. A minor coastal plain exists in the Peninsula-East Coast watershed, with short rivers of fewer than 150 km draining into the Gulf of Thailand. This watershed had nine runoff stations used for data analysis. The Peninsula-West Coast Watershed features short rivers that run into the Andaman Sea in the west and southwest. The runoff information was collected and analyzed from nineteen runoff stations. Thale Sap Songkhla watershed mainly locates in Songkhla, Phatthalung, and the lower part of Nakhon Si Thammarat. There were nine runoff stations available in the Thale Sap Songkhla watershed.

Figure 1
figure 1

The station locations of rainfall, runoff, and weather stations of the study area. (This map was created by QGIS version 3.12, which can be accessed on https://qgis.org/en/site/).

GR2M model

The GR2M is a conceptual monthly rainfall-runoff mathematical model developed by Demagref in the late 1980s. Later on, several versions have been continuously being improved its efficiency by Kabouya32, Makhlouf33, Mouelhi34 until Mouelhi et al.35. This study used the version of 2006b. Literature reviews showed its performance, applicability, and simplicity compared to other models36 due to requiring two parameters: the ability to keep moisture in the soil (X1) and the water exchange coefficient (X2). Monthly rainfall, runoff, and evapotranspiration are the only three meteorological and hydrological data necessary35,37. Total runoff hydrograph, soil moisture content, groundwater flow, etc., are the model results. The water balance concept with two reservoirs was utilized for the GR2M model, as presented in Fig. 2. In the upper reservoir, the basin’s soil moisture (S) depends on production store: X1 (mm). And the lower reservoir is river flow (R), which is regulated by the exchange coefficient water: X2. and a maximum capacity of 60 mm. Starting with precipitation penetrated the soil, soil moisture is at the level: S1 (mm). When the soil has reached saturation, the rainfall excess occurs P1 (mm). Some soil moisture can decrease during that process due to evapotranspiration E, resulting in the soil moisture remains at level: S2 (mm). Some soil water infiltrates into the soil as subsurface water: P2 (mm) and conglomerate with rainfall excess to be surface runoff: P3 (mm). The surface runoff flows into the river combining with the rest water from the previous month: R (mm). The river runoff can change depending on the direction of water flowing into or out from the basin. Finally, the total runoff hydrograph is obtained.

Figure 2
figure 2

Source: Adapted from Bachir et al.38, Rwasoka et al.39.

The GR2M model working process.

Research methodology

In this study, the research methodology (Fig. 3) consisted of four main steps, i.e., (1) the GR2M model’s calibration and verification; (2) analysis of basin’s hydrological data and physical attributes; (3) regionalization methods for estimating the GR2M model’s parameters and their performance comparison; and (4) performance evaluation of the regional GR2M model parameters in estimating monthly runoff. The detailed information for each step can be explained as the following.

Figure 3
figure 3

Framework for research methodology.

The GR2M model’s calibration and verification

The model’s calibration and verification were conducted to make the GR2M model reliable for estimating monthly runoff for 37 different runoff stations in the Southern Basins, Thailand. Before calibrating and verifying the GR2M model, it requires a warm-up period to determine the suitable initial values of X1 and X2 so that the model can imitate the existing hydrological characteristics of the considering basin. For doing this, the initial R value raining from 10 to 60 mm was sought. And the appropriate warm-up periods of 4 to 7 months were discovered, depending on the runoff station characteristics. In this study, the fitted values of X1 and X2 parameters for each runoff station were automatically determined with Microsoft Excel Solver's help by setting root mean square error (RMSE) as an objective function and the constraints of X1 and X2 parameters. The available monthly rainfall, evapotranspiration, and runoff data for each runoff station ranged from 41 to 80 months, resulting in the calibration and verification periods were 22 to 48 months and 10 to 39 months, respectively.

Analysis of basin’s hydrological data, and physical attributes

We collected the monthly runoff (37 stations), rainfall (38 stations) and air temperature (13 stations) information from the Royal Irrigation Department (RID) and the Thai Meteorological Department (TMD). Figure 2 depicts the locations of rainfall, runoff, and weather. Areal rainfall and air temperature information for each runoff gauged station was analyzed by using Thiessen polygon. Table 1 shows the summary statistical values of hydrological data and physical characteristics of runoff gauged station used in this analysis. We used Thornthwaite40 equation to calculate monthly evapotranspiration. Figure 4 shows the physical characteristics information of the 37 runoff gauged stations, including basin area (A), river length (L), and river length from the basin’s centroid to the basin outlet (Lc) of runoff gauged station, were determined with the help of QGIS, a free and open-source geographic information system software. We examined the time matching to choose the appropriate times for calibrating and verifying the model. Hence, all periods for running the GR2M model were in the range of 41 to 80 months. And its calibration and verification periods were in the range of 22 to 49 months and 10 to 39 months, respectively.

Table 1 Summary statistical values of hydrological data and physical characteristics of runoff gauged station used in this analysis.
Figure 4
figure 4

Physical characteristics information of the 37 runoff gauged stations.

Regionalization methods for estimating the GR2M model’s parameters

Estimating the GR2M model’s parameters in the ungauged basin was analyzed by using the regionalization concept. It is a method that transfers model parameters from donor catchments to the target station or ungauged catchments41. In this study, two regionalization methods, i.e., regression-based methods, and distance-based methods, were investigated and compared their performance.

Regression-based methods

Regression analysis was used to determine the relationship between the fitted GR2M model parameters and three basin's physical characteristics (A, L, and Lc), and thirteen hydrological data, including monthly average areal rainfall for 12 months, and annual average areal rainfall. Each fitted GR2M model parameter (i.e., X1 and X2) was a dependent variable, while the basin's physical characteristics and hydrological data were independent variables. We conducted three scenarios for selecting the most suitable group of independent variables, that is, (1) using only the basin’s physical characteristics, (2) using only hydrological data, and (3) combining those mentioned variables in the both scenarios 1 and 2. Three regression-based methods were selected herein, Multiple Linear Regression (MLR), Random forest (RF), and M5 Model Tree (M5). The last two methods are based on a data-driven model.

Multiple linear regression analysis (MLR)

$${\text{y}} = {\text{a}}_{1} {\text{x}}_{1} + {\text{a}}_{2} {\text{x}}_{2} + \cdots + {\text{a}}_{{{\text{n}} - 1}} {\text{x}}_{{{\text{n}} - 1}} + {\text{a}}_{{\text{n}}} {\text{x}}_{{\text{n}}} + {\text{b}}$$
(1)

where y is the dependent variable, xi is the independent variable; ai is regression coefficient, b is constant of regression equation, and n is number of the independent variable. We utilized regression function in Microsoft excel to develop regionalized GR2M model parameter equations.

Random forest (RF)

Random forest (RF) popular modification of decision trees and one of the ensemble techniques, was first introduced by Breiman in early 200142. It can use for data classification and regression. The advantage of RF is that it can find a series of complex relationships between predictors and responses without any relationships between them by including decision trees42.RF creates several trees based on decision trees method, where every tree is produced by arbitrarily selecting training data set, called bagging process, and attributes (or features) from the input vector. By the voting method from the predictive output of every tree created, the model prediction is finally obtained. In regression, the tree predictor proceeds on numerical values as arbitrary to class labels used by the random forest classifier43. The most frequently used variable selection measures in tree induction are the Information Gain Ratio criterion44 and Gini index45. Unlike the M5 model tree, full-grown RF trees are not pruned. One of the key advantages of random forest regression over the M5 model tree is that it is more flexible. The speculation error always converges as the number of trees grows, even if the tree isn’t pruned, and overfitting isn’t a concern because of the Strong Law of Large Numbers43. We used WEKA, free and open-source software, and all default RF parameters as recommended by WEKA in our study.

M5 model tree (M5)

Quinlan 199244 irst developed the M5 model tree by employing a divide-and-conquer strategy to establish the relationship between independent and dependent variables. It can be applied to both qualitative (categorical) and quantitative variables. Building M5 involves three stages. The first stage involves the development of a decision tree by dividing the data set into subsets (or leaves). Second, to prevent an overfitted structure or weak generalizer, the overgrown tree is pruned, and linear regression functions are used to replace the pruned sub-trees. the overgrown tree is pruned and the pruned sub-trees are replaced by linear regression functions. The pruning method requires the merger of some of the lower sub-trees into one node. Finally, the smoothing process is used to compensate for the strong discontinuities that would undoubtedly exist between neighboring linear models on the trimmed trees' leaves, especially for some models with a small number of training samples. For regression-based methods, the suitable group of independent variables for determining two GR2M parameters was investigated. Thus, there were three scenarios, that is, (1) scenario-1: using only the basin’s physical characteristics, (2) scenario-2: using only hydrological information, and (3) scenario-3: combining those mentioned variables in scenarios 1 and 2.

Distance-based methods

Distance-based methods are a method for determination hydrological model parameters in the ungauged basin by transferring their values from donor catchments to the target station or ungauged catchments. Two approaches are popular recommended: Inverse Distance Weighted (IDW), and Inverse Similarity Weighted (ISW). The IDW value depends on the proximity of the distance, whereas ISW value depends on the similarity of the physical characteristics7. By applying IDW and ISW concepts, Spatial Proximity Approach25 and Physical Similarity Approach (PSA) were utilized respectively herein and can be concisely explained as follows:

Spatial proximity approach (SPA)

SPA is the method to select donor stations with a proximity distance to a target station5. The distance between a gauged station (or donor station) and ungauged stations (or a target station) can be determined by:

$${\text{D}}_{{{\text{ug}}}} = \sqrt {\left( {{\text{x}}_{{\text{u}}} - {\text{x}}_{{\text{g}}} } \right)^{2} + \left( {{\text{y}}_{{\text{u}}} - {\text{y}}_{{\text{g}}} } \right)^{2} }$$
(2)

where \({\text{x}}_{{\text{g}}} ,\;{\text{x}}_{{\text{u}}}\) are the latitude (UTM), \({\text{y}}_{{\text{g}}} ,\;{\text{y}}_{{\text{u}}}\) are the longitude (UTM); which g is donor station, and u is the target station, and \({\text{D}}_{{{\text{ug}}}}\) is the distance between g and u stations.

The inverse distance weighted can be calculated as:

$${\text{W}}_{{{\text{g}}\_{\text{i}}}} = \frac{{\left( {1/{\text{D}}_{{{\text{ug}}\_{\text{i}}}} } \right)}}{{\sum_{{{\text{i}} = 1}}^{{\text{n}}} \left( {1/{\text{D}}_{{{\text{ug}}\_{\text{i}}}} } \right)}}$$
(3)

\({\text{W}}_{{{\text{g}}\_{\text{i}}}}\) is the inverse distance weighted, and n is the total number of donor stations.

A parameter of the target station can be obtained by:

$${\text{P}}_{{{\text{ug}}}} = \mathop \sum \limits_{{{\text{i}} = 1}}^{{\text{n}}} {\text{W}}_{{{\text{g}}\_{\text{i}}}} {\text{p}}_{{{\text{g}}\_{\text{i}}}}$$
(4)

where Pug is the parameter of target station, and pg_i is the parameter of donor station.

Physical similarity approach (PSA)

PSA is the method based on the concept that catchments with similar physical characteristics would have similar hydrological behavior46.

$${\text{SI}}_{{{\text{ug}}}} = \sum_{{{\text{i}} = 1}}^{{\text{k}}} \frac{{\left| {{\text{CD}}_{{{\text{g}},{\text{i}}}} - {\text{CD}}_{{{\text{u}},{\text{i}}}} } \right|}}{{\Delta {\text{CD}}_{{{\text{gi}}}} }}$$
(5)

where \({\text{SI}}_{{{\text{ug}}}}\) is the similarity index, \({\text{CD}}_{{{\text{g}},{\text{i}}}} ,{\text{CD}}_{{{\text{u}},{\text{i}}}}\) are the catchment descriptor of donor catchments to the target station; \(\Delta {\text{CD}}_{{{\text{gi}}}}\) is the rage of ith catchment descriptor, k is the total number of catchment descriptor.

$${\text{W}}_{{{\text{g}}\_{\text{i}}}} = \frac{{\left( {1/{\text{SI}}_{{{\text{ug}}\_{\text{i}}}} } \right)}}{{\sum_{{{\text{i}} = 1}}^{{\text{n}}} \left( {1/{\text{SI}}_{{{\text{ug}}\_{\text{i}}}} } \right)}}$$
(6)

\({\text{W}}_{{{\text{g}}\_{\text{i}}}}\) (ISW) is the inverse similarity weighted, and n is the total number of donor stations.

$${\text{P}}_{{{\text{ug}}}} = \mathop \sum \limits_{{{\text{i}} = 1}}^{{\text{n}}} {\text{W}}_{{{\text{g}}\_{\text{i}}}} {\text{p}}_{{{\text{g}}\_{\text{i}}}}$$
(7)

where Pug is the parameter of target station, and pg_i is the parameter of donor station.

Model performance in estimating the regional GR2M model parameters was compared using four statistical indices, including Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), Pearson Correlation Coefficient (r), and Combined Accuracy (CA)47.

Evaluation of the regional GR2M model parameters applied in the GR2M model

Three performance criteria, including Nash–Sutcliffe Efficiency (NSE), Correlation Coefficient (r), and Overall Index (OI), and A Taylor diagram were used for evaluating the applicability of the GR2M Model. The details for each performance criteria can be delineated as the following:

Nash–Sutcliffe Efficiency (NSE) is a prominent index for determining model correctness or model performance, as shown in the following equation:

$${\text{NSE}} = 1 - \frac{{\mathop \sum \nolimits_{{{\text{i}} = 1}}^{{\text{n}}} \left( {{\text{Q}}_{{{\text{cal}}}} - {\text{Q}}_{{{\text{obs}}}} } \right)^{2} }}{{\mathop \sum \nolimits_{{{\text{i}} = 1}}^{{\text{n}}} \left( {{\text{Q}}_{{{\text{obs}}}} - \overline{{\text{Q}}}_{{{\text{obs}}}} } \right)^{2} }}$$
(8)

The NSE ranges from −  α to 1. If the NSE is near to 1, the observed and calculated runoff are likely to be identical, or it is considered the most efficient or accurate48.

The correlation coefficient (r) shows agreement between two variables. The following equation can be used to compute the correlation coefficient between X and Y.

$${\text{r}} = \frac{{\mathop \sum \nolimits_{{{\text{i}} = 1}}^{{\text{n}}} \left( {{\text{Q}}_{{{\text{obs}}}} - \overline{{\text{Q}}}_{{{\text{obs}}}} } \right)\left( {{\text{Q}}_{{{\text{cal}}}} - \overline{{\text{Q}}}_{{{\text{cal}}}} } \right)}}{{\sqrt {\mathop \sum \nolimits_{{{\text{i}} = 1}}^{{\text{n}}} \left( {{\text{Q}}_{{{\text{obs}}}} - \overline{{\text{Q}}}_{{{\text{obs}}}} } \right)^{2} } \cdot \sqrt {\mathop \sum \nolimits_{{{\text{i}} = 1}}^{{\text{n}}} \left( {{\text{Q}}_{{{\text{cal}}}} - \overline{{\text{Q}}}_{{{\text{cal}}}} } \right)^{2} } }}$$
(9)

The r-value ranges from − 1 to 1. The plus sign (+) indicates the direct relation between observed and predicted values or vice versa49.

The overall index (OI) is a model performance criterion that gives the value between −  to 1. The model’s performance is prominent if OI approaches to 150.

$${\text{OI}} = \frac{1}{2}\left[ {2 - \frac{{{\text{RMSE}}}}{{{\text{Q}}_{{{\text{obs}},{\text{max}}}} - {\text{Q}}_{{{\text{obs}},{\text{min}}}} }} - \frac{{\mathop \sum \nolimits_{{{\text{i}} = 1}}^{{\text{n}}} \left( {{\text{Q}}_{{{\text{obs}}}} - {\text{Q}}_{{{\text{cal}}}} } \right)^{2} }}{{\mathop \sum \nolimits_{{{\text{i}} = 1}}^{{\text{n}}} \left( {{\text{Q}}_{{{\text{obs}}}} - \overline{{\text{Q}}}_{{{\text{obs}}}} } \right)^{2} }}} \right]$$
(10)

where \({\text{Q}}_{{{\text{obs}}}}\) is the observed runoff, \({\text{Q}}_{{{\text{cal}}}}\) is the calculated runoff, \({\overline{\text{Q}}}_{{{\text{obs}}}}\) is the average observed runoff, \({\overline{\text{Q}}}_{{{\text{cal}}}}\) is the average calculated runoff, \({\text{Q}}_{{{\text{obs}},{\text{max}}}}\) is the maximum observed runoff, \({\text{Q}}_{{{\text{obs}},{\text{min}}}}\) is the minimum observed runoff, and n is the number of runoff data.

A Taylor diagram was used herein to comparatively elaborate and evaluate the efficacy among the developed models. This diagram can simultaneously show three statistic parameters, i.e., correlation, root mean square error, and standard deviation.

Results and discussion

This section presents our finding as follows: (1) calibrating and validating the GR2M Model and its fitted values, (2) the suitable group of independent variables for determining two GR2M parameters using regression-based methods, (3) model performance comparison in estimating the regional GR2M model parameters, and (4) evaluation of the regionalized parameters applied in the GR2M model. The details information is delineated as follows.

Calibrating and validating the GR2M Model and its fitted values

The calibrating and validating results of the GR2M model is depicted as box plot in Fig. 5. The calibrated NSE, r, and OI values were 0.657, 0.825, and 0.757, respectively, and the verified NSE, r, and OI values were 0.449, 0.743 and 0.599, respectively. It was a satisfactory model prediction as suggested by Lian et al.51. The obtained r value of more than 0.70 showed a strong positive linear relationship between the calculated and observed runoff data52. The OI value of more than 0.60 showing the model had a relatively high accurate prediction. Figure 6 shows two examples of rainfall and runoff time series at the X.64 and X.70 stations, which were obtained from the GR2M model. The monthly rainfall time-series data is shown as a bar chart in blue. The observed and calculated runoff time-series data are shown with the line graphs in orange and green, respectively. And the solid and dot lines indicate the calibrated and validated periods, respectively. The agreement between observed and calculated runoff time-series data with a bit of underestimating the calculated runoff is found.

Figure 5
figure 5

The results of model’s calibration and validation.

Figure 6
figure 6

The rainfall and runoff time series at runoff stations X.64 and X.70.

The statistical values of the fitted GR2M model parameters (X1 and X2) for 37 runoff stations are displayed in Table 2. The production store capacity (X1) value varies between the allowable minimum (2.00 mm) and maximum (10.00 mm) with the average and standard deviation values of 5.71 mm and 2.49 mm, respectively. The skewness and kurtosis X1values of − 0.52, and − 1.03, respectively, indicated that the production store capacity (X1) in the southern river basin, Thailand, has left skew platykurtic, and non-symmetric distributions. The groundwater exchange rate (X2) value varies between 0.54 and 1.00. Most X2 values are 1.0, which is the maximum value, resulting in its average X2 value of 0.93 with a meagre standard deviation value of 0.12. The skewness and kurtosis values of X2 were − 2.01, and 3.69, respectively, indicated that the groundwater exchange rate (X2) in the southern river basin, Thailand, has left skew, leptokurtic, and non-symmetric distributions. It can observe that the positive obtained groundwater exchange rate (X2) value. Thus, it shows no groundwater runs out of the basin.

Table 2 The statistical values of the fitted GR2M model parameters.

The suitable group of independent variables for determining two GR2M parameters using regression-based methods

The results of investigating three scenarios for selecting the most suitable group of independent variables, that is, (1) scenario-1: using only the basin's physical characteristics, (2) scenario-2: using only hydrological information, and (3) scenario-3: combining those mentioned variables in scenarios 1 and 2. The results as shown in Table 3 indicates that scenario-1 received the worse performance for all cases due to giving the highest CA values than other cases. The italic number in Table 3 shows the scenario giving the best performance. For developing the regionalized X1 and X2 equations with MLR, we found that the most suitable group of independent variables was scenario-3. Also, the scenario-3 used for developing the regionalized X1 equation in which RF was the best independent variables. In scenario-2, the regionalized X2 equation developed by RF gave the best performance. By using the scenario-2, the regionalized X1 equation developed by M5 was the best. In scenario-3, the regionalized X2 equation developed by M5 gave the best performance. The explicit equations for estimating X1 and X2 values using MLR and M5 were shown in Eqs. (11) to (14). It should be noticed that the equation for X2 obtained from M5 method excluded variables of the basin's physical characteristics, although scenario-3 was selected the best one. RF is a machine learning algorithm and it has no an explicit equation like MLR and M5.

Table 3 The suitable group of independent variables for determining two GR2M parameters.

MLR

$$\begin{aligned} {\text{X}}_{1} & = 3.90 \times 10^{ - 5} {\text{A}} - 1.48 \times 10^{ - 6} {\text{L}} + 2.10 \times 10^{ - 5} {\text{L}}_{{\text{c}}} - 0.109{\text{RF}}_{2} - 0.079{\text{RF}}_{3} - 0.052{\text{RF}}_{4} \\ & \quad - \;0.031{\text{RF}}_{5} - 0.073{\text{RF}}_{6} - 0.049{\text{RF}}_{7} - 0.060{\text{RF}}_{8} - 0.056{\text{RF}}_{9} - 0.039{\text{RF}}_{10} \\ & \quad - \;0.067{\text{RF}}_{11} - 0.055{\text{RF}}_{12} + 0.053{\text{RF}}_{{\text{y}}} + 7.671 \\ \end{aligned}$$
(11)
$$\begin{aligned} {\text{X}}_{2} & = 3.57 \times 10^{ - 5} {\text{A}} - 1.54 \times 10^{ - 6} {\text{L}} + 2.59 \times 10^{ - 7} {\text{L}}_{{\text{c}}} - 6.92 \times 10^{ - 4} {\text{RF}}_{2} - 2.07 \times 10^{ - 3} {\text{RF}}_{3} \\ & \quad - \;5.38 \times 10^{ - 4} {\text{RF}}_{4} - 1.56 \times 10^{ - 4} {\text{RF}}_{5} - 5.29 \times 10^{ - 4} {\text{RF}}_{6} - 1.19 \times 10^{ - 3} {\text{RF}}_{7} \\ & \quad - \;1.24 \times 10^{ - 3} {\text{RF}}_{8} - 1.23 \times 10^{ - 3} {\text{RF}}_{9} - 5.90 \times 10^{ - 4} {\text{RF}}_{10} - 1.68 \times 10^{ - 3} {\text{RF}}_{11} \\ & \quad - \;9.11 \times 10^{ - 4} {\text{RF}}_{12} + 7.74 \times 10^{ - 4} {\text{RF}}_{{\text{y}}} + 1.270 \\ \end{aligned}$$
(12)

M5

$$\begin{aligned} {\text{X}}_{1} & = - 0.022{\text{RF}}_{2} - 0.0012{\text{RF}}_{3} - 0.0062{\text{RF}}_{6} - 0.0041{\text{RF}}_{9} + 0.0019{\text{RF}}_{{\text{y}}} \\ & \quad + \;6.5015:{\text{if}}\;{\text{RF}}_{6} \le 152.345,\;{\text{and}}\;{\text{RF}}_{{\text{y}}} \le 1701.835 \\ {\text{X}}_{1} & = - 0.0238{\text{RF}}_{2} - 0.0013{\text{RF}}_{3} - 0.0062{\text{RF}}_{6} - 0.0016{\text{RF}}_{7} - 0.0041{\text{RF}}_{9} \\ & \quad + 0.0019{\text{RF}}_{{\text{y}}} + 6.9793:\;{\text{if }}\;{\text{RF}}_{6} \le 152.345,\;{\text{and }}\;{\text{RF}}_{{\text{y}}} > 1701.835 \\ {\text{X}}_{1} & = - 0.0549{\text{RF}}_{2} - 0.0178{\text{RF}}_{3} - 0.0048{\text{RF}}_{6} - 0.0018{\text{RF}}_{9} - 0.0065{\text{RF}}_{11} + 0.0015{\text{RF}}_{{\text{y}}} \\ & \quad + \;7.1315:{\text{if}}\;{\text{RF}}_{6} > 152.345,{\text{RF}}_{3} \le 125.635,\;{\text{and}}\;{\text{RF}}_{2} \le 36.805 \\ {\text{X}}_{1} & = - 0.057{\text{RF}}_{2} - 0.0178{\text{RF}}_{3} - 0.0048{\text{RF}}_{6} - 0.0017{\text{RF}}_{9} - 0.0071{\text{RF}}_{11} + 0.0015{\text{RF}}_{{\text{y}}} \\ & \quad + \;6.6992:{\text{if}}\;{\text{RF}}_{6} > 152.345,{\text{RF}}_{3} \le 125.635,\;{\text{and}}\;{\text{RF}}_{2} > 36.805 \\ {\text{X}}_{1} & = - 0.0368{\text{RF}}_{2} - 0.0213{\text{RF}}_{3} - 0.0048{\text{RF}}_{6} - 0.0032{\text{RF}}_{9} + 0.0015{\text{RF}}_{{\text{y}}} \\ & \quad + \;7.2107:{\text{if}}\;{\text{RF}}_{6} > 152.345,\;{\text{and}}\;{\text{RF}}_{3} > 125.635 \\ \end{aligned}$$
(13)
$$\begin{aligned} {\text{X}}_{2} & = - 0.0003{\text{RF}}_{8} - 0.0002{\text{RF}}_{11} + 1.0979:{\text{if RF}}_{8} \le 105.745 \\ {\text{X}}_{2} & = 0.0006{\text{RF}}_{6} - 0.0002{\text{RF}}_{8} - 0.0007{\text{RF}}_{11} + 0.0003{\text{RF}}_{12} - 0.0002{\text{RF}}_{{\text{y}}} \\ & \quad + \;1.2801:{\text{if RF}}_{8} > 105.745 \\ \end{aligned}$$
(14)

where A = basin area, L = river length, Lc = river length from the basin’s centroid to the basin outlet, RF1, RF2, RF3, …, and RF12 = average monthly rainfall in January, February, March, …, and December, respectively, and RFy = average annual rainfall.

Model performance comparison in estimating the regional GR2M model parameters

In this section, the application of five methods (i.e., MLR, RF, M5, SPA, and PSA) were applied for developing the regionalized GR2M parameters, which are presented and discussed. The first three methods are based on the regression-based method and the rest two methods are distance-based method. Comparison of the fitted X1 and X2 parameters in the GR2M model and the two parameters obtained from those five methods was conducted. The results of applying those five methods to estimate X1 and X2 values are summarized in Table 4 for all 37 runoff stations. It indicated RF gave the best performance for estimating X1 due to providing the lowest CA value, following by MLR, M5, SPA, and PSA, respectively. Likewise, RF gave the best performance for estimating X2, followed by M5, MLR, SPA, and PSA.

Table 4 Statistical indices for estimating X1 and X2 values.

Evaluation of the regionalized parameters applied in the GR2M model

This section aims to present the performance evaluation of the regionalized GR2M parameters developed in the previous section. Those parameters, areal monthly rainfall and evapotranspiration were used as input parameters for the GR2M model. With the same input data sets of areal monthly rainfall, evapotranspiration and different X1 and X2 values obtained from five different methods, we got five monthly runoff time-series as the GR2M model’s output data. Those monthly runoff time-series were compared to that of the calibrated and validated GR2M model. Figure 7 shows the box plot graph, which was obtained from evaluating the GR2M model's effectiveness by using the regionalized GR2M parameters. Also, Table 5 presents the comparison of efficiency criteria obtained from applying X1 and X2 values in the GR2M model with five regionalized methods. As usual, the calibrated model’s performance for all methods was better than those of the validated ones. Figure 7 and Table 5 show that the average values of NSE, r, and OI obtained from the calibration stage gave better values than those obtained from the validation stage. In addition, RF gave the best results when considering NSE and OI values for both calibration and validation stages. Table 6 shows the number of runoff stations that were categorized into four groups with the same interval for each statistical index. By this way, we can see and compare the five methods' effectiveness in our experiment easily. Considering NSE, r, and OI values of equal or more than 0.70 simultaneously, we found that RF gave the best performance in monthly runoff estimation due to providing the highest total number of runoff stations of 60 (i.e., NSE, r, and OI values are equal or more than 0.70 simultaneously in those 60 stations), followed by SPA (53 stations), M5 (49 stations), MLR (46 stations), and PSA (42 stations). Figure 8 presents the scatter plot of three examples of X.44, X.67A, and X.234 runoff station. The graph shows the relationship between the observed and the simulated runoff obtained from GR2M model, MLR, RF, M5, SPA, and PSA in both calibration and validation stages. The perfect line is depicted as the 45-degree diagonal solid line.

Figure 7
figure 7

Comparative results of applying the estimated X1 and X2 values with five methods in the GR2M model.

Table 5 The performance comparison of applying the estimated X1 and X2 values in the GR2M model with 6 methods.
Table 6 The performance criteria of the analysis method for estimating parameters.
Figure 8
figure 8

The relationship between the observed runoff and simulated runoff obtained from six methods in X.44, X.67, and X.234.

Figure 9 presents a Taylor diagram that compares among five regionalized GR2M model and the calibrated and validated GR2M model. As shown in Fig. 9, all models gave a standard deviation value less than that of the observed runoff time series, except for PSA. RF provided the results closest to GR2M's results, followed by SPA, M5, PSA, and MLP. However, in case of lack of basin's physical characteristics and hydrological data, it would recommend using SPA since it only needs information on the distance between a gauged station (or donor station) and ungauged stations (or a target station).

Figure 9
figure 9

Taylor diagram for the regionalized GR2M model for training and testing stages.

Conclusion

The performance investigation of the regionalized GR2M model parameters for estimating monthly runoff in the ungauged basin was conducted in this research work. We selected 37 runoff gauged stations located in the southern basin, Thailand, as the study case. The regression-based and distance-based methods were applied for this purpose. Using regression-based methods to determine two GR2M parameters, the hydrological data was more suitable group of independent variables than the basin’s physical characteristics. We also found that RF gave the best performance for estimating X1 and X2 values due to providing the lowest error, followed by M5, MLR, SPA, and PSA. However, by simultaneously considering NSE, r, and OI values, RF provided the best performance in estimating monthly runoff time series by giving NSE, r, and OI values of equal or more than 0.70, followed by SPA, M5, MLR, and PSA. Furthermore, by using a Taylor diagram, we found that RF provided the results closest to GR2M's results, followed by SPA, M5, PSA, and MLP. However, in case of lack of basin's physical characteristics and hydrological information, it would recommend using SPA since it only needs information on the distance between a gauged station (or donor station) and ungauged stations (or a target station). Estimating monthly runoff time series in the ungauged basin via the regionalization methods could be drastically useful for water resources planning and management.