Comparing three types of data-driven models for monthly evapotranspiration prediction under heterogeneous climatic conditions

Aghelpour, Pouya; Varshavian, Vahid; Khodamorad Pour, Mehraneh; Hamedi, Zahra

doi:10.1038/s41598-022-22272-3

Download PDF

Article
Open access
Published: 17 October 2022

Comparing three types of data-driven models for monthly evapotranspiration prediction under heterogeneous climatic conditions

Scientific Reports volume 12, Article number: 17363 (2022) Cite this article

1340 Accesses
4 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Evapotranspiration is one of the most important hydro-climatological components which directly affects agricultural productions. Therefore, its forecasting is critical for water managers and irrigation planners. In this study, adaptive neuro-fuzzy inference system (ANFIS) model has been hybridized by differential evolution (DE) optimization algorithm as a novel approach to forecast monthly reference evapotranspiration (ET0). Furthermore, this model has been compared with the classic stochastic time series model. For this, the ET0 rates were calculated on a monthly scale during 1995–2018, based on FAO-56 Penman–Monteith equation and meteorological data including minimum air temperature, maximum air temperature, mean air temperature, minimum relative humidity, maximum relative humidity & sunshine duration. The investigation was performed on 6 stations in different climates of Iran, including Bandar Anzali & Ramsar (per-humid), Gharakhil (sub-humid), Shiraz (semi-arid), Ahwaz (arid), and Yazd (extra-arid). The models’ performances were evaluated by the criteria percent bias (PB), root mean squared error (RMSE), normalized RMSE (NRMSE), and Nash-Sutcliff (NS) coefficient. Surveys confirm the high capability of the hybrid ANFIS-DE model in monthly ET0 forecasting; so that the DE algorithm was able to improve the accuracy of ANFIS, by 16% on average. Seasonal autoregressive integrated moving average (SARIMA) was the most suitable pattern among the time series stochastic models and superior to its competitors, ANFIS and ANFIS-DE. Consequently, the SARIMA was suggested more appropriate for monthly ET0 forecasting in all the climates, due to its simplicity and parsimony. Comparison between the different climates confirmed that the climate type significantly affects the forecasting accuracies: it’s revealed that all the models work better in extra-arid, arid and semi-arid climates, than the humid and per-humid areas.

Higher temporal evapotranspiration estimation with improved SEBS model from geostationary meteorological satellite data

Article Open access 18 October 2019

Temperature prediction of solar greenhouse based on NARX regression neural network

Article Open access 28 January 2023

Estimating the monthly pan evaporation with limited climatic data in dryland based on the extended long short-term memory model enhanced with meta-heuristic algorithms

Article Open access 12 April 2023

Introduction

The process of water parting the surface of moist soil is called evaporation, whereas this phenomenon from leaves’ pores is called transpiration. Since recognizing these two phenomena on farms is not easy, they are considered one single integrated variable referred to as “evapotranspiration.” Since evapotranspiration is defined on the surface of an agricultural land, it also includes the water deposited by rain, irrigation, or dew drops on leaves. On the other hand, evapotranspiration is regarded as the water requirement of plants; thus, its measurement is essential in all agricultural and irrigation projects. The amount of evapotranspiration is measured by a lysimeter. Due to the sensitivity of the lysimeter, a technician expert is needed on-site to calibrate the lysimeter constantly. Consequently, if the recorded cases of lysimeter are not cared for carefully, they may have errors. As a remedy, the International Commission on Irrigation and Drainage (ICID) and World Meteorological Organization (WMO) have recognized that the FAO-56 Penman–Monteith equation (FAO-56 PM) should be an acceptable substitute for scarce lysimeter data (Allen et al.¹).

In recent years, despite some well-known mathematical models such as Penman–Monteith, Thornthwaite, Hargreaves-Samani, Blaney-Criddle, etc., the black-box artificial intelligence (AI) models have shown acceptable accuracy in estimating evapotranspiration. For example, Mohammadi and Mehdizadeh² and Ahmadi et al.³, surveying the arid and semi-arid regions of Iran, found that the AI models can estimate evapotranspiration with reasonable accuracy and the least available meteorological variables in the complete absence of meteorological variables, which are required to use the Penman method. They also contended that integrating AI models with bio-inspired optimization algorithms can significantly increase the accuracy of evapotranspiration estimation. In Australia, AIs could accurately estimate evapotranspiration with only temperature and wind speed as available variables (Falamarzi et al.⁴) that can be considered a suitable alternative for the FAO-56 PM model when the meteorological variables are missing. Also, in cases such as Kumar et al.⁵, lysimeter measured evapotranspiration values were used for the validation of the estimated evapotranspiration from neural networks, and their comparison with the outputs of the FAO-56 PM model showed that AIs could be a better estimator for evapotranspiration.

Reference evapotranspiration (ET0) is one of the main components of the hydrological cycle associated with agricultural systems. Accurate estimation and prediction of ET0 are critical in water resources management, irrigation planning, and determining plants’ water needs. Forecasting the ET0 rates by providing information on the future status of evapotranspiration at different time scales can help make appropriate decisions, plan, and apply management methods of water resources. The information for the next day(s), for short-term decisions and planning will be provided on a daily scale prediction. On a monthly scale prediction of ET0, obtaining a longer-term perspective of ET0 changes in the future is possible, which will be especially useful for crops with a long-term growth period (several months). Also, evaluating the agricultural drought status, which is done by famous indicators such as standardized precipitation-evapotranspiration index (SPEI) and Palmer drought severity index (PDSI), directly requires the monthly scale ET0 rate of the region. Data-driven models like stochastic and artificial intelligence methods are efficient approaches that have shown good performance in modeling and predicting hydrometeorological variables in recent years (Essam et al.⁶; Dehghanisanij et al.⁷; Elbeltagi et al.⁸; Azad et al.⁹; Zhang et al.¹⁰; Zarei et al.¹¹; Graf and Aghelpour¹²; Chen et al.¹³). In ET0 cases, Karbasi¹⁴ have used AIs for ET0 forecasting in 1, 2, 3, 7, 10, 14, 18, 24, and 30 days lead times. Karbasi¹⁴ concluded that the accuracy of the predictions was desirable and showed that when the forecast horizon increases, the forecasting accuracy decreases. A comparison between stochastic and artificial intelligence methods in Spain revealed that both model types predicted weekly evapotranspiration effectively (Landeras et al.¹⁵). Lucas et al.¹⁶ compared the seasonal autoregressive integrated moving average (SARIMA) stochastic model with the convolutional neural network (CNN) model to predict daily evapotranspiration in Brazil. They concluded that the CNN model can provide a more accurate prediction of evapotranspiration than the SARIMA model. In contrast, in the Tamil Nadu of India, a comparison was made between artificial intelligence and stochastic methods, and then more appropriate stochastic models were introduced for predicting ET0 (Kishore and Pushpalatha¹⁷).

Predicting evapotranspiration, especially in areas like Iran, which are facing limited water resources, is doubly crucial for determining the cultivation pattern and proper management of water and soil resources. In Iran, these two types of numerical models, i.e., stochastics and AIs, have been used to predict ET0. Ashrafzadeh et al.¹⁸ used the SARIMA, group method of data handling (GMDH), and support vector machine (SVM) models to predict ET0 in humid areas of the Caspian Sea’s southern margin (Guilan province). They evaluated the accuracy of the models and indicated that the mentioned models can predict the ET0 value for the next two years, with the same suitable accuracy as the train-test period. In the same region in Iran (Mazandaran province), Aghelpour and Norooz-Valashedi¹⁹ compared these two model types for the daily prediction of ET0 rates. They applied the models’ autoregressive (AR), moving average (MA), autoregressive moving average (ARMA) and autoregressive integrated moving average (ARIMA) as stochastic models, and compared them with three AIs including SVM, generalized regression neural network (GRNN), and adaptive neuro-fuzzy inference system (ANFIS). The results have shown the high capability of both model types in predicting daily ET0 rates for this humid region. Also another study has developed by Aghelpour et al.²⁰ for the estimation (not prediction) of rice evapotranspiration in this region. They have found that the AIs like GMDH, GRNN, multilayer perceptron (MLP), and radial basis function neural network (GRNN) are capable of providing a high accuracy estimation for the daily evapotranspiration rates of rice crop, which is the most important agricultural crop of this region.

The combination of bio-inspired optimization algorithms has significantly improved the AI performance of AIs in most cases (Ahmadianfar et al.²¹; Mehdizadeh et al.²²; Ahmadi et al.³; Babanezhad et al.²³; Mohammadi et al.²⁴; Aghelpour and Varshavian²⁵; Deo et al.²⁶). These algorithms that use complex evolutionary methods can optimally enhance the parameters of AIs and significantly increase the accuracy of the estimations and predictions. In the AIs, the parameter optimization process is commonly done by the linear least square or gradient decent algorithms, which may suffer from the local optimum problem. To dominate this problem, bio-inspired optimizers, which use nature-inspired search procedures rather than derivatives to find optimal solutions, are suggested in some studies to train AIs. Since there are many natural sources of inspiration, a host of bio-inspired optimizers can be found in the literature. However, just a few of these algorithms have been used in ET0 prediction cases. For example, Mohammadi and Mehdizadeh² have shown that in daily evapotranspiration modeling, a bio-inspired algorithm like the whale optimization algorithm can improve the accuracy of AIs in modeling reference evapotranspiration rates. Genetic and firefly are two other well-known bio-inspired algorithms that have significantly increased the AIs’ accuracy in evapotranspiration modeling cases (Roy et al.²⁷; Tao et al.²⁸; Eslamian et al.²⁹; Aghajanloo et al.³⁰; Yin et al.³¹; Gocić et al.³²). Differential evolution (DE) is another bio-inspired optimization algorithm that has been less used in this term. For example, it was well evaluated to improve the AIs’ accuracy in some cases, such as solar radiation estimation (Babatunde et al.³³; Halabi et al.³⁴), pan evaporation modeling (Wu et al.³⁵), dust source modeling (Rahmati et al.³⁶), or drought prediction (Aghelpour et al.³⁷), but has been rarely evaluated in evapotranspiration modeling cases.

The ANFIS model is one of the most efficient AI methods that has been used in both simple and hybridized forms for hydrological and meteorological modeling. ANFIS model showed its acceptable performances in solar radiation estimation (Üstün et al.³⁸; Halabi et al.³⁴; Khosravi et al.³⁹), pan evaporation estimation (Adnan et al.⁴⁰; Guven and Kisi⁴¹), drought forecasting (Aghelpour et al.⁴²; Aghelpour et al.⁴³; Aghelpour et al.⁴⁴; Kisi et al.⁴⁵), river flow modeling (Mohammadi et al.⁴⁶; Aghelpour et al.⁴⁷), rainfall forecasting (Mekanik et al.⁴⁸; Yaseen et al.⁴⁹), and wind speed forecasting (Maroufpoor et al.⁵⁰). However, they are less used in evapotranspiration prediction for the future (most of the studied cases have used the ANFIS model for ET0 “estimation,” not “prediction” for the future). The present study intends to use the ANFIS model to predict the reference evapotranspiration and compare it with the classical SARIMA stochastic model. Moreover, as a novelty, the DE algorithm is combined with the ANFIS model as ANFIS-DE in this study to optimize and improve the ANFIS’s prediction accuracy. This research studies stations from different climates (from extra-arid to per-humid). Moreover, investigating the effect of the climate type on the accuracy of the models predicting ET0 for the first time is another novelty aspect of the current research.

Materials and methods

Data and areas under investigation

Iran is located in the Middle East, on the dry belt of the earth. Consequently, it is facing limited water resources in human life’s different sectors, such as agriculture. According to De-Martonne climatic zoning, Iran has 28 different climatic classes (Rahimi et al.⁵¹). The majority of regions in Iran have arid (central desert, southwest, and southwest of the country) and semi-arid climates (the Zagros Mountains in the west and northwest of the country as well as northeastern regions), and only small areas of Iran have humid climates (the Southern shore of the Caspian Sea in the north). The evapotranspiration rate, which is affected by different meteorological factors, varies in different climatic zones. For example, in arid regions like Ahwaz, the range of ET0 is between 40 and 350 mm per month, while in humid climates like Ramsar, the ET0 varies between 20 and 158 mm per month. This paper aims to investigate the effect of the climate type on the accuracy of models predicting evapotranspiration. For this, six synoptic stations from different climates of Iran are considered, which are illustrated as Fig. 1 (R packages “sf” [Pebesm⁵²] and “ggplot2” [Wickham⁵³] were used to draw this figure).

Three stations were selected from the humid and sub-humid areas of northern Iran (on the southern margin of the Caspian Sea). The other three stations were from arid and semi-arid areas in central and southwestern parts of Iran. Most of the agricultural lands in the humid northern areas are under rice cultivation, and the horticultural lands in this area are often under citrus cultivation. In the arid and semi-arid regions of the southern parts of Iran, the main crops include wheat and maize, and the important horticultural crops are grapes and pistachios. A summary of the information about this study’s climatic zones, stations, and common products is shown in Table 1.

Table 1 The studied stations’ location, climate (according to extended De-Martonne classification) and the main agricultural/horticultural products of their regions.

Full size table

The data used in this paper include monthly meteorological data that belong to the period of 1995 to 2018. These data include minimum air temperature (Tmin), maximum air temperature (Tmax), mean air temperature (Tmean), minimum relative humidity (RHmin), maximum relative humidity (RHmax), and sunshine duration (SSD), which are prepared on a monthly scale of the Iranian Meteorological Organization (IRIMO). The temperature and humidity variables (Tmin, Tmax, Tmean, RHmax and RHmin) are measured in Stevenson screen box at 1.35 m height from the land surface, and the SSD is measured by sunshine recorder at 1.5 m height. The quality control of these datasets have been checked. There were a few numbers of missings and outliers that were modified using averaging method. Using these data and the FAO-56 PM model, the amount of monthly evapotranspiration was calculated in the six mentioned stations. The reference evapotranspiration based on the FAO-56 PM method is calculated by Eq. (1):

$$ET0 = { }\frac{{0.408\Delta \left( {R_{n} - G} \right) + \gamma \frac{900}{{(T_{a} + 273)}}u_{2} \left( {e_{s} - e_{a} } \right)}}{{\Delta + \gamma \left( {1 + 0.34u_{2} } \right)}}$$

(1)

where $ET0,{ }\Delta ,{ }R_{n} ,{ }G,{ }\gamma ,{ }T_{a} ,{ }u_{2} ,\;{\text{and}}\;e_{s} - { }e_{a} { }$ represent refrence evapotranspirartion $\left( {\frac{{{\text{mm}}}}{{{\text{month}}}}} \right)$, the slope of the vapor pressure curve $\left( {\frac{{{\text{kP}}_{{\text{a}}} }}{{^\circ {\text{C}}}}} \right)$, net surface radiation $\left( {\frac{{{\text{MJ}}}}{{{\text{m}}^{2} {\text{day}}}}} \right)$, soil heat flux ($\left( {\frac{{{\text{MJ}}}}{{{\text{m}}^{2} {\text{day}}}}} \right)$, psychrometric constant $\left( {0.0677\frac{{{\text{ kP}}_{{\text{a}}} }}{{^\circ {\text{C}}}}} \right)$, monthly mean air temperature ($^\circ {\text{C}}$), monthly average wind speed $\left( {\frac{{\text{m}}}{{\text{s}}}} \right)$ at 2 m, and the difference between saturation and actual vapor pressure ($kP_{a}$), respectively (Allen et al.¹). According to Allen et al.¹, $\Delta ,{ }\gamma$ were computed as a function of atmospheric pressure obtained from the local altitude (m), and the maximum and minimum relative humidity as well as maximum and minimum temperature values were used to compute $e_{a}$ and $e_{s}$. Net radiation is the difference between net incoming shortwave solar radiation and outgoing longwave terrestrial radiation. Due to the lack of measurement of the actual solar radiation in most synoptic stations like the one in this research, solar radiation can be estimated from the Angstrom formula based on the actual sunshine duration. Moreover, the net output longwave radiation was estimated according to the modified Stefan-Boltzmann law by considering the effect of cloudiness and atmospheric humidity (downward longwave from the sky). Interested readers can refer to Allen et al. ¹. The “evapotranspiration” package in R software was used to estimate the evapotranspiration rates. For modeling, the period under study was divided into two parts of training and testing that include 75% (the first 18 years of 1995–2012) and 25% (the remaining six years of 2013–2018), respectively. In the training phase, the model is extracted, and the extracted model is applied for predicting ET0 during the testing phase. Then the models’ predictions will be validated by the actual (calculated) ET0. The characteristics of the meteorological data and the calculated evapotranspiration data are shown in Table 2.

Table 2 Specifications of the meteorological data used and the calculated ET0 on the monthly scale.

Full size table

Time series model

A time series is a set of recorded observations of a variable such as ${\text{X}}_{{\text{i}}}$ Overtime in the form of ${\text{X}}_{1}$, ${\text{X}}_{2}$, ${\text{X}}_{3}$, …, ${\text{X}}_{{\text{N}}} ,$ among which the time interval is equal (Gautam and Sinha⁵⁴). Time series models are stochastic models that work based on regression coefficients and use the time lags of the target variable as the model’s input variable. These models include autoregressive (AR), integrated (I), and moving average (MA) components. They are shown in an integrated state known as autoregressive integral moving average (ARIMA). The seasonal ARIMA (SARIMA) model is a model that can be used for numerical simulation of the stochastic behavior of periodic time series. In other words, SARIMA is a linear parametric stochastic model that can be used to model and predict variables which have seasonal autocorrelations. The cross form of this model is shown as SARIMA(p, d, q) × (P, D, Q)_ω, in which ω is the periodicity, p, d, and q are the non-seasonal degrees of autoregressive, differencing, and moving average, respectively, and P, D, and Q are the seasonal degrees of autoregressive, differencing, and moving average, respectively. The general form of this model is shown below: (Salas⁵⁵):

$$\Phi_{P} \left( {B^{\omega } } \right)\phi_{p} \left( B \right)\nabla_{\omega }^{D} \nabla^{d} X_{t} = \theta_{q} \left( B \right)\Theta_{Q} \left( {B^{\omega } } \right)\varepsilon_{t}$$

(2)

In this formula ${X}_{t}$ is a stochastic variable as the target, and ${\varepsilon }_{t}$ is a normal random variable with mean μ and variance ${\sigma }_{\varepsilon }^{2}$, as a residual. The parameters of B including Φ, ϕ, ${\nabla }_{\omega }^{D}$, ${\nabla }^{d}$, Θ, θ, represent the backward operators associated with seasonal autoregressive, non-seasonal autoregressive, seasonal differencing and non-seasonal differencing, seasonal moving average, and non-seasonal moving average, respectively. Their equations are described in Eqs. 3–8 (Salas⁵⁵).

$$\Phi_{P} \left( {B^{\omega } } \right) = \left( {1 - \Phi_{1} B^{\omega \times 1} - \ldots - \Phi_{P} B^{\omega \times P} } \right)$$

(3)

$$\phi_{p} \left( B \right) = \left( {1 - \phi_{1} B^{1} - \ldots - \phi_{p} B^{p} } \right)$$

(4)

$$\nabla_{\omega }^{D} = \left( {1 - B^{\omega } } \right)^{D}$$

(5)

$$\nabla^{d} = \left( {1 - B} \right)^{d}$$

(6)

$$\Theta_{Q} \left( {B^{\omega } } \right) = \left( {1 - \Theta_{1} B^{\omega \times 1} - \ldots - \Theta_{Q} B^{\omega \times Q} } \right)$$

(7)

$$\theta_{q} \left( B \right) = \left( {1 - \theta_{1} B^{1} - \ldots - \theta_{q} B^{q} } \right)$$

(8)

We used the Minitab software and the SARIMA model to simulate and predict evapotranspiration time series in this research.

Adaptive neuro-fuzzy inference system (ANFIS)

ANFIS model can make relationships between input and output data using fuzzy rules to learn from a neural network to generate input structure for a system. ANFIS model designs and creates nonlinear maps to define relationships between input and output spaces by employing the artificial neural network and fuzzy logic, which is known as a neuro-fuzzy system. fuzzy systems include three different parts: fuzzification, inference engine, and defuzzification. By utilizing fuzzy inference systems, fuzzy rules are achieved. A fuzzy inference system consists of two different inferences, namely Mamdani (Mamdani and Assilian⁵⁶) and Sugeno (Takagi and Sugeno⁵⁷). They both work great when combined with an optimization algorithm and adaptive techniques (Khosravi et al.³⁹). In this paper, we use Sugeno inference. Figure 2 shows the structure of the ANFIS model.

These two equations are the base rules of Sugeno inference:

$${\text{Rule 1}}:\;{\text{if}}\; x\; is \;A_{1} \;{\text{and}}\; y \;is\; B_{1} , \;{\text{then}}\; f_{1} = p_{1} x + q_{1} y + r_{1}$$

(9)

$${\text{Rule 1}}:\;{\text{if}}\; x\; is \;A_{2} \;{\text{and}} \;y \;{\text{is}} \;B_{2} , \;{\text{then}}\; f_{2} = p_{2} x + q_{2} y + r_{2}$$

(10)

ANFIS model contains different layers. Layer one, in this model, is the fuzzification layer. Each node receives a signal and then transfers it to the next layer. The following equation describes the cells’ outputs ($O_{1}^{i}$) (Khosravi et al.³⁹; Haznedar and Kalinli⁵⁸):

$$O_{1}^{i} = \mu_{{A_{i} }} \left( x \right);\quad i = 1, 2$$

(11)

${\mu }_{{A}_{i}}$ is related to membership function (MF). ${A}_{i}$ is linguistic variable and is related to node function. The following equation shows the standard formula for ${\mu }_{{A}_{i}}$

$$\mu_{{A_{i} }} \left( x \right) = \exp \left\{ { - \left[ {\left( {\frac{{x - c_{i} }}{{a_{i} }}} \right)^{2} } \right]^{{b_{i} }} } \right\}$$

(12)

In this equation, x is the input, and ${a}_{i}$, ${b}_{i}$, ${c}_{i}$ are premise parameters. Layer 2 is called the rule layer which is obtained by membership degrees. All the output nodes establish the firing strength of a fuzzy rule.

$$O_{2}^{i} = w_{i} = \mu_{{A_{i} }} \left( x \right){ } \cdot \mu_{{B_{i} }} \left( y \right);\quad i = 1, 2$$

(13)

Layer 3 is the normalization layer. In this layer, all the nodes are fixed and tagged with N. The rule’s firing strength to the sum of all rules’ firing strengths is the ratio calculated by the $i^{th}$ node in the normalization layer.

$$O_{3}^{i} = \overline{{w_{i} }} = \frac{{w_{i} }}{{w_{1} + w_{2} }};\quad i = 1, 2$$

(14)

The defuzzification layer is layer 4 of the ANFIS model. Each rule uses the value of the previous layer to compute the output value.

$$O_{4}^{i} = \overline{{w_{i} }} f_{i} = \overline{{w_{i} }} \left( {p_{i} x + q_{i} y + r_{i} } \right);\quad i = 1, 2$$

(15)

In this equation, $\overline{{w_{i} }}$ comes from the previous layer, namely layer 3. $\overline{{w_{i} }}$ is a normalized firing strength and $p_{i}$, $q_{i}$, and $r_{i}$ are the consequent parameters. Layer 5 is called the sum layer. By summing the output values of the rules that come from the previous layer, the final output of the ANFIS model is calculated.

$$O_{5}^{i} = overall \; output = \mathop \sum \limits_{i} \overline{{w_{i} }} f_{i} = \frac{{\mathop \sum \nolimits_{i} w_{i} f_{i} }}{{\mathop \sum \nolimits_{i} w_{i} }}\quad i = 1, 2$$

(16)

To implement the ANFIS model, we used MATLAB software in this study.

To summarize, the ANFIS model contains two sets of parameters: premise parameters and consequence parameters. Premise parameters are input parameters of MFs, and they aim to specify the shape and the location of the input MFs (parameters of input MFs). Consequence parameters are the output parameters of MFs (parameters of output MFs) (Jang ⁵⁹). Classical ANFIS uses the least square (LS) methods to estimate these parameters. However, in the current research, we have developed a novel ANFIS-DE model, which uses the meta-heuristic DE algorithm to estimate ANFIS’s sets of parameters.

Differential evolution (DE) optimization algorithm

Although differential evolution (DE) uses basic optimized operations such as mutation, crossover, and selection, it is an impressive and powerful optimization algorithm. One of the privileges of this algorithm is that it has parallel search methods and uses NP, and also has D-dimensional vectors of parameters (Omidi and Mazaheri⁶⁰). The advantage of these vectors is that they do not change during the minimization procedure. DE performs a population process for each generation G. First, one population vector is randomly initialized, including the parameters, and this probability distribution is uniformed. When the preliminary solution is achieved, the DE algorithm calculates the difference between the weights of two population vectors and assigns it to the third vector in order to produce new parameter vectors, which is known as the mutation operation (Halabi et al.³⁴):

$$v_{i,G + 1} = x_{i,G} + F\left( {x_{r2,G} - x_{r3,G} } \right)$$

(17)

According to $v_{i,G + 1}$, these mutant vectors, $x_{i}$, $G$ and $i = 1,2,3, \ldots ,NP$ are created, while $r1$, $r2$, and $r3$ are randomly integers, and NP is selected from this distribution: integers $\in \left[ {1,2,3, \ldots ,NP} \right].$ Moreover, $I$ and $F$ are real values, and they are different $\in \left[ {1,2,3, \ldots ,NP} \right]$.

During the mixing process, which is also called crossover operation, parameters of the mutated vector are mixed with other vector parameters to create the trial vector. The following equations describe this mixing process:

$$u_{i,G + 1} = \left( {u_{1i,G + 1} ,u_{2i,G + 1} , \ldots ,u_{di,G + 1} } \right)$$

(18)

$$u_{ji,G + 1} = \left\{ {\begin{array}{*{20}c} {v_{ji,G + 1} ; if \, randb\left( j \right) \le \; CR \; or \; j = rnbr\left( i \right)} \\ {x_{ji,G + 1} ; if \; randb\left( j \right) > \; CR \; or \; j \ne rnbr\left( i \right)} \\ \end{array} } \right.$$

(19)

In this equation, $u_{i,G + 1}$ is the trailer, and $x_{i,G}$ is the target vector, where $u_{i,G + 1}$ and $x_{i,G}$ are the trailer and target vectors, respectively. $randb\left( j \right)$ is the Jth uniform random evaluation $\in \left[ {0.1} \right]$, $rnbr\left( i \right)$ is a random value index $\in \left[ {1,2,3, \ldots ,d} \right],$ and $CR$ is a crossover constant determined by users. The selection operation is the last step. The trial vector costs a lower cost function than the target vector. Therefore, the selection operation uses the trial vector as a target value for the next generation. $NP$ competitions are assumed like one generation procedure as each population vector has to serve once as the target vector. Complementary descriptions about the DE optimization algorithm can be found in Storn and Price⁶¹ and Halabi et al.³⁴. The DE algorithm flowchart is illustrated in Fig. 3.

In this paper, the DE algorithm is implemented by coding in MATLAB software’s environment. The trial and error method is used to choose the best operators of DE to optimize the ANFIS model. They are illustrated in Table 3.

Table 3 The operators of differential evolution algorithm.

Full size table

Evaluating the accuracy of the predictions

This study uses six criteria to evaluate the performance of the models: root mean square error (RMSE), normalized RMSE (NRMSE), mean absolute error (MAE), percent bias (PB), Pearson correlation coefficient (R), coefficient of determination (R²), and Nash- Sutcliff coefficient (NS). In general, these criteria are used to compare the accuracy of different models with one another. Furthermore, they are used to compare the accuracy of models in different climates. To calculate them, we need two series of predicted and observed evapotranspiration data. Their equations are as follows.

$$RMSE = \sqrt {\frac{1}{n}\mathop \sum \limits_{i = 1}^{n} \left( {ETO_{i} - ETP_{i} } \right)^{2} } ;\quad 0 < RMSE < + \infty$$

(20)

$$NRMSE = \frac{{\sqrt {\frac{1}{n}\mathop \sum \nolimits_{i = 1}^{n} \left( {ETO_{i} - ETP_{i} } \right)^{2} } }}{{ETO_{max} - ETO_{min} }};\quad 0 < NRMSE < + \infty$$

(21)

$$MAE = \frac{1}{n}\mathop \sum \limits_{i = 1}^{n} \left| {ETO_{i} - ETP_{i} } \right|;\quad 0 < MAE < + \infty$$

(22)

$$PB = \mathop \sum \limits_{i = 1}^{n} \left( {\frac{{ETO_{i} - ETP_{i} }}{{ETO_{i} }}} \right);\quad - \infty < PB < + \infty$$

(23)

$$R = \frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {ETO_{i} - \overline{ETO} } \right)\left( {ETP_{i} - \overline{ETP} } \right)}}{{\sqrt {\mathop \sum \nolimits_{i = 1}^{n} \left( {ETO_{i} - \overline{ETO} } \right)^{2} } *\sqrt {\mathop \sum \nolimits_{i = 1}^{n} \left( {ETP_{i} - \overline{ETP} } \right)^{2} } }};\quad - 1 < R < 1$$

(24)

$$R^{2} = \left[ {\frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {ETO_{i} - \overline{ETO} } \right)\left( {ETP_{i} - \overline{ETP} } \right)}}{{\sqrt {\mathop \sum \nolimits_{i = 1}^{n} \left( {ETO_{i} - \overline{ETO} } \right)^{2} } *\sqrt {\mathop \sum \nolimits_{i = 1}^{n} \left( {ETP_{i} - \overline{ETP} } \right)^{2} } }}} \right]^{2} ;\quad 0 < R^{2} < 1$$

(25)

$$NS = 1 - \frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {ETO_{i} - ETP_{i} } \right)^{2} }}{{\mathop \sum \nolimits_{i = 1}^{n} \left( {ETO_{i} - \overline{ETO} } \right)^{2} }};\quad - \infty < NS < 1$$

(26)

$ETO_{i }$ shows the amount of the observed evapotranspiration (FAO-56 PM calculated ET0) of the ith month, $ETP_{i}$ is the amount of evapotranspiration predicted in the ith month, $\overline{ETO}$ shows the mean of observed evapotranspiration, $\overline{ETP}$ represents the average of the predictive evapotranspiration, $ETO_{max}$ is the maximum of the observed evapotranspiration, and finally $ETO_{min}$ is the minimum of the observed evapotranspiration. According to the defined range for these criteria, the closer the RMSE, NRMSE, MAE and PB are to zero, and the closer NS, R, and R² are to one, the better the model performance is. Another point about NRMSE is that it has 4 intervals while evaluating the models’ quality: (1) NRMSE > 0.3 poor performance, (2) 0.2 < NRMSE < 0.3 average performance (3) 0.1 < NRMSE < 0.2 good performance and (4) 0 < NRMSE < 0.1 excellent performance (Bahrami-Pichaghchi and Aghelpour⁶²). From another point of view, the used criteria are divided into four categories: (I) Accuracy: these criteria can show the errors of the models in ET0 prediction, including RMSE, MAE; (II) precision: these criteria can show the quality of the models in ET0 prediction, including NRMSE and NS; (III) under or overestimation: this criterion can talk about the models’ under/overestimation in ET0 prediction, including PB; (IV) correlation: these criteria show the correlation intensity between the models’ predictions and their observed values, including R and R². It should be noted that all these criteria will be applicable for comparing several models in a specific station.

The general process of modeling and predicting the evapotranspiration time series in this paper is shown as a flowchart in Fig. 4.

Results

Modeling and evaluating the predictions

In this study, the ET0 rates were first calculated by FAO-56 PM method, and the meteorological variables, are represented in Table 1. Then the models were applied for ET0 prediction. It’s worth mentioning that if the inputs are the meteorological variables, the modeling problem is applicable for an “estimation” case and is not usable for a “prediction” (for the future). For a time series prediction problem, the model inputs must have time lag(s) and the output’s time lag must be equal to zero. Since the time series stochastic models are only able to consider the main variable's time lags as input, the same inputs (time lags of ET0) are considered for the ANFIS and ANFIS-DE models too, for a fair comparison. Therefore, autocorrelation function (ACF) diagrams for different stations were considered (Fig. 5) that show the extent and significance of the correlation of the variable with its previous steps’ amounts.

As Fig. 5 indicates, the ET0 data in all six stations have a significant seasonal trend. The ET0 time series are periodic and have a 12 months periodicity. To moderate this seasonal trend, we considered several degrees of seasonal differentiations with a lag of 12 months (equal to the periodicity). Investigations showed that order “one” seasonal differentiation has the best consistency with ET0 data. As a result, the SARIMA model is modified as the SARIMA pattern SARIMA (p,0,q)(P,1,Q)₁₂. Moreover, when the time lag increases, the significance threshold of correlation (dashed line) increases; with more than three return periods (36 months), it reaches a point that is practically logical not to use them as inputs. Therefore, a maximum lag of 36 months is considered as input for all models. In the SARIMA model, this includes seasonal autoregressive and moving average degrees (P & Q), which are equal to 1, 2, and 3. These degrees and also the non-seasonal degrees of autoregressive and moving average (p & q) were all tested, and their best performance was selected for each station and reported in Table 4. Simple and hybrid ANFIS models (ANFIS & ANFIS-DE) were implemented based on the fuzzy cluster means (FCM) clustering method. Lags of 1, 6, 12, 18, 24, 30, and 36 months were also considered as inputs to these AI models.

Table 4 Evaluating the models’ predictions by evaluation criteria.

Full size table

In Table 4, the predictions of all three models were evaluated by the mentioned evaluation metrics. Since the test section actually shows the validity of the models, the test section is also discussed in the interpretations of this section. At first, it can be seen that in all stations, the R coefficients are very high, which indicates the optimal performance of the models in predicting monthly ET0 (the minimum value of R is equal to 0.949, which belongs to the simple ANFIS model in Ramsar station). Additionally, the amount of PB in all cases is very small (close to zero), which confirms the lack of significant under/overestimation and, consequently, the excellent performance of the models. According to Table 4, the SARIMA linear model has superior performance in all stations than the other two models, and the weakest performance among the models belongs to the simple ANFIS model. In combination with the ANFIS model (ANFIS-DE), the DE algorithm was able to increase the prediction accuracy for ANFIS by an average of 15.8%. The lowest prediction error belongs to the SARIMA model at Shiraz station with RMSE = 7.918 $\frac{{{\text{mm}}}}{{{\text{month}}}}$. The highest prediction error is reported in Ahwaz station with RMSE = 16.906 $\frac{{{\text{mm}}}}{{{\text{month}}}}$ , which belongs to the simple ANFIS model.

Comparing the models

Scatter plots are used for graphical illustration of the correlation between the predicted and actual values of monthly ET0 (Fig. 6).

In Fig. 6, the horizontal axis of the graphs represents the observed ET0 data, and the vertical axis represents the predictions presented by the models. This figure shows that, at all stations, the slope of the fitted regression line between the observed-predicted data samples is very small, associated with the X = Y line. The points are well concentrated around their regression line, and this concentration is more on the diagrams related to the SARIMA model than the other two models. On the other hand, the R² coefficient shows that the SARIMA linear model offers a better prediction than the other two nonlinear and complex models, i.e., ANFIS and ANFIS-DE. Also, ANFIS-DE predictions show better correlations compared to simple ANFIS. The diagrams in Fig. 6 show that the weakest performance belongs to the predictions of ANFIS in Ramsar (R² = 0.901), and the best performance belongs to the predictions of SARIMA at Yazd station (R² = 0.984). The Taylor diagram is also represented for each station to compare the models (Fig. 7).

This diagram (Fig. 7) can simultaneously check the correlation and the error and also compare the standard deviations of the outputs of several models and their observed values. In these diagrams, point O is an indicator of observed data, and points A, B, and C are the indicators of the SARIMA, ANFIS, and ANFIS-DE models, respectively. At all stations, point A is located closest to point O, confirming the superiority of the SARIMA model. After that, ANFIS-DE (point C) and ANFIS (point B) models are located in the second and third places, respectively. The best position of points A, B, and C belongs to Shiraz station, where these points are placed between two circles RMSE = 5 $\frac{{{\text{mm}}}}{{{\text{month}}}}{ }$ and RMSE = 10 $\frac{{{\text{mm}}}}{{{\text{month}}}}$, and around the radius R = 0.99. At Yazd station, a situation similar to Shiraz station is observed. The weakest points’ position can belong to Bandar Anzali station, where points A, B, and C are farthest from point O, between circles of RMSE = 10 $\frac{{{\text{mm}}}}{{{\text{month}}}}{ }$ and RMSE = 15 $\frac{{{\text{mm}}}}{{{\text{month}}}}$, and between two radii of R = 0.99 and R = 0.95. Furthermore, a comparison of the standard deviations between outputs and the observations reveals that the points of the models, especially point A, are in a favorable position relative to the quadrant close to point O. This shows that the models, especially SARIMA, can favorably estimate the standard deviation of actual ET0 values.

Comparing ET0 prediction accuracy in different climates

In general, the comparison between the stations in Fig. 7 indicates that the humid stations are in weaker ranges of error and correlation than the arid stations. Also, according to Fig. 6, the R² value resulting from the SARIMA model in humid and sub-humid climates is in the range of 0.95–0.96, while it is in the range of 0.97–0.98 in arid and semi-arid regions. Therefore, it is evident that ET0 is predicted slightly better in arid areas. However, due to the different range of ET0 data in different climates (Table 2), it is better to consider the normalized RMSE (NRMSE) criterion at stations for evaluation (Fig. 8).

In Fig. 8, the NRMSE and NS criteria for the test period were plotted together as a combo-graph. This diagram is drawn separately for all models at all stations. At first, we can observe that all models have an NS value greater than 0.9, which confirms the models’ favorable prediction of ET0. Moreover, the NRMSE value in all stations is less than 0.1. According to the quality classes defined for NRMSE, the predictions for all climates in this study are considered very reasonable. The visible trend of NS and NRMSE is similar across stations. Both criteria indicate a better prediction of ET0 in arid and semi-arid climates. In other words, if the NS level increases at a station, the NRMSE level will decrease at the same station (which is well illustrated in the combo-graph). Therefore, we can state that both criteria achieved similar results in comparing the accuracy of ET0 prediction among the climates. For example, in the ANFIS-DE model for humid and sub-humid stations, the NRMSE is between 0.07 and 0.09 and the NS is between 0.93 and 0.95, while for arid and semi-arid stations, NRMSE is between 0.04 and 0.06 and NS is between 0.97 and 0.98. In the combo-graph belonging to the SARIMA model, the NRMSE value for humid and sub-humid areas is between 0.06 and 0.08, and the NS value is between 0.94 and 0.96, while for arid and semi-arid areas, the NRMSE is between 0.04 and 0.05, and the NS is between 0.98 and 0.99. The comparison of the models is similar to the previous diagrams and tables, which reported that the SARIMA model is more appropriate. The predictions provided by the models can also be seen graphically in time-series plots (Fig. 9) to observe the overlaps.

Discussion

For ET0 modeling, the simple and hybridized AIs have been examined in several studies (as mentioned in the introduction section). Mohammadi and Mehdizadeh², Roy et al.²⁷, Tao et al.²⁸, Eslamian et al.²⁹, Aghajanloo et al.³⁰, Yin et al.³¹, and Gocić et al.³² are such these studies that have shown the combination of AIs with bio-inspired algorithms, can significantly improve the accuracy of simple AIs in ET0 modeling; which is similar to the current study’s results. However, in these mentioned studies the modeling was only applicable in the “estimation” of ET0 and is not examined for future “prediction” of ET0 rates; which distinguishes the mentioned studies from the current study. The desirability of the prediction accuracy of time series models in the current study is similar to the research of Gautam and Sinha⁵⁴, Landeras et al.¹⁵, Psilovikos and Elhag⁶³, Mossad and Alazba⁶⁴, and Bouznad et al.⁶⁵, that have been conducted in different climatic regions. The superiority of time series models over AIs in ET0 forecasting in Iran has also been reported in Ashrafzadeh et al.¹⁸ and Aghelpour and Norooz-Valashedi¹⁹. However, their studies only addressed the humid northern climate of Iran. Additionally, Ashrafzadeh et al.¹⁸ and Aghelpour and Norooz-Valashedi¹⁹ used non-hybridized artificial intelligence models, while the current research showed that the novel hybrid ANFIS-DE model can significantly increase the accuracy of the simple ANFIS model. In Brazil, however, AIs provided a relatively more accurate prediction of ET0 than time series models did (Lucas et al.¹⁶), which contradicts the results of the current study. This contradiction could be due to the differences between the climatic conditions of the studies’ regions.

Comparing the climates of the present study showed that the geographical location and the physical systems involved can be factors influencing the accuracy of ET0 prediction. For example, the humid regions of northern Iran are affected by Caspian atmospheric systems and various western systems, such as the Black Sea and the Mediterranean Sea, whereas the western and southwestern regions of Iran (like Shiraz and Ahwaz) are only weakly affected by the Saudi Arabia’s high-pressure and Sudan’s low-pressure systems. Susceptibility to a large number of systems can disrupt the order of time series, reduce autocorrelation, and consequently lead to poor prediction. This difference in the order of the ET0 series in different climates is depicted in the diagrams of Fig. 9. On the other hand, these three stations of Shiraz, Ahwaz, and Yazd, are located near the subtropical high-pressure belt (SHPB) (latitude 30 degrees), which can stabilize the weather regime in these areas, and thus make the ET0 series more regular. By moving away from the SHPB and approaching the latitudes of the humid northern regions, the effects of the irregularity of the annual regime become more obvious. This irregularity can decrease the autocorrelation of ET0 series (it is almost distinguishable in ACF plots of Fig. 5), and since the predictions are directly affected by ET0 time lags and autocorrelation within them, it can eventually cause a relative increase in the prediction errors in these humid areas.

Conclusion

Studies have shown that the water requirement of plants can be predicted with outstanding accuracy by using the time lags of the evapotranspiration variable. The currently used data-driven approaches could provide acceptable predictions of ET0, regardless of the various atmospheric and physical factors that affect it. This result is similar in all currently studied climates. Despite the significant improvement of the ANFIS model combined with the differential evolution optimization algorithm (about 16%), it still fails to compete with the SARIMA linear model. According to Ashrafzadeh et al.^18,66, the reason is that the linear autocorrelation is stronger than nonlinear autocorrelation in the ET0 time series. Finally, the present study proposes time series models for a better prediction of ET0 for two reasons: (1) higher accuracy and (2) the simplicity of use. Another important conclusion of this paper is that the climate type of a region significantly affects the accuracy of the models predicting ET0. ET0 was predicted more accurately in the arid and semi-arid climates of southern Iran than the humid and sub-humid regions of its north. Due to the high accuracy and promising results of the present study, using these data-driven models to predict plants’ water needs in other geographical areas is recommended. As a practical aspect of the current results, to predict the actual water requirement of a specific crop, the predicted ET0 rate can be obtained by multiplying the crop’s coefficient (FAO coefficients or the local reported coefficient). Moreover, utilizing the current models, especially SARIMA and the hybrid ANFIS-DE, has research value for long-term and multi-ahead years prediction of monthly ET0. The use and comparison of stochastic, artificial intelligence, and metaheuristic models in predicting ET0 on a daily scale can be an interesting topic of study, which we suggest to future researchers in this field. It’s worth mentioning that due to the limitations of the SARIMA model (which cannot consider other options as input to the model except the time lags of the evaporation variable itself), the machine learning models were applied by the inputs of ET0 time lags, to make a logicalcomparison. Therefore, it is suggested that future studies investigate the impacts of other hydro-meteorological factors’ time lags, such as droughts, heat waves, solar radiation, temperaturte, humidity, wind speed, etc. for ET0 prediction, which can only be applied by machine learning algorithms.

Data availability

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

Allen, R. G., Pereira, L. S., Raes, D. & Smith, M. Crop evapotranspiration-Guidelines for computing crop water requirements-FAO Irrigation and drainage paper 56. Fao, Rome 300, D05109 (1998).
Google Scholar
Mohammadi, B. & Mehdizadeh, S. Modeling daily reference evapotranspiration via a novel approach based on support vector regression coupled with whale optimization algorithm. Agric. Water Manag. 237, 106145 (2020).
Article Google Scholar
Ahmadi, F. et al. Application of an artificial intelligence technique enhanced with intelligent water drops for monthly reference evapotranspiration estimation. Agric. Water Manag. 244, 106622 (2021).
Article Google Scholar
Falamarzi, Y., Palizdan, N., Huang, Y. F. & Lee, T. S. Estimating evapotranspiration from temperature and wind speed data using artificial and wavelet neural networks (WNNs). Agric. Water Manag. 140, 26–36 (2014).
Article Google Scholar
Kumar, M., Raghuwanshi, N. S., Singh, R., Wallender, W. W. & Pruitt, W. O. Estimating evapotranspiration using artificial neural network. J. Irrig. Drain. Eng. 128, 224–233 (2002).
Article Google Scholar
Essam, Y. et al. Predicting streamflow in Peninsular Malaysia using support vector machine and deep learning algorithms. Sci. Rep. 12, 1–26 (2022).
Google Scholar
Dehghanisanij, H., Emami, H., Emami, S. & Rezaverdinejad, V. A hybrid machine learning approach for estimating the water-use efficiency and yield in agriculture. Sci. Rep. 12, 1–16 (2022).
Article Google Scholar
Elbeltagi, A. et al. Modelling daily reference evapotranspiration based on stacking hybridization of ANN with meta-heuristic algorithms under diverse agro-climatic conditions. Stoch. Environ. Res. Risk Assess. 36, 1–24 (2022).
Article Google Scholar
Azad, A. S. et al. Water level prediction through hybrid SARIMA and ANN models based on time series analysis: Red hills reservoir case study. Sustainability 14, 1843 (2022).
Article Google Scholar
Zhang, W., Lin, Z. & Liu, X. Short-term offshore wind power forecasting-A hybrid model based on Discrete Wavelet Transform (DWT), Seasonal Autoregressive Integrated Moving Average (SARIMA), and deep-learning-based Long Short-Term Memory (LSTM). Renew. Energy 185, 611–628 (2022).
Article Google Scholar
Zarei, M. et al. Machine-learning algorithms for forecast-informed reservoir operation (FIRO) to reduce flood damages. Sci. Rep. 11, 1–21 (2021).
Article Google Scholar
Graf, R. & Aghelpour, P. Daily river water temperature prediction: A comparison between neural network and stochastic techniques. Atmosphere (Basel). 12, 1154 (2021).
Article ADS Google Scholar
Chen, C., He, W., Zhou, H., Xue, Y. & Zhu, M. A comparative study among machine learning and numerical models for simulating groundwater dynamics in the Heihe River Basin, northwestern China. Sci. Rep. 10, 1–13 (2020).
Google Scholar
Karbasi, M. Forecasting of multi-step ahead reference evapotranspiration using wavelet-Gaussian process regression model. Water Resour. Manag. 32, 1035–1052 (2018).
Article Google Scholar
Landeras, G., Ortiz-Barredo, A. & López, J. J. Forecasting weekly evapotranspiration with ARIMA and artificial neural network models. J. Irrig. Drain. Eng. 135, 323–334 (2009).
Article Google Scholar
e Lucas, Pd. O., Alves, M. A., e Silva, PCd. L. & Guimarães, F. G. Reference evapotranspiration time series forecasting with ensemble of convolutional neural networks. Comput. Electron. Agric. 177, 105700 (2020).
Article Google Scholar
Kishore, V. & Pushpalatha, M. forecasting evapotranspiration for irrigation scheduling using neural networks and ARIMA. Int. J. Appl. Eng. Res. 12, 10841–10847 (2017).
Google Scholar
Ashrafzadeh, A., Kişi, O., Aghelpour, P., Biazar, S. M. & Masouleh, M. A. Comparative study of time series models, support vector machines, and GMDH in forecasting long-term evapotranspiration rates in Northern Iran. J. Irrig. Drain. Eng. 146, 04020010 (2020).
Article Google Scholar
Aghelpour, P. & Norooz-Valashedi, R. Predicting daily reference evapotranspiration rates in a humid region, comparison of seven various data-based predictor models. Stoch. Environ. Res. Risk Assess. 1–23. https://doi.org/10.1007/s00477-022-02249-4 (2022).
Article Google Scholar
Aghelpour, P., Bahrami-Pichaghchi, H. & Karimpour, F. Estimating daily rice crop evapotranspiration in limited climatic data and utilizing the soft computing algorithms MLP, RBF, GRNN, and GMDH. Complexity 2022, (2022).
Ahmadianfar, I., Shirvani-Hosseini, S., He, J., Samadi-Koucheksaraee, A. & Yaseen, Z. M. An improved adaptive neuro fuzzy inference system model using conjoined metaheuristic algorithms for electrical conductivity prediction. Sci. Rep. 12, 1–34 (2022).
Article Google Scholar
Mehdizadeh, S., Mohammadi, B. & Ahmadi, F. establishing coupled models for estimating daily dew point temperature using nature-inspired optimization algorithms. Hydrology 9, 9 (2022).
Article Google Scholar
Babanezhad, M. et al. Investigation on performance of particle swarm optimization (PSO) algorithm based fuzzy inference system (PSOFIS) in a combination of CFD modeling for prediction of fluid flow. Sci. Rep. 11, 1–14 (2021).
Article Google Scholar
Mohammadi, B., Guan, Y., Moazenzadeh, R. & Safari, M. J. S. Implementation of hybrid particle swarm optimization-differential evolution algorithms coupled with multi-layer perceptron for suspended sediment load estimation. CATENA 198, 105024 (2021).
Article Google Scholar
Aghelpour, P. & Varshavian, V. Forecasting different types of droughts simultaneously using multivariate standardized precipitation index (MSPI), MLP neural network, and imperialistic competitive algorithm (ICA). Complexity 2021, (2021).
Deo, R. C. et al. Multi-layer perceptron hybrid model integrated with the firefly optimizer algorithm for windspeed prediction of target site using a limited set of neighboring reference station data. Renew. energy 116, 309–323 (2018).
Article Google Scholar
Roy, D. K., Lal, A., Sarker, K. K., Saha, K. K. & Datta, B. Optimization algorithms as training approaches for prediction of reference evapotranspiration using adaptive neuro fuzzy inference system. Agric. Water Manag. 255, 107003 (2021).
Article Google Scholar
Tao, H. et al. Reference evapotranspiration prediction using hybridized fuzzy model with firefly algorithm: Regional case study in Burkina Faso. Agric. water Manag. 208, 140–151 (2018).
Article Google Scholar
Eslamian, S. S., Gohari, S. A., Zareian, M. J. & Firoozfar, A. Estimating Penman-Monteith reference evapotranspiration using artificial neural networks and genetic algorithm: A case study. Arab. J. Sci. Eng. 37, 935–944 (2012).
Article Google Scholar
Aghajanloo, M.-B., Sabziparvar, A.-A. & Hosseinzadeh Talaee, P. Artificial neural network–genetic algorithm for estimation of crop evapotranspiration in a semi-arid region of Iran. Neural Comput. Appl. 23, 1387–1393 (2013).
Article Google Scholar
Yin, Z. et al. Integrating genetic algorithm and support vector machine for modeling daily reference evapotranspiration in a semi-arid mountain area. Hydrol. Res. 48, 1177–1191 (2017).
Article Google Scholar
Gocić, M. et al. Soft computing approaches for forecasting reference evapotranspiration. Comput. Electron. Agric. 113, 164–173 (2015).
Article Google Scholar
Babatunde, O. M., Munda, J. L. & Hamam, Y. Exploring the potentials of artificial neural network trained with differential evolution for estimating global solar radiation. Energies 13, 2488 (2020).
Article CAS Google Scholar
Halabi, L. M., Mekhilef, S. & Hossain, M. Performance evaluation of hybrid adaptive neuro-fuzzy inference system models for predicting monthly global solar radiation. Appl. Energy 213, 247–261 (2018).
Article Google Scholar
Wu, L. et al. Hybrid extreme learning machine with meta-heuristic algorithms for monthly pan evaporation prediction. Comput. Electron. Agric. 168, 105115 (2020).
Article Google Scholar
Rahmati, O. et al. Hybridized neural fuzzy ensembles for dust source modeling and prediction. Atmos. Environ. 224, 117320 (2020).
Article CAS Google Scholar
Aghelpour, P., Mohammadi, B., Biazar, S. M., Kisi, O. & Sourmirinezhad, Z. A theoretical approach for forecasting different types of drought simultaneously, using entropy theory and machine-learning methods. ISPRS Int. J. Geo-Inform. 9, 701 (2020).
Article ADS Google Scholar
Üstün, İ., Üneş, F., Mert, İ. & Karakuş, C. A comparative study of estimating solar radiation using machine learning approaches: DL, SMGRT, and ANFIS. Energy Sources, Part A Recover. Util. Environ. Eff. 1–24. https://doi.org/10.1080/15567036.2020.1781301 (2020).
Article Google Scholar
Khosravi, A., Nunes, R. O., Assad, M. E. H. & Machado, L. Comparison of artificial intelligence methods in estimation of daily global solar radiation. J. Clean. Prod. 194, 342–358 (2018).
Article Google Scholar
Adnan, R. M., Malik, A., Kumar, A., Parmar, K. S. & Kisi, O. Pan evaporation modeling by three different neuro-fuzzy intelligent systems using climatic inputs. Arab. J. Geosci. 12, 1–14 (2019).
Article Google Scholar
Guven, A. & Kisi, O. Monthly pan evaporation modeling using linear genetic programming. J. Hydrol. 503, 178–185 (2013).
Article ADS Google Scholar
Aghelpour, P., Bahrami-Pichaghchi, H. & Varshavian, V. Hydrological drought forecasting using multi-scalar streamflow drought index, stochastic models and machine learning approaches, in northern Iran. Stoch. Environ. Res. Risk Assess. 35(8), 1–21 (2021).
Article Google Scholar
Aghelpour, P., Kisi, O. & Varshavian, V. Multivariate drought forecasting in short- and long-term horizons using MSPI and data-driven approaches. J. Hydrol. Eng. 26, 04021006 (2021).
Article Google Scholar
Aghelpour, P., Bahrami-Pichaghchi, H. & Kisi, O. Comparison of three different bio-inspired algorithms to improve ability of neuro fuzzy approach in prediction of agricultural drought, based on three different indexes. Comput. Electron. Agric. 170, 105279 (2020).
Article Google Scholar
Kisi, O., Gorgij, A. D., Zounemat-Kermani, M., Mahdavi-Meymand, A. & Kim, S. Drought forecasting using novel heuristic methods in a semi-arid environment. J. Hydrol. 578, 124053 (2019).
Article Google Scholar
Mohammadi, B. et al. Adaptive neuro-fuzzy inference system coupled with shuffled frog leaping algorithm for predicting river streamflow time series. Hydrol. Sci. J. 65, 1738–1751 (2020).
Article Google Scholar
Aghelpour, P. et al. Evaluating the impact of large-scale climatic indices as inputs for forecasting monthly river flow in Mazandaran Province, Iran. Pure Appl. Geophys. 179(4), 1309–1331. https://doi.org/10.1007/s00024-022-02970-9 (2022).
Article ADS Google Scholar
Mekanik, F., Imteaz, M. A. & Talei, A. Seasonal rainfall forecasting by adaptive network-based fuzzy inference system (ANFIS) using large scale climate signals. Clim. Dyn. 46, 3097–3111 (2016).
Article Google Scholar
Yaseen, Z. M. et al. Rainfall pattern forecasting using novel hybrid intelligent model based ANFIS-FFA. Water Resour. Manag. 32, 105–122 (2018).
Article Google Scholar
Maroufpoor, S., Sanikhani, H., Kisi, O., Deo, R. C. & Yaseen, Z. M. Long-term modelling of wind speeds using six different heuristic artificial intelligence approaches. Int. J. Climatol. 39, 3543–3557 (2019).
Article Google Scholar
Rahimi, J., Ebrahimpour, M. & Khalili, A. Spatial changes of extended De Martonne climatic zones affected by climate change in Iran. Theor. Appl. Climatol. 112, 409–418 (2013).
Article ADS Google Scholar
Pebesma, E. J. Simple features for R: Standardized support for spatial vector data. R J. 10, 439 (2018).
Article Google Scholar
Wickham, H. ggplot2: Elegant Graphics for Data Analysis. ISBN 978-3-319-24277-4. (Springer-Verlag, 2016). https://ggplot2.tidynerse.org.
Gautam, R. & Sinha, A. K. Time series analysis of reference crop evapotranspiration for Bokaro District, Jharkhand, India. J. Water L. Dev. 51–56 (2016).
Salas, J. D. Applied Modeling of Hydrologic Time Series (Water Resources Publication, 1980).
Google Scholar
Mamdani, E. H. & Assilian, S. An experiment in linguistic synthesis with a fuzzy logic controller. Int. J. Man. Mach. Stud. 7, 1–13 (1975).
Article MATH Google Scholar
Takagi, T. & Sugeno, M. Fuzzy identification of systems and its applications to modeling and control. IEEE Trans. Syst. Man. Cybern. SMC-15(1), 116–132 (1985).
Article MATH Google Scholar
Haznedar, B. & Kalinli, A. Training ANFIS using genetic algorithm for dynamic systems identification. Int. J. Intell. Syst. Appl. Eng. 4, 44–47 (2016).
Article Google Scholar
Jang, J.-S. ANFIS: Adaptive-network-based fuzzy inference system. IEEE Trans. Syst. Man. Cybern. 23, 665–685 (1993).
Article Google Scholar
Omidi, J. & Mazaheri, K. Differential evolution algorithm for performance optimization of the micro plasma actuator as a microelectromechanical system. Sci. Rep. 10, 1–18 (2020).
Article Google Scholar
Storn, R. & Price, K. Differential evolution–a simple and efficient heuristic for global optimization over continuous spaces. J. Glob. Optim. 11, 341–359 (1997).
Article MathSciNet MATH Google Scholar
Bahrami-Pichaghchi, H. & Aghelpour, P. An estimation and multi-step ahead prediction study of monthly snow cover area, based on efficient atmospheric-oceanic dynamics. Clim. Dyn. 1–23. https://doi.org/10.1007/s00382-022-06341-x (2022).
Article Google Scholar
Psilovikos, A. & Elhag, M. Forecasting of remotely sensed daily evapotranspiration data over Nile Delta region, Egypt. Water Resour. Manag. 27, 4115–4130 (2013).
Article Google Scholar
Mossad, A. & Alazba, A. A. Simulation of temporal variation for reference evapotranspiration under arid climate. Arab. J. Geosci. 9, 1–9 (2016).
Article Google Scholar
Bouznad, I.-E. et al. Trend analysis and spatiotemporal prediction of precipitation, temperature, and evapotranspiration values using the ARIMA models: case of the Algerian Highlands. Arab. J. Geosci. 13, 1–17 (2020).
Article Google Scholar
Ashrafzadeh, A., Kişi, O., Aghelpour, P., Mostafa Biazar, S. & Askarizad Masouleh, M. Closure to “comparative study of time series models, support vector machines, and gmdh in forecasting long-term evapotranspiration rates in northern Iran” by Afshin Ashrafzadeh, Ozgur Kişi, Pouya Aghelpour, Seyed Mostafa Biazar, and Mohammadreza Askarizad. J. Irrig. Drain. Eng. 147, 7021006 (2021).
Article Google Scholar

Download references

Acknowledgements

The work was supported by the Bu-Ali Sina University Deputy of Research and Technology (Grant no. 1400-1066). The authors also acknowledge the Iranian Meteorological Organization that prepared the meteorological datasets for this study.

Author information

Authors and Affiliations

Department of Water Engineering, Faculty of Agriculture, Bu-Ali Sina University, Hamedan, Iran
Pouya Aghelpour, Vahid Varshavian & Mehraneh Khodamorad Pour
Computer Science Department, University of Birmingham, Birmingham, UK
Zahra Hamedi

Authors

Pouya Aghelpour
View author publications
You can also search for this author in PubMed Google Scholar
Vahid Varshavian
View author publications
You can also search for this author in PubMed Google Scholar
Mehraneh Khodamorad Pour
View author publications
You can also search for this author in PubMed Google Scholar
Zahra Hamedi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: P.A.; Methodology: Z.H., P.A.; Software: P.A., M.K.P.; Formal analysis and investigation: P.A., V.V.; Writing—original draft preparation: P.A., Z.H.; Writing—review and editing: V.V., M.K.P.; Resources: V.V.; Supervision: P.A., Visualization: P.A.

Corresponding author

Correspondence to Mehraneh Khodamorad Pour.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Aghelpour, P., Varshavian, V., Khodamorad Pour, M. et al. Comparing three types of data-driven models for monthly evapotranspiration prediction under heterogeneous climatic conditions. Sci Rep 12, 17363 (2022). https://doi.org/10.1038/s41598-022-22272-3

Download citation

Received: 21 May 2022
Accepted: 12 October 2022
Published: 17 October 2022
DOI: https://doi.org/10.1038/s41598-022-22272-3

This article is cited by

One to twelve-month-ahead forecasting of MODIS-derived Qinghai Lake area, using neuro-fuzzy system hybridized by firefly optimization
- Pouya Aghelpour
- Hadigheh Bahrami-Pichaghchi
- Reza Norooz-Valashedi
Environmental Science and Pollution Research (2024)
Coupling ANFIS with ant colony optimization (ACO) algorithm for 1-, 2-, and 3-days ahead forecasting of daily streamflow, a case study in Poland
- Pouya Aghelpour
- Renata Graf
- Edmund Tomaszewski
Environmental Science and Pollution Research (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.