Soil temperature forecasting using a hybrid artificial neural network in Florida subtropical grazinglands agro-ecosystems

Biazar, Seyed Mostafa; Shehadeh, Hisham A.; Ghorbani, Mohammad Ali; Golmohammadi, Golmar; Saha, Amartya

doi:10.1038/s41598-023-48025-4

Download PDF

Article
Open access
Published: 17 January 2024

Soil temperature forecasting using a hybrid artificial neural network in Florida subtropical grazinglands agro-ecosystems

Seyed Mostafa Biazar¹,
Hisham A. Shehadeh²,
Mohammad Ali Ghorbani³,
Golmar Golmohammadi¹ &
…
Amartya Saha⁴

Scientific Reports volume 14, Article number: 1535 (2024) Cite this article

1012 Accesses
1 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Soil temperature is a key meteorological parameter that plays an important role in determining rates of physical, chemical and biological reactions in the soil. Ground temperature can vary substantially under different land cover types and climatic conditions. Proper prediction of soil temperature is thus essential for the accurate simulation of land surface processes. In this study, two intelligent neural models—artificial neural networks (ANNs) and Sperm Swarm Optimization (SSO) were used for estimating of soil temperatures at four depths (5, 10, 20, 50 cm) using seven-year meteorological data acquired from Archbold Biological Station in South Florida. The results of this study in subtropical grazinglands of Florida showed that the integrated artificial neural network and SSO models (MLP-SSO) were more accurate tools than the original structure of artificial neural network methods for soil temperature forecasting. In conclusion, this study recommends the hybrid MLP-SSO model as a suitable tool for soil temperature prediction at different soil depths.

Predicting soil cone index and assessing suitability for wind and solar farm development in using machine learning techniques

Article Open access 05 February 2024

A hybrid machine learning approach for estimating the water-use efficiency and yield in agriculture

Article Open access 25 April 2022

Coupling of machine learning and remote sensing for soil salinity mapping in coastal area of Bangladesh

Article Open access 10 October 2023

Introduction

Soil temperature (ST) is a critical determinant that strongly impacts many physical, chemical, and biological processes in soil. Many factors influence soil temperature, such as meteorology, topography, soil water content, soil texture and vegetation cover/type¹). Ground soil temperature can differ substantially under various current weather conditions and weather regions and land cover types. Soil temperature plays a very important role in plant growth, crop yield and agricultural processes and as such can be a more important factor than surface air temperature in agricultural production^2,3,4,5. Therefore, forecasting soil temperature could be of importance for water resources decision makers as it has implications for irrigation requirements and scheduling.

A wide variety of techniques are used to simulate soil temperature. Numerous recent investigations have delved into short to medium-length ST forecasts, focusing on two distinct approaches^6,7. The initial type emphasizes employing statistical methods, such as numerical weather forecasting techniques, which presume that future ST data series will exhibit statistical alterations akin to past occurrences^8,9. Models for extended forecasts often require substantial data, which typically might not be available^10,11,12. Conversely, the second type involves utilizing artificial intelligence (AI) models^13,14,15, Moreover, several research initiatives have characterized ST as a nonlinear physical phenomenon^16,17,18,19. In recent decades, many studies have focused on soil temperature modelling and forecasting^{20,21,22,23,24}. For instance, the spatial and temporal patterns of soil temperature were predicted based on topography, surface cover and air temperature using the empirical relationship between air and soil temperature²⁵. Reference²⁶ modelled soil temperature for a range of forest species composition, ages and management systems across southern Australia, and sensitivity analysis indicated that one of the most important inputs was air temperature. The support vector machine (SVM) approach has been applied to predict diverse parameters including but not limited to soil moisture prediction, forecasting of river water quality, pan evaporation, stream flow prediction, global solar radiation, daily dew point temperature estimation, and interior environment variables in greenhouses^{27,28,29,30,31}.

Other studies have determined that artificial neural network (ANN) models produce more accurate results compared to multivariate linear regression models in forecasting daily soil temperature^17,32. Monthly soil temperature was predicted based on various atmospheric variables using linear and nonlinear regression models and an artificial neural network; it was found that neural networks were more precise methods compared to linear and nonlinear regressions to predict soil temperature^19,33,34,35. Another study showed that the developed ANNs were a useful modelling approach for the spatiotemporal prediction of monthly soil temperature¹⁸. The application of the multilayer perceptron (MLP) and the adaptive neuro-fuzzy inference system (ANFIS) was examined to predict daily soil temperature in Illinois, and it was concluded that the MLP showed more accurate results than the ANFIS³⁶.

In a study by Ref.¹, soil temperature at multiple depths was predicted using a hybrid artificial neural network model and firefly optimizer algorithm and it was found that the hybrid MLP-FFA hybrid model produced more accurate results compared to the MLP model. Reference³⁷ proposed a hybrid optimization method, namely Hybrid Genetic Algorithm and Sperm Swarm Optimization (HGASSO). The idea of this method was to amalgamate the Genetic Algorithm (GA) operations, such as mutation, selection and crossover operations with the local search of Sperm Swarm Optimization (SSO). This method is tested on solving different well-known multi-model benchmark functions. The results of the HGASSO prove its accuracy over both standard SSO and GA, which outperformed them in the terms of quality of results and speed of convergence. In a different study, Ref.³⁸ proposed a hybrid method, namely “PSO-BP” that combines PSO variant with “back-Propagation (BP)”. The terms of solution quality and convergence speed were tested on various classical models. Depending on the experimental results, the researchers had presented that the hybrid variant is better than both BP and PSO in the aforementioned metrics. Reference³⁹ discussed a hybrid approach, namely “HPSO-DE” that combines PSO with DE. The proposed HPSO-DE was evaluated on various test bed models in the field of optimization. The experimental results proved that “HPSO-DE” was more accurate in generating a better set of solutions.

The objective of the present research is to develop artificial intelligence models which would predict soil temperatures at different depths of the soil using meteorological data in the subtropical ranchlands of South Florida.

The study carries out a comprehensive comparative analysis between the proposed machine learning models (the Classical Multi-layer perceptron and the integrated multi-layer perceptron with Sperm Swarm Optimization algorithm). In addition, different combinations of climate variables have been examined as inputs for the model including but not limited to air temperature, solar radiation, wind speed and relative humidity. For the purpose of this study, a hybrid artificial neural network model was coupled with Sperm Swarm Optimization (SSO) for modeling daily soil temperature at a depth of 5, 10, 20, and 50 cm. The results of this study suggested that the combination of the SSO and hybrid artificial neural network models was a more accurate tool than the original structure of these artificial neural network methods for soil temperature forecasting purposes. To the authors’ knowledge, this is the first attempt to utilize an integrated artificial neural network model with Sperm Swarm Optimization (SSO) as a predictor for soil temperature.

Material and methods

Data acquisition

The 4200 ha Buck Island Ranch (BIR), a division of Archbold Biological Station lies in Highlands County, Florida about 22 km southeast of Lake Placid within the headwaters of the Everglades in southcentral Florida (27° 09ʹ N, 81° 12ʹ W) (Fig. 1). It is a commercial free range cow-calf operation with improved (drained, fertilized, planted exotic grasses) and semi-native grasslands, seasonal wetlands and oak-palm forests. The climate is subtropical with average rainfall of 1360 mm and minimum and maximum temperatures of 15.9 and 29.0 °C (average of 30 years). Evapotranspiration is typically almost as high as rainfall^40,41. The BIR Soils are sandy with an organic layer horizon on top. The Soils were dominated by Alfisols and Spodosols. In this region the seasonally inundated wetland–savanna mosaic has been drained by an extensive-ditch canal network constructed in the mid-twentieth century⁴².

The weather station at BIR measures rainfall (Texas Electronics TE25 tipping bucket Raingage), solar radiation (Kipp and Zonen pyranometers and pyrgeometers for short and longwave radiation), air temperature, relative humidity (Rotronic Hygroclip2 Temperature/RH Probe) and windspeed/direction (RM Young wind monitor). Soil temperature and moisture probes (Stevens Hydraprobe II) are placed at 5, 10, 20 and 50 cm depths. Data is recorded at 15-min intervals and stored in a datalogger (CR1000, Campbell Scientific).

For over three decades, Archbold’s Agroecology Program has been established at Buck Island Ranch. At this location, researchers collaborate with ranchers to comprehend the environmental effects of free-range cow-calf ranching and enhance its ecological sustainability. In 2012, Buck Island Ranch was chosen as one of 18 locations for the US Department of Agriculture’s Long-Term Agroecosystem Research (LTAR) network. LTAR employs synchronized research across different sites and shares data with the aim of enhancing the food system of America and increasing agricultural productivity. This initiative also seeks to ameliorate environmental quality amidst challenges such as climate change. Agroecological research at BIR is also relevant to subtropical grasslands and wetlands globally, with a lot of visitors and collaborations.

Data collection and monitoring activities at Buck Island Ranch (BIR) were conducted over an extended period spanning from the 10th of December 2016 to the 27th of January 2023 (Table 1). This extensive time frame allowed us to gather a comprehensive set of data, reflecting both short-term variations and longer-term trends (Fig. 2).

Table 1 Basic statistics of meteorological variables for the period of 12/10/2016–27/01/2023.

Full size table

Methodology

Multi-layer perceptron neural networks (MLP)

The Multi-Layer perceptron (MLP), a form of the ANN model, will be adopted as the primary modeling tool to forecast soil temperature at multiple depths using a limited predictor dataset. In general, MLPs are extensively utilized for approximation, prediction, recognition and pattern classification. ANNMLP models can handle complex problems that are not linearly separable. Basically, the MLP model is a feed forward neural network with one or more layers among input and output layers⁴³. The term feed forward signifies that the data feature extraction process moves in one direction from the input to output layer. The back propagation learning algorithm is used to train MLP^{44,45,46,47,48}. Multilayer feed-forward Perceptron back propagation learning algorithm (MLP-BP), as one of the popular MLP architectures, involves input, hidden and output layers. Moreover, specific weights are linked among neurons of input and hidden layers and from neurons of hidden and output layers by suitable activation functions. Additionally, the activation functions between input and hidden layers and between hidden and output layers are sigmoid and linear functions, respectively. These activation functions limit the input data to fluctuate between 0 and 1. So, by assuming that input data = d = (Tmean, Patm, SR and M), the mathematical description is as follows:

$${d}_{j}={f}_{1}({b}_{j}+\sum_{i}^{I}{W}_{j,i}{d}_{i})$$

(1)

$${d}_{j}={f}_{2}({b}_{k}+\sum_{i}^{I}{W}_{kj}{d}_{j})$$

(2)

Where d is the array of input parameters including meteorological parameters; ${f}_{1}$ and ${f}_{2}$ are actuation functions, ${b}_{j}$ and ${b}_{k}$ are bias values of ${f}_{1}$ and ${f}_{2}$ and ${W}_{j,i}, {W}_{kj}$ are weight parameter.

In this study, backpropagation algorithm, which employs the extensively implements Levenberg–Marquardt, and the proposed SSO optimization algorithm were utilized for minimizing the error functions of MLP (Mean Squared Error (MSE) = $\frac{1}{n}\sum_{i=1}^{n}{({y}_{predicted}-{y}_{actual})}^{2}$, where n is the number of observation, ${y}_{predicted}$ predicted values and ${y}_{actual}$ observed values).

In this study we used 20 percent of data for the test period and 80% of data for training period same as^49,50.

Standard “sperm swarm optimization (SSO)”

Sperm swarm optimization (SSO) is a newly developed metaheuristic algorithm that draws inspiration from the collective behavior exhibited by a group of sperm cells during the fertilization process of an ovum⁵¹.

The algorithm utilizes a collection of potential solutions, represented as "sperm," that traverse the entire search space in order to explore and acquire the optimal solution. Simultaneously, each candidate solution evaluates the best-performing sperm discovered thus far. In other words, a sperm takes into account it’s previously identified best position (sperm best solution) as well as the overall best position of the entire swarm (global best solution).

Within the SSO algorithm, each sperm enhances its position towards the optimal solution by taking into account its current location, velocity, the distance to its best solution (xbest_i), and the distance to the global best solution obtained thus far (xgbest_i). Mathematically, in SSO, the update of the sperm's position is governed by the following equation:

$${x}_{i}\left(t+1\right)={x}_{i}\left(t\right)+{v}_{i}(t)$$

(3)

Where ${x}_{i}\left(t\right)$ represents the current position of the ith sperm in the search space at time t, ${v}_{i}\left(t\right)$ denotes the velocity of the ith sperm at time t, which governs its movement, ${x}_{i}\left(t+1\right)$ is the updated position of the ith sperm at the subsequent time t + 1.

The equation provided represents the update of the current velocity ${v}_{i}(t)$ of the ith sperm in the algorithm. The velocity is comprised of three components: the initial velocity, the personal best solution (xbest_i) of the sperm, and the global best solution (xgbest_i), as depicted in Eq. (4).

$${v}_{i}(t) =\mathrm{ Initial}\_\mathrm{Velocity }+\mathrm{ Current}\_\mathrm{Best }+\mathrm{ Global}\_\mathrm{Best}$$

(4)

Where ${v}_{i}(t)$ is the velocity of the ith sperm at time tt, which directs its movement across the search space, $\mathrm{Initial}\_\mathrm{Velocity}$ denotes the inherent or starting velocity of the ith sperm, which can be thought of as the sperm's intrinsic momentum prior to any interactions or learning, $\mathrm{Current}\_\mathrm{Best}$ represents the influence of the best position that the ith sperm has discovered up to time t. This component pulls the sperm toward the most promising areas it has personally encountered, $\mathrm{Global}\_\mathrm{Best}$ reflects the influence of the best position found by any sperm in the swarm up to time t. This component guides the sperm towards the best solutions found by the entire collective.

The initial velocity of each sperm after being ejaculated into the search space (referred to as the cervix area) is represented in the first part of Eq. (4). This velocity is influenced by the pH value and can be mathematically expressed as follows:

$$Initial\_velocity=D.{log}_{10}.(pH\_{Rand}_{1}).{v}_{i}$$

(5)

The equation involves the damping factor D, which is a random number ranging from 0 to 1. Additionally, pH_Rand₁ represents a random number within the range of 7–14, symbolizing the pH value of the visited location. The second term in Eq. (4) represents the best position achieved by the sperm thus far, influenced by both pH and temperature. This term can be expressed in the following manner:

$$Current\_Best={log}_{10}(pH\_{Rand}_{2}) .{log}_{10}(Temp\_{Rand}_{1}).{(xgbest}_{i}-{x}_{i}(t))$$

(6)

The equation continues with the term involving $pH\_{Rand}_{2}$, which is a randomly generated number between 7 and 14. Additionally, Temp_Rand1 represents another random number within the range of 35.1–38.5, signifying the temperature value of the visited location.

The final term in Eq. (4) represents the best position among all the sperm, which is the one that is closest to the target. This position is determined and evaluated using the following expression:

$$Global\_Best={log}_{10}(pH\_{Rand}_{3}).{log}_{10}.(Temp\_{Rand}_{2}).{log}_{10}.({xgbest}_{i}-{x}_{i}\left(t\right))$$

(7)

In this equation, pH_Rand₃ represents a randomly generated number ranging from 7 to 14, and Temp_Rand₂ represents another random number within the range of 35.1–38.5. By substituting Eqs. (5)–(7) into Eq. (4), the velocity of the ith sperm in iteration t can be defined as follows:

$${v}_{i}= D.{log}_{10}.(pH\_{Rand}_{1}).{v}_{i}+{log}_{10}(pH\_{Rand}_{2}) .{log}_{10}(Temp\_{Rand}_{1}).{(xgbest}_{i}-{x}_{i}(t))+{log}_{10}(pH\_{Rand}_{3}).{log}_{10}.(Temp\_{Rand}_{2}).{log}_{10}.({xgbest}_{i}-{x}_{i}\left(t\right))$$

(8)

In Eq. (8) integrates various components that represent the sperm’s movement in the search space influenced by environmental factors, namely pH and temperature.$D.{log}_{10}.(pH\_{Rand}_{1}).{v}_{i}$, This component represents the inherent momentum or initial velocity of the ith sperm, influenced by the pH of its immediate environment. The term ${log}_{10}.(pH\_{Rand}_{1})$ transforms a randomly selected pH value (ranging between 7 and 14) to introduce variability from the cervix area’s pH, with D being a damping factor that captures the natural variations in movement. Essentially, this captures the initial impetus a sperm has due to its immediate pH surroundings and ${log}_{10}(pH\_{Rand}_{2}) .{log}_{10}(Temp\_{Rand}_{1}).{(xgbest}_{i}-{x}_{i}(t))$, this term reflects the influence of both pH and temperature on the sperm’s current optimal position. It represents how these environmental factors affect the ability of the sperm to reach better positions. Then, ${log}_{10}(pH\_{Rand}_{3}).{log}_{10}.(Temp\_{Rand}_{2}).{log}_{10}.({xgbest}_{i}-{x}_{i}\left(t\right))$, This term determines the influence of pH and temperature on the global best position among all sperms, representing the overall optimal environmental conditions for the group.

In the Eq. (5) $Initial\_velocity$, describes the inherent momentum of a sperm immediately after its introduction into the cervix area (or the search space). Here, the pH value serves as an environmental factor influencing this initial movement. Specifically, the term ${log}_{10}.\left(p{H}_{{Rand}_{1}}\right)$ represents the logarithmic transformation of a random pH value between 7 and 14, modeling the variability of pH within the cervix area. Hence, the initial velocity is a function of both the randomly chosen pH value and the damping factor D, which represents natural variations. In Eq. (6) $Current\_Best$ signifies the optimal position a sperm has achieved, influenced by the interaction of pH and temperature. The multiplicative terms ${log}_{10}(pH\_{Rand}_{2})$ and ${log}_{10}(Temp\_{Rand}_{1})$ introduce variability from the randomly selected pH (between 7 and 14) and temperature values (between 35.1 and 38.5 °C), respectively. This equation captures the fact that a sperm’s performance (or ability to find better positions) is affected by both the pH and temperature of its current location. At the end in Eq. (7) $Global\_Best$ determines the superior position among all sperms, or the one nearest to the desired solution. Again, both pH and temperature values play vital roles. With terms like ${log}_{10}(pH\_{Rand}_{3})$ and ${log}_{10}.(Temp\_{Rand}_{2})$, we integrate the random effects of both pH and temperature on the collective performance of the sperm group^52,53,54.

Within the SSO algorithm, as described in Eq. (8), the velocity of the sperm is influenced by two factors: the pH value and temperature of the visited zone. Temperature plays a crucial role as it allows the sperm to have awareness of the best solution, which corresponds to the location of the egg⁵⁵.

Forecasting development

In this study, daily meteorological variables were obtained from 15-min measurements at the BIR weather station from 10th December 2016 to 27th January 2023. The selected meteorological variables are Air Temperature (°C) (T_mean), Wind Speed (m/s) (Ws), Relative Humidity (RH), Air Pressure (mb), Rainfall (mm) (R), Solar Radiation (W/m²) (S_r), Soil temperature (°C) (ST₅) at 5 Soil temperature (°C) (ST₁₀) at 10 cm Soil temperature (°C) (ST₂₀) at 20 cm depth, and Soil temperature (°C) (ST₅₀) at 50 cm depth. The statistical properties are displayed in Table 1. It is interesting to note that, the authors reviewed various papers and selected the most impactful variables from the literature, emphasizing commonly available ones^7,56,57. Soil temperature mirrors air temperature at the surface, but deeper layers are more stable and lag behind in seasonal shifts observed at the top¹. Wind speed impacts surface soil temperature through evaporation and moisture content. Its effect diminishes with depth and varies seasonally, influenced by factors like vegetation and solar radiation⁵⁸. Higher relative humidity retains soil moisture, cooling surface soil. Low humidity can warm surface soil faster. Deep layers remain largely unaffected³⁵. High air pressure can boost surface soil temperatures via clear skies and increased sun exposure, while deeper soil layers remain mostly unaffected¹. Rain cools the topsoil directly, while deeper soil layers show little immediate temperature shifts from precipitation²⁶. Solar radiation heats the soil surface directly. As depth increases, the influence of solar radiation on soil temperature diminishes⁵⁹.

To increase model accuracy and to avoid selecting irrelevant input variables, Gamma Test (GT) was applied as a popular input variables selection (IVS) method.

The GT (Gamma Test) is employed to analyze the connection between inputs and outputs within numerical datasets. This approach differs significantly from previous non-linear analysis methods. In this method, a data sample is represented by a certain format^60,61.

$$(\left({x}_{1},\dots ,{x}_{T}\right),y)$$

(9)

In this context, the input vector X is restricted to a closed bounded set C ϵ ${R}^{T}$, while the output is represented by the scalar y. For simplicity, the explanation focuses on the case of a single scalar output y. However, it is important to note that the same algorithm can be applied to scenarios where y is a vector without significant additional complexity or time overhead. The purpose of the GT is to provide an estimation of the noise variation, represented as Var(r), based on the data. The main assumption in this approach is that the system's underlying relationship follows a specific form.

$$y=f\left({x}_{1},\dots ,{x}_{T}\right)+r$$

(10)

In the given context, the variable r signifies an indeterminable component that can arise from either real noise or an insufficient functional determination within the input/output relationship. This component represents the unexplained or uncertain aspect of the system. Despite the unknown nature of the underlying function f, the GT is capable of directly estimating Var(r) using the available data. This estimation, referred to as the Gamma statistic (symbolized by γ), can be computed directly from the data with a time complexity of O (T log T). To compute γ, two specific quantities are derived through the following calculations:

$${\delta }_{t}(k)=\frac{1}{T}\sum_{i=1}^{T}\left|{x}_{N}\left[i,k\right]-{x}_{i}\right|$$

(11)

In the provided context, ${x}_{N}\left[i,k\right]$ refers to the index of the k-th nearest neighbor to xi, and |. | represents the Euclidean distance. The GT relies on the values of N [i, k], which represent the indices of the kth nearest neighbors (${x}_{N}\left[i,k\right]$) for each vector ${x}_{i}$ (with i ranging from 1 to T) typically with a value of p equal to 10. As a result, ${\delta }_{t}(k)$ represents the mean square distance to the kth nearest neighbor. The corresponding Gamma function of the output values is then determined.

$${\gamma }_{T}\left(k\right)=\frac{1}{2T}\sum_{i=1}^{T}{(yN\left[I,K\right]-{y}_{i})}^{2}$$

(12)

Using the GT, the mean-squared distances of the kth nearest neighbors (${\delta }_{t}(k)$) and the corresponding $\gamma {(p)}^{2}$ values up to a maximum value kMax are calculated. Subsequently, the regression line is computed, and the vertical intercept ($\Gamma$) is obtained as the Gamma value. Additionally, the slope A of the regression line is provided as an indication of the model's complexity (f). In theory, Γ represents the limit of $\gamma$ as the distances ($\delta$) approach zero, which corresponds to Var(r).

The GT is utilized as a preliminary step before modeling to estimate the variance of the output that cannot be explained by any smooth model based solely on the inputs, despite the unknown nature of the model itself. The GT helps capture the unexplained variability in the output. The estimation of error variance establishes a goal for the mean squared error that any smooth non-linear model should reach when applied to unseen data^33,50.

Based on Table 2, the bolded variables were applied to predict ST in the cited depths (5, 10, 20 and 50 cm).

Table 2 GT values of all variables with soil temperature values at different depths (5–50 cm).

Full size table

For ST prediction (in 5 and 10 cm) the whole meteorological variables were selected by GT (T_mean, Ws, RH, P, R and S_r). For two other depths, the ST values were estimated based on RH, R, and S_r. It is interesting to note that GT recognized Air pressure (P) variable, as input variable, in addition to what was mentioned before to predict ST₂₀. In this study, before training, the data was normalized by the approach proposed by¹. Moreover 80% and 20% of data were used for training and testing, respectively same as suggested by^5,15.

Performance evaluation

Understanding and predicting soil temperature is paramount due to its significant impact on various environmental, agricultural, and hydrological processes. Accurate models are thus vital for several practical applications, from agricultural decision-making to climate studies. Assessing model performance specifically for soil temperature prediction ensures that these models are both reliable and robust, providing stakeholders with trustworthy information for their respective uses. Several statistical indices including mean absolute percentage error (MAPE), root mean square error (RMSE), mean absolute error (MAE) and mean bias error (MBE) were used to evaluate the performance of the models (MLP and MLP-SSO), Correlation Coefficient (CC). These metrics collectively offer a comprehensive assessment of model performance, capturing both magnitude and direction of prediction errors, as well as potential biases. Their combination ensures a holistic evaluation, making certain that the model is reliable across various dimensions of accuracy^1,15.

Furthermore, MAPE, Provides a relative measure of prediction accuracy, essential for understanding deviations in percentage terms, and its implication on a broader scale. RMSE, Ensures our model's precision, highlighting even occasional large errors which could be crucial for applications demanding high accuracy. MAE, Offers an average of the model's accuracy, ensuring its consistency across predictions. MBE, Monitors potential systematic biases, preventing consistent overpredictions or underpredictions which can skew decision-making. CC, Assesses the linear relationship between predicted and observed soil temperatures, ensuring the model effectively tracks variations^48,49.

$$\mathrm{MAPE}=\frac{1}{n}\sum_{i=1}^{n}\left|\frac{{P}_{i}-{O}_{i}}{{O}_{i}}\right|\times 100$$

(9)

Where ${P}_{i}$ represents predicted values, while ${O}_{i}$ represents observed values.

The MAPE is the most common metric used to forecast error since the variable’s units are scaled to percentage units. The lower the value for MAPE the better, MAPE provides an understanding of prediction accuracy as a percentage. It measures the average absolute percent difference between observed and predicted values relative to the observed values. A reduction in MAPE indicates a higher prediction accuracy. For instance, a MAPE of 5% means that, on average, the model's predictions deviate from the actual observations by 5%. A decrease in this value means the model is becoming more precise in its predictions in percentage terms, which can be especially useful for relative comparisons and understanding the scale of prediction errors in proportion to actual values^62,63

$$\mathrm{RMSE}=\sqrt{\frac{1}{N}\sum_{i=1}^{N}{({P}_{i}-{O}_{i})}^{2}}$$

(10)

Where ${P}_{i}$ represents predicted values, while ${O}_{i}$ represents observed values.

RMSE is frequently used measures of the differences between observed and predicted values. The unit of RMSE is the same as observed/predicted unit. The lower the value for RMSE the better Measures the square root of the average squared differences between predicted and observed values. RMSE gives more weight to larger errors than MAE, making it sensitive to occasional large errors. A reduction in RMSE suggests that the model is making fewer large errors, which is especially crucial when outliers or extreme values can have significant implications^9,64.

$$\mathrm{MAE}=\frac{1}{N}\sum_{i=1}^{N}\left|({P}_{i}-{O}_{i})\right|$$

(11)

where ${P}_{i}$ represents predicted values, while ${O}_{i}$ represents observed values.

MAE in statistics is a measurement used to investigate how predictions are close to eventual outcomes. The unit of MAE is the same unit as the data being measured, Represents the average absolute differences between the observed and predicted values. A reduction in MAE indicates that the model's predictions are, on average, closer to the actual observations. For instance, a decrease in MAE by 2% means that the model's predictions are now, on average, 2% closer to the true soil temperature values, which can have tangible benefits in applications where precision matters⁶⁵.

$$\mathrm{MBE}=\frac{\sum_{i=1}^{n}({O}_{i}-{P}_{i})}{N}$$

(12)

Where ${P}_{i}$ represents predicted values, while ${O}_{i}$ represents observed values.

MBE captures the average bias in the prediction. MBE is essentially applied to estimate the average bias in the model and to decide if any stages needed to be taken to modify the model bias. Indicates the average bias in the model predictions. A positive MBE suggests the model tends to overpredict, while a negative MBE indicates underprediction. Reducing the absolute value of MBE ensures that the model is not systematically biased in its predictions^65,66.

The correlation coefficient (CC) quantifies the strength and direction of a relationship between two variables, A measure of the linear relationship between the observed and predicted soil temperatures. A coefficient value closer to 1 indicates a strong positive linear relationship, meaning that as observed temperatures increase, the model's predictions also tend to increase in a consistent manner. A high correlation suggests the model can effectively track changes in soil temperature¹².

$$\mathrm{CC}=\frac{\sum ({x}_{i}-\overline{x })({y}_{i}-\overline{y })}{\sum {({x}_{i}-\overline{x })}^{2}\sum {({y}_{i}-\overline{y })}^{2}}$$

(13)

Where ${y}_{i}$ is the predicted values and ${x}_{i}$ is the observed values. $\overline{x }$ is the mean of observed values and $\overline{y }$ is the mean of predicted values.

In addition to statistical indices (Eqs. 9–13), a graphical method of Taylor diagram is used to illustrate the degree of correspondence between the observed and predicted behavior in terms of three statistics: the Pearson correlation coefficient (In the Taylor diagram, the angular position [azimuthal angle (angle from x-axis shows correlation. Increasing angle means decreasing correlation; on x-axis, it's perfect.)] represents the correlation coefficient. The Taylor diagram is instrumental in assessing model performance as it concisely visualizes key statistical measures—correlation, standard deviation, and RMSE—in one graphic. On the diagram, the azimuthal angle depicts correlation, the radial distance from the origin shows the normalized standard deviation, and the distance from a model point to a reference point indicates RMSE. This holistic representation provides quick insights into both the magnitude and pattern of model errors, aiding in the comparative evaluation of models or model configurations against reference data⁶⁷. Models that reproduce the spatial pattern of the reference data will lie closer to the horizontal rightmost axis, indicating higher correlation), the root-mean-square error (RMSE) (In the diagram, the distance from a model point to the reference point (usually set at (1,0) for normalized plots) represents the RMSE. Points closer to the reference point have smaller RMSE values, denoting better agreement with observations), and the normalized standard deviation [the radial distance (distance from center shows model's deviation. Perfect match is on radius 1. Inside: underestimation, outside: overestimation) from the origin in the Taylor diagram represents the normalized standard deviation. A value equal to the reference standard deviation implies that the model has accurately captured the observed variability, while values above or below indicate overestimation or underestimation, respectively] in a single diagram. The Taylor diagram is a graphical tool developed by Karl Taylor in the late twentieth century, designed to provide a comprehensive visual summary of how closely a model’s pattern matches observations. Instead of multiple plots to compare various metrics, the Taylor diagram condenses this information into a single plot, making the assessment of multiple models more straightforward. Taylor diagram can be displayed as a series of points on a polar plot. The azimuth angle implies the Pearson Correlation ® value between the estimated and observed data. The radial distance from the origin, meanwhile, signifies the ratio of the normalized standard deviation (SD) of the simulation to that of the observation. The centered RMSE in the simulated field is proportional to the distance from the point on the x-axis^58,67.

Results and discussion

This section delves into a comprehensive comparison between the hybrid MLP-SSO model and its classical counterpart, the MLP model, in predicting soil temperatures at varying depths. Through statistical indicators and graphical presentations, we aim to highlight their respective efficiencies.

The performance of both hybrid MLP-SSO model and classical MLP model are presented using the statistical indices and visual assessment of predicted and observed soil temperature data at different depths. Table 3 presents the comparison of the performances of the MLP and MLP-SSO for the model development (training) and model validation (Testing) datasets. Both models were evaluated at the depths of 5, 10, and 20, 50 cm depths with statistical criteria (RMSE, MAE, MAPE, and MBE).

Table 3 Performance criteria of the MLP-SSO and MLP models for training and testing stages at the Buck Island Ranch station.

Full size table

The statistical analysis results which are presented in Table 3 show that during the testing period, the RMSE values for different depths for MLP model are estimated in the range of 1.01–1.417 ($^\circ{\rm C}$), while the RMSE values for the MLP-SSO model was figured out to be between 0.973 and 1.367 ($^\circ{\rm C}$).

At the 5 cm depth, the MLP-SSO results outperformed the classical MLP model over the testing period. The MLP-SSO model assigned an RMSE of 1.332 (°C), MAE of 0.993 (°C), MAPE of 2.364% and MBE of − 0.084 (°C). In other words, the integration of SSO algorithm with MLP model led to reduction in the RMSE and MAE, 1.10% and 2.35% respectively. Based on this metrics, the new model improved the accuracy. Araghi et al. 2017 had same results.

At the depth of 10 cm the MLP-SSO model produced superior results in comparison with MLP. The results showed the RMSE of 1.367 (°C), MAE of 1.035 (°C), MAPE % of 2.502 and MBE of − 0.142 (°C) for MLP-SSO. Moreover, the MLP-SSO generated lower values of the RMSE and MAE rather than the classical MLP model. The MLP-SSO model reduced them by 10.1% and 2.9%, respectively, same as⁶⁸ outputs.

Figure 3 displays a comparative time series of projected versus actual soil temperature (ST) values at varying depths, as determined by both the MLP and MLP-SSO models. This illustration provides a straightforward visual juxtaposition of predictions from the two models in relation to the true data over time.

The same trend can be seen for at the 20 cm depth. At this depth, the results of MLP-SSO is more precise than the classical MLP model. The MLP-SSO yielded an RMSE value of 0.973 (°C), MAE of 0.758 (°C), MAPE of 1.840% and MBE of -0.028 (°C). Based on aforementioned values, the RMSE and MAE were reduced by about 2.31% and 4.14%, respectively. Samadianfard et al. 2018, reached out to the same results.

However, dissimilar, the trend found for the other depth. The MLP-SSO model showed different results in modeling the ST at 50 cm depth. The MLP-SSO model reduced RMSE from 1.233 in the classical MLP model to 1.232 in the hybrid MLP-SSO model, but it can be seen an increasing trend in the other criteria.

Based on the statistical analysis conducted in this study, it can be concluded that the SSO-based model had a remarkable effect in reducing the predicting errors for ST at the 5, 10 and 20 cm depths below the soil surface. While the MLP-SSO model was unable to enhance the accuracy of ST at 50 cm depth. This conclusion concurs with¹. Even¹⁷ results proved that when depth increased the model accuracy decreased.

Figure 3 presents a time series comparison of predicted and observed soil temperature (ST) values at different depths using both MLP and MLP-SSO models. This Figure offers a direct visual comparison of the two model predictions over time against actual observations. Figure 4 provides scatterplots to visually contrast the predicted ST values from the models against observed ST values. The scatterplots highlight the accuracy and fit of each model, with the tighter clustering of points indicating a better model fit. This graphically showcases the superiority of MLP-SSO over the traditional MLP models, especially when assessing the performance across various soil depths. Figure 5 displays the Taylor diagram, which is instrumental in evaluating the performance of the two models at multiple depths. The diagram employs reference points to indicate centered RMSE differences, and the distance from these points signifies model accuracy. Models closer to the reference point with a correlation coefficient of 1, possessing a similar range of variations as the observations, are considered superior. In this diagram, it's evident that the MLP-SSO model, represented by circles, consistently outperforms the classical MLP model, denoted by squares, across all soil depths in terms of prediction accuracy.

It can be seen in Fig. 3 the time series of predicted and observed ST values at different depths with the MLP and MLP-SSO are demonstrated. In the Fig. 4 the Scatterplots of predicted and observed ST values are illustrated. The superiority of MLP-SSO over the MLP models is proved with these graphs. Comparing of the model predicted for different depths, as displayed in Fig. 4. Obviously illustrates that MLP-SSO model able to estimate the soil temperature values better than classical MLP models.

It can be seen in Fig. 5 the Taylor diagram for both models utilized at multiple depths. In this diagram, the distance from, reference point (i.e. a hollow point) is an amount of the centered RMSE difference. Accordingly, a premier model is normally demonstrated by the reference point with a correlation coefficient of 1 with nearly the same domain of variations compared with the observations. It is obvious from Fig. 5 that the MLP-SSO (i.e., Circle) was able to obtain high accuracy predicts of soil temperature rather than the classical MLP model (i.e., Square) applied at all soil depths.

The enhanced accuracy in soil temperature prediction achieved through our MLP-SSO model can substantially benefit agricultural practices, especially in precision farming where optimal planting and irrigation schedules are determined by soil temperature data. Such accurate predictions can also optimize water resource management and provide invaluable insights for climate change research, especially in modeling carbon and nitrogen cycling in ecosystems, thus refining greenhouse gas emission forecasts from soils. Nevertheless, while our model excelled in Florida's subtropical grazinglands, its performance might differ in areas with distinct climate or soil characteristics. Additionally, its effectiveness is tied to the quality and consistency of the input meteorological data. Thus, while promising, users should account for local conditions and ensure robust input data when leveraging the MLP-SSO model for practical applications.

In our exploration of predicting soil temperatures across various depths, the study underscored the pivotal role of soil temperature as a determining factor for numerous soil-based reactions. Evaluating the predictive accuracy of two intelligent neural models in the subtropical grazinglands of Florida, it was evident that the combined prowess of artificial neural networks (ANNs) and Sperm Swarm Optimization (SSO) resulted in the MLP-SSO model. This hybrid model notably surpassed the traditional artificial neural network methods in forecasting soil temperature, offering a significant improvement in predictive capability. Utilizing a comprehensive seven-year meteorological dataset from Archbold Biological Station, the performance metrics clearly showcased the MLP-SSO model's superior precision. In essence, for those looking to predict soil temperature across various depths, especially in regions with similar environmental dynamics to South Florida, the hybrid MLP-SSO model emerges as a highly recommended tool, outclassing the classical MLP models in accuracy and reliability.

Future research endeavors could delve deeper into refining and expanding the MLP-SSO model by integrating it with other optimization techniques or newer neural network architectures. This could further enhance its prediction accuracy for soil temperatures across diverse geographical landscapes and climatic conditions. Additionally, the influence of different land cover types on soil temperature prediction warrants comprehensive investigation. While the current model has been tested extensively with data from subtropical grazinglands of Florida, its adaptability and efficiency in other climatic zones remain an area worth exploring. Another promising avenue would be the inclusion of more environmental variables into the model, potentially offering a holistic understanding of their collective impact on soil temperature variations. Lastly, assessing the real-time applicability of the MLP-SSO model in agricultural, ecological, or urban planning scenarios could provide actionable insights for stakeholders and drive innovations in the field of soil temperature prediction.

Conclusion

Accurate soil temperature predictions are pivotal for strategic decision-making in agriculture and water resource management. Our study introduced and validated a novel hybrid model, MLP-SSO, for forecasting soil temperatures at varied depths at Buck Island Ranch, South Florida. When compared with the conventional MLP model, the MLP-SSO demonstrated superior predictive accuracy and efficiency, especially when calibrated using readily accessible meteorological variables from 2016 to 2023.

The significant edge of the MLP-SSO underscores its potential as a premier tool in anticipating irrigation needs, especially with agriculture's burgeoning water demands. Beyond irrigation, the model holds promise for applications in understanding forest/grassland productivity dynamics and aiding fire management strategies. Future iterations of the MLP-SSO could explore incorporating additional climatic or soil health/type variables to enhance prediction finesse. This research holds tangible value for stakeholders, from water resource managers and farmers to environmental researchers, enabling them to harness data-driven insights for optimal resource management and sustainable agricultural practices. In the realm of policy, the precision of the MLP-SSO model could inform frameworks focused on sustainable water utilization and soil health management in agriculture. In essence, the MLP-SSO model emerges not just as an academic advancement but as a keystone for future-ready, sustainable agriculture.

In conclusion, the results of current study recommend that the hybrid MLP-SSO model could be a suitable tool for soil temperature prediction at different soil depths. Being calibrated with easily available weather data, this tool can be utilized to forecast and anticipate irrigation demand by water resource managers, given the large and increasing demand of water from agriculture amidst scenarios of decreasing water availability. Studies exploring the connection of soil temperature with forest/grassland productivity, fire management and land use change can also benefit from this tool. Soil temperature forecasting over wide areas with sparse meteorological stations can also inform evapotranspiration (ET) forecasts, given the direct link between soil temperature and ET. Given the magnitude of ET in subtropical and tropical watershed water balances, the relartion with land use change and the current uncertainty in estimating ET⁶⁹, this tool can constrain this uncertainty to some extent, and thereby improve watershed water balance computations.

Data availability

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

References

Samadianfard, S., Ghorbani, M. A. & Mohammadi, B. Forecasting soil temperature at multiple-depth with a hybrid artificial neural network model coupled-hybrid firefly optimizer algorithm. Inf. Proc. Agric. 5(4), 465–476 (2018).
Google Scholar
Hillel. Environmental Soil Physics (Academic Press, London 771, 1998).
Pessarakli, M., & Szabolcs, I. Soil salinity and sodicity as particular plant/crop stress factors. In Pessarakli M. (Ed.) Handbook of Plant and Crop Stress (2002).
Yildirim, A. N. et al. Physiological and biochemical responses of almond rootstocks to drought stress. Turk. J. Agric. For. 45(4), 522–532 (2021).
Article CAS Google Scholar
Biazar, S. M. & Ferdosi, F. B. An investigation on spatial and temporal trends in frost indices in Northern Iran. Theoret. Appl. Climatol. 141(3–4), 907–920 (2020).
Article ADS Google Scholar
Sanikhani, H., Deo, R. C., Yaseen, Z. M., Eray, O. & Kisi, O. Non-tuned data intelligent model for soil temperature estimation: A new approach. Geoderma 330, 52–64 (2018).
Article ADS Google Scholar
Feng, Y., Cui, N., Hao, W., Gao, L. & Gong, D. Estimation of soil temperature from meteorological data using different machine learning models. Geoderma 338, 67–77 (2019).
Article ADS Google Scholar
Chatfield, C. Time-series forecasting. Significance 2(3), 131–133 (2005).
Article MathSciNet Google Scholar
Biazar, S. M., Fard, A. F., Singh, V. P., Dinpashoh, Y. & Majnooni-Heris, A. Estimation of evaporation from saline water. Environ. Monit. Assess. 192, 1–17 (2020).
Article Google Scholar
Aljoumani, B., Sànchez-Espigares, J. A., Canameras, N., Josa, R. & Monserrat, J. Time series outlier and intervention analysis: Irrigation management influences on soil water content in silty loam soil. Agric. Water Manag. 111, 105–114 (2012).
Article Google Scholar
Kumar, M., Kumar, A., Mahanti, N. C., Mallik, C. & Shukla, R. K. Surface flux modelling using ARIMA technique in humid subtropical monsoon area. J. Atmos. Solar Terr. Phys. 71(12), 1293–1298 (2009).
Article ADS Google Scholar
Biazar, S. M., Fard, A. F., Singh, V. P., Dinpashoh, Y. & Majnooni-Heris, A. Estimation of evaporation from saline-water with more efficient input variables. Pure Appl. Geophys. 177, 5599–5619 (2020).
Article ADS Google Scholar
Yaseen, Z. M., Sulaiman, S. O., Deo, R. C. & Chau, K. W. An enhanced extreme learning machine model for river flow forecasting: State-of-the-art, practical applications in water resource engineering area and future research direction. J. Hydrol. 569, 387–408 (2019).
Article ADS Google Scholar
Yaseen, Z. M. et al. Prediction of evaporation in arid and semi-arid regions: A comparative study using different machine learning models. Eng. Appl. Comput. Fluid Mech. 14(1), 70–89 (2020).
Google Scholar
Ashrafzadeh, A., Malik, A., Jothiprakash, V., Ghorbani, M. A. & Biazar, S. M. Estimation of daily pan evaporation using neural networks and meta-heuristic approaches. ISH J. Hydraul. Eng. 26(4), 421–429 (2020).
Article Google Scholar
Zeynoddin, M. et al. A reliable linear stochastic daily soil temperature forecast model. Soil Tillage Res. 189, 73–87 (2019).
Article Google Scholar
Tabari, H., Hosseinzadeh Talaee, P. & Willems, P. Short-term forecasting of soil temperature using artificial neural network. Meteorol. Appl. 22(3), 576–585 (2015).
Article ADS Google Scholar
Wu, W. et al. Spatiotemporal modeling of monthly soil temperature using artificial neural networks. Theoret. Appl. Climatol. 113, 481–494 (2013).
Article ADS Google Scholar
Biazar, S. M., Rahmani, V., Isazadeh, M., Kisi, O. & Dinpashoh, Y. New input selection procedure for machine learning methods in estimating daily global solar radiation. Arab. J. Geosci. 13, 1–17 (2020).
Article Google Scholar
Araghi, A., Mousavi-Baygi, M., Adamowski, J., Martinez, C. & van der Ploeg, M. Forecasting soil temperature based on surface air temperature using a wavelet artificial neural network. Meteorol. Appl. 24(4), 603–611 (2017).
Article ADS Google Scholar
Ghorbani, M. A. et al. Application of firefly algorithm-based support vector machines for prediction of field capacity and permanent wilting point. Soil Tillage Res. 172, 32–38 (2017).
Article Google Scholar
Raheli, B., Aalami, M. T., El-Shafie, M., Ghorbani, M. A. & Deo, R. C. Uncertainty assessment of the multilayer perceptron (MLP) neural network model with implementation of the novel hybrid MLP-FFA method for prediction of biochemical oxygen demand and dissolved oxygen: A case study of Langat River. Environ. Earth Sci. 76, 503 (2017).
Article ADS Google Scholar
Zare Abyaneh, H., Bayat Varkeshi, M. & Golmohammadi, G. Soil temperature estimation using an artificial neural network and co-active neuro-fuzzy inference system in two different climates. Arab. J. Geosci. 9(5), 1–10 (2016).
Article Google Scholar
Ashrafzadeh, A., Kişi, O., Aghelpour, P., Biazar, S. M. & Masouleh, M. A. Comparative study of time series models, support vector machines, and GMDH in forecasting long-term evapotranspiration rates in northern Iran. J. Irrig. Drain. Eng. 146(6), 04020010 (2020).
Article Google Scholar
Kang, S., Kim, S., Oh, S. & Lee, D. Predicting spatial and temporal patterns of soil temperature based on topography, surface cover and air temperature. Forest Ecol. Manage 136, 173–218 (2000).
Article Google Scholar
Paul, K. I. et al. Soil temperature under forests: A simple model for predicting soil temperature under a range of forest types. Agric. For. Meteorol. 121, 167–182 (2004).
Article ADS Google Scholar
Biazar, S. M., Dinpashoh, Y. & Singh, V. P. Sensitivity analysis of the reference crop evapotranspiration in a humid region. Environ. Sci. Pollut. Res. 26, 32517–32544 (2019).
Article Google Scholar
Isazadeh, M., Biazar, S. M. & Ashrafzadeh, A. Support vector machines and feed-forward neural networks for spatial modeling of groundwater qualitative parameters. Environ. Earth Sci. 76, 1–14 (2017).
Article Google Scholar
Khatibi, R., Ghorbani, M. A. & Akhoni, P. F. Stream flow predictions using nature-inspired Firefly Algorithms and a Multiple Model strategy—Directions of innovation towards next generation practices. Adv. Eng. Inform. 34, 80–89 (2017).
Article Google Scholar
Shamshirband, S. et al. A hybrid SVM-FFA method for prediction of monthly mean global solar radiation. Theor. Appl. Climatol. 125, 53–65 (2016).
Article ADS Google Scholar
Taki, M., Mehdizadeh, S. A., Rohani, A., Rahnama, M. & Rahmati- Joneidabad, M. Applied machine learning in greenhouse simulation, new application and analysis. Info. Proc. Agric. 5(2), 253–268 (2018).
Google Scholar
Tabari, H., Sabziparvar, A. A. & Ahmadi, M. Comparison of artificial neural network and multivariate linear regression methods for estimation of daily soil temperature in an arid region. Meteorol. Atmos. Phys. 110, 135–142 (2011).
Article ADS Google Scholar
Jahangir, M. S., Biazar, S. M., Hah, D., Quilty, J. & Isazadeh, M. Investigating the impact of input variable selection on daily solar radiation prediction accuracy using data-driven models: A case study in northern Iran. Stoch. Environ. Res. Risk Assess. 36(1), 225–249 (2022).
Article Google Scholar
Alaboz, P., Dengiz, O. & Demir, S. Barley yield estimation performed by ANN integrated with the soil quality index modified by biogas waste application. Zemdirbyste-Agric. 108(3), 1 (2021).
Google Scholar
Bilgili, M. Prediction of soil temperature using regression and artificial neural network models. Meteorol. Atmos. Phys. 110, 59–70 (2010).
Article ADS Google Scholar
Kim, S. & Singh, V. P. Modeling daily soil temperature using data-driven models and spatial distribution. Theor. Appl. Climatol. 118, 465–479 (2014).
Article ADS Google Scholar
Shehadeh, H. A., Mustafa, H. M., & M. Tubishat. A Hybrid Genetic Algorithm and Sperm Swarm Optimization (HGASSO) for Multimodal Functions. International (2022).
Zhang, J. R., Zhang, J., Lok, T. M. & Lyu, M. R. A hybrid particle swarm optimization–back-propagation algorithm for feedforward neural network training. Appl. Math. Comput. 185, 1026–1037 (2007).
Google Scholar
Yu, X., Cao, J., Shan, H., Zhu, L. & Guo, J. An adaptive hybrid algorithm based on particle swarm optimization and differential evolution for global optimization. Sci. World J. 2014, 1–16 (2014).
Google Scholar
Saha, A. K. et al. A hydrological budget (2002–2008) for a large subtropical wetland ecosystem indicates marine groundwater discharge accompanies diminished freshwater flow. Estuaries Coasts 35, 459–474 (2012).
Article CAS Google Scholar
Baffaut, C. et al. Comparative analysis of water budgets across the US long-term agroecosystem research network. J. Hydrol. 588, 125021 (2020).
Article Google Scholar
Kleinman, P. J. A. et al. Advancing the sustainability of US agriculture through long-term research. J. Environ. Qual. 47(6), 1412–1425 (2018).
Article CAS PubMed Google Scholar
McClelland, J. & Rumelhart, D. Explorations in parallel distributed processing (MTT Press, 1988).
Google Scholar
Khaledian, M. R., Isazadeh, M., Biazar, S. M. & Pham, Q. B. Simulating Caspian Sea surface water level by artificial neural network and support vector machine models. Acta Geophys. 68, 553–563 (2020).
Article ADS Google Scholar
Deo, R. C. & Sahin, M. Application of the Artificial Neural Network model for prediction of monthly Standardized Precipitation and Evapotranspiration Index using hydrometeorological parameters and climate indices in eastern Australia. Atmos. Res. 161–162, 65–81 (2015).
Article Google Scholar
Deo, R. C., Tiwari, M. K., Adamowski, J. F. & Quilty, M. J. Forecasting effective drought index using a wavelet extreme learning machine (W-ELM) model. Stoch. Environ. Res. Risk Assess. 31(5), 1211–1240 (2017).
Article Google Scholar
Deo, R. C. et al. Multi-layer perceptron hybrid model integrated with the firefly optimizer algorithm for windspeed prediction of target site using a limited set of neighboring reference station data. Renew. Energy 116, 309–323 (2018).
Article Google Scholar
Ashrafzadeh, A., Ghorbani, M. A., Biazar, S. M. & Yaseen, Z. M. Evaporation process modelling over northern Iran: Application of an integrative data-intelligence model with the krill herd optimization algorithm. Hydrol. Sci. J. 64(15), 1843–1856 (2019).
Article Google Scholar
Aghelpour, P., Mohammadi, B. & Biazar, S. M. Long-term monthly average temperature forecasting in some climate types of Iran, using the models SARIMA, SVR, and SVR-FA. Theor. Appl. Climatol. 138(3–4), 1471–1480 (2019).
Article ADS Google Scholar
Aghelpour, P., Mohammadi, B., Biazar, S. M., Kisi, O. & Sourmirinezhad, Z. A theoretical approach for forecasting different types of drought simultaneously, using entropy theory and machine-learning methods. ISPRS Int. J. Geo-Inf. 9(12), 701 (2020).
Article Google Scholar
Shehadeh, H. A., Idna Idris, M. Y., Ahmedy, I., Ramli, R. & Mohamed Noor, N. The multi-objective optimization algorithm based on sperm fertilization procedure (MOSFP) method for solving wireless sensor networks optimization problems in smart grid applications. Energies 11(1), 97 (2018).
Article Google Scholar
Shehadeh, H. A., Idna Idris, M. Y. & Ahmedy, I. Multi-objective optimization algorithm based on sperm fertilization procedure (MOSFP). Symmetry 9(10), 241 (2017).
Article ADS Google Scholar
Shehadeh, H. A., Ahmedy, I., & Idris, M. Y. I. Empirical study of sperm swarm optimization algorithm. In Intelligent Systems and Applications: Proceedings of the 2018 Intelligent Systems Conference (IntelliSys) Volume 2 (pp. 1082–1104) (Springer International Publishing, 2019).
Shehadeh, H. A. A hybrid sperm swarm optimization and gravitational search algorithm (HSSOGSA) for global optimization. Neural Comput. Appl. 33(18), 11739–11752 (2021).
Article Google Scholar
Khajehzadeh, M. Earth slope stability evaluation subjected to earthquake loading using chaotic sperm swarm optimization. Arab. J. Geosci. 15(15), 1338 (2022).
Article Google Scholar
Ebtehaj, I., Bonakdari, H., Samui, P. & Gharabaghi, B. Multi-depth daily soil temperature modeling: Meteorological variables or time series?. Theor. Appl. Climatol. 151(3–4), 989–1012 (2023).
Article ADS Google Scholar
Li, Q. et al. An attention-aware LSTM model for soil moisture and soil temperature prediction. Geoderma 409, 115651 (2022).
Article ADS Google Scholar
Naganna, S. R. et al. Dew point temperature estimation: Application of artificial intelligence model integrated with nature-inspired optimization algorithms. Water 11(4), 742 (2019).
Article Google Scholar
Davies, A. & Thomas, H. Rates of leaf and tiller production in young spaced perennial ryegrass plants in relation to soil temperature and solar radiation. Ann. Bot. 51(5), 591–597 (1983).
Article Google Scholar
Isazadeh, M., Biazar, S., Ashrafzadeh, A. & Khanjani, R. Estimation of aquifer qualitative parameters in Guilans plain using gamma test and support vector machine and artificial neural network models. J. Environ. Sci. Technol. 21(2), 1–21 (2019).
Google Scholar
Biazar, S. M., Ghorbani, M. A., & Shahedi, K. Uncertainty of artificial neural networks for daily evaporation prediction (case study: Rasht and Manjil Stations) (2019).
Goodwin, P. & Lawton, R. On the asymmetry of the symmetric MAPE. Int. J. Forecast. 15(4), 405–408 (1999).
Article Google Scholar
Tayman, J. & Swanson, D. A. On the validity of MAPE as a measure of population forecast accuracy. Popul. Res. Policy Rev. 18, 299–322 (1999).
Article Google Scholar
Gholami, H., Lotfirad, M., Ashrafi, S. M., Biazar, S. M. & Singh, V. P. Multi-GCM ensemble model for reduction of uncertainty in runoff projections. Stoch. Environ. Res. Risk Assess. 1, 1–12 (2022).
Google Scholar
Willmott, C. J. & Matsuura, K. Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance. Clim. Res. 30(1), 79–82 (2005).
Article Google Scholar
Kim, S. T., Jeong, H. I. & Jin, F. F. Mean bias in seasonal forecast model and ENSO prediction error. Sci. Rep. 7(1), 6029 (2017).
Article ADS PubMed PubMed Central Google Scholar
Taylor, K. E. Summarizing multiple aspects of model performance in a single diagram. J. Geophys. Res. Atmos. 106(D7), 7183–7192 (2001).
Article ADS Google Scholar
Shamshirband, S. et al. Comparative analysis of hybrid models of firefly optimization algorithm with support vector machines and multilayer perceptron for predicting soil temperature at different depths. Eng. Appl. Comput. Fluid Mech. 14(1), 939–953 (2020).
Google Scholar
Saha, A. K. et al. Evapotranspiration in a subtropical wetland savanna using low-cost Lysimeter, Eddy Covariance and Modeling approaches. Ecohydrology 15(8), e2475 (2022).
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Soil, Water and Ecosystem Sciences, University of Florida, IFAS/RCREC, Ona, FL, USA
Seyed Mostafa Biazar & Golmar Golmohammadi
Department of Artificial Intelligence and Computer Science, College of Computer Science and Informatics, Amman Arab University, Amman, Jordan
Hisham A. Shehadeh
Department of Water Engineering, University of Tabriz, Tabriz, Iran
Mohammad Ali Ghorbani
Archbold Biological Station, Buck Island Ranch, Lake Placid, FL, 33852, USA
Amartya Saha

Authors

Seyed Mostafa Biazar
View author publications
You can also search for this author in PubMed Google Scholar
Hisham A. Shehadeh
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Ali Ghorbani
View author publications
You can also search for this author in PubMed Google Scholar
Golmar Golmohammadi
View author publications
You can also search for this author in PubMed Google Scholar
Amartya Saha
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.M.B. and H.S., drafted the methodology, interpreted the results. M.A.G.H., and G.G. reviewed the methodology, and the results. M.A.G.H., G.G., and A.S. reviewed the paper with necessary corrections.

Corresponding author

Correspondence to Golmar Golmohammadi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Biazar, S.M., Shehadeh, H.A., Ghorbani, M.A. et al. Soil temperature forecasting using a hybrid artificial neural network in Florida subtropical grazinglands agro-ecosystems. Sci Rep 14, 1535 (2024). https://doi.org/10.1038/s41598-023-48025-4

Download citation

Received: 31 May 2023
Accepted: 21 November 2023
Published: 17 January 2024
DOI: https://doi.org/10.1038/s41598-023-48025-4

This article is cited by

Revolutionizing core muscle analysis in female sexual dysfunction based on machine learning
- Doaa A. Abdel Hady
- Tarek Abd El-Hafeez
Scientific Reports (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.