Developing machine learning algorithms for meteorological temperature and humidity forecasting at Terengganu state in Malaysia

Hanoon, Marwah Sattar; Ahmed, Ali Najah; Zaini, Nur’atiah; Razzaq, Arif; Kumar, Pavitra; Sherif, Mohsen; Sefelnasr, Ahmed; El-Shafie, Ahmed

doi:10.1038/s41598-021-96872-w

Download PDF

Article
Open access
Published: 23 September 2021

Developing machine learning algorithms for meteorological temperature and humidity forecasting at Terengganu state in Malaysia

Marwah Sattar Hanoon^1,2,
Ali Najah Ahmed³,
Nur’atiah Zaini⁴,
Arif Razzaq⁵,
Pavitra Kumar⁶,
Mohsen Sherif^7,8,
Ahmed Sefelnasr⁷ &
…
Ahmed El-Shafie^6,7

Scientific Reports volume 11, Article number: 18935 (2021) Cite this article

16k Accesses
64 Citations
7 Altmetric
Metrics details

Subjects

Abstract

Accurately predicting meteorological parameters such as air temperature and humidity plays a crucial role in air quality management. This study proposes different machine learning algorithms: Gradient Boosting Tree (G.B.T.), Random forest (R.F.), Linear regression (LR) and different artificial neural network (ANN) architectures (multi-layered perceptron, radial basis function) for prediction of such as air temperature (T) and relative humidity (Rh). Daily data over 24 years for Kula Terengganu station were obtained from the Malaysia Meteorological Department. Results showed that MLP-NN performs well among the others in predicting daily T and Rh with R of 0.7132 and 0.633, respectively. However, in monthly prediction T also MLP-NN model provided closer standards deviation to actual value and can be used to predict monthly T with R 0.8462. Whereas in prediction monthly Rh, the RBF-NN model's efficiency was higher than other models with R of 0.7113. To validate the performance of the trained both artificial neural network (ANN) architectures MLP-NN and RBF-NN, both were applied to an unseen data set from observation data in the region. The results indicated that on either architecture of ANN, there is good potential to predict daily and monthly T and Rh values with an acceptable range of accuracy.

Identification of influential weather parameters and seasonal drought prediction in Bangladesh using machine learning algorithm

Article Open access 04 January 2024

Forecasting Air Quality in Taiwan by Using Machine Learning

Article Open access 05 March 2020

Machine learning models for daily net radiation prediction across different climatic zones of China

Article Open access 03 September 2024

Introduction

Air temperature (T) and relative humidity (Rh) are important in microclimate and environmental health research. It plays an essential role in several fields such as weather control, climate influence assessment of agricultural and water systems management. With the current global climate change, there is a need to develop a reliable model capable of accurately capturing the temperature and humidity changes. Many researchers have been developed models for predicting the meteorological time series based on statistical processes^1,2. The clear problem is that these processors could not cope with specific non-stationary signals (time-series) and with signals whose mathematical model isn’t linear³. The difficulty is ascribed to the perspicuous environmental stochastic variables (meteorological processes such as temperature and relative humidity parameters) and the fact that future returns cannot be forecasted with acceptable precision when modelling such settings with high uncertainty.

Several forecasting models based on the univariate auto-regressive moving average representative of temperature and relative humidity was developed⁴. However, these models have a tendency to overestimate low temperature and relative humidity values while underestimating high values, which could lead to poor water resources planning and management. This is due to the fact that the meteorological processes are essential in estimating numerous hydrological parameters and irrigation processes including evaporation, evapotranspiration, infiltration, crop water requirement, and soil moisture, which are important for water resources managers and planners^5,6. As a result, it is critical to creating a forecasting model that is both reliable and devoid of these shortcomings.

Temperature and relative humidity are thought to be extremely spatially distributed, time-varying, stochastic, nonlinear, and difficult to modelled utilizing simple models⁷. While conceptual models are important for understanding meteorological processes, these models experienced many practical circumstances especially when accuracy is the ultimate focus⁸. Instead of building a conceptual model, it could be superior to explore and employ different modelling approach such as data-driven model. Models based on differential equations are employed in the data-driven approach to detect the optimal inputs-outputs mapping deprived of a comprehensive examination of the fundamental configuration of phenomena process. In many areas, data-driven models have been shown to offer accurate predictions^9,10,11. However, because most of these models do not attempt to reflect the nonlinear dynamics that are inherent in meteorological phenomena, they may not always perform well and achieve an acceptable level of forecasting accuracy as expected.

Machine learning modelling approaches have been proposed as an alternate modelling method for nonlinear and dynamic systems due to the fact that the machine learning approaches include effective structure and parameter estimation methodologies^12,13. Since the characteristics of meteorological air temperature and relative humidity, it complex and nonlinear, machine learning (ML) models are powerful when implemented to problems whose resolutions required knowledge that is hard to specify. Unlike the conventional methods to time series analyses and predicting, ML models require a decreased quantity of information to predict the future time series. Based on the available time series, utilizing a suitable tuning algorithm, the internal network parameters are tuned. If necessary, this could also be including the adjustment of the primarily selected network structure to better matching the structure needs via the problem at hand¹⁴. Therefore, Machine learning approaches could be considered as an effective and efficient technique to model meteorological processes, based on their effectiveness in modelling dynamic systems in a variety of applications of science and engineering. The success of this strategy in cases when explicit knowledge of the internal meteorological phenomena is not available is its main advantage. In circumstances when modelling of the entire and/or part of the internal parameter of the meteorological phenomena is not available, machine learning modelling methodologies clearly give a realistic and effective approach for constructing input–output forecasting models. Despite the fact that those models have proven to be effective, it is still unknown which of these machine learning modelling approaches would be the best choice for certain system processes like meteorological processes. As a result, several machine learning modelling approaches must be examined in order to evaluate compare their performance, and hence, select the optimal one.

Recently, in meteorological time series Gradient Boosting tree was introduced as an easily adaptable machine learning approach in various aspects, such as filling gaps produced via missing or incorrect data¹⁵. G.B.T. has applied trend prediction of the asphalt pavement temperature accurately and has great ability in exploring the relationship between the temperature of asphalt pavements and meteorological factors¹⁶. In predicting and analyzing net ecosystem carbon exchange¹⁷ and modeling the energy consumption of commercial buildings G.B.T.¹⁸ was introduced. It is also recommended for daily accurate of reference evapotranspiration estimating in various climatic areas of China and somewhere else with the same climates around the globe¹⁹.

The R.F. approach's major advantages are its capability to generalize, lower sensitivity to parameter values, and built-in cross-validation. The R.F. technique's ability in²⁰ has been examined in simulation long-term monthly air temperature. The results showed that the Random Forest approach is superior compared with the other methods. Also, authors in²¹ used the R.F. model for monthly temperatures predicting Kuala Lumpur in Malaysia using historical time series data from 2000 to 2012. R.F.'s performance was compared with other methods, application instances confirmed good properties of the R.F., which has higher accuracy. In²² R.F. was introduced for downscaling daily mean temperatures in the Pearl River basin in southern China. R.F. was demonstrated that the model has higher efficiency when compared with some alternative models. This result indicated that the Random Forest model is a feasible tool for the statistical downscaling of temperatures.

Neural network models, MLP-NN and RBF-NN, have been remarkable developments in the number and variations of the models established and the models' theoretical understanding in the past few years. The authors in²³ examined the Artificial Neural network in predicting minimal temperatures. They have applied MLP-NN architecture to model the predicting system and back-propagation algorithm to train the networks. Their results found that minimal temperatures could be predicted with an acceptable level of accuracy by employing MLP-NN architecture. ANN has been applied in predicting the air temperatures and relative humidity in a building to reduce energy utilization for air conditioning²⁴. Furthermore, the neural network was employed to estimate the hourly dew point temperatures, dew point temperatures is the temperature at which water vapor in the air condenses into liquid as in²⁵. Also, researchers in²⁶ was applied ANN to predicting monthly mean ambient temperatures.

It should be noted here that this study is the first attempt to investigate the performance of the machine learning modeling to predict the Temperature (T) and the Relative Humidity (Rh) in Terengganu, Malaysia, and hence, it is necessary to examine the performance of different machine learning modeling methods. Therefore, it is essential examine the potential of not only the “new” advanced machine learning modeling methods but also it is preferable to examine the “old” classical methods. In this context, in this study, two relatively new machine learning modeling methods including Gradient Boosting Tree (G.B.T.) and Random forest (R.F.) and two classical including Multi-Layered Perceptron (MLP) and Radial Basis Function (RBF) have been investigated. In fact, it is not mandatory that the advanced machine learning modeling methods outperformed the classical ones as the architecture and the mathematical procedure of the classical ones might be more suitable to detect the interrelationship between inputs and the output for these two variables.

In the light of the research background that has been presented above, it is clear that these meteorological variables under this study (humidity and temperature) have been studied, and several modeling approaches have been developed to predict them, including the utilization of different machine learning methods. However, in the development of these previous models, the model structure was almost the same using classical input pattern in the time series of these two variables. The existing models missed one major step in developing a successful prediction model, proposing different input–output mapping scenarios to detect the most appropriate model’s input(s) patterns. Therefore, in this study, twenty different input combination scenarios have been developed with respect to output patterns for a better choice of the most sensitive input(s) affecting the value of the desired output. In addition to that, most previous studies focused on monthly predicting. However, in the current study, the proposed models' reliability will be examined to capture daily and monthly fluctuations in the humidity and the temperature. Such two significant steps are the major novelty in the current research and could be considered the original research contribution in predicting the humidity and temperature, which could be considered a further step in the meteorological modelling.

To accomplish this goal, different artificial neural network (ANN) architectures (multi-layered perceptron, radial basis function was implemented; then it was compared with the Gradient Boosting tree (G.B.T.) and Random Forest (R.F.) techniques to predict daily and monthly air temperatures and relative humidity based on collected data from the year of 1985 to 2019.

Material and methods

Case study region and meteorological data

The site is located in Kuala Terengganu, Malaysia and covers the largest city in the area²⁷ at latitude 5° 23′ N and longitude 103° 06′ E. Figure 1 shows the location of the study area. Figure 1a has been generated by using Google Map software to identify the location of the study area. Air temperature and relative humidity data is observed and collected to predict them in Kuala Terengganu station accurately and is obtained from the Malaysian Meteorological Department. The observed and collected T and Rh data for the period years from 1985 to 2019. The data from 1985 to 2012 will be used to train the models, and from 2013 to 2019 will be used to test the proposed models. Table 1 shows the simple statistics done to ensure the data. It can be noticed that the height mean of daily air temperature is 30.7 °C highest, and relative humidity is 98.2%. In contrast, mean annual air temperature and relative humidity is 27.41 (°C) and 82.64%, respectively.

Table 1 Simple statistical analysis for the measured air temperature (T) and relative humidity (Rh).

Full size table

Gradient boosting tree (G.B.T.)

Gradient boost tree will be used in this study regression model, an ML approach for regression problems in that the key predicting approach is a combining of some weak predicting models. G.B.T. approach is based on gradual strengthening of the predicting function $F_{b}$, via adding of the estimator $S$. The learning process is when the S is fitting to $\left( {y - F_{b} } \right)$ error (residual) and throughout every alteration $F_{b} + 1$ is adjusting to minimizing error values²⁸:

$$F_{b + 1} \left( x \right) = F_{b} \left( x \right) + S = y - F_{b} \left( x \right)$$

(1)

For this objective, the loss function or ${\Psi }\left( {y,F\left( x \right)} \right)$ is specified and a series of inputs variables or $x = \left\{ {x_{1} , \ldots x_{n} } \right\}$ and a series of outputs values or $y$ are taken into account. A predicting modelling initiate via calculating the $F_{z} \left( x \right)$ as following:

$$F_{z} \left( x \right) = arg\;minimum_{\delta } \mathop \sum \limits_{\iota = 1}^{n} {\Psi }\left( {y_{\iota } ,\delta } \right)$$

(2)

Here, the $b{\text{th}}$ pseudo-residual value for $\iota {\text{th}}$ data set,$\delta_{\iota b}$ computed via: $\delta_{\iota b} = - \left[ {\frac{{\partial \psi \left( {y_{\iota } ,F\left( {x_{\iota } } \right)} \right)}}{{\partial \left( {F\left( {x_{\iota } } \right)} \right)}}} \right]F\left( {X_{\iota } } \right) = F_{b - 1} \left( {x_{\iota } } \right)$ for $\iota = 1, \ldots n$ Subsequently, the weak trainer function like a decision tree $\left( {E_{b} \left( {x_{\iota } } \right)} \right)$ is fitting to $\delta_{\iota b}$ and training based on the $\left\{ {\left( {x_{\iota } ,\delta_{\iota b} } \right.} \right\}_{\iota = 1}^{n}$ training sample. Through resolving a one-dimensional optimization relationship, the multiplier $\delta_{b}$ is computed as following:

$$\delta_{b} = arg\;minimum_{\delta } \mathop \sum \limits_{\iota = 1}^{n} {\Psi }\left( {y_{\iota } ,F_{b - 1} \left( {x_{\iota } } \right) + \delta h_{b} \left( {x_{\iota } } \right)} \right)$$

(3)

where $h_{b}$ is a new tree, the $F_{b} \left( {X_{\iota } } \right)$ a function is then taken = $F_{b - 1} \left( x \right) + \delta_{b} S_{b} \left( x \right)$ and the process is repeating till the second term of the total,$\delta_{b} S_{b} \left( x \right)$ is minimize at iteration $B$ where the final output $F_{B} \left( {x_{\iota } } \right)$ is attained, as shown in Fig. 2. For further details, more explanation can be found in^29,30.

Random forest (R.F.)

R.F. is a tree-based ensemble method, as shown in Fig. 3. It’s the algorithm that learns high-dimensional data. Every tree depends on collecting random variables, whereas, from several regression trees, a forest is growing to put with each other and forming an ensemble³¹. The bias is the same in all the trees, but the variances can be reduced by decreasing the relationship's coefficient³². Random forest for regression is dependent on a random vector, and it is created via grown trees, the trees predictor $d\left( {x,\varphi } \right)$ get numerical amounts. An outputs are numerical values, its assumed that the training sample is statistically independent. The mean square generalizing error of numerical predictor $d\left( x \right)$ could be expresses as following:

$$E_{X,Y} \left( {Y - d\left( x \right)} \right)^{2} = arg\;minimum_{\delta } \mathop \sum \limits_{i = 1}^{k} {\Psi }\left( {y_{i} ,F_{m - 1} \left( {x_{i} } \right) + \delta d_{m} \left( {x_{i} } \right)} \right)$$

(4)

The R.F. forecaster is created via averaging over $k$ of the trees $\left( {d\left( {x,\varphi } \right)} \right)$. More details regarding random forest model theories could be discovered in³¹.

Artificial neural network (ANN)

It’s a computational model of the biological brain. It is included a considerable number of interconnecting neurons similar to the brain. Any neuron can perform just easy computations. However, the architectures of ANN are more simple comparing with a biological neuron. ANN is constructed in layers connecting to either single or many hidden layers where the actual processes is performed by weighted connection. In the hidden layers, any neuron connects to each neuron in the output layer. The result of the processing is obtained from the output layer. Learned ANN can attain via a certain training or learning algorithm that expands according to the learning law, supposed to mimic the biological system's learning mechanism³³. Anyhow, as an assembly of neurons, in order to perform a complex task, ANN could be learning and involving: patterns recognitions, systems identifications, trend predicting, and processes controlling³⁴.

Multilayer perceptron neural network (MLP-NN)

In feedforward network Multilayer perceptron is maybe consider the most popular type. Figure 4 displays the M.L.P. with two hidden layers, input, and output layers. Only neurons acts as buffers in the input layer for the distribution of the signals of the input $x_{\iota }$ to neurons that are existing in hidden layers. In the hidden layers, any Neuron $j$ sum it is signals of the input $x_{\iota }$ after weighted them with the strength of the particular connection $wj_{\iota }$ from input layer, then calculates it is output $y_{j}$ which is consider as a function $f$ of sums up, as follows:

$$y_{j} = f \left( {\sum {wj_{\iota } x_{\iota } } } \right)$$

(5)

where $F$ could be an RBF or hyperbolic tangent or a sigmoidal or simple threshold function. In the output layer, there is a similar calculation for the output of the neuron. The most common training algorithms adopt back-propagation and gradient descent in Multilayer perceptron. It provides a changing $\Delta {\text{wj}}_{{{\iota }}}$ weight of the connections among neurons $\iota$ and ${\text{j}}$:

$$\Delta wj_{\iota } = \eta \delta_{j} x_{\iota }$$

(6)

Here $\delta_{j}$ is the factor that depends on whether neuron $j$ is the input neuron or a hidden neuron and $\eta$ is a parameter known as the learning rate. So, for output neuron:

$$\delta_{j} = \left( {\frac{\partial f}{{\partial net_{j} }}} \right)\left( {y_{j}^{\left( t \right)} - y_{j} } \right)$$

(7)

And for hidden neuron:

$$\delta_{j} = \left( {\frac{\partial f}{{\partial net_{j} }}} \right)\mathop \sum \limits_{q} w_{qj} \delta_{q}$$

(8)

In Eq. (7), $net_{j}$ is an overall weighted total of signals in the input layer to ($j$, $y_{j}^{\left( t \right)}$) neuron is the goal output for neurons $j$. For hidden neurons, there aren’t target outputs. As in Eq. (8), a variation of a desired and factual output of neurons in hidden layers $j$ which are replace via weighted sums of $\delta_{q}$ term previously achieved for neuron $q$ connecting to the output of $j$. Therefore, repeatedly, with output layer starting $\delta$ term for neuron is calculated in every layer then for every connection the weights updated defined. The weights updated processing could occur later than presenting every training pattern or following presenting of entire sets of training patterns. For both cases, a training epoch is considered complete when each training pattern was introduced once to Multilayer perceptron. In even the most trivialities, the M.L.P. must be adequately trained for several epochs. Adding the term momentum consider a standard method for accelerating learning to Eq. (6), allowing the change of weight to influence a new change in weight as below effectively:

$$\Delta wj_{\iota } \left( {k + 1} \right) = \eta \delta_{j} x_{\iota } + \mu \Delta wj_{\iota } \left( k \right)$$

(9)

Here $\mu$ represent a ‘momentum’ coefficient, whereas the $\Delta wj_{\iota } \left( {k + 1} \right)$, $\Delta wj_{\iota } \left( l \right)$ is the change of weights in epochs $\left( {k + 1} \right)$ and $\left( k \right)$ respectively.

Radial basis function neural network (RBF-NN)

RBF neural network is a typical kind of ANN. Figure 5 shows the RBF-NN structure, which involves input, an output layer, and neurons' hidden layers. An input layer neuron received an inputs pattern ($x_{1}$ − $x_{N}$). Whereas a hidden layer neuron provided the sets of activation function, which represent the arbitrary “basis” for input pattern in an input range or space in order to expand in hidden range via manner of nonlinear conversion. A distance is computed between centre of every basis function and the input vector at the input of every hidden neuron. Using activation function to such distance produced the output of hidden neurons. The output of the Radial basis function neural network $y_{1}$ to $y_{p}$ are forming through neuron in output layer as weighted sum up of a hidden layer neurons basis. Usually, the activation function was selected as a standard function that’s positive at it is centre $x$ equal to zero, and then uniform declines to 0 on both sides. A popular option is a Gaussian function:

$$k\left( x \right) = exp\left( { - \frac{{x^{2} }}{2}} \right)$$

(10)

The function above could be shifting to a random centre $x$ equal to $m$, and stretched via varying it is spread $\sigma$ as following:

$$k\left( {\frac{x - m}{\sigma }} \right) = exp\left( { - \frac{{\left( {x - m} \right)^{2} }}{{2\sigma^{2} }}} \right)$$

(11)

The output of RBF-NN: $y_{j }$ is presented as below equation

$$y_{j} = \mathop \sum \limits_{\imath = 1}^{h} wj_{\imath } k\left( {\frac{{ \left\|x - m_{\imath }\right\| }}{{\sigma_{\imath } }}} \right)_{\forall x}$$

(12)

Here $wj_{\imath }$ represent weights of hidden neurons $\imath$ to $j$ which mean the output, $m_{\imath }$ the centre of activation function $\imath$, whereas $\sigma_{\imath }$ a spread of function $\left\|x - m_{\imath }\right\|$ is a norm of $x - m_{\imath }$. The norm could compute in many methods, but the most popular is the Euclidean norm defines as:

$$\left\|x - m_{\imath }\right\| = \sqrt {\left( {x_{1} - m_{\imath } } \right)^{2} + \left( {x_{2} - m_{\imath 2} } \right)^{2} + \cdots + \left( {x_{n} - m_{\imath n} } \right)^{2} }$$

(13)

This norm is giving the distance between point $x$ and point $m_{\imath }$ in N-dimension space. Every point $x$ that’s the same radial distance from $m_{\imath }$ gives a similar value to the norm. Learning (Training) RBF-NN aims to define the neuron weights $wj_{\imath }$, RBF center $m_{\imath }$ and spreads $\sigma_{\imath }$ which allow the network to produce the correct output $y_{j}$ related to the inputs pattern $x$.

Statistical evaluation

In general, there is a need to examine the performance of the developed prediction model and compare the performance achieved from different models through particular statistical index. However, it is essential to use several statistical indices due to the fact that there is a possibility that two or more models could achieve similar or nearly values for particular statistical index, and hence, it is challenging to confirm which model outperformed the others. It should be noted here that each statistical index evaluates the model from single angle of the well-fitting between the model outputs and the desired values. Therefore, it is advisable to examine the model against several statistical indices to fully evaluate the performance each model individually and carry out a robust comparison analysis between them in order to get a solid confirmation about the most appropriate modeling method.

Three measures of performance are using to evaluate the result³⁵: (R) the coefficient of correlation which is range (− 1 to 1), relative error measured (RMSE) Root mean square error and mean absolute error (M.A.E.) which is ranged between zero < M.A.E. < ∞. Higher values of R indicate superior model performance, and lower RMSE and M.A.E. indicate great model performance. According to³⁶ as R value equal to 1 is mean the perfect fit, R more than 0.75 is a very good fit, R equivalent (0.64 to 0.74) is a good fit, R = 0.5 to 0.64 is an acceptable fit, and R less than 0.5 is an unacceptable fit.

$$R = \frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {M_{{o,{\mathfrak{i}}}} - \overline{{M_{o} }} } \right)\left( {M_{{P,{\mathfrak{i}}}} - \overline{{M_{P} }} } \right)}}{{\sqrt {\left( {M_{{o,{\mathfrak{i}}}} - \overline{{M_{o} }} } \right)^{2} \mathop \sum \nolimits_{i = 1}^{n} \left( {M_{{o,{\mathfrak{i}}}} - \overline{{M_{o} }} } \right)^{2} } }}$$

(14)

$${\text{RMSE}} = \sqrt {\frac{1}{{\text{n}}}\mathop \sum \limits_{{{\text{i}} = 1}}^{{\text{n}}} \left( {M_{{{\text{o}},{\text{i}}}} - M_{{{\text{p}},{\text{i}}}} } \right)^{2} }$$

(15)

$${\text{M}}.{\text{A}}.{\text{E}}. = \frac{1}{n}\mathop \sum \limits_{i = 1}^{n} \left| {\frac{{M_{o,i} - M_{p,i} }}{{M_{o,i} }}} \right| \times 100{ }$$

(16)

Here; $M_{{o,{\mathfrak{i}}}} { }$ represents values of meteorological data in the current observed $\left( i \right)$, $M_{{P,{\mathfrak{i}}{ }}}$ mean the predict values, $\overline{{{\text{M}}_{o} }}$ represent an average value of actual ‘observations’ and $\overline{{{\text{M}}_{P} }}$ refer to the average values of prediction, and $N$ represent a number of the dataset.

The choice of different statistical indices such as the RMSE and R is due to the fact that RMSE is proposed to examine the variance of the residuals, while R is to examine the relative measure of fit. RMSE indicates the absolute fit of the model to the data–how close the observed data points are to the model’s predicted values, which is a good measure of how accurately the model predicts the response. Whereas R is a relative measure of fit and has the useful property that its scale is intuitive and could examine the trend-match between the model outputs and the desired actual values.

From the above, it is clear that these R and RMSE are totally different, first, in terms of the mathematical formulation to calculate them as presented in Eqs. (14) and (15), second, as each one evaluates different measure of the model output compared with the desired observed values. As a result, it is expected that if two or more models provide same level of accuracy against particular statistical index, both could provide completely difference performance according to the other statistical index.

Given the requirements of the machine learning approaches, the raw meteorological dataset will normalize and range from zero to one before put in models by use the below formula¹⁹:

$$M_{n} = \frac{{M_{i} - M_{min} }}{{M_{max} - M_{min} }}$$

(17)

where $M_{n}$ and $M_{i}$ represents the normalizing and raw training—testing dataset; $M_{max}$ and $M_{min}$ represents the maximum and minimal training—testing dataset.

Furthermore, Taylor diagrams (T.D.s) provide a graphical representation of predicting and observed datasets. In the present work, T.D. will utilize for investigating how the model has higher accuracy compared with ML models' alternative. Some specifications of the predicting and actual values are data could be merged into T.D.³⁷. For instance, the standard deviation (S.D.), CC, and RMSE between predicting and observed data could be demonstrated in the Taylor diagrams.

In this paper, we will also apply the method recommended by³⁸ for the uncertainty analysis, which is calculated via (2.5 $xl$, 97.5 $xu$) percent percentiles.

$$Bracketed\;by\;95PPU{ } = { }\frac{1}{h}{ }Count{ }\left( {h{|}xl{ } \le h{ } \le { }xu} \right)*{ }100$$

(18)

where $h$ represents the number of actual data for a testing phase, in Eq. (18), the rate of ‘Bracketed by 95PPU’ is greater or 100 percent at completely actual data for a testing phase is between $xl$ that mean value of lower 95PPU and $xu$ it means value of upper 95PPU.

Result and discussion

This paper's primary purpose is to evaluate machine learning models' performance (MLP-NN, RBF-NN, G.B.T., LR and R.F.) in predicting air temperature and relative humidity at Kuala Terengganu, Malaysia. To predict these meteorological data, quantify the level of correlation of data time series between output and input variables for different times. Two methods were applied to identify the optimal lag of antecedents’ predictor by assuming several lag times. The autocorrelation function (A.C.F.) is a statistical analysis used to evaluate the correlation among adjacent values correlation³⁹. Simultaneously, the partial autocorrelation (PACF) is defined as the partial correlation with its lag values of a time series at the same time didn't consider the effects of intervening lag auto-correlation⁴⁰. A.C.F. applied for monthly input design, while partial autocorrelation function used for daily input design as in Fig. 6. The historical meteorological dataset on T and Rh corresponding to various lags beginning from the original day (t) to 6 days earlier (time t–6) are using as daily input for the models to predicting meteorological data on day $t$ in Table 2. According to PACF, the highest correlation between input and output is moderate 0.66 and 0.58 for both $T_{t}$ and $Rh_{t}$ , there is a declining in correlation when increase daily time span between input and output (denotes t = daily time step) as in Fig. 6. Also, from Fig. 6 it could see the value of A.C.F. between the target time series $T_{m}$ and $T_{m - 1}$, $T_{m - 2}$, $T_{m - 3}$ and $T_{m - 4}$ (where m is month lag) is decreased as increase lag time, where the high correlation gets with 0.75 in lag 1 month. Thus, these input combinations time series are considered in this work as probable inputs to predicting the $T_{m}$ monthly time series. And same procedure accomplished for $Rh_{m}$ also better correlation is moderate with 0.5 between input and output relative humidity variable, and it was found there is a decrease in correlation as increase lag time so the input structures as in Table 2. To predict $Rh_{m}$. In contrast, the target time series T and Rh's dependence characteristics with previous times series of T and Rh have decreased trend via increase the lag time. Therefore, the input design is shown in Table 2.

Table 2 Input design for air temperature and relative humidity.

Full size table

After data analysis and normalization, the meteorological data are divided into sets, 80%, and 20%, for the training and testing phases, respectively. The models' predicting performance developed in this paper is evaluated based on the statistical evaluation (CC, MAE, and RMSE). Table 3 demonstrates the summary of the results obtained in this study of prediction performance indices for the predictive models MLP-NN, RBF-NN, G.B.T., R.F. and L.R.

Table 3 Results of a performance evaluation using the developed ML techniques during testing phase.

Full size table

With regards to daily air temperature, for MLP-NN and G.B.T. methods, the optimal daily air temperature in prediction is obtained for input combination previous 6 days (M6) with good value of R equal to 0.713 and 0.711, respectively. At the same time, R.F. approach shows the best performance during a time horizon equal to 4 days (M4) with R equal to 0.704. The results regarding applying for the LR model list forth in the order of model performance where R value equal to 6.686 for three models (M4–M6). RBF exhibits the lowest performance compared with all other proposed models.

Regarding prediction daily relative humidity, the performance of LR model increase as increase input variables until it almost shows constant values with CC = 0.6 which is the lowest than values get from G.B.T, R.F. and MLP-NN. Also, it could be noticed that the RBF-NN approach did not perform well, where the highest value gets it for R is 0.59 and 0.54 using lead time 1 and 2 days of Rh values (M7 and M8), respectively. In contrast, M12 using the previous 6 days of Rh as input achieves better performance compared with other structures by applying MLP-NN and G.B.T. with a moderate level of accuracy in daily prediction Rh with R 0.6335 and 0.6152, respectively. The lowest values of MAE and RMSE are achieving at M12 with 0.0263 and 0.0394, respectively. The input combination M11 using the previous 5 days' values as input for the R.F. model demonstrates optimum performance with R found to be 0.63 and the lowest values for both M.A.E. and RMSE out of 0.026 0.039.

All monthly prediction air temperature methods achieved better performance for input structure previous 4 months values of T (M16) with good R of 0.846, 0.832, 0.83, and 0.84 for MLP-NN, RBF-NN, G.B.T., and R.F., respectively. Besides, over M16, MLP-NN produces a lower error with M.A.E. = 0.0105 and RMSE = 0.0135 compared to other techniques.

Concerning monthly prediction relative humidity, the highest R with the lowest RMSE were obtained from the RBF-NN algorithm using the input structure previous 4 months of Rh (M20). Also, M20 was superior to the G.B.T. considerably in among all input combinations. At the same time, M19 showed the highest performance compared with other structures for MLP-NN and R.F., with R of 0.6704 and 0.7019, respectively. Finally, we could conclude that ANN-MLP, G.B.T. and R.F. model shows better performance than LR model in all input combination designs for prediction T and Rh in daily and monthly horizon. For better visualization, scatter plots between the actual values and predict T and Rh's value using the most accurate input combinations are shown in Figs. 7 and 8 for station Kuala Terengganu, Malaysia.

It could be observed that both modeling approaches LR and the machine learning models provide nearly same results considering the RMSE as the statistical index. On the other hands, it could be noted that the machine learning model outperformed the LR model while examining the R value as the statistical index. However, in general, there is a potential to improve the predicting accuracy for both variables T and Rh whether for daily or monthly time-increments. From this perspective, the machine learning as modeling approach is more convincer and has stronger potential for further improvement over the LR model to achieve better prediction accuracy for these two variables for both time-increments.

Although Figs. 7 and 8 and Table 3 show the observed and predicted values and evaluation criteria for all models, summarized the comparison results among ML models could not be discussed via these figures and table. Therefore, the Taylor diagram (T.D.) using the most accurate input structures from the above evaluation criteria will compare the techniques presented in this study. The main concept of the Taylor diagram is to presents the closest prediction model with the corresponding actual observation in the 2-dimensional scaling (standard deviation (S.D.) on the polar axis and the R on the radial axis)³⁷. Standard deviation referring to how much on average measurements differ from each other. So, the relative value of standard deviation predicted (SDP) from standard deviation actual (S.D.A.) indicates high precision.

In contrast, versus as far the value of SDP from S.D.A. refers to lower accuracy. Thus, in Figs. 9 and 10 daily T and daily Rh, it can be observed that the MLP-NN was superior compared to other approaches, which have closer SD with 0.81486 to actual SD 1.146252 in daily T and SD = 3.134169 predicted daily Rh to actual SD = 5.001828. An evaluation of actual and predictor T monthly values yielded via the most accurate models was done for station Kuala Terengganu. T.D.s are shown for T in Fig. 11 that proved the slight efficiency of MLP-NN over RBF-NN, G.B.T., and R.F. models. As shown in Fig. 12, RBF-NN predicted Rh m values more accurately than other models with best SDP = 2.010736.

Accuracy improvement (A.I.)

In order to demonstrate the accuracy improvement, we calculated the accuracy improvement of the MLP-NN compared to RBF-NN, G.B.T. and R.F. using the A.I. indicator Eq. (19).

$$AI = \left( {\frac{{CC_{MLP} - CC_{n} }}{{CC_{n} }}} \right)*100$$

(19)

where $CC_{n}$ is the correlation coefficients for three machine learning model, and $CC_{MLP}$ is the correlation coefficient for the MLP-NN model. It can be seen from Fig. 13, a noticeable level of improvement has been achieved when MLP-NN is adopted compared to three other models (RBF-NN, G.B.T., and RF) for both parameters during the daily proposed time horizons. A.I. varies between 0.3% to more than 7%, where the highest A.I. achieved when the MLP-NN model was used to predict daily relative humidity. Even though MLP-NN also outperform with positive values over three models in prediction monthly T, but for prediction monthly Rh it showed negative value over RBF-NN and R.F. models with − 6% and − 7%, respectively as in Fig. 13 due to the MLP-NN has lesser values of R compare with RBF-NN and G.B.T. out of 0.6507, 0.7133, 0.6836, respectively.

Uncertainty analysis

The prediction uncertainty of the proposed models was examined using the 95PPU. The d-factor (20), an indicator of the standard deviation interval's width, was also carried out.

$$d{ - }factor = { }\frac{{\overline{dx} }}{\sigma x}$$

(20)

where σ$x$ is the S.D. of observed data $X$, and $\overline{dx} { }$ is the average distance between the upper and lower bands, which could be computed using the below formula:

$$\overline{dx} = \frac{1}{k}\mathop \sum \limits_{i = 1}^{k} \left( {{ }XU{ } - { }XL{ }} \right)$$

(21)

The conclusions drawn from the analysis of two uncertainty measures can be seen in Table 4.

Table 4 Results of 95PPU and d-factor for best models in prediction daily and monthly.

Full size table

The results show that about 100 percent of the predicted data is within the range of predicted values based on 95PPU, while the d-factors are low. Such results indicate the reliability of the proposed models in predicting the air temperature and relative humidity.

Recall that the purpose of the current research is to investigate the potential of the existing modeling approaches to provide accurate prediction accuracy for T and Rh for different time-increments. Although a simple equation based on the LR modeling approach could provide relatively similar accuracy of the machine learning model based on particular statistical index, it is worth to examine different statistical index to confirm and settle the effectiveness of all the examined models. In addition, as it could be noticed that the achieved accuracy from both model approaches are not the expected outstanding ones, it is essential to keep improving the better one, and hence, there is a need to figure out the modeling approach that has enough potential for further improvement over the others. On the other hand, for the first attempt applications of the machine learning modeling approaches, it is necessity to examine both the “new” advanced and “old” methods, which could be considered as the major value of the current research.

Conclusions

The accuracy of ML models, namely MLP-NN, RBF-NN, G.B.T., L.R. and R.F., were investigated to predict air temperatures and relative humidity in different time horizons (daily and monthly) using historical meteorological data. Different input combinations were investigated with varying times of lag. The results showed that MLP-NN is a leading algorithm compared with other models in predicting monthly relative humidity with a correlation coefficient equal to 0.8462, followed by RBF-NN with a correlation coefficient equal to 0.7113. By applying Accuracy Improvement, 7% of improvement was achieved after proposing MLP-NN. Uncertainty analysis using 95PPU and d-factor was conducted to test the reliability of the MLP-NN model. It can be concluded that the proposed model can be adopted in predicting these two parameters, even with a new dataset. Although the developed MLP model's attained result is acceptable, future works can be explored by hybridizing the MLP-NN model with optimizers for more accuracy.

References

Ridwan, W. M. W. M. et al. Rainfall forecasting model using machine learning methods: Case study Terengganu, Malaysia. Ain Shams Eng. J. https://doi.org/10.1016/j.asej.2020.09.011 (2020).
Article Google Scholar
Chong, K. L. et al. Performance enhancement model for rainfall forecasting utilizing integrated wavelet-convolutional neural network. Water Resour. Manag. 34, 2371–2387 (2020).
Article Google Scholar
Mellit, A., Pavan, A. M. & Benghanem, M. Least squares support vector machine for short-term prediction of meteorological time series. Theor. Appl. Climatol. 111, 297–307 (2013).
Article ADS Google Scholar
Miyata, M. et al. Proceedings: Building simulation 2007. Energy 1968–1974 (2007).
El-Shafie, A., Najah, A., Alsulami, H. M. & Jahanbani, H. Optimized neural network prediction model for potential evapotranspiration utilizing ensemble procedure. Water Resour. Manag. 28, 947–967 (2014).
Article Google Scholar
Ehteram, M. et al. Performance improvement for infiltration rate prediction using hybridized adaptive neuro-fuzzy inferences system (ANFIS) with optimization algorithms. Ain Shams Eng. J. https://doi.org/10.1016/j.asej.2020.08.019 (2020).
Article Google Scholar
Adnan, M. et al. Prediction of relative humidity in a high elevated basin of western Karakoram by using different machine learning models. In Weather Forecasting [Working Title] (IntechOpen, 2021). https://doi.org/10.5772/intechopen.98226.
El-Shafie, A., Mukhlisin, M., Najah, A. A. A. & Taha, M. R. R. Performance of artificial neural network and regression techniques for rainfall-runoff prediction. Int. J. Phys. Sci. 6, 1997–2003 (2011).
Google Scholar
Jumin, E. et al. Machine learning versus linear regression modelling approach for accurate ozone concentrations prediction. Eng. Appl. Comput. Fluid Mech. 14, 713–725 (2020).
Google Scholar
Ehteram, M. et al. Design of a hybrid ANN multi-objective whale algorithm for suspended sediment load prediction. Environ. Sci. Pollut. Res. https://doi.org/10.1007/s11356-020-10421-y (2020).
Article Google Scholar
Sapitang, M., Ridwan, W., Kushiar, K. F., Ahmed, A. N. & El-Shafie, A. Machine learning application in reservoir water level forecasting for sustainable hydropower generation strategy. Sustainability 12, 6121 (2020).
Article Google Scholar
Abobakr Yahya, A. S. et al. Water quality prediction model based support vector machine model for ungauged river catchment under dual scenarios. Water 11, 1231 (2019).
Article Google Scholar
Najah Ahmed, A. et al. Machine learning methods for better water quality prediction. J. Hydrol. 578, 124084 (2019).
Article CAS Google Scholar
Palit, A. K. & Popovic, D. Computational Intelligence in Time Series Forecasting: Theory and Engineering Applications (Springer Science & Business Media, 2006).
MATH Google Scholar
Körner, P., Kronenberg, R., Genzel, S. & Bernhofer, C. Introducing gradient boosting as a universal gap filling tool for meteorological time series. Meteorol. Zeitschrift 27, 369–376 (2018).
Article ADS Google Scholar
Qiu, X., Hong, H., Xu, W. & Yang, Q. Surface Temperature Prediction of Asphalt Pavement Based on APRIORI-GBDT (2020).
Cai, J., Xu, K., Zhu, Y., Hu, F. & Li, L. Prediction and analysis of net ecosystem carbon exchange based on gradient boosting regression and random forest. Appl. Energy 262, 114566 (2020).
Article Google Scholar
Touzani, S., Granderson, J. & Fernandes, S. Gradient boosting machine for modeling the energy consumption of commercial buildings. Energy Build. 158, 1533–1543 (2018).
Article Google Scholar
Fan, J. et al. Evaluation of SVM, ELM and four tree-based ensemble models for predicting daily reference evapotranspiration using limited meteorological data in different climates of China. Agric. For. Meteorol. 263, 225–241 (2018).
Article ADS Google Scholar
Karimi, S. M., Kisi, O., Porrajabali, M., Rouhani-Nia, F. & Shiri, J. Evaluation of the support vector machine, random forest and geo-statistical methodologies for predicting long-term air temperature. ISH J. Hydraul. Eng. 26, 376–386 (2020).
Article Google Scholar
Naing, W. Y. N. & Htike, Z. Z. Forecasting of monthly temperature variations using random forests. ARPN J. Eng. Appl. Sci. 10, 10109–10112 (2015).
Google Scholar
Pang, B., Yue, J., Zhao, G. & Xu, Z. Statistical downscaling of temperature with the random forest model. Adv. Meteorol. 2017, 1–11 (2017).
Article Google Scholar
Shrivastava, G., Karmakar, S., Kowar, M. K. & Guhathakurta, P. Application of artificial neural networks in weather forecasting: A comprehensive literature review. Int. J. Comput. Appl. 51, 17–29 (2012).
Google Scholar
Mba, L., Meukam, P. & Kemajou, A. Application of artificial neural network for predicting hourly indoor air temperature and relative humidity in modern building in humid region. Energy Build. 121, 32–42 (2016).
Article Google Scholar
Zounemat-Kermani, M. Hourly predictive Levenberg–Marquardt ANN and multi linear regression models for predicting of dew point temperature. Meteorol. Atmos. Phys. 117, 181–192 (2012).
Article ADS Google Scholar
Go, M. Daily means ambient temperature prediction using artificial neural network method: A case study of Turkey. Renew. Energy 34, 1158–1161 (2009).
Article Google Scholar
Albani, A. & Ibrahim, M. Z. Preliminary development of prototype of savonius wind turbine for application in low wind speed in Kuala Terengganu, Malaysia. Int. J. Sci. Technol. Res. 2, 102–108 (2013).
Google Scholar
Friedman, J. H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 29, 1189–1232 (2001).
Article MathSciNet Google Scholar
Sharafati, A., Asadollah, S. B. H. S., Motta, D. & Yaseen, Z. M. Application of newly developed ensemble machine learning models for daily suspended sediment load prediction and related uncertainty analysis. Hydrol. Sci. J. 0, 1–21 (2018).
Google Scholar
Boehmke, B. & Greenwell, B. Hands-on Machine Learning with R (Chapman and Hall/CRC, 2020).
MATH Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article Google Scholar
Hastie, T., Tibshirani, R. & Friedman, J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction (Springer Science & Business Media, 2009).
Book Google Scholar
Yilmaz, A. S. & Özer, Z. Pitch angle control in wind turbines above the rated wind speed by multi-layer perceptron and radial basis function neural networks. Expert Syst. Appl. 36, 9767–9775 (2009).
Article Google Scholar
Behrang, M. A., Assareh, E., Ghanbarzadeh, A. & Noghrehabadi, A. R. The potential of different artificial neural network (ANN) techniques in daily global solar radiation modeling based on meteorological data. Sol. Energy 84, 1468–1480 (2010).
Article ADS Google Scholar
Olyaie, E., Banejad, H. & Chau, K. A comparison of various artificial intelligence approaches performance for estimating suspended sediment load of river systems: A case study in United States. Environ. Monit. Assess. https://doi.org/10.1007/s10661-015-4381-1 (2015).
Article PubMed Google Scholar
Moriasi, D. N. et al. Model evaluation guidelines for systematic quantification of accuracy in watershed simulations. Trans. ASABE 50, 885–900 (2007).
Article Google Scholar
Taylor, K. E. In a single diagram. J. Geophys. Res. 106, 7183–7192 (2001).
Article ADS Google Scholar
Abbaspour, K. C. et al. Modelling hydrology and water quality in the pre-alpine/alpine Thur watershed using SWAT. J. Hydrol. 333, 413–430 (2007).
Article ADS Google Scholar
McCuen, R. H. Modelling hydrological change: Statistical methods. In Modeling Hydrologic Change: Statistical Methods (Lewis Publishers, 2002).
Al-mukhtar, M. Modelling the root zone soil moisture using artificial neural networks, a case study. Environ. Earth Sci. https://doi.org/10.1007/s12665-016-5929-2 (2016).
Article Google Scholar

Download references

Acknowledgements

The author would like to thank the Department of Meteorology Malaysia (MMD) for providing this study with the data.

Funding

This research was supported by the Ministry of Education (MOE) through Fundamental Research Grant Scheme (FRGS/1/2020/TK0/UNITEN/02/16).

Author information

Authors and Affiliations

College of Technical Engineering, Islamic University, Najaf, Iraq
Marwah Sattar Hanoon
College of Engineering, Universiti Tenaga Nasional (UNITEN), 43000, Kajang, Selangor, Malaysia
Marwah Sattar Hanoon
Institute of Energy Infrastructure (IEI), Universiti Tenaga Nasional (UNITEN), 43000, Kajang, Selangor, Malaysia
Ali Najah Ahmed
Department of Civil Engineering, College of Engineering, Universiti Tenaga Nasional (UNITEN), 43000, Kajang, Selangor, Malaysia
Nur’atiah Zaini
College of Science, Al Muthanna University, Samawah, Al-Muthanna, Iraq
Arif Razzaq
Department of Civil Engineering, Faculty of Engineering, Universiti Malaya (UM), 50603 Kuala Lumpur, Malaysia
Pavitra Kumar & Ahmed El-Shafie
National Water and Energy Center, United Arab Emirates University, P.O. Box 15551, Al Ain, United Arab Emirates
Mohsen Sherif, Ahmed Sefelnasr & Ahmed El-Shafie
Civil and Environmental Engineering Department, College of Engineering, United Arab Emirates University, P.O. Box 15551, Al Ain, United Arab Emirates
Mohsen Sherif

Authors

Marwah Sattar Hanoon
View author publications
You can also search for this author in PubMed Google Scholar
Ali Najah Ahmed
View author publications
You can also search for this author in PubMed Google Scholar
Nur’atiah Zaini
View author publications
You can also search for this author in PubMed Google Scholar
Arif Razzaq
View author publications
You can also search for this author in PubMed Google Scholar
Pavitra Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Mohsen Sherif
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Sefelnasr
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed El-Shafie
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Data curation, A.N.A.; Formal analysis, M.S.H., A.R.; Methodology, N.Z. and A.E.-S.; Writing—original draft, M.S.H.; Writing—review and editing, P.K., M.S., A.S. and A.E.-S.

Corresponding author

Correspondence to Ali Najah Ahmed.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hanoon, M.S., Ahmed, A.N., Zaini, N. et al. Developing machine learning algorithms for meteorological temperature and humidity forecasting at Terengganu state in Malaysia. Sci Rep 11, 18935 (2021). https://doi.org/10.1038/s41598-021-96872-w

Download citation

Received: 19 May 2021
Accepted: 17 August 2021
Published: 23 September 2021
DOI: https://doi.org/10.1038/s41598-021-96872-w

This article is cited by

Monthly climate prediction using deep convolutional neural network and long short-term memory
- Qingchun Guo
- Zhenfang He
- Zhaosheng Wang
Scientific Reports (2024)
Global soil respiration estimation based on ecological big data and machine learning model
- Jiangnan Liu
- Junguo Hu
- Kanglai Han
Scientific Reports (2024)
River Water Temperature Prediction Using a Hybrid Model Based on Variational Mode Decomposition (VMD) and Outlier Robust Extreme Learning Machine
- Ehsan Mirzania
- Thendiyath Roshni
- Salim Heddam
Environmental Processes (2024)
Comparative evaluation of machine learning techniques in predicting fundamental meteorological factors based on survey data from 1981 to 2021
- Israa Jasim Mohammed
- Bashar Talib Al-Nuaimi
- Anindita Nath
Spatial Information Research (2024)
Intelligent irrigation scheduling scheme based on deep bi-directional LSTM technique
- R. Jenitha
- K. Rajesh
International Journal of Environmental Science and Technology (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Material and methods

Case study region and meteorological data

Gradient boosting tree (G.B.T.)

Random forest (R.F.)

Artificial neural network (ANN)

Multilayer perceptron neural network (MLP-NN)

Radial basis function neural network (RBF-NN)

Statistical evaluation

Result and discussion

Accuracy improvement (A.I.)

Uncertainty analysis

Conclusions

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links