Hybrid systems using residual modeling for sea surface temperature forecasting

de Mattos Neto, Paulo S. G.; Cavalcanti, George D. C.; de O. Santos Júnior, Domingos S.; Silva, Eraylson G.

doi:10.1038/s41598-021-04238-z

Download PDF

Article
Open access
Published: 11 January 2022

Hybrid systems using residual modeling for sea surface temperature forecasting

Paulo S. G. de Mattos Neto¹,
George D. C. Cavalcanti¹^na1,
Domingos S. de O. Santos Júnior¹^na1 &
…
Eraylson G. Silva¹^na1

Scientific Reports volume 12, Article number: 487 (2022) Cite this article

3156 Accesses
14 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The sea surface temperature (SST) is an environmental indicator closely related to climate, weather, and atmospheric events worldwide. Its forecasting is essential for supporting the decision of governments and environmental organizations. Literature has shown that single machine learning (ML) models are generally more accurate than traditional statistical models for SST time series modeling. However, the parameters tuning of these ML models is a challenging task, mainly when complex phenomena, such as SST forecasting, are addressed. Issues related to misspecification, overfitting, or underfitting of the ML models can lead to underperforming forecasts. This work proposes using hybrid systems (HS) that combine (ML) models using residual forecasting as an alternative to enhance the performance of SST forecasting. In this context, two types of combinations are evaluated using two ML models: support vector regression (SVR) and long short-term memory (LSTM). The experimental evaluation was performed on three datasets from different regions of the Atlantic Ocean using three well-known measures: mean square error (MSE), mean absolute percentage error (MAPE), and mean absolute error (MAE). The best HS based on SVR improved the MSE value for each analyzed series by $82.26\%$, $98.93\%$, and $65.03\%$ compared to its respective single model. The HS employing the LSTM improved $92.15\%$, $98.69\%$, and $32.41\%$ concerning the single LSTM model. Compared to literature approaches, at least one version of HS attained higher accuracy than statistical and ML models in all study cases. In particular, the nonlinear combination of the ML models obtained the best performance among the proposed HS versions.

Recent reductions in aerosol emissions have increased Earth’s energy imbalance

Article Open access 03 April 2024

Global prediction of extreme floods in ungauged watersheds

Article Open access 20 March 2024

The risk of concurrent heatwaves and extreme sea levels along the global coastline is increasing

Article Open access 11 April 2024

Introduction

Climate change is one of humanity’s most critical global challenges since it is harmful to all living beings on Earth. Therefore, it is a crucial subject for all countries, independent of geographical, social, or economic characteristics. Global climate change directly affects the environment and has already had observable effects, such as glacier shrinkage, more intense heat waves, and sea-level rise. According to the Intergovernmental Panel on Climate Change, several regions will be affected in diverse manners over time, leading to significant societal and environmental systems changes. The Sea Surface Temperature (SST) is one of the most significant variables for monitoring the global climate system¹. SST is related to ocean heat content, which directly affects global warming. The SST records are collected using satellites, which read these values mainly of moored and drifting buoys and can be considered the best-known ocean parameter on global scales².

The variability of the SST is correlated with many natural phenomena^1,3,4,5,6. For instance, the El Niño/Southern Oscillation and Indian Ocean Dipole relate to the warming or cooling of SST in predefined areas of the Pacific and Indian oceans, respectively¹. Furthermore, the SST from the Atlantic ocean is connected with the quantity of rain and cloudiness in South America, droughts in Northeastern Brazil, and climate change on the Amazon vegetation^4,7. In this way, SST forecasting can support the decision in many operational applications, such as rainfall monitoring, turtles tracking, tourism, fishing management, and coral bleaching evaluation⁸.

In the time series forecasting literature, statistical and machine learning (ML) models have been widely employed in various domains^9,10,11. Among statistical models, linear methods, such as Autoregressive (AR), Moving Average (MA), and ARIMA models are the most popular due to their simplicity, adaptability, and the Box and Jenkins methodology¹², which provides a well-established design process for time series modeling. The Box & Jenkins methodology used in the design of the linear models guarantees that the linear patterns are properly modeled. However, this class of models is not able to properly model temporal phenomena that present nonlinear patterns. ML models have been highlighted due to their performance, flexibility, nonlinearity and because they are data-driven techniques, allowing temporal modeling without making any a priori assumption¹³. Among the ML models, multilayer perceptron neural networks (MLP), support vector regression (SVR), and long short-term memory (LSTM) are examples of techniques that have reached promising results^11,14,15.

We can highlight the following works that used linear statistical models and nonlinear ML models for SST forecasting. Lins et al.³ employed an SVR to the daily forecast of one year ahead of two different locations of the tropical Atlantic ocean. Salles et al.⁴ applied the ARIMA model to analyze the temporal aggregation of seventeen datasets of SST located in the tropical Atlantic ocean. Tripathi et al.⁵ analyzed the MLP and linear regression techniques in the monthly forecasting of SST located in the Indian Ocean. Mahongo and Deo¹ showed that the Nonlinear Autoregressive with Exogenous Input neural network reached an accurate performance in the SST forecast located in the African seashore of the Indian ocean. Garcia-Gorriz and Garcia-Sanchez⁶ employed a system based on MLP to the monthly forecasting of the SST in the western Mediterranean Sea.

One of the primary objectives in time series analysis and forecasting is to develop accurate systems. Among the approaches that use ML methods, hybrid systems that combine models from the error series modeling have reached promising results in many applications^{16,17,18,19,20,21}. Residuals or error series are obtained from the difference between the time series and its forecasting. Such hybrid systems use the residuals modeling to correct biased forecasts that can occur due to the overfitting, underfitting, or misspecification of models^22,23. Hybrid systems commonly design the time series as a combination of a linear statistical model with a nonlinear ML model or as the combination of ML models. The former aims to model the linear and nonlinear patterns of the time series separately^16,17,20. The latter employs ML models for error series modeling intending to improve the accuracy of an initial nonlinear ML model^19,22,23.

To the best of our knowledge, hybrid systems that perform the residuals modeling were not proposed or evaluated for SST time series forecasting. In SST forecasting, the works proposed in the literature commonly use a single method to model the time series under analysis²⁴. These approaches employ mainly linear statistical models or nonlinear ML models for this task^{3,4,5,24,25,26,27}. To fulfill this gap, we perform an empirical evaluation of hybrid systems that use error series modeling in the context of SST time series forecasting. This experimental analysis is of crucial importance because of the adoption of hybrid systems: (i) it generally leads to more accurate results than single models in complex time series modeling^16,17; (ii) it is an efficient way of dealing with the problem of model selection with little extra effort¹⁹; and (iii) it is an effective manner to correct biased and/or misspecified forecasters^22,23. However, the best combination approach between the forecasters of time series and the residuals still is an open question¹⁹ and not yet investigated in the context of SST forecasting. This work evaluates the performance of hybrid systems combining ML models for SST time series forecasting unprecedentedly. In this sense, the objective of this paper is threefold: (a) evaluate whether the residual modeling is an advantageous approach to increasing ML models’ accuracy for SST time series; (b) evaluate two well-known forms of combination (linear and nonlinear) of the literature employing ML models; (c) analyze, for each SST series, which combination is most suitable.

The main contributions of this work can be summarized as:

Proposal of a hybrid system methodology to improve the accuracy of ML models in the SST forecasting;
The performance evaluation of two hybrid systems in the 1-day ahead forecasting SST using three well-known measures: mean square error (MSE), mean absolute percentage error (MAPE) and mean absolute error (MAE);
The development of two versions of each analyzed hybrid system using well-known ML models: SVR and LSTM;
The hybrid systems employing the SVR achieved on average a percentage gain compared to its respective single model of 80.27%, 61.72%, and 60.21% for MSE, MAPE, and MAE, respectively;
The hybrid systems using the LSTM attained an average percentage gain concerning its respective single model of 73.16%, 57.90%, and 56.98% for MSE, MAPE, and MAE, respectively;
The results show that, in general, the developed hybrid systems overcame literature statistical and ML models in the SST forecasting context.

The remainder of the paper is organized as follows. “Related works” section shows the related works of hybrid systems that deal with residual series modeling. This section describes the evaluated hybrid systems in the SST forecasting: Perturbative approach (“The perturbative approach” section) and NoLiC (“The NoLiC method” section). “PIRATA data set” section presents the data set extracted from the PIRATA project website. “Experimental protocol” section shows the experimental protocol used in this work. In “Simulations and experimental results” section, the results and discussions are presented. Finally, “Discussion” section shows the concluding remarks and suggestions for future works.

Related works

Combining models is one of the most common alternatives to enhance the accuracy of forecasting systems^{16,17,28,29,30,31}. In the literature, there are two well-established approaches: ensembles³² and hybrid systems that perform residuals modeling^16,17. Both theoretical and empirical results indicate that the latter approach is an interesting strategy to increase the robustness and accuracy of the forecasts^{16,17,28,29,30,31}.

The general architecture of a hybrid system that performs the residuals modeling can be divided into three main steps: time series forecasting, error series forecasting, and the combination of the two first steps. Equation (1) shows a general view of this architecture, where the final output of the hybrid system ${\hat{y}}_t$ is given by a function f(.) that combines the forecast of the time series $P_0$ with the forecast of the residuals $P_1$ to estimate $y_t$. $P_0$ is the forecast of the time series given by the ${\text {M}}_0$ model (Eq. 2), and $P_1$ is the forecast of the residual series $(e_{t-1},\dots ,e_{t-n})$ given by the ${\text {M}}_1$ model (Eq. 3). The residual series is calculated as the difference between the predicted and the actual values.

$$\begin{aligned} {\hat{y}}_t = f(P_0,P_1), \end{aligned}$$

(1)

where

$$\begin{aligned} P_0 = {\mathrm{M}}_0(y_{t-1},\dots ,y_{t-m}), \end{aligned}$$

(2)

and

$$\begin{aligned} P_1 = {\mathrm{M}}_1(e_{t-1},\dots ,e_{t-n}), \end{aligned}$$

(3)

where m and n are the time lags used as input to the ${\text {M}}_0$ and ${\text {M}}_1$ models, respectively. The time lags can be defined using the auto-correlation function (ACF), partial auto-correlation function (PACF), or some searching algorithm^31,33.

Based on the general architecture described by the f function in Eq. (1), two classes of hybrid systems have been studied for real-world time series modeling: a combination of linear statistical methods with nonlinear Machine Learning (ML) models and a combination of ML models. For simplicity, the first class is denominated as a hybrid system and the second as a combination of ML models. The hybrid system class is described in “Hybrid systems—combining linear and nonlinear models” section. Techniques that combine ML models are presented in “Combining nonlinear models” section. This section also describes two recent techniques: the perturbative approach²² and NoLiC²³.

Hybrid systems—combining linear and nonlinear models

Linear statistical models have been combined with nonlinear ML models based on the assumption that real-world time series generally present linear and nonlinear patterns^16,17. Thus, in this hybrid system class, statistical models are used as ${\text {M}}_0$, and ML models are employed as ${\text {M}}_1$ intended to deal with linear and nonlinear patterns separately.

The f function that is responsible for combining $P_0$ with $P_1$ can be either linear or nonlinear^16,19,23,34. The linear combination, which is more commonly used in the literature^16,29,35, consists of a non-trainable rule, such as the sum. This combination has been successfully used in several applications, for instance: financial indexes²⁹, wind speed³⁵, groundwater level fluctuations³⁶, the prevalence of schistosomiasis in humans³⁷, particulate matter³⁸, and water quality³⁹.

Zhang¹⁶ showed the linear combination:

$$\begin{aligned} {\hat{y}}_t = P_0 + P_1, \end{aligned}$$

(4)

where $P_0$ is the forecasting of a linear statistical model (${\text {M}}_0$), $P_1$ is the forecasting of an ML model (${\text {M}}_1$) applied to the residual of the time series, and ${\hat{y}}_t$ is the final prediction of the hybrid system performed by the linear combination. In his experiments, $M_0$ was defined as an ARIMA and $M_1$ as an MLP neural network.

Despite being widely used in the literature, the linear combination of the forecasts $P_0$ and $P_1$ can underestimate, or degenerate the accuracy of the initial model (${\text {M}}_0$), since there may be no additive relationship between linear and nonlinear forecasts^17,40,41,42.

Based on this assumption, Khashei and Bijari^17,41,42 proposed a nonlinear combination of the forecasts to overcome the limitations of the linear combination. In their hybrid systems^17,41,42, the function f (Eq. 1) is defined as an ML that receives as input $P_0$, the residual ($(e_{t-1},\dots ,e_{t-n})$), and the time series ($y_{t-1},\dots ,y_{t-m}$), as shown in Eq. (5). They employed an ARIMA model for time series modeling as $P_0$, and an MLP as $P_1$.

$$\begin{aligned} {\hat{y}}_t = f(P_0, e_{t-1},\dots ,e_{t-n},y_{t-1},\dots ,y_{t-m}). \end{aligned}$$

(5)

In general, the nonlinear combination of the forecasts $P_0$ and $P_1$ reaches better results than the linear approach^17,41,43. However, there is no guarantee that the nonlinear combination is the most appropriate for modeling any temporal phenomena⁴². Therefore, the best combination function of the forecasts of the time series (linear component) and the residual series (nonlinear component) is unknown, being still a research challenge in the hybrid systems research field^17,23,42.

Combining nonlinear models

Nonlinear ML models have been combined based on the assumption that adopting only one single model can be inadequate to real-world time series forecasting. The underperforming of a single ML model can occur due to problems caused by overfitting, underfitting, or misspecification^22,23,28.

In Ginzburg and Horn²⁸, two MLPs are combined linearly following the same idea shown in Eq. (4). Thus, the time series forecast ($P_0$) is performed by the first MLP (${\text {M}}_0$), and its residuals are modeled by the second MLP (${\text {M}}_1$), generating the forecast of the residuals ($P_1$). In this sense, the ${\text {M}}_1$ model is employed to uncover and model temporal correlations found in the residuals of ${\text {M}}_0$, thus correcting the original forecast ($P_0$). This premise is based on biological systems that commonly deal with complex tasks through subsystems²⁸. Later stages of consecutive subsystems (networks) refine the response of earlier ones, improving the performance of the entire biological system^28,44. This principle was also successfully employed in atmospheric pollution forecasting^31,45.

The next two subsections show two recent approaches that combine nonlinear models: the perturbative approach²² and NoLiC²³.

The perturbative approach

The linear combination of ML models proposed in²⁸ was generalized in²², which employed the perturbation theory concept that was previously applied in many areas, such as physics, chemistry, and mathematics^46,47. The idea is to initiate the forecasting of a time series using a first estimation (forecast $P_0$). Then, p new forecasts ($P_1 + P_2 + \cdots + P_p$) are added to make a partial forecast ever closer to the real solution P. Mathematically,

$$\begin{aligned} P = P_0 + P_1 + P_2 + \cdots + P_p, \end{aligned}$$

where P is the desired solution (perfect forecasting), $P_0$ is the series forecast, and the term of the major contribution to P, and $P_1, P_2, \dots , P_p$ are the p higher-order terms (residual forecasts). Then, $P_1$ is the forecast of the residuals of ${\text {M}}_0$, $P_2$ is the forecast of the residuals of ${\text {M}}_0$ $+$ ${\text {M}}_1$, $P_3$ is the forecast of the residuals of ${\text {M}}_0$ $+$ ${\text {M}}_1+{\text {M}}_2$, and $P_p$ is the forecast of the residuals of ${\text {M}}_0+{\text {M}}_1+{\text {M}}_2+\cdots +{\text {M}}_{p-1}$. Theoretically, the corrections generated by the residual forecasts ($P_1, P_2, \dots , P_p$) decrease since, at each perturbation i, the residual series $E_i$ present values closer to zero. In practice, the contribution of the residual forecasts ($P_1, P_2, \dots , P_p$) depends on the specification and training of the model $M_i$.

Algorithms 1 and 2 show the training and the testing phases of the perturbative approach²², respectively.

The training phase is divided into two general steps: training the time series forecasting model (lines 5–9) and training the correction models based on the residual series (lines 10–19). The algorithm’s input is the training set of the time series, and the outputs are p residuals (error series E) and $p+1$ trained models ($\{M_0, M_1, \ldots , M_p\}$). The training phase has two stop criteria: the maximum number of perturbations (pMax) or an increase in the error value in the validation set concerning antecedent perturbation (lines 14 to 15).

In line 4, the initial model (${\text {M}}_0$) is trained using the training set Y, generating the time series forecast $P_0$ in line 5. $P_0$ is the main contributor to the final solution P²². In line 8, the first error series ($E_1$) is generated. The error series consists of the difference between the actual series and the estimated values provided by the perturbative approach (P).

After, the perturbative terms are generated. Each ${\text {M}}_i$ model is trained to forecast $E_{i}$ (line 10), which is the difference between Y and P (line 14). At each iteration of the loop (lines 9–15), a new perturbative term is generated (lines 10–11) and added to the final solution (line 12). At the end of the training phase, P is the sum of the $p+1$ forecasts ($P_0, P_1, P_2, \dots , P_p$) of the $p+1$ models (${\text {M}}_0+{\text {M}}_1+{\text {M}}_2 \dots + {\text {M}}_p$). Lines 10 to 14 are executed until the stopping criterion is reached.

The testing phase (Algorithm 2) is divided into two steps: forecast of the time series (line 5) and forecast of the perturbative terms (lines 8–12). Lines 5 and 6 show the generating of $P_0$ and its inclusion in the final output P of the perturbative approach. Lines 8 to 12 show the second part, where the perturbative terms are generated for the test point (observation of the test sample). So, each model (${\text {M}}_1$, ${\text {M}}_2, \dots , {\text {M}}_p$) generate the forecasting of its respective error series (${\text {E}}_1, {\text {E}}_2, \dots , {\text {E}}_p$). This loop is repeated (p) times, which is the number of perturbations defined in the training phase. After, each perturbative term is forecasted (line 9) and added to the solution P using a linear combination (line 10), generating the final forecasting.

The NoLiC method

The NoLiC method²³ employs an adaptive combination of ML models with other techniques using the residual series. This combination method does not presuppose a linear combination as other works^16,22,28. The idea is to find a combination function between $P_0$ and $P_1$ using an ML model that is flexible, capable of performing linear and nonlinear modeling.

The Nonlinear Combination (NoLiC) method is composed of three steps: forecast of the time series ($P_0$), forecast of the residuals ($P_1$), and the combination f(.) of $P_0$ and $P_1$. Figure 1 shows the training and testing phases of the NoLiC method.

The training phase receives as input the training set and generates three trained models (${\text {M}}_0$, ${\text {M}}_1$, and M$_{\mathrm{C}}$) as outputs. Similarly to other works^16,28, the models ${\text {M}}_0$ and ${\text {M}}_1$ are employed to forecast the time series and the residuals, respectively. The ${\text {M}}_1$ model’s training is performed using the residuals of ${\text {M}}_0$ ($E_1 = Y_t - P_0$), generating the forecast of the error series $P_1$. After, the combination model M$_{\mathrm{C}}$ receives as inputs $P_0$ and $P_1$ and is trained with the objective to correct the output of ${\text {M}}_0$, generating a forecast (P) closer to the target (future value of $Y_t$).

In the test phase, the ${\text {M}}_0$ and ${\text {M}}_1$ models receive the lag values of the time series ($Y_q$) and the residuals ($E_q$), respectively. After, ${\text {M}}_0$ and ${\text {M}}_1$ models generate theirs respective forecasts $P_0$ and $P_1$. Then, the trained ML model ${\text {M}}_C$ combines the forecasts of the series and residuals to produce the final forecast P.

Remarks

The combination methods described in “The perturbative approach” and “The NoLiC method” sections have different characteristics. The perturbative method can extract information from more than one residual series. However, this method supposes that the models should be combined using a simple sum rule. In contrast, the NoLiC method supposes a nonlinear combination between the forecasts and the residuals.

The NoLiC method employs an ML model aiming to find a combination more suitable than a simple sum. However, there is no guarantee that the ${\mathrm{M}}_{\mathrm{C}}$ model leads to the best accuracy of the hybrid system. The optimum performance depends on adjusting the parameters and training of the ${\mathrm{M}}_{\mathrm{C}}$ model, which is a complex task since it is related to forecasts of ${\mathrm{M}}_0$ and ${\mathrm{M}}_1$. Thus, investigating how to combine the forecasts of the time series and its error series is a crucial issue in the definition of the hybrid system since it is closely related to its accuracy.

PIRATA data set

The Pilot Research Moored Array in the Tropical Atlantic (PIRATA) was developed by a network of observatories composed of many countries, such as Brazil, France, and the United States. This project has the objective to improve the knowledge about atmospheric variations in the tropical Atlantic Ocean⁴⁸. The climatic variations in this area can influence the development of droughts, floods, severe storms, and even hurricanes, affecting millions of people in South America and Africa³ (Fig. 2).

The project PIRATA has buoys in the ocean, where meteorological variables are collected, such as shortwave radiation, relative humidity, air temperature, and ocean surface temperature. All data are gathered and transmitted by satellite and are available on the project web page (https://www.pmel.noaa.gov/gtmba/pirata).

We aim to perform the forecast of the sea surface temperature of three regions³: $8^\circ$ N $38^\circ$ W (Fig. 3a), $10^\circ$ S $10^\circ$ W (Fig. 3b), and $4^\circ$ N $23^\circ$ W (Fig. 3c). These locations were selected because they have an appropriate amount of data for modeling with ML models. These data sets do not present interruptions and are located in different regions. The selected locations can be seen in Fig. 2, represented by red points. Table 1 shows the characteristics of each time series used in this work.

Table 1 Properties of the SST time series used in this work.

Full size table

Experimental protocol

The experiments evaluate two machine learning models: support vector machines (SVR) and long short-term memory (LSTM). These models were chosen because they reached relevant results regarding accuracy for the SST forecasting task^3,49,50. The SVR and LSTM models are employed as single models as well as in the combination approaches.

The SVR model was successfully employed in the SST forecasting³ and has highlighted results in several other forecasting applications⁵¹. SVR is an interesting choice because it employs a quadratic optimization procedure to solve a convex constrained problem, with a single solution⁵². Therefore, in contrast to methods such as neural networks where several local minima can be achieved, the uniqueness of the solution of SVR is obtained given a set of hyperparameters. To the kernel SVR, the Radial Basis Function (RBF) was selected because it is a well-established kernel function in the time series forecasting area⁵¹. RBF kernel also was successfully employed in the SST forecasting³ and has been widely used in hybrid systems^{18,19,20,29,53}. Besides, the RBF is considered the default SVR kernel in the Sklearn⁵⁴ library, which is now the most popular package for creating SVR models in Python. The RBF’s popularity can be explained by its finite and localized responses across the entire range of the x-axis, so it does not need previous assumptions about the data and adds few parameters to the SVR model (Cost and Gamma)⁵⁵.

LSTM was selected because it is one of the state-of-the-art ML models in time series forecasting. It has outperformed traditional neural networks in several applications⁵⁶. Its ability to deal with short or long-term temporal dependencies can be promising in the SST time series modeling⁵⁷.

For the combination approach, the same model (SVR or LSTM) are employed for all stages. For the perturbative approach, the best number of perturbations (or corrections) was selected based on the MSE value in the validation set having a upper limit of four perturbations. For all models, a grid search approach was performed for selecting the best configuration based on the MSE value in the validation set. The data used in the experimental simulations were scaled into the interval [0.1, 0.9], similar to³.

Table 2 shows the set of parameters investigated for each model in the 1-day-ahead forecasting scenario. The number of input lags used in the grid search was selected based on PACF. For S1, the lags 1, 3, 4, 11, and 15 presented significant linear correlations. For S2, PACF selected the lags 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 13, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 28, and 30, and for S3, the lags 1, 2, 3, 5, 17, and 18 presented relevant correlation.

Table 2 Values of the parameters of the SVR and LSTM models.

Full size table

Table 3 shows the values selected for single and combination approaches for 1 day ahead SST forecasting for each study case (S1, S2, and S3 series). It is important to highlight that all combination approaches use the same ${\text {M}}_0$, and the NoLiC method employs the same ${\text {M}}_0$ and ${\text {M}}_1$ of the perturbative approach. So, it is possible to compare the performance of the combination approaches directly. For all series, the perturbation approach employed three perturbations for the SVR model and two perturbations for the LSTM model.

Table 3 Selected parameters for SVR and LSTM in the combination approaches using a grid search in the validation set for 1 day ahead SST forecasting.

Full size table

The performance of the approaches are evaluated using three performance metrics applied to the context of sea surface temperature forecast^3,4: mean square error (MSE), mean absolute percentage error (MAPE) and mean absolute error (MAE). Equations (6), (7) and (8) show the MSE, MAPE and MAE metrics, respectively.

$$\begin{aligned} {\mathrm{MSE}}= & {} \displaystyle \frac{1}{N}\sum _{t=1}^{N}(y_t - {\hat{y}}_t)^2, \end{aligned}$$

(6)

$$\begin{aligned} {\mathrm{MAPE}}= & {} \displaystyle \frac{100}{N}\sum _{t=1}^{N} \left| \frac{y_t - {\hat{y}}_t}{y_t}\right| , \end{aligned}$$

(7)

$$\begin{aligned} {\mathrm{MAE}}= & {} \displaystyle \frac{1}{N}\sum _{t=1}^{N} |y_t - {\hat{y}}_t|, \end{aligned}$$

(8)

where N represents the time series length, $y_t$ the true value at time t and ${\hat{y}}_t$ is the forecast at time t. For all metrics, the lower the values, the better the results. A percentage gain/loss measure (Eq. 9) is used to compare the combination approaches with the single models.

$$\begin{aligned} {\mathrm{PC}} = \displaystyle \frac{{\mathrm{Metric}}_{\mathrm{sm}} - {\mathrm{Metric}}_{\mathrm{comb}}}{{\mathrm{Metric}}_{\mathrm{sm}}}\times 100, \end{aligned}$$

(9)

where ${\mathrm{Metric}}_{\mathrm{sm}}$ and ${\mathrm{Metric}}_{\mathrm{comb}}$ represent the MSE values reached by single models and combination approaches, respectively. In this way, the higher the PC, the better the performance of the combination approach in relation to the single model.

The SVR and LSTM models were implemented in the Python programming language using the Sklearn⁵⁴ and Keras⁵⁹ libraries. The experimental simulations were performed in a computer with a single Intel Core i7-7500 CPU and 20 GB RAM.

The experimental comparison was carried out among the single models, the hybrid approaches, and the following literature models: Exponential Smoothing (ETS)⁶⁰, Convolution LSTM (ConvLSTM)^11,50, and the Nonlinear Autoregressive Exogenous (NARX)¹.

Exponential Smoothing (ETS) is a traditional statistical method employed in time series forecasting^60,61. It is a versatile method due to its ability to model time series with/without trend and seasonality components. However, the ETS can reach a limited performance in forecasting time series that present nonlinear patterns⁶². The experiments with ETS were carried out using the Statsmodel library of Python⁶³.

The Convolutional Long Short-Term Memory (ConvLSTM)^11,50 is a Deep Learning technique able to model spatiotemporal correlations. The ConvLSTM models spatial and temporal patterns using convolution and LSTM layers. This technique attained higher accuracy than other ML models to SST time series forecasting tasks^11,50. On the other hand, its training can be costly computationally due to the number of hyper-parameters that must be adjusted⁶⁴. In this work, the employed ConvLSTM used the configuration suggested in the SST forecasting works^11,50.

Nonlinear Autoregressive with Exogenous Input neural network (NARX) was proposed to model the nonlinear and autoregressive behaviors⁶⁵. NARX model was successfully used to predict SST anomalies in the western Indian Ocean region¹. Despite being able to forecast seasonal anomaly trends, the NARX performance is highly sensitive to parameters specification¹.

Simulations and experimental results

Table 4 shows the results regarding MSE, MAPE and MAE for the test set of the S1, S2, and S3 series, for 1-day ahead forecasting. In that table is possible to compare the performance of the hybrid systems with single and literature models.

Table 4 Comparison in terms of MSE, MAPE, and MAE of the combination approaches with single statistical and Machine Learning models of the literature applied to the SST daily forecasting.

Full size table

For the S1 time series, all hybrid systems improved the accuracy of their respective single model, reaching better MSE, MAPE, and MAE values better than literature models. In particular, the NoLiC employing the LSTM model attained the best result in all considered metrics. Regarding MSE, the hybrid system versions obtained an error of one order of magnitude smaller than their respective single models, for instance, 3.97E−04 for NoLiC+LSTM and 6.89E−04 for LSTM.

The S2 and S3 series follow the same behavior: all hybrid systems versions improved the performance of their respective single models for the evaluated metrics. For the S2 time series, the perturbation approach using the SVR model attained the best performance in terms of MSE, MAPE, and MAE. This hybrid system version, which employed three perturbations ($P_0 + P_1 + P_2 + P_3$), improved the MSE value in two orders of magnitude regarding the single SVR.

Hybrid systems that use the LSTM model deserve special attention for the S3 time series. In this case, the NoLiC attained the best MSE value, while the perturbative approach ($P_0 + P_1 + P_2$) obtained the smallest MAPE and MAE. Both hybrid system versions of single SVR and LSTM improved the MSE value in one order of magnitude. Among the single and literature models, the NARX¹ achieved the best results for the evaluated times series.

Tables 5 and 6 show the percentage difference (Eq. 9) in terms of the MSE metric between the literature models and the perturbative and NoLiC approaches, respectively. The tables show that the hybrid systems improved the performance of both single models for the S1 series. The NoLiC using the LSTM model attained an improvement greater than 65% for all evaluation metrics (Table 5). The versions of the hybrid systems attained a superior performance, at least 20% when compared with the literature. Figure 4a,b show the forecasts of the S1 series test set of the hybrid approaches using SVR and LSTM, respectively. It can be seen that both hybrid systems were able to improve the forecasting of the single models. Both hybrid approaches achieved forecasts closer to the real when compared with the initial model.

Table 5 Percentage difference between the perturbative approach and literature models for MSE, MAPE, and MAE.

Full size table

Table 6 Percentage difference between the NoLic and literature models for MSE, MAPE, and MAE.

Full size table

For S2, Table 5 shows that the perturbative approach reached an improvement higher than 30% in all comparisons. This approach with SVR obtained a percentage gain regarding SVR of 98.93%, 90.56%, and 91.09% for MSE, MAPE, and MAE, respectively. Table 6 shows the NoLiC using LSTM model attained a gain concerning to single LSTM of 98.42%, 88.40% and 88.87% for MSE, MAPE, and MAE, respectively. Figure 4c,d show the forecasts of the S2 series test set of the hybrid approaches using SVR and LSTM, respectively. Both figures show that the hybrid systems improved the forecast of the single models. In both comparisons, it is possible to verify that the forecast of the hybrid systems using SVR or LSTM is closer to the test set of S2 than the respective single model.

Tables 5 and 6 show that the percentage difference between hybrid systems with LSTM and single models is positive in all comparisons for the S3 data set. The NoLiC using SVR obtained the greatest improvement regarding single SVR with 65.03%, 39.50%, and 37.68% for MSE, MAPE, and MAE, respectively. Figure 4e,f show the forecasts of the S2 series test set of the hybrid approaches using SVR and LSTM, respectively. The forecasts obtained by the perturbative and NoLiC approaches are closer to S3 series when compared with the single models. Supplementary Information presents additional analyzes.

Discussion

To verify if there are (or not) significant statistical differences between the hybrid systems and literature approaches, we employed the Diebold–Mariano statistical test⁶⁶. We use MSE since it is the target metric employed to guide the search of the parameters of the models. Table 7 shows that both versions of the perturbative approach attain MSE values statistically different from the single and literature models, i.e., the p value is smaller than the significance level adopted (0.05) in all comparisons. The NoLiC employing LSTM also reached results statistically better than other models. Only the NoLiC version using SVR attained an MSE worse than NARX¹ and ConvLSTM^11,50 models.

Table 7 Results of the comparison of the hybrid systems using SVR and LSTM with single and literature models using Diebold–Mariano hypothesis test.

Full size table

Table 8 shows the execution time (in seconds) of the testing phase calculated over 30 executions. The evaluated approaches presented an execution time smaller than 1 s in all data sets. It is important to highlight that the hybrid systems based on the LSTM model are more costly regarding computational effort than the ones based on SVR. For instance, the SVR’s perturbative approaches were less computationally costly than single LSTM for S1 and S2 series.

Table 8 Testing time in seconds of the single models and combination approaches for 1 day ahead forecasting.

Full size table

The complexity analysis of the hybrid system can be divided into p steps, each one corresponding to the training of a model (${\text {M}}_0, {\text {M}}_1, \ldots , {\text {M}}_p$). The evaluated hybrid systems are trained sequentially, and their training time can be described as ${\text {MT}}_0 + {\text {MT}}_1 + \cdots + {\text {MT}}_p$, where MT is the training time of the model in a specific phase. In this way, the NoLiC approach is approximately three times more expensive than the single models, because the NoLiC uses three models (${\text {M}}_0, {\text {M}}_1$, and $M_{\mathrm{C}}$), and the perturbative approach is approximately p times more expensive than the single models because it uses ${\text {M}}_0, {\text {M}}_1, \ldots , {\text {M}}_p$ models. This work applies two compositions of hybrid systems, using SVR or LSTM. The SVR training process has a complexity of O(lm)^18,67, where l is the size of the data set, and m represents the number of input features. The training process of the LSTM has a complexity of O(W), where W is the total number of parameters⁶⁸.

Conclusion

The Sea Surface Temperature (SST) is an important environmental variable due to its strong relationship to climate, weather, and nature events, such as El Niño. So, the SST accurate forecast can support decisions in several science fields.

In this work, we evaluated two types of hybrid systems intending to improve the performance of single ML models in the task of SST forecast. The hybrid systems are evaluated in the 1-day-ahead forecasting scenario. The purpose was to correct biased and deteriorated forecasts of the ML models by modeling the error series. The Perturbative and NoLiC hybrid approaches employ linear and nonlinear combinations, respectively. For each approach, two versions were generated, one using SVR and another using LSTM as base models. All the models were evaluated in three data sets of different locations in the tropical Atlantic using traditional metrics (MSE, MAPE, and MAE) of the literature. Compared with the ML single models, the hybrid system approaches obtained a significant performance improvement (more than 20%).

Regarding the hybrid systems, it was possible to verify the influence of the combination function in their performance. The linear combination used by the Perturbative approach obtained the best performance in two out of three study cases regarding MSE. Although NoLiC employs a combination function more versatile than the simple sum, it could not overcome the linear combination in most cases. In particular, when the perturbative approach uses the LSTM as the base model, it reached the highest performance in three out of five cases. The LSTM’s best performance compared to the other ML models can be attributed to its ability to capture long-term temporal dependencies due to its recurrent abilities, such as memory cells1. In this way, the LSTM can consider previous training examples on its forecasting process, creating a better understanding of the past data and a more robust combination process.

It is crucial to remark that both single and hybrid models struggled to forecast extreme points. This issue is a challenging task in the time series literature¹². Another is the absence of sufficient extreme cases in the training set, which can bias the training process towards more regular cases and the applied target metric (MSE). The hybrid system’s computational effort is the sum of the costs of modeling its counterparts and depends on each model’s parameters set. The computational cost can be minimized in the test phase, parallelizing the time series and residual forecasting.

For future works, we intend to improve the accuracy of the hybrid systems to better forecast extreme values by automatically searching for the most suitable combination function. Besides, different base models, such as convolutional neural networks, echo state networks, and decision trees for regression, can be investigated.

References

Mahongo, S. & Deo, M. Using artificial neural networks to forecast monthly and seasonal sea surface temperature anomalies in the Western Indian Ocean. Int. J. Ocean Clim. Syst. 4, 133–150 (2013).
Google Scholar
Reynolds, R. W. & Smith, T. M. Improved global sea surface temperature analyses using optimum interpolation. J. Clim. 7, 929–948 (1994).
ADS Google Scholar
Lins, I. D., Araujo, M., das Chagas Moura, M., Silva, M. A. & Droguett, E. L. Prediction of sea surface temperature in the tropical Atlantic by support vector machines. Comput. Stat. Data Anal. 61, 187–198 (2013).
MathSciNet MATH Google Scholar
Salles, R. et al. Evaluating temporal aggregation for predicting the sea surface temperature of the Atlantic Ocean. Ecol. Inform. 36, 94–105 (2016).
Google Scholar
Tripathi, K., Das, I. & Sahai, A. Predictability of sea surface temperature anomalies in the Indian Ocean using artificial neural networks. Indian J. Mar. Sci. 35, 210–220 (2006).
Google Scholar
Garcia-Gorriz, E. & Garcia-Sanchez, J. Prediction of sea surface temperatures in the western Mediterranean Sea by neural networks using satellite observations. Geophys. Res. Lett. 34, 1–6 (2007).
Google Scholar
Cho, J. et al. A study on the relationship between Atlantic Sea surface temperature and Amazonian greenness. Ecol. Inform. 5, 367–378 (2010).
Google Scholar
NOAA. NOAA national oceanic and atmospheric administration (accessed 31 October 2017); https://oceanservice.noaa.gov/facts/sea-surface-temperature.html (2017).
Chatfield, C. The Analysis of Time Series: An Introduction (CRC Press, 2016).
MATH Google Scholar
Michalski, R. S., Carbonell, J. G. & Mitchell, T. M. Machine Learning: An Artificial Intelligence Approach (Springer, 2013).
MATH Google Scholar
Xiao, C. et al. A spatiotemporal deep learning model for sea surface temperature field prediction using time-series satellite data. Environ. Modell. Softw. 120, 104502 (2019).
Google Scholar
Box, G. E., Jenkins, G. M., Reinsel, G. C. & Ljung, G. M. Time Series Analysis: Forecasting and Control (Wiley, 2015).
MATH Google Scholar
Zhang, G., Patuwo, B. E. & Hu, M. Y. Forecasting with artificial neural networks: The state of the art. Int. J. Forecast. 14, 35–62 (1998).
CAS Google Scholar
Sulaiman, J. & Wahab, S. H. Heavy rainfall forecasting model using artificial neural network for flood prone area. IT Converg. Secur. 449, 68–76 (2018).
Google Scholar
Basak, D., Pal, S. & Patranabis, D. C. Support vector regression. Neural Inf. Process. Lett. Rev. 11, 203–224 (2007).
Google Scholar
Zhang, G. P. Time series forecasting using a hybrid ARIMA and neural network model. Neurocomputing 50, 159–175 (2003).
MATH Google Scholar
Khashei, M. & Bijari, M. A novel hybridization of artificial neural networks and ARIMA models for time series forecasting. Appl. Soft Comput. 11, 2664–2675 (2011).
Google Scholar
de Oliveira, J. F. & Ludermir, T. B. A hybrid evolutionary decomposition system for time series forecasting. Neurocomputing 180, 27–34 (2016).
Google Scholar
Silva, E. G., Domingos, S. d. O., Cavalcanti, G. D. & de Mattos Neto, P. S. Improving the accuracy of intelligent forecasting models using the perturbation theory. In 2018 International Joint Conference on Neural Networks (IJCNN) 1–7 (IEEE, 2018).
Santos, D. S. d. O. Jr., de Oliveira, J. F. & de Mattos Neto, P. S. An intelligent hybridization of ARIMA with machine learning models for time series forecasting. Knowl. Based Syst. 175, 72–86 (2019).
Google Scholar
de Oliveira, J. F., Silva, E. G. & de Mattos Neto, P. S. A hybrid system based on dynamic selection for time series forecasting. IEEE Trans. Neural Netw. Learn. Syst. 1–13. https://ieeexplore.ieee.org/document/9340584 (2021).
de Mattos Neto, P. S., Ferreira, T. A., Lima, A. R., Vasconcelos, G. C. & Cavalcanti, G. D. A perturbative approach for enhancing the performance of time series forecasting. Neural Netw. 88, 114–124 (2017).
PubMed Google Scholar
de Mattos Neto, P. S., Cavalcanti, G. D. & Madeiro, F. Nonlinear combination method of forecasters applied to PM time series. Pattern Recognit. Lett. 95, 65–72 (2017).
ADS Google Scholar
Haghbin, M., Sharafati, A., Motta, D., Al-Ansari, N. & Noghani, M. Applications of soft computing models for predicting sea surface temperature: A comprehensive review and assessment. Prog. Earth Planet. Sci. 8, 1–19 (2021).
ADS Google Scholar
McDermott, P. L. & Wikle, C. K. Bayesian recurrent neural network models for forecasting and quantifying uncertainty in spatial-temporal data. Entropy 21, 184 (2019).
ADS MathSciNet PubMed Central Google Scholar
Qian, S. et al. Seasonal rainfall forecasting for the Yangtze River Basin using statistical and dynamical models. Int. J. Climatol. 40, 361–377 (2020).
Google Scholar
Sun, Y. et al. Time-series graph network for sea surface temperature prediction. Big Data Res. 25, 100237 (2021).
Google Scholar
Ginzburg, I. & Horn, D. Combined neural networks for time series analysis. In NIPS’93: Proceedings of the 6th International Conference on Neural Information Processing Systems 224–231. https://dl.acm.org/doi/10.5555/2987189.2987218 (1993).
Pai, P.-F. & Lin, C.-S. A hybrid ARIMA and support vector machines model in stock price forecasting. Omega 33, 497–505 (2005).
Google Scholar
Firmino, P. R. A., de Mattos Neto, P. S. & Ferreira, T. A. Error modeling approach to improve time series forecasters. Neurocomputing 153, 242–254 (2015).
Google Scholar
de Mattos Neto, P. S., Cavalcanti, G. D., Madeiro, F. & Ferreira, T. A. An approach to improve the performance of PM forecasters. PLoS ONE 10, e0138507 (2015).
PubMed PubMed Central Google Scholar
Gheyas, I. A. & Smith, L. S. A novel neural network ensemble architecture for time series forecasting. Neurocomputing 74, 3855–3864 (2011).
Google Scholar
Ribeiro, G. H. T., de Mattos Neto, P. S. G., Cavalcanti, G. D. C. & Tsang, I. R. Lag selection for time series forecasting using particle swarm optimization. In The 2011 International Joint Conference on Neural Networks 2437–2444 (2011).
Chen, K.-Y. & Wang, C.-H. A hybrid SARIMA and support vector machines in forecasting the production values of the machinery industry in Taiwan. Expert Syst. Appl. 32, 254–264 (2007).
Google Scholar
Cadenas, E. & Rivera, W. Wind speed forecasting in three different regions of Mexico, using a hybrid ARIMA-ANN model. Renew. Energy 35, 2732–2738 (2010).
Google Scholar
Yan, Q. & Ma, C. Application of integrated ARIMA and RBF network for groundwater level forecasting. Environ. Earth Sci. 75, 396 (2016).
Google Scholar
Zhou, L. et al. Using a hybrid model to forecast the prevalence of schistosomiasis in humans. Int. J. Environ. Res. Public Health 13, 355 (2016).
PubMed PubMed Central Google Scholar
Wongsathan, R. & Seedadan, I. A hybrid ARIMA and neural networks model for PM-10 pollution estimation: The case of Chiang Mai city moat area. Procedia Comput. Sci. 86, 273–276 (2016).
Google Scholar
Faruk, D. Ö. A hybrid neural network and ARIMA model for water quality time series prediction. Eng. Appl. Artif. Intell. 23, 586–594 (2010).
Google Scholar
Taskaya-Temizel, T. & Casey, M. C. A comparative study of autoregressive neural network hybrids. Neural Netw. 18, 781–789 (2005).
PubMed Google Scholar
Khashei, M. & Bijari, M. An artificial neural network (p, d, q) model for timeseries forecasting. Expert Syst. Appl. 37, 479–489 (2010).
MATH Google Scholar
Khashei, M. & Bijari, M. Which methodology is better for combining linear and nonlinear models for time series forecasting?. J. Ind. Syst. Eng. 4, 265–285 (2011).
Google Scholar
Zhu, B. & Wei, Y. Carbon price forecasting with a novel hybrid ARIMA and least squares support vector machines methodology. Omega 41, 517–524 (2013).
Google Scholar
Corrado, P. & Morine, M. J. Analysis of Biological Systems (World Scientific Publishing Company, 2015).
Google Scholar
Ettouney, R. S., Mjalli, F. S., Zaki, J. G., El-Rifai, M. A. & Ettouney, H. M. Forecasting of ozone pollution using artificial neural networks. Manag. Environ. Qual. Int. J. 20, 668–683 (2009).
Google Scholar
Sakurai, J. J. Modern Quantum Mechanics Revised. (Addison Wesley, 1995).
Google Scholar
de Mattos Neto, P. S., Junior, A. R. L., Ferreira, T. A. & Cavalcanti, G. D. An intelligent perturbative approach for the time series forecasting problem. In The 2010 International Joint Conference on Neural Networks (IJCNN) 1–8 (IEEE, 2010).
Bourlès, B. et al. The PIRATA program: History, accomplishments, and future directions. Bull. Am. Meteorol. Soc. 89, 1111–1125 (2008).
ADS Google Scholar
Zhang, Q., Wang, H., Dong, J., Zhong, G. & Sun, X. Prediction of sea surface temperature using long short-term memory. IEEE Geosci. Remote Sens. Lett. 14, 1745–1749 (2017).
ADS Google Scholar
Yang, Y. et al. A CFCC-LSTM model for sea surface temperature prediction. IEEE Geosci. Remote Sens. Lett. 15, 207–211 (2018).
ADS Google Scholar
Sapankevych, N. I. & Sankar, R. Time series prediction using support vector machines: A survey. IEEE Comput. Intell. Mag. 4, 24–38. https://doi.org/10.1109/MCI.2009.932254 (2009).
Article Google Scholar
Smola, A. J. & Schölkopf, B. A tutorial on support vector regression. Stat. Comput. 14, 199–222 (2004).
MathSciNet Google Scholar
de Mattos Neto, P. S. G. et al. A hybrid nonlinear combination system for monthly wind speed forecasting. IEEE Access 8, 191365–191377. https://doi.org/10.1109/ACCESS.2020.3032070 (2020).
Article Google Scholar
Pedregosa, F. et al. Scikit-learn: Machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar
Nisbet, R., Elder, J. & Miner, G. Chapter 8—Advanced Algorithms for Data Mining. In Handbook of Statistical Analysis and Data Mining Applications (eds Nisbet, R. et al.) 151–172 (Academic Press, 2009).
MATH Google Scholar
Mahmoud, A. & Mohammed, A. A survey on deep learning for time-series forecasting. In Machine Learning and Big Data Analytics Paradigms: Analysis, Applications and Challenges (eds Hassanien, A. E. & Darwish, A.) 365–392 (Springer, 2021).
Google Scholar
Li, X. Sea surface temperature prediction model based on long and short-term memory neural network. In IOP Conference Series: Earth and Environmental Science, vol. 658, 012040 (IOP Publishing, 2021).
Wielgosz, M., Skoczeń, A. & Mertik, M. Recurrent neural networks for anomaly detection in the post-mortem time series of LHC superconducting magnets. arXiv:1702.00833 (2017).
Chollet, F. et al. Keras: The python deep learning library. In ASCL ascl–1806 (2018).
Hyndman, R. J. & Khandakar, Y. Automatic time series forecasting: The forecast package for R. J. Stat. Softw. Articles 27, 1–22 (2008).
Google Scholar
Hyndman, R. J. & Athanasopoulos, G. Forecasting: Principles and Practice (OTexts, 2018).
Google Scholar
Lai, K. K., Yu, L., Wang, S. & Huang, W. Hybridizing exponential smoothing and neural network for financial time series predication. In International Conference on Computational Science 493–500 (Springer, 2006).
Seabold, S. & Perktold, J. Statsmodels: Econometric and statistical modeling with python. In Proceedings of the 9th Python in Science Conference, vol. 57, 61 (2010).
Justus, D., Brennan, J., Bonner, S. & McGough, A. S. Predicting the computational cost of deep learning models. In 2018 IEEE International Conference on Big Data (Big Data) 3873–3882 (IEEE, 2018).
Xie, H., Tang, H. & Liao, Y.-H. Time series prediction based on NARX neural networks: An advanced approach. In 2009 International Conference on Machine Learning and Cybernetics, vol. 3, 1275–1279 (IEEE, 2009).
Diebold, F. X. & Mariano, R. S. Comparing predictive accuracy. J. Bus. Econ. Stat. 20, 134–144 (2002).
MathSciNet Google Scholar
Chang, C.-C. & Lin, C.-J. LIBSVM: A library for support vector machines. ACM Trans. Intell. Syst. Technol. (TIST) 2, 27 (2011).
Google Scholar
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9, 1735–1780 (1997).
PubMed CAS Google Scholar

Download references

Acknowledgements

The authors thank the National Council for Scientific and Technological Development (CNPQ—Brazil) and FACEPE for financial support.

Author information

These authors contributed equally: George D. C. Cavalcanti, Domingos S. de O. Santos Júnior and Eraylson G. Silva.

Authors and Affiliations

Centro de Informática, Universidade Federal de Pernambuco, Recife, Pernambuco, Brazil
Paulo S. G. de Mattos Neto, George D. C. Cavalcanti, Domingos S. de O. Santos Júnior & Eraylson G. Silva

Authors

Paulo S. G. de Mattos Neto
View author publications
You can also search for this author in PubMed Google Scholar
George D. C. Cavalcanti
View author publications
You can also search for this author in PubMed Google Scholar
Domingos S. de O. Santos Júnior
View author publications
You can also search for this author in PubMed Google Scholar
Eraylson G. Silva
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.S.G.M.N. and G.D.C.C. conceived the experiment(s), E.G.S and D.S.O.S.J. conducted the experiment(s), P.S.G.M.N., E.G.S and D.S.O.S.J. analysed the results. All authors wrote and reviewed the manuscript.

Corresponding author

Correspondence to Paulo S. G. de Mattos Neto.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

de Mattos Neto, P.S.G., Cavalcanti, G.D.C., de O. Santos Júnior, D.S. et al. Hybrid systems using residual modeling for sea surface temperature forecasting. Sci Rep 12, 487 (2022). https://doi.org/10.1038/s41598-021-04238-z

Download citation

Received: 28 August 2021
Accepted: 17 December 2021
Published: 11 January 2022
DOI: https://doi.org/10.1038/s41598-021-04238-z

This article is cited by

Hybrid physics-machine learning models for predicting rate of penetration in the Halahatang oil field, Tarim Basin
- Shengjie Jiao
- Wei Li
- Yinao Su
Scientific Reports (2024)
An error correction system for sea surface temperature prediction
- Ricardo de A. Araújo
- Paulo S. G. de Mattos Neto
- Sergio C. B. Soares
Neural Computing and Applications (2023)
Systematic Literature Review of Various Neural Network Techniques for Sea Surface Temperature Prediction Using Remote Sensing Data
- Lalita Chaudhary
- Shakti Sharma
- Mohit Sajwan
Archives of Computational Methods in Engineering (2023)
Forecasting blood demand for different blood groups in Shiraz using auto regressive integrated moving average (ARIMA) and artificial neural network (ANN) and a hybrid approaches
- Seddigheh Edalat Sarvestani
- Nahid Hatam
- Mohsen Bayati
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Related works

Hybrid systems—combining linear and nonlinear models

Combining nonlinear models

The perturbative approach

The NoLiC method

Remarks

PIRATA data set

Experimental protocol

Simulations and experimental results

Discussion

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links