Development and evaluation of a deep learning approach for modeling seasonality and trends in hand-foot-mouth disease incidence in mainland China

Wang, Yongbin; Xu, Chunjie; Zhang, Shengkui; Yang, Li; Wang, Zhende; Zhu, Ying; Yuan, Juxiang

doi:10.1038/s41598-019-44469-9

Download PDF

Article
Open access
Published: 29 May 2019

Development and evaluation of a deep learning approach for modeling seasonality and trends in hand-foot-mouth disease incidence in mainland China

Yongbin Wang¹^na1,
Chunjie Xu²^na1,
Shengkui Zhang¹,
Li Yang¹,
Zhende Wang¹,
Ying Zhu¹ &
…
Juxiang Yuan¹

Scientific Reports volume 9, Article number: 8046 (2019) Cite this article

3849 Accesses
26 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The high incidence, seasonal pattern and frequent outbreaks of hand, foot, and mouth disease (HFMD) represent a threat for millions of children in mainland China. And advanced response is being used to address this. Here, we aimed to model time series with a long short-term memory (LSTM) based on the HFMD notified data from June 2008 to June 2018 and the ultimate performance was compared with the autoregressive integrated moving average (ARIMA) and nonlinear auto-regressive neural network (NAR). The results indicated that the identified best-fitting LSTM with the better superiority, be it in modeling dataset or two robustness tests dataset, than the best-conducting NAR and seasonal ARIMA (SARIMA) methods in forecasting performances, including the minimum indices of root mean square error, mean absolute error and mean absolute percentage error. The epidemic trends of HFMD remained stable during the study period, but the reported cases were even at significantly high levels with a notable high-risk seasonality in summer, and the incident cases projected by the LSTM would still be fairly high with a slightly upward trend in the future. In this regard, the LSTM approach should be highlighted in forecasting the epidemics of HFMD, and therefore assisting decision makers in making efficient decisions derived from the early detection of the disease incidents.

A method for hand-foot-mouth disease prediction using GeoDetector and LSTM model in Guangxi, China

Article Open access 29 November 2019

Using deep learning to predict the hand-foot-and-mouth disease of enterovirus A71 subtype in Beijing from 2011 to 2018

Article Open access 22 July 2020

Trend analysis and forecast of daily reported incidence of hand, foot and mouth disease in Hubei, China by Prophet model

Article Open access 14 January 2021

Introduction

Hand, foot and mouth disease (HFDM) is a common acute infectious disease in children, the majority (91%) of whom are under 5 years¹. Most show mild symptoms mainly characterized by fever and rash on the hands, feet and mouth. A small minority have more severe complications, such as myocarditis, pulmonary edema and aseptic meningoencephalitis, some of which are fatal². The infections are predominantly caused by coxsackievirus A16 (CVA16) and human enterovirus 71 (EV71), although other viruses can be involved³. The viruses are thought to be predominantly transmitted from child to child by direct and indirect contact, including droplets, droplet nuclei, dust, water and food^1,4. Furthermore, approximately half of the individuals may be coinfected with more than one pathogenic agent⁵. HFMD was first reported in New Zealand in 1957, and since then, millions of cases and many outbreaks have been reported worldwide⁶. However, the worldwide epidemiology of HFMD has dramatically changed during the past decade, especially in East and Southeast Asian countries such as China, Brunei, Malaysia, Mongolia, Singapore and Vietnam⁷, where epidemics and numerous large-scale outbreaks of HFMD have occurred, resulting in enormous burdens of disease and global public health concerns^3,8,9.

In mainland China, after the first reported case of HFMD in Shanghai in 1981¹⁰, several outbreaks have been reported and have caused the deaths of numerous children³. Moreover, HFMD affects more than two million children annually in mainland China¹¹, and the number of cases and deaths caused by HFMD sporadics, epidemics and outbreaks invariably tops the list of monitored class C diseases every year since HFMD was designated as a notifiable disease in 2008^3,12. Since 2016, although an available vaccine that only plays a role in the infection caused by the EV71 virus has been introduced to prevent HFMD¹³, the potentially worsening trend in the reported cases of HFMD has not been reversed. Importantly, it is estimated that there is an increasing risk of ongoing HFMD recurrence in China in recent years¹⁴ and HFMD still continues to exert a significant influence on the general susceptible population. Early detection and advanced warning for the timing, extent and duration of HFMD epidemics will be particularly valuable in formulating effective prevention and intervention strategies to minimize the damage caused by the infection¹⁵. Therefore, a reliable forecasting technique to track the temporal patterns of HFMD is needed.

The existing early warning models for forecasting the morbidity and mortality of infectious diseases mainly consist of linear and nonlinear models, along with their hybrids¹⁶. Additionally, the autoregressive integrated moving average (ARIMA) model is one of the best linear models in terms of performance for a specified time series¹⁷; the nonlinear autoregressive neural network (NAR) approach is among the nonlinear models with arbitrarily expected accuracy that can effectively extract the meaningful dynamic information of a data sequence¹⁶. Both the ARIMA and NAR models are well suited to study the future trends of the morbidity or mortality time series of diseases with stationary short-term dependencies based on the aggregated long trajectories¹⁶. However, long-term trajectory modeling, which is most often encountered in epidemiological prediction, is frequently characterized by non-stationary long-term lags over time. Additionally, when employing an NAR model to connect the preceding information located into the time-varying series to the present assignment, with the growing gap between the past inputs and estimated outputs, the NAR technique will encounter a vanishing or exploding gradient problem during training, which makes it difficult to develop the long-term dependence structure in a time series¹⁸. The long short-term memory (LSTM) architecture, a type of deep learning network that has been extensively studied and applied to quite a few frontier fields, such as voice recognition¹⁹, video classification²⁰ and speech synthesis systems²¹, comprises a cluster of recurrently connected subnets that allow the LSTM method to store and access information over long periods of time, hence mitigating the vanishing or exploding gradient problem²². However, there is a current lack of research focusing specifically on the applicability of LSTM model in the forecasting of infectious diseases with time series analysis. Therefore, motivated by the merits of the LSTM model, the burden of HFMD and the persistently high incidence in mainland China, we aim to forecast the notified incident cases of HFMD with a LSTM model. Meanwhile, the simulating and predictive abilities of the LSTM model were compared with two especially useful estimation models, including the ARIMA and NAR methods, to seek the best-fitting time series modeling technique for HFMD, which will be of great help in initiating guidance planning and effective intervention measures for HFMD-prevention in mainland China.

Results

General information

The time series of notified cases of HFMD included 121 observations from June 2008 to June 2018, and a total of 19,218,824 cases were reported. The monthly average of case notifications was 158,834, which led to an annualized average morbidity rate of 133,894 cases per 100,000 population with a standard error of 10,825 cases during the whole period. Between June 2008 and June 2017, 17,072,500 cases occurred, and the number of morbidity cases increased from 102,223 to 308,789, with an overall increase of 202.074% throughout the past decade. The highest incidence peak of 528,777 cases was observed in May 2014, which was a marginal increase of 417.278% compared to June2008. When the Hodrick-Prescott (HP) decomposition approach was performed to smooth the short-term monthly effect of the observed incidence case series from June 2008 to June 2018, it was found that there were clear seasonal peaks, specifically in May and June of every year, a trough in January and February and an evident 12-month cyclical process. In addition, notwithstanding a slight decline that existed between January 2014 and June 2018, there were still a substantially large number of reported cases each year (Fig. 1 and Supplementary Fig. S1).

The best-performing SARIMA model

Before modeling, the original in-sample observations were examined using the augmented Dickey-Fuller (ADF) test (ADF = −2.3318, P = 0.1642), which is indicative of an obviously non-stationary morbidity case series. Therefore, according to the results from the HP filter and ADF test, the first-order seasonal and non-seasonal differences were used to stabilize the variance and mean to suit the modeling requirement of a stationary sequence (ADF = −4.4129, P = 0.0006). Analysis of the spikes in the ACF and PACF plots at varying lags with the transformed incidence case series for HFMD resulted in the selection of several candidate models by trial and error to further detect the best-performing specification (Supplementary Figs S2 and S3). Finally, taking the error correlations between the ACF and PACF plots comprehensively and taking the AIC, AICc and SBC into consideration (Fig. 2 and Supplementary Table S1), the preferred model of SARIMA(1,1,2)(1,1,0)₁₂ was generated with minimized AIC, AICc and SBC values of 2339.41, 2340.08 and 2352.23, respectively. When the error correlations at lags fell into the estimated 95% confidence bounds (Fig. 2b,c), the Ljung-Box Q test showed that the residuals from the SARIMA model obtained desirable white noise (Fig. 2d and Table 1), and the LM test suggested that no ARCH effect was found at various lags in the residual series (Table 2). Furthermore, the test results of the estimated parameters were also all significant (Supplementary Table S1). Nevertheless, the only complication was that the Q-Q plot of the residuals showed a clear departure from normality at the tails (Supplementary Fig. S4). The specified equation of the SARIMA model was expressed as (1-B)(1-B¹²)X_t = (1 + 0.524B + 0.394B²)ɛ_t/(1 − 0.434B)(1 + 0.6B¹²). In the same way, the reported case series of HFMD from June 2008 to December 2016 and December 2017 was utilized to account for the robustness of the model. The best-fitting SARIMA method constructed using the first 103 in-sample observations was identified as a SARIMA(1,0,1)(1,1,1)₁₂ form, and Supplementary Figs S5–S8 and Tables S2–S5 provide the results of the diagnostic tests for this optimal SARIMA approach; the best-simulating SARIMA approach obtained with the first 115 observed points was still identified as a SARIMA(1,0,1)(1,1,1)₁₂ form, and the identified parameters and diagnostic tests for the preferred approach are given in Supplementary Figs S9–S12 and Tables S6–S9. Then, these optimal methods chosen were utilized to calculate out-of-sample predictions.

Table 1 Ljung-Box Q test of the residuals for the selected three optimal models at different lags.

Full size table

Table 2 ARCH effect of the observations and residuals of the selected three models with LM test at various lags.

Full size table

The best-performing NAR model

To obtain an optimum NAR model, the hidden units and feedback delays, ranging from 10 to 35 and from 2 to 8, respectively, were iteratively examined within in-sample data points. Ultimately, we identified the best-fitting model with 18 hidden neurons and 5 feedback delays dependent on the comprehensive optimum performance indices aside from the fact that a fat-tailed distribution, compared to the normal distribution, should be utilized (Supplementary Fig. S13). As presented in Supplementary Fig. S14, in the preferred model, the minimum MSE values for the training, validation and testing datasets and for the entire dataset were 0.0011, 0.0074, 0.0144 and 0.0038, respectively; the maximum R values of the training, validation, testing subsets and entire dataset were 0.987, 0.931, 0.920 and 0.963, respectively. Moreover, the input-to-error correlations and autocorrelations of the produced residuals were never beyond the estimated 95% uncertainty limits around zero across varying lag times, apart from the one in the ACF plot at lag zero that should occur (Fig. 3 and Table 1). The LM test suggested that the ARCH effect that existed in the original data largely minimized the errors of the NAR model (Table 2). In addition, the response plot of the estimated values from the randomly selected training, validation and testing datasets against their corresponding original observations at different time points showed that the optimal approach could simulate the data points included in the three grouped subsets well because of the small residuals that were mostly located between −0.2 and 0.2 (Fig. 4). Similarly, according to the modeling steps described above, in these two robustness-test datasets, the best-presenting technique fit to the dataset between June 2008 and December 2016 was such an NAR model with 17 hidden neurons and 5 feedback delays. The identified layer architecture and statistical measures for this preferred network are displayed in Supplementary Figs S15–S19 and Tables S4, S5 and S10. The best-fitting model developed utilizing the data from June 2008 to December 2017 was an NAR model with 19 hidden neurons and 6 feedback delays, and with regard to the optimal network, all further diagnostic results can be seen in Supplementary Figs S20–S24 and Tables S8–S10. Afterwards, these best-performing networks were employed to conduct out-of-sample forecasting, and the simulated and forecasted values obtained were back-transformed to the original scale because they were computed on the transformed scale.

The best-performing LSTM model

Generally, the LSTM network with 1 hidden layer surrounded by 1 to 7 hidden units can satisfy the need for time series modeling. Consequently, to attain the optimal modeling parameters for the HFMD series, the range of time steps was set to 1 to 20, and the LSTM model was conducted repeatedly utilizing the activation sigmoid function with time steps ranging from 1 to 20 together with a batch size of 1 using the Adam optimizer technique to stabilize the argument updates to help minimize the loss function of the RMSE. All candidate models were iterated through 300 epochs. Ultimately, we identified that the best-simulating model with 1 hidden layer containing 5 hidden neurons and 11 time steps relied on a minimum training score of RMSE = 0.0031 and a testing score of RMSE = 0.0038 and with maximum R values of the training and validation subsets and of the overall data (0.972, 0.982 and 0.974, respectively) (Supplementary Fig. S25). In addition, as presented in Fig. 5, no overfitting occurred during the training process because of the similar downward trend between the testing and validation datasets before 300-step iterations. The ACF plot of the produced errors revealed no individually evident autocorrelation at varying lags except for the one at lag zero (Fig. 6b), as shown in the normal Q-Q plot (Supplementary Fig. S26), indicating that the forecasted residuals of the LSTM model were normally distributed to a great extent. The Ljung-Box Q test showed that errors did not depart from the assumptions of stochastic white noise (Fig. 6d and Table 1), and the LM test demonstrated that the volatility that existed in the actual observations was essentially eliminated in the residuals of the selected LSTM model as well (Table 2). Therefore, the model chosen is adequate in capturing the dynamic dependences of this time series. Likewise, the in-sample observations used to test the robustness of the model were applied to determine the optimal LSTM network as noted earlier: the network with 1 hidden layer including 6 hidden neurons and 12 time steps was constructed based on the data series from June 2008 to December 2016 was the best-performing; the results of the diagnostic tests for this network are summarized in Supplementary Figs S27–S30 and Tables S4, S5 and S11. The network with 1 hidden layer containing 5 hidden neurons and 11 time steps built on the basis of the data series from June 2008 to December 2017 should be regarded as the best-performing. Supplementary Figs S31–S34 and Tables S8, S9 and S11 offer systematic diagnostic tests for this preferred network. After the optimal LSTM approaches are selected, they can be employed to predict the epidemic trends of HFMD in the upcoming years.

Comparative analysis

Multiple statistical measures were applied to compare the in-sample simulation and out-of-sample predictive accuracies among the three methods. Compared to the SARIMA and NAR models, the minimum values of measures concerning the facets of training and testing were observed in the LSTM technique, aside from the MAE in the simulated stage of the NAR model, and were fitted to the robustness-test dataset from June 2008 to December 2017 (Table 3). For these three constructed models, overall, the curves simulated and predicted by the LSTM model were closer in proximity to the actual values as well (Fig. 7), which further implies that the epidemic trajectories of HFMD can be captured reasonably well by the LSTM technique. Hence, the LSTM technique was re-modeled to recursively achieve multistep-ahead predictions from July 2018 to December 2020 (Table 4); it appears that a slightly potential rising risk in the incident cases of HFMD will be observed in the forecasting period. In the meantime, the uncertainty bands for the resultant forecasts were estimated utilizing simulation of 100 future possible paths by performing bootstrapping with the number of samples of 1000^23,24 (Table 4 and Supplementary Table S12).

Table 3 The comparison results of in-sample simulating and out-of-sample predicted performances for the three models.

Full size table

Table 4 The predictive incident cases of HFMD using the best-presenting LSTM technique from July 2018 to December 2020.

Full size table

Discussion

Since 2008, HFMD has regularly captured wide attention owing to both its high incidence and potential health hazards for millions of children, along with substantial losses to economy every year in mainland China⁸. It is imperative that specific control and intervention planning be introduced and set by the related public health agencies to handle such a wide-ranging issue. However, vital to any initiation planning of HFMD is an accurate prediction of its future temporal patterns. Early detection of HFMD epidemics based on models such as SARIMA and NAR has been a profitable technology for facilitating prevention strategies more effectively^10,11,17. Therefore, in view of the LSTM model’s flexible capacity to learn what to store and what to abandon during information-processing²⁵, to the best of our knowledge, this attempt is the first using an LSTM approach to model the long trajectory behaviors of HFMD incidence in mainland China. Our results imply that the LSTM method has the potential to obtain a clearer perspective of epidemic trends than the SARIMA and NAR models built on the specified predictive horizons. Notably, the LSTM method can make the outcome measurements of MAE, MAPE and RMSE markedly drop by 31.993%, 30.182% and 39.154%, respectively, in the training dataset, and can make their counterparts in the testing dataset decrease by 21.883%, 28.415% and 16.607%, respectively, compared to the SARIMA model. In contrast to the NAR model, the decreased percentages in the training subset for the three indices listed above are 0.310%, 29.091% and 13.001%, respectively, and their counterparts in the testing subset are 69.035%, 30.328% and 128.304%, respectively. Likewise, according to Fig. 7, it was found that the upward or downward trend simulated and forecasted by the LSTM model provides a more reasonable approximation to the reported points, especially for the identification of high incidence peaks, indicating that the LSTM model can adequately capture the essence of the dependence hidden behind the notified case series of HFMD. Similarly, in the two datasets used to account for the uncertainty in the model, the accuracy measurement indices in the LSTM approach also display the lowest error rates among these chosen, optimal methods, particularly in the 18-step ahead predictions, and the performances are as good as can be expected. The scale-dependent measure of MAE showed a lower value in the simulated stage of the NAR method than in the LSTM technique in the 6-step predictions; this result mainly arises from the fact that there are several simulated values that are far away from the observed values. Moreover, it was confirmed that the NAR model is effective in capturing the short-term dynamics of the data. In short, in comparison with the SARIMA and NAR models, the established LSTM approach not only can better explain the seasonal and trend characteristics of HFMD but also is robust with respect to medium-term and long-term forecasts. Apparently, this network can act as an effective tool for recognizing the temporal levels of HFMD incident cases in mainland China in upcoming years. Similar to the recent literature, which has also found that the LSTM method provides good forecasting power for air pollutant concentrations²⁶, financial time series²⁷ and harmful algal blooms in rivers²⁸. From this point of view, our LSTM technique appears to be worthy of being popularized for forecasting the incidence case series of HFMD in other settings in China and even a wide range of simulation applications, such as for all types of contagious diseases or in all time series analyses; however, this conclusion requires further verification. It should, however, be noted that with the increasing development of hybrid techniques, numerous combined methods incorporating linear approaches such as the SARIMA method¹⁷, the gray GM(1,1) model²⁹, the error-trend-seasonal model³⁰ and the exponential smoothing model³¹ and nonlinear techniques such as the back propagation neural network approach³², the generalized regression neural network method³³ and the radical basis function technique³² have already been adopted to serve as early warning tools for infectious diseases, and most have obtained satisfactory results. Consequently, much work will be required to explore the preferred models for detecting and analyzing HFMD morbidity cases in mainland China. In addition, in terms of the modeling measures (MAE, MAPE and RMSE), we found that the simulating and forecasting efficacies of the NAR method were slightly superior to the SARIMA method in the short-term (6-step) predictions, which is consistent with the earlier studies performed by Zhou et al.^34,35, yet is incongruous with the study involving modeling the prevalence of schistosomiasis in Qianjiang¹⁶; in the medium-term (12-step) and long-term (18-step) prediction stages, the NAR method underperforms the SARIMA model in the testing dataset, but this result is not in line with previous work predicting the morbidity of hemorrhagic fever with renal syndrome³⁶ and the daily number of new admission inpatients³⁵. It seems possible that these contrasting results are due to the following: the various characteristics of infectious diseases from different regions and the NAR approach suffer from overfitting, which is a defect inherent in the ANN methods. However, during MATLAB training, a default technique of early stopping was adopted to improve generalization and avoid overfitting. Therefore, being further suggestive of the necessity of constructing forecasting techniques for different infectious diseases in various settings and at different time periods, it is superior to the NAR technique in short-term forecasting.

It is well established that accurate identification of high-risk seasonality plays a pivotal role in timely implementation of prevention strategies and the reasonable allocation of resources for HFMD⁶. In our report, HFMD could occur throughout the year, and larger epidemics could be regularly found every 2 to 3 years. Similar trajectory behaviors were also reported in the studies involved in the regions of Vietnam³⁷, Malaysia³⁸, Hong Kong³⁹, Taiwan⁴⁰ and Singapore⁴¹, but the underlying drivers fail to be fully elucidated. In our study, evident seasonal and cyclical components were observed with the aid of the HP method; for example, every year from April until July, there was peak activity accounting for 59.581% of all notified cases, among which May and June were of particular concern, as they accounted for 60.952% of cases occurring in high-risk seasons. However, between January and February annually, there was a dramatic decline in the reported cases. A similar seasonal distribution was also revealed in recent years in other countries, including Singapore⁴¹, Malaysia³⁸ and most regions of China^6,39,40,42, containing Hong Kong, Taiwan, Shenzhen, Ningbo, Shandong, Zunyi, Guangdong and Guangzhou⁶. Moreover, outbreaks commonly occurred during the 4 months as well^42,43. Additionally, two peaks could be noted in our data from June 2008 to June 2018, the first and stronger peak primarily occurred during the high-risk season, and the weaker peak was chiefly observed from August to November annually; the appearance of the two peaks was also reported in another study of southern China⁶. This seasonal pattern is consistent with that of Hong Kong³⁹, Taiwan⁴⁰ and Vietnam³⁷. The single peak was customarily observed in northern China⁴⁴, and earlier studies on the temporal characteristics of HFMD in Japan³ and Malaysia³⁸ matched that in northern China. This may be pertinent to the different viruses, geographical differences or changing risk (e.g., school attendance, temperature, humidity or other meteorological drivers). Furthermore, in studies of particular regions of China, the leading agents (EV71 and CVA16) are also distributed in various peaks⁴⁵, where the pathogenic agent EV71 is predominant in the stronger peak months. By contrast, CVA16 is more inclined to circulate in the more vulnerable population. These two etiologic factors are notably attenuated in January and February. Regarding the seasonal variations, climatic factors are possibly responsible for such a discrepancy (e.g., the ability of the causative agents to survive outside the host, the variability in the behavior and immune level of the host by climatic factors, and the inclination of people to go outdoors in summer rather than in winter increases the chances of person-to-person contact causing the etiologic agents to more easily achieve transmission among humans by virtue of spreading-factors)⁶.

To understand the epidemic situation in advance of the coming years, the constructed LSTM model with the best-fitting and best-predicting performance was adopted to calculate forecasts for the next two years. The results indicated that although the estimated observations would not show a large amplitude of oscillations relative to the in-sample data obtained, HFMD morbidity cases remained high, among which the highest-risk seasonality seemed to occur in June and July. Similar to prior findings⁶, two apparent seasonal peaks will be observed separately in subsequent Junes and Octobers, in all probability. Thus, due attention and instant action should be paid to these months and a response should be prompted, such as health promotion education; prevention at and control of key locations, particularly in nurseries and schools; vaccination and financial support. In addition, the prevention and control strategies for the rest of the low-risk months should fail to be ignored. In summary, the expected number of cases of HFMD remain present and still comparatively large, demonstrating that China is still afflicted with a chronic threat of HFMD.

Some limitations should be acknowledged in this work. First, no theoretical guidance can be adopted to identify the optimum number of hidden units, feedback delays and other key parameters during the establishment of ANN models. In practice, they are frequently selected by trial and error, and the specific forecasting process is poorly understood. Second, estimating the 95% uncertainty bounds for the predictions remains an additional problem. Third, the aggregated HFMD incidence case data utilized were obtained from nationwide passive infectious disease surveillance. We thus fail to rule out artifactual monitoring biases (e.g., substantial underreporting, misdiagnosis and delay). Fourth, the statistical predictions do not take known drivers into account and lack any epidemiological data other than case numbers and months, owing to their unavailability. Therefore, whether further studies that take these variables into account will have the potential to boost the fit and predictive ability remains to be authenticated. Fifth, albeit the LSTM technique built can be considered to be an instrumental tool for the medium-long-term estimation of future trends in HFMD incidence case data, in applications, this network is expected to be updated in due course with the incident cases to ensure its superiority in predictive performance. Sixth, detailed data on HFMD notifications are missing (e.g., age and sex), which precludes further analysis in the present work. Lastly, the LSTM model was developed based only on nationwide monitoring data over the period from 2008 to2018. These results therefore need to be interpreted with caution, and the analytic results can represent only entire epidemics of HFMD on the Chinese mainland. Remodeling for the region-specific notified HFMD cases time series may act as guidance for the formulation of targeted public health strategies, and whether the model is appropriate to calculate predictions for other kinds of communicable diseases requires further study.

In conclusion, notwithstanding its flaws, our study does indicate that the LSTM model established can provide more accurate predictions, be it in the in-sample dataset or the out-of-sample dataset. For the HFMD notified case time series compared with the individual SARIMA and NAR models, the LSTM model may be a beneficial tool for the early detection and advanced warning of HFMD activities in mainland China and can allow the official government to allocate health resources effectively and appropriately formulate the preventive and control planning for HFMD. Additionally, the number of forecasted incident cases are still relatively large and indeed present in the imminent future, this issue warrants to be resolved urgently and strategically within the effective measures taken.

Materials and Methods

Data collection

In this study, the aggregated monthly and yearly reported cases of HFMD, available from June 2008 to June 2018, were obtained from the notifiable infectious disease monitoring system provided by the Chinese Center for Disease Control and Prevention (CDC) (http://www.nhfpc.gov.cn/jkj/s3578/new_list.shtml). A total of 121 observations covering 11 years were collated and summarized. Subsequently, the whole dataset was split into two blocks to build the models, among which the first 109 data points (from June 2008 to June 2017) were regarded as in-sample modeling horizons, while the remaining 12 data points (from July 2017 to June 2018) were considered as out-of-sample predictive horizons. Since the sample length and time periods adopted to construct the models might have an impact on the forecasting power, two additional data categories were provided to test the robustness of the models developed, among which the first 103 (from June 2008 to December 2016) and 115 data points (from June 2008 to December 2017) were considered as in-sample modeling horizons, while the other 18 (from January 2017 to June 2018) and 6 data points (from January 2018 to June 2018) were used as out-of-sample predictive horizons.

In China, HFMD is clinically diagnosed by physicians, and the laboratory confirmed the diagnosis dependent on the detection of specific nucleic acids, the isolation of enterovirus related to pathogenic factors and the detection of a fourfold change in neutralizing antibodies. In addition, verified cases must be registered within 24 hours, and duplicate cases must be deleted by professionals at the end of the same month. Ethical approval or consent is not required for our present study owing to the public availability of HFMD surveillance data in China.

Constructing the SARIMA model

The Box-Jenkins method of ARIMA(p, d, q) has been the most commonly used statistical forecasting technique for time series data that display no seasonality⁴⁶. However, in applications, particularly in the morbidity time series of diseases, this time series frequently shows marked seasonal and cyclic tendencies³⁰. Consequently, to avert losing significant series traits, a seasonal ARIMA method, specified as SARIMA(p, d, q)(P, D, Q)_s, has been proposed to reveal data with those patterns⁴⁷. In this model, the actual observation can be represented as a linear combination of the prior observation and the error sequence. As such, the secular change and seasonal variation of time series are captured in the SARIMA method as interpretable terms⁴⁸. Although, of note, the linear SARIMA method can also model periodicity, the fitted cyclic change remains invariably symmetric. In our present study, considering the characteristics of HFMD incident case sequences containing evident cyclical and seasonality⁶, a typical SARIMA model will be a useful tool in predicting the future temporal trends¹⁷, among which the seasonal part of HFMD was taken for the predictors and the monthly HFMD incidence time series was used for the dependent variable. The final formula of a SARIMA model can be expressed as:

$$\{\begin{array}{c}\varphi (B){\rm{\Phi }}({B}^{{\rm{s}}}){{\rm{\Delta }}}^{d}{{\rm{\Delta }}}_{s}^{D}{X}_{t}=\theta (B){\rm{\Theta }}({B}^{s}){\varepsilon }_{t}\\ E({\varepsilon }_{t})=0,\,Var({\varepsilon }_{t})={\sigma }_{\varepsilon }^{2},E({\varepsilon }_{t}{\varepsilon }_{s})=0,s\ne t\\ E({X}_{s}{\varepsilon }_{t})=0,{\forall }_{s} < t\end{array}$$

(1)

where B refers to the backshift operator, ɛ_t denotes the errors from HFMD series, S signifies the length of seasonal cycle of HFMD notifications, d and D are the non-seasonal and seasonal differenced times, respectively. In the SARIMA model notation, p and q represent the orders of the non-seasonal autoregressive and moving average models, respectively; P and Q represent the orders of the seasonal autoregressive and moving average models, respectively. ${\nabla }^{{\rm{d}}}={(1-B)}^{d}$, ${\nabla }_{S}^{D}={(1-B)}^{SD}$, $\varphi (B)=1-{\varphi }_{1}B-\cdot \cdot \cdot \,-\,{\varphi }_{p}{B}^{p}$,$\theta (B)=1-{\theta }_{1}B-\cdot \cdot \cdot -{\theta }_{q}{B}^{q}$, ${\rm{\Phi }}({B}^{s})=1-{{\rm{\Phi }}}_{1}{B}^{s}-\cdot \cdot \cdot -{{\rm{\Phi }}}_{P}{B}^{P{\rm{s}}}$, ${\rm{\Theta }}({B}^{s})=1-{{\rm{\Theta }}}_{1}{B}^{s}-\cdot \cdot \cdot -{{\rm{\Theta }}}_{Q}{B}^{Q{\rm{s}}}$.

We utilized the R statistical package (version 3.4.3, R Development Core Team, Vienna, Austria) and SPSS software (version 17.0, IBM Corp, Armonk, NY) to construct the SARIMA model. The development of the SARIMA approach is under the assumption of a stationary incidence time series¹⁰. Therefore, in this research, the ADF test was used to identify whether the actual reported cases and processed data using differencing or a transformation technique accomplished stationarity⁴⁹. Afterwards, the autocorrelation function (ACF) and partial autocorrelation function (PACF) plots, the Schwarz Bayesian criterion (SBC), the Akaike information criterion (AIC) and the corrected Akaike Information criterion (AICc), along with the Lagrangian multiplier (LM) and Ljung-Box Q tests were applied to estimate and diagnose the model⁵⁰. The above mentioned modeling procedures were repeatedly conducted until the best-performing model was ultimately discovered.

Establishing the NAR model

Complexities and challenges in understanding the temporal characteristics of infectious diseases that exist are the complicated nonlinear interactions among different dimensions in real-world scenarios¹⁶. Artificial neural networks (ANNs) can adequately enable arbitrarily intricate non-stationary series to attain any desired accuracy owing to their powerful flexible nonlinear mapping capacity, and they have been considered as a function approximator applied in the domains of environmental forecasting, electrical energy and medicine^10,16. The NAR model is a leading shallow, dynamic recurrent neural network (RNN) that is based on the linear autoregressive model with the ability to time-varying the state of interconnected neurons⁵¹, can be adopted to explore the nonlinear relationship between the response variable and its predictors owing to its network architecture with a hidden layer accompanied by a sigmoid transfer function that allows it to have no restrictions on the parameters that comply with the requirement of stationarity^52,53,54. Furthermore, with the aid of tapped delay lines, the NAR technique also has a short-term memory function for the previous inputs and outputs, which makes its response at any given time rely not only on the present inputs but on the history of the inputs series as well⁵⁵. For a time series with obvious seasonality, when providing suitable observations from the same season as inputs, this network can capture the time series components of periodicity, seasonality and secular trend adequately well with respect to the appropriate inputs that require a multitude of experiments to discover the optimum. Accordingly, this method can provide reliable forecasts for current HFMD incidence case series including linear and nonlinear information. The specified equation of the NAR method can be written as:

$$y(t)=f(y(t-1),\,y(t-2),\,\cdot \cdot \cdot ,\,y(t-d)),$$

(2)

where y(t) represents the predicted points of the HFMD incidence series relied merely on the prior data of lagged period d.

In this work, the graphical user interface (GUI) in MATLAB (Version R2014a, MathWorks, Natick, MA, USA) was employed to automatically create an advanced script prior to modeling an NAR. First, the actual HFMD observations were processed between 0 and 1 using a normalized approach⁵⁶ to facilitate further analysis. Second, the dividerand function was used to randomly divide the in-sample data into training, validation and testing subsets, following ratios of 70%, 15% and 15%, respectively. In the robustness-test data, the abovementioned ratio in the training, validation and testing subsets was also used in the first test dataset, while another commonly used ratio of 80%, 10% and 10% corresponding to the training, validation and testing subsets, respectively, was employed in the second test dataset. Third, the number of hidden neurons and delays d were adjusted by repeated attempts with the Levenberg-Marquardt algorithm in an open feedback loop. The response plot of outputs and targets, correlograms and input-error cross-correlation plots, coupled with the mean square error (MSE) and correlation coefficient (R), were offered to choose the best-fitting model. Finally, the training open-loop form was transformed to a closed loop to achieve a goal of multistep-ahead forecasting (Supplementary Fig. S35).

Establishing the LSTM model

As mentioned above, coincident with the increase in the time lag has been a decrease in the long-term learning ability during the training of an NAR method due to a vanishing or exploding gradient problem, which is a major flaw for NAR method forecasting²⁶. A LSTM model, not the least prevalent and rewarding variant of the conventional RNNs, can overcome this disadvantage encountered in an NAR model as it is capable of maintaining state and identifying traits over the length of the sequences used^26,27. The LSTM technique has a special layered architecture with memory blocks that contain one or more self-connected memory cells^22,57, surrounded by three gating units, including the input, output and forget gates, that can continuously perform write, read and reset operations, to preserve information (Supplementary Fig. S36)^57,58. Such a configuration can keep the states persisting or communicating between updates of the weights with the progress of each epoch; moreover, it can reinforce the RNN by capturing the long-term dynamics of the time series components of periodicity, seasonality and secular trend in addition to the short-term dynamics. The estimated equations of the LSTM model can be defined as:

$${i}_{t}=\sigma ({W}_{xi}{x}_{t}+{W}_{hi}{h}_{t-1}+{W}_{ci}{c}_{t-1}+b{}_{i})$$

(3)

$${f}_{t}=\sigma ({W}_{xf}{x}_{t}+{W}_{hf}{h}_{t-1}+{W}_{cf}{c}_{t-1}+b{}_{f})$$

(4)

$${c}_{t}={f}_{t}{c}_{t-1}+{i}_{t}\,\tanh ({W}_{xc}{x}_{t}+{W}_{hc}{h}_{t-1}+{W}_{ci}{c}_{t-1}+b{}_{c})$$

(5)

$${o}_{t}=\sigma ({W}_{xo}{x}_{t}+{W}_{ho}{h}_{t-1}+{W}_{co}{c}_{t-1}+b{}_{o})$$

(6)

$${h}_{t}={o}_{t}\,\tanh ({c}_{t})$$

(7)

$$y(t)={W}_{yh}(f({W}_{hx}y(t-1)+{W}_{hh}{h}_{t-1}+{b}_{h})+{b}_{y}$$

(8)

where i_t refers to the input gate; f_t is the forget gate; c_t stands for the states of memory cell at time t; o_t represents the output gate; h_t is the hidden states at time t; W_xc, W_xi, W_xf and W_xo are the weight matrices connecting the input signals; x_t, W_hc, W_hi, W_hf and W_ho represent the weight matrices connecting the hidden layer output signals; h_t, W_ci, W_cf and W_co stand for the diagonal matrices connecting the neuron activation functions; b_i, b_c, b_f, b_o, b_h and b_y refer to the bias vectors; σ is the activation function (tanh or sigmoid); y(t) is the predicted points of the HFMD incidence series; W_hx, W_hh, and W_yh are the input-hidden weight matrix, hidden-hidden weight matrix and hidden-output weight matrix, respectively; and ${y}_{t-1}=({y}_{t-1,}{y}_{t-2,\cdots ,}{y}_{t-d})^{\prime} $ is a vector including time steps of the series.

Our prediction project using the LSTM model for regression was conducted with Keras. First, the actual HFMD notifications were rescaled between 0 and 1 using a normalized preprocessing approach. Second, in-sample data were separated into two blocks, 75% of them would be used for the training model, whereas the remaining 25% of the data would be utilized to validate the generalization and to simulate performance. Similarly, in the robustness-test data, the same data classification described above was applied to the first test dataset, while 70% and 30% data points were taken as training and validation subsets, respectively, out of the second test dataset. Third, the back propagation through time (BPTT) algorithm⁵⁹ was adopted for LSTM training with various time steps, and hidden layer neurons used to select the preferred model relied on the minimum root mean square error loss and satisfactory ACF plot²². Finally, the best-fitting architecture was chosen to generate out-of-sample predictions, and then the results should further be transformed to the simulated and forecasted values from the original observations with the inverse transform technique.

Measuring for accuracy

In order to distinguish the stimulation and forecasting accuracies from the selected various models, the root mean square error (RMSE), mean absolute error (MAE) and mean absolute percentage error (MAPE) were ultimately adopted to measure the performance accuracy.

$${\rm{RMSE}}=\sqrt{\frac{1}{N}\sum _{i=1}^{N}{({X}_{i}-{\bar{X}}_{i})}^{{\rm{2}}}}$$

(9)

$${\rm{MAE}}=\frac{1}{N}\sum _{i=1}^{N}|{X}_{i}-{\bar{X}}_{i}|$$

(10)

$${\rm{MAPE}}=\frac{1}{N}\sum _{i=1}^{N}\frac{|{X}_{i}-{\bar{X}}_{i}|}{{X}_{i}}$$

(11)

Where X_i stands for the actual notified notifications, ${\bar{X}}_{i}$ represents the simulated and predictive values with the selected preferred methods, N is the number of simulations and predictions under the models used.

Data Availability

These data can be extracted as presented in the website of data collection or please contact the first author on reasonable request.

References

Wang, J. F. et al. Hand, foot and mouth disease: spatiotemporal transmission and climate. Int J Health Geogr 10, 25, https://doi.org/10.1186/1476-072X-10-25 (2011).
Article PubMed PubMed Central Google Scholar
Wang, Y. et al. Hand, foot, and mouth disease in China: patterns of spread and transmissibility. Epidemiology 22, 781–792, https://doi.org/10.1097/EDE.0b013e318231d67a (2011).
Article CAS PubMed PubMed Central Google Scholar
Xing, W. et al. Hand, foot, and mouth disease in China, 2008–2012: an epidemiological study. Lancet Infectious Diseases 14, 308–318, https://doi.org/10.1016/S1473-3099(13)70342-6 (2014).
Article PubMed Google Scholar
Wei, J. et al. The effect of meteorological variables on the transmission of hand, foot and mouth disease in four major cities of shanxi province, China: a time series data analysis (2009–2013). PLoS Neglected Tropical Diseases 9, e0003572, https://doi.org/10.1371/journal.pntd.0003572 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ang, L. W. et al. Seroepidemiology of Coxsackievirus A6, Coxsackievirus A16, and Enterovirus 71 Infections among Children and Adolescents in Singapore, 2008–2010. PloS One 10, e0127999, https://doi.org/10.1371/journal.pone.0127999 (2015).
Article CAS PubMed PubMed Central Google Scholar
Zhuang, Z. C. et al. Epidemiological Research on Hand, Foot, and Mouth Disease in Mainland China. Viruses 7, 6400–6411, https://doi.org/10.3390/v7122947 (2015).
Article CAS PubMed PubMed Central Google Scholar
Wang, L. et al. Epidemiologic characteristics of hand, foot, and mouth disease in China from 2006 to 2015. Journal of Infection 73, 512–515, https://doi.org/10.1016/j.jinf.2016.08.007 (2016).
Article PubMed Google Scholar
Zheng, Y. M., Yang, J. & Liao, Q. H. Direct medical costs and influencing factors in severe hand, foot and mouth disease in children aged between six months and five years old. Zhonghua Yu Fang Yi Xue Za Zhi. Chinese Journal of Preventive Medicine 51, 87–92, https://doi.org/10.3760/cma.j.issn.0253-9624.2017.01.017 (2017).
Article CAS PubMed Google Scholar
Ma, E., Chan, K. C., Cheng, P., Wong, C. & Chuang, S. K. The enterovirus 71 epidemic in 2008–public health implications for Hong Kong. International Journal of Infectious Diseases 14, e775–e780, https://doi.org/10.1016/j.ijid.2010.02.2265 (2010).
Article PubMed Google Scholar
Yu, L. et al. Application of a new hybrid model with seasonal auto-regressive integrated moving average (ARIMA) and nonlinear auto-regressive neural network (NARNN) in forecasting incidence cases of HFMD in Shenzhen, China. PloS One 9, e98241, https://doi.org/10.1371/journal.pone.0098241 (2014).
Article CAS PubMed PubMed Central ADS Google Scholar
Du, Z. et al. Predicting the hand, foot, and mouth disease incidence using search engine query data and climate variables: an ecological study in Guangdong, China. Bmj Open 7, e016263, https://doi.org/10.1136/bmjopen-2017-016263 (2017).
Article PubMed PubMed Central Google Scholar
Zhang, X. et al. Temporal and long-term trend analysis of class C notifiable diseases in China from 2009 to 2014. BMJ Open 6, e011038, https://doi.org/10.1136/bmjopen-2016-011038 (2016).
Article PubMed PubMed Central Google Scholar
Ministry of Science and Technology of the People’s Republic of China. HFMD vaccine developed by China now on sale. Available at, http://www.most.gov.cn/eng/pressroom/ 201605/t20160506_125468.htm (Accessed: August 03 2018) (2016).
National Health and Family Planning Commission of China. Suggestions on prevention and control of HFMD in 2018. Available at, http://www.nhfpc.gov.cn/jkj/s3578/201803/45ae9b 6347dc4b6cb87b8b871c578456.shtml (Accessed: August 03 2018) (2018).
Organization, W. H. A Guide to Clinical Management and Public Health Response for Hand, Foot and Mouth disease (HFMD). Organizational Behavior and Human Decision Processes (2011).
Zhou, L. et al. A hybrid model for predicting the prevalence of schistosomiasis in humans of Qianjiang City, China. PloS One 9, e104875, https://doi.org/10.1371/journal.pone.0104875 (2014).
Article CAS PubMed PubMed Central ADS Google Scholar
Liu, L., Luan, R. S., Yin, F., Zhu, X. P. & Lu, Q. Predicting the incidence of hand, foot and mouth disease in Sichuan province, China using the ARIMA model. Epidemiology and Infection 144, 144–151, https://doi.org/10.1017/s0950268815001144 (2016).
Article CAS PubMed Google Scholar
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Computation 9, 1735–1780 (1997).
Article CAS Google Scholar
Chen, K. & Huo, Q. Training Deep Bidirectional LSTM Acoustic Model for LVCSR by a Context-Sensitive-Chunk BPTT Approach. IEEE/ACM Transactions on Audio Speech & Language Processing 24, 1185–1193 (2016).
Article Google Scholar
Gao, L., Guo, Z., Zhang, H., Xu, X. & Shen, H. T. Video Captioning With Attention-Based LSTM and Semantic Consistency. IEEE Transactions on Multimedia 19, 2045–2055 (2017).
Article Google Scholar
Zen, H. Acoustic Modeling in Statistical Parametric Speech Synthesis - From HMM to LSTM-RNN. Middle East Policy 15, 125–132 (2015).
Google Scholar
Deng, L. & Yu, D. Deep Learning: Methods and Applications. Foundations & Trends in Signal Processing 7, 197–387 (2014).
Article MathSciNet Google Scholar
Hyndman, R. J., Ord, K., Snyder, R. D. & Koehler, A. B. Prediction intervals for exponential smoothing using two new classes of state space models. Journal of Forecasting 24, 17–37 (2005).
Article MathSciNet Google Scholar
Khosravi, A., Nahavandi, S., Creighton, D. & Atiya, A. F. Comprehensive review of neural network-based prediction intervals and new advances. IEEE Trans Neural Netw 22, 1341–1356, https://doi.org/10.1109/tnn.2011.2162110 (2011).
Article PubMed Google Scholar
D’Informatique, D. E. et al. Long Short-Term Memory in Recurrent Neural Networks. Epfl 9, 1735–1780 (2001).
Google Scholar
Li, X. et al. Long short-term memory neural network for air pollutant concentration predictions: Method development and evaluation. Environmental Pollution (Barking, Essex: 1987) 231, 997–1004, https://doi.org/10.1016/j.envpol.2017.08.114 (2017).
Article CAS Google Scholar
Bao, W., Yue, J. & Rao, Y. A deep learning framework for financial time series using stacked autoencoders and long-short term memory. PloS One 12, e0180944, https://doi.org/10.1371/journal.pone.0180944 (2017).
Article CAS PubMed PubMed Central Google Scholar
Lee, S. & Lee, D. Four Major South Korea’s Rivers Using Deep Learning Models. International Journal of Environmental Research and Public Health 15, https://doi.org/10.3390/ijerph15071322 (2018).
Article Google Scholar
Gan, R., Chen, X., Yan, Y. & Huang, D. Application of a hybrid method combining grey model and back propagation artificial neural networks to forecast hepatitis B in china. Computational and Mathematical Methods in Medicine 2015, 328273, https://doi.org/10.1155/2015/328273 (2015).
Article PubMed PubMed Central Google Scholar
Wang, Y. et al. Temporal trends analysis of human brucellosis incidence in mainland China from 2004 to 2018. Scientific Reports 8, 15901, https://doi.org/10.1038/s41598-018-33165-9 (2018).
Article CAS PubMed PubMed Central ADS Google Scholar
Al-Sakkaf, A. & Jones, G. Comparison of time series models for predicting campylobacteriosis risk in New Zealand. Zoonoses Public Health 61, 167–174, https://doi.org/10.1111/zph.12046 (2014).
Article CAS PubMed Google Scholar
Zhang, X. et al. Comparative study of four time series methods in forecasting typhoid fever incidence in China. PloS One 8, e63116, https://doi.org/10.1371/journal.pone.0063116 (2013).
Article CAS PubMed PubMed Central ADS Google Scholar
He, F. et al. Construction and evaluation of two computational models for predicting the incidence of influenza in Nagasaki Prefecture, Japan. Scientific Reports 7, 7192, https://doi.org/10.1038/s41598-017-07475-3 (2017).
Article CAS PubMed PubMed Central ADS Google Scholar
Zhou, L. et al. Using a Hybrid Model to Forecast the Prevalence of Schistosomiasis in Humans. International Journal of Environmental Research and Public Health 13, 355, https://doi.org/10.3390/ijerph13040355 (2016).
Article CAS PubMed PubMed Central Google Scholar
Zhou, L., Zhao, P., Wu, D., Cheng, C. & Huang, H. Time series model for forecasting the number of new admission inpatients. BMC Medical Informatics and Decision Making 18, 39, https://doi.org/10.1186/s12911-018-0616-8 (2018).
Article PubMed PubMed Central Google Scholar
Wu, W. et al. Application of nonlinear autoregressive neural network in predicting incidence tendency of hemorrhagic fever with renal syndrome. Zhonghua Liu Xing Bing Xue Za Zhi 36, 1394–1396 (2015).
PubMed Google Scholar
Van Tu, P. et al. Epidemiologic and virologic investigation of hand, foot, and mouth disease, southern Vietnam, 2005. Emerging Infectious Diseases 13, 1733–1741, https://doi.org/10.3201/eid1311.070632 (2007).
Article CAS PubMed PubMed Central Google Scholar
Podin, Y. et al. Sentinel surveillance for human enterovirus 71 in Sarawak, Malaysia: lessons from the first 7 years. BMC Public Health 6, 180, https://doi.org/10.1186/1471-2458-6-180 (2006).
Article PubMed PubMed Central Google Scholar
Yang, B., Lau, E. H., Wu, P. & Cowling, B. J. Transmission of Hand, Foot and Mouth Disease and Its Potential Driving Factors in Hong Kong. Scientific Reports 6, 27500, https://doi.org/10.1038/srep27500 (2016).
Article CAS PubMed PubMed Central ADS Google Scholar
Chen, K. T., Chang, H. L., Wang, S. T., Cheng, Y. T. & Yang, J. Y. Epidemiologic features of hand-foot-mouth disease and herpangina caused by enterovirus 71 in Taiwan, 1998–2005. Pediatrics 120, e244–252, https://doi.org/10.1542/peds.2006-3331 (2007).
Article PubMed Google Scholar
Ang, L. W. et al. Epidemiology and control of hand, foot and mouth disease in Singapore, 2001–2007. Annals of the Academy of Medicine, Singapore 38, 106–112 (2009).
PubMed Google Scholar
Zhang, Y. et al. An emerging recombinant human enterovirus 71 responsible for the 2008 outbreak of hand foot and mouth disease in Fuyang city of China. Virology Journal 7, 94, https://doi.org/10.1186/1743-422X-7-94 (2010).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Y. et al. An outbreak of hand, foot, and mouth disease associated with subgenotype C4 of human enterovirus 71 in Shandong, China. Journal of Clinical Virology 44, 262–267, https://doi.org/10.1016/j.jcv.2009.02.002 (2009).
Article PubMed Google Scholar
Wang, C. et al. Spatiotemporal Cluster Patterns of Hand, Foot, and Mouth Disease at the County Level in Mainland China, 2008–2012. PloS One 11, e0147532, https://doi.org/10.1371/journal.pone.0147532 (2016).
Article CAS PubMed PubMed Central Google Scholar
Liu, W. et al. Co-circulation and genomic recombination of coxsackievirus A16 and enterovirus 71 during a large outbreak of hand, foot, and mouth disease in Central China. PloS One 9, e96051, https://doi.org/10.1371/journal.pone.0096051 (2014).
Article CAS PubMed PubMed Central ADS Google Scholar
He, Z. & Tao, H. Epidemiology and ARIMA model of positive-rate of influenza viruses among children in Wuhan, China: A nine-year retrospective study. International Journal of Infectious Diseases 74, 61–70, https://doi.org/10.1016/j.ijid.2018.07.003 (2018).
Article CAS PubMed Google Scholar
Cortes, F. et al. Time series analysis of dengue surveillance data in two Brazilian cities. Acta Tropica 182, 190–197, https://doi.org/10.1016/j.actatropica.2018.03.006 (2018).
Article PubMed Google Scholar
Arruda, A. G., Vilalta, C., Puig, P., Perez, A. & Alba, A. Time-series analysis for porcine reproductive and respiratory syndrome in the United States. PloS One 13, e0195282, https://doi.org/10.1371/journal.pone.0195282 (2018).
Article CAS PubMed PubMed Central Google Scholar
Wang, C. et al. Epidemiological Features and Forecast Model Analysis for the Morbidity of Influenza in Ningbo, China, 2006–2014. International Journal of Environmental Research and Public Health 14, https://doi.org/10.3390/ijerph14060559 (2017).
Article Google Scholar
Xu, Q. et al. Forecasting the Incidence of Mumps in Zibo City Based on a SARIMA Model. International Journal of Environmental Research and Public Health 14, 925, https://doi.org/10.3390/ijerph14080925 (2017).
Article PubMed Central Google Scholar
Wang, K. W. et al. Hybrid methodology for tuberculosis incidence time-series forecasting based on ARIMA and a NAR neural network. Epidemiology and Infection 145, 1118–1129, https://doi.org/10.1017/S0950268816003216 (2017).
Article CAS PubMed Google Scholar
Adeboye, A., Davies, O., Akinwumi, O., James, N. & Ruffin, M. Seasonality and Trend Forecasting of Tuberculosis Prevalence Data in Eastern Cape, South Africa, Using a Hybrid Model. International Journal of Environmental Research and Public Health 13, 757, https://doi.org/10.3390/ijerph13080757 (2016).
Article Google Scholar
Ansari, M., Othman, F., Abunama, T. & El-Shafie, A. Analysing the accuracy of machine learning techniques to develop an integrated influent time series model: case study of a sewage treatment plant, Malaysia. Environmental Science and Pollution Research International 25, 12139–12149, https://doi.org/10.1007/s11356-018-1438-z (2018).
Article PubMed Google Scholar
Zhang, J., Qiu, H., Li, X. & Niu, J. Real-Time Nowcasting of Microbiological Water Quality at Recreational Beaches: A Wavelet and Artificial Neural Network-Based Hybrid Modeling Approach. 52, 8446–8455, https://doi.org/10.1021/acs.est.8b01022 (2018).
Article CAS ADS Google Scholar
Wu, W. et al. Comparison of Two Hybrid Models for Forecasting the Incidence of Hemorrhagic Fever with Renal Syndrome in Jiangsu Province, China. PloS One 10, e0135492, https://doi.org/10.1371/journal.pone.0135492 (2015).
Article CAS PubMed PubMed Central Google Scholar
Shafaei, M., Adamowski, J., Fakherifard, A., Dinpashoh, Y. & Adamowski, K. A wavelet-SARIMA-ANN hybrid model for precipitation forecasting. Journal of Water & Land Development 28, 27–36, https://doi.org/10.1515/jwld-2016-0003 (2016).
Article Google Scholar
Jimeno Yepes, A. Word embeddings and recurrent neural networks based on Long-Short Term Memory nodes in supervised biomedical word sense disambiguation. J Biomed Inform 73, 137–147, https://doi.org/10.1016/j.jbi.2017.08.001 (2017).
Article PubMed Google Scholar
Volkova, S., Ayton, E., Porterfield, K. & Corley, C. D. Forecasting influenza-like illness dynamics for military populations using neural networks and social media. PloS One 12, e0188941, https://doi.org/10.1371/journal.pone.0188941 (2017).
Article CAS PubMed PubMed Central Google Scholar
Bengio, Y., Simard, P. & Frasconi, P. Learning long-term dependencies with gradient descent is difficult. IEEE Trans Neural Netw 5, 157–166 (2002).
Article Google Scholar

Download references

Acknowledgements

We thank all units involving the collection of HFMD data and the infectious disease monitoring system, coupled with the funders for the present study. This project was supported by the Graduate Student Innovation Fund of Hebei Province (CXZZBS2017130).

Author information

Yongbin Wang and Chunjie Xu contributed equally.

Authors and Affiliations

Department of Epidemiology and Health Statistics, School of Public Health, North China University of Science and Technology, Tangshan, Hebei Province, P.R. China
Yongbin Wang, Shengkui Zhang, Li Yang, Zhende Wang, Ying Zhu & Juxiang Yuan
Department of Occupational and Environmental Health, School of Public Health, Capital Medical University, Beijing, 100069, P.R. China
Chunjie Xu

Authors

Yongbin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Chunjie Xu
View author publications
You can also search for this author in PubMed Google Scholar
Shengkui Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Li Yang
View author publications
You can also search for this author in PubMed Google Scholar
Zhende Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ying Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Juxiang Yuan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.B.W., C.J.X. and J.X.Y. conceived this work, and collected and analyzed the data. S.K.Z., Z.D.W., L.Y. and Y.Z. improved the paper. All authors agree to submit this article. The funders did not take part in the data curation, formal analysis, methodology and improvement of the present manuscript.

Corresponding author

Correspondence to Juxiang Yuan.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Materials

Table S12

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, Y., Xu, C., Zhang, S. et al. Development and evaluation of a deep learning approach for modeling seasonality and trends in hand-foot-mouth disease incidence in mainland China. Sci Rep 9, 8046 (2019). https://doi.org/10.1038/s41598-019-44469-9

Download citation

Received: 07 August 2018
Accepted: 06 March 2019
Published: 29 May 2019
DOI: https://doi.org/10.1038/s41598-019-44469-9

This article is cited by

Trend analysis and prediction of gonorrhea in mainland China based on a hybrid time series model
- Zhende Wang
- Yongbin Wang
- ZiJian Feng
BMC Infectious Diseases (2024)
Study of the influence of meteorological factors on HFMD and prediction based on the LSTM algorithm in Fuzhou, China
- Hansong Zhu
- Si Chen
- Jianming Ou
BMC Infectious Diseases (2023)
Dual-attention-based recurrent neural network for hand-foot-mouth disease prediction in Korea
- Sieun Lee
- Sangil Kim
Scientific Reports (2023)
Estimating the tuberculosis incidence using a SARIMAX-NNARX hybrid model by integrating meteorological factors in Qinghai Province, China
- Wenjuan Liang
- Ailing Hu
- Yongbin Wang
International Journal of Biometeorology (2023)
Weather integrated multiple machine learning models for prediction of dengue prevalence in India
- Satya Ganesh Kakarla
- Phani Krishna Kondeti
- Srinivasa Rao Mutheneni
International Journal of Biometeorology (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

General information

The best-performing SARIMA model

The best-performing NAR model

The best-performing LSTM model

Comparative analysis

Discussion

Materials and Methods

Data collection

Constructing the SARIMA model

Establishing the NAR model

Establishing the LSTM model

Measuring for accuracy

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links