Deep learning for bias correction of MJO prediction

Kim, H.; Ham, Y. G.; Joo, Y. S.; Son, S. W.

doi:10.1038/s41467-021-23406-3

Download PDF

Article
Open access
Published: 25 May 2021

Deep learning for bias correction of MJO prediction

Nature Communications volume 12, Article number: 3087 (2021) Cite this article

9965 Accesses
25 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Producing accurate weather prediction beyond two weeks is an urgent challenge due to its ever-increasing socioeconomic value. The Madden-Julian Oscillation (MJO), a planetary-scale tropical convective system, serves as a primary source of global subseasonal (i.e., targeting three to four weeks) predictability. During the past decades, operational forecasting systems have improved substantially, while the MJO prediction skill has not yet reached its potential predictability, partly due to the systematic errors caused by imperfect numerical models. Here, to improve the MJO prediction skill, we blend the state-of-the-art dynamical forecasts and observations with a Deep Learning bias correction method. With Deep Learning bias correction, multi-model forecast errors in MJO amplitude and phase averaged over four weeks are significantly reduced by about 90% and 77%, respectively. Most models show the greatest improvement for MJO events starting from the Indian Ocean and crossing the Maritime Continent.

Deep learning for multi-year ENSO forecasts

Article 18 September 2019

FuXi: a cascade machine learning forecasting system for 15-day global weather forecast

Article Open access 16 November 2023

Deep learning reveals moisture as the primary predictability source of MJO

Article Open access 08 January 2024

Introduction

Accurate prediction beyond the two-week limit of atmospheric predictability is extremely valuable to society and the economy. In particular, reliable forecasts in the subseasonal range (i.e., timescale of 3–4 weeks) provide vital information about hazardous weather threats, such as floods, heat waves, and cold spells, which are extremely important for risk managers, stakeholders, and policymakers. The value of subseasonal forecasts has been recognized by society and the scientific community, and tremendous international efforts towards making reliable subseasonal forecasts are underway^1,2.

One of the primary predictability sources for the 3–4 week forecast window in the global climate system is the Madden-Julian Oscillation (MJO)³, the dominant mode of tropical subseasonal variability. The MJO is a planetary-scale organized convection-circulation coupled system with a typical period of 30–60 days, characterized by an eastward propagation, especially during boreal winter. As the MJO develops and propagates, anomalous diabatic heating leads to the formation of an anomalous Rossby wave source. This excites the Rossby waves to propagate into the extratropics, modulating the weather events therein^4,5,6,7. For example, studies have shown significant influences of the MJO on tropical cyclones^8,9, extreme temperature and precipitation^10,11,12, storm tracks^13,14,15, atmospheric blocking events¹⁶, atmospheric rivers^17,18,19, tornadoes²⁰, and weather in the Arctic²¹ and Antarctic²², among many others. Due to its far-reaching global impacts⁷ and quasiperiodic nature, the MJO is recognized as one of the leading sources of global climate predictability for the subseasonal timescale that bridges the gap between the traditional weather (i.e., from one day to 2 weeks) and seasonal (i.e., from 2 months to 1 year) forecast ranges.

Recent advances in theoretical understanding, improved numerical models, and international collaborative activities on field campaigns and forecast experiments have promoted advances in MJO forecasting^23,24,25. Now, the state-of-the-art dynamical forecast systems are able to predict the MJO up to 3 weeks in advance^23,25, a remarkable improvement since the early 2000s. However, due to errors originating from imperfect numerical models, the MJO prediction skill has not reached its theoretical predictability, which is known to be ~7 weeks²⁶. This indicates that there is considerable room for improvement^{23,25,26,27,28}. One of the greatest challenges in current dynamical forecast systems is the fast damping of the MJO signal as the forecast lead time increases, which results in a rapid decrease of forecast skill^25,29,30. This systematic damping of the MJO convection signal is particularly apparent when the MJO starts over the Indian Ocean and is expected to propagate through the Maritime Continent and move further into the western Pacific. The frequency of MJO events not crossing the Maritime Continent in forecast models is more than twice as large as it is in observations³⁰, known as the Maritime Continent prediction barrier^{25,29,30,31,32}. Given that the MJO prediction alone presents considerable systematic biases, the global weather forecast beyond 2 weeks is an even more daunting task.

Model deficiencies in simulating realistic MJO events are partially due to our poor understanding of the underlying physics. Therefore, more efforts on process-level diagnostics are suggested to further improve MJO simulation and prediction²³. Concurrently, post-processing of numerical forecasts has been proven to be a powerful tool to improve forecasts when models display systematic biases³³. A recent study has shown an increase of MJO prediction skill by correcting model biases with a linear statistical model³⁴. Deep learning (DL) techniques have been proven to be a powerful tool for identifying weather and climate patterns^35,36,37, sub-grid scale parameterizations^38,39, weather and climate predictions^{40,41,42,43,44}, and post-processing of numerical weather forecasts (shorter than 7 days^43,44). However, post-processing with DL methods has not yet been applied to MJO forecasts.

In this study, we utilize DL as a bias correction method to improve MJO forecasts. We demonstrate that the DL post-processing substantially reduces the MJO forecast errors from the state-of-the-art dynamical forecasting systems, thus making strides towards improving global extended range forecasts.

Results

Improved MJO prediction with deep learning bias correction

Figure 1 highlights the advantage of Deep learning bias correction (DL-correction) for MJO forecasts. It shows the multi-model mean of predicted Real-time Multivariate MJO indices (RMMs) composite on a phase-space diagram⁴⁵ as a function of initial MJO phases and forecast lead days from day 1 to day 28 (4 weeks). Predictions from the original Subseasonal-to-seasonal (S2S) reforecasts and DL-correction for each forecast target years are composited and compared with observations (see “Methods” section). The composite results of individual models are displayed in Supplementary Fig. 2. Several key results strongly demonstrate the benefit of the DL-correction on MJO forecasts throughout all MJO phases. A large discrepancy between S2S reforecasts and observations is clearly shown on day 1. Most S2S models forecast either weaker (phases 2 and 3) or stronger amplitude (phases 6 and 7), or phase (θ) shifts relative to the observations on day 1. The DL-correction reduces those systematic errors, making the day 1 and the following forecasts closer to the observations in all models and throughout all MJO phases (Fig. 1 and Supplementary Fig. 2).

**Fig. 1: Observed and predicted Madden-Julian oscillation (MJO) composites.**

To evaluate the forecast errors quantitatively, the bivariate root-mean-squared error (BMSE, see “Methods” section) is calculated as a function of initial MJO phases from the composites shown in Fig. 1. The amplitude error (BMSEa) appears in the S2S models from the beginning of the forecasts for most phases, with predominant errors in phases 2 and 3 (Fig. 2a). This amplitude error reduces substantially when DL-correction is applied (Fig. 2c). Whether amplitude errors are large or small in individual S2S models, they all become similar after the DL-correction (Supplementary Fig. 3). For example, during the first 2 weeks, the ECMWF-Cy43r3 has the largest MJO amplitude error in phases 2 and 3, while the NASA-GEOS5 possesses a large amplitude error in phases 6 and 7 (Supplementary Figs. 2 and 3). Nevertheless, these amplitude errors become negligible after DL-correction (Supplementary Fig. 3). The BMSEa averaged over 4 weeks and eight phases is summarized in Fig. 3. After DL-correction, BMSEa in S2S reforecasts is about 90% reduced in multi-model mean and about 70–94% reduced in individual models (Fig. 3).

**Fig. 2: Forecast errors in multi-model mean S2S reforecasts and DL-corrections.**

**Fig. 3: Averaged forecast errors in individual models.**

The majority of S2S models show large errors emanating from MJO phase (BMSEp) (Fig. 2b, Supplementary Fig. 4), indicating the inability of current forecasting systems to predict the main location of the MJO realistically²⁵. The phase error reduces substantially in all S2S reforecasts by DL-correction (Fig. 2d and Supplementary Fig. 4). The BMSEp averaged over 4 weeks and eight phases is reduced by about 78% after DL-correction in the multi-model mean and by about 45–90% in individual models (Fig. 3). This indicates that, in addition to the amplitude, the MJO location can be better forecasted by applying the DL-correction. Note that when the BMSE is calculated with individual MJO events rather than the composite, the reduction of error is clearly shown as well (Supplementary Fig. 5). Bias correction via the multi-linear regression (MLR, see “Methods” section) model also reduces the forecast errors (Supplementary Fig. 6), but not as much as the DL-correction. The multi-model mean BMSE from the DL-correction is reduced by about 65% compared to the MLR-correction when averaged over 1 week and eight phases and by 24% over 4 weeks (Supplementary Fig. 6). Particularly, with DL-correction, BMSEa during the first week reduces about 87% compared to the MLR-correction (Supplementary Fig. 6a).

To assess prediction skill and predictability of the MJO, two additional verification metrics are applied (see “Methods” section). The multi-model mean BCOR of the DL-correction is consistently higher than the S2S models up to 4 weeks (Supplementary Fig. 7). An increased signal and reduced noise after the DL-correction, which results in a higher MJO predictability than the original ECMWF-Cy43r3 reforecasts, is also shown (Supplementary Fig. 8).

Eastward propagation of the predicted MJO

As mentioned earlier, dynamical forecasts have struggled to accurately forecast the MJO propagation over the Maritime Continent, especially when the forecast is initialized in the Indian Ocean (phases 2 and 3). Such a prediction barrier can be partly explained by the basic state moisture biases that degrade the physical processes associated with the eastward propagation of the MJO^23,24,46,47. Even the ECMWF-Cy43r3, the best MJO prediction model, simulates an exaggerated Maritime Continent barrier (Supplementary Fig. 2a). Figure 4 shows the reconstructed Outgoing Longwave Radiation (OLR) and zonal wind at 850 hPa (U850) anomalies obtained by projecting the RMMs starting from phase 2 onto the normalized eigenvectors used in RMM calculation^45,48. Compared to the well-organized eastward propagating MJO signal that crosses the Maritime Continent and through the western Pacific in the observations (Fig. 4a), the MJO signal in the ECMWF-Cy43r3 shows fast damping before the convective anomaly reaches the Maritime Continent (~120˚E) (Fig. 4b). With DL-correction, however, the MJO anomalies become close to the observations beyond 2 weeks by realistically forecasting both amplitude and phase of the MJO (Fig. 4c). The improved MJO eastward propagation is mostly due to the amplification of the strongly damped MJO signal shown in most of the S2S models (Supplementary Fig. 2).

**Fig. 4: Observed and forecasted Madden-Julian oscillation (MJO) propagation.**

Discussion

This study demonstrates the power of Deep learning to be used as a post-processing tool to correct the systematic biases that evolve during MJO forecasts. The errors emanating from MJO amplitude and phase in the dynamical model forecasts are both reduced substantially by DL-correction. The results show that the performance of poor models becomes comparable to the best model after DL-correction. This implies that the differences in the model’s performance mainly originate from the systematic errors, rather than the random errors, and the DL method effectively works to minimize them regardless of their amplitude as long as the forecast errors are systematic. Given that huge efforts have been made in operational centres to upgrade their models to reach the level of the world’s best model for MJO forecasting (i.e., the ECMWF model), our results show promise for conserving both human and computational resources. Moreover, while the characteristics of systematic biases can change as operational forecast models undergo periodic upgrades, the DL-correction method developed here can be easily adapted to appropriately reflect the changes. Most importantly, because the model is simple, the approach developed in this study can easily be implemented into real-time MJO forecasts, which in turn can help end-user preparedness and eventually protect lives and properties vulnerable to various hazardous extreme events related to the MJO.

However, although the Deep learning approach can assist in correcting model biases, continuous effort towards developing the dynamical forecast system to minimize the inherent errors is the key for making MJO forecasts reach their potential predictability. Note that the improved MJO prediction with DL-correction does not guarantee an improved prediction of MJO-related phenomena such as tropical cyclones, monsoons, and midlatitude teleconnections, because they rely on both the MJO and the background state within the model. Therefore, further improvements in dynamical models and initialization are fundamental to ultimately improve the S2S predictions.

Methods

Hindcasts and validation datasets

In this study, we use long-term reforecasts from the international Subseasonal-to-Seasonal prediction (S2S⁴⁹) and Subseasonal Experiment (SubX⁵⁰) projects, and from the NCAR Community Earth System Model v1⁵¹, which follows the SubX protocol. Hereafter, we refer to these reforecasts as S2S reforecasts for convenience. Supplementary Table 1 provides information on the eight S2S reforecast models, including initialization interval, ensemble size, reforecast period, and sample size. Note that the reforecast period of ECMWF-Cy43r3 and KMA/UKMO-GloSea5 are different compared to the rest of the models (Supplementary Table 1). These eight models have shown good performance in MJO prediction^24,47.

To identify the MJO events, the Real-time Multivariate MJO (RMM)⁴⁵ index is calculated with the daily mean zonal wind at 850 hPa (U850) and 200 hPa (U200) from the ECMWF Interim Reanalysis⁵² and Outgoing Longwave Radiation (OLR) from the NOAA Advanced Very High-Resolution Radiometer⁵³; these are referred to as observation for brevity. All S2S reforecasts and observations are interpolated onto a 1° longitude and 1° latitude grid. The method for calculating anomalies and RMM indices follows previous studies^24,30.

Deep learning bias-correction model

The Deep learning bias correction (hereafter, DL-correction) model utilizes the Long Short-Term Memory (LSTM), which has been proven to be powerful for time sequence modelling^54,55 (Supplementary Fig. 1). It has a cell state (c_t), which accumulates the information from the previous states (t-1) up to time t. The forget gate (f_t) controls the extent to which the previous cell state (c_t-1) is forgotten. The status of an input variable (X) at time t is contained in the updated state (g_t), and the input gate (i_t) determines how much the updated state is retained in the cell state (c_t). The cell state (c_t) and the updated status (g_t) are combined and then propagate into the final state (h_t), which is further controlled by the output gate (o_t).

In the training period, the input variables (X) are the modelled RMM1 and RMM2 indices from the S2S reforecasts, and the output variables (Y) are the observed RMM1 and RMM2. We inactivated the cell state c_t-1 and hidden state h_t-1 to correct the modelled data at time t, to focus on the simultaneous relationship between the input variables (i.e., modelled RMM indices) and the output variables (i.e., observed RMM indices). That is, while the LSTM is often used for predicting the time sequence of the data, we utilized the LSTM to improve the quality of the modelled data by correcting the systematic biases in the S2S models.

The LSTM is trained using the Adaptive Moment Estimation optimizer⁵⁶ and mean-square-error loss to optimize weights and biases. Here, the training and validation sets are the same datasets. One to three hidden layers and 3-100 nodes have been tested, while additional hidden layers and nodes did not improve the DL-correction performance. To keep the process as simple and efficient as possible, the final DL-correction model uses one input layer with two nodes, one hidden layer with 10 nodes, and one output layer with two nodes. Note that adding more input variables, such as the leading principal components of OLR and zonal winds, degrades the skill (not shown), hence only RMMs are used for both input and output variables.

DL-correction procedure

The leave-one-year-out cross-validation (LOOCV) procedure is often used for making predictions on data not used in the training period and is appropriate for a relatively small dataset. For example, to process DL-correction on the target year 1997 in ECMWF-Cy43r3, the modelled/observed RMMs of MJO events from the rest of 19 years (from 1998 to 2016) are used to train the LSTM model. Then, the weighting coefficients and biases obtained during the training period are directly applied to the modelled RMMs of MJO events in 1997 (target year). This results in DL-corrected MJO predictions in 1997. For the target year 1998, MJO events from the rest of the 19 years (1997 and from 1999 to 2016) are used to train the LSTM model, and so on. The LSTM model is built at every target year, forecast lead time, MJO phase, and each model individually due to their unique systematic biases. Note that, for given input data sets that were randomly selected, the LOOCV produces very similar results for every target year, indicating that the LSTM model is stable.

We also perform the DL-correction in a real forecast manner. The ECMWF-Cy43r3 reforecasts during the first 10 years (1997–2006) are used as the training period to build the LSTM model, and the remaining independent 10 years (2007–2016) are evaluated. In this real forecast procedure, biases are still significantly reduced compared to the raw S2S reforecasts (not shown), but larger than those by the LOOCV approach due to the limited training sample size.

The MJO amplitude (A) and phase (θ) for the observation (A_o and θ_o) and reforecast (A_m and θ_m) are defined as^45,47:

$${{\rm{A}}}_{{\rm{o}}}({\rm{t}})=\sqrt{{{\rm{O}}}_{1}^{2}({\rm{t}})+{{\rm{O}}}_{2}^{2}({\rm{t}})}$$

(1)

$${{\rm{A}}}_{{\rm{m}}}({\rm{t}},{\rm{\tau }})=\sqrt{{{\rm{M}}}_{1}^{2}({\rm{t}},{\rm{\tau }})+{{\rm{M}}}_{2}^{2}({\rm{t}},{\rm{\tau }})}$$

(2)

$${{\rm{\theta }}}_{{\rm{o}}}({\rm{t}})={\tan }^{-1}\left(\frac{{{\rm{O}}}_{2}({\rm{t}})}{{{\rm{O}}}_{1}({\rm{t}})}\right)$$

(3)

$${{\rm{\theta }}}_{{\rm{m}}}({\rm{t}},{\rm{\tau }}\,)={\tan }^{-1}\left(\frac{{{\rm{M}}}_{2}({\rm{t}},{\rm{\tau }})}{{{\rm{M}}}_{1}({\rm{t}},{\rm{\tau }})}\right)$$

(4)

where ${{\rm{O}}}_{1}({\rm{t}})$ and ${{\rm{O}}}_{2}({\rm{t}})$ are the observed RMM1 and RMM2 at time t, and ${{\rm{M}}}_{1}({\rm{t}},{\rm{\tau }})$ and ${{\rm{M}}}_{2}({\rm{t}},{\rm{\tau }})$ are the modelled RMM1 and RMM2 at time t with a lead time of τ days. On the two-dimensional phase-space diagram⁴⁵, the MJO phase is defined as the azimuth of the RMM1 and RMM2 combination and is usually divided into eight phases depending on the location of the MJO convection⁴⁵. The MJO amplitude is determined based on the distance of the azimuth point from the origin, and an MJO event is defined when the observed MJO amplitude (A_o) exceeds 1.0 on initial day 0. Although the MJO is most active during boreal winter and thus has the highest forecast skill in this season²⁵, we use MJO events from all seasons due to the limited sample size. For the same reason, we group two MJO phases (phases 2 & 3, 4 & 5, 6 & 7, and 8 & 1) when training the LSTM model. Note that MJO forecasts of the grouped phases generally possess similar characteristics of errors.

The selected MJO events differ among models due to different initialization frequencies and reforecast periods (Supplementary Table 1). The number of MJO events for initial MJO phases 2 & 3 used for training and target period is listed in Supplementary Table 1 as an example, while other phases show similar event counts. All reforecasts used here are the ensemble mean. Applying the DL-correction to individual ensembles and then averaging the results shows lower performance than applying the DL-correction directly to the ensemble mean (not shown). This indicates that the DL-correction is targeted to reduce the systematic forecast errors, and it is obscured by the random errors in the individual ensemble members.

To establish a baseline for assessing the benefit of the DL-correction method, a multi-linear regression (MLR) model, a standard linear approach for post-processing, is compared. The MLR-correction is identical to the DL-correction in that it corrects RMM1 and RMM2 separately using the modelled RMMs as input and observed RMMs as output with the LOOCV procedure.

Assessment of MJO predictions

To evaluate the MJO forecast quality, the bivariate correlation coefficient (BCOR)²⁵ and bivariate root-mean-squared error (BMSE)⁴⁷ are calculated between the predicted and observed RMM indices as a function of forecast lead days as follows:

$${\rm{BCOR}}({\rm{\tau }})=\frac{{\sum }_{{\rm{t}}=1}^{{\rm{N}}}[{{\rm{O}}}_{1}({\rm{t}}){{\rm{M}}}_{1}({\rm{t}},{\rm{\tau }})+{{\rm{O}}}_{2}({\rm{t}}){{\rm{M}}}_{2}({\rm{t}},{\rm{\tau }})]}{\sqrt{{\sum }_{{\rm{t}}=1}^{{\rm{t}}={\rm{N}}}[{{\rm{O}}}_{1}^{2}({\rm{t}})+{{\rm{O}}}_{2}^{2}({\rm{t}})]}\sqrt{{\sum }_{{\rm{t}}=1}^{{\rm{t}}={\rm{N}}}[{{\rm{M}}}_{1}^{2}({\rm{t}},{\rm{\tau }})+{{\rm{M}}}_{2}^{2}({\rm{t}},{\rm{\tau }})]}}$$

(5)

$${\rm{BMSE}}({\rm{\tau }})=\frac{1}{{\rm{N}}}\mathop{\sum }\limits_{{\rm{t}}=1}^{{\rm{N}}}({[{{\rm{O}}}_{1}({\rm{t}})-{{\rm{M}}}_{1}({\rm{t}},{\rm{\tau }})]}^{2}+{[{{\rm{O}}}_{2}({\rm{t}})-{{\rm{M}}}_{2}({\rm{t}},{\rm{\tau }})]}^{2})$$

(6)

where N is the number of MJO events. The BMSE can be separated into the error emanating from amplitude error (BMSEa) and phase error (BMSEp)⁴⁷ as:

$${\rm{BMSE}}({\rm{\tau }})={\rm{BMSEa}}({\rm{\tau }})+{\rm{BMSEp}}({\rm{\tau }})$$

(7)

$${\rm{BMSEa}}({\rm{\tau }})=\frac{1}{{\rm{N}}}\mathop{\sum }\limits_{{\rm{t}}=1}^{{\rm{N}}}{[{{\rm{A}}}_{{\rm{m}}}({\rm{t}},{\rm{\tau }})-{{\rm{A}}}_{{\rm{o}}}({\rm{t}})]}^{2}$$

(8)

$${\rm{BMSEp}}({\rm{\tau }})=\frac{1}{{\rm{N}}}\mathop{\sum }\limits_{{\rm{t}}=1}^{{\rm{N}}}2{{\rm{A}}}_{{\rm{m}}}({\rm{t}},{\rm{\tau }}){{\rm{A}}}_{{\rm{o}}}({\rm{t}})\ast \{1-\,\cos [{{\rm{\theta }}}_{{\rm{m}}}({\rm{t}},{\rm{\tau }})-{{\rm{\theta }}}_{{\rm{o}}}({\rm{t}})]\}$$

(9)

The MJO potential predictability is assessed via the signal and noise²⁵ defined as:

$${\rm{Signal}}\,({\rm{\tau }})=\frac{1}{{\rm{N}}}\mathop{\sum }\limits_{{\rm{t}}=1}^{{\rm{N}}}({\overline{{{\rm{M}}}_{1}({\rm{t}},{\rm{\tau }})}}^{2}+{\overline{{{\rm{M}}}_{2}({\rm{t}},{\rm{\tau }})}}^{2})$$

(10)

$${\rm{Noise}}\,({\rm{\tau }})=\frac{1}{{\rm{N}}}\mathop{\sum }\limits_{{\rm{t}}=1}^{{\rm{N}}}({\overline{{\rm{M}}^{{\prime} }_{1}({\rm{t}},{\rm{\tau }})}}^{2}+{\overline{{\rm{M}}^{{\prime} }_{2}({\rm{t}},{\rm{\tau }})}}^{2})$$

(11)

where overbar denotes ensemble mean and prime presents individual ensembles’ deviations from the ensemble mean. In this formulation, the signal refers to the variability of the ensemble mean while the noise refers to the variability of individual forecasts around the ensemble mean (i.e., the forecast spread), and both quantities depend on the forecast lead time (τ). The ECMWF-Cy43r3 is used to estimate the signal and noise due to its relatively large ensemble size and high MJO skill.

Confidence interval

The statistical significance test is performed with ECMWF-Cy43r3 and NCAR-CESM1 only, due to their relatively large ensemble sizes. The confidence interval of DL-correction results is calculated using the bootstrap method. We randomly select 11 ensemble members from the S2S reforecasts with allowing overlap to calculate the ensemble-averaged BMSE. This process is repeated 10,000 times and the 2.5^th and 97.5^th percentile values are used to define the 95% confidence interval. The ensemble-averaged BMSE of DL-correction value (Fig. 3, Supplementary Figs. 3, 4) is significant at the 95% confidence level if it lies outside the 2.5^th or 97.5^th percentile. The same process is performed for the reconstructed OLR and U850 to check whether the composited anomalies from DL-correction is significantly different from the S2S forecast results (Fig. 4).

Data availability

Data related to this paper can be downloaded from: ERA-Interim, http://apps.ecmwf.int/datasets/data/interim_full_daily; NOAA OLR, https://www.esrl.noaa.gov/psd/data/gridded/data.interp_OLR.html; The SubX and NCAR-CESM1 reforecasts, http://iridl.ldeo.columbia.edu/SOURCES/.Models/.SubX; The S2S reforecasts, https://apps.ecmwf.int/datasets/data/s2s/. The data that support the findings of this study are available at https://zenodo.org/record/4601794 and from the corresponding author upon reasonable request.

Code availability

TensorFlow (https://www.tensorflow.org) libraries were implemented to formulate the forecast model using the LSTM. The codes used in the current analysis are available at https://zenodo.org/record/4601794 and from the corresponding author upon reasonable request.

References

E. National Academies of Sciences, Medicine, Next Generation Earth System Prediction: Strategies for Subseasonal to Seasonal Forecasts. (The National Academies Press, Washington, DC, 2016), pp. 350.
Mariotti, A. et al. Windows of opportunity for skillful forecasts subseasonal to seasonal and beyond. B Am. Meteorol. Soc. 101, E608–E625 (2020).
Article Google Scholar
Madden, R. & Julian, P. Further evidence of global-scale, 5-day pressure waves. J. Atmos. Sci. 29, 1464 (1972).
Article ADS Google Scholar
Sardeshmukh, P. D. & Hoskins, B. J. The generation of global rotational flow by steady idealized tropical divergence. J. Atmos. Sci. 45, 1228–1251 (1988).
Article ADS Google Scholar
Hoskins, B. J. & Karoly, D. J. The steady linear response of a spherical atmosphere to thermal and orographic forcing. J. Atmos. Sci. 38, 1179–1196 (1981).
Article ADS Google Scholar
Ferranti, L., Palmer, T. N., Molteni, F. & Klinker, E. Tropical-extratropical interaction associated with the 30–60 day oscillation and its impact on medium and extended range prediction. J. Atmos. Sci. 47, 2177–2199 (1989).
Article ADS Google Scholar
Stan, C. et al. Review of tropical-extratropical teleconnections on intraseasonal time scales. Rev. Geophys. 55, 902–937 (2017).
Article ADS Google Scholar
Klotzbach, P. J. On the Madden–Julian oscillation–atlantic hurricane relationship. J. Clim. 23, 282–293 (2010).
Article ADS Google Scholar
Maloney, E. D. & Hartmann, D. L. Modulation of Eastern North Pacific Hurricanes by the Madden–Julian oscillation. J. Clim. 13, 1451–1460 (2000).
Article ADS Google Scholar
Bond, N. A. & Vecchi, G. A. The influence of the Madden–Julian oscillation on precipitation in Oregon and Washington. Weather Forecast. 18, 600–613 (2003).
Article ADS Google Scholar
Jeong, J., Kim, B., Ho, C. & Noh, Y. Systematic variation in wintertime precipitation in East Asia by MJO-induced extratropical vertical motion. J. Clim. 21, 788–801 (2008).
Article ADS Google Scholar
J.-H. Jeong, C.-H. Ho, B.-M. Kim, W.-T. Kwon, Influence of the Madden-Julian Oscillation on wintertime surface air temperature and cold surges in east Asia. J. Geophys. Res. Atmos. 110, (2005).
Zheng, C., Chang, E. K. M., Kim, H., Zhang, M. & Wang, W. Impacts of the Madden–Julian oscillation on storm-track activity, surface air temperature, and precipitation over North America. J. Clim. 31, 6113–6134 (2018).
Article ADS Google Scholar
Deng, Y. & Jiang, T. Y. Intraseasonal modulation of the North Pacific storm track by tropical convection in boreal winter. J. Clim. 24, 1122–1137 (2011).
Article ADS Google Scholar
Wang, J., Kim, H.-M., Chang, E. K. M. & Son, S.-W. Modulation of the MJO and North Pacific Storm track relationship by the QBO. J. Geophys. Res.: Atmos. 123, 3976–3992 (2018).
Article ADS Google Scholar
Henderson, S. A., Maloney, E. D. & Barnes, E. A. The influence of the Madden-Julian oscillation on Northern Hemisphere winter blocking. J. Clim. 29, 4597–4616 (2016).
Article ADS Google Scholar
Guan, B., Waliser, D. E., Molotch, N. P., Fetzer, E. J. & Neiman, P. J. Does the Madden–Julian oscillation influence wintertime atmospheric rivers and snowpack in the Sierra Nevada? Mon. Weather Rev. 140, 325–342 (2012).
Article ADS Google Scholar
Mundhenk, B. D., Barnes, E. A. & Maloney, E. D. All-season climatology and variability of atmospheric river frequencies over the North Pacific. J. Clim. 29, 4885–4903 (2016).
Article ADS Google Scholar
Guan, B. & Waliser, D. E. Detection of atmospheric rivers: evaluation and application of an algorithm for global studies. J. Geophys Res-Atmos. 120, 12514–12535 (2015).
Article ADS Google Scholar
Tippett, M. K. Robustness of relations between the MJO and U.S. Tornado occurrence. Mon. Weather Rev. 146, 3873–3884 (2018).
Article ADS Google Scholar
Yoo, C., Lee, S. & Feldstein, S. B. Arctic response to an MJO-like tropical heating in an idealized GCM. J. Atmos. Sci. 69, 2379–2393 (2012).
Article ADS Google Scholar
Lee, H.-J. & Seo, K.-H. Impact of the Madden-Julian oscillation on Antarctic sea ice and its dynamical mechanism. Sci. Rep. 9, 10761 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Jiang, X. et al. Fifty years of research on the Madden-Julian oscillation: recent progress, challenges, and perspectives. J. Geophys. Res. Atmos. 125, e2019JD030911 (2020).
ADS Google Scholar
Kim, H., Janiga, M. A. & Pegion, K. MJO propagation processes and mean biases in the SubX and S2S reforecasts. J. Geophys. Res.: Atmospheres 124, 9314–9331 (2019).
Article ADS Google Scholar
Kim, H., Vitart, F. & Waliser, D. E. Prediction of the Madden–Julian oscillation: a review. J. Clim. 31, 9425–9443 (2018).
Article ADS Google Scholar
D. Waliser, in Intraseasonal Variability in the Atmosphere-Ocean Climate System. (Springer Berlin Heidelberg, Berlin, Heidelberg, 2005), pp. 389-423.
C. Zhang, Madden-Julian oscillation. Rev. Geophys. 43, (2005).
Zhang, C. Madden–Julian oscillation: bridging weather and climate. Bull. Am. Meteorol. Soc. 94, 1849–1870 (2013).
Article ADS Google Scholar
Kim, H.-M. et al. MJO propagation across the maritime continent in the ECMWF ensemble prediction system. J. Clim. 29, 3973–3988 (2016).
Article ADS Google Scholar
Vitart, F. Madden—Julian oscillation prediction and teleconnections in the S2S database. Q. J. R. Meteorological Soc. 143, 2210–2220 (2017).
Article ADS Google Scholar
Vitart, F. & Molteni, F. Simulation of the Madden– Julian oscillation and its teleconnections in the ECMWF forecast system. Quart. J. R. Meteorological Soc. 136, 842–855 (2010).
Article ADS Google Scholar
Neena, J. M., Lee, J. Y., Waliser, D., Wang, B. & Jiang, X. Predictability of the Madden–Julian oscillation in the intraseasonal variability hindcast experiment (ISVHE)*. J. Clim. 27, 4531–4543 (2014).
Article ADS Google Scholar
I. S. Kang, J. Shukla, in The Asian Monsoon. (Springer Praxis Books. Springer, Berlin, Heidelberg, 2006), pp 585-612.
Wu, J. & Jin, F.-F. Improving the MJO forecast of S2S operation models by correcting their biases in linear dynamics. Geophys. Res. Lett. 48, e2020GL091930 (2021).
ADS Google Scholar
Barnes, E. A., Hurrell, J. W., Ebert-Uphoff, I., Anderson, C. & Anderson, D. Viewing forced climate patterns through an AI lens. Geophys. Res. Lett. 46, 13389–13398 (2019).
Article ADS Google Scholar
B. Toms, K. Kashinath, M. Prabhat, D. Yang, Deep learning for scientific inference from geophysical data: the Madden-Julian oscillation as a test case. arXiv preprint arXiv:1902.04621 (2019).
Barnes, E. et al. Indicator patterns of forced change learned by an artificial neural network. J. Adv. Modeling Earth Syst. 12, e2020MS002195 (2020).
ADS Google Scholar
Gentine, P., Pritchard, M., Rasp, S., Reinaudi, G. & Yacalis, G. Could machine learning break the convection parameterization deadlock? Geophys. Res. Lett. 45, 5742–5751 (2018).
Article ADS Google Scholar
Rasp, S., Pritchard, M. S. & Gentine, P. Deep learning to represent subgrid processes in climate models. Proc. Natl. Acad. Sci. 115, 9684–9689 (2018).
Article CAS PubMed PubMed Central Google Scholar
Arcomano, T. et al. A machine learning-based global atmospheric forecast model. Geophys. Res. Lett. 47, e2020GL087776 (2020).
Article ADS Google Scholar
Weyn, J. A., Durran, D. R. & Caruana, R. Can machines learn to predict weather? using deep learning to predict gridded 500-hPa geopotential height from historical weather data. J. Adv. Modeling Earth Syst. 11, 2680–2693 (2019).
Article ADS Google Scholar
Ham, Y.-G., Kim, J.-H. & Luo, J.-J. Deep learning for multi-year ENSO forecasts. Nature 573, 568–572 (2019).
Article ADS CAS PubMed Google Scholar
Rasp, S. & Lerch, S. Neural networks for postprocessing ensemble weather forecasts. Mon. Weather Rev. 146, 3885–3900 (2018).
Article ADS Google Scholar
Chapman, W. E., Subramanian, A. C., Delle Monache, L., Xie, S. P. & Ralph, F. M. Improving atmospheric river forecasts with machine learning. Geophys. Res. Lett. 46, 10627–10635 (2019).
Article ADS Google Scholar
Wheeler, M. C. & Hendon, H. H. An all-season real-time multivariate MJO index: Development of an index for monitoring and prediction. Mon. Weather Rev. 132, 1917–1932 (2004).
Article ADS Google Scholar
Jiang, X. Key processes for the eastward propagation of the Madden-Julian Oscillation based on multimodel simulations. J. Geophys. Res.: Atmospheres 122, 755–770 (2017).
Article ADS Google Scholar
Lim, Y., Son, S.-W. & Kim, D. MJO prediction skill of the subseasonal-to-seasonal prediction models. J. Clim. 31, 4075–4094 (2018).
Article ADS Google Scholar
Kim, H.-M., Webster, P. J., Toma, V. E. & Kim, D. Predictability and prediction skill of the MJO in two operational forecasting systems. J. Clim. 27, 5364–5378 (2014).
Article ADS Google Scholar
Vitart, F. et al. The subseasonal to seasonal (S2S) prediction project database. Bull. Am. Meteorol. Soc. 98, 163–173 (2017).
Article ADS Google Scholar
Pegion, K. et al. The subseasonal experiment (SubX): a multimodel subseasonal prediction experiment. Bull. Am. Meteorol. Soc. 100, 2043–2060 (2019).
Article ADS Google Scholar
Richter, J. et al. Subseasonal prediction with and without a well-represented stratosphere in CESM1. Weather Forecast. 35, 2589–2602 (2020).
Article ADS Google Scholar
Dee, D. P. et al. The ERA-interim reanalysis: configuration and performance of the data assimilation system. Q. J. R. Meteorological Soc. 137, 553–597 (2011).
Article ADS Google Scholar
Liebmann, B. & Smith, C. A. Description of a complete (interpolated) outgoing longwave radiation dataset. B Am. Meteorol. Soc. 77, 1275–1277 (1996).
Google Scholar
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9, 1735–1780 (1997).
Article CAS PubMed Google Scholar
Sherstinsky, A. Fundamentals of recurrent neural network (RNN) and long short-term memory (LSTM) network. Phys. D: Nonlinear Phenom. 404, 132306 (2020).
Article MathSciNet Google Scholar
Kingma, D. & Ba, j. Adam: A Method for Stochastic Optimization. International Conference on Learning Representations, (2014).

Download references

Acknowledgements

H.K. was supported by NSF grant AGS-1652289 and the Brain Pool program funded by the Ministry of Science and ICT through the National Research Foundation of Korea (NRF) 2019H1D3A2A01102234. Y.G.H. and Y.S.J. are supported by the NRF under Grant No. NRF-2020R1A2C2101025. S.W.S. was supported by the Korea Meteorological Administration Research and Development Program under Grant KMI2020-01010.

Author information

Authors and Affiliations

School of Marine and Atmospheric Sciences, Stony Brook University, New York, NY, USA
H. Kim
Department of Oceanography, Chonnam National University, Gwangju, South Korea
Y. G. Ham & Y. S. Joo
School of Earth and Environmental Sciences, Seoul National University, Seoul, South Korea
S. W. Son

Authors

H. Kim
View author publications
You can also search for this author in PubMed Google Scholar
Y. G. Ham
View author publications
You can also search for this author in PubMed Google Scholar
Y. S. Joo
View author publications
You can also search for this author in PubMed Google Scholar
S. W. Son
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.K. designed the research, performed model run, analyzed the model output, and generated figures. H.K., Y.G.H. and Y.S.J. built the LSTM model. H.K., Y.G.H., Y.S.J. and S.W.S. discussed the results and contributed to the writing of the manuscript.

Corresponding author

Correspondence to H. Kim.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Samson Hagos, Aneesh Subramanian and Baoqiang Xiang for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kim, H., Ham, Y.G., Joo, Y.S. et al. Deep learning for bias correction of MJO prediction. Nat Commun 12, 3087 (2021). https://doi.org/10.1038/s41467-021-23406-3

Download citation

Received: 29 September 2020
Accepted: 22 April 2021
Published: 25 May 2021
DOI: https://doi.org/10.1038/s41467-021-23406-3

This article is cited by

Deep learning reveals moisture as the primary predictability source of MJO
- Na-Yeon Shin
- Daehyun Kim
- Jong-Seong Kug
npj Climate and Atmospheric Science (2024)
Toward a Learnable Climate Model in the Artificial Intelligence Era
- Gang Huang
- Ya Wang
- Chaoyang Xie
Advances in Atmospheric Sciences (2024)
Comparison of conventional and machine learning methods for bias correcting CMIP6 rainfall and temperature in Nigeria
- Bashir Tanimu
- Al-Amin Danladi Bello
- Shamsuddin Shahid
Theoretical and Applied Climatology (2024)
Taking advantage of quasi-periodic signals for S2S operational forecast from a perspective of deep learning
- Yang Zhou
- Qifan Zhao
Scientific Reports (2023)
Seasonal Predictions of Summer Precipitation in the Middle-lower Reaches of the Yangtze River with Global and Regional Models Based on NUIST-CFS1.0
- Wushan Ying
- Huiping Yan
- Jing-Jia Luo
Advances in Atmospheric Sciences (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.