Introduction

Tropical cyclones (TCs) are among the most harmful natural hazards in terms or both economic and human cost. There is, therefore, great interest in the prediction of the seasonal activity and particularly landfall of TCs among many organisations, including governments and business. There is demand for forecasts with longer lead times of landfall to permit planning and preparation1. The South China Sea region is of special importance as the highest value at-risk region to tropical cyclones in the world. The region includes the largest mega-city in the world2, an exposed GDP of over 1 Trillion US$ and accounts for a third of global shipping3. Globally, it is also one of the coastlines most affected by TCs. The seasonal forecasting of TC activity in this region is, therefore, of particular interest. Existing forecasts are typically issued only just before the start of the TC season in March/April and are often only on the basin scale rather than landfall4.

TC activity in the western North Pacific has significant inter-annual variability5 that is related to the El Niño-Southern Oscillation (ENSO) phenomenon6,7. While it is generally accepted that ENSO is not strongly correlated with contemporaneous basin TC counts8, a delayed suppression of TC activity in response to strong El Niño events linked to changes in large-scale circulation has been reported9. All the current seasonal forecasting models for South China landfall frequency are based on some level of ENSO sea surface temperature (SST) predictability10,11,12,13. However, none of these forecasts can be issued before the Spring. Subsurface ocean temperature is a known important variable in seasonal prediction and has been assimilated into dynamical ocean-atmosphere forecasting models since the 1990s14. However, the subsurface ocean temperature has only been used with short lead times of 3 months or less for Indian Summer Monsoon rain15 and seasonal Atlantic hurricane activity16. Here, we show a step change in the forecast lead time of an atmospheric phenomenon, landfalling TCs, to 1 year using equatorial Pacific subsurface ocean temperature.

Results

Subsurface ocean temperature and landfall

We find that Pacific equatorial subsurface ocean temperature anomaly around the date line, between the depths of 100–300 m, is strongly anti-correlated with the following year’s South China tropical cyclone landfall count. The anomaly in prior summer (June–August, JJA-1) was found to have the strongest anti-correlation and is shown in Fig. 1a. The region of significant anti-correlation is largely reduced outside the depth range 100–300 m. This shows that the predictive subsurface ocean temperature signal is isolated and not connected with the surface at that time. Temperature anomaly information from below 100 m near the thermocline depths is needed. In Fig. 1b, c, we show the mean and inter-annual standard deviation of equatorial Pacific summer subsurface temperature, respectively. The maximum anti-correlation occurs at the depth of the thermocline. At this depth the vertical gradient of temperature is a maximum separating the Pacific warm pool water above from cold water below. This is also the depth at which the maximum inter-annual variability occurs.

Fig. 1: Equatorial (5S–5N) Pacific longitude-depth profiles.
figure 1

Pearson’s correlation coefficient (r) of May–November South China landfall counts and the preceding year’s June–August (JJA-1) subsurface ocean temperature anomaly in a. Correlations significant at the 99% confidence level using a two-tailed t-test are shaded. Climatological b mean, and c inter-annual standard deviation of JJA subsurface ocean temperature. Black box shows region selected as subNiño4 predictor index. Dotted white line is the thermocline defined at 20 C.

To further investigate the location of predictability we calculate the correlation of Pacific summer equatorial subsurface ocean temperature anomaly (JJA-1) averaged between depths of 100–300 m with South China TC landfall count (Fig. 2a). The region of correlation significant at the 99% confidence level extends widely in the equatorial Central Pacific centred near the date line, it has a similar lateral extent to the standard Niño 4 index definition, which includes the Pacific warm pool. In the vertical the subNiño4 region is also bounded by the eastward Pacific equatorial undercurrent at the same depths as identified here17. We define the subNiño4 index as the mean JJA-1 subsurface ocean temperature anomaly at the bottom of the warm pool averaged over depths 100–300 m and the Niño 4 region, 160E–150W, 5S–5N. The Pearson’s correlation coefficient (r) of subNiño4 with the next year’s seasonal (May–November) South China landfall count is r = 0.67 (p < 0.001). Therefore, cold (warm) summer subsurface ocean temperatures one year prior lead to high (low) South China landfall counts the next year. The correlation of subNiño4 with landfall counts derived from other TC warning agencies are also all highly significant (p < 0.01) (Supplementary Table 1). We note that the correlation is sensitive to the lifetime maximum intensity of the TCs making landfall (Supplementary Table 1). The prior summer sea surface temperature (SST) in the Pacific is not correlated with the following year’s South China landfall count (Fig. 2b). The basin wide accumulated cyclone energy (ACE) is also correlated with the prior summer subNiño4 index (r = 0.45, p = 0.006). However, the correlation of the subNiño4 index with total number of tropical cyclones in the North West Pacific basin is weak (r = –0.26, p = 0.12).

Fig. 2: Correlation of tropical Pacific summer subsurface ocean temperature and SST with following year’s South China landfall counts.
figure 2

Pearson’s correlation coefficient (r) between May–November South China landfall counts and the preceding year’s June–August (JJA-1) a 100–300 m subsurface ocean temperature, and b sea surface temperature. The correlation coefficient for each variable averaged across the black box (Niño 4) is shown. Correlations significant at the 99% confidence level using a two-tailed t-test are shaded. Yellow markers show position of the landfall events in south China.

The source of predictability a year in advance lies in the temperature below the ocean surface. The depth of the equatorial thermocline on the date line is an alternative good predictor of the next year’s South China landfall count (r = 0.63, p < 0.001) although not as powerful as the subNiño4 index. The magnitude and location of the correlation is robust across three different subsurface ocean temperature products we have examined (Supplementary Fig. 1).

Subsurface ocean temperature and landfall factors

The difference of May–November tropical Pacific SST anomaly for those years following cool and warm subNiño4 anomalies is shown in Fig. 3a. Prior summer negative subNiño4 anomalies lead to a central La Niña type SST anomaly pattern. A cool region extends from the equatorial East Pacific into the Central Pacific with warmer SSTs to the East of the Philippines. This pattern has very high spatial correlation (r = 0.95) to the May-Nov Niño 3.4 SST anomaly field over the mapped region (Supplementary Fig. 2).

Fig. 3: Tropical Pacific anomalies associated with increased South China landfall.
figure 3

May–November anomalies of a sea surface temperature, and b wind at 500 hPa. Anomalies calculated as difference between mean lower and upper subNiño4 JJA-1 tercile years. Lower and upper tercile thresholds are –0.15 and 0.30 C, respectively. Magnitude of wind vector anomaly is shaded, arrows show direction.

We now examine how the prior summer subNiño4 index relates to key variables in the landfall track region during the TC season. The only regionally consistent signal over the entire track region is found in the winds at the steering level (approximately centred at 500 hPa). Wind anomalies over both the South China Sea and Philippine Sea enhance the northwest steering towards landfall on the South China coast (Fig. 3b). The zonal wind anomaly for low-high subNiño4 JJA-1 and May–November Niño 3.4 (Supplementary Fig. 2) also have very high spatial correlation (r = 0.95) over the mapped region for the 500 hPa zonal wind anomaly. Prior summer subNiño4 index anomalies lead to diverse regional changes in SST, vertical wind shear, low-level relative vorticity and mid-level relative humidity (Fig. 4). These other factors are variable and even oppose each other in the two seas. There is TC favourable enhanced mid-level relative humidity and low-level vorticity, but unfavourable lower SST and enhanced vertical wind shear over the the South China Sea. In contrast, the Philippine Sea is characterised by favourable warmer SST and reduced vertical wind shear in the South, but unfavourable reduced low-level vorticity and some reduced mid-level relative humidity in the West. Only the 500 hPa geopotential height and zonal wind anomalies are consistent across the entire region with the enhanced westward steering to South China.

Fig. 4: Regional anomalies associated with low subNiño4 JJA-1 years.
figure 4

May–November anomalies of a SST, b geopotential height at 500 hPa, c relative humidity at 500 hPa, d relative vorticity at 850 hPa, e magnitude of vertical wind shear between 850 hPa and 200 hPa, and f zonal wind at 500 hPa. Anomalies calculated as difference between mean lower and upper subNiño4 JJA-1 tercile years. Lower and upper tercile thresholds are –0.15 and 0.30 C, respectively.

We find that negative subNiño4 anomalies lead to a small shift in genesis location of landfalling storms toward the South China coast (Supplementary Fig. 3) although this difference is not statistically significant (p > 0.1). This lack of significant genesis shift is different to known basin wide genesis shifts under ENSO conditions6,18. The relationship of landfall with the basin storm total itself is weak (r = 0.28, p = 0.09), suggesting why previous studies on basin wide activity may give little insight into South China landfall statistics.

Predictive model performance

We now test the performance of the subNiño4 index as a predictor of South China Landfall count in a regression model validated using a leave-one-out cross-validation. Figure 5 shows time series of observations and cross-validated linear model predictions and an 11-year rolling correlation between the two showing decadal variation of r. The correlation coefficient after cross-validation is r = 0.61 (p < 0.001). The linear model outperforms the Poisson regression model in terms of correlation, root mean square error (RMSE) and skill. This is likely because the relationship between the subNiño4 index and landfall count is more linear than log-linear (Supplementary Fig. 4).

Fig. 5: Time series of number of tropical cyclones making landfall in South China (NLT).
figure 5

Observed value (blue) and cross-validated linear model prediction made using subNiño4 with 90% confidence bounds (orange). An 11-year rolling correlation coefficient (r) the two time series is shown in dashed yellow. The thin dashed line shows the r at which the 11-year rolling correlation is significant at the 95% confidence level using a two-tailed t-test (r = 0.60).

We also perform a double cross-validation as a further test of the robustness of subNiño4 as a predictor of South China landfall. This technique is designed to remove biases due to predictor selection, as well as model fitting, being performed using data also used in validation. The double cross-validation process reveals that the optimum choice for a predictor box (of the same size) is actually 8 East and 4 North of the Niño 4 box, but still at the original subNiño4 100–300 m depth range and JJA-1 time window. The double cross-validated correlation is 0.65 (p < 0.001). This is higher than the simple single cross-validation correlation because the subNiño4 box was not chosen using optimisation methods.

We find that after double cross-validation the model predictions have a skill of 23% against a climatology baseline. This is similar to a previously reported 37% skill in forecasting South China landfall issued in the April just before the TC season13. However, that method was not subjected to double cross-validation and is likely over-fitted with a lower true skill. Our forecast has only a slightly lower skill, but it is much simpler, likely more robust and crucially can be issued a year in advance. This is a step change.

Discussion

Our prior summer subNiño4 index is correlated with the following year’s TC season (May–November) mean Niño 3.4 index (r = 0.53, p < 0.001). The mean TC season Niño 3.4 SST index has the strongest correlation of all traditional ENSO indices with the seasonal South China landfall count (r = –0.46, p =  0.004). The Modoki index has no significant correlation (r = –0.16, p = 0.33). We therefore propose that the subNiño4 index has predictive power because it precedes surface ENSO conditions during the TC season, which in turn influence tropical cyclone steering towards landfall in South China. A more negative summer subNiño4 precedes a La Niña the next year and more landfall in South China.

Current theories of ENSO are intimately associated with subsurface ocean temperature. These originate with Bjerknes19 who proposed a coupled ocean-atmosphere feedback mechanism, which enhances warm east Pacific SST anomalies via ocean processes, including zonal advection, local upwelling, and thermocline feedback. This led to the delayed oscillator model based on Rossby wave reflection at the ocean western boundary20 and the recharge-discharge model21,22 based on discharge of warm water through Sverdrup transport. Observations of subsurface temperature have been shown to lead surface ENSO signals by up to 15 months23,24—the lead time scale we have identified here for TC forecasting. Statistical prediction models of ENSO based on these ideas helped break through the “spring persistence barrier”25,26.

What are the physical mechanisms by which the subNiño4 index becomes a predictor of landfall? The depth of the thermocline and its spatial integral, the warm water volume, are well known as important factors for the evolution of ENSO signals and have previously been used in predictive models27. Here, we are particularly concerned with the relatively shorter ENSO growth phase rather than explaining the longer term reversal. The recharge oscillator theory predicts a time of about 12 months from the West Pacific subsurface anomaly, the subNiño4 region, to reach the surface by advection and upwelling22. Coupled models also confirm the eastward advection and climatalogical tradewinds driving equatorial upwelling of the subsurface temperature anomalies from the subNiño4 region. Specifically they show that the summer subsurface temperature anomaly leads the Niño 3.4 surface region by about 12 months28. Once the anomaly has reached the surface, the SST patterns influence atmospheric circulation patterns through well known zonal shits in atmospheric diabatic heating. We have shown that La Niña type SST patterns cause more northwestward steering wind conditions, which favour landfall of TCs in South China. This steering wind anomaly is consistent with the idealised northwestward Matsuno-Gill type circulation response to enhanced equatorial atmospheric diabatic heating towards the west, which is typical of La Niña29.

Surprisingly, there is significantly more correlation (r =  0.67, p < 0.001) using the subNiño4 index as a predictor almost a year in advance than the Niño 3.4 (the most correlated index out of the standard El Niño indices and the El Niño Modoki) SST anomaly during the season itself (r = 0.46, p = 0.004). This finding is robust for all the different TC agencies. It may be explained by considering the atmospheric response when the subsurface temperature anomaly emerges at the surface. A cool (warm) SST anomaly causes less (more) cloud cover. These cloud anomalies drive surface shortwave radiative heating anomalies and this negative feedback is largest in the summer28. This radiative effect dampens the in-season SST-landfall correlation. The Niño 3.4 SST index in season is not an independent predictor: the very conditions, the diabatic heating anomalies, that favour enhanced TC steering also at the same time dampen the SST through cloudiness anomalies. However, the subNiño4 index is a leading indicator and independent of TC activity. We therefore propose that temperature anomalies at depth, before they are damped at the surface, are a stronger predictor.

The prior summer subNiño4 index also has some potential for predicting the basin wide seasonal ACE. ACE is measure of intensity, duration and number of TCs. The correlation of subNiño4 index with total number is found to be small. As subNiño4 can predict more/less steering to South China it is also therefore a measure of the TC poleward fraction. As more poleward TCs typically have larger duration and higher intensities they also enhance the seasonal ACE. Tracks typically are either more poleward (high ACE) or towards South China (low ACE) so that there is an anti-correlation between basin ACE and South China landfall (r = –0.40, p = 0.01). Therefore, there is also an opposite correlation of the basin ACE compared to TC landfall with the subNiño4 index.

Conclusion

We identify a subNiño4 index as subsurface ocean temperature anomalies averaged between depths of 100–300 m in the Pacific warm pool region. The subNiño4 index is strongly anti-correlated with the following year’s South China tropical cyclone landfall count. Pacific subsurface temperature anomalies precede the surface ENSO conditions and cause atmospheric heating anomalies that modify the steering of TCs onto South China. Surprisingly, the previous summer subNiño4 is more correlated with South China landfall activity than the ENSO signal during the season itself. A prediction based on the observed prior summer subNiño4 index can be issued almost a year in advance and is as skilful as those currently issued in the Spring just before the TC season. Perhaps the forecasts of other atmospheric variables can also be substantially extended further than currently thought using regional subsurface ocean temperature in dynamical and statistical models.

Methods

The primary tropical cyclone landfall data were based on the China Meteorological Administration (CMA) best track data set30 from the International Best Track Archive for Climate Stewardship (IBTrACS, v04r00) World Meteorological Organisation data31,32. Additional TC data from the Joint Typhoon Warning Centre (JTWC), Hong Kong Observatory (HKO), and the Japan Meteorological Agency (JMA) were all obtained via IBTrACS. A South China landfall event was defined at the time and location of the first recorded timestep when the cyclone position was within 200 km of the mainland China coast in the latitude range 18N–23N and longitude greater than 109E. This definition of landfall has the benefit of including cyclones, which affect coastal regions but whose centres do not necessarily reach land. This is more precise than, for example, a common method of counting the number of cyclones entering a box encompassing a coastal region. The South China landfall seasonal count is the number of landfall events occurring in the period May–November, which accounts for 99% of South China landfall events. Only TCs with lifetime maximum intensity >13 m s−1, the speed at which JTWC begin issuing western North Pacific tropical cyclone warnings, are included. A climatology of TCs making landfall in South China is provided in Fig. 6. Seasonal basin total accumulated cyclone energy (ACE) was calculated as the sum of the maximum sustained wind speed squared for each 3-h interval in a given year excluding periods a cyclone is marked as extra-tropical33.

Fig. 6: Climatology of TCs making landfall in South China between 1981–2017.
figure 6

Tracks of typhoons making landfall in South China (blue lines) in the CMA best track database are shown in a with genesis locations (mauve circles) and landfall locations (orange crosses). A distribution of South China landfall event month is shown in b. The distribution of seasonal number of landfalling typhoons (NLT) in South China is shown in c with a Poisson distribution of the same mean (5.32) overlaid for comparison.

The subsurface ocean temperatures are from Met Office Hadley Centre EN4 objective analyses34 with time varying bias adjustments35. The monthly means are on a 1 by 1 grid at a variety of depth levels. The ECMWF ocean reanalysis system36 (ORAS5) and the Global Ocean Data Assimilation System37 (GODAS) provided alternative subsurface ocean temperature data. Thermocline depth was estimated via linear interpolation depth to 20 C. The monthly mean sea surface temperature was from the Hadley Centre Sea Ice and Sea Surface Temperature data set (HadISST)38 on a 1 by 1 grid. The geopotential height and wind speed reanalysis data are from European Centre for Medium-range Weather Forecasts (ECMWF) reanalysis ERA539. Monthly mean fields down-sampled to 1 by 1 resolution were used in this analysis.

Prediction models for regional annual count of TCs making landfall typically use Poisson regression40,41,42 models with parameters estimated using a maximum likelihood technique. However, the assumption in a Poisson regression model that the logarithm of the expected value of the response variable varies with a linear combination of the predictors may not be optimal hence we also fit a linear regression model to our data. These have been used successfully in previous studies to model tropical cyclone landfall counts13,43,44.

In addition to choosing the right model, it is important to assess its performance fairly through cross-validation. We use a double cross-validation method comprising an inner and outer layer. The outer layer is a standard leave-one-out cross-validation43,45, whereby model fitting and validation are performed separately for every year in the data. For each year, data are split into two sets: the training set (all years except the target year), and the validation set (the target year). Model fitting is performed using only the training set and validation only on the validation set. This removes the bias due to validating the model on data which has used also been using in the fitting process and therefore improves assessment of out of sample model error. The inner layer attempts to deal with the problem of biased predictor selection by choosing the optimum predictor from an ensemble of predictors at each stage of the outer cross-validation. This process acknowledges the fact that decisions are made when choosing a predictor based on correlation analysis of the whole data set, whereas a fair analysis chooses a predictor based only on the training set.

The ensemble of predictors used in the inner layer of cross-validation was made by calculating the mean seasonal subsurface ocean temperature anomaly of a box in different positions. We sampled every position available to a box of size 50 longitude by 10 latitude (Niño 4 size) by 200 m depth and spanning 3 months within the region 146E–224E, –11N–11N (at 2 longitude, 1 latitude intervals), 50–350 m (in 50 m intervals) and the months May–September (in 1-month intervals). This creates an ensemble of 1755 predictors to choose from at each stage of the outer leave-one-out process. Extending the ranges beyond those described yields no alternative predictor choices so we believe this cross-validation to be complete.

We calculate skill scores against a climatological mean baseline using,

$${\rm{Skill}}=(1-{\rm{RMSE}}/{{\rm{RMSE}}}_{{\rm{clim}}})\times 100 \% ,$$
(1)

where RMSE is the model prediction root mean squared error, and RMSE\({}_{{\rm{clim}}}\) is the root mean squared error using the climatological mean as the baseline prediction.