Introduction

The tragic sinking of the SS El Faro vessel occurred while it was traveling from Florida to Puerto Rico1. The vessel with a crew of 33 sank about 1140 Hrs UTC on Oct. 1, 2015. As part of their official investigation into the sinking of the El Faro, the National Transportation Safety Board (NTSB) has requested us to carry out an analysis on the occurrence of rogue waves during Hurricane Joaquin around the time and location of the El Faro’s sinking2. Here, we provide a plain presentation of the main results of our analysis avoiding interpretations, considerations or claims that can be drawn from our studies.

The data suggests that the El Faro vessel was drifting at an average speed of approximately 2.5 m/s prior to its sinking2. As a result, El Faro has a higher probability to encounter a rogue wave while drifting over a period of time than that associated with an observer located at a fixed point on the ocean surface. Indeed, the encounter of a rogue wave by a moving vessel is analogous to that of a big wave that a surfer is in search of the surfer’s likelihood to encounter a big wave increases if he moves over a large area instead of staying still. Indeed, if he spans a large area the chances to encounter a large wave increase3, 4. This is a space-time effect very important for ship navigation and it cannot be overlooked. Such an effect is considered in our rogue wave analysis by way of a new probabilistic model for the exceedance probability, or occurrence frequency of a rogue wave encountered by a vessel along its navigation path3, 5. The proposed space-time model provides the basis for the next generation of wave forecast models for a predictive capability of wave extremes and early warnings for shipping companies and others to avoid dangerous areas at risk of rogue waves.

Results

Our rogue wave analysis is focused on the 1-hour sea state of Hurricane Joaquin during which the El Faro vessel sank. This will hereafter be referred to as the El Faro sea state. The wave parameters and statistical models relevant to and required for our analysis are presented in the Methods section.

Metocean parameters of Hurricane Joaquin in the region of the sinking of El Faro

We use the hindcast directional spectra predicted by WAVEWATCH III and describe the wave characteristics of the sea states generated by Hurricane Joaquin at and around the time and location where the El Faro vessel sank6. The top panel on the left of Fig. 1 shows the hourly variation of the significant wave height H s during the event. The top-right panel displays the time history of the dominant wave period T p , and the dominant wave direction, the neutral stability 10-m wind speed U 10 and direction are shown in the bottom-panels, respectively. The red vertical lines delimit the 1–hour interval during which the El Faro vessel sank.

Figure 1
figure 1

WAVEWATCH III parameters history during Hurricane Joaquin around the location where the El Faro vessel sank. (top-left) Hourly variation of the significant wave height H s , (top-right) dominant wave period T p , (bottom-left) dominant wave direction and (bottom-right) normalized U 10/U 10,max wind speed (solid line) and direction (dashed line). Maximum wind speed U 10,max  = 51 m/s. Red vertical lines delimit the 1–hour interval during which the El Faro vessel sank.

The 1-hour sea state experienced by El Faro at and around the time and location of sinking had a significant wave height of H s  ≈ 9 m and the maximum wind speed was U 10,max  = 51 m/s. Waves were multidirectional (short-crested) as indicated by the large values of both the spectral bandwidth ν and angular spreading σ θ as shown in Fig. 2.

Figure 2
figure 2

WAVEWATCH III parameters history during Hurricane Joaquin around the location where the El Faro vessel sank. (top) Hourly variation of the spectral bandwidth ν history, (center) directional spreading σ θ and (bottom) directional factor \(R=\frac{1}{2}{\sigma }_{\theta }^{2}/{\nu }^{2}\). Red vertical lines delimit the 1-hour interval during which the El Faro vessel sank.

In Table 1 we report the metocean parameters of the El Faro sea state in comparison to those of the Draupner, Andrea and Killard rogue sea states7. Note that the four sea states have similar metocean characteristics. However, El Faro is a steeper sea state as the mean wavelengh L 0 is shorter than those observed in the other three cases.

Table 1 Wave parameters and various statistics of the simulated El Faro sea state in comparison to the Andrea, Draupner and Killard rogue sea states7.

Statistical properties of Hurricane Joaquin-generated seas

The relative importance of ocean nonlinearities can be measured by integral statistics such as the coefficients of skewness λ 3 and excess kurtosis λ 40 of the zero-mean surface elevation η(t). The skewness is a measure of asymmetry, and it describes the effects of second-order bound nonlinearities on the geometry and statistics of the sea surface with higher sharper crests and shallower more rounded troughs8,9,10. The excess kurtosis indicates whether the tails of the distribution of surface elevations is heavy- or light-tailed relative to a Gaussian distribution. It comprises a dynamic component \({\lambda }_{40}^{d}\) measuring third-order quasi-resonant wave-wave interactions and a bound contribution \({\lambda }_{40}^{b}\) induced by both second- and third-order bound nonlinearities8,9,10,11,12,13.

In deep waters, the dynamic kurtosis14 depends on the Benjamin-Feir index BFI and the parameter R, a dimensionless measure of the multidirectionality of dominant waves11, 14, 15. For unidirectional (1D) waves R = 0. The bottom panel of Fig. 2 displays the hourly variations of the directional factor R during Hurricane Joaquin near the location where El Faro sank. Around the peak of the hurricane, the generated sea states are quite multidirectional (short-crested) as R > 1. As wave energy also spreads directionally, nonlinear focusing due to modulational instability effects diminishes14, 16,17,18 and becomes essentially insignificant under such realistic oceanic conditions7, 14, 19, 20.

The top panel of Fig. 3 displays the hourly variation of the Tayfun steepness μ (solid line) with associated bounds (dashed lines). The coefficient of excess kurtosis λ 40 mostly due to bound nonlinearities is shown in the center panel and the associated Λ parameter at the bottom. The red vertical lines delimit the 1-hour interval during which the El Faro vessel sank.

Figure 3
figure 3

WAVEWATCH III parameters history during Hurricane Joaquin around the location where the El Faro vessel sank. (top) Hourly variation of the Tayfun steepness μ (solid line) with bounds (dashed lines), (center) excess kurtosis λ 40 and (bottom) nonlinear coefficient Λ ~ 8λ 40/3. Red vertical lines delimit the 1-hour interval during which the El Faro vessel sank.

In Table 1 we compare the statistical parameters of the El Faro sea state and the Draupner, Andrea and Killard rogue sea states (from ref. 7). Note that the El Faro sea state has the largest directional spreading. Moreover, for all the four sea states the associated BFI are less than unity and the maximum dynamic excess kurtosis is of O(10−3) and thus negligible in comparison to the associated bound component. Thus, third-order quasi-resonant interactions, including NLS-type modulational instabilities play an insignificant role in the formation of large waves7, 14 especially as the wave spectrum broadens21 in agreement with oceanic observations available so far9, 22, 23. On the contrary, NLS instabilities have been proven to be effective in the generation of optical rogue waves24.

Higher Order Spectral (HOS) simulations of the El Faro sea state

We have performed Higher-Order pseudo-Spectral (HOS) simulations25, 26 of the El Faro sea state over an area of 4 km × 4 km for a duration of 1 hour (see Methods section for a description of the numerical method). The initial wave field conditions are defined by the WAVEWATCH III hindcast directional spectrum S(f, θ) around the time and region of the El Faro sinking as shown in Fig. 4. This is the result of a balance of the energy fluxes due to wind input (S in ), exact four-wave resonance nonlinearities (S nl ) and dissipation due to wave breaking (S ds ). Wind gustiness and currents are not modeled. Our WW3 hindcast indicates that the flux S in is balanced out by S ds . In particular, around the spectral peak 60% of wind input is lost to dissipation. This offset increases away from the peak. Any wave growth associated with S in  + S ds and S nl is accounted for in the WW3 model. It is the wave growth associated with quasi-resonant and bound harmonics nonlinear effects that is not modeled. In our study, we exploit the HOS wave solver to simulate the El-Faro sea state by accounting for quasi-resonant and bound nonlinearities up to fourth order in wave steepness. An estimate of the most likely rogue wave amplitude is then provided as discussed below. Note that both wind input and wave breaking are somewhat modeled in our HOS simulations as these are initialized with the WW3 spectrum. Clearly, our analysis suggests future studies on the relative importance of possible effects such wind gustiness27 and wave breaking28, 29 on the HOS model results.

Figure 4
figure 4

WAVEWATCH III hindcast directional spectrum S(f, θ) [m 2 s/rad] at approximately the time and location of the El-Faro sinking. The zero direction points North and angles increase westward.

The wavenumber-frequency spectrum S(k, ω) estimated from the HOS simulations is shown in Fig. 5. Here, dashed lines indicate the theoretical dispersion curves related to the first-order (1st) free waves as well as the second (2nd) and third-order (3rd) bound harmonic waves. The HOS predictions indicate that second-order nonlinearities are dominant with a weak effect of third-order nonlinear bound interactions, in agreement with recent studies of rogue sea states7. It appears that fourth-order effects are insignificant.

Figure 5
figure 5

HOS simulations of the El Faro sea state: predicted wavenumber-frequency spectrum S(k, ω) [m 2 s/rad]. Sea state duration of 1 hour over an area of 4 km × 4 km; the wave field is resolved using 1024 × 1024 Fourier modes.

The wave skewness and kurtosis rapidly reach a steady state after a few (mean) wave periods as an indication that third-order quasi-resonant wave-wave interactions are negligible in agreement with theoretical predictions14 and simulations7. Note that the theoretical narrowband (NB) predictions slightly overestimate the simulated values for skewness and excess kurtosis (see Table 1). The same trend is also observed in recent studies on rogue waves7. This is simply because NB approximations do not account for the directionality and finite spectral bandwidth of the El Faro wave spectrum.

Occurrence frequency of a rogue wave by a fixed observer: the return period of a wave whose crest height exceeds a given threshold

To describe the statistics of rogue waves encountered by an observer at a fixed point of the ocean surface, we consider the conditional return period N h (ξ) of a wave whose crest height exceeds the threshold h = ξH s , namely

$${N}_{h}(\xi )=\frac{1}{{\rm{\Pr }}[h > \xi {H}_{s}]}=\frac{1}{P(\xi )},$$
(1)

where P(ξ) is the probability or occurrence frequency of a wave crest height exceeding ξH s as encountered by a fixed observer. In other words, P(ξ) is the probability to randomly pick from a time series observed at a fixed point of the ocean a wave crest that exceeds the threshold ξH s . Equation (1) also implies that the threshold ξH s , with H s  = 4σ, is exceeded on average once every N h (ξ) waves. For weakly nonlinear random seas, the probability P is hereafter described by the third-order Tayfun-Fedele9 (TF), second-order Tayfun8 (T), second-order Forristall30 (F) and the linear Rayleigh (R) distributions (see Methods section).

Our statistical analysis of HOS wave data suggests that second-order effects are the dominant factors in shaping the probability structure of the El Faro sea state with a minor contribution of excess kurtosis effects. Such dominance is seen in Fig. 6, where the HOS numerical predictions of the conditional return period N h (ξ) of a crest exceeding the threshold ξH s are compared against the theoretical predictions based on the linear Rayleigh (R), second-order Tayfun (T) and third-order (TF) models from Eq. (17). It is noted that the HOS predictions are based on a sample population of 106 crests. In particular, N h (ξ) follows from Eq. (1) as the inverse 1/P(ξ) of the empirical probabilities of a crest height exceeding the threshold ξH s . An excellent agreement is observed between simulations and the third-order TF model up to crest amplitudes h/H s  ~ 1.5. For larger amplitudes, the associated confidence bands of the estimated empirical probabilities widen, but TF is still within the bands. Donelan and Magnusson31 suggest that the TF model agrees with the Andrea rogue wave measurements up to h/H s  ~ 1.1, concluding that TF is not suitable to predict larger rogue crest extremes (see their Fig. 7 in ref. 31). Unfortunately, their analysis is based on a much smaller sampled population of ~104 crest heights and they do not report the confidence bands associated with their probability estimates, nor they provide any parameter values to validate their data analysis. The deviation of their data from the TF model is most likely due to the relatively smaller population of crests observed. Note also that TF slightly exceeds both the T and F models as an indication that second-order effects are dominant, whereas the linear R model underestimates the return periods.

Figure 6
figure 6

HOS simulations of the El Faro sea state. Crest height scaled by the significant wave height (ξ) versus conditional return period (N h ) for the (left) Andrea, (center) Draupner and (right) Killard rogue sea states: HOS numerical predictions (□) in comparison with theoretical models: F = Forristall (blue dashed) T = second-order Tayfun (blue solid), TF = third-order (red solid) and R = Rayleigh distributions (red solid). Confidence bands are also shown (light dashes). N h (ξ) is the inverse of the exceedance probability P(ξ) = Pr[h > ξH s ]. Horizontal lines denote the rogue threshold 1.25H s 32 and 1.6H s .

For both third- and fourth-order nonlinearities, the return period N r of a wave whose crest height exceeds the rogue threshold 1.25H s  ≈ 11 m32 is nearly N r  ~ 104 for the El Faro sea state and for the simulated Andrea, Draupner and Killard rogue sea states7. This is in agreement with oceanic rogue wave measurements23, which yield roughly the same return period. Similarly, recent measurements off the west coast of Ireland33 yield N r  ~ 6 · 104. In contrast, N r  ~ 3 · 105 in a Gaussian sea.

Note that the largest simulated wave crest height exceeds the threshold 1.6H s  ≈ 14 m (see Table 1). This is exceeded on average once every 106 waves in a time series extracted at a point in third- and fourth-order seas and extremely rarely in Gaussian seas, i.e. on average once every 109 waves. This implies that rogue waves observed at a fixed point of the ocean are likely to be rare occurrences of weakly random seas, or Tayfun sea states34. Our results clearly confirm that rogue wave generation is the result of the constructive interference (focusing) of elementary waves enhanced by bound nonlinearities in agreement with the theory of stochastic wave groups developed by Fedele and Tayfun (2009)10 as an extension of Boccotti’s (2000) theory of quasi-determinism35. Our conclusions are also in agreement with observations9, 10, 12, 22, recent rogue wave analyses7, 31, 36,37,38,39,40,41 and studies on optical rogue waves caustics analogues42.

Time profile of the simulated rogue waves

The wave profile η with the largest wave crest height (>1.6H s  ≈ 14 m) observed in the time series of the surface fluctuations extracted at points randomly sparse over the simulated El Faro domain is shown in the left panel of Fig. 7. For comparison, the Draupner, Andrea and Killard rogue wave profiles are also shown7. In the same figure, the mean sea level (MSL) below the crests is also shown. The estimation of the MSL follows by low-pass filtering the measured time series of the wave surface with frequency cutoff f c  ~ f p /2, where f p is the frequency of the spectral peak43. An analysis of the kinematics44, 45 of the simulated rogue waves indicate that such waves were nearly incipient breaking28, 29, 44 suggesting that larger rogue events are less likely to occur21, 44. The saturation of the crest height is mainly due to the nonlinear dispersion and it is an energy limiter for rogue waves.

Figure 7
figure 7

Third-order HOS simulated extreme wave profiles η/η max (solid) and mean sea levels (MSL) (dashed) versus the dimensionless time t/T p for (from left to right) El Faro, Andrea, Draupner and Killard waves. η max is the maximum crest height given in Table 1. For comparisons, actual measurements (thick solid) and MSLs (tick dashed) are also shown for Andrea, Draupner and Killard. Note that the simulad El-Faro wave is shown in bold. Note that the Killard MSL is insignificant and the Andrea MSL is not available. T p is the dominant wave period (see Methods section for definitions).

The four wave profiles are very similar suggesting a common generation mechanism of the rogue events. The manner waves are generated by Hurricane Joaquin or the northerly storm of the Draupner, Andrea and Killard sea states, all four waves and their statistics cannot differ in a fundamental way from each other as the spectral shape of the four sea states is similar showing only some variations in terms of directionality or frequency characteristics.

Further, we observe a set-up below the simulated El Faro rogue wave, most likely due to the multidirectionality of the sea state. A set-up is also observed for the actual Draupner rogue wave. Indeed, recent studies showed that Draupner occurred in a crossing sea consisting of swell waves propagating at approximately 80 degrees to the wind sea46, 47. This would explain the set-up observed under the large wave43 instead of the second-order set-down normally expected48.

Space-time statistics of the sea state encountered by El Faro before sinking

The largest crest height of a wave observed in time at a given point of the ocean represents a maximum observed at that point. Clearly, the maximum wave surface height observed over a given area during a time interval, i.e. space-time extreme, is much larger than that observed at a given point. Indeed, in relatively short-crested directional seas such as those generated by hurricanes, it is very unlikely that an observed large crest at a given point in time actually coincides with the largest crest of a group of waves propagating in space-time. In contrast, in accord with Boccotti’s (2000) QD theory35, it is most likely that the sea surface was in fact much higher somewhere near the measurement point.

Space-time wave extremes can be modeled stochastically3, 4 drawing on the theory of Euler Characteristics of random fields49,50,51 and nonlinear wave statistics14. In the following, we present the Fedele’s Space-Time (FST) stochastic model for the prediction of space-time extremes3 that accounts for both second and third-order nonlinearities5. Fedele’s work3, 5 considers a 3-D non-Gaussian field η(x, y, t) in space-time over an area A for a time period of D (see Fig. 8). The area cannot be too large since the wave field may not be homogeneous. The duration should be short so that spectral changes occurring in time are not significant and the sea state can be assumed as stationary. Then, the third-order nonlinear probability \({P}_{{\rm{FST}}}^{(nl)}(\xi ;A,D)\) that the maximum surface elevation \({\eta }_{\max }\) over the area A and during the time interval D exceeds the generic threshold ξH s is described by5

$${P}_{{\rm{FST}}}^{(nl)}(\xi ;A,D)={P}_{{\rm{ST}}}({\xi }_{0};A,D)(1+\Lambda {\xi }_{0}^{2}(4{\xi }_{0}^{2}-1)),$$
(2)

where

$${P}_{{\rm{ST}}}(\xi ;A,D)={\rm{\Pr }}\{{\eta }_{\max } > \xi {H}_{s}\}=(16{M}_{3}{\xi }^{2}+4{M}_{2}\xi +{M}_{1}){P}_{{\rm{R}}}(\xi )$$
(3)

denotes the Gaussian probability of exceedance, and P R(ξ) is the Rayleigh exceedance probability of Eq. (19).

Figure 8
figure 8

(Left) the space-time (xyt) volume spanned by the El Faro vessel (base area A = 241 × 30 m 2) while drifting at the speed of 2.5 m/s over a time interval of D = 10 minutes along the path Γ is that of the slanted parallelepiped V a ; (center) the drifting vessel covers the strip area (1500 × 30 m 2) in the 10-minute interval and the associated space-time volume is that of the parallelepiped V b ; (right) if the vessel would be anchored at a location for the same duration, it would span instead the spacetime volume of the straight parallelepiped V c . The solid red arrowed line denotes the space-time path of El Faro while drifting along the path Γ. The vertical axis is time (t) and the other two axes refer to the space dimensions (x) and (y) respectively.

Here, M 1 and M 2 are the average number of 1-D and 2-D waves that can occur on the edges and boundaries of the volume Ω, and M 3 is the average number of 3-D waves that can occur within the volume3. These all depend on the directional wave spectrum and its spectral moments m ijk defined in the Methods section.

The amplitude ξ relates to ξ 0 via the Tayfun (1980) quadratic equation8

$$\xi ={\xi }_{0}+2\mu {\xi }_{0}^{2}\mathrm{.}$$
(4)

Given the probability structure of the wave surface defined by Eq. (2), the nonlinear mean maximum surface or crest height \({\overline{h}}_{{\rm{FST}}}={\xi }_{{\rm{FST}}}{H}_{s}\) attained over the area A during a time interval D is given, according to Gumbel (1958), by4, 5

$${\xi }_{{\rm{FST}}}={\overline{h}}_{{\rm{FST}}}/{H}_{s}={\xi }_{{\rm{m}}}+2\mu {\xi }_{{\rm{m}}}^{2}+\frac{{\gamma }_{e}(1+4\mu {\xi }_{{\rm{m}}})}{16{\xi }_{{\rm{m}}}-\frac{32{M}_{3}{\xi }_{{\rm{m}}}+4{M}_{2}}{16{M}_{3}{\xi }_{{\rm{m}}}^{2}+4{M}_{2}{\xi }_{{\rm{m}}}+{M}_{1}}-{\rm{\Lambda }}\frac{2{\xi }_{{\rm{m}}}(8{\xi }_{{\rm{m}}}^{2}-1)}{1+{\rm{\Lambda }}{\xi }_{{\rm{m}}}^{2}(4{\xi }_{m}^{2}-1)}},$$
(5)

where the most probable surface elevation value ξ m satisfies P ST(ξ m; A, D) = 1 (see Eq. (2)) and the Euler-Mascheroni constant γ e  ≈ 0.577.

The nonlinear mean maximum surface or crest height h T expected at a point during the time interval D follows from Eq. (5) by setting M 2 = M 3 = 0 and M 1 = N D, where \({N}_{{\rm{D}}}=D/\bar{T}\) denotes the number of waves occurring during D, and \(\bar{T}\) is the mean up-crossing period (see Methods section). The second-order form of the FST model (Λ = 0) has been implemented in WAVEWATCH III52. The linear limit follows from Eq. (5) by setting μ = 0 and Λ = 0.

The statistical interpretations of the probability \({P}_{{\rm{FST}}}^{(nl)}(\xi ;A,D)\) and associated space-time average maximum \({\overline{h}}_{{\rm{ST}}}\) are as follows. Consider an ensemble of N realizations of a stationary and homogeneous sea state of duration D, each of which has similar statistical structure to the El Faro wave field. On this basis, there would be N samples, say \(({\eta }_{\max }^{\mathrm{(1)}},\ldots ,{\eta }_{\max }^{(N)})\) of the maximum surface height \({\eta }_{\max }\) observed within the area A during the time interval D. Then, all the maximum surface heights in the ensemble will exceed the threshold \({\overline{h}}_{{\rm{FST}}}\). Clearly, the maximum surface height exceeds by far such average. Indeed, only in a few number of realizations \(N\cdot {P}_{{\rm{FST}}}^{(nl)}(\xi ;A,D)\) out of the ensemble of N sea states, the maximum surface height exceeds a threshold \(\xi {H}_{s}\gg {\overline{h}}_{{\rm{FST}}}\) much larger than the expected value. To characterize such rare occurrences in third-order nonlinear random seas one can consider the threshold h q  = ξ q H s exceeded with probability q by the maximum surface height \({\eta }_{{\rm{\max }}}\) over an area A during a sea state of duration D. This satisfies

$${P}_{{\rm{FST}}}^{(nl)}({\xi }_{q};A,D)=q\mathrm{.}$$
(6)

The statistical interpretation of h q is as follows: the maximum surface height \({\eta }_{\max }\) observed within the area A during D exceeds the threshold h q only in qN realizations of the above mentioned ensemble of N sea states.

Note that for large areas, i.e. \(\ell \gg {L}_{0}\), our FST model as any other similar models available in literature47, 53,54,55,56 will overestimate the maximum surface height over an area and time interval because they all rely on Gaussianity. This implies that there are no physical limits on the values that the surface height can attain as the Gaussian model does not account for the saturation induced by the nonlinear dispersion21 of ocean waves or wave breaking. Thus, the larger the area A or the time interval D, the greater the number of waves sampled in space-time, and unrealistically large amplitudes are likely to be sampled in a Gaussian or weakly nonlinear Gaussian sea.

This point is elaborated further and demonstrated explicitly by way of the results displayed in Fig. 9. Here, the theoretical (FST) ratio \({\overline{h}}_{{\rm{FST}}}/{\overline{h}}_{{\rm{T}}}\) as a function of the area width \(\ell /{L}_{0}\) is shown for the El Faro, Draupner and Andrea sea states respectively. The FST ratios for Draupner and Andrea are estimated using the European Reanalysis (ERA)-interim data5. For comparisons, the empirical ST ratio from the El Faro HOS simulations together with the experimental observations at the Acqua Alta tower4 are also shown. Recall that \({\overline{h}}_{{\rm{FST}}}\) is the mean maximum surface height expected over the area \({\ell }^{2}\) during a sea state of duration D = 1 hour and \({\overline{h}}_{{\rm{T}}}\) is the mean maximum surface height expected at a point. Clearly, the theoretical FST ratio for El Faro fairly agrees with the HOS simulations for small areas (\(\ell \le {L}_{0}\)), whereas it yields overestimation over larger areas. We argue that the saturation of the HOS FST ratio over larger areas is an effect of the nonlinear dispersion which is effective in limiting the wave growth as a precursor to breaking21, 44.

Figure 9
figure 9

Space-time extremes: theoretical FST ratios \({\overline{h}}_{{\rm{FST}}}/{\overline{h}}_{{\rm{T}}}\) as a function of the area width \(\ell /{L}_{0}\) for El Faro (black), Draupner (red) and Andrea (blue) sea states, where \({\overline{h}}_{{\rm{FST}}}\) is the mean maximum surface height expected over the area \({\ell }^{2}\) during a sea state of duration D = 1 hours and \({\overline{h}}_{{\rm{T}}}\) is the mean maximum surface height expected at a point. For comparisons, the empirical FST ratio from the El Faro HOS simulations (dashed line with circles) together with the experimental observations at the Acqua Alta tower (squares) are also shown4. L 0 is the mean wavelength.

Note that the FST ratios for all the three sea states are nearly the same for \(\ell \le {L}_{0}\). These results are very encouraging as they suggest possible statistical similarities and universal laws for space-time extremes in wind sea states5. Moreover, for \(\ell \sim {L}_{0}\) the mean wave surface maximum expected over the area is 1.35 times larger than that expected at a point in agreement with Acqua Alta sea observations4.

The occurrence frequency of a rogue wave by the El Faro vessel

The data suggests that the El Faro vessel was drifting at an average speed of approximately 2.5 m/s prior to its sinking. This is considered in our analysis as follows. First, define the two events R = “El Faro encounters a rogue wave along its navigation route” and S = “El Faro sinks”. We know that the event S happened. As a result, one should consider the conditional probability

$${\rm{\Pr }}[R|S]=\frac{{\rm{\Pr }}[S|R]\cdot {\rm{\Pr }}[R]}{{\rm{\Pr }}[S]}\mathrm{.}$$
(7)

Here, Pr[S] is the unconditional probability of the event that El Faro sinks. This could be estimated from worldwide statistics of sunk vessels with characteristics similar to El Faro. Pr[S|R] is the conditional probability that El Faro sinks given that the vessel encountered a rogue wave. This probability can be estimated by Monte Carlo simulations of the nonlinear interaction of the vessel with the rogue wave field.

Our rogue wave analysis provides an estimate of the unconditional probability Pr[R] that El Faro encounters a rogue wave along its navigation or drifting route by means of the exceedance probability, or occurrence frequency P e (h). This is the probability that a vessel along its navigation path encounters a rogue wave whose crest height exceeds a given threshold h. The encounter of a rogue wave by a moving vessel is analogous to that of a big wave that a surfer is in search of. His likelihood to encounter a big wave increases if he moves around a large area instead of staying still. This is a space-time effect which is very important for ship navigation and must be accounted for3, 57,58,59,60.

The exceedance probability P e (h) is formulated as follows. Consider a random wave field whose surface elevation at a given point (x, y) in a fixed frame at time t is η(x, y, t). Consider a vessel of area A that navigates through the wave field at a constant speed V along a straight path at an angle β with respect to the x axis. Define also (x e , y e ) as a cartesian frame moving with the ship. Then, the line trajectories of any point (x e , y e ) of the vessel in the fixed frame are given by

$$x={x}_{e}+V\,\cos (\beta )t,\quad y={y}_{e}+V\,\sin (\beta )t,$$
(8)

where for simplicity we assume that at time t = 0 the center of gravity of the vessel is at the origin of the fixed frame.

The surface height η c (t) encountered by the moving vessel, or equivalently the surface fluctuations measured by a wave probe installed on the ship, is

$${\eta }_{c}({x}_{e},{y}_{e},t)=\eta ({x}_{e}+V\,\cos (\beta )t,{y}_{e}+V\,\sin (\beta )t,t),$$
(9)

If η is a Gaussian wave field homogeneous in space and stationary in time, then so is η c with respect to the moving frame (x e , y e , t). The associated space-time covariance is given by

$$\begin{array}{rcl}{\rm{\Psi }}(X,Y,T) & = & \overline{{\eta }_{c}({x}_{e},{y}_{e},t){\eta }_{c}({x}_{e}+X,{y}_{e}+Y,t+T)}\\ & = & \int S(f,\theta )\cos ({k}_{x}X+{k}_{y}Y-2\pi {f}_{e}T)dfd\theta ,\end{array}$$
(10)

where \({k}_{x}=k\,\cos (\theta )\), \({k}_{y}=k\,\sin (\theta )\) and k is the wavenumber associated with the frequency f by way of the wave dispersion relation. As a result of the Doppler effect, the encountered, or apparent frequency is given by57,58,59,60

$${f}_{e}=f-kV\,\cos (\theta -\beta )/(2\pi ),$$
(11)

and S(f, θ) is the directional wave spectrum of the sea state. Note that when the vessel moves faster than waves coming from a direction θ, the apparent frequency f e  < 0 and for an observer on the ship waves appear to move away from him/her. In this case, the direction of those waves should be reversed57, i.e. θ = θ + π, and f e set as positive.

The spectral moments \({m}_{ijk}^{(e)}\) of the encountered random field readily follow from the coefficients of the Taylor series expansion of Ψ(X, Y, T) around (X = 0, Y = 0, T = 0). In particular,

$${m}_{ijk}^{(e)}=\frac{{\partial }^{i+j+k}{\rm{\Psi }}}{\partial {X}^{i}\partial {Y}^{j}\partial {T}^{k}}{|}_{X=Y=T=0}=\int S(f,\theta ){k}_{x}^{i}{k}_{y}^{j}{f}_{e}^{k}{\rm{d}}f{\rm{d}}\theta \mathrm{.}$$
(12)

The nonlinear space-time statistics can then easily processed by using the encountered spectral moments \({m}_{ijk}^{(e)}\) using the FST model3, 5, which is based on Eq. (2) as described above. Note that for generic navigation routes the encountered wave field η c is a non-stationary random process of time. Thus, the associated spectral moments will vary in time. The space-time statistics can be still computed by first approximating the route by a polygonal made of piecewise straight segments along which the random process η c is assumed stationary.

Figure 10 illustrates the HOS and theoretical predictions for the normalized nonlinear threshold h n /H s exceeded with probability 1/n, where n is the number of waves. In particular, consider an observer on the vessel moving along the straight path Γ spanned by El Faro drifting against the dominant sea direction over a time interval of 10 minutes. In space-time the observer spans the solid red line shown in Fig. 8. In this case, he has a probability P e  ~ 3 · 10−4 to encounter a wave whose crest height exceeds the threshold 1.6H s  ≈ 14 m (blue lines), and the expected spatial shape is shown in Fig. 11. If we also account for the vessel size (base area A = 241 × 30 m 2), in space-time El Faro spans the volume of the slanted parallelepiped V a shown in Fig. 8. In this case, the exceedance probability P e (V a ) further increases to 1/400 (black lines in Fig. 10). Note that if the vessel would be anchored at a location for the same duration, in spacetime it would span instead the volume of the vertical parallelepiped V c shown in the same Figure. Note that the two parallelepipeds cover the same space-time volume A × D, with the base area A and height D = 10 min. For the case of the anchored vessel, the associated exceedance probability P e (V c ) is roughly the same as P e (V a ) since El Faro was drifting at a slow speed. Larger drift speeds yield larger P e (V a ) since the vessel encounters waves more frequently than if it was anchored, because of the Doppler effect58, 59. Moreover, the drifting vessel covers the strip area (1500 × 30 m 2) in the 10-minute interval and the associated space-time volume is that of the parallelepiped V b shown in Fig. 8, which has a larger volume than that of V a . As a result, the occurrence frequency P e (V b ) of a rogue wave associated with V b is larger and it increases to ~1/100 (see red lines in Fig. 10). However, El Faro does not visit the entire volume V b , but it only spans the smaller volume V a . Thus, the conditional probability P e (V a |V b ) that the drifting El Faro encounters a rogue wave given that a rogue wave occurred over the larger spacetime volume V b is P e (V a )/P e (V b ) ~ 1/4. Furthermore, a fixed observer has a much lower probability P e  ~ 10−6 to pick randomly from a time series extracted at a point a wave whose crest height exceeds 1.6H s (see Fig. 6, TF model, black solid line). Finally, we observed that the exceedance probability P e (V a ) for the drifting El Faro does not scale linearly with time because of nonlinearities that reduce the natural dispersion of waves. Indeed, assuming that El Faro drifts over a time interval 5 times longer (50 minutes), P e (V a ) just increases roughly by 3 times, ~1/130.

Figure 10
figure 10

HOS (squares) and theoretical (solid lines) predictions for the normalized nonlinear threshold h n /H s exceeded with probability 1/n; i) along the straight path Γ spanned by El Faro while drifting at an estimated approximate average speed of 2.5 m/s over a time interval of 10 minutes (blue), ii) and also accounting for the vessel size (241 × 30 m 2) (black), and over the strip area (1500 × 30 m 2) spanned by the vessel in a 10-minute interval (red). Confidence bands are also shown (light dashes). Horizontal line denotes the threshold 1.6H s  ≈ 14 m, which is exceeded with probability 3 · 10−4, 1/400 and 1/100 for the three cases shown.

Figure 11
figure 11

HOS simulations: expected spatial shape of a rogue wave whose crest height is >1.6H s  ≈ 14 m.

Discussions

Our present studies open a new research direction on the prediction of rogue waves during hurricanes using the WW3 wave model combined with HOS simulations and the new stochastic FST model5 for the prediction of space-time wave extremes3, 4. Any wave growth associated with wind stresses, dissipation due to wave breaking and exact nonlinear resonant four-wave interactions is accounted for in the WW3 model. It is the wave growth associated with quasi-resonant and bound harmonics nonlinearities that is not modeled. Such nonlinear effects can locally increase the wave amplitude over the expected values of the WW3 simulations. In our analysis, quasi-resonant and bound nonlinearities are modeled by way of a HOS wave solver that simulated the sea state around the time and location of the accident. The HOS simulations provided an estimate of the most likely rogue wave amplitude over a given area and time interval indicating that bound nonlinearities are dominant, in agreement with recent rogue-wave studies7. Our analysis also suggests new studies on the possible effects of factors such as wind gustiness27 and wave breaking28, 29 on generating rogue waves and associated statistics.

Methods

Wave parameters

The significant wave height H s is defined as the mean value H 1/3 of the highest one-third of wave heights. It can be estimated either from a zero-crossing analysis or more easily but approximately from the wave omnidirectional spectrum \({S}_{o}(f)={\int }_{0}^{2\pi }S(f,\theta ){\rm{d}}\theta \) as H s  ≈ 4σ, where \(\sigma =\sqrt{{m}_{0}}\) is the standard deviation of surface elevations, m j  = ∫S o (f)f jdf are spectral moments. Further, S(f, θ) is the directional wave spectrum with θ as the direction of waves at frequency f, and the cyclic frequency is ω = 2πf.

The dominant wave period T p  = 2π/ω p refers to the cyclic frequency ω p of the spectral peak. The mean zero-crossing wave period T 0 is equal to 2π/ω 0, with \({\omega }_{0}=\sqrt{{m}_{2}/{m}_{0}}\). The associated wavelength L 0 = 2π/k 0 follows from the linear dispersion relation \({\omega }_{0}=\sqrt{g{k}_{0}\,\tanh ({k}_{0}d)}\), with d the water depth. The mean spectral frequency is defined as ω m  = m 1/m 0 8 and the associated mean period T m is equal to 2π/ω m . A characteristic wave steepness is defined as μ m  = k m σ, where k m is the wavenumber corresponding to the mean spectral frequency ω m 8. The following quantitites are also introduced: \({q}_{m}={k}_{m}d,{Q}_{m}=\,\tanh \,{q}_{m}\), the phase velocity c m  = ω m /k m , the group velocity c g  = c m [1 + 2q m /sinh(2q m )]/2.

The spectral bandwidth \(\nu ={({m}_{0}{m}_{2}/{m}_{1}^{2}-1)}^{1/2}\) gives a measure of the frequency spreading. The angular spreading \({\sigma }_{\theta }=\sqrt{{\int }_{0}^{2\pi }D(\theta ){(\theta -{\theta }_{m})}^{2}{\rm{d}}\theta }\), where \(D(\theta )={\int }_{0}^{\infty }S(\omega ,\theta ){\rm{d}}\omega /{\sigma }^{2}\) and \({\theta }_{m}={\int }_{0}^{2\pi }D(\theta )\theta {\rm{d}}\theta \) is the mean direction. Note that \({\omega }_{0}={\omega }_{m}\sqrt{1+{\nu }^{2}}\).

The wave skewness λ 3 and the excess kurtosis λ 40 of the zero-mean surface elevation η(t) are given by

$${\lambda }_{3}=\overline{{\eta }^{3}}/{\sigma }^{3},\quad {\lambda }_{40}=\overline{{\eta }^{4}}/{\sigma }^{4}-3\,.$$
(13)

Here, overbars imply statistical averages and σ is the standard deviation of surface wave elevations.

For second-order waves in deep water10

$${\lambda }_{3}\approx 3{\mu }_{m}(1-\nu +{\nu }^{2}),$$
(14)

and the following bounds hold61

$$3{\mu }_{m}(1-\sqrt{2}\nu +{\nu }^{2})\le {\lambda }_{3}\le 3{\mu }_{m}.$$
(15)

Here, ν is the spectral bandwidth defined above and the characteristic wave steepness μ m  = k m σ, where k m is the wavenumber corresponding to the mean spectral frequency ω m 8. For narrowband (NB) waves, ν tends to zero and the associated skewness λ 3,NB  = 3μ m 8,9,10.

For third-order nonlinear random seas the excess kurtosis

$${\lambda }_{40}={\lambda }_{40}^{d}+{\lambda }_{40}^{b}$$
(16)

comprises a dynamic component \({\lambda }_{40}^{d}\) due to nonlinear quasi-resonant wave-wave interactions11, 62 and a Stokes bound harmonic contribution \({\lambda }_{40}^{b}\) 63. In deep water it reduces to the simple form \({\lambda }_{40,NB}^{b}=18{\mu }_{m}^{2}=2{\lambda }_{3,NB}^{2}\) 11, 63, 64 where λ 3,NB is the skewness of narrowband waves8.

As for the dynamic component, Fedele14 recently revisited Janssen’s62 weakly nonlinear formulation for \({\lambda }_{40}^{d}\). In deep water, this is given in terms of a six-fold integral that depends on the Benjamin-Feir index \(BFI=\sqrt{2}{m}_{m}/v\) and the parameter \(R={\sigma }_{\theta }^{2}/2{\nu }^{2}\), which is a dimensionless measure of the multidirectionality of dominant waves11, 15. As waves become unidirectional (1D) waves R tends to zero and a random narrowband wave train becomes unstable if BFI > 165.

The Tayfun-Fedele model

We define P(ξ) as the probability that a wave crest observed at a fixed point of the ocean in time exceeds the threshold ξH s . For weakly nonlinear nonlinear seas, this probability can be described by the third-order Tayfun-Fedele model9,

$${P}_{TF}(\xi )={\rm{\Pr }}[h > \xi \,{H}_{s}]=\exp (-\,8{\xi }_{0}^{2})[1+\Lambda {\xi }_{0}^{2}(4{\xi }_{0}^{2}-1)],$$
(17)

where ξ 0 follows from the quadratic equation \(\xi ={\xi }_{0}+2\mu {\xi }_{0}^{2}\)8. Here, the Tayfun wave steepness μ = λ 3/3 is of O(μ m ) and it is a measure of second-order bound nonlinearities as it relates to the skewness λ 3 of surface elevations10. The parameter Λ = λ 40 + 2λ 22 + λ 04 is a measure of third-order nonlinearities and is a function of the fourth order cumulants λ nm of the wave surface η and its Hilbert transform \(\hat{\eta }\) 9. In particular, \({\lambda }_{22}=\overline{{\eta }^{2}{\hat{\eta }}^{2}}/{\sigma }^{4}-1\) and \({\lambda }_{04}=\overline{{\hat{\eta }}^{4}}/{\sigma }^{4}-3\). In our studies Λ is approximated solely in terms of the excess kurtosis as Λ appr = 8λ 40/3 by assuming the relations between cumulants66 λ 22 = λ 40/3 and λ 04 = λ 40. These, to date, have been proven to hold for linear and second-order narrowband waves only12. For third-order nonlinear seas, our numerical studies indicate that Λ ≈ Λ appr within a 3% relative error in agreement with observations67, 68.

For second-order seas, referred to as Tayfun sea states34, Λ = 0 only and P TF in Eq. (17) yields the Tayfun (T) distribution8

$${P}_{T}(\xi )=\exp (-\,8{\xi }_{0}^{2}).$$
(18)

For Gaussian seas, μ = 0 and Λ = 0 and P TF reduces to the Rayleigh (R) distribution

$${P}_{R}(\xi )=\exp (-\,8{\xi }^{2}).$$
(19)

Note that the Tayfun distribution represents an exact result for large second order wave crest heights and it depends solely on the steepness parameter defined as μ = λ 3/310.

The Forristall model

The exceedance probability is given by30

$${P}_{F}(\xi )=\exp (-\,{(\xi /\alpha )}^{\beta }),$$
(20)

where α = 0.3536 + 0.2561S 1 + 0.0800U r , \(\beta =2-1.7912{S}_{1}-0.5302{U}_{r}+0.284{U}_{r}^{2}\) for multi-directional (short-crested) seas. Here, \({S}_{1}=2\pi {H}_{s}/(g{T}_{m}^{2})\) is a characteristic wave steepness and the Ursell number \({U}_{r}={H}_{s}/({k}_{m}^{2}{d}^{3})\), where k m is the wavenumber associated with the mean period T m  = m 0/m 1 and d is the water depth.

Space-Time Statistical Parameters

For space-time extremes, the coefficients in Eq. (3) are given by3, 69

$$\begin{array}{rcl}{M}_{3} & = & 2\pi \frac{D}{\overline{T}}\frac{{\ell }_{x}}{\overline{{L}_{x}}}\frac{{\ell }_{y}}{\overline{{L}_{y}}}{\alpha }_{xyt},\\ {M}_{2} & = & \sqrt{2\pi }(\frac{D}{\overline{T}}\frac{{\ell }_{x}}{\overline{{L}_{x}}}\sqrt{1-{\alpha }_{xt}^{2}}+\frac{D}{\overline{T}}\frac{{\ell }_{y}}{\overline{{L}_{y}}}\sqrt{1-{\alpha }_{yt}^{2}}+\frac{{\ell }_{x}}{\overline{{L}_{x}}}\frac{{\ell }_{y}}{\overline{{L}_{y}}}\sqrt{1-{\alpha }_{xy}^{2}}),\\ {M}_{1} & = & {N}_{D}+{N}_{x}+{N}_{y},\end{array}$$

where

$${N}_{D}=\frac{D}{\overline{T}},\quad {N}_{x}=\frac{{\ell }_{x}}{\overline{{L}_{x}}},\quad {N}_{y}=\frac{{\ell }_{y}}{\overline{{L}_{y}}}$$

are the average number of waves occurring during the time interval D and along the x and y sides of length \({\ell }_{x}\) and \({\ell }_{y}\) respectively. They all depend on the mean period \(\overline{T}\), mean wavelengths \(\overline{{L}_{x}}\) and \(\overline{{L}_{y}}\) in x and y directions:

$$\overline{T}=2\pi \sqrt{\frac{{m}_{000}}{{m}_{002}}},\quad \overline{{L}_{x}}=2\pi \sqrt{\frac{{m}_{000}}{{m}_{200}}},\quad \overline{{L}_{y}}=2\pi \sqrt{\frac{{m}_{000}}{{m}_{020}}}$$

and

$${\alpha }_{xyt}=\sqrt{1-{\alpha }_{xt}^{2}-{\alpha }_{yt}^{2}-{\alpha }_{xy}^{2}+2{\alpha }_{xt}{\alpha }_{yt}{\alpha }_{xy}}.$$

Here,

$${m}_{ijk}=\iint {k}_{x}^{i}{k}_{y}^{j}{f}^{k}S(f,\theta )dfd\theta $$

are the moments of the directional spectrum S(f, θ) and

$${\alpha }_{xt}=\frac{{m}_{101}}{\sqrt{{m}_{200}{m}_{002}}},\quad {\alpha }_{yt}=\frac{{m}_{011}}{\sqrt{{m}_{020}{m}_{002}}},\quad {\alpha }_{xy}=\frac{{m}_{110}}{\sqrt{{m}_{200}{m}_{020}}}.$$

The Higher Order Spectral (HOS) method

The HOS, developed independently by Dommermuth & Yue25 and West et al.26 is a numerical pseudo-spectral method, based on a perturbation expansion of the wave potential function up to a prescribed order of nonlinearities M in terms of a small parameter, the characteristic wave steepness. The method solves for nonlinear wave-wave interactions up to the specified order M of a number N of free waves (Fourier modes). The associated boundary value problem is solved by way of a pseudo-spectral technique, ensuring a computational cost which scales linearly with M 2 Nlog(N)70, 71. As a result, high computational efficiency is guaranteed for simulations over large spatial domains. In our study we used the West formulation26, which accounts for all the nonlinear terms at a given order of the perturbation expansion. The details of the specific algorithm are given in Fucile70 and Fedele et al.2. The wave field is resolved using 1024 × 1024 Fourier modes on a spatial area of 4000 m × 4000 m. Initial conditions for the wave potential and surface elevation are specified from the directional spectrum as an output of WAVEWATCH III72.

Data Availability

All the publicly available data and information about the El Faro accident are posted on the National Transportation Safety Board (NTSB) website1.