Introduction

Subject to non-linear processes, the climate system often shows quasi-non-deterministic behavior1. Therefore, detecting the influence of multidecadal signals in regional climate systems is challenging and essential to estimate their interaction with global climate change on multidecadal to multi-centennial time scales. This work investigates the influence of multidecadal sea surface temperature (SST) fluctuations in the North Atlantic Ocean (the so-called Atlantic Multidecadal Variability or AMV for short2) on Northern European temperature, focusing on the Baltic Sea.

During the 20th century, the North Atlantic changed between anomalous warm and cold SST on multidecadal time scales. The main driver of these fluctuations, which are summarized as the AMV, is still heavily debated, but its origin is likely a combination of internal variability and response to external forcing3,4,5. Both observational and modeling studies agree that AMV influences the climate in Europe6. The AMV has been linked to altered temperature and precipitation patterns across Europe6,7,8,9. It strongly impacts summer temperatures in western and central Europe3. During the negative AMV phase in the 1960s–1990s, summers were cooler across large parts of Europe, whereas the recent positive phase resulted in warmer springs, summers, and autumns in Northern Europe3,6,10.

During winter, the NAO is the most prominent mode of climate variability in the North Atlantic sector11. In winter, it is responsible for more than 80% of the sea level pressure (SLP) variability over the North Atlantic and Europe12. Recent studies argue for a dynamic coupling between the NAO and the AMV7,8,9,13,14,15. Strong zonal winds over the North Atlantic (likely associated with positive NAO anomalies) lead to anomalous heat loss in the Labrador Sea, enhancing deep-water formation and strengthening the Atlantic meridional overturning circulation (AMOC). Stronger AMOC-driven poleward heat transport helps to compensate for the initial cooling, leading to basin-wide warming on multidecadal time scales. The atmosphere then adjusts to the Atlantic warming. The years following the AMV maximum are characterized by negative NAO conditions, a weakening AMOC, and, consequently, colder SSTs over the Atlantic which in turn feed back onto AMV, North Atlantic temperatures, and the AMOC3,7,9,16,17. Hence, the AMOC and the AMV are influenced by the time-integrated effect of the NAO. At the same time, the resulting ocean response likely affects the NAO9.

Traditionally, the AMV signal and the associated multidecadal variability within the North Atlantic basin were analyzed by spatially averaging SST anomalies north of the equator (0°–60°N, 0°–80°W). Trenberth and Shea2 proposed to remove the global SST signal instead of linear detrending18, which allows isolating variability within the North Atlantic. While the estimation of the amplitude and phase of the AMV are sensitive to the detrending method19, the method proposed by Trenberth and Shea2 has been proven suitable for analyzing multidecadal variability. It may cause an underestimation of the variability in the North Atlantic, but it also removes the impact of global warming20. However, Wills et al.14 argued that the choice of the averaging region is problematic, as no physical mechanism has been proposed that is limited to the area between the equator and 60°N. To avoid any a priori assumption about the spatial and temporal structure, Wills et al.14 developed a method called low-frequency component analysis (LFCA), which allows for finding spatial anomaly patterns with the highest ratio of low frequency to the total variance. Using LFCA, they disentangled influences of high-frequency variability, multidecadal variability, and long-term global warming.

While the AMV’s impact on long-term land climate has been investigated6,10,21, its effects on coastal regions and large water bodies of marginal seas have not yet been studied in detail. The Baltic Sea has limited water exchange with the North Atlantic, and its variability is mainly controlled by atmospheric forcing. Its large water body (21,500 km3) filters high-frequency temperature fluctuations and is therefore well-suited to study long-term atmospheric teleconnections.

Börgel et al.22 found that the Baltic Sea, like many other regional climate systems, is influenced by the AMV. They estimated that multidecadal salinity changes of 1 g/kg—about 10% of the Baltic Sea’s mean salinity—could be linked to the AMV. Moreover, SST and sea ice cover were also found to be influenced by the AMV. Kniebusch et al.17 analyzed the impact of the AMV on the SST in the Baltic Sea and found that 58% of the decadal SST variability of the Baltic Sea could be linked to the AMV when the global warming signal is removed. It is estimated that the AMV’s impact on the SST of the Baltic Sea on annual time scales is relatively small, with a 5% explained variance, assuming that winter and annual SST responses to AMV variability are similar23.

Multiple studies found that a positive AMV is followed by a negative NAO state3,9,13,15,16,17, which is likely associated with colder and drier winters in Northern Europe24. However, there is no analysis focusing on the seasonality of the AMV and its impact on the temperature in the Baltic Sea region.

Therefore, this study aims to estimate the seasonal impact of the AMV on temperature variability in Northern Europe, specifically the Baltic Sea region. To assess the low-frequency variability, we use LFCA. This method avoids possible inaccuracies25 since it takes non-linearities into account. This allows us to analyze the dominant low-frequency signals within the Baltic Sea without prior assumptions of linearity, which do not necessarily apply to teleconnections8.

Results

The AMV

We use LFCA to extract the dominant modes of variability within the North Atlantic SST. We compute the two leading low-frequency components and low-frequency patterns (LFCs and LFPs) of the monthly SST anomalies within the Atlantic basin, retaining 30 empirical orthogonal functions (EOFs) explaining roughly 90% of the total variance within the North Atlantic (Fig. 1). The first LFP (Fig. 1a) shows a nearly uniform warming pattern and resembles the North Atlantic response to global warming captured by the LFC1 (Fig. 1c)26. The second LFP (Fig. 1b) indicates a warming signal north of 0° with a maximum warming pattern in the subpolar gyre, representing the averaging region of the established AMV definition by Trenberth and Shea2. The pattern looks like the regression pattern associated with Trenberth and Shea-based AMV index2. The corresponding second LFC (Fig. 1d) shows multidecadal temperature fluctuations with a positive period between the 1930s and 1960s, which transitions to a negative phase around 1960 and back into a positive phase in the 1990s. This second LFC shows a high correlation of 0.9 with the established AMV definition of Trenberth and Shea2 when applying a 10-y running mean. However, compared to the original AMV definition by Enfield et al.18, it has different amplitudes and phases, suggesting that linear detrending is not capturing the internal component of the North Atlantic variability.

Fig. 1: Low-frequency component analysis of the Atlantic sea surface temperature (SST) anomalies from ERSST.
figure 1

The first and second low-frequency patterns (a, b) (LFP) and low-frequency components (c, d) (LFC) retaining 30 EOFs and a 10-y low-pass cutoff are shown. LFC1 (global warming in the Atlantic) shows a 1-y running mean in black, whereas the gray line shows monthly fluctuations. LFC2 (Atlantic Multidecadal Variability, AMV) shows a 1-y running mean in black and monthly changes in gray. The AMV definitions of Enfield et al. (2001) and Trenberth and Shea (2006a) have been added in red and green, respectively.

In contrast to the Trenberth and Shea-based AMV index, LFCs are not low-pass filtered and can capture the interaction between high-frequency atmospheric variability and the inertia-driven response of the ocean14,26. Hence, in the following, the second LFC can be interpreted as the AMV signal without filtering out processes that operate on shorter time scales than decades.

The impact of the AMV on regional climate systems such as the Baltic Sea

The AMV influences the Baltic Sea22. Often, when analyzing the response of a regional climate system to climate variability, a linear response is assumed. However, besides the North Atlantic influence, the Baltic Sea is influenced by many other processes with various time scales.

If two assumptions hold, namely that (a) the AMV influences the Baltic Sea and (b) the LFCA proves to be a suitable method to discriminate between different physical forcing mechanisms, it should be possible to disentangle the AMV signal from the total SST variability in the Baltic Sea using LFCA. The second hypothesis (b) is supported by the fact that unlike a time series, e.g., discriminating between frequency bands only, the LFCA method takes spatial information into account and can disentangle processes leaving different spatial footprints. The idea is to distill the AMV-related response within the Baltic Sea, likely influenced by non-linear processes, and therefore get more meaningful results compared to a simple (linear) correlation with the AMV signal.

The Baltic Sea’s complex dynamics and the different spatial resolutions of the data products influence the number of EOFs retained. We increased the number of EOFs (N) kept to 200, capturing over 95% of the monthly SST variance. This way, we include variability on smaller spatial scales with significant low-frequency power. This improves isolating patterns of multidecadal variability and filtering out variability that acts on shorter time scales26. We use monthly SST anomalies from the regional ocean model RCO simulating the Baltic Sea for our analysis27. The results are robust to changes between N = 100 and N = 250.

The two dominant LFCs/LFPs that maximize the ratio of low frequency to the total variance in the Baltic Sea are a basin-wide warming response associated with global warming (LFC/LFP1 Fig. 2a, c) and multidecadal temperature fluctuations (LFC/LFP2), shown in Fig. 2. The multidecadal fluctuations of the second LFC (Fig. 2d) project onto a dipole LFP pattern (Fig. 2b), with a warming response in the northeastern part of the Baltic Sea and a cooling reaction within the rest of the Baltic Sea. On an annual timescale, the second LFC of the Baltic Sea SST shows a correlation of 0.72 with the second LFC within the North Atlantic. After applying a 10-y running mean to both signals, we find a correlation of 0.93 between LFC2 in the North Atlantic and the second LFC of the Baltic Sea, indicating the close relationship between both signals. Further, all but the second Baltic LFC show only a small correlation (<0.1) with the North Atlantic LFC2. Even when we consider a possible relationship between the North Atlantic LFC2 and the Baltic LFC1 due to the high amplitude of Baltic LFC1, the results prove robust. This holds independent of the number of LFCs considered.

Fig. 2: Low-frequency component analysis of the Baltic Sea sea surface temperature anomalies, extracted from the regional climate model RCO.
figure 2

The first and second low-frequency patterns (a, b) (LFP) and low-frequency components (c, d) (LFC) using 200 EOFs retained and a 10-y low-pass cutoff are shown. LFC1 (global warming signal in the Baltic Sea) shows a 1-y running mean in black, whereas the gray line shows monthly fluctuations. LFC2 (Baltic Multidecadal Variability, BMV) shows a 1-y running mean in black and monthly changes in gray. The LFC2 (Atlantic Multidecadal Variability, AMV) computed for the Atlantic Basin (Fig. 1) is shown in red for comparison.

Still, we decided to use a linear combination of the first two Baltic LFCs and weigh them according to the previously computed correlation coefficients, accounting for the small contribution of the North Atlantic LFC2 to Baltic LFC1. Including the Baltic LFC1 increases the correlation between the North Atlantic LFC2 and the combination of the first two Baltic LFCs on decadal time scales to 0.98. In the following, the resulting signal is defined as Baltic Multidecadal Variability (BMV), and the LFC2 of the Atlantic as AMV.

While the AMV and the BMV are closely coupled on yearly and decadal time scales, on monthly time scales, the correlation drops to 0.55 (not shown). The seasonal cycle was removed from the data. This indicates a close coupling between the AMV and the BMV in the low-frequency band but a de-coupling of the related intra-annual variability.

Regressing monthly SST anomalies onto the BMV (Fig. 3a), the AMV (NA LFC2) (Fig. 3b), and the traditional Trenberth and Shea-based AMV index (Fig. 3c) results in similar spatial regression patterns. All regression patterns show a uniform warming pattern in the Baltic Sea. However, the AMV and the Trenberth and Shea-based AMV index show a relatively weak amplitude of about 0.2 °C. In contrast, the BMV relates to a strong positive response of up to 0.6 °C, tripling the estimated response by the AMV. Compared to the linear regression based on the Trenberth and Shea-based AMV index, we find that the LFCA-based indices increase the statistically significant area.

Fig. 3: Baltic Sea sea surface temperature regression patterns using different AMV indices.
figure 3

Linear regression of Baltic Sea sea surface temperature anomalies onto the BMV (a) (LFC2 Fig. 2), the AMV (b) (LFC2 Fig. 1), and AMV Trenberth and Shea2 (c). Only statistically significant regions (p < 0.05) are colored.

The LFCA results in a higher low-frequency to total variance ratio (r = 0.78) for the AMV than the BMV, where only about 40% of the signal consists of low-frequency variability (not shown). Accordingly, the AMV signal shows a strong autocorrelation, indicating a higher persistence than the BMV. By contrast, the autocorrelation of the BMV decreases significantly within the first year (not shown). The difference in persistence in the Baltic Sea and the different amplitudes in Fig. 3 suggests that the BMV-related response has a strong seasonality that is not captured by the linear regression of the Baltic Sea SST onto the AMV.

To better understand this seasonal influence, the seasonal SST response associated with the BMV variability is analyzed (Fig. 4). In some regions, up to 40% of the SST variability in winter (Fig. 4a) can be attributed to the BMV, corresponding to temperature fluctuations of 1.2 °C per BMV standard deviation. The most substantial influence is found in the eastern part of the Gulf of Finland, parts of the central Baltic Sea, and the northern Bothnian Sea. These regions share interannual changes in sea ice coverage. During spring (Fig. 4b), there is no significant response. During summer (Fig. 4c), only small regions in the northern part explain more than 5% of the SST variability that can be linked to the BMV for this season. Lastly, the response in autumn (Fig. 4d) shows warming in the northern part of the Baltic Sea and the Gulf of Finland with 10–20% explained variance. The annual mean pattern (Fig. 3) resembles the winter and autumn patterns most strongly, with some summer contributions. We find similar results for the regression patterns of the AMV index (Supplementary Figure 1) and the traditional AMV index by Trenberth and Shea (Supplementary Fig. 2). However, the statistically significant areas are smaller, and there is little to no explained variance.

Fig. 4: Seasonal sea surface temperature response associated with BMV variability.
figure 4

Linear regression of seasonal sea surface temperature anomalies onto the seasonal component of the BMV (winter (a), spring (b), summer (c), and autumn (d)). The hatching indicates the explained variance. Dots (.) correspond to >5%, side hatching (/) to >10%, stars (*) to >20%, and backward side hatching (\) to >30%, explained variance. Only statistically significant changes are colored (p < 0.05; Student’s t-test).

To analyze the relationship between the AMV, the BMV, and the Baltic Sea during winter, we regress SLP, surface winds (u10, v10), surface air temperature (SAT), and SST onto the AMV and the BMV indices. We consider lags −20 to −5 years, −5 to 5 years, and 5 to 20 years. The regression analysis applies to positive and negative states, but for clarity, the following description is given for a positive event. We find that the regression patterns of the AMV and the BMV are nearly identical (not shown). Therefore, we focus on the response to the AMV (Fig. 5).

Fig. 5: Regression patterns of ERA20-C associated with AMV variability.
figure 5

Regression of winter sea level pressure (a), surface winds (b), surface air temperature (c), and sea surface temperature onto the AMV (d) around an AMV maximum. The left column panel shows lags of −20 to −5 years, the middle column panel shows lags of −5 to 5 years, and the right column panel shows a lag of 5–20 years. The dotted area indicates statistical significance (p < 0.05).

For lags −20 to −5 years, we find a negative SLP anomaly over Greenland and Iceland and a positive SLP anomaly south of 50°N in the Atlantic, spanning across Europe (Fig. 5a). The wind regression pattern shows increased westerlies in the subpolar region (40°–60°N, 20°–60°W, Fig. 5b) and increased south-westerlies near Greenland–Iceland–Norwegian (GIN) Seas and the Iceland Basin. While there is an ongoing debate about which high-latitude deep-water formation regions control the AMOC and its variability, the increase in westerlies over the Labrador Sea is associated with enhanced upward turbulent heat fluxes resulting in a lagged strengthening of the AMOC7,9,16, which enhances poleward heat transport16. More recently, the GIN Seas have also been identified as important regions for deep-water formation28. Hence, it is likely that the enhanced upward turbulent heat fluxes in the GIN seas also contribute to the stronger deep-water formation, strengthening the AMOC. The SAT response shows a near basin-wide warming also reaching the Baltic Sea region (Fig. 5c). In contrast, the SST shows a negative response. At lag −5 to 5 years, the atmosphere shows a weakly negative NAO response in agreement with reduced westerlies and cooling over central Europe. However, the northern part of Europe still shows slightly positive SAT and SST, likely related to the large oceanic inertia and enhanced northward oceanic heat transport. After an AMV maximum at lags 5 to 20 years, stronger negative NAO conditions and a decrease in westerlies likely result in a strong cooling of air temperatures over Europe and the Baltic Sea. In contrast, the larger inertia of the North Atlantic still shows higher SST in the North Atlantic and parts of the Baltic Sea, despite a cooling effect of the atmosphere. A zoom on the Baltic Sea can be found in the supplementary (Supplementary Fig. 3).

We repeated the same analysis for autumn (Supplementary Fig. 4). During autumn, the atmospheric response is noisier but yields a uniform warming response at lags −20 to 20 around an AMV maximum that also influences the Baltic Sea region.

Discussion

Our study focuses on the emergence of multidecadal fluctuations in regional climate systems. With an LFCA, we have been able to detect multidecadal SST fluctuations in the Baltic Sea (BMV) that bear a strong similarity with the AMV, showing a correlation higher than 0.95 on decadal time scales. We identify the BMV as an important contributor to Baltic Sea SST variability during winter. The impact of the BMV on the Baltic Sea SST variability differs drastically from season to season. While we find a substantial impact of up to 40% explained variance locally during winter, it remains mostly below 5% during spring and summer.

In contrast to other parts of Europe where the AMV also has a substantial impact during summer, for the Baltic Sea, we identify the winter season to be under the strongest influence of the AMV by far. The winter influence is likely caused by the NAO’s out-of-phase relationship with the AMV: The impact of the time-integrated NAO forcing on the AMV and AMOC links to basin-wide warming in the North Atlantic (Fig. 5d), which maintains warmer SST despite already weak negative NAO conditions at lag −5 to 5 years. Some studies have already analyzed the AMV’s impact on Northern Europe22,23. Ruprich-Robert et al.8 found a possible Northern European warming response to the AMV during winter, which they did not discuss in detail. Their study used two different GCMs (GFDL CM2.1 and NCAR CESM1) and restored North Atlantic SST to observe anomalies associated with AMV variability. In addition, the oscillatory relationship between the time-integrated NAO and the AMOC is discussed by Omrani et al.9 and supports the strong temperature impact found during winter. Omrani et al.9 argue that during and just after an AMV+ event, when sea ice melts in the Arctic peaks, Eurasia is warmer due to poleward oceanic heat transport. They found that the warming response also includes the Baltic Sea area. Using the regional atmospheric forcing, the associated regional SAT response to AMV/BMV variability also supports this warming response (Supplementary Figs. 5 and 6). We find the strongest impact in the Baltic Sea in regions where sea ice is formed. Further analysis linked decreasing sea ice to positive BMV phases (not shown) and reducing the ice-snow albedo in regions where sea ice is formed (Supplementary Fig. 7). In fact, reduced precipitation during winter over Northern Europe was related to positive AMV+ phases7 leading to less snowfall, also supporting the reduction of the ice-snow albedo over the Baltic Sea. Hence, there may be a link between the BMV signal and the yearly sea ice cover in the Baltic Sea, supporting earlier hints of changing sea ice cover22.

Summarizing, our results suggest that the ocean is the main source of inertia that maintains the anomalous warm SST in the North Atlantic through increased northward oceanic heat transport, influencing the North Atlantic and the Baltic Sea. This warming is likely amplified by a smaller ice/snow albedo, causing even stronger warming due to decreasing sea ice cover in the Baltic Sea.

In our study, we discriminate between the AMV and the BMV. The difference in seasonal dynamics in the Baltic Sea becomes apparent when regressing the AMV signal onto the Baltic Sea SST (Fig. 3). The AMV projects a weak warming response for the Baltic Sea. In contrast, the BMV that emerges from the LFCA analysis of the Baltic Sea SST signal shows a three times stronger response. This is likely related to seasonal dynamics within the Baltic Sea that cannot be captured by regressing the SST onto the AMV signal. The strong seasonality of the Baltic Sea also influences the low frequency to total variance ratio (around 40%) of the BMV, which is relatively low compared to the AMV (78%). Analyzing the seasonal SST response associated with the AMV (Supplementary Figure 1) and the traditional AMV definition by Trenberth and Shea (Supplementary Fig. 2) shows that all indices show the strongest impact during winter but have little to no explained variance.

We find that using LFCA allows for analyzing the seasonal response of a regional climate system associated with AMV variability. In that way, we have been able to show that the winter season in the Baltic Sea and the SAT in Northern Europe are likely under the strong influence of the AMV, which has not been discussed in detail. Further, we find that the seasonal response within the Baltic Sea is strongly heterogonous and non-linear, as the spring season is not influenced at all. We thus propose to consider multi-decadal variations of Baltic Sea SST linked to but also partly autonomous from the AMV. While we focus on the effects of the AMV on the Baltic Sea, many other regional climate systems could be analyzed using LFCA without exclusively focusing on the AMV.

In summary, multidecadal fluctuations in the Baltic Sea, which are very similar to the AMV and for which we introduce the term BMV, have the greatest influence on SST in winter. The BMV turns out to be better suited for analyzing the teleconnection with the North Atlantic and the Baltic Sea than the original AMV signal.

Methods

Regional ocean model

This work considers 1900–2008 using a historical climate simulation27. The model used for this study is the Rossby Centre Ocean model (RCO) coupled with the Swedish Coastal and Ocean Biogeochemical Model (RCO-SCOBI)29,30. RCO is a Bryan-Cox-Semtner primitive equation ocean model with a free surface. An open boundary in the northern Kattegat limits the model domain. RCO uses a two-equation closure k-epsilon turbulence model. The ocean model is coupled with a Hibler-type multi-category sea ice model. The bathymetry is based on topography data by Seifert and Kayser31. The horizontal resolution of the model is 3.7 km. Further, RCO uses 83 vertical levels with a layer thickness of 3 m resulting in a maximum depth of 250 m. RCO is forced by High-Resolution atmospheric Forcing Fields (HiResAFF)32. A more detailed description of the atmospheric fields is given in27. A detailed assessment of this simulation can be found in27. Furthermore, Kniebusch et al.23 focused on the SST signal in RCO. They discovered that RCO slightly underestimates the SST but is still within the standard deviation of the observations.

Extended reconstructed SST (ERSST)

For the North Atlantic, we use observed SST from 1900 to 2008. The ERSST dataset has a 2° × 2° horizontal grid resolution and covers the period 1854 to the present (Huang et al., 2017). It uses COADS 3.0, which combines SST from Argo floats (above 5 m), Hadley Centre Ice-SST version 2 (HadISST2) ice concentration (1854–2015), and NCEP ice concentration (2016–present). Like14, we decided to use the period 1900 to 2008 as the data gets more reliable after the 19th century.

ERA-20C

For the North Atlantic domain, we use atmospheric fields from ERA-20C, an atmospheric reanalysis from the 20th century covering the period 1900–201033. ERA-20C was created as part of the ERA-CLIM project (see http://era-clim.eu) and is based on ECMWF’s Integrated Forecast System.

It assimilates surface pressure and surface winds only and has a horizontal resolution of roughly 125 km (spectral truncation T159) and 91 vertical levels between the surface and 0.01 hPa. For the analysis, SLP, SAT, SST, and zonal winds at 700 hPa have been used.

Low-frequency component analysis

LFCA26 allows for identifying modes of low-frequency variability. LFCA solves for linear combinations of EOFs that maximize the ratio of low frequency to the total variance. In other words, LFCA maximizes the fraction of a signal defined as low frequency. For a detailed description, the reader is referred to the original work26. However, in the following, a short introduction to LFCA is given based on the description in Wills et al.26.

LFCA starts with a conventional principal component analysis (PCA) that identifies spatial patterns explaining large parts of the variance in the data. For the study, the data is converted to an \({\rm{n}}\times p\) spatio-temporal matrix \({\bf{X}}\) with zero time mean.

The EOF can be computed by solving for the eigenvectors \({{\boldsymbol{a}}}_{{\boldsymbol{k}}}\) of the sample covariance matrix

$${\bf{C}}=\frac{1}{(n-1)}{{\boldsymbol{X}}}^{{\boldsymbol{T}}}{\boldsymbol{X}},$$
(1)

such that \({\boldsymbol{C}}{{\boldsymbol{a}}}_{{\boldsymbol{k}}}={\sigma }_{k}^{2}{{\boldsymbol{a}}}_{{\boldsymbol{k}}}\).

The EOFs are normalized (\({||}{{\boldsymbol{a}}}_{{\boldsymbol{k}}}{||}=1),\) so that the eigenvalue \({\sigma }_{k}^{2}\) gives the variance associated with the kth EOF. The kth principal component (PC) is then defined by projecting the kth EOF onto the data matrix \({\bf{X}}\)

$${\rm{P}}{{\rm{C}}}_{{\rm{k}}}={\sigma }_{k}^{-1}{\bf{X}}{{\bf{a}}}_{{\bf{k}}}.$$
(2)

For the LFCA, a linear Lanczos lowpass filter (L) with a cutoff frequency \({T}^{-1}\) is used34, which allows obtaining a lowpass filtered data matrix \(\widetilde{{\bf{X}}}\,{\boldsymbol{=}}\,{\rm{L}}\left({\rm{T}}\right){\bf{X}}\). In our analysis, we define T to be 10 years.

The projection of \(\widetilde{{\bf{X}}}\) onto the \({\rm{kth}}\) EOF results in a lowpass filtered \(k{\rm{th}}\) principal component

$$\overline{{\bf{PC}}}={\sigma }_{k}^{-1}\widetilde{{\bf{X}}}{{\bf{a}}}_{{\bf{k}}}.$$
(3)

Then, LFCA searches for linear combinations uk of the first N EOFs a1, …, aN,

$${{\bf{u}}}_{{\bf{k}}}=\left[\frac{{{\bf{a}}}_{{\bf{1}}}}{{{\rm{\sigma }}}_{1}}\frac{{{\bf{a}}}_{{\bf{2}}}}{{{\rm{\sigma }}}_{2}}\ldots \frac{{{\bf{a}}}_{{\bf{N}}}}{{{\rm{\sigma }}}_{{\rm{N}}}}\right]{{\bf{e}}}_{{\bf{k}}}$$
(4)

where the square brackets indicate a \({\rm{p}}\times {\rm{N}}\) matrix. The coefficients ekk shall be chosen in such a way that the ratio \({r}_{k}\) of low-frequency to total variance is maximized. The ratio is defined as

$${r}_{k}=\frac{{\left(\widetilde{{\bf{X}}}{{\bf{u}}}_{{\bf{k}}}\right)}^{{\rm{T}}}\widetilde{{\bf{X}}}{{\bf{u}}}_{{\bf{k}}}}{{\left({\boldsymbol{X}}{{\boldsymbol{u}}}_{{\boldsymbol{k}}}\right)}^{T}{\boldsymbol{X}}{{\boldsymbol{u}}}_{{\boldsymbol{k}}}}=\frac{{{\boldsymbol{u}}}_{{\boldsymbol{k}}}^{T}{L}^{T}{\boldsymbol{C}}L{{\boldsymbol{u}}}_{{\boldsymbol{k}}}}{{{\boldsymbol{u}}}_{{\boldsymbol{k}}}^{T}{\boldsymbol{C}}{{\boldsymbol{u}}}_{{\boldsymbol{k}}}}.$$
(5)

The coefficient vectors \({e}_{k}\) are defined as normalized such that \(\left|\left|{{\boldsymbol{e}}}_{{\boldsymbol{k}}}\right|\right|=1\). The normalization factors \({\sigma }_{k}^{-1}\) then give the linear combinations \({u}_{k}\) unit variance like the ak have.

Using the equations above and the definition of a PC, it can be shown that the coefficient vectors \({{\boldsymbol{e}}}_{{\boldsymbol{k}}}\) are eigenvectors of the covariance matrix

$${{\boldsymbol{R}}}_{{\boldsymbol{ij}}}={cov}\left(\widetilde{{\rm{P}}{{\rm{C}}}_{{\rm{i}}}},\widetilde{{\rm{P}}{{\rm{C}}}_{{\rm{j}}}}\right)\text{for}\,i,j\epsilon \left[1,N\right].$$
(6)

\({\boldsymbol{R}}\) has \(N\) eigenvectors \({{\boldsymbol{Re}}}_{{\boldsymbol{k}}}={r}_{k}{{\boldsymbol{e}}}_{{\boldsymbol{k}}}\), and they are sorted by the magnitude of the eigenvalues rk.

The kth so-called LFC can be computed by projecting the unfiltered data onto the linear combination vectors \({u}_{k}\).

$${LF}{C}_{k}={\boldsymbol{X}}{{\boldsymbol{u}}}_{{\boldsymbol{k}}}$$
(7)

the associated so-called LFP can be computed by the regression of the unfiltered data onto the \(k{\rm{th}}\) LFC,

$${{\boldsymbol{LFP}}}_{k}={{\boldsymbol{X}}}^{{\boldsymbol{T}}}{LF}{C}_{k}$$
(8)