Abstract
Identifying regions important for spreading and mediating perturbations is crucial to assess the susceptibilities of spatiotemporal complex systems such as the Earth’s climate to volcanic eruptions, extreme events or geoengineering. Here a datadriven approach is introduced based on a dimension reduction, causal reconstruction, and novel network measures based on causal effect theory that go beyond standard complex network tools by distinguishing direct from indirect pathways. Applied to a data set of atmospheric dynamics, the method identifies several strongly uplifting regions acting as major gateways of perturbations spreading in the atmosphere. Additionally, the method provides a stricter statistical approach to pathways of atmospheric teleconnections, yielding insights into the Pacific–Indian Ocean interaction relevant for monsoonal dynamics. Also for neuroscience or power grids, the novel causal interaction perspective provides a complementary approach to simulations or experiments for understanding the functioning of complex spatiotemporal systems with potential applications in increasing their resilience to shocks or extreme events.
Similar content being viewed by others
Introduction
A complex system’s susceptibility to perturbations may crucially depend on where such a perturbation enters the system and how it is propagated. In the climate system, perturbations such as volcanic eruptions, extreme events^{1,2} or anthropogenic manipulations such as air pollution and geoengineering^{3,4} may have very different global effects if the region they occur in is strongly connected globally. The huge volcanic eruption of Mt. Pinatubo in June 1991 had a large impact on global climate^{5} also because it is located in a climatologically sensitive region tied to atmospheric teleconnections, the tropical western Pacific^{6}. Similarly, epileptic seizures in the brain^{7}, blackouts in power grids^{8,9}, epidemic spreading^{10,11} or the failure of certain banks in the financial system^{12,13} are key examples where subprocesses have a high cumulative effect on the whole complex system when perturbed, making them gateways of external influences spreading in the system. How can such subprocesses or regions be identified? Through which subprocesses are perturbations mainly mediated? These questions are key for understanding the dynamics and functioning of these systems, predicting their behaviour under perturbations and could help to make them more resilient. One way to address this problem is via active experiments, for example invasive brain stimulations in neuroscience (bearing ethical concerns) or by computer simulations, for example, in epidemic spreading models^{11} or tracer experiments in climate^{14}. Such simulations are, however, only possible if the underlying physical equations are known and even then the corresponding calculations, for example in climate research, are computationally expensive and may not adequately represent important processes^{15}. Here we follow the complementary approach of using the data alone to retrieve information about the interaction dynamics of the complex system (exploiting passive or natural experiments). Datadriven analysis within the framework of complex networks^{16} is a very active field of current research and has, among others, been applied to study the structure and function of complex systems in neuroscience^{17,18,19,20} and more recently also in climate research^{21,22,23,24,25,26}.
To identify how important individual subprocesses are in spreading and mediating perturbations in such spatiotemporal complex systems with time series typically given on a spatial grid, the subprocesses or nodes first need to be reconstructed since the gridded time series are often not the variables of interest. Secondly, the analysis should be based on a network that more faithfully than pairwise statistical associations represents possible pathways of perturbation propagation, requiring a causal definition of network links able to distinguish direct from indirect interactions. Last, even if all links in a network were established to be ‘statistically causal’, the toolbox of classical network measures is not rich enough for quantifying gateways and mediators of perturbations. Essentially, these measures—with many originating from the social sciences^{27}—are based on a different definition of links, for example, two persons knowing each other, as opposed to dynamical interactions in a complex system. Hence, what is needed are quantitative measures that take into account the relative importance of causal pathways on which perturbations propagate in a complex system’s interaction network.
Here, we present such an approach based on three steps: First, a dimension reduction of a gridded data set using the Varimax approach^{28,29} to a set of components representing relevant subprocesses defining the network’s nodes. Secondly, a (multivariate) causal reconstruction of the network’s links based on a causal discovery algorithm^{30,31,32} and, thirdly, a causal interaction quantification utilizing Pearl’s causal effect theory^{33,34,35} to construct a causally weighted directed network on which we define network measures that are better suited for quantifying key regions of causal perturbation spread and mediation compared to classical network measures such as the node degree and betweenness centrality^{36}. The extent to which such a datadriven analysis allows for a causal interpretation depends on the included variables, time resolution of the data and assumptions such as stationarity. We demonstrate the potentials of our method on a global data set of surface pressure as a representative characteristic of atmospheric variability. Applied to test specific hypotheses, we find that within this pressure system the climatic interaction mechanism between the East Pacific limb of the El Niño Southern Oscillation (ENSO)^{37,38}, and the Arabian Sea region, relevant for the Indian Monsoon system, is mainly mediated via the Indonesian archipelago. This application of our method also incorporates more rigorously the concept of atmospheric teleconnections, which were previously defined based on pairwise correlations^{39,40}. As an exploratory tool, the method identifies several strongly uplifting regions of major convergence of lowlevel air masses and highlevel air uplifts above the tropical oceans. These subprocesses integrate incoming perturbations at the surface and transport them vertically into the higher troposphere from which they again influence other surface processes via atmospheric downdrafts. This mechanism explains the key importance of these regions as gateways of perturbation spread along causal pathways in the atmosphere. Our approach is of substantial value for several applications. In climate research it may allow to more efficiently allocate resources to understand, monitor and forecast these important subprocesses. For other applications, like epileptic seizure prevention, it may help to more reliably identify which brain regions are seizure foci to concentrate counter measures on. In summary, the novel causal interaction perspective provides a complementary firststep approach towards model simulations and experiments to better understand the dynamics and functioning of complex spatiotemporal systems and may help to inform design and engineering processes aiming at increasing their resilience.
Results
Dimension reduction and causal reconstruction
In the following, we explain and illustrate each of the three steps (see Fig. 1) with a climate example. More technical details are given in Methods.
In climate research, spatiotemporal data sets are typically given on a regular grid. Here we consider a reanalysis data set of surface pressures^{41} for the period 1948–2012. At a resolution of 2.5° in latitude and longitude, the data set consists of 10,512 grid points with 3,339 samples for each time series on a weekly timescale. But towards an interpretation of perturbation propagation or information transfer^{42}, such individual grid points are not the entity of interest, because they do not represent distinct climatological processes. For example, processes like ENSO require a special decomposition of the data fields for an efficient description of their spatiotemporal structure^{43}. Also from a statistical perspective, a large number of variables with comparatively few observations presents an estimation problem^{44}. The first step of our approach, therefore, is aimed at obtaining a small set of components that represent relevant subprocesses of the complex system. We choose Varimaxrotated principal components^{28,45} here, combined with a subsequent significance test^{29,46} to exclude components merely representing noise. For the atmospheric pressure data set, this dimension reduction algorithm yields a set of 60 components (cf. Methods and ref. 29). As shown for selected components in Fig. 2 (all components shown in Supplementary Figs 1, 2), the corresponding regions well represent several important climatological subprocesses. As further discussed in Methods, all components are anomalized (seasonal cycle removed from the mean and variance) and standardized. Here we study intraseasonal interactions at a weekly time resolution.
Given that an external perturbation occurs in one of these components representing a certain subprocess: On which paths can it propagate and which other subprocesses can it possibly reach? Suppose a perturbation enters in subprocess X in the example in Fig. 1c, then it can only reach nodes further ‘downstream’ on causal paths like W_{1}, W_{2} and Y, but not Z_{1}, even though they are statistically associated. Such spurious links can be unveiled by causal discovery algorithms^{30,47} which iteratively test whether an association can be explained by another process in the network. Note that this notion of causality is only to be interpreted with respect to the included variables and unobserved drivers can still cause spurious links. The causal reconstruction steps are detailed in Methods (see also Supplementary Fig. 3). In essence, for the second step of our approach we employ a causal discovery algorithm adapted to time series^{31,32} and a subsequent thresholding step to study the robustness of all further analyses at different link densities. This approach yields the causal time series graph^{31,32,48} which is a special type of a graphical model^{49} and encodes the conditional dependencies of the components at different time lags. For the example of the ENSO—Indian Ocean teleconnection studied next, Fig. 3 depicts two different representations: Fig. 3b shows the time series graph on which causal paths and the measures of causal effect (the third step of our approach) are based, while the aggregated causal network shown in Fig. 3a can be better visualized.
In our climate application, we consider time lags up to τ_{max}=4 weeks, since we are interested in atmospheric interactions where dependencies typically decay within a month^{50}, but our results are robust also for longer time lags. Contemporaneous associations (possibly because of unobserved common drivers or faster interactions) can be represented as undirected links in the time series graph^{31,32}, but these are not taken into account here since they are not regarded as causal. In the following, we discuss results for a link density of 20% in the causal network consisting of N=60 components, but our main results are also robust for link densities between 10 and 50% and other parameters of the method.
Quantifying causal effect
The causal time series graph allows to qualitatively determine which causal paths a perturbation can possibly take. Now we employ measures to quantify the causal effect of hypothetical perturbations and their mediation along causal paths, exemplified on the teleconnection mechanism by which component No. 1 in the East Pacific ENSO region influences component No. 33 describing surface pressures in the Arabian Sea with relevance also for the Indian Monsoon rainfall^{37,38,43} (see component regions in Fig. 2).
We approach this problem by using measures of causal effect in the framework of structural equation modelling^{33,35}. With the reconstructed time series graph as a causal hypothesis, we fit a linear regression model to the multivariate component time series X with nonzero coefficients for every link in the time series graph. The standardized regression coefficient for a direct causal link between two components at lag τ (in weeks) is then called the path coefficient^{33,51}. This makes the time series graph a causally weighted directed network. The matrices of path coefficients between all components are shown in Supplementary Fig. 4.
Rather than studying only causal effects between adjacent nodes in the causal time series graph, here we are interested in the total causal effect (CE) also along indirect causal paths. Under certain assumptions (see Methods), the CE between two components i and j at lag τ, denoted , can be evaluated by summing over the products of the path coefficients along each causal path^{35} and carries the causal interpretation as the expected change in X^{j} (in units of its s.d. and relative to the unperturbed regime) at time t if X^{i} was perturbed at time t–τ by a one s.d. delta peak. The matrices of CEs between all components are shown in Supplementary Fig. 5. Similarly, the mediated causal effect (MCE) via another component k can be measured by summing only over those paths that pass through component k (at any lag). These measures are now illustrated for the teleconnection mechanism between components Nos. 1 and 33.
In the atmospheric pressure system we find 31 indirect causal paths between the Eastern ENSO component No. 1 and the Arabian Sea component No. 33 at a lag of 3 weeks (only a selection via components No. 0 above the Indonesian archipelago and No. 53 over East Asia shown in Fig. 3a). The total CE sums up to −0.08±0.01 here, implying that a perturbation of 1 s.d. in the East Pacific yields a decrease in No. 33 of ∼8% in units of its s.d. (note that component No. 1 has deviations of several s.d. from the seasonal mean). Further, we find that component No. 0 above the Indonesian archipelago mediates −0.053±0.006 of that effect, here resulting from the causal chain No. 1 No. 0 No. 33. This MCE explains more than 60% of the total CE while other paths, for example via East Asia (No. 53), contribute less than 10%. For comparison, counting just the fraction of causal paths passing through a given node, in analogy to betweenness centrality^{36}, here yields 1 from a total of 31 paths for No. 0 and, for example, 8 for component No. 35, even though the latter’s mediated effect is much weaker. We estimated the CE for a link density of 20% here, in Supplementary Table 1 we show that our results are largely robust to this choice. On the other hand, we find that the interaction was much weaker in the first half of the data set (1948–1980).
Climatologically, our present analysis implies that of the many possible climatic mechanisms linking sealevel pressure anomalies in the ENSO region to pressure variability west of India, only the mechanism via No. 0 is relevant, at least within the intraseasonal timescale of the atmospheric surface pressure system and integrated over all seasons. Note that conclusions about an effect of ENSO on the Indian Monsoon are also complicated by the apparent nonstationarity of this relationship^{52}. More detailed analyses taking into account additional climatological variables such as temperature, only certain seasons (for example, during El Niño phases), and filtering out nonrelevant time scales (such as from oceanic drivers) can provide more accurate estimates of CEs for more specific climatological hypotheses.
Causal gateways and mediators
The foregoing case study introduced the CE measures and is an example of causal modelling for testing specific hypotheses about interaction mechanisms. Now we study aggregated node measures based on the causally weighted directed network in a more exploratory analysis to identify the importance of components as gateways for spreading and mediating hypothetical perturbations in the network.
As aggregated firstorder measures of CE, we consider the matrix of CEs between all pairs of components (Supplementary Fig. 5) taken at the lag with maximum absolute effect . Then we define the mean along each column as the average causal effect (ACE) that a component has on the rest of the system and the rowmean as the average causal susceptibility (ACS) as a measure of how sensitive a component is to perturbations entering in other parts of the system. To measure how strong a subprocess mediates CEs spreading throughout the complex system, we propose the average mediated causal effect (AMCE) of a component k by averaging the previously defined MCE over all interaction pairs with causal paths through k (more details in Fig. 4 and Methods). As opposed to a pathbased network measure like betweenness centrality^{36}, AMCE depends not so much on the number of paths through a given component, but more on how strong the CE along these paths is.
In Fig. 4 we depict ACE, ACS and AMCE for the atmospheric pressure system. Although the distribution of susceptibilities is quite broad (Fig. 4c), few components have a very strong ACE (red nodes Nos. 0, 1, 2, 18) and are also rather susceptible. These components reach a large fraction of processes (node size in Fig. 4a,c) and correspond to processes in the tropical oceans (No. 0 over Indonesia and the East Indian Ocean, No. 2 in the Atlantic, Nos. 1 and 18 in the East and West Pacific). A one s.d. perturbation entering these processes has a large effect of up to 0.3 on other processes and each of them drives more than 10 other processes with a CE of at least 0.1 (Supplementary Fig. 6). These components, thus, act as major gateways of perturbation spread and also belong to the most susceptible processes being causally driven by ∼20–30% of the other components with an ACS above 0.05.
Figures 4b,d demonstrate that there is not much correlation between the fraction of interactions with a path through a certain component (node size) and its AMCE (R^{2}=0.36). Components Nos. 0, 1, 2, 18 (but also Nos. 26 and 48) are the most dominant causal mediators being involved in more than 80% of all interactions with an AMCE between 0.0015 and 0.002. Note that the average nonzero CE between any pair is only about 0.02. In Supplementary Figs. 7, 8 we show that these results are robust for different link densities and other parameters of the method, in particular the maximum lag τ_{max} and the significance level in the algorithm. The results are also robust if only the first (1948–1980) or second half (1981–2012) of the data set is used.
Climatologically, components Nos. 0, 1, 2, 18 correspond to major convergence regions with ascending motion of air masses (cf. Fig. 4e)^{43,53}. In particular, component No. 0 (and to a smaller degree No. 18) represents the uplifting western limb of the Walker circulation over Indonesia. This region is one of the strongest atmospheric convergence zones where moist air masses rise up and affect global tropical and extratropical climate via teleconnections in the upper troposphere. Component No. 2, located in the tropical Atlantic, also features strong uplifting deep convection^{43,53}. The core ENSO region represented by component No. 1 plays a double role depending on the state of the ENSO system^{37,38}. During normal ENSO conditions (depicted in Fig. 4e) it is a region of descending upper tropospheric air masses and, thus, not as much governed by surface pressures. During El Niño events, on the other hand, it is a region of strong uplifts. These effects are mixed in our analysis and more detailed studies can further distinguish seasons to obtain a more precise picture of seasonal climatic interactions. In summary, these strongly uplifting regions integrate incoming perturbations at the surface and transport them vertically into the higher troposphere from which they again influence other surface processes via atmospheric downdrafts. This mechanism explains the key importance of these regions as causal gateways and mediators of perturbations spreading in global climate via atmospheric teleconnections. Our analysis considered delta peak perturbations of 1 s.d. Since often perturbations reach extreme deviations of several s.d.^{1,2,54}, and, even more importantly, multiple perturbations can accumulate, these findings reflect the large global influence of these regions.
Discussion
We have introduced a threestep approach for the analysis of multivariate spatiotemporal data sets, consisting of a dimension reduction, causal reconstruction, and CE quantification to identify subprocesses in complex systems that are important gateways for spreading and mediating perturbations entering the system in one subprocess. While this approach lends itself also to other spatiotemporal complex systems such as the brain^{18}, for applications to financial data or food webs, the causal reconstruction can already be applied to, for example, economic indices or species abundances, and in complex systems like power grids or transportation networks, the first two steps of our approach could be skipped since the network structure is naturally given.
The causal quantification approach takes classical analyses of functional brain networks^{18} or climate networks^{21,23}, which were previously mostly based on pairwise association measures, to a new level. Consequently, our proposed node measures can be seen as dynamical and causal alternatives to classical measures for functional networks such as the degree or betweenness centrality. In Supplementary Fig. 9 and Supplementary Note 1 we compare our results with these measures where we find that the latter have only weak predictive power (R^{2}≈0.4) for perturbation propagation and mediation. The pathway analysis goes substantially beyond pure causal network reconstructions^{24,32} and also provides, for the first time, a stricter statistical approach for characterizing atmospheric teleconnections, which are of paramount importance for studies of climate change and in particular climate extremes^{55}, and which were previously formulated more phenomenologically based on (lagged) correlations^{39,40,56}.
Like any datadriven approach our method is limited by several assumptions: causal sufficiency^{30,33} assumes that the common drivers of all variables are taken into account and the causal Markov condition assumes all ‘error terms’ of the nodes in the time series graph to be independent. In our climate analysis, for example, we only excluded common drivers from within the pressure system, but on larger monthly time scales the underlying sea surface temperature field certainly interacts with the faster atmospheric pressure field over the oceans^{57} and can confound the assessment of CEs. It is, therefore, important to interpret CEs only relative to the variables that were taken into account. A further complication are CEs that are faster than the weekly resolution considered here, which appear as contemporaneous in our analysis, but are not taken into account since they are not regarded as causal links. Our weekly time resolution reflects a balance between resolving causal directionality and a multiple testing problem if too many lags are considered (in our example 30 days). Also the interplay of different time scales^{58,59} could be further addressed, for example by singular spectrum analysis^{45}, and one could possibly also account for timevarying time delays of interactions^{60}. To estimate the time series graph and CEs from the observed time series, we assume stationarity such that these properties do not change over time. More detailed research questions can take into account nonstationarity, for example, due to seasonality in climate (here we used the whole time series). While the linear approach can also be adapted from deltapeak perturbations to more general scenarios with multiple or different types of perturbations^{35}, the perturbations must be small enough to conserve the dynamics and causal structure of the system such that the conditional distributions remain the same^{35}. The effects of large unprecedented perturbations cannot be predicted from observed data alone. We introduced the method using simple linear measures here, but the framework can to some extent also be implemented with nonlinear quantifiers, for example using informationtheoretic measures^{42,61}.
We see the proposed method as a complementary firststep approach towards model simulations and experiments to help guide decision making in several ways: in climate, the knowledge of subprocesses or regions with large perturbative effect, either as gateways or mediators, can help to optimally design computationally expensive simulations such as tracer experiments^{14}, geoengineering impact assessments^{3,4}, or extreme event attribution studies^{62}. Such experiments allow to conduct counterfactual analyses, for example, with and without anthropogenic influences, to conclude on necessary and sufficient CEs^{63}. In neuroscience, it could help to optimize therapeutic interventions for preventing seizures by targeting selected brain regions with large CE or mediating CE. In power grids, nodes with strong mediating effect are the ones that one would best block to prevent a blackout perturbation from spreading throughout the network. Summarizing, the novel causal interaction perspective provides a general approach to better understand the possible influence of perturbations on complex spatiotemporal systems and may guide further research to inform design and engineering processes aiming at increasing their resilience against shocks or extreme events.
Methods
Data and software availability
The climatological reanalysis data set^{41} studied here can be downloaded from http://www.esrl.noaa.gov/psd/data/gridded/data.ncep.reanalysis.html. Code for the dimension reduction step is available from coauthor M. Vejmelka at https://github.com/vejmelkam/ndwclimate/blob/master/scripts. A Python software script by J. Runge to estimate the causal network can be obtained from http://tocsy.pikpotsdam.de/tigramite.php.
Dimension reduction
Our dimension reduction approach is based on Varimaxrotated principal components^{28,45} and a subsequent significance test to eliminate components merely representing noise^{29}. As further discussed in ref. 29, the rotation of principal components maximizes the sum of the variances of the squared principal component weights (loadings) and better represents regionally confined processes than principal components. The data preprocessing steps to obtain the component weights are discussed in more detail in ref. 29, here we give a brief summary: Monthly gridded time series are first anomalized to remove the annual cycle not only from the seasonal means but also from the seasonal variance. After a linear detrending, the covariance matrix is estimated on cosinetransformed data to account for the area a grid point represents (poles are excluded), and the eigenvectors are computed. These are then rotated using the Varimax criterion^{28} and a limited number of components is selected based on a comparison of eigenvalues of original data (not components) to those from surrogate data which preserve the autocorrelation structure, but destroy dependencies between the grid point time series. Here this algorithm yields N=60 significant components. Finally, the component weight matrix is multiplied with the daily original gridded time series (that have been preprocessed by anomalization in mean and variance, standardization, linear detrending and cosine transform as above), and the daily component time series are aggregated to a weekly resolution which reflects a balance between causal time resolution and the multiple testing problem in the causal reconstruction step. In contrast to principal components, where the diagonal entries corresponding to the eigenvalues can be interpreted as the explained variability, for rotated principal components, the offdiagonal entries are not zero anymore and one cannot simply attribute an ‘explained variance’ to each component. We enumerate the components by the entry on the diagonal starting with the largest value (component No. 0). Here monthly time series were used for the extraction of the components for computational reasons. Carrying out the decomposition directly on the daily or weekly time series might have provided a slightly different set of components, as the decomposition would also take into account higher frequency variability. In Supplementary Figs 1, 2 we show the loadings and time series of all components.
Causal reconstruction
To reconstruct the causal network from the component time series, we utilize a causal discovery algorithm^{31,32} which is based on the PC algorithm (named after its inventors Peter Spirtes and Clark Glymour^{30}). This algorithm can be used in an informationtheoretic framework^{31} as well as employing linear partial correlation^{24,32}. Here we choose the linear approach as a firstorder approximation. The significance level used in the causal algorithm is not a very reliable indicator for the final significance level of causal links because links are tested sequentially and, therefore, Bonferroni corrections cannot be easily applied. To overcome this problem, we use the causal algorithm only as a variable selection for a subsequent ‘causal regression’. The time series graph reconstruction, thus, consists of three steps as described below.
First, variable selection of the causal parents: The parents of each component j are selected with the causal algorithm described in ref. 32 and Supplementary Note 2. The algorithm’s parameters here are: maximum time lag τ_{max}=4 weeks, (twotailed) significance level α=0.001 (Student’s ttest), initial number of conditions n_{0}=3. For the causal algorithm to consistently converge to the true parents, one needs to assume causal sufficiency and the causal Markov condition, that is, the independence of error terms driving each subprocess, and faithfulness^{30} which guarantees that the graph entails all conditional independence relations true for the underlying process and can be violated in certain pathological cases^{30}. Since we estimate partial correlations from time series data here, we also assume stationarity. Ref. 64^{64} discusses the computational complexity of the algorithm. In Supplementary Fig. 3b we show the distribution of the number of parents for every component (black dashed line) and in Supplementary Tables 2, 3 we list the parents for all components (median number of parents is 8).
Secondly, estimating the causal regression matrix: the lagged causal regression matrix C(τ) of shape (N, N, τ_{max}) is estimated using the above selected parents by
where r denotes the standardized regression coefficient of component in the multiple regression model of on using ordinary least squares regression. For infinite sample sizes, these ‘causal’ regression coefficients would be nonzero only for the parents of each component as estimated with the algorithm. In Supplementary Fig. 3a we show the sorted coefficients for every component j. The largest coefficients are typically associated with the past lag of a component. After a sharp decay, most of the coefficients have absolute values below 0.1
Thirdly, constructing the time series graph: The causal time series graph is constructed from thresholding the causal regression matrix with crosslinks (i j) and autolinks (i=j) defined by
with the threshold θ chosen to obtain a given link density in the corresponding aggregated causal network (Supplementary Fig. 9b), that is, , where autolinks are not counted and multiple links between two components are only counted once. For the link density ρ_{dens}=0.2 analysed here, there are 708 such links, while the time series graph, thresholded at θ=0.0585, has a link density of 0.062. Because of the assumed stationarity, the subscript t in equation (2) can be dropped. For the network analysed in the main article at 20% link density, the median of parents in the time series graph is 14 which determines the nonzero coefficients in the CEestimation model (4) described below. Supplementary Fig. 3b shows the distribution of parents for different link densities.
Causal effect estimation
There are different ways to use the reconstructed causal network to further quantify causal interactions between subprocesses. We call the general idea to use the time series graph for quantifying general causal interactions the Tigramite approach (time series graphbased measures of information transfer), which is also the abbreviation of the accompanying software package. Here we consider a measure I to quantify the linear CE of perturbations for its reliable estimation and interpretability, generalizations in an informationtheoretic framework are discussed in refs 42, 61.
The approach is based on the CE estimator for multivariate time series proposed in ref. 35 in a linear application of Pearl’s causal framework^{33}, considering deltapeak perturbations (called atomic interventions in ref. 35). Within this framework, the CE of a perturbation of setting to x* on is given by
where denotes the expectation. It is important to note that the dooperator does not pertain to a conditional expectation, but refers to the experiment of intervening in the system and forcing the variable to a certain value. From observational data, this effect can only be estimated (or identified) under certain assumptions^{33,35}. Here we assume a linear model based on the reconstructed time series graph with all relevant variables (or confounders) included (thus, satisfying the backdoor criterion^{33,35}):
Note that we do not fit a full autoregressive model here, but a sparse one where we estimate only those coefficients corresponding to causal links in the time series graph (including cross and autolinks). A standardized coefficient Φ_{ji}(τ) is called path coefficient^{34,51} and stands for the change in the expectation of (in units of its s.d.) induced by raising by 1 s.d., while keeping all other parents of constant. The matrices of path coefficients between all components are shown in Supplementary Fig. 4. Then the CE of a perturbation x*=1, that is, a 1 s.d. for the standardized component time series, is given by^{35}
where Ψ(τ) can be iteratively computed from matrix products of the estimated coefficient matrices Φ(τ) by
for example,
where is the identity matrix. An entry Ψ_{ji}(τ) here yields the sum over the products of path coefficients along all causal paths as explained for the climate example in Fig. 3. The framework also allows to encompass more complex types of perturbations such as the effect of multiple perturbations^{35}. In Supplementary Fig. 5 we depict the matrices of CEs between all pairs of components for all considered link densities.
The MCE through a component k is defined as the sum over the products of path coefficients only along causal paths through k. From the matrices Ψ it can be derived as
where Ψ^{(k)}(τ) is computed from equation (6) with modified path coefficient matrices Φ^{(k)}(τ) where all links towards component k are set to zero,
which blocks all paths through component k at any lag. The causal interpretation is that an indirect effect via the component X^{k} measures the change we would see in while holding constant and setting component X^{k} to whatever value it would have obtained under a unit change in ^{33,65}.
Aggregated measures
For the aggregated causal effect measures ACE and ACS we are interested not so much in the lag at which the interaction occurs and, therefore, base these measures on the lag with maximum effect:
quantifies by how much (in units of its s.d.) any of the N–1 remaining components is changed on average by a one unit increase in component i (at the lag with maximum absolute effect). This serves as a quantitative measure of how much a component is a gateway of perturbations. , on the other hand, measures by how much a component j is changed on average by a one unit perturbation in any of the N–1 remaining components. Further, we denote the fraction of components that i is influencing with by and as the fraction of components that j is affected by, at any lag within 0<τ≤τ_{max}. Normalizing ACE and ACS by these quantities results in a measure that is not as robust as desired because it depends on the chosen threshold. In Supplementary Fig. 7 we show ACE versus ACS for different network link densities and reconstruction parameters (maximum lag τ_{max}, significance level α), and also for different segments of the data set.
The AMCE is based on causal paths passing through a given node:
where is the set of interactions between all nonidentical pairs i, j k at all lags 0<τ≤τ_{max} where k is an intermediate component (at any lags) and denotes its cardinality. Here we take the absolute value , but one could further distinguish between enhancing (where the sign of MCE equals that of CE) and counteracting (opposite signs) effects. In general, there can be maximally c_{max}=(N–1)(N–2)=3,422 interacting nonidentical pairs and in Fig. 4b,d we depict the fraction of interaction pairs where a component k is an intermediate node as the size of the nodes. In Supplementary Fig. 8 we show ACE versus AMCE as in Fig. 4d for different setups.
Uncertainty quantification
To estimate the standard errors for all causal effect measures considered above, we employ a residualbased bootstrap procedure^{66}. Each bootstrap surrogate is constructed from running model (4) with a joint random sample (with replacement) of the original multivariate residual time series and with the original coefficient matrices Φ(τ),
From this bootstrap surrogate time series, Φ*(τ) is estimated from which the other quantities are derived. We use 200 bootstrap surrogates here to estimate the standard errors of all quantities defined above (given as ± in the main text as well as Supplementary Table 1 and as error bars in scatter plots in Fig. 4 and in the Supplementary Figs 7, 8).
Additional information
How to cite this article: Runge, J. et al. Identifying causal gateways and mediators in complex spatiotemporal systems. Nat. Commun. 6:8502 doi: 10.1038/ncomms9502 (2015).
References
Rahmstorf, S. & Coumou, D. Increase of extreme events in a warming world. Proc. Natl Acad. Sci. USA 108, 17905–17909 (2011) .
Ghil, M. et al. Extreme events: dynamics, statistics and prediction. Nonlin. Process. Geophys. 18, 295–350 (2011) .
Orgis, T. et al. Influence of interactive stratospheric chemistry on largescale air mass exchange in a global circulation model. Eur. Phys. J. Spec. Top. 174, 257–269 (2009) .
Vaughan, N. E. & Lenton, T. M. A Review of Geoengineering Proposals. Climatic Change 109, 745–790 (2011) .
Bassett, G. W. & Lin, Z. Breaking global temperature records after Mt. Pinatubo. Climatic Change 25, 179–184 (1993) .
Trenberth, K. E., Fasullo, J. T., Branstator, G. & Phillips, A. S. Seasonal aspects of the recent pause in surface warming. Nat. Clim. Change 4, 911–916 (2014) .
Zubler, F. et al. Detecting functional hubs of ictogenic networks. Brain Topogr. 28, 305–317 (2014) .
Albert, R., Albert, I. & Nakarado, G. L. Structural vulnerability of the North American power grid. Phys. Rev. E 69, 025103 (2004) .
Menck, P. J., Heitzig, J., Kurths, J. & Schellnhuber, H. J. How dead ends undermine power grid stability. Nat. Commun. 5, 3969 (2014) .
Newman, M. E. J. Spread of epidemic diseases on networks. Phys. Rev. E 66, 016128 (2002) .
Klemm, K., Serrano, M. A., Egulluz, V. M. & Miguel, M. S. A measure of individual role in collective dynamics. Sci. Rep. 2, 292 (2012) .
Lenzu, S. & Tedeschi, G. Systemic risk on different interbank network topologies. Physica A 391, 4331–4341 (2012) .
Haldane, A. G. & May, R. M. Systemic risk in banking ecosystems. Nature 469, 351–355 (2011) .
Brovkin, V. et al. Geoengineering climate by stratospheric sulfur injections: Earth system vulnerability to technological failure. Climatic Change 92, 243–259 (2009) .
Stocker, T. & Qin, D. Climate Change 2013: The Physical Science Basis Cambridge University Press (2013) .
Newman, M. E. J. Networks: An Introduction Oxford University Press (2010) .
Friston, K. J. Functional and effective connectivity in neuroimaging: a synthesis. Hum. Brain Mapp. 2, 56–78 (1994) .
Bullmore, E. & Sporns, O. Complex brain networks: graph theoretical analysis of structural and functional systems. Nat. Rev. Neurosci. 10, 186–198 (2009) .
Schinkel, S., ZamoraLopez, G., Dimingen, O., Sommer, W. & Kurths, J. Functional network analysis reveals differences in the semantic priming task. J. Neurosci. Methods 197, 333–339 (2011) .
Simpson, S. L., Bowman, F. D. & Laurienti, P. J. Analyzing complex functional brain networks: fusing statistics and network science to understand the brain. Stat. Surv. 7, 1–36 (2013) .
Tsonis, A. A., Swanson, K. L. & Wang, G. On the role of atmospheric teleconnections in climate. J. Climate 21, 2990–3001 (2008) .
Yamasaki, K., Gozolchiani, A. & Havlin, S. Climate networks around the globe are significantly affected by El Nino. Phys. Rev. Lett. 100, 228501 (2008) .
Donges, J. F., Zou, Y., Marwan, N. & Kurths, J. The backbone of the climate network. Europhys. Lett. 87, 48007 (2009) .
EbertUphoff, I. & Deng, Y. Causal discovery for climate research using graphical models. J. Climate 25, 5648–5665 (2012) .
Deng, Y. & EbertUphoff, I. Weakening of atmospheric information flow in a warming climate in the Community Climate System Model. Geophys. Res. Lett. 41, 193–200 (2014) .
Boers, N., Bookhagen, B., Barbosa, H., Marwan, N. & Kurths, J. Prediction of extreme floods in the eastern Central Andes based on a complex networks approach. Nat. Commun. 5, 5199 (2014) .
Freeman, L. C. A set of measures of centrality based on betweenness. Sociometry 40, 35–41 (1977) .
Kaiser, H. F. The varimax criterion for analytical rotation in factor analysis. Psychometrika 23, 187–200 (1958) .
Vejmelka, M. et al. Nonrandom correlation structures and dimensionality reduction in multivariate climate data. Climate Dyn. 44, 2663–2682 (2015) .
Spirtes, P., Glymour, C. & Scheines, R. Causation, Prediction, and Search The MIT Press (2000) .
Runge, J., Heitzig, J., Petoukhov, V. & Kurths, J. Escaping the curse of dimensionality in estimating multivariate transfer entropy. Phys. Rev. Lett. 108, 258701 (2012) .
Runge, J., Petoukhov, V. & Kurths, J. Quantifying the strength and delay of climatic interactions: The ambiguities of cross correlation and a novel measure based on graphical models. J. Climate 27, 720–739 (2014) .
Pearl, J. Causality: Models, Reasoning, and Inference Cambridge University Press (2000) .
Pearl, J. Linear models: a useful ‘microscope’ for causal analysis. J. Causal Inference 1, 155–170 (2013) .
Eichler, M. & Didelez, V. On Grangercausality and the effect of interventions in time series. Lifetime Data Anal. 16, 3–32 (2010) .
Freeman, L. C. Centrality in social networks conceptual clarification. Soc. Networks 1, 215–239 (1978) .
Philander, S. G. H. ElNiño and the Southern Oscillation Academic press (1990) .
Cane, M. A. The evolution of El Niño, past and future. Earth Planet. Sci. Lett. 230, 227–240 (2005) .
Wallace, J. M. & Gutzler, D. S. Teleconnections in the geopotential height field during the Northern Hemisphere winter. Mon. Weather Rev. 109, 784–812 (1981) .
Ghil, M. & Mo, K. Intraseasonal Oscillations in the Global Atmosphere. Part I: Northern Hemisphere and Tropics. J. Atmos. Sci 48, 752–779 (1991) .
Kalnay, E. et al. The NCEP/NCAR 40year reanalysis project. Bull. Am. Meteorol. Soc. 77, 437–471 (1996) .
Runge, J. Quantifying information transfer and mediation along causal pathways in complex systems, Preprint at http://arxiv.org/abs/1508.03808 [stat.ME] (2015) .
Webster, P. J. et al. Monsoons: processes, predictability, and the prospects for prediction. J. Geophys. Res. Oceans 103, 14451–14510 (1998) .
Hastie, T., Tibshirani, R. & Friedman, J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction 2nd edn Springer (2009) .
Vautard, R. & Ghil, M. Singular spectrum analysis in nonlinear dynamics, with applications to paleoclimatic time series. Physica D 35, 395–424 (1989) .
Hlinka, J. et al. Reliability of inference of directed climate networks using conditional mutual information. Entropy 15, 2023–2045 (2013) .
Spirtes, P. & Glymour, C. An algorithm for fast recovery of sparse causal graphs. Soc. Sci. Comput. Rev. 9, 62–72 (1991) .
Eichler, M. Graphical modelling of multivariate time series. Probab. Theory Relat. Fields 153, 233–268 (2012) .
Lauritzen, S. L. Graphical Models Oxford University Press (1996) .
Storch, H. V. & Zwiers, F. W. Statistical Analysis in Climate Research Cambridge University Press (1999) .
Wright, S. The method of path coefficients. Ann. Math. Stat. 5, 161–215 (1934) .
Kumar, K. K., Rajagopalan, B. & Cane, M. A. On the weakening relationship between the Indian Monsoon and ENSO. Science 284, 2156–2159 (1999) .
Hosking, J. S., Russo, M. R., Braesicke, P. & Pyle, J. A. Tropical convective transport and the Walker circulation. Atmos. Chem. Phys. 12, 9791–9797 (2012) .
Petoukhov, V., Rahmstorf, S., Petri, S. & Schellnhuber, H. J. Quasiresonant amplification of planetary waves and recent Northern Hemisphere weather extremes. Proc. Natl Acad. Sci. USA 110, 5336–5341 (2013) .
Lau, W. & Kim, K. The 2010 Pakistan flood and Russian heat wave: teleconnection of hydrometeorological extremes. J. Hydrometeor. 13, 392–403 (2012) .
Ghil, M. & Robertson, A. W. ‘Waves’ versus ‘particles’ in the atmosphere's phase space: a pathway to longrange forecasting? Proc. Natl Acad. Sci. USA 99, 2493–2500 (2002) .
Lau, K. & Yang, S. in Encyclopedia of Atmospheric Sciences ed. Holton J. R. 2505–2510Academic Press (2003) .
Paluš, M. Multiscale atmospheric dynamics: crossfrequency phaseamplitude coupling in the air temperature. Phys. Rev. Lett. 112, 078702 (2014) .
Moron, V., Robertson, A. W., Qian, J.H. & Ghil, M. Weather types across the Maritime Continent: from the diurnal cycle to interannual variations. Front. Environ. Sci. 2, 65 (2015) .
Coluzzi, B., Ghil, M., Hallegatte, S. & Weisbuch, G. Boolean delay equations on networks in economics and the geosciences. Int. J. Bifurcat. Chaos 21, 3511–3548 (2011) .
Runge, J., Heitzig, J., Marwan, N. & Kurths, J. Quantifying causal coupling strength: a lagspecific measure for multivariate time series related to transfer entropy. Phys. Rev. E 86, 061121 (2012) .
Pall, P. et al. Anthropogenic greenhouse gas contribution to flood risk in England and Wales in autumn 2000. Nature 470, 382–385 (2011) .
Hannart, A., Pearl, J., Otto, F., Naveau, P. & Ghil, M. Causal counterfactual theory for the attribution of weather and climaterelated events. Bull. Am. Meteor. Soc. Early online release at http://dx.doi.org/10.1175/BAMSD1400034.1 (2015) .
Runge, J., Donner, R. & Kurths, J. Optimal modelfree prediction from multivariate time series. Phys. Rev. E 91, 052909 (2015) .
VanderWeele, T. Explanation in causal inference: methods for mediation and interaction Oxford University Press (2015) .
Hardle, W., Horowitz, J. & Kreiss, J.P. Bootstrap methods for time series. Int. Stat. Rev. 71, 435–459 (2003) .
Acknowledgements
J.R. received support by the German National Academic Foundation (Studienstiftung), a Humboldt University Postdoctoral Fellowship, and the German Federal Ministry of Science and Education (Young Investigators Group CoSyCC^{2}, grant no. 01LN1306A). J.F.D. thanks the Stordalen Foundation and BMBF (project GLUES) for financial support. D.H. has been funded by grant ERCCZ CORES LL1201 of the Czech Ministry of Education. M.P. and N.J. received funding from the Czech Science Foundation project No. P3031402634S and from the Czech Ministry of Education, Youth and Sports, project No. DAAD1530. J.H. was supported by the Czech Science Foundation project GA1323940S and Czech Health Research Council project NV1529835A. We thank Mary Lindsey from the National Oceanic and Atmospheric Administration for her kind help with Fig. 4e. NCEP Reanalysis data provided by NOAA/OAR/ESRL PSD, Boulder, Colorado, USA, from their web site at http://www.esrl.noaa.gov/psd/.
Author information
Authors and Affiliations
Contributions
J.R. designed the study, J.R., N.J. and D.H. prepared the data, J.R. carried out the analysis and prepared the manuscript. All authors discussed the results and contributed to editing the manuscript. N.M., M.P. and J.K. supervised the study.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Supplementary information
Supplementary Information
Supplementary Figures 19, Supplementary Tables 13, Supplementary Notes 12 and Supplementary References (PDF 9318 kb)
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Runge, J., Petoukhov, V., Donges, J. et al. Identifying causal gateways and mediators in complex spatiotemporal systems. Nat Commun 6, 8502 (2015). https://doi.org/10.1038/ncomms9502
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/ncomms9502
This article is cited by

Megacities are causal pacemakers of extreme heatwaves
npj Urban Sustainability (2024)

Causal inference for time series
Nature Reviews Earth & Environment (2023)

Stratocumulus adjustments to aerosol perturbations disentangled with a causal approach
npj Climate and Atmospheric Science (2023)

Global warming overshoots increase risks of climate tipping cascades in a network model
Nature Climate Change (2023)

Coupling the Causal Inference and Informer Networks for Shortterm Forecasting in Irrigation Water Usage
Water Resources Management (2023)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.