Causal decomposition in the mutual causation system

Yang, Albert C.; Peng, Chung-Kang; Huang, Norden E.

doi:10.1038/s41467-018-05845-7

Download PDF

Article
Open access
Published: 23 August 2018

Causal decomposition in the mutual causation system

Nature Communications volume 9, Article number: 3378 (2018) Cite this article

17k Accesses
40 Citations
51 Altmetric
Metrics details

Subjects

Matters Arising to this article was published on 23 May 2022

This article has been updated

Abstract

Inference of causality in time series has been principally based on the prediction paradigm. Nonetheless, the predictive causality approach may underestimate the simultaneous and reciprocal nature of causal interactions observed in real-world phenomena. Here, we present a causal-decomposition approach that is not based on prediction, but based on the covariation of cause and effect: cause is that which put, the effect follows; and removed, the effect is removed. Using empirical mode decomposition, we show that causal interaction is encoded in instantaneous phase dependency at a specific time scale, and this phase dependency is diminished when the causal-related intrinsic component is removed from the effect. Furthermore, we demonstrate the generic applicability of our method to both stochastic and deterministic systems, and show the consistency of causal-decomposition method compared to existing methods, and finally uncover the key mode of causal interactions in both modelled and actual predator–prey systems.

Partial cross mapping eliminates indirect causal influences

Article Open access 26 May 2020

Fast and effective pseudo transfer entropy for bivariate data-driven causal inference

Article Open access 19 April 2021

Inferring causation from time series in Earth system sciences

Article Open access 14 June 2019

Introduction

Since the philosophical inception of causality by Galilei¹ and Hume² that cause must precede the effect in time, the scientific criteria for assessing causal relationships between two time series have been dominated by the notion of prediction, as proposed by Granger³. Namely, the causal relationship from variable A to variable B is inferred if the history of variable A is helpful in predicting the value of variable B, rather than using information from the history of variable B alone.

Granger causality is based on the time dependency between cause and effect⁴. As discussed by Sugihara et al.⁵, Granger causality is critically dependent on the assumption that cause and effect are separable³. While the separability is often satisfied in linear stochastic systems where Granger causality works well, it might not be applicable in nonlinear deterministic systems where separability appears to be impossible because both cause and effect are embedded in a non-separable higher dimension trajectory^6,7. Consequently, Sugihara et al.⁵ proposed the convergent cross-mapping (CCM) method based on state-space reconstruction. In this context, cause and effect are state dependent, and variable A is said to causally influence variable B, although counterintuitive, if the state of variable B can be used to predict the state of variable A in the embedded space, and this predictability improves (i.e., converges) as the time series length increases.

Existing methods of detecting causality in time series are predominantly based on the Bayesian⁸ concept of prediction. However, cause and effect are likely simultaneous⁹. The succession in time of the cause and effect is produced because the cause cannot achieve the totality of its effect in one moment. At the moment when the effect first manifests, it is always simultaneous with its cause. Moreover, most real-world causal interactions are reciprocal; examples include predator–prey relationships and the physiologic regulation of body functions. In this sense, predictive causality may fail because the attempt to estimate the effect with the history of cause is compromised as the history of the cause is already simultaneously influenced by the effect itself, and vice versa.

Another constraint of the generalised prediction framework is that it requires a priori knowledge of the extent of past history that may influence and predict the future, such as the time lag between cause and effect in Granger’s paradigm, or the embedding dimensions in state-space reconstructions such as CCM. Furthermore, a causality assessment is incomplete if it is based exclusively on time dependency or state dependency. Time series commonly observed in nature, including those from physiologic system or spontaneous brain activity, contain oscillatory components within specific frequency bands^10,11. Identification of frequency-specific causal interaction is essential to understand the underlying mechanism^12,13. Furthermore, the application of either linear Granger causality or the nonlinear CCM method alone is insufficient to accommodate the complex causal compositions typically observed in real-world data blending with oscillatory stochastic and deterministic mechanisms.

Here, we present a causal-decomposition analysis that is not based on prediction, and more importantly, is neither based on time dependency nor state dependency, but based on the instantaneous phase dependency between cause and effect. The causal decomposition essentially involves two assumptions: (1) any cause–effect relationship can be quantified with instantaneous phase dependency between the source and target decomposed as intrinsic components at specific time scale, and (2) the phase dynamics in the target originating from the source are separable from the target itself. We define the cause–effect relationship between two time series according to the covariation principle of cause and effect¹: cause is that which put, the effect follows; and removed, the effect is removed; thus, variable A causes variable B if the instantaneous phase dependency between A and B is diminished when the intrinsic component in B that is causally related to A is removed from B itself, but not vice versa. To achieve this, we use the ensemble empirical mode decomposition (ensemble EMD)^14,15,16 to decompose a time series into a finite number of intrinsic mode functions (IMFs) and identify the causal interaction that is encoded in instantaneous phase dependency between two time series at a specific time scale. We validate the causal-decomposition method with both stochastic and deterministic systems and illustrate its application to ecological time series data of prey and predators.

Results

Illustration of the causal-decomposition method

Figure 1 depicts how the causal decomposition can be used to identify the predator–prey causal relationship of Didinium and Paramecium¹⁷. Briefly, we decomposed the time series of Didinium and Paramecium into two set of IMFs, and determined the instantaneous phase coherence¹⁸ between comparable IMFs from the two time series (Fig. 1a). Orthogonality and separability tests were performed to determine the ensemble EMD parameter (i.e., added noise level) that minimises the nonorthogonal leakage and root-mean-square of the correlation between the IMFs, thereby ensuring the orthogonality and separability of the IMFs (Fig. 1d, e). Subsequently, we removed one of the IMFs (e.g., IMF 2) from Paramecium (Fig. 1b; subtract IMF 2 from the original Paramecium signal) and redecomposed the time series. We then calculated the phase coherence between the original IMFs of Didinium and redecomposed IMFs of Paramecium. This decomposition and redecomposition procedure was repeated for IMF 2 of Didinium (Fig. 1c) and generalised to all IMF pairs. This procedure enabled us to examine the differential effect of removing a causal-related IMF on the redistribution of phase dynamics in cause-and-effect variables. The relative ratio of variance-weighted Euclidian distance between the phase coherence of the original IMFs (i.e., Fig. 1a) and redecomposed IMFs (i.e., Fig. 1b, c) is therefore an indicator of causal strength (Fig. 1f), where a ratio of 0.5 indicates either no causality is detected or no difference in causal strength in the case of reciprocal causation, and a ratio approaching 0 or 1 indicates a strong causal influence from either variable A or variable B, respectively.

Application to deterministic and stochastic models

Figure 2 depicts the causal-decomposition analysis in both deterministic⁵ and stochastic¹⁰ models given in Eqs. 9 and 10. The IMF with a causal influence identifies the key mechanism of the model data in stochastic (Fig. 2a) and deterministic (Fig. 2b) systems. These results indicate that the causal-decomposition method is suitable for separating causal interactions not only in the stochastic system, but also in the deterministic model where non-separability is generally assumed in the state space. Furthermore, we validated and compared the causal decomposition with existing causality methods in uncorrelated white noise with varying lengths, showing the consistency of causal decomposition in a short time series and under conditions where no causal interaction should be inferred (Fig. 3a). In addition, we assessed the effect of down-sampling (Fig. 3b) and temporal shift (Fig. 3c) of a time series on causal decomposition and existing methods, showing that causal decomposition is less vulnerable to spurious causality due to sampling issues³ and is independent of temporal shift, which is significantly confounded with the predictive causality method¹⁹.

Validation of causal-decomposition analysis

We generated 10,000 pairs of uncorrelated white noise time-series observations with varying lengths (L = 10–1000) and calculated causality based on various methods (Fig. 3a). Causal decomposition exhibited a consistent pattern of causal strengths at 0.5 (the error bar denotes the standard error of causality assessment here and in the other panels), indicating that no spurious causality was detected, even in the case of the short noise time series. Causality in the CCM methods was indicated by the difference in correlations obtained from cross-mapping the embedded state space. In the case of uncorrelated white noise, the difference of correlation should be approximately zero, indicating no causality. However, the CCM method detects spurious causality with differences of up to 0.4 in the crossmap correlations in the short time series, and the difference between the correlations decreased as the signal length increased. A high percentage or intensity of spurious causality was also observed in Granger’s causality and mutual information from the mixed embedding (MIME) method²⁰.

Next, we assessed the effect of down-sampling on the various causality methods (Fig. 3b). The stochastic and deterministic models shown in Fig. 2 are used (the corresponding colour for each variable is shown in the figure). The time series were down-sampled by a factor 1 to 10. For Factor 1, the time series were identical to the original signals. The down-sampling procedure destroyed the causal dynamics in both models and made causal inference difficult in predictive causality analysis¹⁹. Causal-decomposition analysis revealed a consistent pattern of the absence of causality when the causal dynamics were destroyed as the down-sampling factor was >2. However, spurious causality was detected with the predictive causality methods when the signals were down-sampled.

Finally, we evaluated the effect of temporal shift on the causality measures (Fig. 3c). Temporal shift (both lagged or advanced up to 20 data points) was applied to both the stochastic and deterministic time series. Causal decomposition exhibited a stable pattern of causal strength independent of a temporal shift up to 20 data points. CCM reduced its crossmap ability to detect causa interaction in the bi-directional deterministic system as temporal shift increased in either direction, and is unable to show differences in crossmap ability in the anterograde temporal shift in stochastic system. As anticipated, Granger’s causality showed the opposite patterns of causal interaction in anterograde and retrograde temporal shift in both deterministic and stochastic system. MIME lost its predictability when the temporal shift is beyond 5 data points and was inconsistent in stochastic system.

Quantifying predator and prey relationship

Figure 4 shows the results of applying causal decomposition to ecosystem data from the Lotka Volterra predator–prey model^21,22 (Eq. 11; Fig. 4a), wolf and moose data from Isle Royale National Park²³ (Fig. 4b), and the Canada lynx and snowshoe hare time series reconstructed from historical fur records of Hudson’s Bay Company²⁴ (Fig. 4c). The causal decomposition invariantly identifies the dominant causal role of the predator in the IMF, which is consistent with the classic Lotka Volterra predator–prey model. Previously, the causality of such autonomous differential equation models was understood only in mathematical terms because there is no prediction-based causal factor²⁵, yet our results indicated that the causal influence of this model can be established through the decomposition of instantaneous phase dependency.

Comparison of causal assessment in ecosystem data

Figure 5 shows the comparison of causality assessment in these predator and prey data using different methods. In general, results showed that neither the Granger nor CCM methods consistently identify predator–prey interactions in these data, indicating that the predator–prey relationship does not exclusively fit either the stochastic or deterministic chaos paradigms. The CCM result showed a top–down causal interaction between lynx and hare, and Didinium and Paramecium interactions¹⁷, which the latter was consistent with the data presented by Suigihara et al.⁵ However, CCM method could not be used to detect causal interaction in the Lotka Volterra predator–prey model, and it exhibited a cross-over of correlations in the wolf and moose data. Granger’s causality detected top–down causal interaction in the Lotka Volterra predator–prey model and wolf and moose data, but the bottom-up causal interaction was observed in Didinium and Paramecium data, which the latter was also observed in the supplementary data in Sugihara et al.⁵ The inconsistency in causal strength was also observed in the results obtained with the MIME method.

Discussion

An interdisciplinary problem of detecting causal interactions between oscillatory systems solely from their output time series has attracted considerable attention for a long time. The motivation of causal-decomposition analysis is that the inference of causality that is largely dependent on the temporal precedence principle is of concern. In other words, observing the past with a limited period is insufficient to infer causality because that history is already biased. Instead, we followed another fundamental criterion of causal assessment proposed by Galilei¹—covariation of cause and effect: cause is that which put, the effect follows; and removed, the effect is removed. In this statement, however, the prediction of time series based on the past history is neither required or implied. Therefore, the complex dynamical process between cause and effect should be delineated through the decomposition of intrinsic causal components inherited in causal interactions.

It is noteworthy that our approach is essentially different by combing EMD with existing causality methods, such as assessing Granger’s causality between paired IMFs of economic time series²⁶, applying CCM to detect the nonlinear coupling of decomposed brain wave data²⁷, or measuring time dependency between IMFs decomposed from stock market data²⁸. The decomposition of time series with EMD alone may improve the separability of intrinsic components embedded in the time series data, but does not avoid the constraints inherited from the existing prediction-based causality methods. Furthermore, our approach does not neglect the temporal precedence principle, but emphasises the instantaneous relationship of causal interaction, and is thus more amenable to detecting simultaneous or reciprocal causation, which is not fully accounted for by predictive methods.

Because our causal strengths measurement is relative, it detects differential causality rather than absolute causality. Differential causality adds to the philosophical concept of mutual causality that all causal effects are not equal, and it may fit the emerging research data better than linear and unidirectional causal theories do. In addition, causal decomposition using EMD fundamentally differs from the spectral extension of Granger’s causality²⁹ in that the latter involves the prior knowledge of history (e.g., autoregressive model order) and is susceptible to non-stationary artefacts. Furthermore, without resorting to frequency-domain decomposition, EMD bypasses the linear and stationary assumptions, and the limitation of uncertainty principle imposed on data characteristics as in Fourier analysis, and results in more precise phase and amplitude definition³⁰.

The operational definition of causal decomposition is in accordance with Granger’s assumption on separability³ but in a more complete form. We note that such definition is distinct from non-separability assumed by CCM. Clearly, CCM is developed under the constraints of perfect deterministic system, in which the state of cause is encoded in effect that is not separable from effect itself. The state-space reconstruction approach such as CCM may be applicable to certain ecosystem data, such as predator and prey interactions, in which they represent non-separable components of the ecosystem³¹, but is unlikely to generalise to all causal interactions being studied³². It is noteworthy that the effect of temporal shift on the CCM shown in Fig. 3c is relevant to the extended CCM to detect time-delayed causal interactions³³. The extended CCM has been shown to capture bi-directional causal interactions in the deterministic system. However, in the real-world data, the time-delayed causal interaction has to be achieved by the arbitrary temporal shift of time series data, and the interpretation of such results is still of concern, as demonstrated in our Fig. 3c.

Several limitations should be considered in interpreting the causal strength presented in this paper. First, the causal decomposition represents a form of statistical causality and does not imply the true causality, which requires the inclusion of all variables to conclude the existence of causal relationship³. Second, the causal decomposition is limited to the pairwise measurement in the current form, but we do not exclude the possibility of the extension of the current method to multivariate systems (e.g., functional brain networks) with the employment of multivariate EMD^34,35 in the future. In that case, we have to define and work with the absolute causal strength matrix. Then the redecomposition would be from one to many. Although the causal principle remains the same, the computation would be time consuming.

The use of EMD overcomes the difficulty of signal decomposition in nonlinear and non-stationary data, and it is applicable to both stochastic and deterministic systems in that the intrinsic components in the latter remain separable in the time domain. Furthermore, the central element in causal-decomposition analysis is the decomposition and redecomposition procedure, and we do not exclude the use of other signal decomposition methods³⁶ to detect causality in a similar manner. Therefore, the development of causal decomposition is not to complement existing methods, but to explore the use of covariation principle of cause and effect for assessing causality. With the potential of the extension of ensemble EMD to multivariate EMD^34,35, we anticipate that this causal decomposition approach will assist with revealing causal interactions in complex networks not accounted for by current methods.

Methods

Causal relationship based on instantaneous phase dependency

We define the cause–effect relationship between Time Series A and Time Series B according to the fundamental criterion of causal assessment proposed by Galilei¹: cause is that which put, the effect follows; and removed, the effect is removed; thus, variable A causes variable B if the instantaneous phase dependency between A and B is diminished when the intrinsic component in B that is causally related to A is removed from B itself, but not vice versa.

$${\mathrm{Coh}}\left( {A,B\prime } \right) < {\mathrm{Coh}}\left( {A,B} \right)\sim {\mathrm{Coh}}\left( {A\prime ,B} \right)$$

(1)

where Coh denotes the instantaneous phase dependency (i.e., coherence) between the intrinsic components of two time series, and the accent mark represents the time series where the intrinsic components relevant to cause effect dynamics were removed. The realisation of this definition requires two key treatments of the time series. First, the time series must be decomposed into intrinsic components to recover the cause–effect relationship at a specific time scale and instantaneous phase. Second, a phase coherence measurement is required to measure the instantaneous phase dependency between the intrinsic components decomposed from cause–effect time series.

Empirical mode decomposition

To achieve this, we decompose a time series into a finite number of IMFs by using the ensemble EMD^14,15,16 technique. Ensemble EMD is an adaptive decomposition method originated from EMD (i.e., the core of Hilbert–Huang Transform) for separating different modes of frequency and amplitude modulations in the time domain^14,15.

Briefly, EMD is implemented through a sifting process to decompose the original time-series data into a finite set of IMFs. The sifting process comprises the following steps: (1) connecting the local maxima or minima of a targeted signal to form the upper and lower envelopes by natural cubic spline lines; (2) extracting the first prototype IMF by estimating the difference between the targeted signal and the mean of the upper and lower envelopes; and (3) repeating these procedures to produce a set of IMFs that were represented by a certain frequency–amplitude modulation at a characteristic time scale. The decomposition process is completed when no more IMFs could be extracted, and the residual component is treated as the overall trend of the raw data. Although IMFs are empirically determined, they remain orthogonal to one another, and may therefore contain independent physical meanings^15,37.

The IMF decomposed from EMD enables us to use Hilbert transform to derive physically meaningful instantaneous phase and frequency^14,29. For each IMF, they represent narrow-band amplitude and frequency-modulated signal S(t), and can be expressed as

$$S\left( t \right) = A\left( t \right){{\cos}}\emptyset \left( t \right)$$

(2)

where instantaneous amplitude A and phase ∅ can be calculated by applying the Hilbert transform, defined as S_H = $\frac{1}{\pi }{\int} {\frac{{S(t\prime )}}{{t - t\prime }}{\mathrm{d}}t\prime }$; A(t) = $\sqrt {S^2\left( t \right) + S_H^2(t)}$; and ∅(t) = ${\mathrm{arctan}}\left( {{\textstyle{{S_H\left( t \right)} \over {S\left( t \right)}}}} \right)$. The instantaneous frequency is then calculated as the derivative of the phase function ω(t) = d∅(t)/dt.

Thus, the original signal X can be expressed as the summation of all IMFs and residual r,

$$X(t) = \mathop {\sum}\nolimits_{j = 1}^k {A_j\left( t \right) \, {{\exp}}\left( {i\mathop {\scriptstyle\int }\omega_{j}(t){\mathrm{d}}t} \right) + r}$$

(3)

where k is the total number of IMFs, A_j(t) is the instantaneous amplitude of each IMF; and ω_j(t) is the instantaneous frequency of each IMF. Previous literature have shown that IMFs derived with EMD can be used to delineate time dependency³⁸ or phase dependency^{37,39,40,41,42} in nonlinear and non-stationary data.

The ensemble EMD^15,16,43 is a noise-assisted data analysis method to further improve the separability of IMFs during the decomposition and defines the true IMF components S_j(t) as the mean of an ensemble of trials, each consisting of the signal plus white noise of a finite amplitude.

$$S_j\left( t \right) = \lim _{{\mathrm{N}} \to \infty }\mathop {\sum}\nolimits_{k = 1}^N {\left\{ {S_j\left( t \right) + r \times w_k(t)} \right\}} $$

(4)

where w_k(t) is the added white noise, and k is the kth trial of the jth IMF in the noise-added signal. The magnitude of the added noise r is critical to determining the separability of the IMFs (i.e., r is a fraction of a standard deviation of the original signal). The number of trials in the ensemble N must be large so that the added noise in each trial is cancelled out in the ensemble mean of large trials (N = 1000 in this study). The purpose of the added noise in the ensemble EMD is to provide a uniform reference frame in the time–frequency space by projecting the decomposed IMFs onto comparable scales that are independent of the nature of the original signals. With the ensemble EMD method, the intrinsic oscillations of various time scales can be separated from nonlinear and non-stationary data with no priori criterion on the time–frequency characteristics of the signal. Hence, the use of ensemble EMD could complement the constraints of separability in Granger’s paradigm⁴⁴ and potentially capture simultaneous causal relationships not accounted for by predictive causality methods.

Orthogonality and separability of IMFs

Because r is the only parameter involved in the causal-decomposition analysis, the strategy of selecting r is to maximise the separability while maintaining the orthogonality of the IMFs, thereby avoiding spurious causal detection resulting from poor separation of a given signal. We calculated the nonorthogonal leakage¹⁴ and root-mean-square (RMS) of the pairwise correlations of the IMFs for each r with an increment of 0.05 in the uniform space between 0.05 and 1. A general guideline for selecting r in this study is to minimise the RMS of the pairwise correlations of the IMFs (ideally under 0.05) while maintaining the nonorthogonal leakage also under 0.05.

Phase coherence

Next, the Hilbert transform is applied to calculate the instantaneous phase of each IMF and to determine the phase coherence between the corresponding IMFs of two time series¹⁸. For each corresponding pair of IMFs from the two time series, denoted as S_1j(t) and S_2j(t), and can be expressed as

$${S}_{1j} (t) = {A}_{1j} (t){{\cos}}{\emptyset}_{1j}(t) \, {\mathrm{and}} \, {S}_{2j}(t) = {A}_{2j}(t) {{\cos}}{\emptyset}_{2j} {(t)},$$

(5)

where A_1j, ∅_1j can be calculated by applying the Hilbert transform, defined as $S_{1jH} = \frac{1}{\pi }{\int} {\frac{{S_{1j}(t\prime )}}{{t - t^\prime }}{\mathrm{d}}t\prime }$, and $A_{1j}\left( t \right) = \sqrt {S_{1j}^2(t) + S_{1jH}^2(t)} $, and ∅_1j(t) = arctan$\left( {\frac{{S_{1{\mathrm{j}}H}\left( t \right)}}{{S_{1j}\left( t \right)}}} \right)$; and similarly applied for S_2jH, A_2j, and ∅_2j. The instantaneous phase difference is simply expressed as ∆ ∅_12j(t) = ∅_2j(t)∅_1j(t). If two signals are highly coherent, then the phase difference is constant; otherwise, it fluctuates considerably with time. Therefore, the instantaneous phase coherence Coh measurement can be defined as

$${\mathrm{Coh}}\left( {S_{1j},S_{2j}} \right) = \frac{1}{T}\left| {{\int}_{\hskip-5pt 0}^T {e^{i\Delta \emptyset _{12j}(t)}{\mathrm{d}}t} } \right|$$

(6)

Note that the integrand (i.e., $e^{i\Delta \emptyset _{12{\mathrm{j}}}(t)}$) is a vector of unit length on the complex plane, pointing toward the direction which forms an angle of $\Delta \emptyset _{12j}(t)$ with the +x axis. If the instantaneous phase difference varies little over the entire signal, then the phase coherence is close to 1. If the instantaneous phase difference changes markedly over the time, then the coherence is close to 0, resulting from adding a set of vectors pointing in all possible directions. This phase coherence definition allows the instantaneous phase dependency to be calculated without being subjected to the effect of time lag between cause and effect (i.e., the time precedence principle), thus avoiding the constraints of time lag in predictive causality methods¹⁰.

Causal decomposition between two time series

With the decomposition of the signals by ensemble EMD and measurement of the instantaneous phase coherence between the IMFs, the most critical step in the causal-decomposition analysis is again based on Galilei’s principle: the removal of an IMF followed by redecomposition of the time series (i.e., the decomposition and redecomposition procedure). If the phase dynamic of an IMF in a target time series is influenced by the source time series, removing this IMF in the target time series (i.e., subtract an IMF from the original target time series) with redecomposition into a new set of IMFs results in the redistribution of phase dynamics into the emptied space of the corresponding IMF. Furthermore, because the causal-related IMF is removed, redistribution of the phase dynamics into the corresponding IMF would be exclusively from the intrinsic dynamics of the target time series, which is irrelevant to the dynamics of the source time series, thus reducing the instantaneous phase coherence between the paired IMFs of the source time series and redecomposed target time series. By contrast, this phenomenon does not occur when a corresponding IMF is removed from the source time series because the dynamics of that IMF are intrinsic to the source time series and removal of that IMF with redecomposition would still preserve the original phase dynamics from the other IMFs. Therefore, this decomposition and redecomposition procedure enables quantifying the differential causality between the corresponding IMFs of two time series.

Because each IMF represents a dynamic process operating at a distinct time scale, we treat the phase coherence between the paired IMFs as the coordinates in a multidimensional space, and quantify the variance-weighted Euclidean distance between the phase coherence of the paired IMFs decomposed from the original signals as well as the paired original and redecomposed IMFs, which are expressed as follows:

$$\begin{array}{l}D\left( {S_{1j} \to S_{2j}} \right) = \left\{ {\mathop {\sum}\nolimits_{j = 1}^m {W_j\left[ {{\mathrm{Coh}}\left( {S_{1j},S_{2j}} \right) - {\mathrm{Coh}}\left( {S_{1j},S_{2j}^\prime } \right)} \right]^2} } \right\}^{\frac{1}{2}}\\ D\left( {S_{2j} \to S_{1j}} \right) = \left\{ {\mathop {\sum}\nolimits_{j = 1}^m {W_j\left[ {{\mathrm{Coh}}\left( {S_{1j},S_{2j}} \right) - {\mathrm{Coh}}\left( {S_{1j}^\prime ,S_{2j}} \right)} \right]} ^2} \right\}^{\frac{1}{2}}\\ W_j = \left( {{\mathrm{Var}}_{1j} \times {\mathrm{Var}}_{2j}} \right){\mathrm{/}}\mathop {\sum}\nolimits_{j = 1}^m {\left( {{\mathrm{Var}}_{1j} \times Var_{2j}} \right)} \end{array}$$

(7)

The range of D represents the level of absolute causal strength and is between 0 and 1. The relative causal strength between IMF S_1j and S_2j can be quantified as the relative ratio of absolute cause strength $D\left( {S_{1j} \to S_{2j}} \right)$ and $D\left( {S_{2j} \to S_{2j}} \right)$, expressed as follows:

$$\begin{array}{l}C\left( {S_{1j} \to S_{2j}} \right) = D\left( {S_{1j} \to S_{2j}} \right) \Big/ \left[ {D\left( {S_{1j} \to S_{2j}} \right) + D\left( {S_{2j} \to S_{1j}} \right)} \right]\\ C\left( {S_{2j} \to S_{1j}} \right) = D\left( {S_{2j} \to S_{1j}} \right)\Big/\left[ {D\left( {S_{1j} \to S_{2j}} \right) + D\left( {S_{2j} \to S_{1j}} \right)} \right].\end{array}$$

(8)

This decomposition and redecomposition procedure is repeated for each paired IMF to obtain the relative causal strengths at each time scale, where a ratio of 0.5 indicates either that there is no causal relationship or equal causal strength in the case of reciprocal causation, and a ratio toward 1 or 0 indicates a strong differential causal influence from one time series to another. To avoid a singularity when both $D\left( {S_{1j} \to S_{2j}} \right)$ and $D\left( {S_{2j} \to S_{1j}} \right)$ approach zero (i.e., no causal change in phase coherence with the redecomposition procedure), D + 1 is used to calculate the relative causal strength when both absolute causal strength D values are <0.05.

In summary, causal decomposition comprises the following three key steps: (1) decomposition of a pair of time series A and B into two sets of IMFs (e.g., IMFs A and IMFs B) and determining the instantaneous phase coherence between each paired IMFs; (2) removing an IMF in a given time series (e.g., time series A), performing the redecomposition procedure to generate a new set of IMFs (IMF A′) and recalculating the instantaneous phase coherence between the original IMFs (IMFs B) and redecomposed IMFs (IMFs A′); and (3) determining the absolute and relative causal strength by estimating the deviation of phase coherence from the phase coherence of the original time series (IMFs A vs. IMFs B) to either of the redecomposed time series (e.g., IMFs A′ vs. IMF B).

Validation of causal strength

To validate the causal strength, a leave-one-sample-out cross-validation is performed for each causal-decomposition test. Briefly, we delete a time point for each leave-one-out test and obtain a distribution of causal strength for all runs where the total number of time points is <100, or a maximum of 100 random leave-one-out tests where the total number of time points was higher than 100. A median value of causal strength is observed.

Deterministic and stochastic model data

The deterministic model was used in accordance with Sugihara et al.⁵ based on a coupled two-species nonlinear logistic difference system, expressed as follows (initial value x(1) = 0.2, and y(1) = 0.4):

$$\begin{array}{l}x\left( {t + 1} \right) = x\left( t \right)\left[ {3 . 8 - 3 . 8x\left( t \right) - 0 . 02y\left( t \right)} \right]\\ y\left( {t + 1} \right) = y\left( t \right)\left[ {3 . 5 - 3 . 5y\left( t \right) - 0 . 1x\left( t \right)} \right]\end{array}$$

(9)

For the stochastic model, we used part of the example shown in Ding et al.¹⁰ for Granger causality, which is expressed as follows (using a random number as the initial value).

$$\begin{array}{l}x\left( {t + 1} \right) = 0 . 95\sqrt 2 x\left( t \right) - 0 . 9025x\left( {t - 1} \right) + w_1(t)\\ y\left( {t + 1} \right) = 0 . 5x\left( {t - 1} \right) + w_2(t)\end{array}$$

(10)

Ecological data and validation

We assessed the causality measures in both modelled and actual predator and prey systems. The Lotka Volterra predator–prey model^21,22 is expressed as follows:

$${\mathrm{d}}x/{\mathrm{d}}t = \alpha x - \beta xy \\ {\mathrm{d}}y/{\mathrm{d}}t = \delta xy - \gamma y$$

(11)

where x and y denote the prey and the predator, respectively (α = 1, β = 0.05, δ = 0.02, γ = 0.5 were used in this study).

Experimental data on Paramecium and Didinium are available online⁴⁵, and these were obtained by scanning the graphics in Veilleux¹⁷ and digitising the time series. Wolf and moose field data are available online at the United States Isle Royale National Park²³. The lynx and hare data were reconstructed from fur trading records obtained from Hudson’s Bay Company²⁴. The benchmark time series⁴⁶ was reconstructed from various sources in two periods (the 1844–1904 data were reconstructed from fur records, whereas the 1905–1935 data were derived from questionnaires)²⁴. We used the fur-record time series between the year 1900 and 1920 for illustrative purposes.

Comparison with other causality methods

We compared causal decomposition with CCM, Granger’s causality, and MIME method²⁰. The detail of the calculation of CCM⁵, Granger causality¹⁰, and MIME²⁰ has been documented in the literature. Of note, both the CCM and Granger causality involve the selection of lag order. In this paper, the lag order (i.e., embedding dimension) of 3 was chosen for the application of CCM method to the ecosystem data⁵, and the lag order in the Granger causality was selected by the Bayesian information Criterion. The MIME is an entropy-based causality method which also employs the time precedence principle⁴⁷ and is equivalent to Granger’s causality in certain conditions⁴⁸.

Code availability

The source code for the causal-decomposition analysis (including ensemble EMD (http://rcada.ncu.edu.tw/research1.htm)) is implemented in Matlab (Mathworks Inc., Natick, MA, USA), and the current version (causal-decomposition-analysis-v1.0) or any future versions of the codes will be available at GitHub.

Data availability

The Didinium and Paramecium data that support the findings of this study are available in http://robjhyndman.com/tsdldata/data/veilleux.dat. Wolf and moose field data are available online at the United States Isle Royale National Park. Lynx and hare data are available online at https://github.com/bblais/Systems-Modeling-Spring-2015-Notebooks/tree/master/data/Lynx%20and%20Hare%20Data.

Change history

06 September 2018
This Article was originally published without the accompanying Peer Review File. This file is now available in the HTML version of the Article; the PDF was correct from the time of publication.

References

Galilei, G. On Motion and on Mechanics (The University of Wisconsin Press, Madison, 1960).
Hume, D. A Treatise of Human Nature (Clarendon Press, Oxford, 2007).
Granger, C. Investigating causal relations by econometric models and cross-spectral methods. Econometrica 37, 428–438 (1969).
MATH Google Scholar
Wiener, N. in Modern Mathematics for the Engineer (ed. Beckenbach, E. F.) 165–211 (McGrawHill, New York, 1956).
Sugihara, G. et al. Detecting causality in complex ecosystems. Science 338, 496–500 (2012).
Article ADS PubMed MATH CAS Google Scholar
Takens, F. Dynamical Systems and Turbulence (Springer-Verlag, Heidelberg, 1981).
Deyle, E. R. & Sugihara, G. Generalized theorems for nonlinear state space reconstruction. PLoS ONE 6, e18295 (2011).
Article ADS PubMed PubMed Central CAS Google Scholar
Aitchison, J. & Dunsmore, I. R. Statistical Prediction Analysis (Cambridge University Press, Cambridge, 1980).
Kant, I. The Critique of Pure Reason 2nd edn (The University of Adelaide Library, Adelaide, 1787).
Ding, M., Chen, Y., Bressler, S. L. in Handbook of Time Series Analysis: Recent Theoretical Developments and Applications (eds Schelter B., Winterhalder, M. & Timmer, J.) Ch. 17 (Wiley-VCH, Weinheim, 2006).
Buzsaki, G. Rhythms of the Brain (Oxford University Press, Oxford, 2006).
Chen, Y., Bressler, S. L. & Ding, M. Frequency decomposition of conditional Granger causality and application to multivariate neural field potential data. J. Neurosci. Methods 150, 228–237 (2006).
Article PubMed Google Scholar
Baccala, L. A. & Sameshima, K. Partial directed coherence: a new concept in neural structure determination. Biol. Cybern. 84, 463–474 (2001).
Article PubMed MATH CAS Google Scholar
Huang, N. E. et al. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc. Math. Phys. Eng. Sci. 454, 903–995 (1998).
Article ADS MathSciNet MATH Google Scholar
Wu, Z., Huang, N. E., Long, S. R. & Peng, C. K. On the trend, detrending, and variability of nonlinear and nonstationary time series. Proc. Natl Acad. Sci. USA 104, 14889–14894 (2007).
Article ADS PubMed CAS Google Scholar
Wu, Z. H. & Huang, N. E. Ensemble empirical mode decomposition: a noise assisted data analysis method. Adv. Adapt. Data Anal. 1, 1–41 (2008).
Article Google Scholar
Veilleux, B. G. The analysis of a predatory interaction between Didinium and Paramecium. MSc thesis, Univ. of Alberta (1976).
Tass, P. et al. Detection of n:m phase locking from noisy data: application to magnetoencephalography. Phys. Rev. Lett. 81, 3291–3294 (1998).
Article ADS CAS Google Scholar
Kirchgassner G., Wolters J. & Hassler U. Introduction to Modern Time Series Analysis (Springer, Berlin, 2013).
Vlachos, I. & Kugiumtzis, D. Nonuniform state-space reconstruction and coupling detection. Phys. Rev. 82, 016207 (2010).
ADS Google Scholar
Lotka, A. J. Elements of Physical Biology (Williams & Wilkins, Baltimore, 1925).
Volterra, V. Variations and Fluctuations of the Number of Individuals in Animal Species Living Together (McGrawHill, New York, 1931).
Vucetich, J. A. & Peterson, R. O. The population biology of Isle Royale wolves and moose: an overview. Wolves & Moose of Isle Royal, http://www.isleroyalewolf.org/data/data/home.html (2012)
Stenseth, N. C., Falck, W., Bjornstad, O. N. & Krebs, C. J. Population regulation in snowshoe hare and Canadian lynx: asymmetric food web configurations between hare and lynx. Proc. Natl Acad. Sci. USA 94, 5147–5152 (1997).
Article ADS PubMed CAS Google Scholar
Tuma, N. B. & Hannan, M. T. Social Dynamics Models and Methods (Academic Press, Orlando, 1984).
Jiang, L. & Bai, L. Revisiting the Granger causality relationship between energy consumption and economic growth in China: a multi-timescale decomposition approach. Sustainability 9, 2299 (2017).
Article Google Scholar
Schiecke, K. et al. Advanced nonlinear approach to quantify directed interactions within EEG activity of children with temporal lobe epilepsy in their time course. EPJ Nonlinear Biomed. Phys. 5, 3 (2017).
Article Google Scholar
Nava, N., Di Matteo, T. & Aste, T. Dynamic correlations at different time-scales with empirical mode decomposition. Phys. A 502, 534–544 (2018).
Article Google Scholar
Geweke, J. Measurement of linear dependence and feedback between multiple time series. J. Am. Stat. Assoc. 77, 304–324 (1982).
Article MathSciNet MATH Google Scholar
Fogedby, H. C. On the phase space approach to complexity. J. Stat. Phys. 69, 411–425 (1992).
Article ADS MathSciNet MATH Google Scholar
Pikovsky, A., Rosenblum, M. & Kurths, J. Synchronization: A Universal Concept in Nonlinear Sciences (Cambridge University Press, Cambridge, 2003).
McCracken, J. M. & Weigel, R. S. Convergent cross-mapping and pairwise asymmetric inference. Phys. Rev. E 90, 062903 (2014).
Ye, H., Deyle, E. R., Gilarranz, L. J. & Sugihara, G. Distinguishing time-delayed causal interactions using convergent cross mapping. Sci. Rep. 5, 14750 (2015).
Article ADS PubMed PubMed Central CAS Google Scholar
Rehman, N. & Mandic, D. P. Multivariate empirical mode decomposition. Proc. Math. Phys. Eng. Sci. 2010, 1291–1302 (2009).
MATH Google Scholar
Zhang, Y. et al. Noise-assisted multivariate empirical mode decomposition for multichannel EMG signals. Biomed. Eng. Online 16, 107 (2017).
Article PubMed PubMed Central Google Scholar
Shimizu, A., Hyvarinen, S., Kano, Y. & Hoyer, P. O. Discovery of non-gaussian linear causal models using ICA In Proc. of the Twenty-First Conference on Uncertainty in Artificial Intelligence (eds Dechter, R. & Richardson, T. S.) 525–533 (AUAI Press, Corvallis, 2005).
Lo, M. T., Novak, V., Peng, C. K., Liu, Y. & Hu, K. Nonlinear phase interaction between nonstationary signals: a comparison study of methods based on Hilbert–Huang and Fourier transforms. Phys. Rev. 79, 061924 (2009).
Article MathSciNet CAS Google Scholar
Cummings, D. A. et al. Travelling waves in the occurrence of dengue haemorrhagic fever in Thailand. Nature 427, 344–347 (2004).
Article ADS PubMed CAS Google Scholar
Novak, V. et al. Multimodal pressure-flow method to assess dynamics of cerebral autoregulation in stroke and hypertension. Biomed. Eng. Online 3, 39 (2004).
Article PubMed PubMed Central Google Scholar
Hu, K., Lo, M. T., Peng, C. K., Liu, Y. & Novak, V. A nonlinear dynamic approach reveals a long-term stroke effect on cerebral blood flow regulation at multiple time scales. PLoS Comput. Biol. 8, e1002601 (2012).
Article MathSciNet PubMed PubMed Central CAS Google Scholar
Sweeney-Reed, C. M. & Nasuto, S. J. A novel approach to the detection of synchronisation in EEG based on empirical mode decomposition. J. Comput. Neurosci. 23, 79–111 (2007).
Article PubMed CAS Google Scholar
Cho, D., Min, B., Kim, J. & Lee, B. EEG-Based prediction of epileptic seizures using phase synchronization elicited from noise-assisted multivariate empirical mode decomposition. IEEE Trans. Neural Syst. Rehabil. Eng. 25, 1309–1318 (2017).
Article PubMed Google Scholar
Wu, Z. & Huang, N. E. A study of the characteristics of white noise using the empirical mode decomposition method. Proc. Roy. Soc. Lond. A 460, 1597–1611 (2004).
Article ADS MATH Google Scholar
Yu, L., Li, J., Tang, L. & Wang, S. Linear and nonlinear Granger causality investigation between carbon market and crude oil market: a multi-scale approach. Energy Econ. 51, 300–311 (2015).
Article Google Scholar
Hyndman, R. J. Time Series Data Library. http://robjhyndman.com/tsdldata/data/veilleux.dat (2012)
MacLulich, D. A. Fluctuations in the Number of the Varying Hare (Lepus americanus) (Univ. of Toronto Press, Toronto, 1937).
Schreiber, T. Measuring information transfer. Phys. Rev. Lett. 85, 461–464 (2000).
Article ADS PubMed CAS Google Scholar
Amblard, P. & Michel, O. J. J. The relation between Granger causality and directed information theory: a review. Entropy 15, 113–143 (2014).
Article ADS MathSciNet MATH Google Scholar

Download references

Acknowledgements

This work was supported by the Ministry of Science and Technology (MOST) of Taiwan (Grant MOST 101-2314-B-075-041-MY3; 104-2314-B-075-078-MY2). We thank Dr. Shih-Jen Tsai, Dr. Shuu-Jiun Wang, Dr. Susan Shur-Fen Gau, Dr. Ching-Po Lin, Dr. Chang-Wei Wu, and Dr. Zhaohua Wu for valuable discussions.

Author information

Authors and Affiliations

Division of Interdisciplinary Medicine and Biotechnology, Beth Israel Deaconess Medical Center/Harvard Medical School, Boston, MA, 02215, USA
Albert C. Yang & Chung-Kang Peng
Institute of Brain Science, National Yang-Ming University, 11221, Taipei, Taiwan
Albert C. Yang
Department of Psychiatry, Taipei Veterans General Hospital, 11217, Taipei, Taiwan
Albert C. Yang
Center for Dynamical Biomarkers and Translational Medicine, National Central University, 32001, Chungli, Taiwan
Norden E. Huang
Key Laboratory of Data Analysis and Applications, First Institute of Oceanography, SOA, 266061, Qingdao, China
Norden E. Huang

Authors

Albert C. Yang
View author publications
You can also search for this author in PubMed Google Scholar
Chung-Kang Peng
View author publications
You can also search for this author in PubMed Google Scholar
Norden E. Huang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.C.Y. developed the causal-decomposition method, performed the computational analysis, and wrote the manuscript. C.K.P and N.E.H. gave critical comments and contributed to manuscript writing. All authors discussed the results and approved the manuscript.

Corresponding author

Correspondence to Albert C. Yang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yang, A.C., Peng, CK. & Huang, N.E. Causal decomposition in the mutual causation system. Nat Commun 9, 3378 (2018). https://doi.org/10.1038/s41467-018-05845-7

Download citation

Received: 12 January 2018
Accepted: 20 July 2018
Published: 23 August 2018
DOI: https://doi.org/10.1038/s41467-018-05845-7

This article is cited by

Big Data in Earth system science and progress towards a digital twin
- Xin Li
- Min Feng
- Huadong Guo
Nature Reviews Earth & Environment (2023)
Detection of intermuscular coordination based on the causality of empirical mode decomposition
- Carlos Cruz-Montecinos
- Xavier García-Massó
- Claudio Tapia
Medical & Biological Engineering & Computing (2023)
Understanding the dynamic of government expenditures for disability and other social benefits: evidence from a Lotka–Volterra model for the Netherlands
- Chiara Natalie Focacci
- Peter Mascini
- Romke van der Veen
Quality & Quantity (2023)
Comments on identifying causal relationships in nonlinear dynamical systems via empirical mode decomposition
- Chun-Wei Chang
- Stephan B. Munch
- Chih-hao Hsieh
Nature Communications (2022)
Reply To: Comments on identifying causal relationships in nonlinear dynamical systems via empirical mode decomposition
- Albert C. Yang
- Chung-Kang Peng
- Norden E. Huang
Nature Communications (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.