Detecting Causality from Nonlinear Dynamics with Short-term Time Series

Ma, Huanfei; Aihara, Kazuyuki; Chen, Luonan

doi:10.1038/srep07464

Download PDF

Article
Open access
Published: 12 December 2014

Detecting Causality from Nonlinear Dynamics with Short-term Time Series

Huanfei Ma^1,2,
Kazuyuki Aihara² &
Luonan Chen^2,3

Scientific Reports volume 4, Article number: 7464 (2014) Cite this article

13k Accesses
60 Citations
15 Altmetric
Metrics details

Subjects

Abstract

Quantifying causality between variables from observed time series data is of great importance in various disciplines but also a challenging task, especially when the observed data are short. Unlike the conventional methods, we find it possible to detect causality only with very short time series data, based on embedding theory of an attractor for nonlinear dynamics. Specifically, we first show that measuring the smoothness of a cross map between two observed variables can be used to detect a causal relation. Then, we provide a very effective algorithm to computationally evaluate the smoothness of the cross map, or “Cross Map Smoothness” (CMS) and thus to infer the causality, which can achieve high accuracy even with very short time series data. Analysis of both mathematical models from various benchmarks and real data from biological systems validates our method.

Song lyrics have become simpler and more repetitive over the last five decades

Article Open access 28 March 2024

Spike sorting with Kilosort4

Article Open access 08 April 2024

Principal component analysis

Article 22 December 2022

Introduction

Detecting causal relationship between variables, especially from observed time series, has attracted great attentions from multiple-disciplines. Though there still has been no universally accepted definition of causality, various measures of causality have been reported and extensively studied. Among a variety of methods based on linear regression, Granger causality¹ is undoubtedly a widely accepted definition and method to detect causal relationship between different factors. However, the Granger method mainly focuses on linear models and needs a key condition of separability, namely, assumes that the driven information from causative factors can be removed from the effects². In fact, it has been noted that the causality in Granger sense may be not suitable to detect directional coupling between nonlinear systems³, especially deterministic dynamical systems with weak or moderate couplings. To quantify causal relationship between the intertwined variables of nonlinear systems, methods developed from transfer entropy⁴, conditional mutual information⁵, recurrence plots^6,7 and nonlinear extension of Granger causality^8,9 have been proposed and extensively studied. Moreover, various kinds of mutual nonlinear cross map methods based on state space reconstruction (SSR) technique have been also studied both theoretically^10,11,12,13 and numerically¹⁴ for a long time. In particular, recently the mutual cross map method has been successfully applied to solve complex relationship in ecological systems^2,15.

Though these methods have been demonstrated to correctly identify causal relations for many systems, they all require sufficiently long time series to achieve a reasonable result. This stems from the fact that training regression models, calculating correlations, determining transition probabilities and finding nearest neighbors all need a sufficiently large training set or a large number of samples.

However, in practical situations, the measured time series are always limited rather than sufficiently long and sometimes are even rather short, e.g., the high throughput microarray or RNA-seq data for gene expressions of a biological process are typically measured less than 20 time points due to both experimental and economical constraints¹⁶. Though various methods based on Bayesian inference, regression analysis, econometrics models and standard similarity measures have been used to analyze such short time series data^17,18,19, inferring genetic networks from short data is still regarded as an ‘ill-posed’ inverse problem and a challenging task^20,21. On the other hand, in some occasions even though long-term data can be measured, only short (recent) pieces can correctly reflect the causal relation between subsystems due to the nonstationary and fast switching property of the concerned systems. Therefore it is in urgent need of developing new methods to detect causality based on short-term data or a small number of samples.

In contrast to the traditional knowledge that short-term data cannot provide enough information to infer the causal relation, here we show that we can detect causality from very short time series in an accurate manner by exploiting global information of data. Specifically, we propose a measurement “Cross Map Smoothness” (CMS) based on the embedding theory of attractors^22,23 in this paper, which can not only detect causal relationship but also derive a cross map between any two observed variables even with short-term time series data and then we provide an efficient algorithm to construct such a map for inferring the causal relation. The key idea behind our method is that measuring smoothness of a cross map between two observed variables implies causal relations, which can be computationally achieved even with short time series data, comparing with the traditional methods, e.g., the nearest neighbor method. Analysis of mathematical models from various benchmarks validates our results and real data from biological systems confirms the method can be used to infer genetic networks from short data.

Methods

To begin, we revisit the mutual cross map method based on state space reconstruction. Consider two scalar time series x(t) and y(t) measured from two variables x and y in an unknown nonlinear dynamical system. With appropriately chosen embedding dimension L and proper delay τ^24,25, one can obtain time delayed coordinate vectors x(t) = [x(t), x(t − τ), …, x(t − (L − 1)τ)]^T and y(t) = [y(t), y(t − τ), …, y(t − (L − 1)τ)]^T respectively. According to delayed embedding theory²², the set of vectors x(t) forms the reconstructed attractor M_x and one can define M_y in an analogous way. For each point y(t₀) on M_y, one can find its k nearest neighbors y(t_y₁), y(t_y₂), …, y(t_yk) with time indices t_y₁, t_y₂, …, t_yk. Moreover, one can define the mutual neighbors for x(t₀) ∈ M_x as x(t_y₁), x(t_y₂), …, x(t_yk) and the map from nearest neighbors to mutual neighbors is defined as cross map Φ_yx: M_y → M_x. In the case that x is causally influencing y, or x is a driving factor of y (i.e., x → y), the information of x is included in the dynamics of y and thus two close states on M_y correspond to two close states on M_x, or explicitly, the mutual neighbors of x(t₀) are also in the neighborhood of x(t₀). Inversely, in the case that y has no influence over x, the dynamics of x is insensitive to the state of y and the mutual neighbors of y(t₀) are not necessarily to be close to y(t₀), as illustrated in Figs. 1(a) and (b). Therefore, the geometry property of mutual neighbors can be used to detect causality^2,10,11. The details of mutual neighbors and cross map are revisited in Supplementary Information.

Here, it should be stressed that one requirement is essential, that is, the nearest neighbors for both x and y are required to be sufficiently close to the true neighborhood so that the local geometric information can be correctly measured. If this requirement is not fulfilled, contradictory results may be derived, e.g., the studies of refs. 10 and 11 used different assumptions for mutual predictions which both gave the same result. In fact, due to the computational way of state space reconstruction using delayed embedding technique, sufficiently long time series are required to guarantee that the nearest neighbors on the reconstructed attractor converge to the true neighborhood. Figure 2 shows the relationship between nearest neighbors and the time series length, where the nearest neighbors found on the attractor reconstructed from short time series (Fig. 2(b)) are apparently far away from the true neighborhood of the underlying center point (Fig. 2(a)). Detailed discussions and explanations on the necessity of convergence of nearest neighbors can be further referred to ref. 2. Thus, detecting causality based on nearest neighbors and mutual neighbors essentially requires sufficiently long time series data to make reliable causality detection.

Here, we notice that the key idea behind the method of finding mutual neighbors is actually measuring the smoothness of the map. Specifically, if x causally influences y, the nearest neighbors of y(t₀) are mapped to close states of x(t₀), i.e., the cross map Φ_yx: M_y → M_x maps the neighborhood of y(t₀) to the neighborhood of x(t₀), which actually implies that Φ_yx is locally smooth around y(t₀), as shown in Fig. 1(a). On the reverse direction, if y has no influence over x, the image of x(t₀)'s neighborhood under the cross map Φ_xy: M_x → M_y is not necessarily the neighborhood of y(t₀), thus the cross map Φ_xy is not necessarily smooth around x(t₀), as shown in Fig. 1(b). If the cross map Φ is locally smooth in the neighborhood of every point on the attractor, then the map is globally smooth on the whole attractor and vice versa. Thus, the global smoothness of Φ_yx and Φ_xy can be built from local properties, as illustrated in Figs. 1(c) and (d). Moreover, when the coupling strength increases, information becomes more distinct in the causally influenced variables. As a result, their attractors will contain stronger historical information from the causes. Thus, within one system, the relative smoothness can indicate the relative strength of causative effectiveness.

Therefore, finding mutual nearest neighbors is equivalent to measuring the smoothness of the cross map Φ, i.e., the smoothness of Φ_yx indicates the strength of causative effectiveness from x to y. While mutual neighbors only use the local information around one point, we propose a new framework, i.e., Cross Map Smoothness (CMS), to measure the smoothness of Φ using global information and consequently we can detect causality even from short time series in an accurate manner. In other words, instead of finding nearest neighbors which requires a large number of samples, we computationally evaluate the smoothness of the cross map by designing an efficient algorithm for the global attractor.

Our fundamental idea is based on the fact that any smooth map can be approximated by a neural network ²⁶ while training a neural network to approximate an unsmooth map will fail with large training errors, as illustrated in Figs. 3(a)–(d). Furthermore, the training errors reflects the relative smoothness and thus can be a measure of the relative strength of causative effectiveness. Therefore we can train the neural network to approximate the map Φ_yx, using the whole set of data y(t) on M_y as input and the whole set of data x(t) on M_x as output. Thus the training error (i.e., the measurement of the relative smoothness) between Φ and indicates the strength of the causative influence from x to y. The sketch of the Cross Map Smoothness (CMS) with the neural network (NN) method is illustrated in Figs. 3(e) and (f).

Thus, we propose the Cross Map Smoothness(CMS) algorithm using a Radial Basis Function (RBF) network to detect causality between two variables x and y. The details of the algorithm is listed in Supplementary Information Section 2. Here we adopt a leave-one-out strategy to fully use the short time series, i.e., we train one RBF network based on each leave-one-out data set and make prediction on the one test point. Finally, we compute the causality index R_xy based on the the normalized training error, which measures the causative effective strength from x to y.

Results

To validate the method, several representative examples are considered as benchmarks.

Theoretical model validation

Let us begin with several representative causality patterns which can be used as motifs in many complex situations. We first consider two coupled variables with both unidirectional and bidirectional couplings in the following form,

where r_x = 3.7 and r_y = 3.8 are two coefficients. Here γ_xy and γ_yx are two coupling parameters and indicate the strength of causative effectiveness. In the first case, we set γ_xy = 0 and γ_yx = 0.32, which implies that X and Y have a driving-response relation, namely there is unidirectional causality from X to Y, while the inverse is not true. We use time series with length of 20 time points and apply the CMS method to the data set. The causality detection result obtained by CMS Algorithm is shown in Fig. 4(a), where nonzero R_xy and zero R_yx clearly fit with the unidirectional causality pattern. Then we set γ_yx = 0.1 and γ_xy = 0.02 which makes it a mutually coupled system and thus there is mutual causative effectiveness between X and Y. With the same setting in the first case, we detect the causality between X and Y using CMS, as shown in Fig. 4(b). The detected result with R_xy = 0.69 and R_yx = 0.32 not only shows the bidirectional causality between X and Y but also indicates that the relative strength of the causative effectiveness from X to Y is stronger than the inverse direction.

Here it is stressed that within one system when all the other conditions are the same, as the coupling strength increases, information becomes more distinct in the causally influenced variables and consequently larger causality indices will be detected. However, this strength of causative effectiveness is relative but not absolute, i.e., the relation is not always monotonous between the coupling parameter values and the coupling strength. Therefore, the detected index reflects the relative strength of the causative effectiveness between different pairs of variables within one system. To this end, we consider varying coupling strength values. In the unidirectional case, we fix γ_xy = 0 and vary the values of γ_yx in the range [0, 0.32]. For each value of γ_yx, we generate data and use the CMS algorithm to detect causal relations with the same setting. The result is shown in Fig. 5(a) where the detected R_yx is always zero while R_xy firstly jumps from zero to nonzero, then shows an ascending trend as the coupling strength increases. In the bidirectional case, we consider the following varying form: γ_yx = α, γ_xy = 0.2 − α where α is a varying factor in the range [0, 0.2] measuring the coupling strength in both directions. With the same setting as in the unidirectional case, the detected result is shown in Fig. 5(b) where zero causality indices reflect zero couplings and the detected causality indices R_xy and R_yx show ascending or descending trend as the coupling strength varies in the same way. Moreover, for small α, it clearly shows R_yx > R_xy which coincides with the fact of relative stronger causative effectiveness from X to Y and vice versa.

Then we consider a more complicated system involving three variables as follows:

where γ_ij are coupling parameters. With particular settings of the coupling parameters, shown in Supplementary Information, the causal relations between the three variables can show fan-out or fan-in patterns, as shown in Figs. 4(c) and (d). For the fan-out case in Fig. 4(c), there are two unidirectional couplings from Y₁ to Y₂ and Y₃, while Y₂ and Y₃ have no direct relationship with each other. Since Y₂ and Y₃ are both driven by the common source from Y₁, the dynamics of Y₂ and Y₃ both contain the information from Y₁. Thus the time series Y₂(t) and Y₃(t) are correlated but have no causality between them, which is a difficult situation for causality detection⁷. Here we apply the CMS method to the time series with length of 20 points and get the mutual relationship between Y₁, Y₂ and Y₃, which is shown in Fig. 4(c), where only the detected causality over significance threshold is shown. The result in Fig. 4(c) indicates that we can detect causality from Y₁ to Y₂ and Y₃ but not vice versa. Furthermore, there is no causal relationship detected between Y₂ and Y₃, which confirms that our method is effective for common source causality pattern even with short-term data. As for the fan-in case in Fig. 4(d), there are two unidirectional couplings to Y₃, i.e., Y₃ is driven by both Y₁ and Y₂ simultaneously. With the same setting as the fan-out case, we use CMS to detect the mutual relationship between the three variables, as shown in Fig. 4(d). The result illustrates that we can correctly detect the fan-in causality pattern. Here we stress that the strength of the detected nonzero causality in Fig. 4(d) is much weaker than the previous cases. Actually, a fan-in motif is generally considered as a difficult pattern to infer¹⁷, this is mainly due to the fact that the dynamics of Y₃ are affected by both Y₁ and Y₂ at the same time, which weakens the effect of each single driving force.

The above cases validate that our method can be effective for discrete-time dynamical systems. To test our method with continuous-time systems, we consider the Lorenz system driven by a chaotic signal from the Rössler system, which was used as a benchmark in ref. 11. We use the standard parameter values with which the coupled system has chaotic dynamics. We assume that 50 time points with an even measurement interval are observed from the systems and use CMS to detect the causality between the two systems. The detected causality indices are R_LR = 0 and R_RL = 0.27, which clearly shows the unidirectional causal relation from the Rössler system to the Lorenz system.

Here we note that though we use the normalized error in CMS Algorithm, the final causality index R_YX for non-causal situation does not reach exactly zero. Therefore in order to decide whether the causality relation exists, we set a threshold value ξ based on the significance test^18,27,28,29. Our statistical analysis is based on the permutation test: we run 1000 independent permutations uniformly at random, shuffle the time points according to the permutations and run CMS on the shuffled data. With the empirical distribution, we estimate the threshold as ξ = 0.001 at a significance level p < 0.05, i.e., we treat a causality index below 0.001 as zero. The details of the significant test is shown in Supplementary Information.

The above illustrations show that our method is effective even for the situations where only short-term data can be obtained. On the other hand, in many occasions, owing to strong nonstationary and irregular behavior of many real-world systems, the causal relationship between system variables may switch quickly and thus even if a long time series is available the causality detected based on the long-term time series is meaningless. To consider such problems, we assume that the coupling parameters in system (1) are no longer constant but switching at random intervals between two sets of values so that the causal relationship between the two variables change from time to time, as shown in Fig. 6. We assume that a time series of 1000 time points are measured for the system and it is obvious that using the whole time series to calculate one constant causality between X and Y will yield a false result. Therefore we use a time window of 20 time points moving along the whole time series and use the CMS method to detect causality from every short piece of data during one time window. Figure 6 shows the detected result, where the dashed square waves represent the random switching of the coupling parameters between zero and nonzero values and the solid lines represent the detected strengths of causative effectiveness over each time window. The switching value of γ_xy decides whether or not there is causative effectiveness from Y to X and it is clear that the detected causality from Y to X coincides with the square wave of γ_xy quite well in Fig. 6(a). The similar result can be observed for the causality from X to Y and the switching values of γ_yx in Fig. 6(b), which confirms the effectiveness of our method for such causality-varying situations.

Unraveling gene regulatory networks

How to infer gene regulatory interactions from transcriptomics time-resolved data and further unravel the gene regulatory network (GRN) is of paramount importance to gain a deeper insight into the complexity and functions of the underlying biological systems. Due to the limit of experiment technique and other constraints, usually only very short-term and often noisy timeresolved measurements can be available in gene expressions. Though various methods based on Bayesian inference, regression analysis, econometrics models and standard similarity measures have been used to analyze such short time series data^17,18,19, inferring genetic networks from short data is still regarded as an ‘ill-posed’ inverse problem and a challenging task^20,21.

Here, we note that in a gene regulatory network, the regulation mechanism obeys some biochemistry rules and thus the regulatory dynamics can be described by standard kinetics models, such as Michaelis-Menten and Hill kinetics^30,31. Therefore, the regulatory interactions can be measured by causal relationship in nonlinear dynamical systems and the proposed CMS method can be particularly suitable for such a task, i.e., reverse engineering GRN from short-term data.

Since in the real time-resolved expression data, e.g., microarray chip data, not every regulatory subnetwork contains information of all the participating genes, particularly over a specific time period and a specific condition of interest. These facts render it challenging to give a comprehensive evaluation of network inference with real time-resolved expression data. On the other hand, it is widely accepted these years to evaluate inference methods using standard synthetically generated data sets^17,18,19. Therefore, before applying our method with real data, we give a comprehensive validation with synthetically generated data sets of the bacterium E. coli³⁰, as described in ref. 32.

Specifically, we consider a subsystem consisting of 50 genes picked out randomly from the whole network, whose regulatory relation is shown in Fig. 7(a). The regulation network of the selected subsystem consists of several clusters, as illustrated in Fig. 7(a), which can well approximate the statistical properties of the whole network³⁰. Here each node's dynamics is governed by Michaelis-Menten or Hill kinetics, so that the simulated gene expression time series are similar to real microarray measurements. We assume that only 10 time points are available for each gene expression time series to simulate the real experimental measurements and use CMS to detect causality between each pair of genes. In order to evaluate the inference efficiency of CMS to detect GRN structure, we use the resulting receiver operating characteristics (ROC) curves, which plot the true positive rate (TPR) against the false positive rate (FPR) at various threshold settings. Actually, ROC curves show the relative trade-offs between benefits of correctly inferred links (TPR) and the drawbacks of incorrectly inferred links (FPR) with different values of threshold to identify a link. The ROC curve for noise-free data is shown in Fig. 7(b), from which it can be concluded that the ROC curve is very close to the perfect classification; i.e., when FPR is controlled to be less than 2.5%, TPR can reach more than 90% and the area under curve (AUC) is 0.98 which is very close to be one. Therefore, the CMS method can achieve a very good result for noise-free data. Meanwhile, generally noise-free expression data cannot be obtained in real experiment and thus we need to further test the robustness of the method to include various kinds of noise in the time series. Here we consider three kinds of noise simultaneously, namely, the biological noise, the experimental noise and noise on correlated inputs with three different noise intensities, namely 0.1, 0.2 and 0.3. Figure 7(b) shows the three ROC curves for data with noisy perturbations, where we can conclude that though the ROC curves for noisy data are lower than the ROC curve for noise-free data, the accuracy is still very high; especially all the three ROC curves have AUC values around 0.9.

Moreover, to consider the influences of network properties such as size and degree, we further test two additional data sets for S.cerevisiae with 100 and 150 genes respectively. The selected subsystems are shown in Figs. 8(a) and (b) and based on 10 output of the gene expressions, we apply CMS to infer the network interactions respectively. The ROC curves for both non-noise and noise disturbed cases are given in Figs. 8(c) and (d). Here, it is noticed that there are two central genes which regulate other downstream genes at the same time for the selected subnetworks of S.cerevisiae, which makes fan-in and fan-out motifs abundant. As discussed in Figs. 4(c) and (d), the fan-in motif may lead the causality less significant to be detected, making it more difficult to detect the true network.

Next, we test our method with real gene expression data. Here we consider the data of the laboratory rat (Rattus norvegicus) cultured cells sampled from suprachiasmatic nucleus (SCN) for studying circadian rhythm, where the gene expression profiles are measured with Affimetrix microarray (Genechip Rat Genome 230 2.0)^33,34,35. To elucidate the gene regulation network architecture, we select the data set consisting of 16 measured time points after the drug perturbation in the 19th hour. For the mammalian circadian clocks, it has been identified that there are around 17 genes involved in the core regulation network, where the transcriptional circuits are formed by regulation of E/E′ boxes, DBP/E4BP4 binding elements and RevErbA/ROR binding elements respectively^36,37. Moreover, besides the gene-level interactions, there are also regulation interactions at the protein level; e.g., the transcription factor Clock is phosphorylated by PFK family genes and the crytochrome genes Cry1 and Cry2 are phosphorylated by MAPK family genes³³. Therefore, we consider the 17 core circadian genes as well as 18 kinase genes, whose relations are depicted in Fig. 9(a). With 16 time points measured for each gene's expression, we apply the CMS method to detect the regulation relation between all the selected 35 genes. As a comparison, we also apply IOTA, partial IOTA¹⁸ and CCM² to the same data set, where IOTA is a newly proposed permutation-based asymmetric association measure to detect regulatory links from very short time series and CCM is a mutual cross map-based method. Based on the core regulation network in ref. 36, we carry out the ROC analysis for the regulation detection and the results are shown in Fig. 9(b).

Here, we stress that inference of GRN based on only one single short-term data set is a challenging task due to the extremely short measurements. The existing methods for GRN inference can usually reach an AUC around 0.7 for synthetical data but only around 0.5 for real experimental data²¹. The CCM method, which relies on finding nearest neighbors and thus requires long-term data for the convergence, has AUC around 0.5 in Fig. 9(b). Particularly, the core transcriptional circuits of mammalian circadian clocks consist of complexly integrated regulatory loops involving three kinds of middle elements: E/E′ boxes, DBP/E4BP4 binding elements and RevErbA/ROR binding elements. Therefore in the circadian data set we used here, there are many fan-in motifs and the relation between two interacting subsystems may no longer be monotonic and thus the IOTA method and the partial IOTA method may lead to a false result, as shown in Fig. 9(b) where the AUC of the IOTA method is less than 0.5.

As shown for the both synthetical GRN data and the real GRN data, our CMS method which is particularly effective for short-term data can achieve a very good result in a much accurate way in ROC analysis.

Discussion

In this section, we discuss several issues related to CMS' application in real situations. First a question arises naturally: how short can be the time series so that CMS can be effective? Or explicitly, what is the lower bound of the length of time series that CMS requires to guarantee a reliable result. Intuitively, the longer the time series can be observed, the more information the data can provide. Therefore just as pointed out in ref. 2, one should consider the convergence of the causality index over different lengths of time series and use the limit value as the truly detected causality index. Here we use system (1) with unidirectional causality setting as a benchmark to test the lower bound for the CMS method. The result is shown in Fig. 10(a), which indicates that with around 20 time points, it is enough for CMS to distinguish zero and nonzero causality. Meanwhile, as a comparison, we also test the CCM² method for the same data set. Since the CCM uses the convergence of nearest neighbors to detect causality, it needs much longer data length to converge. As shown in Fig. 10(b) as well as the inset there, CCM cannot distinguish zero and nonzero causality with short length of data and with as long as 2000 time points, CCM can give a trend of convergence for nonzero causality though still no convergence for zero causality is achieved. Therefore, we conclude that CMS method can be effective for short-term data with length n ~ O(10) while the existing neighborhood-based method requires data with length n ~ O(10³) to reach a reasonable result.

Then we compare our method with some representative existing methods for causality detection. Here we choose two kinds of methods for comparison, namely, the mutual cross map based on nearest neighbors and the composition alignment method. For the former, we use the newly proposed CCM² and for the latter, we use IOTA which is purposely designed for inferring gene networks from short time series¹⁸. We test all the numerical results in our paper with the same condition for the two methods and the comparison results for the theoretical models are shown in Fig. 11. Moreover, for the gene networks, we consider several criteria to compare the methods, i.e., we consider the area under the ROC curve (AUC(ROC)), the Youden index (YOUDEN = max(the true positive rate - the false positive rate)) and the area under the Precision/Recall curve (AUC(PvsR)) which is based on the comparison between the true edges and the inferred ones. The results for these ROC analysis are shown in Table 1. Generally, it is suggested that a method has an excellent performance if conditions AUC(ROC) > 0.8; YOUDEN > 0.5 and AUC(PvsR) > 0.05 are satisfied simultaneously¹⁹. We highlighted the scores with the excellent performances in Table 1. Clearly, we see that CMS performs well in all the three cases. Since CCM needs long term data for convergence, the accuracy of results by CCM based on short term data here is poor. As for IOTA, it is specifically designed for gene network inference and one crucial point of the IOTA approach lies on the assumption that two interacting genes have monotonic relationship. Therefore, for a general nonlinear dynamical system or gene expression which does not obey the monotonic assumption, the IOTA method may fail.

Table 1 Comparison results for three method on four GRN cases, where three criteria are listed and the scores with the excellent performances are highlighted

Full size table

The above comparison results also imply that R_xy designed in CMS Algorithm can indicate the relative probability or the strength of causality taking values between 0 and 1. Therefore, we can use the CMS index R to detect whether there is causality and how strong the causal relation is between two variables, as shown in the gray scale used in Fig. 4 and Fig. 11. Note that though the CMS method uses the prediction error to measure causality, it is different from the measurement of the Granger method which uses a series y(t) to predict y(t) by constructing x(t) → y(t) correlation; our method uses a series y(t) to predict x(t) by constructing a y(t) → x(t) map, for inferring the causality from x(t) to y(t). It is also noted that the prediction error based methods cannot detect autoregulation, i.e., the causal effect from one variable to itself.

In conclusion, based on the state space reconstruction theory for nonlinear dynamics, we have developed a new method of CMS to detect causality between variables, even with short observed time series. The key idea of our method is to detect causative effectiveness by measuring the smoothness of the cross map between two observed variables rather than finding the nearest neighbors, thereby avoiding the requirement of long-term time series data. The method is validated with both theoretical benchmark models and real-world data from gene networks. Our method is particularly effective in situations where only short-term data are available, such as high throughput biological data. In this paper we adopted a neural network model to train a smooth map and other methods constructing a smooth map can be also used in a similar way. As a future topic, we will consider to extend this method further to detect the causal relations of the measured variables just before the critical transitions^38,39,40 and high dimensional measured variables of nonlinear dynamics⁴¹.

References

Granger, C. W. J. Investigating causal relations by econometric models and cross-spectral methods. Econometrica 37, 424–438 (1969).
Article Google Scholar
Sugihara, G. et al. Detecting causality in complex ecosystems. Science 338, 496–500 (2012).
Article CAS ADS PubMed Google Scholar
Haufe, S., Nikulin, V. V., Mller, K.-R. & Nolte, G. A critical assessment of connectivity measures for EEG data: A simulation study. NeuroImage 64, 120–133 (2013).
Article PubMed Google Scholar
Schreiber, T. Measuring information transfer. Phys. Rev. Lett. 85, 461 (2000).
Article CAS ADS PubMed Google Scholar
Paluš, M., Komárek, V., Hrnčíř, Z. & Štěrbová, K. Synchronization as adjustment of information rates: detection from bivariate time series. Phys. Rev. E 63, 046211 (2001).
Article ADS Google Scholar
Feldhoff, J. H., Donner, R. V., Donges, J. F., Marwan, N. & Kurths, J. Geometric detection of coupling directions by means of inter-system recurrence networks. Phys. Lett. A 376, 3504–3513 (2012).
Article CAS ADS Google Scholar
Hirata, Y. & Aihara, K. Identifying hidden common causes from bivariate time series: a method using recurrence plots. Phys. Rev. E 81, 016203 (2010).
Article ADS Google Scholar
Chen, Y., Rangarajan, G., Feng, J. & Ding, M. Analyzing multiple nonlinear time series with extended Granger causality. Phys. Lett. A 324, 26–35 (2004).
Article CAS ADS MathSciNet Google Scholar
Ancona, N., Marinazzo, D. & Stramaglia, S. Radial basis function approach to nonlinear Granger causality of time series. Phys. Rev. E 70, 056221 (2004).
Article ADS Google Scholar
Schiff, S. J., So, P., Chang, T., Burke, R. E. & Sauer, T. Detecting dynamical interdependence and generalized synchrony through mutual prediction in a neural ensemble. Phys. Rev. E 54, 6708 (1996).
Article CAS ADS Google Scholar
Quyen, M. L. V., Martinerie, J., Adam, C. & Varela, F. J. Nonlinear analyses of interictal EEG map the brain interdependences in human focal epilepsy. Physica D 127, 250–266 (1999).
Article ADS Google Scholar
Hlaváčková-Schindler, K., Paluš, M., Vejmelka, M. & Bhattacharya, J. Causality detection based on information-theoretic approaches in time series analysis. Phys. Rep. 441, 1–46 (2007).
Article ADS Google Scholar
Chicharro, D. & Andrzejak, R. G. Reliable detection of directional couplings using rank statistics. Phys. Rev. E 80, 026217 (2009).
Article ADS Google Scholar
Arnhold, J., Grassberger, P., Lehnertz, K. & Elger, C. A robust method for detecting interdependences: application to intracranially recorded EEG. Physica D 134, 419–430 (1999).
Article ADS Google Scholar
Deyle, E. R. et al. Predicting climate effects on Pacific sardine. Proc. Nat. Acad. Sci. USA 110, 6430–6435 (2013).
Article CAS ADS PubMed PubMed Central Google Scholar
Wang, Y., Joshi, T., Zhang, X. & Chen, L. Inferring gene regulatory networks from multiple microarray datasets. Bioinformatics 22, 2413–2420 (2006).
Article CAS PubMed Google Scholar
Marbach, D. et al. Revealing strengths and weaknesses of methods for gene network inference. Proc. Nat. Acad. Sci. USA 107, 6286–6291 (2010).
Article CAS ADS PubMed PubMed Central Google Scholar
Hempel, S., Koseska, A., Kurths, J. & Nikoloski, Z. Inner composition alignment for inferring directed networks from short time series. Phys. Rev. Lett. 107, 054101 (2011).
Article CAS ADS PubMed Google Scholar
Hempel, S., Koseska, A., Nikoloski, Z. & Kurths, J. Unraveling gene regulatory networks from time-resolved gene expression data–a measures comparison study. BMC bioinformatics 12, 292 (2011).
Article PubMed PubMed Central Google Scholar
Wang, X., Wu, M., Li, Z. & Chan, C. Short time-series microarray analysis: Methods and challenges. BMC Systems Biology 2, 58 (2008).
Article PubMed PubMed Central Google Scholar
Wang, M. et al. LegumeGRN: A Gene Regulatory Network Prediction Server for Functional and Comparative Studies. PloS one 8, e67434 (2013).
Article CAS ADS PubMed PubMed Central Google Scholar
Takens, F. Dynamical Systems and Turbulence Rand, D. A. & Young, L. S. (ed.) 366–381 (Springer-Verlag, New York, 1981).
Sauer, T., Yorke, J. A. & Casdagli, M. Embedology. Journal of statistical Physics 65, 579–616 (1991).
Article ADS MathSciNet Google Scholar
Kantz, H. & Schreiber, T. Nonlinear time series analysis (Cambridge University press, Cambridge, 2004).
Kennel, M. B. et al. Determining embedding dimension for phase-space reconstruction using a geometrical construction. Phys. Rev. A 45, 3403–3411 (1992).
Article CAS ADS PubMed Google Scholar
Park, J. & Sandberg, I. W. Universal approximation using radial-basis-function networks. Neural Comput. 3, 246–257 (1991).
Article CAS PubMed Google Scholar
Nichols, T. E. & Holmes, A. P. Nonparametric permutation tests for functional neuroimaging: A primer with examples. Hum. Brain Mapp. 15, 1–25 (2002).
Article PubMed Google Scholar
Luo, Q., Ge, T. & Feng, J. Granger causality with signal-dependent noise. Neuroimage 57, 1422–1429 (2011).
Article PubMed Google Scholar
Guo, S. et al. Uncovering interactions in the frequency domain. Plos. Comput. Bio. 4, e1000087 (2008).
Article MathSciNet Google Scholar
Van den Bulcke, T. et al. SynTReN: a generator of synthetic gene expression data for design and analysis of structure learning algorithms. BMC bioinformatics 7, 43 (2006).
Article PubMed PubMed Central Google Scholar
Hofmeyr, J.-H. S. & Cornish-Bowden, H. The reversible Hill equation: how to incorporate cooperative enzymes into metabolic models. Comput. Appl. Biosci. 13, 377–385 (1997).
CAS PubMed Google Scholar
Shen-Orr, S. S., Milo, R., Mangan, S. & Alon, U. Network motifs in the transcriptional regulation network of Escherichia coli. Nat. Genet. 31, 64–68 (2002).
Article CAS PubMed Google Scholar
Wang, Y., Zhang, X.-S. & Chen, L. A network biology study on circadian rhythm by integrating various omics data. OMICS 13, 313–324 (2009).
Article CAS PubMed Google Scholar
Kawaguchi, S. et al. Establishment of cell lines derived from the rat suprachiasmatic nucleus. Biochem. Bioph. Res. Co. 355, 555–561 (2007).
Article CAS Google Scholar
Morioka, R. et al. Phase Shifts of Circadian Transcripts in Rat Suprachiasmatic Nucleus. In The Second International Symposium on Optimization and Systems Biology. 109C114 (World Publishing Corporation, Lijiang, China, 2008).
Ueda, H. R. et al. System-level identification of transcriptional circuits underlying mammalian circadian clocks. Nat. Genet. 37, 187–192 (2005).
Article CAS PubMed Google Scholar
Ko, C. H. & Takahashi, J. S. Molecular components of the mammalian circadian clock. Hum. Mol. Genet. 15, R271–R277 (2006).
Article CAS PubMed Google Scholar
Chen, L. et al. Detecting early-warning signals for sudden deterioration of complex diseases by dynamical network biomarkers. Sci. Rep. 2, 342 (2012).
Article PubMed PubMed Central Google Scholar
Liu, R. et al. Early diagnosis of complex diseases by molecular biomarkers, network biomarkers and dynamical network biomarkers. Medicinal Research Reviews 34, 455–478 (2013).
Article CAS PubMed Google Scholar
Liu, R. et al. Identifying critical transitions of complex diseases based on a single sample. Bioinformatics 30, 1579–1586 (2014).
Article CAS PubMed Google Scholar
Ma, H. et al. Predicting Time-Series from Short-Term High-Dimensional Data. Int. J. Bifurcat. Chaos 24, 1430033 (2014).
Article MathSciNet Google Scholar

Download references

Acknowledgements

We thank Prof. Hajimi Tei on the gene expression data and prior information collection. This research was supported by the Aihara Project, the FIRST program from JSPS, initiated by CSTP. This work was also supported by the NSFC (Nos. 61134013, 91029301, 61072149, 11301366 and 11326035), the Knowledge Innovation Program of CAS (No. KSCX2-EW-R-01) and 863 project (No. 2012AA020406).

Author information

Authors and Affiliations

School of Mathematical Sciences, Soochow University, China
Huanfei Ma
Collaborative Research Center for Innovative Mathematical Modelling, Institute of Industrial Science, The University of Tokyo, Japan
Huanfei Ma, Kazuyuki Aihara & Luonan Chen
Key Laboratory of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, China
Luonan Chen

Authors

Huanfei Ma
View author publications
You can also search for this author in PubMed Google Scholar
Kazuyuki Aihara
View author publications
You can also search for this author in PubMed Google Scholar
Luonan Chen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.M., L.C. and K.A. conceived the research; H.M. and L.C. performed the experiments, analyzed the data and prepared the figures; All the authors wrote the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Supplementary Information for Detecting Causality from Nonlinear Dynamics with Short-term Time Series

Rights and permissions

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder in order to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/4.0/

Reprints and permissions

About this article

Cite this article

Ma, H., Aihara, K. & Chen, L. Detecting Causality from Nonlinear Dynamics with Short-term Time Series. Sci Rep 4, 7464 (2014). https://doi.org/10.1038/srep07464

Download citation

Received: 17 February 2014
Accepted: 25 November 2014
Published: 12 December 2014
DOI: https://doi.org/10.1038/srep07464

This article is cited by

A study of the effect of influential spreaders on the different sectors of Indian market and a few foreign markets: a complex networks perspective
- Anwesha Sengupta
- Shashankaditya Upadhyay
- Prasanta K. Panigrahi
Journal of Computational Social Science (2023)
Fast and effective pseudo transfer entropy for bivariate data-driven causal inference
- Riccardo Silini
- Cristina Masoller
Scientific Reports (2021)
Convergent cross sorting for estimating dynamic coupling
- Leo Breston
- Eric J. Leonardis
- Andrea A. Chiba
Scientific Reports (2021)
Partial cross mapping eliminates indirect causal influences
- Siyang Leng
- Huanfei Ma
- Luonan Chen
Nature Communications (2020)
Field transcriptome revealed a novel relationship between nitrate transport and flowering in Japanese beech
- Akiko Satake
- Kazutaka Kawatsu
- Qingmin Han
Scientific Reports (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.