Parsimonious description for predicting high-dimensional dynamics

Hirata, Yoshito; Takeuchi, Tomoya; Horai, Shunsuke; Suzuki, Hideyuki; Aihara, Kazuyuki

doi:10.1038/srep15736

Download PDF

Article
Open access
Published: 29 October 2015

Parsimonious description for predicting high-dimensional dynamics

Yoshito Hirata^1,2,3,
Tomoya Takeuchi¹,
Shunsuke Horai¹,
Hideyuki Suzuki^2,3 &
…
Kazuyuki Aihara^1,2,3

Scientific Reports volume 5, Article number: 15736 (2015) Cite this article

1669 Accesses
12 Citations
Metrics details

Subjects

Abstract

When we observe a system, we often cannot observe all its variables and may have some of its limited measurements. Under such a circumstance, delay coordinates, vectors made of successive measurements, are useful to reconstruct the states of the whole system. Although the method of delay coordinates is theoretically supported for high-dimensional dynamical systems, practically there is a limitation because the calculation for higher-dimensional delay coordinates becomes more expensive. Here, we propose a parsimonious description of virtually infinite-dimensional delay coordinates by evaluating their distances with exponentially decaying weights. This description enables us to predict the future values of the measurements faster because we can reuse the calculated distances and more accurately because the description naturally reduces the bias of the classical delay coordinates toward the stable directions. We demonstrate the proposed method with toy models of the atmosphere and real datasets related to renewable energy.

Data-driven discovery of intrinsic dynamics

Article 08 December 2022

The probabilistic backbone of data-driven complex networks: an example in climate

Article Open access 13 July 2020

Cost function for low-dimensional manifold topology assessment

Article Open access 25 August 2022

Introduction

Nonlinear time series analysis^1,2,3, or time series analysis based on dynamical systems theory^4,5,6, has been developed intensively in the last 35 years. The most important result is the method of delay coordinates^7,8,9: Suppose that we can observe a scalar time series from a target system whose dimension is m. If we construct d-dimensional vectors , called delay coordinates, by using successive scalar measurements, then it is generally true that if d > 2m, the states x(t−d+1) for the underlying dynamical system and the vectors with delay coordinates , are one-to-one on the attractor, or a set of states the trajectory of the underlying dynamical system is attracted after the transient. Although this statement is supported how large the dimension m for the underlying dynamics is, practically the method of delay coordinates cannot be used for high-dimensional dynamics partly because the delay coordinates are distorted toward the stable directions on the attractor¹⁰ and partly because the calculation of delay coordinates becomes expensive.

The core part of the proposed method came from the idea of weighted delay coordinates⁴. In the weighted delay coordinates, the distortion of high-dimensional delay coordinates toward the stable directions is avoided by reducing weights of the past observations exponentially. If we denote the decay rate by λ (0 < λ ≤ 1), the weighted delay coordinates can be written as . Berry et al.¹⁰ uses the weighted delay coordinates for dimension reduction.

Our idea is to virtually use infinite-dimensional weighted delay coordinates, namely , where we extend the time axis toward the minus infinity and assume s(t) = 0 for t ≤ 0 for simplicity. As we will show in the Methods section, the distance between and in norm is calculated efficiently and successively. To make the distance converge, we just need to set 0 < λ < 1. Thus, by combining with the Lorenz’s method of analogues^11,2, we construct a method of time series prediction (see the details in the Methods section).

First, we tested the proposed time series prediction on Lorenz’96 I model^12,13, which is the minimum model of the atmosphere (see Supplementary Information for the details of the numerical experiment). When we compared the performance of the proposed method with that of the conventional 10-dimensional delay coordinates, we found that the proposed method tended to achieve the higher correlation coefficient (see Supplementary Information for the definition) between the prediction and the corresponding actual values up to 5 steps ahead (Fig. 1(a,c)). The proposed method was significantly better than the persistent prediction, where we let the current values be the prediction for the future values. For one of the 100 tested time series, we compared the speed of calculations. The proposed method only consumed 0.72 seconds, while the conventional 10-dimensional delay coordinates consumed 14.66 seconds. We used a laptop computer with Intel Core i7 CPU(3GHz) with 16.0GB memory. The programs were implemented in MATLAB. We did not use the recursive formula of equation (4) in Methods here. Thus, the proposed method is faster and more accurate in prediction than the conventional delay coordinates.

Second, we tested the proposed time series prediction on Lorenz’96 II model^12,13. In the Lorenz’96 II model, there are two types of variables: the slow variables correspond to the upper-layer of the atmosphere and the fast variables correspond to the layer close to the surface of the earth (see Supplementary Information for the details of this numerical experiment). We found that the proposed method tended to have the greater correlation coefficient than the conventional 10-dimensional delay coordinates up to 10 steps ahead (Fig. 1(b,d)). In one of the 100 tested time series, the computational time required for the proposed method was 0.80 seconds with the laptop computer, while that required for the conventional delay coordinates was 20.16 seconds; we did not use the recursive formula of equation (4) here too.

Third, we applied the proposed time series prediction to the sunshine duration at a single point of Fuchu, Japan (see Supplementary Information for the details of the numerical experiment). The result presented in Fig. 2 shows that the proposed method achieved the higher correlation coefficient than the persistent prediction and the prediction using 1 day periodicity when the prediction steps were greater than or equal to 1.2 hours. To make the prediction for the 10 minutes dataset spanning 5 years of 2008–2012, it only took 3.8 hours with a desktop computer with 2.7 GHz 12-Core Intel Xenon E5 with 64 GB memory. Therefore, the proposed method can run in the real time.

We also applied the proposed time series prediction to the wind speed data at Fuchu, Japan (See the Supplementary Information for the details of this numerical experiment). We found that the proposed method achieved the greater correlation coefficient than the persistent prediction and the prediction using 1 day periodicity when the prediction steps were between 4 and 15.8 hours (Fig. S1). In addition, we needed 3.8 hours with the desktop computer to complete the prediction for the dataset between 2008 and 2012. Thus, for this wind speed dataset, the prediction can also be done in the real time.

By using the proposed method, we can circumvent problems of the current standard practice for obtaining delay coordinates², which is, for example, to decide the delay by the first minimum of mutual information¹⁴ and the embedding dimension by false nearest neighbors¹⁵. Instead of choosing these two parameters, we need to choose the decay rate λ, by which the proposed method shows the robust performance as demonstrated in Fig. 3 and S2. Thus, the proposed method will make it easier to apply nonlinear time series prediction. We can even remove the choice of λ by combing the proposed method with the expert advice algorithm^16,17,18 (see Supplementary Information for the details). Thus, the proposed method is suitable for automating some time series prediction tasks.

The proposed method is robust against the observational noise. Even if we increase the noise level up to 10% of the standard deviation of the original time series, the correlation coefficient between the prediction using the proposed method and the actual values was significantly higher than that between the prediction using the conventional delay coordinates and the actual values (See Fig. 4). The proposed method naturally filters out observational noise when it is included in the measurements. Moreover, even if we evaluate the prediction with the root mean square errors, the proposed method is superior to the conventional delay coordinates for short-term predictions (Fig. 1(e,f)). Thus, our results could have the robustness to some extent in terms of ways for evaluating the prediction.

If we increased the size of database, then the prediction performance became better (Fig. S3). In addition, even if we used different numbers of neighbors for making the prediction, the performance was robust and did not change much (Fig. S4).

The lower the minimal Lyapunov exponent σ₁ for the underlying dynamics is, the higher the optimal λ is (Fig. S5). But, judging from the values of the optimal λ for the prediction, the reconstructed space was not reduced to the most stable direction because ¹⁰. Therefore, the list of distances contained the information of more than the one-dimensional space. From this viewpoint, the proposed method provides a convenient description for the high-dimensional dynamics.

It is easy to further extend the proposed infinite-dimensional weighted delay coordinates to multivariate time series^{19,20,21,22,23,24} or point processes^{25,26,27,28,29}. Let W be a set of states. If we define the state at time t by and a distance function on these states by , then the distance between and on such infinite dimensional weighted delay coordinates can be defined as

This distance, called the Fréchet product metric³⁰, satisfies the three conditions for the metric: (i) and if and only if ; (ii) ; (iii) . Therefore, if a given time series is multidimensional, we may choose the Euclidean distance between times and as D to obtain . We will discuss this extension in our future communication.

The proposed description might also be useful in inferring a network structure. This is an open question and we are also interested in developing the method in this direction.

Comparing with the traditional delay coordinates, the proposed infinite-dimensional weighted delay coordinates can produce more accurate time series prediction faster. As there is an increasing demand for real-time prediction for a big dataset especially in the field of renewable energy such as photovoltaic and wind powers, we hope that the proposed method helps to introduce more renewable energy into the power grids so that we can reduce CO₂ emissions.

Methods

Suppose that a scalar time series is given successively. Denote by N the size of the database. Let be the infinite-dimensional weighted delay coordinates, where λ is a decay rate. We define a distance between and as

We use λ = 0.5 if not mentioned.

This distance is convenient because we can reuse the previous calculations of distances to obtain distances for a pair of its future infinite-dimensional weighted delay coordinates. Namely, observe the following relation:

When we use the conventional d-dimensional delay coordinates (λ = 1), the similar recursive logic may be applied to simplify the calculation of distances as follows:

We combine the relation of equation (3) with Lorenz’s method of analogues^2,11. In the Lorenz’s method of analogues, we find close matches in the past and average their following points as prediction for the future. Let us construct prediction of up to P steps ahead. Suppose that the current time is at time t and we have the following datasets: a list of distances and a list of the corresponding observed values . First, we find the set of indices for the K smallest distances among and provide the p steps ahead prediction for each p by . We used K = 10 if not mentioned. Second, we update the list of distances by

We also update the list of the corresponding values by

for 1 ≤ i < N and

Therefore, to run the proposed method, each time we obtain a measurement, we only need to store 2N floating numbers, compare N floating numbers, conduct N additions, N subtractions and N multiplications. Compare these numbers with the case using the traditional d-dimensional delay coordinates: If we do not use the recursive formula of equation (4), we need dN comparisons for comparing element-wisely N pairs of d dimensional delay coordinates, dN subtractions and (d−1)N additions (The computation for finding nearest neighbors is in the same order).

Additional Information

How to cite this article: Hirata, Y. et al. Parsimonious description for predicting high-dimensional dynamics. Sci. Rep. 5, 15736; doi: 10.1038/srep15736 (2015).

References

Abarbanel, H. D. I. Analysis of Observed Chaotic Data (Springer-Verlag, 1996).
Kantz, H. & Schreiber T. Nonlinear Time Series Analysis (Cambridge University Press, 2003).
Small, M. Applied Nonlinear Time Series Analysis: Applications in Physics, Physiology and Finance (World Scientific, 2005).
Alligood, K. T., Sauer, T. D. & Yorke, J. A. Chaos: An Introduction to Dynamical Systems (Springer, 1996).
Katok, A. & Hasselblatt, B. Introduction to the Modern Theory of Dynamical Systems (Cambridge University Press, 1996).
Strogatz, S. H. Nonlinear Dynamics and Chaos: With Applications to Physics, Biology, Chemistry and Engineering (Westview Press, 2014).
Packard, N. H., Crutchfield, J. P., Farmer, J. D. & Shaw, R. S. Geometry from a time series. Phys. Rev. Lett. 45, 712–715 (1980).
Article ADS Google Scholar
Takens, F. Detecting strange attractors in turbulence. Lect. Notes Math. 898, 366–381 (1981).
Article MathSciNet Google Scholar
Sauer, T., Yorke, J. A. & Casdagli, M. Embeddology. J. Stat. Phys. 65, 579–616 (1991).
Article ADS Google Scholar
Berry, T., Cressman, J. R., Gregurić-Ferenček, Z. & Sauer, T. Time-scale separation from diffusion-mapped delay coordinates. SIAM J. Appl. Dyn. Syst. 12, 618–649 (2013).
Article MathSciNet Google Scholar
Lorenz, E. N. Atmospheric predictability as revealed by naturally occurring analogues. J. Amtos. Sci. 26, 636–646 (1969).
ADS Google Scholar
Lorenz, E. N. Predictability: A problem partly solved. in Proceedings of the Seminar on Predictability, Vol. 1, pp.1–18 (ECMWF, 1996).
Google Scholar
Hansen J. A. & Smith, L. A. The role of operational constraints in selecting supplementary observations. J. Atmos. Sci. 57, 2859–2871 (2000).
Article ADS Google Scholar
Fraser, A. M. & Swinny, H. L. Independent coordinates for strange attractors from mutual information. Phys. Rev. A 33, 1134- (1986).
Article CAS ADS MathSciNet Google Scholar
Kennel, M. B., Brown, R. & Abarbanel, H. D. I. . Determining embedding dimension for phase-space reconstruction using a geometrical construction. Phys. Rev. A 45, 3403–3411 (1992).
Article CAS ADS Google Scholar
Cesa-Bianchi, N. & Lugosi, G. Prediction, Learning and Games (New York, Cambridge Univ. Press, 2006).
Chernov, A. & Zhdanov, F. Prediction with expert advice under discounted loss. Proc. Of AIT 2010, Lecture Notes in Artificial Intelligence 6331, 255–269 (2010).
MathSciNet MATH Google Scholar
Morino, K., Hirata, Y., Tomioka, R., Kashima, H., Yamanishi, K., Hayashi, N., Egawa, S. & Aihara, K. Predicting disease progression from short biomarker series using expert advice algorithm. Sci. Rep. 5, 8953 (2015).
Article CAS ADS Google Scholar
Cao, L., Mees, A. & Judd, K. Dynamics from multivariate time series. Physica D 121, 75–88 (1998).
Article ADS Google Scholar
Bocaletti, S., Valladares, D. L., Pecora, L. M., Geffert, H. P. & Carroll, T. Reconstructing embedding spaces of coupled dynamical systems from multivariate data. Phys. Rev. E 65, 035204(R) (2002).
Article ADS Google Scholar
Garcia, S. P. & Almeida J. S. Multivariate phase space reconstruction by nearest neighbor embedding with different time delays. Phys. Rev. E 72, 027205 (2005).
Article ADS Google Scholar
Hirata, Y., Suzuki, H. & Aihara, K. Reconstructing state spaces from multivariate data using variable delays. Phys. Rev. E 74, 026202 (2006).
Article ADS MathSciNet Google Scholar
Pecora, L. M., Moniz, L., Nichols, J. & Carroll, T. L. A unified approach to attractor reconstruction. Chaos 17, 013110 (2007).
Article ADS MathSciNet Google Scholar
Vlachos, I. & Kugiumtzis, D. Nonuniform state-space reconstruction and coupling detection. Phys. Rev. E 82, 016207 (2010).
Article ADS Google Scholar
Victor, J. D. & Purpura, K. P. Metric-space analysis of spike trains: theory, algorithms and application. Network 8, 127–164 (1997).
Article Google Scholar
van Rossum, M. C. W. A novel spike distance. Neural Comp. 13, 751–763 (2001).
Article CAS Google Scholar
Kreuz, T., Haas, J. S., Morelli, A., Abarbanel, H. D. I. & Politi, A. Measuring spike train synchrony. J. Neurosci. Methods 165, 151–161 (2007).
Article Google Scholar
Schoenberg, F. P. & Tranbarger, K. E. Description of earthquake aftershock sequences using prototype point patterns. Environmetrics 19, 271–286 (2008).
Article MathSciNet Google Scholar
Suzuki, S. Hirata, Y. & Aihara, K. Definition of distance for marked point process data and its application to recurrence plot-based analysis of exchange tick data of foreign currencies. Int. J. Bifurcat. Chaos 20, 3699–3708 (2010).
Article Google Scholar
Deza, M. M. & Deza, E. Encyclopedia of distances. 2nd edition (Springer, 2012).
Kantz, H. A robust method to estimate the maximal Lyapunov exponent of a time series. Phys. Lett. A 185, 77–87 (1994).
Article ADS Google Scholar

Download references

Acknowledgements

We thank the Japan Meteorological Agency for providing us the datasets of sunshine duration and wind speed. The used datasets are commercially available from Japan Meteorological Business Support Center. This manuscript is partially based on results obtained from a project commissioned by the New Energy and Industrial Technology Development Organization (NEDO). This research is also partially supported by Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency (JST).

Author information

Authors and Affiliations

Institute of Industrial Science, The University of Tokyo, 4-6-1 Komaba, Meguro-ku, Tokyo, 153-8505, Japan
Yoshito Hirata, Tomoya Takeuchi, Shunsuke Horai & Kazuyuki Aihara
Graduate School of Information Science and Technology, The University of Tokyo, Bunkyo-ku, Tokyo, 113-8656, Japan
Yoshito Hirata, Hideyuki Suzuki & Kazuyuki Aihara
CREST, JST, 4-1-8 Honcho, Kawaguchi, Saitama, 332-0012, Japan
Yoshito Hirata, Hideyuki Suzuki & Kazuyuki Aihara

Authors

Yoshito Hirata
View author publications
You can also search for this author in PubMed Google Scholar
Tomoya Takeuchi
View author publications
You can also search for this author in PubMed Google Scholar
Shunsuke Horai
View author publications
You can also search for this author in PubMed Google Scholar
Hideyuki Suzuki
View author publications
You can also search for this author in PubMed Google Scholar
Kazuyuki Aihara
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.H. and K.A. made the ground design of this study. Y.H., T.T., S.H. and K.A. designed the study of wind speed data. Y.H., H.S. and K.A. designed the study of sunshine duration data. Y. H. and T.T. created the method. Y.H. and T.T. wrote the programs and analyzed the datasets. All authors contributed to interpreting the results and writing the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Hirata, Y., Takeuchi, T., Horai, S. et al. Parsimonious description for predicting high-dimensional dynamics. Sci Rep 5, 15736 (2015). https://doi.org/10.1038/srep15736

Download citation

Received: 22 May 2015
Accepted: 29 September 2015
Published: 29 October 2015
DOI: https://doi.org/10.1038/srep15736

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.