Predicting multiple observations in complex systems through low-dimensional embeddings

Wu, Tao; Gao, Xiangyun; An, Feng; Sun, Xiaotian; An, Haizhong; Su, Zhen; Gupta, Shraddha; Gao, Jianxi; Kurths, Jürgen

doi:10.1038/s41467-024-46598-w

Download PDF

Article
Open access
Published: 12 March 2024

Predicting multiple observations in complex systems through low-dimensional embeddings

Nature Communications volume 15, Article number: 2242 (2024) Cite this article

2865 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Forecasting all components in complex systems is an open and challenging task, possibly due to high dimensionality and undesirable predictors. We bridge this gap by proposing a data-driven and model-free framework, namely, feature-and-reconstructed manifold mapping (FRMM), which is a combination of feature embedding and delay embedding. For a high-dimensional dynamical system, FRMM finds its topologically equivalent manifolds with low dimensions from feature embedding and delay embedding and then sets the low-dimensional feature manifold as a generalized predictor to achieve predictions of all components. The substantial potential of FRMM is shown for both representative models and real-world data involving Indian monsoon, electroencephalogram (EEG) signals, foreign exchange market, and traffic speed in Los Angeles Country. FRMM overcomes the curse of dimensionality and finds a generalized predictor, and thus has potential for applications in many other real-world systems.

Facilitating time series classification by linear law-based feature space transformation

Article Open access 27 October 2022

Forecasting high-dimensional dynamics exploiting suboptimal embeddings

Article Open access 20 January 2020

Multi-view manifold learning of human brain-state trajectories

Article 27 March 2023

Introduction

Prediction of future states of a complex dynamical system is a challenging task across various disciplines^1,2,3. System details are often unknown, and only their time series data are accessible. Therefore, a variety of data-driven techniques are designed for the prediction task^4,5, including traditional statistical models (e.g., autoregressive integrated moving average (ARIMA))⁶, state space-based methods (e.g., sequential locally weighted global linear maps (S-maps)⁷ and multiview embedding (MVE))⁸, machine learning algorithms (e.g., support vector machine (SVM)⁹, long short-term memory (LSTM)¹⁰, and reservoir computing (RC)^11,12, and state-of-the-art combination frameworks (e.g., multitask learning-based Gaussian process regression machine (MT-GPRM)¹³, randomly distribution embedding (RDE)¹⁴ and autoreservoir neural network (ARNN)¹⁵). These advanced approaches have shown potential for several significant tasks, e.g., one-step and multistep ahead predictions of a target time series variable¹⁶.

Despite considerable efforts in the study of prediction tasks in complex systems, it is still unsolved to design a generalized framework for the predictions of all components in a complex system. Since real-world systems often consist of many interconnected units, e.g., multiple spatiotemporal observations in climate systems¹⁷ and thousands of functionally connected neurons in the brain¹⁸, they therefore output a large number of time series variables, and the interactions between these variables intrinsically contribute to the dynamical evolution of a complex system. A practical way to predict complex systems (especially for high-dimensional systems), as an approximation, is to study the dynamics of partial units, e.g., representative observations¹⁹. However, identifying such representative variables remains a challenging task. Moreover, one should be cautious to ignore those ‘unimportant variables’, in which small perturbations may be amplified and propagated to all components, resulting in heavy changes in system behaviors (known as cascading effects)^20,21. Instead, the capacity to predict the future states of all components can help to better estimate the future behavior of a complex system. However, many existing approaches present typical limitations for this task. (a) The uncertainty of the predictor, which means that for a target variable, the predictors are often selected empirically²², e.g., several target-related observations. If regarding all the remaining variables as predictors, some redundant information may negatively affect the performance (e.g., noise and irrelevant variables to the target variable¹⁵), especially for high-dimensional real-world systems⁸. (b) The uncertainty of the predictive model, which means that for different targets, some approaches may train different models (e.g., the predictive models for ${y}_{1}$ and ${y}_{2}$ are completely independent)^23,24. There may need N models for an N-dimensional system, leading to a high computational cost. (c) The challenge in forecasting multiple observations typically results in verifying methods over only a single or possibly few observations^13,14,23. Therefore, designing a unified and reliable framework to forecast all components in complex systems is still an open and challenging issue.

In this work, we develop a data-driven and model-free framework by a combination of manifold learning and delay embedding, namely, feature-to-reconstructed manifold mapping (FRMM). The FRMM framework yields reliable predictions for all components via a generalized and practical predictor, i.e., the system’s low-dimensional representation from manifold learning (feature embedding). The theoretical foundation of the FRMM is based on the ground truth that high-dimensional systems often contain redundant information and that their essential dynamics or structures can be characterized by low-dimensional representations^25,26,27, e.g., the meaningful structure of a 4096-dimensional image (64 pixels by 64 pixels) can be characterized in a three-dimensional manifold with two pose variables and an azimuthal lighting angle²⁸. These low-dimensional representations can be sufficiently identified from two powerful techniques: feature embedding and delay embedding. (a) Feature embedding finds a low-dimensional representation by preserving the geometric features (e.g., nearest neighbor) of the original system as much as possible²⁵. (b) Delay embedding reconstructs an isomorphic structure with the original system from a single time series²⁹. Given that low-dimensional representations (in different coordinates) from two approaches show isomorphic structures with the original system. This enables prediction tasks by one-to-one mapping between two low-dimensional representations. Additionally, in a dynamical system, each time series variable can reconstruct a low-dimensional representation via delay embedding³⁰. Therefore, the low-dimensional representation from feature embedding can be practically selected as a generalized predictor to potentially identify the future dynamics of all components in complex systems.

Results

Low-dimensional representation from delay embedding

According to Takens’ embedding theory, it is possible to reconstruct a low-dimensional attractor by a single time series from a high-dimensional dynamical system³⁰. Particularly, for an N-dimensional system M, one can reconstruct a topologically isomorphic manifold ${M}_{{x}_{i}}$ (namely, reconstructed manifold) from every time series ${x}_{i}(t)$ within the system ($i=1,2,\cdots,N$, $t=1,2,\cdots,L$, L is the length of the series), and each state point on ${M}_{{x}_{i}}$ is represented as ${\tilde{X}}_{i}(t)=({x}_{i}(t),{x}_{i}(t+\tau ),\cdots,{x}_{i}(t+(E-1)\tau ))$, where E is the embedding dimension and $\tau$ is the time lag. For example, the attractors of the 3-dimensional Lorenz system and Rössler system are reconstructed in 2-dimensional space from individual time series (Fig. 1a, b, d, e).

**Fig. 1: Low-dimensional embeddings of complex systems.**

${M}_{{x}_{i}}$ has an isomorphic topological structure with the original system M. It indicates that for every state point $X(t)$ on M, one can find a corresponding state point ${\tilde{X}}_{i}(t)$ on ${M}_{{x}_{i}}$ through a smooth mapping ${\varphi }_{i}$. According to Takens³⁰, ${\varphi }_{i}$ is a one-to-one mapping, we therefore identify a corresponding state point $X(t)$ on M for every ${\tilde{X}}_{i}(t)$ on ${M}_{{x}_{i}}$ via the inverse mapping ${\varphi }^{-1}({\tilde{X}}_{i}(t))$. These processes can be represented as (1).

$${\varphi }_{i}:M\to {M}_{{x}_{i}},{\varphi }_{i}(X(t))={\tilde{X}}_{i}(t),{{\varphi }_{i}}^{-1}({\tilde{X}}_{i}(t))=X(t),$$

(1)

where $X(t)=({x}_{1}(t),{x}_{2}(t),\cdots,{x}_{N}(t))$ and $X(t)\in M,{\tilde{X}}_{i}(t)\in {M}_{{x}_{i}}$.

Low-dimensional representation from feature embedding

Delay embedding can reconstruct low-dimensional representations of the original systems. Additionally, such low-dimensional representations can be obtained from manifold learning algorithms. For example, based on the diffusion map algorithm^31,32, we find 2-dimensional representations that show equivalent structures with the 3-dimensional Lorenz and Rössler systems (Fig. 1c, f). These techniques embed a high-dimensional system in a low-dimensional space by retaining the essential geometric features (e.g., the neighboring points in high-dimensional space are also adjacent in the low-dimensional representations) (feature embedding) of the original system. Since the embedding is a one-to-one mapping³¹, it can be written as (2)

$$\phi :M\to {M}_{0},\phi (X(t))=Y(t),{\phi }^{-1}(Y(t))=X(t),$$

(2)

where ${M}_{0}$ represents an E-dimensional manifold (namely, feature manifold), and $Y(t)\in {M}_{0},X(t)\in M$. Real-world systems often show diverse dynamical structures and geometry features, and five algorithms are selected alternatively to identify feature embedding, i.e., isometric feature mapping (ISOMAP)²⁸, locally linear embedding (LLE)³³, Laplacian³⁴, diffusion map³¹, and local tangent space alignment (LTSA)³⁵. More details are provided in Methods.

Prediction via mapping between low-dimensional representations

Through delay embedding and feature embedding, a high-dimensional system is represented by its two low-dimensional manifolds: the reconstructed manifold (${M}_{x}$) and the feature manifold (${M}_{0}$). This indicates a one-to-one mapping between the feature manifold and the reconstructed manifold. Then, for every state point $Y(t)$ on ${M}_{0}$, we can find its corresponding state point ${\tilde{X}}_{i}(t)$ on ${M}_{{x}_{i}}$ by a smooth mapping (e.g., ${\psi }_{i}$).

$${\psi }_{i}:{M}_{0}\to {M}_{{x}_{i}},{\psi }_{i}(Y(t))={\tilde{X}}_{i}(t),i=1,2,\cdots,N,$$

(3)

where ${\psi }_{i}(x)={\varphi }_{i}{\phi }^{-1}(x)$(see Eqs. (1) and (2)).

Note that ${\tilde{X}}_{i}(t)=({x}_{i}(t),{x}_{i}(t+\tau ),\cdots,{x}_{i}(t+(E-1)\tau ))$, we deduce a spatiotemporal transformation from state points on ${M}_{0}$ to a temporal series (final component in ${\tilde{X}}_{i}(t)$):

$${\mathop{\psi }\limits^{\frown {}}}_{i}({y}_{1}(t),{y}_{2}(t),\cdots,{y}_{E}(t))={x}_{i}(t+(E-1)\tau ),i=1,2,\cdots,N,t=1,2,\cdots,L,$$

(4)

where $({y}_{1}(t),{y}_{2}(t),\cdots,{y}_{E}(t))\in {M}_{0}$ and ${x}_{i}(t+(E-1)\tau )\in {\tilde{X}}_{i}(t)\in {M}_{{x}_{i}}$. When $t=L$, it yields at most$(E-1)\tau$-forward dynamics of each variable ${x}_{i}(t)$ once ${\mathop{\psi }\limits^{\frown {}}}_{i}$ is identified (more details of ${\mathop{\psi }\limits^{\frown {}}}_{i}$ are provided in Methods). In this work, we employ the classical Gaussian process regression to train every ${\mathop{\psi }\limits^{\frown {}}}_{i}$ (cf. SI Appendix Chapter 1.3)³⁶. To guarantee robustness, we validate the performance by randomly dividing the observed series into a training set and a test set (i.e., cross-validation). Two widely used metrics are employed to measure the performance, i.e., the Pearson correlation between observed values and predicted values ($\rho$) and the normalized root mean square error (RMSE) (normalized by the standard deviation of input series)¹⁵. The main architecture of FRMM is given in Fig. 2.

Performance of FRMM on model systems

To illustrate the mechanism of the FRMM framework, we start with the benchmark Lorenz system¹⁵. For the 3-dimensional ordinary Lorenz system (see Methods), the FRMM first identifies its 2-dimensional manifolds (i.e., feature manifold and reconstructed manifold) via feature embedding and delay embedding (Fig. 1a–c). Both feature manifolds and reconstructed manifolds show isomorphic structures with the original system, which indicates an isomorphism between the feature manifold and reconstructed manifold. Then, the feature manifold can be utilized as a generalized predictor for the predictions of three units. According to the generic cross-validation (see Methods), we validate the performance by randomly selecting 50% of the data as training samples, and the others are test samples. Since the embedding dimension and time lag are $E=2$ and $\tau=10$, FRMM yields reliable 10-step ($T=(E-1)\tau$) ahead predictions for all units, where the average $\rho$ reaches 0.91 and the average error remains at a low level ($RMSE=0.38$). Still, FRMM achieves accurate 10-step-ahead predictions for all units in the Rössler system (SI Appendix Fig. S2).

To further evaluate the FRMM framework in high-dimensional systems, we select the 90-dimensional coupled Lorenz system as a benchmark (see Methods)¹⁴. Traditional regression-based predictions encounter the “curse of dimensionality”. Some neural network-based frameworks set all the observations as input, leading to relatively high computational costs. Due to the sensitivity of initial states as well as complex nonlinear dynamics between units, predicting all components is indeed a challenging task. Our FRMM framework embeds the 90-dimensional Lorenz system into a relatively lower space ($E=11$) via feature embedding and delay embedding and sets the feature manifold as a generalized predictor to find the future states of all components (Fig. 3d–f). FRMM performs reliably in that all the $\rho$ values are higher than 0.6 and all the errors are lower than 0.8 (Fig. 3f). The average $\rho$ reaches 0.74, and the average RMSE is 0.6.

Real-world systems are often influenced by various external factors, and they may behave with time-dependent dynamics, e.g., the couplings among components are not constant but time-varying. For this case, we evaluate the FRMM by setting the coupling in the 90-dimensional Lorenz system to be increased by 0.2 after ten time intervals (see Methods)¹⁵. FRMM remains reliable with an average $\rho$ of 0.73 and RMSE of 0.63 (Fig. 3g–i).

Performance of FRMM on real-world systems

To illustrate the FRMM in real-world systems, we use several benchmark samples across different disciplines, including climate systems, neuroscience, financial systems, and traffic systems (details of all datasets are provided in SI Appendix Chapter 1.2). a) For the climate system, we consider the Indian monsoon, which is a typical phenomenon that generates dramatic influences on India’s agriculture and economy³⁷. Skillful ahead prediction of this phenomenon is of importance. However, it remains a challenging task due to the complex spatial-temporal interactions among multiple observations. To this end, we select the lower-level (850 hPa) zonal daily wind component from region IMI2 (70E-90E, 20N-30N), provided on a spatial grid with a resolution of ${1}^{0}\times {1}^{0}$³⁸. Wind speeds interact spatially and form a 231-dimensional subsystem. The FRMM performs reliable 20-day forward predictions for all observations, and the average $\rho$ and RMSE are 0.86 and 0.41, respectively (Fig. 4a–c and Supplementary Fig. S3). Additionally, the FRMM is certified by monthly observations in the same region (SI Appendix Fig. S4).

**Fig. 4: Performance in real-world datasets.**

b) In neuroscience, electroencephalogram (EEG) signals have been extensively used to study the underlying mechanisms of the human brain as well as some typical diseases³⁹. Therefore, ahead predictions of EEG signals are expected to deliver efficient early warnings for related diseases. EEG signals are often captured from different regions in the brain and show spatial-temporal dynamics. To test FRMM, we utilize a 64-dimensional subsystem that consists of EEG signal series from 64 channels from a healthy participant⁴⁰. By setting the low-dimensional ($E=5$) feature manifold of the system as a generalized predictor, we achieve accurate 20-second ahead predictions for all signals, where the average $\rho$ and RMSE are 0.92 and 0.42 (Fig. 4d–f and Supplementary Fig. S5).

c) Financial systems are typically complex systems influenced by numerous internal and external factors via various channels, resulting in high uncertainty and instability, which in turn makes prediction a difficult and challenging task⁴¹. We start with a 70-dimensional subsystem from the foreign exchange market, which includes the daily closing prices of 70 currencies against the US dollar. FRMM performs accurate 20-day-ahead predictions for all observations, where the average $\rho$ and RMSE are 0.89 and 0.41 (Fig. 4g–i and Supplementary Fig. S6). In addition, the FRMM outputs reliable predictions of 46 stock indices from global stock markets; see SI Appendix Fig. S7.

d) Finally, we demonstrate the FRMM in a 207-dimensional traffic system, which consists of the traffic speeds collected from 207 loop detectors in Los Angeles County. Forecasting the time evolution of this traffic system is a challenging task due to the complex spatial and temporal dependencies among the elements of the system⁴². Our FRMM achieves 10-step ahead predictions ($\rho \ge 0.6$) for 86% (179) of the components, where the average $\rho$ and RMSE are 0.78 and 0.68 (the average accuracies for all components are 0.73 and 0.7) (Fig. 4j–l and Supplementary Fig. S8). As reported in Fig. 4l, the FRMM exhibits relatively poor performance for a few components, whose time series involve many abrupt changes, like tipping points, possibly caused by rush hours or accidents.

Discussion

Robustness tests

Generally, a predictive model performs better with longer training samples and shorter test samples, and the performance decreases sharply when training short samples but testing long samples. Our FRMM performs robustly even when inputting a short training sample (10% (140)) and verifying on a longer test sample (90% (1260)), where the average $\rho$ and RMSE are 0.69 and 0.75 (Fig. 5a, Supplementary Figs. S10 and S11). In addition, the FRMM is robust with deteriorating noise (Fig. 5b and Supplementary Fig. S12). For the length of input data, the FRMM also outputs reliable predictions with short input data (Fig. 5c). For the predicted step, the FRMM can find at most $(E-1)\tau$ steps. Theoretically, each $\tau$ can be employed to reconstruct an isomorphic attractor if the observed series is long enough³⁰. Often, only limited data are accessible, leading to poor reconstruction with an even larger $\tau$. Therefore, our framework is unable to make long-term predictions. Despite this, our framework achieves at most 30-step-ahead predictions for all components in the 90-dimensional ordinary Lorenz system (Fig. 5d).

Remark on feature embedding

Identifying the low-dimensional feature manifold is critical for our prediction. However, embedding a false feature manifold (which has an inequivalent topology to the original system) may result in poor prediction due to the one-to-one mapping not being maintained between the feature manifold and the reconstructed manifold. For example, LLE and Laplacian fail to identify the 2-dimensional feature manifold of the 3-dimensional Lorenz system, resulting in poor predictions (variable z) by adding them to the FRMM framework (Fig. 6a, b). Conversely, despite the diffusion map algorithm, ISOMAP and LTSA also output a reliable feature manifold of the original attractor and could be used to perform accurate predictions for all variables by substituting the diffusion map in the FRMM framework (Fig. 6c, d). However, the diffusion map outperforms ISOMAP and LTSA. Due to the diversity of dynamic structures in various real-world systems and the lack of sufficient details, only from their time series, there is no golden rule to select an optimal feature embedding algorithm (More discussions are given in SI Appendix Chapter 1.7). Nevertheless, the five powerful techniques in this work make sense in many high-dimensional systems.

**Fig. 6: Performance with other feature embedding techniques.**

Comparison with traditional methods

Many existing predictive models perform well when training on long samples and verifying on short samples, but the performances often decrease sharply with short training samples and long test samples. We first compare the robustness of the length of training and test samples with some traditional methods (e.g., ARIMA⁶, MVE⁸, SVM⁹, LSTM¹⁰, RC¹¹) and several advanced delay embedding-based frameworks (e.g., ARNN¹⁵, RDE¹⁴, MT-GPRM¹³. As depicted in Fig. 7, ARIMA fails to predict variable z in the Lorenz system even when training on long samples, whereas the other methods predict it accurately. Although the performance decreases as the training sample length decreases and the test sample length increases, FRMM always remains relatively robust compared to other methods. Second, we compare the performance of our FRMM with the aforementioned methods across different datasets. The results indicate that FRMM yields relatively better predictions (Table 1). In summary, FRMM is shown more reliable for the predictions of all components in complex systems.

**Fig. 7: Comparison of robustness with classic predictive models concerning the length of the training sample, including ARIMA, SVM, MVE, LSTM, RC, ARNN, RDE, and MT-GPRM.**

Table 1 Comparison of performance with several classic approaches

Full size table

Final remark on FRMM framework

Our data-driven and model-free framework (FRMM) has been illustrated by both representative models and real-world systems, and it has several advantages. First, FRMM performs predictions by mapping between low-dimensional representations, which is well-grounded in theory that the topological structure of a high-dimensional dynamical system can be theoretically characterized in low-dimensional space from delay embedding and feature embedding. Second, for the uncertainties of the predictor and predictive model, FRMM sets the feature manifold as a generalized predictor to find future states of all components, and Gaussian process regression is utilized as a fixed tool to train all mappings between embedded manifolds. Third, many existing predictive models directly train and fit relations between time series from a system, and they may perform poorly due to the inconstant correlations estimated from time series⁴³ (the fitted parameters in a model are often time-varying). Instead, FRMM finds the mapping between low-dimensional representations of a system, and this mapping is inherently supported. In summary, FRMM overcomes the curse of dimensionality, has higher interpretability, and shows potential to be applied in various fields.

Gaussian process regression is applied to find the mapping between the feature manifold and the reconstructed manifold. We need to note that this mapping can be also trained by some neural network algorithms, e.g., ARNN utilizes reservoir computing to train a mapping from the original attractor to the delay attractor. Despite the satisfactory performance of neural networks, they often rely on sufficient and rather large training samples. Besides, there remain unknown hidden details as black-box characters inside of some artificial neural networks. More importantly, the trade-offs between accuracy, cost, and interpretability are needed to be balanced in practical applications. On this basis, it seems more satisfactory to integrate Gaussian process regression in our FRMM framework.

FRMM is developed based on a popular framework, namely spatiotemporal information (STI) transformation⁴⁴. Several advanced STI-based methods (e.g., MT-GPRM¹³, RDE¹⁴, and ARNN¹⁵) have been proposed to predict various complex systems. FRMM shows individual characteristics and meaningful improvements comparing with many existing STI-based methods. We clarify them from three aspects, including the prediction task, the architecture, and the theoretical foundation.

For the prediction task, it is still unsolved for the predictions of all components in complex systems. Though some existing STI-based frameworks have the potential to address this issue, their abilities are often certified on partial components and not fully tested by all units in complex systems. Note that verifying a predictive model on fewer observations of complex systems may be risky. Take the generic 3-dimensional Lorenz system as an example (Eq. (16)), it is possible to predict variables x and y through a linear regression model, but this model fails to predict variable z⁴⁵. It is uncritical to conclude that a linear regression model can predict the Lorenz system. In this direction, FRMM is faithful and exhibits higher potential for the predictions of all components in complex systems.

For the architecture, the main difference between FRMM and other STI-based frameworks is the selection of predictor. Some STI-based frameworks set the original system as the fixed predictor, e.g., MT-GPRM, while some frameworks may use different predictors for different targets, e.g., for each target variable, ARNN finds several highly related components as predictors. FRMM focuses on system’s fundamental dynamics and sets the system’s low-dimensional feature manifold as a fixed and generalized predictor, which gives an efficient predictor when predicting different components in complex systems.

For the theoretical foundation, many existing STI-based frameworks create the STI equation by non-delay embedding and delay embedding, which originates from that a complex system can be approximately represented by different coordinates. Generally, the non-delay embedding of complex systems can be approximated in a space with either low or high dimension. (e.g., MT-GPRM sets all selected observations as a representation of original systems, RDE finds non-delay embedding by randomly selecting several observations). The theoretical foundation of FRMM is based on a well-accepted report that a high-dimensional system often has redundant information, and the system’s fundamental dynamics (e.g., the topology of complex systems) are restored in low-dimensional manifolds^{31,32,33,34,35}. FRMM framework focuses on low-dimensional dynamics of complex systems, and these low-dimensional dynamics are identified by feature embedding and delay embedding. The feature embedding is conducted by powerful manifold learning algorithms, and these methods can automatically extract and restore the fundamental topology of the original system in a low-dimensional space. Thus, FRMM shows different theoretical foundations with existing STI-based frameworks. Additionally, identifying the fundamental dynamics of a high-dimensional system theoretically helps to reduce the negative impacts of redundant information in a high-dimensional system, and will be beneficial for better predictions. These are also supported by the relatively higher performance and robustness of FRMM in many real-world datasets (Table 1 and Fig. 7).

However, like other STI-based frameworks, FRMM fails to predict different components synchronously. In other words, for each target variable, one needs to train a suitable mapping. Additionally, FRMM also has limitations for situations in which a system experiences abrupt, rapid, and even irreversible transitions (known as tipping points)^46,47. The behavior of a system shifts between contrasting states, and the historical rules are often not held when a system crosses the threshold, leading to poor predictions of our framework. The phenomenon of critical transitions, often caused by diverse external factors, is reported in numerous real-world systems. Despite this shortfall, the FRMM framework also inspires to identify the tipping points of a system, e.g., the occurrence of poor performance may indicate an underlying shift in the system.

Methods

FRMM framework

Given an N-dimensional system with time series ${x}_{i}(t)$($i=1,2,\cdots,N,t=1,2,\cdots,L$), we aim to predict all units within the system. For this task, the main structure of our FRMM framework is listed as follows:

(1) For each target variable ${x}_{i}(t)$, we estimate the embedding dimension E (cf. SI Appendix Chapter 3.3) and time lag $\tau$ based on the false nearest neighbor algorithm and mutual information function, respectively^48,49. Then, this approach allows for reconstructing an isomorphic manifold ${M}_{{x}_{i}}$ in an E-dimensional space (E is usually much smaller than N). The reconstructed manifold ${M}_{{x}_{i}}$ is given as

$${M}_{{x}_{i}}=\left[\begin{array}{cccc}{x}_{i}(1) & {x}_{i}(1+\tau ) & \cdots & {x}_{i}(1+(E-1)\tau )\\ \vdots & \vdots & \cdots & \vdots \\ {x}_{i}(h) & {x}_{i}(h+\tau ) & \cdots & {x}_{i}(L)\\ \begin{array}{c}\vdots \\ {x}_{i}(L)\end{array} & \begin{array}{c}\vdots \\ {x}_{i}(L+\tau )\end{array} & \begin{array}{c}\cdots \\ \cdots \end{array} & \begin{array}{c}\vdots \\ {x}_{i}(L+(E-1)\tau )\end{array}\end{array}\right],$$

(5)

where L is the length of the series and $h=L-(E-1)\tau$. Note that the elements from ${x}_{i}(1)$ to ${x}_{i}(L)$ are observed values from the original system, others (${x}_{i}(L+\tau ),\cdots,{x}_{i}(L+(E-1)\tau )$) are unknown, and our goal is to predict them. According to Takens, each time series variable can be used to reconstruct an E-dimensional manifold³⁰, which gives the fundamental basis for predicting all components in complex systems.

(2) Moreover, the low-dimensional manifolds for the system can also be identified by preserving their fundamental geometric features (feature embedding). In this work, we provide several techniques to find low-dimensional representations for the systems, e.g., isometric feature mapping (ISOMAP), locally linear embedding (LLE), Laplacian, diffusion map, and local tangent space alignment (LTSA), since real-world systems often behave with different dynamical structures and have various geometric features.

We select LLE as an example to clarify the main idea of low-dimensional embedding. Given an N-dimensional dimensional system with observed vectors ${X}_{i}=({x}_{1i},{x}_{2i},\cdots,{x}_{Ni})$, we approximate each point by a linear function of its K nearest neighbors (e.g.,$K=8$).

$${\tilde{X}}_{i}=\mathop{\sum }\limits_{i=1}^{N}{w}_{ij}{X}_{j},\mathop{\sum }\limits_{j}{w}_{ij}=1,$$

(6)

where ${w}_{ij}$ measures the weight between the ith point and jth point. To find the optimal set of weights $\mathop{W}\limits^{\frown {}}=({\mathop{w}\limits^{\frown {}}}_{ij})$, we minimize the loss function

$$\mathop{W}\limits^{\frown {}}={{\mbox{arg}}}\,\min {\mathop{\sum }\limits_{i=1}^{N} \left\Vert {X}_{i}-\mathop{\sum }\limits_{j=1}^{N}{w}_{ij}{X}_{j} \right\Vert }^{2}.$$

(7)

We expect the local geometry in the original space to be preserved in their low-dimensional manifold. Therefore, we fix the matrix $\mathop{W}\limits^{\frown {}}=({\mathop{w}\limits^{\frown {}}}_{ij})$ and find the low-dimensional embedding by solving

$$\mathop{Y}\limits^{\frown {}}={{\mbox{arg}}}\mathop{\min }\limits_{Y}{\mathop{\sum }\limits_{i=1}^{N} \left\Vert {Y}_{i}-\mathop{\sum }\limits_{j=1}^{N}{\mathop{w}\limits^{\frown {}}}_{ij}{Y}_{j} \right\Vert }^{2},$$

(8)

where ${Y}_{i}$ represents the points in the low-dimensional manifold. Then, the bottom E nonzero eigenvectors (from Eq. (8)) provide the low-dimensional embedding ${M}_{0}$

$${M}_{0}=\left[\begin{array}{ccc}{y}_{1}(1) & \cdots & {y}_{E}(1)\\ \vdots & \ddots & \vdots \\ {y}_{1}(L) & \ldots & {y}_{E}(L)\end{array}\right],$$

(9)

where $y(t)\in {Y}_{i}$. Consequently, each N-dimensional observation ${X}_{i}$ is mapped to an E-dimensional point ${Y}_{i}$.

(3) An N-dimensional system is embedded into an E-dimensional space from two different approaches, which then suggests a one-to-one mapping between ${M}_{{x}_{i}}$ and ${M}_{0}$.

$${\psi }_{i}:{M}_{0}\to {M}_{{x}_{i}},{\psi }_{i}(Y(t))={\tilde{X}}_{i}(t),i=1,2,\cdots,N,$$

(10)

where $Y(t)\in {M}_{0},{\tilde{X}}_{i}(t)\in {M}_{{x}_{i}}$. From Eq. (10), we infer

$${\psi }_{i}\left(\begin{array}{cccc}{y}_{1}(1) & {y}_{2}(1) & \cdots & {y}_{E}(1)\\ \vdots & \vdots & \cdots & \vdots \\ {y}_{1}(h) & {y}_{2}(h) & \cdots & {y}_{E}(h)\\ \vdots & \vdots & \cdots & \vdots \\ {y}_{1}(L) & {y}_{2}(L) & \cdots & {y}_{E}(L)\end{array}\right)=\left(\begin{array}{ccc}{x}_{i}(1) & {x}_{i}(1+\tau ) & \cdots \\ \vdots & \vdots & \cdots \\ {x}_{i}(h) & {x}_{i}(h+\tau ) & \cdots \\ \vdots & \vdots & \cdots \\ {x}_{i}(L) & {x}_{i}(L+\tau ) & \cdots \end{array}\begin{array}{c}{x}_{i}(1+(E-1)\tau )\\ \vdots \\ {x}_{i}(L)\\ \vdots \\ {x}_{i}(L+(E-1)\tau )\end{array}\right).$$

(11)

Since a one-to-one mapping between the feature manifold and the reconstructed manifold is held, it is possible to train a mapping from the feature manifold to each coordinate of the reconstructed manifold. In particular, for longer horizon predictions, we aim to find the mapping from the feature manifold to the final coordinate of the reconstructed manifold (i.e., $x(t+(E-1)\tau ),t=1,2,\cdots,L$). These processes can be also explained mathematically, as follows.

Based on Eq. (11), we deduce the form (12)

$${\mathop{\phi }\limits^{\frown {}}}_{i}\left(\begin{array}{ccc}{x}_{i}(1) & {x}_{i}(1+\tau ) & \cdots \\ \vdots & \vdots & \cdots \\ {x}_{i}(h) & {x}_{i}(h+\tau ) & \cdots \\ \vdots & \vdots & \cdots \\ {x}_{i}(L) & {x}_{i}(L+\tau ) & \cdots \end{array}\begin{array}{c}{x}_{i}(1+(E-1)\tau )\\ \vdots \\ {x}_{i}(L)\\ \vdots \\ {x}_{i}(L+(E-1)\tau )\end{array}\right)=\left(\begin{array}{c}{x}_{i}(1+(E-1)\tau )\\ \vdots \\ {x}_{i}(L)\\ \vdots \\ {x}_{i}(L+(E-1)\tau )\end{array}\right).$$

(12)

In Eq. (12), ${\mathop{\phi }\limits^{\frown {}}}_{i}$ can be easily obtained by the transform (13)

$$\left(\begin{array}{ccc}{x}_{i}(1) & {x}_{i}(1+\tau ) & \cdots \\ \vdots & \vdots & \cdots \\ {x}_{i}(h) & {x}_{i}(h+\tau ) & \cdots \\ \vdots & \vdots & \cdots \\ {x}_{i}(L) & {x}_{i}(L+\tau ) & \cdots \end{array}\begin{array}{c}{x}_{i}(1+(E-1)\tau )\\ \vdots \\ {x}_{i}(L)\\ \vdots \\ {x}_{i}(L+(E-1)\tau )\end{array}\right)[\begin{array}{c}0\\ \vdots \\ 0\\ \vdots \\ 1\end{array}]=\left(\begin{array}{c}{x}_{i}(1+(E-1)\tau )\\ \vdots \\ {x}_{i}(L)\\ \vdots \\ {x}_{i}(L+(E-1)\tau )\end{array}\right).$$

(13)

According to Eqs. (11) and (12), we deduce a form (14)

$${\mathop{\psi }\limits^{\frown {}}}_{i}\left(\begin{array}{cccc}{y}_{1}(1) & {y}_{2}(1) & \cdots & {y}_{E}(1)\\ \vdots & \vdots & \cdots & \vdots \\ {y}_{1}(h) & {y}_{2}(h) & \cdots & {y}_{E}(h)\\ \vdots & \vdots & \cdots & \vdots \\ {y}_{1}(L) & {y}_{2}(L) & \cdots & {y}_{E}(L)\end{array}\right)=\left(\begin{array}{c}{x}_{i}(1+(E-1)\tau )\\ \vdots \\ {x}_{i}(L)\\ \vdots \\ {x}_{i}(L+(E-1)\tau )\end{array}\right),$$

(14)

where ${\mathop{\psi }\limits^{\frown {}}}_{i}(x)={\mathop{\phi }\limits^{\frown {}}}_{i}{\psi }_{i}(x)$. Equation (14) suggests a mapping from the feature manifold to the final coordinate of the reconstructed manifold.

All the elements on ${M}_{0}$ (see the left matrix in Eq. (14)) are obtained via manifold learning algorithms, whereas partial components on ${M}_{{x}_{i}}$ are unknown, i.e., ${x}_{i}(L+\tau ),\cdots,{x}_{i}(L+(E-1)\tau )$(see the right matrix in Eq. (14)), and they represent the future dynamics of the variable ${x}_{i}(t)$. Once ${\mathop{\psi }\limits^{\frown {}}}_{i}$ is identified, one can find at most $(E-1)\tau$-forward dynamics of a selected variable ${x}_{i}(t)$.

In a dynamical system, each time series variable can be used to reconstruct a low-dimensional embedding (i.e., ${M}_{{x}_{i}}$, where $i=1,2,\cdots,N$). This approach thus enables the construction of N mappings (i.e., ${\mathop{\psi }\limits^{\frown {}}}_{i}$,$i=1,2,\cdots,N$) from ${M}_{0}$ to the final coordinate of${M}_{{x}_{i}}$, which yields multistep predictions for all units in high-dimensional dynamical systems. In this work, we use the Gaussian process regression algorithm to identify every ${\mathop{\psi }\limits^{\frown {}}}_{i}$.

(4) We validate the performance by randomly dividing the observed series (${x}_{i}(1+(E-1)\tau ),{x}_{i}(2+(E-1)\tau ),\cdots,{x}_{i}(L)$) into a training set and a test set (i.e., cross-validation). The correlation between the observed values and predicted values ($\rho$) and the normalized root mean square error (RMSE) are applied to measure the performance.

$$\rho=\frac{{{{{{\mathrm{cov}}}}}}({{{{{\mathrm{x}}}}}},\tilde{{{{{\mathrm{x}}}}}})}{{\eta }_{x}{\eta }_{\tilde{x}}},\,RMSE=\frac{\sqrt{\frac{1}{n}\sum {(x-\tilde{x})}^{2}}}{{\eta }_{x}},$$

(15)

where $x$ and $\tilde{x}$ are the original and predicted data, respectively. ${\eta }_{x}$ represents the standard deviation of the series $x$.

Benchmark model systems

The coupled Lorenz system is defined as Eq. (16), where the ith ($i=1,2,\cdots,N$) subsystem is coupled with the (i−1) subsystem via $c$. To make the system closed, we set i−1 as N for i = 1.

$${\dot{x}}_{i}= \sigma (t)({y}_{i}-{x}_{i})+c{x}_{i-1},\\ {\dot{y}}_{i}= a{x}_{i}-{y}_{i}-{x}_{i}{z}_{i},\\ {\dot{z}}_{i}= b{z}_{i}+{x}_{i}{y}_{i},$$

(16)

where$\sigma (t)$ is the time-varying parameter. a and b are set to be generic values, i.e., $a=28,b=-8/3$.

For the time-invariant case ($\sigma (t)\equiv 10$), Eq. (16) depicts an ordinary Lorenz system. Particularly, we obtain a 3-dimensional Lorenz system when $N=1$ and $c=0$. We define a 90-dimensional coupled Lorenz system when $N=30$ and $c=0.1$.

For the time-varying case, we set $\sigma (t)$ to be increased (from an initial value of 10) by 0.2 after every ten time intervals, i.e., $\sigma (t)=10+0.2(t|10)$.

To generate the discrete data, we set the initial state as 0.1, and the output series has a length of 1500. The first 100 data points are ignored to avoid transient dynamics. We embed the 3-dimensional ordinary Lorenz system into a 2-dimensional space, where $E=2,\tau=10$. For the 90-dimensional ordinary and time-varying Lorenz systems, the embedding dimension and time lag are $E=11$ and $\tau=1$, respectively. The diffusion map algorithm is used to find their low-dimensional representations.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The details of real-world datasets are listed in SI Appendix Chapter 1.2. All real-world datasets are available at https://github.com/wt1234wt/FRMM-framework. The details of datasets from model systems are given in Methods.

Code availability

The related codes are available at https://github.com/wt1234wt/FRMM-framework.

References

Clauset, A., Larremore, D. B. & Sinatra, R. Data-driven predictions in the science of science. Science 355, 477–480 (2017).
Article CAS PubMed ADS Google Scholar
Subrahmanian, V. S. & Kumar, S. Predicting human behavior: The next frontiers. Science 355, 489–489 (2017).
Article CAS PubMed ADS Google Scholar
Perretti, C. T., Munch, S. B. & Sugihara, G. Model-free forecasting outperforms the correct mechanistic model for simulated and experimental data. PNAS 110, 5253–5257 (2013).
Article CAS PubMed PubMed Central ADS Google Scholar
Wang, W. X., Lai, Y. C. & Grebogi, C. Data based identification and prediction of nonlinear and complex dynamical systems. Phys. Rep. 644, 1–76 (2016).
Article MathSciNet ADS Google Scholar
Munch, S. B. et al. Constraining nonlinear time series modeling with the metabolic theory of ecology. PNAS 120, e2211758120 (2023).
Article CAS PubMed PubMed Central Google Scholar
Box, G. E. P. & Pierce, D. A. Distribution of residual autocorrelations in autoregressive integrated moving average time series models. J. Am. Stat. Assoc. 65, 1509–1526 (1970).
Article MathSciNet Google Scholar
Sugihara, G. Nonlinear forecasting for the classification of natural time series. Philos. Trans. Phys. Sci. Eng. 348, 477–495 (1994).
ADS Google Scholar
Ye, H. & Sugihara, G. Information leverage in interconnected ecosystems: Overcoming the curse of dimensionality. Science 353, 922–925 (2016).
Article CAS PubMed ADS Google Scholar
Rebentrost, P., Mohseni, M. & Lloyd, S. Quantum support vector machine for big data classification. Phys. Rev. Lett. 113, 130503 (2014).
Article PubMed ADS Google Scholar
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9, 1735–1780 (1997).
Article CAS PubMed Google Scholar
Pathak, J., Hunt, B., Girvan, M., Lu, Z. & Ott, E. Model-free prediction of large spatiotemporally chaotic systems from data: A reservoir computing approach. Phys. Rev. Lett. 120, 024102 (2018).
Article CAS PubMed ADS Google Scholar
Gauthier, D. J., Bollt, E., Griffith, A. & Barbosa, W. A. S. Next generation reservoir computing. Nat. Commun. 12, 5564 (2021).
Article CAS PubMed PubMed Central ADS Google Scholar
Tao, P. et al. Predicting time series by data-driven spatiotemporal information transformation. Inf. Sci. 622, 859–872 (2023).
Article Google Scholar
Ma, H. F., Leng, S. Y., Aihara, K., Lin, W. & Chen, L. N. Randomly distributed embedding making short-term high-dimensional data predictable. PNAS 43, E9994–E10002 (2018).
MathSciNet Google Scholar
Chen, P., Liu, R., Aihara, K. & Chen, L. N. Autoreservoir computing for multistep ahead prediction based on the spatiotemporal information transformation. Nat. Commun. 11, 4568 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Chen, C., Li, R., He, Z. & Chen, L. N. Predicting future dynamics from short-term time series using an Anticipated Learning Machine. Natl Sci. Rev. 7, 1079–1091 (2020).
Article PubMed PubMed Central Google Scholar
Fan, J. F. et al. Statistical physics approaches to the complex Earth system. Phys. Rep. 89, 1–84 (2021).
MathSciNet ADS Google Scholar
Park, H. J. & Friston, K. J. Structural and functional brain networks: From connections to cognition. Science 342, 1238411 (2013).
Article PubMed Google Scholar
Fan, J. Q. & Lv, J. C. A selective overview of variable selection in high dimensional feature space. Stat. Sin. 20, 101–148 (2010).
MathSciNet PubMed PubMed Central Google Scholar
Liu, X. M. et al. Network resilience. Phys. Rep. 971, 1–108 (2022).
Article MathSciNet ADS Google Scholar
Gao, J., Barze, B. & Barabási, A. L. Universal resilience patterns in complex networks. Nature 530, 307–312 (2016).
Article CAS PubMed ADS Google Scholar
Robertson, D. E. & Wang, D. J. Bayesian approach to predictor selection for seasonal strearnflow forecasting. J. Hydrometeorol. 13, 155–171 (2012).
Article ADS Google Scholar
Ben Taieb, S., Bontempi, G., Atiya, A. F. & Sorjamaa, A. A review and comparison of strategies for multi-step ahead time series forecasting based on the NN5 forecasting competition. Expert Syst. Appl. 39, 7067–7083 (2012).
Article Google Scholar
Zhang, Y. R., Zhang, Y. L. & Haghani, A. A hybrid short-term traffic flow forecasting method based on spectral analysis and statistical volatility model. Transp. Res. Part C Emerg. Technol. 43, 65–78 (2014).
Article CAS Google Scholar
Whitney, H. Differentiable manifolds. Ann. Math. 37, 645–680 (1936).
Article MathSciNet Google Scholar
Seung, H. S. & Lee, D. D. The manifold ways of perception. Science 290, 2268–2269 (2000).
Article CAS PubMed Google Scholar
Busch, E. L. et al. Multi-view manifold learning of human brain-state trajectories. Nat. Comput. Sci. 3, 240–253 (2023).
Article PubMed PubMed Central Google Scholar
Tenenbaum, J. B., de Sivlar, V. & Langford, J. C. A global geometric framework for nonlinear dimensionality reduction. Science 290, 2319–2323 (2000).
Article CAS PubMed ADS Google Scholar
Sugihara, G. et al. Detecting causality in complex ecosystems. Science 338, 496–500 (2012).
Article CAS PubMed ADS Google Scholar
Takens, F. Detecting strange attractors in turbulence. Mathematics 898, 366–381 (1981).
MathSciNet Google Scholar
Coifman, R. R. & Lafon, S. Diffusion maps. Appl. Comput. Harmonic Anal. 21, 5–30 (2006).
Article MathSciNet Google Scholar
Floryan, D. & Graham, M. D. Data-driven discovery of intrinsic dynamics. Nat. Mach. Intell. 4, 1113–1120 (2022).
Article Google Scholar
Roweis, S. T. & Saul, L. K. Nonlinear dimensionality reduction by locally linear embedding. Science 29, 2323–2326 (2000).
Article ADS Google Scholar
Belkin, M. & Niyogi, P. Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput. 15, 1373–1396 (2003).
Article Google Scholar
Zhang, Z. Y. & Zha, H. Y. Principal manifolds and nonlinear dimension reduction via local tangent space alignment. SIAM J. Sci. Comput. 26, 313–338 (2004).
Article MathSciNet Google Scholar
Rasmussen, C. & Williams, C. Gaussian processes for machine learning (MIT Press, Cambridge, MA) (2006).
Gade, S. V. et al. Impact of the ensemble Kalman filter based coupled data assimilation system on seasonal prediction of Indian summer monsoon rainfall. Geophys. Res. Lett. 49, e2021GL097184 (2022).
Article ADS Google Scholar
Hersbach, H. et al. The ERA5 global reanalysis. Q. J. R. Meteorol. Soc. 1316, 1999–2049 (2020).
Article ADS Google Scholar
Hassan, M. & Wendling, F. Aiming for high resolution of brain networks in time and space Electroencephalography Source Connectivity. IEEE Signal Process. Mag. 35, 81–96 (2018).
Article Google Scholar
Jao, P. K., Chavarriaga, R. & Millan, J. D. EEG-based online regulation of difficulty in simulated flying. IEEE Trans. Affect. Comput. 14, 394–405 (2023).
Article Google Scholar
Battiston, S. et al. Complexity theory and financial regulation. Science 351, 818–819 (2016).
Article CAS PubMed ADS Google Scholar
Avila, A. M. & Mezic, I. Data-driven analysis and forecasting of highway traffic dynamics. Nat. Commun. 11, 2090 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Wu, T. et al. Universal window size-dependent transition of correlations in complex systems. Chaos 33, 023111 (2023).
Article MathSciNet PubMed ADS Google Scholar
Tong, Y. Y. et al. Earthquake alerting based on spatial geodetic data by spatiotemporal information transformation learning. PNAS 120, e2302275120 (2023).
Article CAS PubMed PubMed Central Google Scholar
Wu, T. et al. A novel framework for direct multistep prediction in complex systems. Nonlinear Dyn. 111, 9289–9304 (2023).
Article Google Scholar
Scheffer, M. et al. Anticipating critical transitions. Science 338, 344–348 (2012).
Article CAS PubMed ADS Google Scholar
Grziwotz, F. et al. Anticipating the occurrence and type of critical transitions. Sci. Adv. 9, eaba4558 (2023).
Krakovska, A., Mezeiova, K. & Budacova, H. Use of false nearest neighbors for selecting variables and embedding parameters for state space reconstruction. J. Complex Syst. 2015, 932750 (2015).
Google Scholar
Fraser, A. M. & Swinney, H. L. Independent coordinates for strange attractors from mutual information. Phys. Rev. A. 33, 134–1140 (1986).
Article MathSciNet ADS Google Scholar

Download references

Acknowledgements

X.Y.G. is supported by the National Natural Science Foundation of China (No.72371229, 71991481, 71991485), and by the Fundamental Research Funds for the Central Universities (3-7-8-2023-01). H.Z.A. is supported by the National Natural Science Foundation of China (No.71991481, 71991480], and by the Basic Science Center Project of the National Natural Science Foundation of China (No. 72088101). J.X.G. is supported by the National Science Foundation (No. 2047488), and by the Rensselaer- IBM AI Research Collaboration.

Author information

Authors and Affiliations

College of Management Science, Chengdu University of Technology, Chengdu, 610059, China
Tao Wu
School of Economics and Management, China University of Geosciences, Beijing, 100083, China
Xiangyun Gao, Xiaotian Sun & Haizhong An
Key Laboratory of Carrying Capacity Assessment for Resource and Environment, Ministry of Land and Resources, Beijing, 100083, China
Xiangyun Gao & Haizhong An
School of Economics and Management, Beijing University of Chemical Technology, Beijing, 100029, China
Feng An
Potsdam Institute for Climate Impact Research (PIK)–Member of the Leibniz Association, Potsdam, 14473, Germany
Zhen Su, Shraddha Gupta & Jürgen Kurths
Department of Computer Science, Humboldt University at Berlin, Berlin, 12489, Germany
Zhen Su
Department of Physics, Humboldt University at Berlin, Berlin, 12489, Germany
Shraddha Gupta & Jürgen Kurths
Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
Jianxi Gao
Network Science and Technology Center, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
Jianxi Gao

Authors

Tao Wu
View author publications
You can also search for this author in PubMed Google Scholar
Xiangyun Gao
View author publications
You can also search for this author in PubMed Google Scholar
Feng An
View author publications
You can also search for this author in PubMed Google Scholar
Xiaotian Sun
View author publications
You can also search for this author in PubMed Google Scholar
Haizhong An
View author publications
You can also search for this author in PubMed Google Scholar
Zhen Su
View author publications
You can also search for this author in PubMed Google Scholar
Shraddha Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Jianxi Gao
View author publications
You can also search for this author in PubMed Google Scholar
Jürgen Kurths
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.W., X.Y.G., F.A., J.X.G. and J.K. designed the research. T.W., X.Y.G. and F.A. performed the experiments and wrote the first draft of the paper. T.W., X.T.S., S.G. and Z.S. analyzed data. T.W., X.Y.G., F.A., H.Z.A., J.X.G. and J.K. reviewed and edited the manuscript.

Corresponding authors

Correspondence to Xiangyun Gao, Feng An, Jianxi Gao or Jürgen Kurths.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Pei Chen, Dibakar Ghosh, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wu, T., Gao, X., An, F. et al. Predicting multiple observations in complex systems through low-dimensional embeddings. Nat Commun 15, 2242 (2024). https://doi.org/10.1038/s41467-024-46598-w

Download citation

Received: 11 November 2023
Accepted: 04 March 2024
Published: 12 March 2024
DOI: https://doi.org/10.1038/s41467-024-46598-w

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.