Extrapolating tipping points and simulating non-stationary dynamics of complex systems using efficient machine learning

Köglmayr, Daniel; Räth, Christoph

doi:10.1038/s41598-023-50726-9

Download PDF

Article
Open access
Published: 04 January 2024

Extrapolating tipping points and simulating non-stationary dynamics of complex systems using efficient machine learning

Daniel Köglmayr¹ &
Christoph Räth¹

Scientific Reports volume 14, Article number: 507 (2024) Cite this article

856 Accesses
Metrics details

Subjects

Abstract

Model-free and data-driven prediction of tipping point transitions in nonlinear dynamical systems is a challenging and outstanding task in complex systems science. We propose a novel, fully data-driven machine learning algorithm based on next-generation reservoir computing to extrapolate the bifurcation behavior of nonlinear dynamical systems using stationary training data samples. We show that this method can extrapolate tipping point transitions. Furthermore, it is demonstrated that the trained next-generation reservoir computing architecture can be used to predict non-stationary dynamics with time-varying bifurcation parameters. In doing so, post-tipping point dynamics of unseen parameter regions can be simulated.

Emerging opportunities and challenges for the future of reservoir computing

Article Open access 06 March 2024

Time series reconstructing using calibrated reservoir computing

Article Open access 29 September 2022

Higher-order Granger reservoir computing: simultaneously achieving scalable complex structures inference and accurate dynamics prediction

Article Open access 20 March 2024

Introduction

Small perturbations in a complex system can dramatically change its evolution¹. A lack of precision in determining the exact state of the system can lead to an amplified lack of certainty about the future behavior of the system. This is the case even when we know its true governing equations and the exact boundary conditions. How can we then deal with complex systems where, in addition, we do not know the governing equations and must rely solely on observational data?

In recent years promising and remarkably efficient machine learning methods were proposed that use observational data as training data to autonomously generate a model that can explain the data^2,3,4. One prominent example is a recurrent neural network method called reservoir computing^5,6 (RC). A reservoir computer creates a high-dimensional nonlinear representation of the observed dynamical system and synchronizes it with the corresponding input data. The synchronized representation is then trained on the desired output target so that the reservoir computer becomes an autonomous dynamical system whose output dynamics resemble that of the analyzed system. This way, it can achieve cutting-edge performances in predicting short-and long-term behavior of chaotic systems and outperforms other machine learning approaches like LSTMs or DNNs^7,8. In September 2021, Gauthier et al. published the next-generation reservoir computing architecture (NG-RC), highlighting its lack of randomness, the fewer hyperparameters, the smaller amount of required training data, and its performance gain in speed compared to the traditional approach⁹. In traditional reservoir computing, randomly initialized matrices are used to feed the input variables of the dynamical system into a high-dimensional state space that is nonlinearized by applying a nonlinear activation function. The NG-RC uses a library of unique polynomials of time-shifted input variables to achieve a nonlinear dimensionality expansion. In both cases, the resulting state space is consistently trained on the desired output target using ridge regression to become an autonomous dynamical system. Both methods can generally be deployed with small state spaces, which, combined with the computational cheap regression, lead to highly efficient algorithms.

So far, these algorithms have been used mainly for analyzing stationary dynamical systems, where the boundary conditions of the system are assumed to be fixed, i.e., time-independent. In this case, the qualitative behavior of the system, such as periodicity or chaoticity, remains the same over time. However, in most real-world systems, the boundary conditions can change over time, possibly leading to a qualitative change in the behavior of the system, e.g., from stable periodicity to chaos or from chaos to system collapse. These systems are called non-stationary dynamical systems, and the boundary condition under which the system undergoes such a critical transition is termed tipping point. Extrapolating tipping points is of great interest in many scientific fields. A prominent example is the evolution of the climate system influenced by atmospheric greenhouse gas concentrations, for which several tipping points are predicted¹⁰. Irrgang et al. surveyed the role of artificial intelligence for earth system modeling. They highlighted the concern that current classic earth system models might not be capable of predicting future abrupt climate changes¹¹. Hence, data-driven methods that capture the underlying physics seem suitable to augment classic models.

Reservoir computing-based methods for analyzing non-stationary dynamical systems and, thus, for the possible data-driven extrapolation of tipping points are still in their early stages. Two different approaches are emerging in the current literature, which mainly differs from each other in the form of their training data. One approach directly uses non-stationary time series to train the reservoir computer^12,13,14. The other approach uses several stationary data samples from different boundary conditions, i.e., bifurcation parameters, to train the reservoir computer^15,16,17. The latter uses the multifunctional capabilities of reservoir computing, which are complemented by an additional parameter channel to test for new an unseen parameter regions. It has been shown that a reservoir computer can be optimized to predict several different dynamical systems with a single trained architecture^{18,19,20,21,22}. Kim et al. showed that reservoirs can learn with this approach and the additional parameter channel to interpolate and extrapolate translations, linear transformations, and bifurcations of the Lorenz attractor¹⁵. In¹⁶, Kong et al. statistically evaluated the parametrization of a system collapse, or global bifurcations, of a chaotic food chain model and a generic power system model. Kong et al. were also able to reconstruct bifurcation diagrams of driven chaotic systems¹⁷.

In this work a framework for parameter-aware next-generation reservoir computing is developed. By means of the examples used in¹⁶, the functionality of the developed method is demonstrated and it is shown that the method is capable of accurately reconstructing bifurcation diagrams and simulating non-stationary dynamics, even in situations where the data is limited and the parameterization of the training data is far from the global bifurcation of interest.

Results

Recently, a new type of RC called next-generation reservoir computing (NG-RC) has been introduced for the analysis of dynamical systems. In its functional core, the algorithm first collects the time-shifted input variables of the time series data to be analyzed into a vector. In a second step, each unique polynomial combination of certain orders of the entries in the previously collected vector is determined and appended. In this way, the feature vector is created. During training, the linear mapping of the feature vector to the corresponding next time series data point is optimized using ridge regression. Due to this minimal architecture, the NG-RC features excellent speed and lacks any randomness. Besides these operational advantages, it has been shown in several publications that NG-RC requires significantly less training data than the already data sparing traditional reservoir computer^9,23,24,25, which makes NG-RC a highly efficient method for analyzing and predicting dynamical systems.

The algorithm proposed in this paper models an additional input channel for a bifurcation parameter into the NG-RC architecture by adding the parameter times a scaling parameter to each entry of the feature vector (see Methods). This allows the algorithm to learn a dynamical system also in terms of its bifurcation parameter. After training, the parameter can be varied so that the prediction of the algorithm can be tested for unseen parameter regions. The parameter-aware next-generation reservoir computing architecture is applied below to two systems of ordinary differential equations, a generic power system model²⁶ and a chaotic food chain model²⁷. Both systems were examples for statistical evaluation of tipping points using traditional reservoir computing¹⁶, moreover, their equations contain terms that cannot be directly represented by the polynomial structure of the feature vector, making them informative test systems for the parameter-aware NG-RC.

For the generic power system model, the NG-RC architecture is used to predict the bifurcation diagram and to extrapolate the tipping point. To evaluate its prediction quality, the largest Lyapunov exponents are compared with those of the model equations. The Lyapunov exponent is a measure of the long-term statistical behavior, or statistical climate, of a time series and indicates how chaotic or periodic a time series is (see Methods). It is also demonstrated that the correct choice of the scaling parameter is important and affects the quality of the prediction. For the chaotic food chain model, the influence of the scaling parameter on the extrapolation is further investigated. For that, 15 bifurcation diagrams predicted by the same NG-RC architecture are shown, which only differ in a slightly different scaling parameter. Furthermore, it is shown that the trained NG-RC can be used to simulate non-stationary dynamics in unseen parameter regions, capturing the main behavior of the dynamics even after passing through a tipping point.

Power system model

Dobson and Chiang formulated a set of generic equations to model the collapse of electrical power systems. This can be caused by the dynamic response of the system to disturbances, which may lead to a progressive drop in voltage, causing what is known as a “voltage collapse” or blackout²⁶. In the upper plot of Fig. 1, the bifurcation diagram of the generic equations of the power system model is scattered in red. The corresponding Lyapunov exponents are plotted to measure the dynamic behavior of the system. It evolves from a periodic dynamic to a chaotic one for increasing bifurcation parameters. In some areas, it shows periodic windows. The system collapses at the critical bifurcation parameter $Q_{1c}=2.989820$, and the total voltage drops to zero. The presented NG-RC architecture aims to reconstruct the bifurcation diagram with matching Lyapunov exponents. In this example, seven training data samples are taken from different and widely separated regions of the bifurcation diagram to be analyzed. The bifurcation parameter of these are highlighted as vertical green dashed lines.

Results

The parameter-aware NG-RC architecture was applied and tested with different scaling parameters. The reconstructed bifurcation diagram of the best performing architecture is scattered in the middle plot of Fig. 1 in green, and its corresponding Lyapunov exponents are plotted in blue. It captures the main dynamical behaviors of the model. Between the area of training data samples, the architecture interpolates the periodic windows even though none of the samples were set in a similar region. In extrapolating the dynamics, the architecture captures the periodic window starting at $Q_1=2.989784$ and predicts the system collapse at $Q_{1c}=2.989819$. The dynamical properties of the model were successfully captured for a set of scaling parameters in the range of $\gamma \in [0.6,1,05]$. The corresponding minimum and maximum Lyapunov exponents of these parameters are plotted in the lower plot of Fig. 1. The Lyapunov exponents of the model equations lie well in between this region. The Lyapunov exponents for $\gamma =1.1$ are plotted in yellow, showing that the NG-RC architecture could not predict the dynamical properties of the model. For some parameters of the training data, the Lyapunov exponents of the predicted dynamics differ strongly from those of the training data, which allows for direct validation of prediction performance given the applied scaling parameter. This makes the introduced scaling parameter a functional new hyperparameter, which is worthwhile tuning in this setup. Its functionality is further investigated in the next example.

Chaotic food chain model

McCann and Yodzis showed that ecosystem behavior, when it transitions to chaotic transient dynamics, can cause sudden and unpredictable disappearance of populations. Due to the realistic and nonlinear functional response properties of productive environments, sudden and unexpected jumps to other dynamical population density attractors may occur, potentially causing the disappearance of a population²⁷. To model this, they used a three-species food chain model with a resource density R, a consumer density C, and a predator density P. The resource-carrying capacity K of the environment is taken as the bifurcation parameter. The bifurcation diagram of the model equations is scattered in Fig. 2 in red and with a larger K space in Fig. 3. This system of equations shows rich bifurcation structures. When the resource-carrying capacity K reaches a critical value of $K_{c_1}=1.00050$, the chaotic oscillating predator density P suddenly drops to 0, and the predator population disappears. For $K_{c_2}=1.04$, this density reappears in a reverse manner. Both system behaviors are global bifurcations. Notably, there is another one at $K_{c_3}=0.96075$, where the predator density performs a sudden jump. This time the NG-RC architecture aims to reconstruct the bifurcation diagram given stationary training data samples, which are narrowed down to a more minor part of the bifurcation diagram compared to those taken in the previous example. The performance of the extrapolation of tipping points is evaluated regarding the introduced scaling parameter. Non-stationary dynamics are simulated, passing through the tipping point $K_{c_3}$.

Results

The prediction performance of the NG-RC architecture was investigated for different scaling parameters. The best performing architecture with $\gamma =0.4$ is scattered in Fig. 2 in green. The nearest tipping point from the parameterization of the training data at $K_{c_3}=0.96075$ and the subsequent transition from chaoticity to periodicity at $K=0.983$ are accurately predicted. The tipping point at $K_{c_1}=1.00050$ was predicted with $K=0.99875$. The bifurcation that is farthest away, $K_{c_2}=1.04$, was extrapolated with $K=1.027$ (see Fig. 3). A qualitative difference between the prediction and the model is that the predicted trajectory after the tipping point at $K=0.99875$ goes to minus infinity, whereas the real one goes to zero. Clear topological differences can be seen between the training data samples at $K=0.935$ and $K=0.94$. This area incorrectly shows the properties of a periodic window. This also applies to the interpolation for different scaling parameters. Thus, the effect of the scaling parameter on the interpolation capabilities between the parameter space of training data samples is limited. As expected, accurate extrapolation of possible tipping points becomes more difficult the further they are from the parameterization of the training data. Looking at the evolution of the bifurcation diagrams for different scaling parameters in Fig. 3 was instructive to see the influence of the scaling parameter on the extrapolation capability. From $\gamma =0.3$ on to $\gamma =0.4$, the increasing scaling parameter stretches the bifurcation topology in the range of $K \in [0.95,1]$. Interestingly, there is a qualitative change in the bifurcation diagram when the scaling parameter stretches it over the actual tipping point at $K_{c_1}=1.00050$. The tipping point prediction is lost, and the NG-RC transforms the two parts of the bifurcation diagram into a continuous one. This behavior generally allows practical parameter tuning of the scaling parameter by introducing validation data to determine the necessary degree of stretching. Moreover, the trained NG-RC architecture for $\gamma =0.4$ is used to simulate non-stationary dynamics using Eq. (15). In Fig. 4, the bifurcation parameter switches from $K=0.955$ to $K=0.965$ over the tipping point at $K_{c_3}=0.96075$. The predicted trajectory captures this transition. Using the identical trained NG-RC architecture, a sinusoidal and linearly increasing function of K is taken as another example. The result is shown in Fig. 5. The dominant dynamical behaviors concerning the bifurcation parameter regions are captured in the prediction. These examples illustrate the applicability of this architecture, which enables the simulations of non-stationary dynamics as a function of a time-varying bifurcation parameter.

Discussion

A machine learning method based on parameter-aware next-generation reservoir computing was presented to investigate the bifurcation behavior of dynamical systems. It was shown that tipping points can be extrapolated. Moreover, the trained architecture can be used to simulate non-stationary dynamics and, with that, also, post-tipping point dynamics. The success of reservoir computing and next-generation reservoir computing relies on optimizing them into dynamical systems whose dynamics resemble that of the analyzed system. The presented implementation of the bifurcation parameter provided a functional input channel that allowed the investigation of the system dynamics in unseen parameter regions. It is noteworthy that, on the one hand, this method can capture high sensitivities of the analyzed dynamical system to the bifurcation parameter. This was shown in the generic power system model, where the sixth decimal place of the bifurcation parameter partly determines the dynamic behavior. On the other hand, this method integrated the bifurcation parameter, implemented by adding it times a scaling parameter to the feature space of NG-RC, so that the optimized NG-RC architecture was able to simulate and predict dynamics where the bifurcation parameter appears as an inverse parameter in the governing equation. This generally extends the applicability of this approach and was shown in the chaotic food chain example. Although both system equations contain terms that cannot be directly represented by the polynomial structure of the feature vector, the proposed NG-RC architecture was able to interpolate and extrapolate the system behaviors, which is another plus for its applicability.

So far, there are few publications in which reservoir computing methods are used to determine bifurcation diagrams of dynamical systems and their tipping points. In Kim et al.¹⁵, parameter-aware reservoir computing was used to accurately extrapolate the period doubling bifurcations of the Lorenz system around $\rho \approx 100$. For this, 4 training samples with 250000 training steps and 50000 synchronization steps were used, resulting in 1200000 data points. In Kong et al.¹⁷, the bifurcation diagram of a driven Lorenz-96 system was predicted with parameter-aware RC. This was done using 4 training samples with 140000 training steps and 800 synchronization steps each, resulting in 563000 data points. In the results presented here, the parameter-aware NG-RC required 70014 data points to train the architecture on the power system model and 175112 data points for the chaotic food chain model. A general statement that NG-RC requires significantly less training data than traditional RC, even in the case of parameter-aware extrapolation, would be overstated due to the lack of a direct comparison of the two methods. However, the results presented here provide a first tendency that the required training data can significantly be reduced. In terms of setting up a working architecture, the parameter-aware RC approach has eight tunable hyperparameters¹⁷, while the proposed NG-RC architecture has six, most of which are far less comprehensive to optimize. In addition, the NG-RC works completely without randomness. Instead of random matrices, the polynomial architecture generally ensures higher interpretability and, together with the previously mentioned points, a more direct setup to deploy a working architecture without stochastic realizations of the reservoir system. In the context of this work, the NG-RC architectures were not extensively optimized nor comprehensively investigated concerning the minimal required training data. However, if the tendency holds, further applications emerge. Since most real-world dynamical systems are of non-stationary nature, the less training data needed, the better non-stationary data samples can be approximated as stationary data. Which can improve the prediction of tipping points based on non-stationary time series data. Consequently, the here proposed parameter-aware NG-RC is an efficient, model-free, and data-driven method for extrapolating the behavior of dynamical systems and simulating non-stationary dynamics.

Methods

The method presented here is based on next-generation reservoir computing. Its architecture is extended by an input channel for a bifurcation parameter of a dynamical system. Therefore, the parameter is added to the NG-RC feature vector as a product with a scaling parameter to each element of the feature vector. The new feature vector is then extended with orders of itself. These steps are presented in detail below. A condensed mathematical description of the NG-RC is used, so that the applied architecture and its hyperparameters can be written as one equation. For the power system model results the architecture with its hyperparameters is described in Eq. (23) and for the chaotic food chain model in Eq. (27).

Next-generation reservoir computing

The d-dimensional data points $\textbf{x} \in \mathbb {R}^d$ of the input data $\textbf{X}=(\textbf{x}_0,....,\textbf{x}_n)$ are transformed with a polynomial multiplication dictionary $\textbf{P}$ into a higher dimensional state space. The unique polynomials of certain orders O, included in $\textbf{P}^{[O]}$, are denoted by an index. For illustration purposes, we consider a two-dimensional input data point $\textbf{x}_i=(x_{i,1}, x_{i,2})^T$ and transform it with the unique polynomials of order 1 and 2,

$$\begin{aligned} \textbf{P}^{[1,2]}(\textbf{x}_i)= \begin{pmatrix} x_{i,1}\\ x_{i,2}\\ x_{i,1}^2\\ x_{i,2}^2\\ x_{i,1}x_{i,2} \end{pmatrix}. \end{aligned}$$

(1)

Further, Gauthier et al. introduced a time shift expansion $\textbf{L}_k^s$ of the input data. The k value indicates the number of past data points with which the current data point is concatenated. The s value indicates how far these points are separated in time. Following the previous example

$$\begin{aligned} \textbf{P}^{[1,2]}\left( \textbf{L}^{s=1}_{k=2}(\textbf{x}_i)\right) = \textbf{P}^{[1,2]}\left( \begin{pmatrix} x_{i,1}\\ x_{i,2}\\ x_{i-1,1}\\ x_{i-1,2} \end{pmatrix}\right) = \begin{pmatrix} x_{i,1}\\ x_{i,2}\\ x_{i-1,1}\\ \vdots \\ x_{i,1}x_{i-1,2}\\ x_{i,2}x_{i-1,2}\\ \end{pmatrix}=\textbf{r}_{i+1}, \end{aligned}$$

(2)

where $\textbf{r}_{i+1} \in \mathbb {R}^N$ defines the feature vector with feature space dimension N. By concatenating this vector with powers of itself, higher-order features can be included in a computationally cheap way. For this purpose, an additional post-processing operator $\textbf{q}_{[O_{states}]}(\textbf{r})$ is introduced, where $O_{states}$ specifies which orders of the feature vector are to be concatenated. Defining $\odot$ as the Hadamard product, $\oplus$ as the vector concatenation operation, and specifying that for $0 \in O_{states}$ a bias term of dimension one is concatenated, the feature space can be extended, for example, for $O_{states}=[0,1,2]$, as shown below,

$$\begin{aligned} \textbf{q}_{[0,1,2]}(\textbf{r}_{i+1})&=1 \oplus \textbf{r}_{i+1} \oplus (\textbf{r}_{i+1} \odot \textbf{r}_{i+1})=(1,r_{i+1,1}, \ldots ,r_{i+1,N},r^2_{i+1,1}, \ldots ,r^2_{i+1,N})^T=\widetilde{\textbf{r}}_{i+1} \in \mathbb {R}^{2N+1} \end{aligned}$$

where $\widetilde{\textbf{r}}_{i+1} \in \mathbb {R}^{\widetilde{N}}$ defines the expanded feature vector with dimension $\widetilde{N}$ . This vector is then mapped with a readout matrix $\textbf{W}_{out}$ onto the desired output target $\textbf{y}_{i+1}$. During the training process, this mapping is optimized. In the training phase of the NG-RC, the input training data $\textbf{X}$ of length T is transformed into the feature matrix

$$\begin{aligned} \textbf{R}=\textbf{q}_{[O_{states}]}(\textbf{P}^{[O]}(\textbf{L}^{s}_{k}(\textbf{X}))) \end{aligned}$$

(3)

accordingly. Note that due to the k and s value, a warm-up time of $\delta t=ks$ is needed, where entries of the feature matrix at time $t < \delta t$ are not defined. Consequently, the output target matrix $\textbf{Y}$ needs to be adjusted. The output target matrix $\textbf{Y}$ is defined in the scope of this work as

$$\begin{aligned} \textbf{Y}=(\Delta \textbf{x}_{\delta t+1}, \,\ldots \,, \Delta \textbf{x}_T)^T \end{aligned}$$

(4)

with $\Delta \textbf{x}_i =\textbf{x}_i-\textbf{x}_{i-1}$, such that the mapping is optimized to fulfill

$$\begin{aligned} \textbf{x}_{i+1}=\textbf{x}_i+\textbf{W}_{out} \widetilde{\textbf{r}}_{i+1}. \end{aligned}$$

(5)

The readout matrix $\textbf{W}_{out}$ is learned via ridge regression by optimizing

$$\begin{aligned} \textbf{W}_{out}=\textbf{Y}\textbf{R}^T(\textbf{R}\textbf{R}^T+\beta \textbf{I})^{-1}. \end{aligned}$$

(6)

Matrix $\textbf{I}$ is an identity matrix, and $\beta$ is the regression parameter. In this setup, the NG-RC is optimized to become a one-step-ahead integrator that drives the trajectory according to

$$\begin{aligned} \textbf{x}_{i+1}=\textbf{x}_i+\textbf{W}_{out} \textbf{q}_{[O_{states}]}(\textbf{P}^{[O]}(\textbf{L}^s_k(\textbf{x}_i))). \end{aligned}$$

(7)

Multifunctionality with input channel

Multifunctionality setup

To include the data of n trajectories into the training process of the NG-RC, the feature matrix of every trajectory $\textbf{X}_m$ for $m=1,...,n$ is calculated with

$$\begin{aligned} \textbf{R}_m=\textbf{q}_{[O_{states}]}(\textbf{P}^{[O]}(\textbf{L}^{s}_{k}(\textbf{X}_{m}))). \end{aligned}$$

(8)

The resulting feature matrices are concatenated to

$$\begin{aligned} \textbf{R}_M=\textbf{R}_1 \oplus \textbf{R}_2 \,\ldots \, \oplus \textbf{R}_n. \end{aligned}$$

(9)

The output target matrix for each trajectory $\textbf{X}_m$ must also be concatenated to

$$\begin{aligned} \textbf{Y}_M=\textbf{Y}_1 \oplus \textbf{Y}_2 \,\ldots \, \oplus \textbf{Y}_n. \end{aligned}$$

(10)

This way, the identical training routine can be applied so that

$$\begin{aligned} \textbf{W}_{out}=\textbf{Y}_M\textbf{R}_M^T(\textbf{R}_M\textbf{R}_M^T+\beta \textbf{I})^{-1} \end{aligned}$$

(11)

is optimized via ridge regression. Provided the training is successful, the $\textbf{W}_{out}$ can be used to predict the different trajectories

$$\begin{aligned} \textbf{x}_{m,i+1}=\textbf{x}_{m,i}+\textbf{W}_{out} \textbf{q}_{[O_{states}]}(\textbf{P}^{[O]}(\textbf{L}^s_k(\textbf{x}_{m,i}))). \end{aligned}$$

(12)

Multifunctionality setup with input channel

In the scope of this work, however, we use this architecture to modulate the bifurcation parameter for multiple stationary dynamics of a system into the feature vector so that the NG-RC can be tested on predicting the dynamics for new and unseen bifurcation parameters. Therefore, for every stationary dynamic $\textbf{X}_m$ in the training data, determined by its bifurcation parameter $\theta _m$, we add to each element in the corresponding feature vector the bifurcation parameter $\theta _m$ multiplied by a scaling parameter $\gamma$,

$$\begin{aligned} \textbf{R}_m=\textbf{q}_{[O_{states}]}(\textbf{P}^{[O]}(\textbf{L}^{s}_{k}(\textbf{X}_{m}))+\gamma \theta _m). \end{aligned}$$

(13)

This concept can be trained similarly by optimizing Eq. (11). The NG-RC then drives the trajectory according to

$$\begin{aligned} \textbf{x}_{i+1}=\textbf{x}_i+\textbf{W}_{out} \textbf{q}_{[O_{states}]}(\textbf{P}^{[O]}(\textbf{L}^s_k(\textbf{x}_i))+\gamma \theta ). \end{aligned}$$

(14)

In addition, this structure allows to change the bifurcation parameter per prediction step,

$$\begin{aligned} \textbf{x}_{i+1}=\textbf{x}_i+\textbf{W}_{out} \textbf{q}_{[O_{states}]}(\textbf{P}^{[O]}(\textbf{L}^s_k(\textbf{x}_i))+\gamma \theta _i). \end{aligned}$$

(15)

Provided that the trained architecture can predict the dynamical behavior of the system for various unseen bifurcation parameters successfully, i.e., reconstruct its bifurcation diagram, a reasonable motivation for simulating non-stationary processes can be derived from Eq. (15).

Lyapunov exponent

Due to the definition of chaos and its sensitivity to initial conditions, evaluating dynamical predictions only on their deviation from the ground truth, i.e., with its short-time behavior, can not capture essential features of dynamical systems. To determine the systematic behavior of a dynamical system, it is necessary to determine the long-term properties of the trajectory. These are referred to as the statistical climate of the system, and its measurement can give rise to how chaotic or periodic the system is. Lyapunov exponents $\lambda _{i}$ measure the temporal complexity of the dynamical system by measuring the average divergence rate of nearby points in phase space. This gives rise to its sensitivity to initial conditions for each dimension i and quantifies the time scale on which it becomes unpredictable^28,29. Suppose at least one Lyapunov exponent is positive. In that case, the system is considered chaotic. The magnitude of the largest Lyapunov exponent $\lambda _{max}$ can then be taken to measure the degree of chaoticity the system exhibits. In the context of this work, the Rosenstein algorithm is used to calculate the largest Lyapunov exponent³⁰.

Power system model

Model equations

The model consists of four ordinary differential equations.

$$\begin{aligned} \dot{\delta }_m= & {} \omega , \end{aligned}$$

(16)

$$\begin{aligned} M\dot{\omega }= & {} -d_{m}\omega +P_m-E_mY_msin(\delta _m-\delta )V, \end{aligned}$$

(17)

$$\begin{aligned} K_{qw}\dot{\delta }= & {} -K_{qv2}V^2-K_{qv}V+Q(\delta _m,\delta ,V)-Q_0-Q_1, \end{aligned}$$

(18)

$$\begin{aligned} TK_{qw}K_{pv}\dot{V}= & {} K_{pw}K_{qv2}V^2 +(K_{pw}K_{qv}-K_{qw}K_{pv})V +K_{qw}[P(\delta _m,\delta ,V)-P_0-P_1]\nonumber \\{} & {} -K_{pw}[Q(\delta _m,\delta ,V)-Q_0-Q_1] \end{aligned}$$

(19)

where

$$\begin{aligned} P(\delta _m,\delta ,V)=&-E'_0Y'_0Vsin(\delta )+E_mY_mVsin(\delta _m-\delta ) , \\ Q(\delta _m,\delta ,V)=&-E'_0Y'_0Vcos(\delta ) -(Y'_0+Y_m)V^2+E_mY_mVcos(\delta _m-\delta ). \end{aligned}$$

The real power demand P and the reactive power demand Q of the system appear in the differential equations of the load voltage V and the motor frequency $\delta$. The variable ${\delta }_m$ describes the angle dynamics between two generators and $\omega$ the speed of a generator rotor. For a more technical description, we suggest the paper by Dobson et al.²⁶. Following the model parameterization used for the traditional reservoir computing approach¹⁶,

$$\begin{aligned} E'_0= & {} \frac{E_0}{(1+C^2Y_0^{-2}-2CY_0^{-1}cos(\theta _0))^{\frac{1}{2}}}, \end{aligned}$$

(20)

$$\begin{aligned} Y'_0= & {} Y_0(1+C^2Y_0^{-2}-2CY_0^{-1}cos(\theta _0))^{\frac{1}{2}}, \end{aligned}$$

(21)

$$\begin{aligned} \theta '_0= & {} \theta _0+tan^{-1} \left( \frac{CY_0^{-1}sin(\theta _0)}{1-CY_0^{-1}cos(\theta _0)}\right) \end{aligned}$$

(22)

are set as constants and $K_{pw}=0.4$, $K_{pv}=0.3$, $K_{qw}=-0.03$, $K_{qv}=-2.8$, $K_{qv2}=2.1$, $T=8.5$, $P_0=0.6$, $Q_0=0.3$, $P_1=0$, $Y_0=3.33$, $Y_m=5$, $P_m=1$, $d_m=0.05$, $\theta _0=0$, $E_m=1.05$, $M=0.01464$, $C=3.5$, $E_0=1$, $Q_0=1.3$.

$Q_1$ is taken as the bifurcation parameter. It determines the load reactive power demand of the system. The model bifurcation diagram was created using Runge-Kutta 4 (RK4), starting each time from $\textbf{x}_0=(\delta _{m,0},\omega _0,\delta _0,V_0)^T=(0.17,0.05,0.05,0.83)^T$ for $T=10000$ time steps with time step size $\Delta t=0.05$ and an bifurcation parameter step size of $\Delta Q_1=0.000001$.

NG-RC architecture

The training data is generated identically for each system parameter in

$$\begin{aligned} \textbf{Q}^{train}_1=\,&[2.98953, 2.98956, 2.98960, 2.98964, 2.98967, 2.98969, 2.98975]. \end{aligned}$$

The following NG-RC architecture

$$\begin{aligned} \textbf{R}_{Q_{1,m}}=\textbf{q}_{[0,1,2,3]}(\textbf{P}^{[1,2,3]}(\textbf{L}^{2}_{2}(\textbf{X}_{m}))+\gamma Q^{train}_{1,m}), \end{aligned}$$

(23)

is used and trained with a regression parameter of $\beta =10^{-8}$. The expanded feature vectors have dimension $\widetilde{N}=493$. The warm-up times of length $\delta t=4$ are simulated with RK4.

Chaotic food chain model

Model equations

The three-species food chain model with a resource density R, a consumer density C, and a predator density P is modeled as follows

$$\begin{aligned} \dot{R}= & {} R\left( 1-\frac{R}{K}\right) -\frac{x_cy_cCR}{R+R_0}, \end{aligned}$$

(24)

$$\begin{aligned} \dot{C}= & {} x_cC\left( \frac{y_cR}{R+R_0}-1\right) -\frac{x_py_pPC}{C+C_0}, \end{aligned}$$

(25)

$$\begin{aligned} \dot{P}= & {} x_pP\left( \frac{y_pC}{C+C_0}-1\right) \end{aligned}$$

(26)

with $x_c=0.4$, $y_c=2.009$, $x_p=0.08$, $y_p=2.876$, $R_0=0.16129$, $C_0=0.5$ and the resource-carrying capacity K is taken as the bifurcation parameter¹⁶. The model bifurcation diagram was created using RK4, starting each time from $\textbf{x}_0=(R_0,C_0,P_0)^T=(0.6,0.35,0.9)^T$ for $T=25000$ time steps with time step size $\Delta t=0.1$ and an bifurcation parameter step size of $\Delta K=0.00025$.

NG-RC architecture

The training data is generated identically for each parameter in

$$\begin{aligned} \textbf{K}^{train}=[0.92, 0.925, 0.93, 0.935, 0.94, 0.945, 0.95]. \end{aligned}$$

The following NG-RC architecture

$$\begin{aligned} \textbf{R}_{K_{m}}=\textbf{q}_{[0,1,2,3]}(\textbf{P}^{[1,2]}(\textbf{L}^{4}_{4}(\textbf{X}_{m}))+\gamma K^{train}_{m}) \end{aligned}$$

(27)

is applied. The expanded feature vectors have dimension $\widetilde{N}=271$. A regression parameter of $\beta =10^{-3}$ is used. The warm-up times of length $\delta t=16$ are simulated with RK4.

Data availability

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

References

Lorenz, E. N. Deterministic nonperiodic flow. J. Atmos. Sci. 20, 130–141 (1963).
Article ADS MathSciNet Google Scholar
Brunton, S. L., Proctor, J. L. & Kutz, J. N. Discovering governing equations from data by sparse identification of nonlinear dynamical systems. Proc. Natl. Acad. Sci. 113, 3932–3937 (2016).
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Ma, H., Haluszczynski, A., Prosperino, D. & Räth, C. Identifying causality drivers and deriving governing equations of nonlinear complex systems. Chaos: Interdiscip. J. Nonlinear Sci. 32, 103128 (2022).
Article MathSciNet Google Scholar
Huang, Y., Mabrouk, Y., Gompper, G. & Sabass, B. Sparse inference and active learning of stochastic differential equations from data. Sci. Rep. 12, 21691 (2022).
Article ADS PubMed PubMed Central Google Scholar
Lukoševičius, M. & Jaeger, H. Reservoir computing approaches to recurrent neural network training. Comput. Sci. Rev. 3, 127–149 (2009).
Article Google Scholar
Jaeger, H. & Haas, H. Harnessing nonlinearity: Predicting chaotic systems and saving energy in wireless communication. Science 304, 78–80 (2004).
Article ADS CAS PubMed Google Scholar
Bompas, S., Georgeot, B. & Guéry-Odelin, D. Accuracy of neural networks for the simulation of chaotic dynamics Precision: of training data vs precision of the algorithm. Chaos: Interdiscip. J. Nonlinear Sci. 30, 113118 (2020).
Article MathSciNet CAS Google Scholar
Chattopadhyay, A., Hassanzadeh, P. & Subramanian, D. Data-driven predictions of a multiscale Lorenz 96 chaotic system using machine-learning methods: Reservoir computing, artificial neural network, and long short-term memory network. Nonlinear Process. Geophys. 27, 373–389 (2020).
Article ADS Google Scholar
Gauthier, D. J., Bollt, E., Griffith, A. & Barbosa, W. A. Next generation reservoir computing. Nat. Commun. 12, 1–8 (2021).
Article Google Scholar
Lenton, T. M. et al. Tipping elements in the Earth’s climate system. Proc. Natl. Acad. Sci. 105, 1786–1793 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Irrgang, C. et al. Towards neural earth system modelling by integrating artificial intelligence in earth system science. Nat. Mach. Intell. 3, 667–674 (2021).
Article Google Scholar
Lim, S. H., Theo Giorgini, L., Moon, W. & Wettlaufer, J. S. Predicting critical transitions in multiscale dynamical systems using reservoir computing. Chaos: Interdiscip. J. Nonlinear Sci. 30, 123126 (2020).
Article MathSciNet Google Scholar
Patel, D., Canaday, D., Girvan, M., Pomerance, A. & Ott, E. Using machine learning to predict statistical properties of non-stationary dynamical processes: System climate, regime transitions, and the effect of stochasticity. Chaos: Interdiscip. J. Nonlinear Sci. 31, 033149 (2021).
Article MathSciNet Google Scholar
Patel, D. & Ott, E. Using machine learning to anticipate tipping points and extrapolate to post-tipping dynamics of non-stationary dynamical systems. arXiv:2207.00521 (2022).
Kim, J. Z., Lu, Z., Nozari, E., Pappas, G. J. & Bassett, D. S. Teaching recurrent neural networks to infer global temporal structure from local examples. Nat. Mach. Intell. 3, 316–323 (2021).
Article Google Scholar
Kong, L.-W., Fan, H.-W., Grebogi, C. & Lai, Y.-C. Machine learning prediction of critical transition and system collapse. Phys. Rev. Res. 3, 013090 (2021).
Article CAS Google Scholar
Kong, L.-W., Weng, Y., Glaz, B., Haile, M. & Lai, Y.-C. Reservoir computing as digital twins for nonlinear dynamical systems. Chaos: Interdiscip. J. Nonlinear Sci. 33 (2023).
Flynn, A., Tsachouridis, V. A. & Amann, A. Multifunctionality in a reservoir computer. Chaos: Interdiscip. J. Nonlinear Sci. 31, 013125 (2021).
Article MathSciNet Google Scholar
Flynn, A., Herteux, J., Tsachouridis, V. A., Räth, C. & Amann, A. Symmetry kills the square in a multifunctional reservoir computer. Chaos: Interdiscip. J. Nonlinear Sci. 31, 073122 (2021).
Article MathSciNet Google Scholar
Flynn, A. et al. Exploring the limits of multifunctionality across different reservoir computers. In 2022 International Joint Conference on Neural Networks (IJCNN), 1–8 (IEEE, 2022).
Herteux, J. & Räth, C. Breaking symmetries of the reservoir equations in echo state networks. Chaos: Interdiscip. J. Nonlinear Sci. 30, 123142 (2020).
Article MathSciNet Google Scholar
Lu, Z. & Bassett, D. S. Invertible generalized synchronization: A putative mechanism for implicit learning in neural systems. Chaos: Interdiscip. J. Nonlinear Sci. 30, 063133 (2020).
Article MathSciNet Google Scholar
Barbosa, W. A. & Gauthier, D. J. Learning spatiotemporal chaos using next-generation reservoir computing. Chaos: Interdiscip. J. Nonlinear Sci. 32 (2022).
Gauthier, D. J., Fischer, I. & Röhm, A. Learning unseen coexisting attractors. Chaos: Interdiscip. J. Nonlinear Sci. 32 (2022).
Haluszczynski, A., Koeglmayr, D. & Räth, C. Controlling dynamical systems to complex target states using machine learning: Next-generation vs. classical reservoir computing. In 2023 International Joint Conference on Neural Networks (IJCNN), 1–7 (IEEE, 2023).
Dobson, I. & Chiang, H.-D. Towards a theory of voltage collapse in electric power systems. Syst. Control Lett. 13, 253–262 (1989).
Article MathSciNet Google Scholar
McCann, K. & Yodzis, P. Nonlinear dynamics and population disappearances. Am. Nat. 144, 873–879 (1994).
Article Google Scholar
Wolf, A., Swift, J. B., Swinney, H. L. & Vastano, J. A. Determining Lyapunov exponents from a time series. Physica D 16, 285–317 (1985).
Article ADS MathSciNet Google Scholar
Shaw, R. Strange attractors, chaotic behavior, and information flow. Zeitschrift für Naturforschung A 36, 80–112 (1981).
Article ADS MathSciNet Google Scholar
Rosenstein, M. T., Collins, J. J. & De Luca, C. J. A practical method for calculating largest Lyapunov exponents from small data sets. Physica D 65, 117–134 (1993).
Article ADS MathSciNet Google Scholar

Download references

Acknowledgements

D.K. gratefully acknowledges the funding provided by Allianz Global Investors (AGI).

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

German Aerospace Center (DLR), Institute for AI Safety and Security, 89081, Ulm, Germany
Daniel Köglmayr & Christoph Räth

Authors

Daniel Köglmayr
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Räth
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.R. initiated the research. D.K. and C.R. designed the study. D.K. conducted the calculations. C.R. and D.K. interpreted and evaluated the findings. All authors reviewed the manuscript.

Corresponding author

Correspondence to Daniel Köglmayr.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Köglmayr, D., Räth, C. Extrapolating tipping points and simulating non-stationary dynamics of complex systems using efficient machine learning. Sci Rep 14, 507 (2024). https://doi.org/10.1038/s41598-023-50726-9

Download citation

Received: 14 September 2023
Accepted: 23 December 2023
Published: 04 January 2024
DOI: https://doi.org/10.1038/s41598-023-50726-9

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Emerging opportunities and challenges for the future of reservoir computing

Time series reconstructing using calibrated reservoir computing

Higher-order Granger reservoir computing: simultaneously achieving scalable complex structures inference and accurate dynamics prediction

Introduction

Results

Power system model

Results

Chaotic food chain model

Results

Discussion

Methods

Next-generation reservoir computing

Multifunctionality with input channel

Multifunctionality setup

Multifunctionality setup with input channel

Lyapunov exponent

Power system model

Model equations

NG-RC architecture

Chaotic food chain model

Model equations

NG-RC architecture

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links