Online quantum time series processing with random oscillator networks

Nokkala, Johannes

doi:10.1038/s41598-023-34811-7

Download PDF

Article
Open access
Published: 11 May 2023

Online quantum time series processing with random oscillator networks

Johannes Nokkala ORCID: orcid.org/0000-0002-5052-9813^1,2

Scientific Reports volume 13, Article number: 7694 (2023) Cite this article

1439 Accesses
2 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Reservoir computing is a powerful machine learning paradigm for online time series processing. It has reached state-of-the-art performance in tasks such as chaotic time series prediction and continuous speech recognition thanks to its unique combination of high computational power and low training cost which sets it aside from alternatives such as traditionally trained recurrent neural networks, and furthermore is amenable to implementations in dedicated hardware, potentially leading to extremely compact and efficient reservoir computers. Recently the use of random quantum systems has been proposed, leveraging the complexity of quantum dynamics for classical time series processing. Extracting the output from a quantum system without disturbing its state too much is problematic however, and can be expected to become a bottleneck in such approaches. Here we propose a reservoir computing inspired approach to online processing of time series consisting of quantum information, sidestepping the measurement problem. We illustrate its power by generalizing two paradigmatic benchmark tasks from classical reservoir computing to quantum information and introducing a task without a classical analogue where a random system is trained to both create and distribute entanglement between systems that never directly interact. Finally, we discuss partial generalizations where only the input or only the output time series is quantum.

Time-series quantum reservoir computing with weak and projective measurements

Article Open access 23 February 2023

Optimizing quantum noise-induced reservoir computing for nonlinear and chaotic time series prediction

Article Open access 07 November 2023

Optimizing a quantum reservoir computer for time series prediction

Article Open access 07 September 2020

Introduction

Tasks where one time series need to be transformed into another include time series forecasting^1,2, pattern generation^3,4 and pattern recognition^5,6,7,8. In online time series processing both the given data and desired transformed data are functions of time, which separates it from approaches such as first recording the data and later processing it. Instead, the objective is to realize the time dependent function which for a given timestep and input time series up to that step returns the corresponding element of the output time series. Such tasks are also known as temporal tasks. When successful, online time series processing facilitates, e.g., the processing of arbitrarily long sequences of data since the inputs are continuously processed into outputs. This is possible in particular for tasks that can be solved by so called fading memory functions, which are functions well approximated by continuous functions of only a finite number of past inputs⁹. Under typically mild conditions the input time series can be used to drive random dynamical systems such that their internal variables become such fading memory functions, which can then be combined to approximate the desired ouput by training a simple, even linear readout function. This is known as reservoir computing (RC), which is a powerful approach to solving temporal tasks thanks to a remarkably low training cost^10,11 combined with state-of-the-art performance¹². Furthermore, classical or quantum physical systems are also amenable to be used as the dynamical system^13,14,15, paving the way to harvesting computational power from essentially random physical systems with fading memory and complex dynamics. In RC such systems are usually called reservoirs.

After recent seminal works investigating the suitability of the transverse-field Ising model for RC purposes^16,17, there has been a surge of interest in the quantum case in particular. Indeed, the initial model has been refined and analyzed in several ways^18,19,20,21, whereas new proposals have introduced RC based on quantum circuits^22,23, nuclear magnetic resonance (NMR) systems²⁴ and continuous variable quantum systems²⁵. The results have been promising, suggesting that both in the discrete and continuous variable case quantum reservoirs may have an advantage over their classical counterparts in terms of how rapidly the potential reservoir performance improves with size^16,20,25. One of the biggest hurdles is in fact the extraction of the classical output from the quantum systems, since not only does a single measurement reveal only a tiny amount of information about a quantum system in an unknown state, it also alters the state and therefore competes with the inputs in driving reservoir dynamics. For certain special systems, such as NMR systems, an enormous amount of copies of the reservoir are naturally available which has been proposed to allow one to bypass the measurement back-action problem when collective input injections and measurements can be carried out²⁴. In general, repeatedly initializing and subsequently measuring a quantum system extracts classical information out of it—for example, a value of an observable—however such an approach is far from ideal for time series processing for two reasons. Firstly, carrying out the repetitions anew for every element in the output time series one wishes to learn introduces severe overhead, and secondly, it is hard to imagine how such a protocol can run in an online mode, continuously producing elements of the output time series. All in all, output extraction is a major challenge in exploiting the potential of quantum reservoirs for classical time series processing.

Here we lay down an alternative, RC inspired path that can fully harness the quantumness of the reservoir while largely sidestepping the measurement problem. Namely, we introduce online time series processing with random fading memory quantum systems where both the input and desired output time series consist of quantum information. The main advantages over previous related work in quantum RC are two-fold: the role of quantumness is clearer as the tasks are by construction impossible for a classical reservoir; no measurements are required after the reservoir has been trained since the output can remain quantum. That being said the proposed scheme considers fundamentally different tasks and as such cannot replace quantum RC, therefore not solving the measurement problem of quantum RC either. Specifically, we consider random networks of interacting quantum harmonic oscillators as the reservoir. Taking inspiration from RC, we train only the interaction Hamiltonian between the network and the carriers of input information. The main difference with RC is how the output is formed; in the former it is a trained function of reservoir observables, here the output is imprinted directly on the quantum systems acting as carriers of data. This affects also the training process as will be seen.

We illustrate the possibilities of the proposed model with three different temporal tasks. The short term quantum memory (STQM) task is the quantum analog of the short term memory task²⁶ commonly used as a benchmark task in classical RC—the objective is to recall past inputs that are no longer available using the memory of the reservoir. Another common task is channel equalization task²⁷, where the input time series is transmitted through a noisy nonlinear channel that also mixes the time series with various echoes of itself, and the objective is to recover the original time series from the distorted one. Here we generalize it to inverting the transformation caused by a quantum channel. Despite being generalizations of a classical task it will be seen that the quantum cases have notable differences. Furthermore, we introduce a task without a classical counterpart which we call the entangler. Here the objective is to create entanglement between different initially uncorrelated systems by letting each of them in turn interact once with the reservoir but never between each other. We find that all these tasks are possible to solve using random untrained networks of interacting quantum harmonic oscillators; remarkably, not even the network initial state needs to be controlled. Finally, we briefly discuss partial generalizations, i.e. cases where only the input or only the output time series is quantum.

Results

The model

In RC a reservoir is a dynamical system that can be steered by an input time series to a trajectory in its state space determined by the inputs alone, i.e. its internal variables become completely determined by the input history at the limit of many inputs. If the variables can be monitored then the response of the reservoir at different timesteps can be post-processed to achieve a desired transformation from the input time series to an output time series. Importantly, for sufficiently complex reservoirs nontrivial transformations can be achieved by cheap post-processing, such as a linear combination of the variables.

Here the reservoir is a network of N unit mass quantum harmonic oscillators interacting with springlike couplings. Such units are used that $\hbar =1$ and $k_B=1$. Let ${\textbf{p}}^\top =\{p_1,p_2,\ldots ,p_N\}$ and ${\textbf{q}}^\top =\{q_1,q_2,\ldots ,q_N\}$ be the vectors of momentum and position operators of the oscillators. The reservoir Hamiltonian $H_R$ is

$$\begin{aligned} H_R=\dfrac{{\textbf{p}}^\top {\textbf{p}}}{2}+\dfrac{{\textbf{q}}^\top (\varvec{\Delta }_{\varvec{\omega }}^2+{\textbf{L}}){\textbf{q}}}{2}, \end{aligned}$$

(1)

where the diagonal matrix $\varvec{\Delta }_{\varvec{\omega }}$ holds the oscillator frequencies $\varvec{\omega }^\top =\{\omega _1,\omega _2,\ldots ,\omega _N\}$ and the symmetric matrix ${{\textbf{L}}}$ has elements ${{\textbf{L}}}_{ij}=\delta _{ij}\sum _k g_{ik}-(1-\delta _{ij})g_{ij}$. Here $g_{ij}\ge 0$ are interaction strengths between the reservoir oscillators. Aside from oscillator frequencies there is a one-to-one correspondence between $H_R$ and weighted simple graphs. Indeed, ${{\textbf{L}}}$ can be interpreted as the Laplace matrix of such a graph, and a given graph with a Laplace matrix ${{\textbf{L}}}$ defines $H_R$ through Eq. (1).

We consider temporal quantum tasks (analogous to temporal tasks in RC), which we define in this work as follows. The input time series ${{\textbf{s}}}=\{\ldots ,\rho _{m-1}^{I},\rho _{m}^{I},\rho _{m+1}^{I},\ldots \}$ consists of quantum states $\rho _{m}^{I}$ where m indicates the timestep. In general these can be states of multimode continuous variable quantum systems, however we assume that apart from their states the systems are identical, i.e. each system has the same Hamiltonian $H_S$. The input time series is processed by the reservoir into output time series ${{\textbf{o}}}=\{\ldots ,\rho _{m-1}^{O},\rho _{m}^{O},\rho _{m+1}^{O},\ldots \}$ by letting each system in turn interact with the reservoir for some time $\Delta t$ according to an interaction Hamiltonian $H_I$ coupling every reservoir oscillator to every subsystem. The order of interactions is given by the timesteps. Consequently ${{\textbf{o}}}$ is the image of ${{\textbf{s}}}$ and reservoir initial conditions under a transformation induced by the full Hamiltonian $H=H_R+H_S+H_I$ and the interaction time $\Delta t$. In a temporal quantum task we attempt to realize a given transformation from ${{\textbf{s}}}$ to ${{\textbf{o}}}$ in this way. Besides the uncorrelated case one may also consider correlations between the systems at different timesteps, and we will return to this point later.

In the special case where $H_S$ consist of M unit mass quantum harmonic oscillators and the interactions in $H_I$ are springlike couplings, H has the same general form as $H_R$. The transformation induced by H and $\Delta t$ on the operators of the reservoir and input system is now linear and can be given in terms of a symplectic matrix ${{\textbf{S}}}$. Let ${{\textbf{x}}}^R_k$ be the form of the reservoir operators after k-th input has been processed, let ${{\textbf{x}}}^S_k$ be the operators of the k-th input and let ${{\textbf{x}}}^O_k$ be the operators of the k-th output. Now

$$\begin{aligned} \begin{pmatrix} {{\textbf{x}}}^R_{k+1} \\ {{\textbf{x}}}^O_{k+1} \end{pmatrix}= {{\textbf{S}}} \begin{pmatrix} {{\textbf{x}}}^R_{k} \\ {{\textbf{x}}}^S_{k+1} \end{pmatrix}= \begin{pmatrix} {{\textbf{A}}} &{} {{\textbf{B}}} \\ {{\textbf{C}}} &{} {{\textbf{D}}} \end{pmatrix} \begin{pmatrix} {{\textbf{x}}}^R_k \\ {{\textbf{x}}}^S_{k+1} \end{pmatrix}, \end{aligned}$$

(2)

where the symplectic matrix has been divided into blocks such that ${{\textbf{A}}}$ is $2N\times 2N$ and ${{\textbf{D}}}$ is $2M\times 2M$. By iterating this equation we immediately get the form of both the reservoir and input modes for some timestep m:

$$\begin{aligned} {\left\{ \begin{array}{ll} {{\textbf{x}}}^R_m={{\textbf{A}}}^m{{\textbf{x}}}^R_0+\sum _{k=1}^m{{\textbf{A}}}^{m-k}{{\textbf{B}}}{{\textbf{x}}}^I_k,\\ {{\textbf{x}}}^O_m={{\textbf{C}}}{{\textbf{x}}}^R_{m-1}+{{\textbf{D}}}{{\textbf{x}}}^I_{m}, \end{array}\right. } \end{aligned}$$

(3)

where ${{\textbf{x}}}_0^R$ is the initial form of the reservoir modes. The form of the output modes for some timestep m as a function of ${{\textbf{x}}}_0^R$ and input history is then

$$\begin{aligned} \begin{aligned} {{\textbf{x}}}^O_m&={{\textbf{C}}}{{\textbf{A}}}^{m-1}{{\textbf{x}}}^R_0+{{\textbf{D}}}{{\textbf{x}}}^I_{m}+{{\textbf{C}}}\sum _{k=1}^{m-1}{{\textbf{A}}}^{m-k-1}{{\textbf{B}}}{{\textbf{x}}}^I_k \\&\approx {{\textbf{D}}}{{\textbf{x}}}^I_{m}+{{\textbf{C}}}\sum _{k=1}^{m-1}{{\textbf{A}}}^{m-k-1}{{\textbf{B}}}{{\textbf{x}}}^I_k \quad \text {when }\rho ({{\textbf{A}}})<1\text { and }m\gg 1, \end{aligned} \end{aligned}$$

(4)

where the first line is exact and the second line an approximation which holds when the spectral radius $\rho ({{\textbf{A}}})$—not to be confused with quantum states—is less than 1 and enough inputs have been processed. Given that the equations of motion are conveniently expressed in terms of the operators, in the rest of this manuscript we will simply write ${{\textbf{s}}}=\{\ldots ,{{\textbf{x}}}_{m-1}^{I},{{\textbf{x}}}_{m}^{I},{{\textbf{x}}}_{m+1}^{I},\ldots \}$ and ${{\textbf{o}}}=\{\ldots ,{{\textbf{x}}}_{m-1}^{O},{{\textbf{x}}}_{m}^{O},{{\textbf{x}}}_{m+1}^{O},\ldots \}$.

When $\rho ({{\textbf{A}}})<1$ the output time series ${{\textbf{o}}}$ becomes independent of the initial conditions at the limit of infinitely long input history, which is known as the echo state property¹ in the RC literature. In fact, it can be shown²⁵ that satisfying the spectral radius condition gives the reservoir also the so-called fading memory property⁹, which guarantees that ${{\textbf{x}}}^R_m$ and therefore ${{\textbf{x}}}^O_m$ become well-approximated by a continuous function of only a finite number of past inputs at the limit $m\gg 1$. This not only ensures that the initial conditions can be ignored but also prevents any physical quantities from diverging: the reservoir state never leaves the state space as long as all input states are physical, not even in the limit $m\rightarrow \infty $. At variance, if $\rho ({{\textbf{A}}})\ge 1$ then, e.g., reservoir excitations may diverge. Finally, in the special case where ${{\textbf{A}}}$ is nilpotent for some index n there is sudden death of reservoir memory where ${{\textbf{x}}}^R_m$ is a function of exactly n previous inputs. As the index of a nilpotent matrix is always at most its order²⁸, $n\le 2N$.

RC inspired quantum time series processing

The system given in Eq. (3) can be harnessed for RC by considering only ${{\textbf{x}}}^R_m$ when it is assumed that the states in ${{\textbf{s}}}$ are in fact functions of the elements of a classical time series²⁵. When $\rho ({{\textbf{A}}})<1$ there is fading memory and the observables of the reservoir become well-approximated by continuous functions of only a finite number of past inputs. Different transformations to a classical output time series can then be accomplished by training a simple function of the reservoir observables such as first moments, second moments or covariances of ${{\textbf{x}}}^R_m$. The resulting RC can be analyzed with contemporary RC theory since the latter is agnostic to the mechanism that creates the functions of the input. Importantly, $H_R$ is not trained and can be random since $\rho ({{\textbf{A}}})<1$ can typically be achieved by just tuning $\Delta t$.

The scheme for using the reservoir to process temporal quantum information instead is shown in Fig. 1a. Here ${{\textbf{s}}}$ itself will be the input while ${{\textbf{o}}}$ consists of quantum information encoded in ${{\textbf{x}}}^O_m$. Thanks to fading memory, each ${{\textbf{x}}}^O_m$ is completely determined by ${{\textbf{x}}}^I_i$ where $i\le m$ and the initial reservoir state can be ignored. To achieve different transformations ${{\textbf{s}}}\mapsto {{\textbf{o}}}$, the matrix ${{\textbf{S}}}$ induced by $H=H_R+H_S+H_I$ and $\Delta t$ must be changed while preserving $\rho ({{\textbf{A}}})<1$. Whereas the number of parameters in H is proportional to $(N+M)^2$, we take an approach inspired by RC and attempt to achieve online quantum time series processing by only training the NM interaction terms in $H_I$, leaving both $H_R$ and $H_S$ fixed. We will provide strong numerical evidence that this is in practice enough to succeed in many different temporal quantum tasks, even with random $H_R$.

The similarity of this result with the fact that in RC with fading memory systems it is enough to only train the final weights in the readout layer is uncanny, since the situations are quite different. Indeed, unlike in RC here reservoir dynamics cannot be separated from training since tuning $H_I$ changes the dynamics. We are in fact not aware of any theoretical results that could be used to explain the phenomenon. In the following we provide a brief overview of the general purpose training process. See “Methods” for further details.

First the input time series ${{\textbf{s}}}$ is divided into three phases: the preparation, training and test phases. The role of the preparation phase is to get rid of the influence of the reservoir initial state. During the training phase the performance is checked using a cost or objective function, which varies depending on the task but for a fixed input and reservoir is completely determined by $H_I$. The specific forms will be introduced along with the respective tasks. The interaction Hamiltonian is varied to optimize the function using a simple stochastic function optimizer. Spectral radius condition is enforced by providing the optimizer as initial points only such $H_I$ that the condition is satisfied; during minimization points that violate the condition can be expected to perform worse and are therefore discarded. Unless the time between inputs $\Delta t$ is fixed by the task, training may be repeated for different choices of $\Delta t$. In the test phase trained $H_I$ and best $\Delta t$ are used and reservoir output is collected to check the performance using a task dependent figure of merit. In this way the trained reservoir is exposed to new input and must be able to generalize beyond the specific inputs in the training phase to succeed. These are the results shown in the figures.

Examples of temporal quantum tasks

Parameter values used in numerical experiments

In all numerical experiments reported throughout the section we consider as the reservoir random completely connected networks of N identical oscillators with a bare frequency $\omega _0=0.25$. Each coupling strength between the reservoir oscillators is $g_{ij}\in [0,0.2]$, chosen uniformly at random. The M input modes are are also such oscillators. Although in principle having $\rho ({{\textbf{A}}})<1$ is enough for fading memory, in practice we impose the limit $\rho ({{\textbf{A}}})<0.99$ to avoid issues with finite precision numerics. The influence of the initial state of the reservoir is washed out during the preparation phase and is therefore irrelevant, however in practice we use the ground state of $H_R$. For each different case we show results of 100 random realizations of the reservoir and when applicable, other quantities such as the input time series ${{\textbf{s}}}$. In all cases the lengths of preparation, training and test phases are 40, 80 and 40, respectively. The values of the time between inputs $\Delta t$ and the input states themselves vary and will be reported along the tasks.

Short term quantum memory task

The short term memory task is a paradigmatic task in classical RC where the input $s_k$ at some timestep k is a real scalar or vector, and the target is $s_{k-\tau }$ where $\tau $ is the delay. Checking the performance in this task for different delays is often used to gauge how much linear memory a reservoir has. The short term quantum memory (STQM) task is its direct generalization where $s_k$ is a state of a quantum system. Specifically,

$$\begin{aligned} {\left\{ \begin{array}{ll} {{\textbf{x}}}^I_k, &{} (\text {input at timestep }k) \\ {{\textbf{x}}}^O_k\approx {{\textbf{x}}}^I_{k-\tau }. &{} (\text {target at timestep }k) \end{array}\right. } \end{aligned}$$

(5)

The objective is to achieve the transformation ${{\textbf{s}}}\mapsto {{\textbf{o}}}$ defined by Eq. (5) by training $\Delta t$ and Hamiltonian $H_I$. $H_R$ is assumed to be random but fixed and ${{\textbf{x}}}_0^R$ can be arbitrary. Unlike in the classical case where the amount of information grows linearly with input size, here the growth is more rapid as the reservoir must be able to delay also the correlations and entanglement between different input modes in ${{\textbf{x}}}_k^I$. We mention in passing that the process tomography of such a delay map in the discrete variable case was considered in Ref.²⁹.

It should be pointed out that for single-mode input states the task can in principle be done exactly for any delay $\tau $ by concatenating $N=\tau $ single oscillator reservoirs, which can be compared to deep RC. For $\tau =1$ the states of the reservoir and input must be swapped; the required interaction strength and time can be solved analytically. Clearly using a single reservoir oscillator with double the interaction time solves the task for $\tau =0$ since the states are swapped twice, while using a sequence of swaps with N different single mode reservoirs achieves a delay $\tau =N$. For $M>1$ and $\tau >0$ the task can be done by using $N=M\tau $ non-interacting reservoir oscillators, provided that the input systems are likewise non-interacting. Here we show that the task can in fact be solved by using a single random and fixed reservoir and letting the input interact only once with it, which can be compared to ordinary (shallow) RC.

The simplicity of this task makes it amenable to a special purpose training procedure. Indeed, from Eq. (4) the following conditions to solve the task can be observed:

$$\begin{aligned} {\left\{ \begin{array}{ll} {{\textbf{D}}}\approx {{\textbf{I}}},\ {{\textbf{C}}}{{\textbf{A}}}^t{{\textbf{B}}}\approx {{\textbf{0}}}\ \forall t\ge 0 &{} \text {if }\tau =0,\\ {{\textbf{C}}}{{\textbf{A}}}^{\tau -1}{{\textbf{B}}}\approx {{\textbf{I}}},\ {{\textbf{D}}}\approx {{\textbf{C}}}{{\textbf{A}}}^{t\ne \tau -1}{{\textbf{B}}}\approx {{\textbf{0}}}&{} \text {if }\tau >0. \end{array}\right. } \end{aligned}$$

(6)

When satisfied the contributions from the incorrect timesteps are suppressed while the contribution from the correct one is enhanced. The conditions in Eq. (6) for the full symplectic matrix induced by all three Hamiltonians must be satisfied by training only one of them, $H_I$. In practice the cost function given by Eq. (11) in “Methods” is minimized to train the reservoir. Additionally, training of $H_I$ is repeated for $\Delta t= 2\pi /\omega _0,4\pi /\omega _0,\ldots ,8\pi /\omega _0$ and the best value is chosen for testing phase. Notably, training is input state independent; although the training phase in ${{\textbf{s}}}$ could therefore be omitted, it is kept for the sake of consistency. It should be stressed that since the task is linear in the involved modes for any state, if training is successful the reservoir can delay also non-Gaussian M-mode states.

Results of numerical experiments are shown in Fig. 2. The input time series ${{\textbf{s}}}$ consists of random zero mean M-mode Gaussian states generated by acting with a random symplectic matrix on a thermal state of non-interacting oscillators. For details and parameter values used see “Methods”; it should be stressed that in general this creates correlated states. The reservoir parameters are as specified previously. Different values of N, M and delay $\tau $ are considered, whereas the figure of merit is the (Uhlmann) fidelity averaged over the test phase, possible to calculate in closed form for arbitrary Gaussian states applying, e.g., results of Ref.³⁰.

In panel a reservoir size N and input size M have been fixed and the delay $\tau $ is varied. The performance is excellent especially for small delays $\tau =0,1,2$ where the median values are ${\bar{F}}>0.999999$, ${\bar{F}}>0.9999$ and ${\bar{F}}>0.964$, respectively. Even the worst possible performance is decent for small delays, and it is conceivable that if in their case training was repeated with a different seed for the random number generator the performance might be improved significantly. In panel b the delay is fixed and N and M are varied; the diagonal part of the array shows the case $N=M$. As expected, the reservoir struggles significantly with the task when $N<M$ whereas for $N>M$ the performance is mostly good. Although one might expect the diagonal where $N=M$ to remain roughly constant in practice the performance is found to deteriorate as these parameters increase. This could be because the training method is not guaranteed to find the global optimum whereas the optimization problem becomes increasingly challenging as the number of parameters increases. Remarkably, a random reservoir can be trained to delay also multimode states by only tuning $H_I$ and $\Delta t$.

Quantum channel equalization

This task is inspired by the channel equalization task where the input time series is distorted by a transmission through a classical channel with fading memory and the objective is to invert the transformation by the channel and restore the original time series. The quantum counterpart is presented in Fig. 1c, where the original time series consists of states of quantum systems with operators ${{\textbf{x}}}^I$ that are transformed by an interaction with some fixed but random system with operators ${{\textbf{x}}}^{Ch}$, which constitutes the channel. We assume the channel to induce a linear transformation of the modes given by some symplectic matrix, i.e. the relevant Hamiltonians are quadratic, and furthermore that it has fading memory, which implies that ${{\textbf{x}}}^{Ch}$ is well-approximated by a function of a finite number of past inputs. The transformed operators of the input system are denoted by ${{\textbf{x}}}^D$. A reservoir with operators ${{\textbf{x}}}^R$ is trained to recover the original time series from ${{\textbf{x}}}^D$. The equations of motion can be recast in the same general form as before as explained in “Methods”.

It should be stressed that simply training the reservoir to perform the inverse of the channel symplectic matrix will not work since the channel acts on ${{\textbf{x}}}^{Ch}$ and ${{\textbf{x}}}^I$ but the reservoir symplectic matrix ${{\textbf{S}}}$ acts on ${{\textbf{x}}}^R$ and ${{\textbf{x}}}^D$. The transformed input ${{\textbf{x}}}^D_k$ for some timestep k in general does not contain full information of any of the previous inputs because part of the information is in ${{\textbf{x}}}^{Ch}$ and in general also in the correlations between the channel and the distorted input. Since it is well known that unknown quantum states can neither be cloned nor amplified the task as given is in fact impossible—in stark contrast with its classical counterpart. To have any hope of success, the task must be modified.

Here we will do this using techniques inspired by classical RC. Instead of a single copy of the input state we will transmit a product state of ${\mathfrak {s}}$ copies, but still require the reservoir to distill only a single copy of the original input, thus giving the reservoir additional quantum information to work with. We call this spatial multiplexing at order ${\mathfrak {s}}$. Additionally, we transmit the same product state ${\mathfrak {m}}$ times, one copy after another, requiring the reservoir to output the original input only after the ${\mathfrak {m}}$-th copy. We call this temporal multiplexing at order ${\mathfrak {m}}$. More formally,

$$\begin{aligned} {\left\{ \begin{array}{ll} \bigoplus _{i=1}^{{\mathfrak {s}}}{{\textbf{x}}}_k^I,\,{\mathfrak {m}} \text { copies sequentially}&{} \text {(inputs to channel at timestep }k)\\ {{\textbf{x}}}^D_{k,1},\,{{\textbf{x}}}^D_{k,2},\ldots {{\textbf{x}}}^D_{k,{\mathfrak {m}}}\,\text {sequentially} &{} \text {(inputs to reservoir at timestep }k) \\ {{\textbf{x}}}^O_k\approx {{\textbf{x}}}^I_{k}, &{} \text {(target at timestep }k) \end{array}\right. } \end{aligned}$$

(7)

Here the timesteps are to be understood as the points where we switch from one set of identical copies to another, as determined by the original unaltered input time series ${{\textbf{s}}}$. As a final remark before moving on, if ${{\textbf{s}}}$ is given but unknown neither spatial nor temporal multiplexing can be used, making the task again impossible. It must be assumed that there is a source that directly generates states according to some ${\mathfrak {s}}$ and some ${\mathfrak {m}}$, or alternatively that the states are known to the sender who may then prepare the copies. It may be asked if some other modifications could help solve the task even for unknown ${{\textbf{s}}}$, but this is outside the scope of present work.

With the analysis complete for now, we move on to numerical experiments. The input consists of single mode zero mean Gaussian states. The channel has $C=2$ oscillators and the Hamiltonian has the same general form as the reservoir Hamiltonian. This system is taken to interact with the inputs such that the spectral radius of the relevant block in the symplectic matrix is at most 0.95. For the reservoir $N=3$. The input time series consists of random zero mean Gaussian states like before in the STQM task, however for simplicity we focus only on the single mode case where $M=1$. Unlike before, the time between inputs is taken to be fixed by the channel and is therefore kept at a constant value which was chosen to be $\Delta t=1.5\pi /\omega _0$. Results are shown in Fig. 3.

In panel a spatial and temporal multiplexing are considered separately. At ${\mathfrak {s}}={\mathfrak {m}}=1$ both reduce to the original formulation of the task, which was already concluded to be unsolvable. Still, performance exceeds that of random guessing. Performance increases quickly with ${\mathfrak {s}}$ and slowly with ${\mathfrak {m}}$. Indeed, already at ${\mathfrak {s}}=2$ results tend to be better than at ${\mathfrak {m}}=5$. That being said, unlike increasing ${\mathfrak {m}}$ increasing ${\mathfrak {s}}$ increases the number of terms in $H_I$ which, e.g., makes training slower. In panel b spatial and temporal multiplexing are considered together. Curiously, performance does not always increase with ${\mathfrak {m}}$ and ${\mathfrak {s}}$. For example, both ${\mathfrak {m}}=3,\;{\mathfrak {s}}=4$ and ${\mathfrak {m}}=4,\;{\mathfrak {s}}=3$ lead to a better performance than ${\mathfrak {m}}=4,\;{\mathfrak {s}}=4$. Moreover, spatial multiplexing alone achieves a performance not too far off from the best case. A possible interpretation is that since at every timestep some quantum information permanently leaves the reservoir temporal multiplexing also introduces a mechanism which hinders the task and is therefore not always beneficial.

Entangler

In this task the inputs ${{\textbf{s}}}_k$ are taken to be uncorrelated single mode systems in the vacuum state whereas the target time series consists of entangled states. Although more complicated patterns can be envisioned, here we focus on entangling systems with a fixed delay $\tau $. That is to say the goal is to entangle ${{\textbf{s}}}_k$ with ${{\textbf{s}}}_{k-\tau }$ by training $H_I$. For $\tau =1$ the target is then a chain of systems with nearest neighbor connections where the connections are entanglement, for $\tau =2$ a chain with next nearest neighbors connected and so on. Importantly, we assume that only one input interacts with the reservoir at any given timestep and all the others are unavailable, and furthermore assume that there are never any direct interactions between the systems in ${{\textbf{s}}}$. In fact, if we imagine that the systems are periodically generated by a source of vacuum states then the input ${{\textbf{s}}}_{k+1}$ does not even exist at some timestep k. A system or a device that can solve the task can turn the source of uncorrelated states into one of entangled states.

Much like in the STQM task, in certain special cases and allowing for time-dependent $H_I$ the task can be achieved analytically and exactly, as depicted in Fig. 1d where only the case $\tau =1$ is considered for simplicity. At each timestep the reservoir and the ancilla are first entangled using an interaction Hamiltonian of one form and then their states are swapped using a different interaction Hamiltonian; in both cases one can analytically solve what the precise form of the interactions and interaction times must be. In fact, since the operations commute the order does not matter. Every application of the entangling gate creates a link in Fig. 1d, which is later swapped to the next input system. The role of the reservoir oscillator is to provide short term quantum memory. Without it, the task becomes impossible.

Here we solve the task with the RC inspired approach by training the time-independent $H_I$ to maximize the entanglement, as quantified by logarithmic negativity³¹ between ${{\textbf{s}}}_k$ and ${{\textbf{s}}}_{k-\tau }$. Like in the STQM task, training is repeated for $\Delta t= 2\pi /\omega _0,4\pi /\omega _0,\ldots ,8\pi /\omega _0$ and the best value is chosen for testing phase. To succeed the reservoir must simultaneously create entanglement and re-distribute the quantum information correctly as explained previously. Results are shown in Fig. 4. In panel a the reservoir size is fixed to $N=3$ and the delay $\tau $ is varied, and the logarithmic negativity averaged over the systems in the test phase is shown. The logarithmic negativity achieved for the shortest delay is between 0.35 and 0.4, which corresponds to that of a twin beam state with two-mode squeezing parameter $0.175\le s\le 0.2$. Performance decreases slowly up to $N=\tau $, but then collapses for delays $\tau >N$, bearing a striking similarity with panel a of Fig. 2. One may interpret this as the reservoir being able to remember up to N single mode states before running out of memory. The same behaviour can be observed also in panel b where both N and delay $\tau $ are varied and median performances are shown.

Partial generalizations

In all previously introduced tasks both the input and the output time series consist of quantum information, however one may consider partial generalizations where one of them is still classical. Here we briefly illustrate the possibilities with two simple examples.

If the output is quantum but ${{\textbf{s}}}$ is classical, say, a time series of systems in thermal states, one may follow the framework used previously. As an example task we consider predictive quantum state preparation where the reservoir is trained to prepare a given quantum state—here, squeezed vacuum—based on future classical inputs. For arbitrary ${{\textbf{s}}}$ this is of course impossible, but if ${{\textbf{s}}}$ is at least approximately predictable then the task can be in principle solved. Here we consider the Santa Fe chaotic time series, a dataset recorded from a far-infrared laser in a chaotic state^32,33 often used to benchmark the predictive power of classical reservoirs. Specifically, we normalize the Santa Fe time series and consider as ${{\textbf{s}}}$ single mode thermal states such that the number of thermal excitations $(n_{\textrm{th}})_k$ follows the normalized time series and the target is a squeezed vacuum state with a squeezing parameter $r_k=(n_{\textrm{th}})_{k+a}$ where $a\ge 0$ is the advance, or the number of timesteps in the future the reservoir must be able to predict.

Results are shown in Fig. 5a, where the average fidelity between the target state and the actual output state is shown for different values of the advance. There is very little spread for most values since only the reservoir is randomized between different realizations. Interestingly, there is an abrupt change in behavior when the advance a exceeds the number of reservoir oscillators N. Even then the fidelity remains decent but there is considerably more spread in the performance.

In the opposite case where output is classical information, say, about the properties of the states carried by the input systems in ${{\textbf{s}}}$, the approach where the classical output is formed from reservoir observables can be used, as outlined previously in “The model” section. Let $\sigma ({{\textbf{x}}}_k^R)$ be the reservoir covariance matrix at some timestep k. It can be shown²⁵ that

$$\begin{aligned} \begin{aligned} \sigma ({{\textbf{x}}}_m^R)&={{\textbf{A}}}^m\sigma ( {{\textbf{x}}}_0^R)({{\textbf{A}}}^\top )^m+\sum _{k=1}^m{{\textbf{A}}}^{m-k}{{\textbf{B}}}\sigma ( {{\textbf{x}}}_k^I){{\textbf{B}}}^\top ({{\textbf{A}}}^\top )^{m-k}\\ {}&\approx \sum _{k=1}^m{{\textbf{A}}}^{m-k}{{\textbf{B}}}\sigma ( {{\textbf{x}}}_k^I){{\textbf{B}}}^\top ({{\textbf{A}}}^\top )^{m-k}\quad \text {when } \rho ({{\textbf{A}}})<1\text { and }m\gg 1, \end{aligned} \end{aligned}$$

(8)

which holds for any number M of input modes. In principle, the elements of $\sigma ({{\textbf{x}}}_k^R)$ can be estimated by performing measurements on multiple copies of the reservoir that has processed identical inputs ${{\textbf{s}}}$, which however introduces substantial overhead. Analyzing just how much overhead is incurred is beyond the scope of this work but has recently been done in Ref.³⁴ proposing also a possible implementation; in what follows, it is assumed that the exact values of the elements of $\sigma ({{\textbf{x}}}_k^R)$ are available. That being said, since the target is some function of reservoir observables the full Hamiltonian can remain constant, decoupling training from the dynamics. Indeed, once the elements are available multiple trained functions can be used to estimate a number of different features of ${{\textbf{s}}}$.

As an example task, we consider as input random single mode Gaussian states and as target $\det (\sigma ({{\textbf{x}}}^I_{k-\tau }))$, or the determinant of the single mode covariance matrix for some delay $\tau \ge 0$. This is an important quantity that, e.g., completely determines the purity, amount of thermal excitations and the von Neumann entropy of the state. Below we given an overview of the conditions under which this task was simulated; for full details, see “Methods”.

We consider a reservoir of size $N=20$ and consider $M=10$ input modes such that the input is in a random product state of identical single mode states. Furthermore, the Hamiltonian is such that only two reservoir oscillators interact with a single input mode, and there are no interactions between these triplets of two reservoir oscillators and a single input oscillator. The interaction strengths are random but fixed and the spectral radius condition is satisfied by tuning $\Delta t$. One may observe from Eq. (8) that $\sigma ({{\textbf{x}}}_k^R)$ is linear in $\sigma ({{\textbf{x}}}^I_{k-\tau })$ for any delay $\tau $ unlike the determinant, however this problem can be overcome by considering trained linear combinations of products of pairs of elements of $\sigma ({{\textbf{x}}}_k^R)$. Here the output is a trained linear combination of products of distinct pairs of the first row of $\sigma ({{\textbf{x}}}_k^R)$, with training carried out as in Ref.²⁵. Finally, unlike elsewhere, we consider preparation, training and test phases of length 500, 2000 and 500, respectively.

Results are shown in Fig. 5b, where the the normalized mean squared error (NMSE) between the actual von Neumann entropy to that computed from the determinants estimated by the reservoir is shown. As can be seen, the NMSE is very small for all considered delays, suggesting an excellent agreement between the actual and predicted value.

Discussion

In this work we have introduced a RC inspired model for online processing of time series consisting of quantum information. Importantly, we have found that just with a judicious choice of the interaction Hamiltonian random instances of the model starting from any initial state can solve a variety of different tasks with high performance. The scheme might be further developed by considering, e.g., how training also the time between inputs can affect the performance or considering the case of non-Gaussian states or operations, which can be expected to lead to nonlinear memory³⁵ where ${{\textbf{x}}}^O_m$ can be nonlinear in ${{\textbf{x}}}^I_i$ for $i\le m$. One may also consider the prospects of a proof-of-principle experimental implementation since the general form of the reservoir Hamiltonian can in principle be realized in a multimode optics platform³⁶. We have also briefly illustrated the possibilities of partial generalizations of classical temporal tasks to cases where either the input or the target time series remains classical. Looking at the bigger picture, it is interesting to compare and contrast two distinct situations: when the output time series is to be quantum, and when it is to be classical.

In the former case the output extraction problem hindering previous related work vanishes but engineering freedom is preserved: control of only a small subset of all parameters is sufficient for high performance. Furthermore, if the reservoir Hamiltonian is random but known then measurements are not needed even in the training stage provided one can simulate the dynamics. For the considered model in particular an unknown Hamiltonian can be probed first³⁷. It can be imagined that a classical RC augmented with a state preparation mechanism could emulate the case where input is classical, but otherwise there is genuine quantumness: in the single shot case it is clear that no classical RC can emulate its quantum counterpart since the input data would first have to be transformed to classical information. That being said, training the interaction Hamiltonian is in general somewhat costly even when the dynamics can be simulated. If simulation is not possible the cost or objective function must be estimated with measurements, which should be expected to be a very challenging optimization problem in its own right by comparing with, e.g., variational quantum algorithms^{38,39,40,41,42}. Finally, another advantage of RC is lost in multitasking where the same reservoir processing the same input can simultaneously solve many different tasks by using differently trained readout functions. In the quantum case any attempts to multitask will inevitably affect performance because there is only so much quantum information for forming the output.

When instead the output is classical, multitasking is possible and the training cost is minimal. The challenges are two-fold: the output extraction problem and pinpointing what role exactly quantumness plays aside from providing a larger state space. Moreover, even if the output extraction problem can be solved, the specific way it is solved may dictate what quantum systems are ultimately suitable. As we demonstrate here with von Neumann entropy detection, the case where input is quantum might be of particular interest however, since thanks to the memory and multitasking a plethora of information concerning multiple past input states can be distilled even if the reservoir observables are known only for one or few timesteps. This may be compared to recent proposals where information of only a single quantum state is extracted with the help of a larger quantum system and supervised machine learning^43,44,45.

Indeed, the results have created a fertile ground for further work in the direction where at least one of the time series is quantum. To the best of our knowledge there is currently little work on such temporal tasks, however the inverse problem of performing tomography of an unknown temporal quantum map has been recently considered in a spin system²⁹. Comparisons may also be made with a recent proposal to train quantum system to induce quantum gates between qubits⁴⁶; its temporal generalization might consider gates between inputs at different timesteps, for example. Indeed, temporal quantum tasks could be tackled also in the discrete variable case with e.g. spins or superconducting qubits. Since the Hilbert space dimension grows exponentially with the number of systems it is conceivable that they might permit a wider range of tasks than Gaussian states also in the case of quantum time series processing, and investigating the matter is an interesting avenue of further research. In particular, single nitrogen vacancy center spins in diamond have long coherence times whereas coherent information exchange could be mediated by polaritons as recently proposed⁴⁷. One may also consider hybrid quantum systems exhibiting phenomena with potential applications in quantum information processing such as coherent perfect absorption⁴⁸, higher-order exceptional points⁴⁹ and Kerr nonlinearity⁵⁰, or quasiparticle systems such as magnons for which a proposal for a well-controlled system with tunable parameters has been made⁵¹.

Methods

Generation of random zero mean Gaussian states

In the single mode case where $M=1$ the states may be parameterized in terms of the thermal excitations $n_{\textrm{th}}$, magnitude of squeezing r and phase of squeezing $\varphi $. For displacement we consistently use $\alpha =0$, leading to the input first moments to vanish—the state is now completely characterized by its covariance matrix, which reads

$$\begin{aligned} \sigma ({{\textbf{x}}}^I)=\frac{2n_{\text {th}}+1}{2}\begin{pmatrix} (\cosh {(2r)}+\cos {(\varphi )}\sinh {(2r)})/\omega &{} \sin {(\varphi )}\sinh {(2r)} \\ \sin {(\varphi )}\sinh {(2r)} &{} (\cosh {(2r)}-\cos {(\varphi )}\sinh {(2r)})\omega \end{pmatrix}, \end{aligned}$$

(9)

which is a covariance matrix of a squeezed thermal state.

In the multimode case where $M>1$ the state is essentially parameterized by M thermal excitations, each independently and uniformly distributed, and M squeezing parameters, also independently and uniformly distributed, in the following way. We begin from the product state of M single mode thermal states, each with their own thermal excitations. Then we act with a random basis change, apply single mode squeezing of the position to all the modes with random magnitudes, and finally act on the resulting state with another random basis change. The random basis changes are built from Haar random $M\times M$ unitary matrices; let such a matrix be ${{\textbf{U}}}$. Then by construction

$$\begin{aligned} {{\textbf{O}}}=\begin{pmatrix} \textrm{Re}({{\textbf{U}}}) &{} \textrm{Im}({{\textbf{U}}})\\ -\textrm{Im}({{\textbf{U}}}) &{} \textrm{Re}({{\textbf{U}}}) \end{pmatrix} \end{aligned}$$

(10)

is orthogonal and also a symplectic matrix w.r.t. the chosen ordering of operators.

For both STQM and channel equalization tasks we have chosen the intervals to be $n_{\textrm{th}}\in [0,10]$ and $r\in [0,1]$. In the single mode case $\varphi \in [0,2\pi ]$. These intervals also apply to the input states used for von Neumann entropy detection task shown in Fig. 5b. In the entangler task the input states are always single mode vacuum states.

Training

Cost function minimization

A simple stochastic function optimizer called differential evolution (DE) is used. It treats the cost function as a black box, allowing it to, e.g., attempt to optimize functions where gradients (1st derivatives) or hessians (2nd derivatives) either do not exist or are not practical to calculate. Specifically, the implementation of Wolfram Mathematica 11.2 is used, which is described in Ref.⁵². Here we give an overview of the method and the parameter values used; for full details consult the reference.

DE iterates a population of points $\{x_1,x_2,\ldots ,x_d\}$. At each iteration a new population is created from the old one as follows. For each $x_j$ in the old population, three other old points $x_w$, $x_u$ and $x_v$ are chosen randomly and a point $x_s=x_w+s(x_u-x_v)$ is formed where $s\in {\mathbb {R}}$ is a parameter called scaling factor. Then a new point $x_j^{new}$ is created by taking each element either from $x_j$ or $x_s$ with probabilities p and $1-p$, respectively, where the parameter p is called cross probability. Finally, the new point $x_j^{new}$ replaces $x_j$ if $f(x_j^{new})$ is better than $f(x_j)$, where f is a given cost or objective function. The stopping criterion is met when both $|f(x_j^{new})-f(x_j)|$ and $\Vert x_j^{new}-x_j\Vert $ are sufficiently small.

We initialize the population by generating 30NM points, each corresponding to different interaction Hamiltonian $H_I$ where each interaction strength $g_{nm}$ between some reservoir oscillator $n\in \{1,\ldots ,N\}$ and some input mode $m\in \{1,\ldots ,M\}$ is uniformly and independently distributed in $g_{nm}\in [0,0.2]$ such that the spectral radius condition $\rho ({{\textbf{A}}})\le 0.99$ is satisfied. In the event that some point does not satisfy $\rho ({{\textbf{A}}})\le 0.99$ it is generated anew.

We consistently use a scaling factor of $s=0.05$ and a cross probability $p=0.4$, i.e. rather small shifts are used to create the shifted points $x_s$ and when forming $x_j^{new}$ the elements are slightly more likely to be picked from $x_s$. We settled for these values through a simple lattice search. All other settings use the default values listed in Ref.⁵².

Cost function of the STQM task

The cost function is

$$\begin{aligned} {\left\{ \begin{array}{ll} f(H_I,\Delta t)=\Vert {{\textbf{D}}}-{{\textbf{I}}}\Vert +1/\Vert H_I\Vert _\infty &{} \text {if }\tau =0,\\ f(H_I,\Delta t)=0.5\Vert {{\textbf{D}}}\Vert +5\Vert {{\textbf{C}}}{{\textbf{A}}}^{\tau -1}{{\textbf{B}}}-{{\textbf{I}}}\Vert &{} \text {if }\tau >0, \end{array}\right. } \end{aligned}$$

(11)

where $\Vert \cdot \Vert $ is the Frobenius norm and where with a slight abuse of notation we have indicated by $\Vert H_I\Vert _\infty $ the maximum coupling strength between a reservoir oscillator and the input oscillator(s). The point of the term $1/\Vert H_I\Vert _\infty $ is to prevent the training to converge to the trivial solution $H_I={{\textbf{0}}}$. The factors 0.5 and 5 control the relative importance of minimizing the norm of ${{\textbf{D}}}$ and achieving ${{\textbf{C}}}{{\textbf{A}}}^{\tau -1}{{\textbf{B}}}\approx {{\textbf{I}}}$; these values where chosen after some trial and error. While the function does not feature all of the relevant terms in Eq. (6), numerical experiments suggest that including more terms leads to worse results.

Objective functions of the quantum channel equalization and entangler tasks

Unlike in the relatively simple STQM task, in these tasks there is no obvious way to derive conditions on the reservoir symplectic matrix. This is why the objective function is the task dependent figure of merit—fidelity between reservoir output and original input in channel equalization and the logarithmic negativity in entangler—during training phase.

Additional details about the quantum channel equalization task

Let us write down the transformations caused by the channel and the reservoir at some timestep k. The interaction between input and channel modes induces a symplectic matrix ${{\textbf{S}}}'$. Its action on all of the relevant modes reads

$$\begin{aligned} \begin{pmatrix} {{\textbf{x}}}^{Ch}_{k+1} \\ {{\textbf{x}}}^R_{k} \\ {{\textbf{x}}}^D_{k+1} \end{pmatrix}= \begin{pmatrix} {{\textbf{A}}}' &{} {{\textbf{0}}} &{} {{\textbf{B}}}' \\ {{\textbf{0}}} &{} {{\textbf{I}}} &{} {{\textbf{0}}} \\ {{\textbf{C}}}' &{} {{\textbf{0}}} &{} {{\textbf{D}}}' \end{pmatrix} \begin{pmatrix} {{\textbf{x}}}^{Ch}_{k} \\ {{\textbf{x}}}^R_{k} \\ {{\textbf{x}}}^I_{k+1} \end{pmatrix}, \end{aligned}$$

(12)

where ${{\textbf{S}}}'$ has already been divided into blocks such that ${{\textbf{A}}}'$ is $C\times C$ and ${{\textbf{D}}}'$ is $M\times M$. Nothing happens to the reservoir modes since there is no interaction between the reservoir and the channel. The reservoir processes ${\mathbf {x^D_{k+1}}}$ according to

$$\begin{aligned} \begin{pmatrix} {{\textbf{x}}}^{Ch}_{k+1} \\ {{\textbf{x}}}^R_{k+1} \\ {{\textbf{x}}}^O_{k+1} \end{pmatrix}= \begin{pmatrix} {{\textbf{I}}} &{} {{\textbf{0}}} &{} {{\textbf{0}}} \\ {{\textbf{0}}} &{} {{\textbf{A}}} &{} {{\textbf{B}}} \\ {{\textbf{0}}} &{} {{\textbf{C}}} &{} {{\textbf{D}}} \end{pmatrix} \begin{pmatrix} {{\textbf{x}}}^{Ch}_{k+1} \\ {{\textbf{x}}}^R_{k} \\ {{\textbf{x}}}^D_{k+1} \end{pmatrix}. \end{aligned}$$

(13)

Combining these two transformations we get

$$\begin{aligned} \begin{pmatrix} {{\textbf{x}}}^{Ch}_{k+1} \\ {{\textbf{x}}}^R_{k+1} \\ {{\textbf{x}}}^O_{k+1} \end{pmatrix}= \begin{pmatrix} {{\textbf{A}}}' &{} {{\textbf{0}}} &{} {{\textbf{B}}}' \\ {\textbf{BC}}' &{} {{\textbf{A}}} &{} {\textbf{BD}}' \\ {\textbf{DC}}' &{} {{\textbf{C}}} &{} {\textbf{DD}}' \end{pmatrix} \begin{pmatrix} {{\textbf{x}}}^{Ch}_{k} \\ {{\textbf{x}}}^R_{k} \\ {{\textbf{x}}}^I_{k+1} \end{pmatrix}, \end{aligned}$$

(14)

where the intermediate form ${\mathbf {x^D}}$ of the input modes has been eliminated. The dynamics now follows Eqs. (2)–(4) with the replacements

$$\begin{aligned} {{\textbf{x}}}_k^R\mapsto {{\textbf{x}}}_k^{Ch}\oplus {{\textbf{x}}}_k^R,\quad {{\textbf{A}}}\mapsto \begin{pmatrix} {{\textbf{A}}}' &{} {{\textbf{0}}} \\ {\textbf{BC}}' &{} {{\textbf{A}}} \end{pmatrix},\quad {{\textbf{B}}}\mapsto \begin{pmatrix} {{\textbf{B}}}' \\ {\textbf{BD}}' \end{pmatrix},\quad {{\textbf{C}}}\mapsto \begin{pmatrix} {\textbf{DC}}'&{{\textbf{C}}} \end{pmatrix},\quad {{\textbf{D}}}\mapsto {\textbf{DD}}', \end{aligned}$$

(15)

that is to say the channel and the reservoir may be treated together as if they formed a new, larger reservoir. This simplifies the equations of motion and the simulation of the dynamics. Although one may now consider Eq. (6) to solve the task, in practice the performance is very poor because only the reservoir blocks are controllable, hence the modifications of Eq. (7).

Additional details about the von Neumann entropy detection task

Let $\rho $ be a single mode Gaussian state. Then its von Neumann entropy is defined as $S_V(\rho )=-\textrm{Tr}(\rho \textrm{ln}(\rho ))$. It can be shown⁵³ that

$$\begin{aligned} S_V(\rho )=n_{\textrm{th}}\textrm{ln}\left( \frac{n_{\textrm{th}}+1}{n_{\textrm{th}}}\right) +\textrm{ln}(n_{\textrm{th}}+1) \end{aligned}$$

(16)

where $n_{\textrm{th}}$ is the amount of thermal excitations of the state $\rho $. This quantity in turn is connected to the determinant of the associated covariance matrix $\sigma $ through

$$\begin{aligned} \textrm{Det}(\sigma )=(0.5+n_{\textrm{th}})^2, \end{aligned}$$

(17)

which can be seen by direct calculation starting from, e.g., Eq. (9). In Fig. 5b the actual $S_V(\rho )$ is compared to that computed from the estimated determinant of the input covariance matrix using Eqs. (16) and (17). The minimum value of the determinant is 0.25, reached by pure states. To enforce physical values and for convenience, any estimated $\textrm{Det}(\sigma )\le 0.251$ is set to $\textrm{Det}(\sigma )=0.251$.

Data availability

Data is available from the corresponding author upon reasonable request. The code to reproduce the figures is available at https://github.com/jsinok/onlinequantumtimeseries.

References

Jaeger, H. The, “echo state’’ approach to analysing and training recurrent neural networks-with an erratum note. Bonn Ger. Ger. Natl. Res. Cent. Inf. Technol. GMD Tech. Rep. 148, 13 (2001).
Google Scholar
Jaeger, H. Adaptive nonlinear system identification with echo state networks. Adv. Neural. Inf. Process. Syst. 15, 609–616 (2002).
Google Scholar
Jaeger, H. Tutorial on training recurrent neural networks, covering BPPT, RTRL, EKF and the“ echo state network” approach, vol. 5 (GMD-Forschungszentrum Informationstechnik Bonn, 2002).
Hauser, H., Ijspeert, A. J., Füchslin, R. M., Pfeifer, R. & Maass, W. The role of feedback in morphological computation with compliant bodies. Biol. Cybern. 106, 595–613 (2012).
Article MathSciNet PubMed MATH Google Scholar
Verstraeten, D., Schrauwen, B., Stroobandt, D. & Van Campenhout, J. Isolated word recognition with the liquid state machine: a case study. Inf. Process. Lett. 95, 521–528 (2005).
Article MATH Google Scholar
Soh, H. & Demiris, Y. Iterative temporal learning and prediction with the sparse online echo state gaussian process. In The 2012 international joint conference on neural networks (IJCNN), 1–8 (IEEE, 2012).
Paquot, Y. et al. Optoelectronic reservoir computing. Sci. Rep. 2, 1–6 (2012).
Article Google Scholar
Jalalvand, A., Van Wallendael, G. & Van de Walle, R. Real-time reservoir computing network-based systems for detection tasks on visual contents. In 2015 7th International Conference on Computational Intelligence, Communication Systems and Networks, 146–151 (IEEE, 2015).
Boyd, S. & Chua, L. Fading memory and the problem of approximating nonlinear operators with volterra series. IEEE Trans. Circ. Syst. 32, 1150–1161 (1985).
Article MathSciNet MATH Google Scholar
Lukoševičius, M. A practical guide to applying echo state networks. In Neural networks: Tricks of the trade, 659–686 (Springer, 2012).
Butcher, J. B., Verstraeten, D., Schrauwen, B., Day, C. R. & Haycock, P. W. Reservoir computing and extreme learning machines for non-linear time-series data analysis. Neural Netw. 38, 76–89. https://doi.org/10.1016/j.neunet.2012.11.011 (2013).
Article CAS PubMed Google Scholar
Schrauwen, B., Verstraeten, D. & Van Campenhout, J. An overview of reservoir computing: theory, applications and implementations. In Proceedings of the 15th european symposium on artificial neural networks. p. 471-482 2007, 471–482 (2007).
Tanaka, G. et al. Recent advances in physical reservoir computing: A review. Neural Netw. 115, 100–123 (2019).
Article PubMed Google Scholar
Mujal, P. et al. Opportunities in quantum reservoir computing and extreme learning machines. Adv. Quant. Technol. 2100027. https://doi.org/10.1002/qute.202100027 (2021).
Nakajima, K. & Fischer, I. Reservoir Computing (Springer, Singapore, 2021).
Book MATH Google Scholar
Fujii, K. & Nakajima, K. Harnessing disordered-ensemble quantum dynamics for machine learning. Phys. Rev. Appl. 8, 024030. https://doi.org/10.1103/PhysRevApplied.8.024030 (2017).
Article ADS Google Scholar
Chen, J. & Nurdin, H. I. Learning nonlinear input-output maps with dissipative quantum systems. Quantum Inf. Process. 18, 198. https://doi.org/10.1007/s11128-019-2311-9 (2019).
Article ADS MathSciNet MATH Google Scholar
Nakajima, K., Fujii, K., Negoro, M., Mitarai, K. & Kitagawa, M. Boosting computational power through spatial multiplexing in quantum reservoir computing. Phys. Rev. Appl. 11, 034021. https://doi.org/10.1103/PhysRevApplied.11.034021 (2019).
Article ADS CAS Google Scholar
Kutvonen, A., Fujii, K. & Sagawa, T. Optimizing a quantum reservoir computer for time series prediction. Sci. Rep. 10, 14687. https://doi.org/10.1038/s41598-020-71673-9 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Martínez-Peña, R., Nokkala, J., Giorgi, G. L., Zambrini, R. & Soriano, M. C. Information processing capacity of spin-based quantum reservoir computing systems. Cognit. Comput. 1–12. https://doi.org/10.1007/s12559-020-09772-y (2020).
Martínez-Peña, R., Giorgi, G. L., Nokkala, J., Soriano, M. C. & Zambrini, R. Dynamical phase transitions in quantum reservoir computing. Phys. Rev. Lett. 127, 100502 (2021).
Article ADS PubMed Google Scholar
Chen, J., Nurdin, H. I. & Yamamoto, N. Towards single-input single-output nonlinear system identification and signal processing on near-term quantum computers. In 2019 IEEE 58th Conference on Decision and Control (CDC), 401–406. https://doi.org/10.1109/CDC40024.2019.9029180 (2019).
Chen, J., Nurdin, H. I. & Yamamoto, N. Temporal information processing on noisy quantum computers. Phys. Rev. Appl. 14, 024065. https://doi.org/10.1103/PhysRevApplied.14.024065 (2020).
Article ADS CAS Google Scholar
Negoro, M., Mitarai, K., Fujii, K., Nakajima, K. & Kitagawa, M. Machine learning with controllable quantum dynamics of a nuclear spin ensemble in a solid. arXiv:1806.10910 (2018).
Nokkala, J. et al. Gaussian states of continuous-variable quantum systems provide universal and versatile reservoir computing. Comm. Phys. 4, 1–11 (2021).
Article Google Scholar
Jaeger, H. Short term memory in echo state networks. gmd-report 152. In GMD-German National Research Institute for Computer Science (2002), http://www.faculty.jacobs-university.de/hjaeger/pubs/STMEchoStatesTechRep.pdf (Citeseer, 2002).
Mathews, V. J. & Lee, J. Adaptive algorithms for bilinear filtering. In Advanced Signal Processing: Algorithms, Architectures, and Implementations V, vol. 2296, 317–327 (International Society for Optics and Photonics, 1994).
Zhang, F. Matrix theory: basic results and techniques (Springer, Berlin, 2011).
Book MATH Google Scholar
Tran, Q. H. & Nakajima, K. Learning temporal quantum tomography. Phys. Rev. Lett. 127, 260401 (2021).
Article ADS MathSciNet CAS PubMed Google Scholar
Banchi, L., Braunstein, S. L. & Pirandola, S. Quantum fidelity for arbitrary gaussian states. Phys. Rev. Lett. 115, 260501 (2015).
Article ADS PubMed Google Scholar
Vidal, G. & Werner, R. F. Computable measure of entanglement. Phys. Rev. A 65, 032314 (2002).
Article ADS Google Scholar
Huebner, U., Abraham, N. & Weiss, C. Dimensions and entropies of chaotic intensity pulsations in a single-mode far-infrared NH$_3$ laser. Phys. Rev. A 40, 6354 (1989).
Article ADS CAS Google Scholar
Weigend, A. S. & Gershenfeld, N. A. Results of the time series prediction competition at the santa fe institute. In IEEE international conference on neural networks, 1786–1793 (IEEE, 1993).
García-Beni, J., Giorgi, G. L., Soriano, M. C. & Zambrini, R. Scalable photonic platform for real-time quantum reservoir computing. arXiv:2207.14031 (2022).
Braunstein, S. L. & Van Loock, P. Quantum information with continuous variables. Rev. Mod. Phys. 77, 513 (2005).
Article ADS MathSciNet MATH Google Scholar
Nokkala, J. et al. Reconfigurable optical implementation of quantum complex networks. New J. Phys. 20, 053024 (2018).
Article ADS Google Scholar
Nokkala, J., Galve, F., Zambrini, R., Maniscalco, S. & Piilo, J. Complex quantum networks as structured environments: Engineering and probing. Sci. Rep. 6, 1–7 (2016).
Article Google Scholar
McClean, J. R., Babbush, R., Love, P. J. & Aspuru-Guzik, A. Exploiting locality in quantum computation for quantum chemistry. J. Phys. Chem. Lett. 5, 4368–4380 (2014).
Article CAS PubMed Google Scholar
Wecker, D., Hastings, M. B. & Troyer, M. Progress towards practical quantum variational algorithms. Phys. Rev. A 92, 042303 (2015).
Article ADS Google Scholar
Babbush, R. et al. Low-depth quantum simulation of materials. Phys. Rev. X 8, 011044 (2018).
CAS Google Scholar
Cai, Z. Resource estimation for quantum variational simulations of the hubbard model. Phys. Rev. Appl. 14, 014059 (2020).
Article ADS CAS Google Scholar
García-Pérez, G. et al. Learning to measure: Adaptive informationally complete generalized measurements for quantum algorithms. Prx Quant. 2, 040342 (2021).
Article ADS Google Scholar
Ghosh, S., Opala, A., Matuszewski, M., Paterek, T. & Liew, T. C. Quantum reservoir processing. NPJ Quant. Inf. 5, 1–6 (2019).
Google Scholar
Ghosh, S., Opala, A., Matuszewski, M., Paterek, T. & Liew, T. C. Reconstructing quantum states with quantum reservoir networks. IEEE Trans. Neural Netw. Learn. Syst. (2020).
Angelatos, G., Khan, S. A. & Türeci, H. E. Reservoir computing approach to quantum state measurement. Phys. Rev. X 11, 041062 (2021).
CAS Google Scholar
Ghosh, S., Krisnanda, T., Paterek, T. & Liew, T. C. Realising and compressing quantum circuits with quantum reservoir computing. Commun. Phys. 4, 1–7 (2021).
Article Google Scholar
Xiong, W. et al. Strong tunable spin-spin interaction in a weakly coupled nitrogen vacancy spin-cavity electromechanical system. Phys. Rev. B 103, 174106 (2021).
Article ADS CAS Google Scholar
Xiong, W., Chen, J., Fang, B., Lam, C.-H. & You, J. Coherent perfect absorption in a weakly coupled atom-cavity system. Phys. Rev. A 101, 063822 (2020).
Article ADS CAS Google Scholar
Xiong, W. et al. Higher-order exceptional point in a pseudo-hermitian cavity optomechanical system. Phys. Rev. A 104, 063508 (2021).
Article ADS CAS Google Scholar
Chen, J. et al. Strong single-photon optomechanical coupling in a hybrid quantum system. Opt. Express 29, 32639–32648 (2021).
Article ADS PubMed Google Scholar
Zhang, G.-Q., Chen, Z., Xiong, W., Lam, C.-H. & You, J. Parity-symmetry-breaking quantum phase transition via parametric drive in a cavity magnonic system. Phys. Rev. B 104, 064423 (2021).
Article ADS CAS Google Scholar
Wolfram Research. Numerical Nonlinear Global Optimization. https://reference.wolfram.com/language/tutorial/ConstrainedOptimizationGlobalNumerical.html (2021). [Online; accessed 27-July-2021].
Agarwal, G. Entropy, the wigner distribution function, and the approach to equilibrium of a system of coupled harmonic oscillators. Phys. Rev. A 3, 828 (1971).
Article ADS Google Scholar

Download references

Acknowledgements

The author acknowledges the Spanish State Research Agency, through the Severo Ochoa and María de Maeztu Program for Centers and Units of Excellence in R &D (MDM-2017-0711) and through the QUARESC project (PID2019-109094GB-C21 and -C22/ AEI / 10.13039/501100011033). The author also acknowledges funding by CAIB through the QUAREC project (PRD2018/47). The author acknowledges financial support from the Turku Collegium for Science, Medicine and Technology as well as the Academy of Finland under project no. 348854. Finally, the author would like to thank Roberta Zambrini, Gian Luca Giorgi and Miguel C. Soriano for helpful discussion and comments.

Author information

Authors and Affiliations

Department for Physics and Astronomy, University of Turku, 20014, Turun Yliopisto, Finland
Johannes Nokkala
IFISC, Instituto de Física Interdisciplinar y Sistemas Complejos (UIB-CSIC) UIB Campus, 07122, Palma de Mallorca, Spain
Johannes Nokkala

Authors

Johannes Nokkala
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Johannes Nokkala.

Ethics declarations

Competing interests

The author declares no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Nokkala, J. Online quantum time series processing with random oscillator networks. Sci Rep 13, 7694 (2023). https://doi.org/10.1038/s41598-023-34811-7

Download citation

Received: 04 August 2021
Accepted: 08 May 2023
Published: 11 May 2023
DOI: https://doi.org/10.1038/s41598-023-34811-7

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.