Coherent Ising machines—optical neural networks operating at the quantum limit

Yamamoto, Yoshihisa; Aihara, Kazuyuki; Leleu, Timothee; Kawarabayashi, Ken-ichi; Kako, Satoshi; Fejer, Martin; Inoue, Kyo; Takesue, Hiroki

doi:10.1038/s41534-017-0048-9

Download PDF

Review Article
Open access
Published: 15 December 2017

Coherent Ising machines—optical neural networks operating at the quantum limit

Yoshihisa Yamamoto^1,2,
Kazuyuki Aihara ORCID: orcid.org/0000-0002-4602-9816³,
Timothee Leleu³,
Ken-ichi Kawarabayashi⁴,
Satoshi Kako^1,4,
Martin Fejer²,
Kyo Inoue⁵ &
…
Hiroki Takesue⁶

npj Quantum Information volume 3, Article number: 49 (2017) Cite this article

25k Accesses
135 Citations
25 Altmetric
Metrics details

Subjects

A Publisher Correction to this article was published on 01 February 2018

This article has been updated

Abstract

In this article, we will introduce the basic concept and the quantum feature of a novel computing system, coherent Ising machines, and describe their theoretical and experimental performance. We start with the discussion how to construct such physical devices as the quantum analog of classical neuron and synapse, and end with the performance comparison against various classical neural networks implemented in CPU and supercomputers.

The power of quantum neural networks

Article 24 June 2021

The physics of optical computing

Article 09 October 2023

Realization of a quantum neural network using repeat-until-success circuits in a superconducting quantum processor

Article Open access 21 November 2023

Introduction

Optimization problems with discrete and continuous variables, some of which belong to NP-hard or NP-complete class in complexity theory, are ubiquitous in numerous important areas, including operations and scheduling, drug discovery, wireless communications, finance, integrated circuit design, compressed sensing, and machine learning. Despite rapid advances in both algorithms and digital computer technology, even modest sized NP-hard/NP-complete problems that arise in practice may be very difficult to solve on modern digital computers. One alternative to current von-Neumann-computer-based methods for solving such problems is a neural network realized with analog electronic circuits and other systems.^1,2,3,4 Another approach of contemporary interest is the adiabatic quantum computation (AQC)⁵ and quantum annealing (QA).^6,7 Sophisticated AQC/QA devices are already under development,^8,9,10,11 but providing dense connectivity between qubits remains a major challenge¹² with serious implications for the efficiency of AQC/QA systems.¹³

Networks of degenerate optical parametric oscillators (DOPOs) are an alternative physical system, with an unconventional operating mechanism,^14,15,16,17 for solving the Ising problem,^18,19,20 and by extension many other combinatorial optimization problems.²¹ Formally, the N-spin Ising problem is to find the configuration of spins ${\mathrm{\sigma }}_i \in \left\{ { - 1, + 1} \right\}\,\left( {i = 1, \ldots ,N} \right)$ that minimizes the energy function ${\it H} = - \mathop {\sum}\nolimits_{1 \le i < j \le N} {J_{ij}\sigma _i\sigma _j} - \mathop {\sum}\nolimits_{1 \le i \le N} {h_i\sigma _i}$, where the particular problem instance being solved is specified by the N × N matrix J (with elements J_ij) and the N-dimensional vector h (with elements h_i).

In DOPO Ising machines, the up-spin and down-spin are represented by the positive and negative in-phase amplitudes, i.e. $\sigma _i = X_i{\mathrm{/}}\left| {X_i} \right|$. Then, the energy function H is mapped to the effective loss, photon decay rate, of the DOPO network by implementing the mutual coupling proportional to J_ij between two DOPOs and injecting the dc field proportional to h_i to individual DOPO.¹⁴ The ground state of the Ising Hamiltonian corresponds to an oscillation mode with the minimum network loss. At the pump rate below an oscillation threshold, every DOPO is in a squeezed vacuum state which is interpreted as linear superposition of σ_i = +1 (positive amplitude or up-spin) and σ_i = −1 (negative amplitude or down-spin) so that the probability amplitudes for 2^N spin configurations are all identical, as shown in Fig. 1a. At the threshold pump rate, the ground state with a minimum loss reaches a threshold first and oscillates as a single stable mode, which triggers the stimulated emission of photons and cross-gain saturation for all the other modes, as shown in Fig. 1b. Thus, the solution of the Ising problem can be found by simply detecting the single oscillation mode.¹⁴

The input states into all DOPOs are vacuum states, $\left| 0 \right\rangle _1\left| 0 \right\rangle _2, \cdots \left| 0 \right\rangle _N$. The essential quantum operations are the collective (not individual) symmetry breaking at DOPO threshold, which translates all DOPOs to either positive amplitude $\left| \alpha \right\rangle$ or negative amplitude $\left| { - \alpha } \right\rangle$ coherent state in a correlated way. Finally, the computational results are read out by optical homodyne detectors. A block diagram of the CIM operation is shown in Fig. 1c.

We have realized such a system with a scalable architecture that uses a measurement-feedback circuit in place of optical delay lines, which were used in earlier experiments.^15,22,23 Our measurement-feedback-based DOPO Ising machine allows connections between any spin and any other spin, and is fully programmable.^24,25 Fully connected superconducting Ising machines have been also proposed recently.^26,27

Classical vs. quantum neural networks

Nonlinear devices that have a characteristic input–output relation shown in Fig. 2a represent a typical gain function of neurons in classical neural networks (CNN).^1,2 When an input signal level is weak, the neuron linearly amplifies this input signal and thus compensates for unavoidable linear loss in the network. When the input signal level exceeds a certain threshold value a_th, however, the output signal level is clamped at a constant value b_s. This nonlinear input–output relation is essential for CNN to find a stable operating point which manifests a solution for a given mathematical problem. In this model, each neuronal state is expressed by a continuous variable a_i, which obeys a continuous time evolution governed by

$$\frac{{\rm d}}{{{\rm d}t}}a_i = - a_i + f\left( {a_i} \right) - \frac{{\partial V}}{{\partial a_i}} + g_i,$$

(1)

where the first term of R.H.S. of Eq. (1) expresses the linear loss and the second term describes self-feedback with the nonlinear gain function f shown in Fig. 2a. Note that a time is normalized by the decay rate of a neuron excitation. The third term represents the mutual coupling among neurons and the interaction potential V implements a given mathematical problem. We emphasize that a simultaneous mutual coupling between neurons without introducing undesired instability or oscillation is made possible by the gradient descent character of the third term. The nonlinear gain function f is often implemented as a coupling coefficient for the third term of mutual coupling instead of the second term of self-feedback.^1,2 An essential feature of the CNN is, however, independent of the location of the nonlinear gain function. Finally, the fourth term, g_i, expresses a noise driving force associated with the decay and the nonlinear gain for a neuron excitation.

The classical picture behind Eq. (1) is that each neuron has an unpredictable but definite value a_i at a given time even though the network is randomly driven by internal and/or external noise. This is a correct physical picture of a neural network when thermal noise is much larger than quantum zero-point noise i.e. k_BT$\gg$ħω. However, if the opposite is true, i.e. ħω$\gg$k_BT, the interpretation of Eq. (1) must be altered. It should be considered as a c-number stochastic differential equation (CSDE), which is derived from and fully equivalent to the density operator master equation for the quantum neural network.^16,17 Now, the variable a_i does not represent any physical quantity of the neuron but rather it is merely an eigenvalue (α or β) of the coherent states used in the quasi-probability density to expand the density operator, such as a positive P(α,β) representation¹⁶ or Wigner representation.¹⁷ In order to compute the expectation value of any physical quantity of the neuron, we must first reconstruct the density operator by an ensemble of many trajectories of a_i generated by the CSDE. Whenever ħω$\gg$k_BT, we must take this quantum approach to reach a correct answer.

A quantum neuron (or quantum neural network) is characterized by the following three properties and in this way, it is distinct from a classical counterpart:

1.
A quantum neuron is prepared in a superposition state of different neural excitations so that a quantum parallel search can be implemented.
2.
A network of quantum neurons makes a decision to reach a final computational result through correlated and collective symmetry breaking at a critical point of phase transition.
3.
A network of quantum neurons amplifies the above quantum solution to a classical signal via bosonic final state stimulation.

As shown in the subsequent sections, we will show the above three properties are indeed realized by DOPO and their networks.

The most important advantage of the DOPO-based quantum neural network compared to its classical counterpart is that each neuron (DOPO) is prepared in linear superposition of different in-phase amplitude eigenstates, i.e. squeezed vacuum state and so a quantum parallel search can be implemented to the whole optimization process. This is the crucial computational resource for CIM.

Quantum neurons

Degenerate optical parametric amplifiers and oscillators

Any phase-sensitive amplifier (PSA) has an input–output relation similar to that shown in Fig. 2a due to its inherent gain saturation effect. We focus here on the degenerate optical parametric amplifier (DOPA) based on the two photon emission process shown in Fig. 2b. A particular device is composed of a second-order nonlinear crystal placed in an optical cavity. The nonlinear crystal absorbs one pump photon at a frequency 2ω_s and simultaneously emits two signal photons at a frequency ω_s. A pertinent interaction Hamiltonian²⁸ is expressed by

$$H = \hbar \kappa \left( {\hat a_{\rm s}^{\dagger 2}\hat a_{\rm p} + \hat a_{\rm s}^2\hat a_{\rm p}^\dagger } \right).$$

(2)

Here $\hat a_{\rm s}( {\hat a_{\rm s}^\dagger } )$ and $\hat a_{\rm p}( {\hat a_{\rm p}^\dagger } )$ are the annihilation (creation) operators for the signal and pump fields, and κ is a parametric coupling constant. The simultaneously generated two signal photons are a conjugate pair in a sense that they have a positive correlation in the in-phase amplitudes $\hat X( { = ( {\hat a_{\rm s} + \hat a_{\rm s}^\dagger } ){\mathrm{/}}2} )$ and a negative correlation in the quadrature-phase amplitudes $\hat P( { = ( {\hat a_{\rm s} - \hat a_{\rm s}^\dagger } ){\mathrm{/}}2i} )$. As a result of such positive and negative correlations in $\hat X$ and $\hat P$, constructive and destructive interference happens between the two signal photons along the X-axis and P-axis, respectively. As a result of such quantum interference, if a DOPA is input by an external vacuum state (zero-point fluctuation), the zero-point fluctuation is amplified and deamplified along the X-axis and P-axis, respectively, as shown in Fig. 2c. The resulting state is called a squeezed vacuum state, which is a minimum uncertainty wave packet and satisfies the Heisenberg uncertainty principle with equality, $\left\langle {\Delta \hat X^2} \right\rangle \left\langle {\Delta \hat P^2} \right\rangle = 1{\mathrm{/}}16$, just as the vacuum state.

The phase-sensitive amplification/deamplification mentioned above is not a rare phenomenon in nature. One classical example is a swing driven by a person as shown in Fig. 3a, where a person (corresponding to the pump in DOPA) completes a full cycle (up–down–up) while a swing (corresponding to the signal in DOPA) makes only a half cycle (left-to-right). Note that the phase of the pump and that of the signal should be mutually locked in order to realize an amplitude amplification process, as shown in Fig. 3b. This corresponds to the situation that the in-phase amplitude $\hat X$ is amplified. You can easily imagine from your childhood memory what you did in order to stop the swing when you must go home. You stood up at the center of the swing period and crouched at both ends. Then, the swing (signal) amplitude is attenuated. This corresponds to the situation that the quadrature-phase amplitude $\hat P$ is deamplified.

The OPO has a long history of development at Stanford University.²⁹ In the coherent Ising machine (CIM), a periodically polled lithium-niobate (PPLN) waveguide device is used as such a PSA for signal pulses. The first experimental demonstration of deamplification (squeezing) and amplification (anti-squeezing) for optical pulses in vacuum states using this particular device was reported in 1995.³⁰ Figure 3c shows the suppressed quadrature-phase amplitude noise (squeezing) and enhanced in-phase amplitude noise (anti-squeezing) by the PPLN waveguide DOPA. It is a general property of the squeezing devices that the degree of squeezing (vacuum noise deamplification) is reduced by experimental system’s imperfections, in particular, a linear optical loss, but that the degree of anti-squeezing (vacuum noise amplification) is not reduced so much by linear optical loss. Suppose a PSA output has a 20 dB of squeezing, $4\left\langle {\Delta \hat P_{{\mathrm{in}}}^2} \right\rangle = 10^{ - 2}$, for the quadrature-phase amplitude and 20 dB of anti-squeezing, $4\left\langle {\Delta \hat X_{{\mathrm{in}}}^2} \right\rangle = 10^2$, for the in-phase amplitude. An optical linear loss L degrades the degree of squeezing and anti-squeezing according to

$$\left\langle {\Delta \hat X_{{\mathrm{out}}}^2} \right\rangle = L\left\langle {\Delta \hat X_{{\mathrm{in}}}^2} \right\rangle + \left( {1 - L} \right)\frac{1}{4},\,\left\langle {\Delta \hat P_{{\mathrm{out}}}^2} \right\rangle = L\left\langle {\Delta \hat P_{{\mathrm{in}}}^2} \right\rangle + \left( {1 - L} \right)\frac{1}{4}.$$

(3)

If there is a 3 dB of linear loss after the PSA, the degree of squeezing is only ~3 dB while the degree of anti-squeezing is still ~17 dB. Note the Ising spin is represented by the in-phase amplitude rather than the quadrature-phase amplitude. Therefore, the quantum parallel search in the CIM is based on the anti-squeezed quantum noise of the in-phase amplitude rather than the squeezed quantum noise of the quadrature-phase amplitude, so that the operation of CIM is inherently robust against an optical loss.

The input–output relation of the DOPA for an increasing signal power above the saturation level approximately reproduces the nonlinear response function f(a) shown in Fig. 2a.³¹ A physical mechanism behind this gain saturation is the depletion of the pump power, which triggers a reverse energy flow, i.e. two signal photons are absorbed simultaneously in the nonlinear crystal to generate one pump photon. When the input signal power becomes sufficiently large, the reverse energy flow from the signal to the pump is switched on and the linear amplification process must stop. In our case of constructing quantum neurons, a related but slightly different nonlinear effect is employed to amplify a solution state at a microscopic level to that at a macroscopic level, which we will explain next.

If the parametric amplifier gain exceeds the signal decay rate from the cavity, the system can sustain a steady-state field by creating a finite average amplitude along the X-direction. The center position of a quantum wave packet in the (X,P) phase space is no more $X = P = 0$ but there emerges a non-zero $X$ value, as shown in Fig. 2d. This is the oscillation threshold point. The device pumped at above the oscillation threshold is called a DOPO. The probability for creating a positive $X$ value and negative $X$ value is randomly selected by 50–50% probability, which is known as spontaneous symmetry breaking and is an ubiquitous feature for any second-order phase transition phenomena.³² In the language of dynamical systems theory, the DOPO threshold represents the supercritical pitchfork bifurcation. The two stable states with positive and negative $X$ values correspond to the firing and non-firing states of a classical neuron or up-spin and down-spin states of an Ising problem. A similar mechanism, dissipative quantum bifurcation machine, was discussed recently in the context of Josephson junction qubits.^33,34,35 When the input signal level to the DOPA is varied while the pump rate is fixed at below the oscillation threshold, the output signal level is first linearly amplified and then eventually reaches the oscillation state with a clamped amplitude due to gain saturation. In this way, the nonlinear input/output relation shown in Fig. 2a is realized in the transition from DOPA to DOPO in our quantum neurons.

Why are DOPOs referred to as quantum neurons and how are they different from classical neurons? We will present the answers to these important questions in the next section.

Linear superposition

A set of photon number eigenstates $\left| n \right\rangle$ can expand an arbitrary state of the field as an orthonormal set. The wavefunction of a squeezed vacuum state (see Fig. 2c) can be mathematically constructed as a superposition of photon number eigenstates with even eigenvalues:

$$\left| {{\it{\psi }}_{{\rm sv}}} \right\rangle = c_0\left| 0 \right\rangle + c_2\left| 2 \right\rangle + c_4\left| 4 \right\rangle + \cdots .$$

(4)

A simple physical reasoning behind Eq. (4) is explained as follows: The photon number of an intense pump field exhibits large quantum uncertainty, typically on the order of $\left\langle {\Delta \hat n^2} \right\rangle \sim \left\langle {\hat n} \right\rangle$ for a coherent pump field. Therefore, we cannot extract which-path information, even in principle, whether the number of absorbed pump photons in the nonlinear crystal is zero, one, two … at a given time. Because one pump photon is converted into two signal photons, those cases correspond to the output signal field in $\left| 0 \right\rangle ,\,\left| 2 \right\rangle ,\,\left| 4 \right\rangle ,\, \cdots$ photon number eigenstates. Because of the lack of which-path information, the proper expression for the quantum state of the output signal field of DOPA must be a superposition of those states. Except for an irrelevant phase factor, the probability amplitudes c₀,c₂,c₄,$\cdots$ have an identical phase, for instance, positive real numbers.³⁶ In the (X, P) phase space, the different photon number eigenstates constructively interfere with each other along the X-axis, while they destructively interfere with each other along the P-axis.

Alternatively, a squeezed vacuum state can be mathematically constructed as a superposition of quadrature amplitude eigenstates $\left| X \right\rangle$, which are the eigenstates of the Hermitian operator $\hat X$:

$$\left| {{\it{\psi }}_{{\rm sv}}} \right\rangle = {\int}_{\kern-6pt - \infty }^\infty {C\left( X \right)\left| X \right\rangle {\rm d}{X}.}$$

(5)

It is not difficult to show that the different in-phase amplitude eigenstates constructively interfere with each other in the small P-regions centered at $P = 0$, while they destructively interfere with each other in the large P-regions. Therefore, more anti-squeezing (enhanced quantum noise) along the X-axis realizes more squeezing (reduced quantum noise) along the P-axis.

It is worth pointing out that a statistical mixture of in-phase amplitude eigenstates

$$\hat \rho _{{\mathrm{mix}}} = {\int}_{\kern-6pt - \infty }^\infty {P\left( X \right)\left| X \right\rangle \left\langle X \right|{\rm d}X}$$

(6)

instead of Eq. (5), covers the same range of the eigenvalues X but requires an infinite amount of energy to realize, since the localization of the state along the P-axis near $P = 0$ is now impossible due to the lack of phase coherence between different $\left| X \right\rangle$ eigenstates. The infinite uncertainty along the P-axis means an infinite energy. A squeezed vacuum state, represented by Eq. (5) and produced by the DOPA, allows a quantum parallel search with a finite energy, while a classical mixed state, represented by Eq. (6), does not allow such a quantum parallel search.

A particularly unique quantum feature of the DOPA/DOPO is that the aforementioned superposition survives not only below the threshold but also above the threshold to some extent if the cavity loss is small. At well above the oscillation threshold, the DOPO produces either 0-phase or π-phase coherent field as shown in Fig. 2d. However, a DOPO at just above the oscillation threshold maintains the superposition of 0-phase and π-phase states, in spite of finite photon leakage from the resonator.^37,38,39 This is possible because a hypothetical measurement performed for a leaked signal field cannot identify which phase the DOPO selects due to the enhanced quantum noise along the X-axis and also small separation between the two states. The anti-squeezed in-phase amplitude noise realizes a so-called “quantum erasure” for this hypothetical which-path measurement.

The evidence for the above statement is shown in Fig. 4a.⁴⁰ Here we consider a CIM consisting of synchronously pumped DOPO pulses circulating in a fiber ring cavity.^24,25 The density matrix elements $\left\langle {\left. X \right|\hat \rho \left| {X^\prime} \right.} \right\rangle$ in the (X+X′, X−X′) phase space for two out-of-phase (anti-ferromagnetically) coupled DOPOs are plotted, where the pump rate is just above the threshold. In Fig. 4a, the in-phase amplitude probability distribution $\left\langle {\left. X \right|\hat \rho \left| X \right.} \right\rangle$, which is given as the horizontal slice at X−X′ = 0, and the quantum coherence $\left\langle {\left. X \right|\hat \rho \left| { - X} \right.} \right\rangle$ between the two in-phase amplitude eigenstates $\left| X \right\rangle$ and $\left| { - X} \right\rangle$, which is given as the vertical slice at X+X' = 0, clearly show macroscopically separated coherent states $\left| \alpha \right\rangle$ and $\left| { - \alpha } \right\rangle$ maintain quantum coherence.⁴⁰ Figure 4b compares the density matrix elements $\left\langle {\left. X \right|\hat \rho \left| {X^\prime} \right.} \right\rangle$ for a superposition state, $1{\mathrm{/}}\sqrt 2 \left( {\left| \alpha \right\rangle + \left| { - \alpha } \right\rangle } \right)$, and a mixed state, $\hat \rho = 1/2\left( {\left| \alpha \right\rangle \left\langle \alpha \right| + \left| { - \alpha } \right\rangle \left\langle { - \alpha } \right|} \right)$, in the same (X+X′, X−X′) phase space, respectively. We can conclude from Fig. 4a that the two DOPOs are approximately in Schrödinger’s cat states and yet their centers of gravity are negatively correlated due to out-of-phase coupling. That is, the DOPO1 (above) is biased toward a positive amplitude, while the DOPO2 (below) is biased toward a negative amplitude. Figure 4c shows the Wigner function of the DOPO state. The oscillatory behavior and negative values of the Wigner function are another manifestation of the quantum coherence between $\left| \alpha \right\rangle$ and $\left| { - \alpha } \right\rangle$ states.⁴⁰ Such a highly non-classical feature is quickly destroyed by increasing a cavity loss. However, it should be emphasized that the superposition of $\left| X \right\rangle$ eigenstates with positive and negative eigenvalues in a squeezed vacuum state, represented by Eq. (4), is rather robust against a cavity loss as mentioned above so that the resulting quantum parallel search survive in a low-Q cavity.

Amplitude and phase error correction capability given by phase-sensitive amplification

Fluctuations are induced in the DOPO fields by the external phase and amplitude noise injection. Both the amplitude and phase of the DOPO field are continuous variables so that the standard error detection and error correction techniques cannot be applied. Fortunately, the phase-sensitive amplification/deamplification mechanism of DOPO can stabilize the phase to either 0 or π as long as the phase error is small compared to ±π/2.¹⁵ Moreover, the amplitude error can be also suppressed by the gain saturation mechanism of the DOPO. If the signal amplitude is increased to above the steady-state value, the pump amplitude is more strongly depleted which, in turn, recovers the steady-state amplitude through a reduced parametric gain. The opposite is true when the signal amplitude is decreased to below the steady-state value. The deterministic amplitude decay due to optical linear loss can be compensated for by the average gain. In this way, the output amplitude is stabilized through the balance between the linear cavity loss and the saturated gain in DOPO, while the phase is stabilized by the phase-sensitive deamplification of DOPO. This is schematically shown in Fig. 5.

The DOPO provides a unique opportunity as a robust analog memory against external noise injection. We can stably store the analog information at quantum limited accuracy with the above-mentioned mechanisms.

Quantum synapses

We wish to implement a target Hamiltonian as a cost function (effective loss) of the network consisting of quantum neurons. Our target Hamiltonian is the following Ising model:⁴¹

$$H = - \mathop {\sum}\limits_{i < j} {J_{ij}\sigma _i\sigma _j - \mathop {\sum}\limits_i {h_i\sigma _i,} }$$

(7)

where the Ising spin σ_i takes either +1 (up-spin) or −1 (down-spin). As described above, we wish to represent σ_i = 1 or σ_i = −1 by the positive or negative in-phase amplitude, which corresponds to 0-phase or π-phase oscillation, respectively (Fig. 2d). The Ising coupling J_ij and the local field h_i take continuous (real) values, the magnitude of which are determined by mapping a given combinatorial optimization problem on the Ising model.^13,21 The three-dimensional Ising model and the two-dimensional Ising model with local fields belong to the NP-hard class in complexity theory.¹⁸ Therefore, you can imagine that many hard problems in the real world can be solved through the Ising model.^13,21 In order to implement the cost function described in Eq. (7) as the effective loss of the DOPO network, pairs of DOPOs must be coupled with the coupling constant J_ij and also the constant optical field h_i must be injected into each DOPO.¹⁴

Optical delay line coupling scheme

In the configuration shown in Fig. 6a, N independent DOPOs are simultaneously realized as N optical pulses circulating in a single fiber ring cavity with an internal PSA which is driven externally by pump pulse trains. Figure 6a shows one experimental scheme to implement the Ising coupling J_ij in the DOPO network, where a part of each DOPO pulse circulating in a fiber ring resonator is picked-off at every round trip by the output coupler, amplified by an external PSA, split into multiple optical delay lines including intensity and phase modulators and then injected back to the target DOPO pulse at appropriate timing.^15,22,23

The external PSA is provided by another DOPA and boosts the in-phase amplitude $\hat X_{}^{}$ of a picked-off pulse before it is attenuated by the beam splitter and modulator loss. Using this technique, we can implement a relatively large Ising coupling constant (J_ij ≲ 1) even if the system size is fairly large (N ≳ 10⁴). Using such N−1 optical delay lines, any (jth) pulse can be connected to any other (ith) pulse with a coupling constant J_ij. Please note that asymmetric coupling J_ij ≠ J_ji can be easily implemented in this scheme and that ~N² mutual coupling constants can be realized by (N−1) optical delay lines. Such an all-optical coupling scheme has been experimentally demonstrated for N = 4 and 16 pulses using free-space optics with a 4.8 m ring cavity and 250 MHz and 1 GHz repetition frequencies, respectively^15,22 and for N = 10⁴ pulses using a planar lightwave circuit and fiber optics with 1 km fiber ring cavity and 2 GHz repetition frequency.²³ In the latter system, for example, the optical fiber length is 1 km so that a round trip time is 5 μs while the pulse interval is 0.5 ns, which leads to N = 10⁴ DOPO pulses inside a single ring cavity.

This optical delay line coupling machine is capable of producing the quantum noise correlation among the internal DOPO pluses.^16,17 In the section Quantum features of CIM, we will discuss this point in detail using the two measures for evaluating quantum noise correlation: quantum entanglement and discord.

Measurement-feedback coupling scheme

An alternative coupling scheme to implement the Ising coupling J_ij is shown in Fig. 6b.^24,25 Instead of directly connecting the DOPO pulses with optical delay lines, we can measure approximately the in-phase amplitude of the internal DOPO pulse by the optical balanced homodyne detectors. If the inferred in-phase amplitude of the jth DOPO pulse is represented by $\tilde X_j$, the feedback pulse to the ith DOPO pulse should have an in-phase amplitude proportional to $\mathop {\sum }\nolimits_j J_{ij}\tilde X_j$. The complicated task of the synchronous computation of the vector-vector multiplication between J_ij and $\tilde X_j$, which must be completed in the pulse interval of 100 ps – 1 ns, is achieved by a single measurement-feedback circuit consisting of an analog-to-digital converter, a field programmable gate array (FPGA), a digital-to-analog converter and optical amplitude/phase modulators.^24,25 The feedback pulse used as an input to the optical modulator and the local oscillator pulse used for optical homodyne detection are both provided by a part of the pump laser output, as shown in Fig. 6b.

Such a measurement-feedback coupling scheme (Fig. 6b) is equivalent to an optical delay line coupling scheme (Fig. 6a) except for the following advantage/disadvantage. The advantage of the measurement-feedback scheme is that all-to-all coupling of the order of ~N² connections can be implemented by a single measurement-feedback circuit, so that the daunting task of constructing N−1 optical delay lines and stabilizing their delay lengths (or optical phase) with an error much less than the optical wavelength can be avoided. In addition, many-body Ising-type interactions, such as ${\it H} = - \mathop {\sum }K_{ijk}\sigma _i\sigma _j\sigma _k$, can be readily implemented. The disadvantage of the measurement-feedback scheme is that the FPGA circuit must complete ~Ο(N) vector–vector multiplication and addition within each pulse interval. This imposes a limitation on the machine size N and the pulse repetition frequency. On the other hand, the optical delay line coupling scheme enjoys its inherent high-speed operation with a pulse repetition frequency limited only by optical device performance.

There is a subtle but important difference in the operational principles from quantum mechanical viewpoints between the optical delay line coupling and measurement-feedback coupling schemes. The measurement-feedback scheme does not produce any entanglement among DOPO pulses. The total density operator stays in the product state of individual DOPO pulse density operators, because the coupling is provided by the local operation and classical communications (LOCC). However, the measurement-feedback scheme is capable of implementing the non-unitary state reduction to pump out a spurious entropy from the network and prepare each DOPO pulse close to the Heisenberg limit (minimum uncertainty wave packet) induced by quantum measurements. The wave packet is actually non-Gaussian at above threshold, which contributes to enhanced quantum tunneling in the quantum parallel search process.

Figure 6c shows schematically this scenario. Suppose the two DOPOs are connected by the anti-ferromagnetic Ising coupling constant J₂₁ < 0. At the pump rate near threshold and before measurement feedback, the two DOPOs are independently in squeezed vacuum states. If the DOPO1 is measured and a positive amplitude is obtained ($\tilde X_1 > 0$), the center position of the DOPO1 wave packet is shifted to the positive side and the variance is reduced, but a long tail exists toward X₁ = 0 and X₁ < 0 (non-Gaussian). Simultaneously, the center position of the DOPO2 wave packet is shifted to the negative side by injecting a feedback pulse with an amplitude of $J_{21}\tilde X_1$. In this way, the anti-ferromagnetic order is implemented in the two quantum states even with a single measurement-feedback step.

Mapping of an Ising Hamiltonian to DOPO network loss

In this section, the in-phase amplitude x_j(t) of the jth DOPO is considered as a classical variable. The simple classical picture presented in this section still captures an essential part of the mapping principle.

Pitchfork bifurcation

We consider in this section simple equations of motion that capture the essential computational principle for the sake of clarity. We consider the dynamics of N analog variables x_j, with x_j ∈]−∞, +∞[and j∈{1,⋯, N}, given as follows:

$$\dot x_j\left( t \right) = - \frac{{\partial V}}{{\partial x_j}},$$

(8)

with

$$V = \mathop {\sum }\limits_l \left[ {V_b\left( {x_j} \right)} \right] + {\it{\epsilon }}V_l\left( x \right),$$

(9)

where V_b(x_j) is the archetype bistable potential in x_j with $V_{\rm b}\left( {x_j} \right) = - \frac{1}{2}\alpha x_j^2 + \frac{1}{4}x_j^4$ and V_l(x) the analog version of the Ising Hamiltonian with V_l(x) = −∑_jω_jlx_jx_l. Here t = (γ_s/2)τ is the unitless time normalized by the signal field lifetime 2/γ_s, γ_s is the signal photon decay rate, p = α + 1 = F_p/F_th is the unitless pump rate normalized by the solitary DOPO threshold pump photon flux $F_{{\rm th}} = \gamma _{\rm s}\sqrt {\gamma _{\rm p}} {\mathrm{/4}}\kappa$, γ_p is the pump photon decay rate, κ is the parametric coupling constant, x_j = X_j/A_s is the normalized in-phase amplitude, and $A_{\rm s} = \sqrt {\gamma _{\rm s}\gamma _{\rm p}{\mathrm{/2}}\kappa ^2}$ is the saturation amplitude which is equal to the actual DOPO oscillation amplitude at the normalized pump rate p = 2. α = −1 + p is the bifurcation parameter given by the normalized decay rate (−1) and linear gain (p) for the signal field. Moreover, the parameter $\epsilon$ is small enough such that $0 < \epsilon \ll 1$. Note that the Ising coupling constant ω_jl can be arbitrarily chosen under this constraint on the parameter ε. Equation (8) describes the dynamics of a gradient descent system. Note that the equations of motion for a solitary DOPO are given as $\dot x_j = \alpha x_j - x_j^3$ when ε = 0, which represents the normal form of the supercritical pitchfork bifurcation.

In the uncoupled case, i.e., ε = 0, the potential V_b is monostable with x_j = 0 and bistable with x_j = ±$\sqrt \alpha$ when α < 0 and α > 0, respectively. In the coupled case when $0 < \epsilon \ll 1$ and α is large enough, the state of each variable x_j can be considered as approximately binary and can be mapped to the Ising spin. In particular, the Ising spin configuration {σ_j}_j is associated to the state {x_j}_j of the system described in Eq. (8) by considering that σ_j is equal to the sign of x_j, i.e., σ_j = x_j/|x_j|.

When α increases over a critical threshold value, there is a breaking of symmetry and the states x_j become either positive or negative. The emergence of the first non-zero stable steady state can be studied by considering linear stability analysis at the origin, i.e., the Jacobian matrix J associated with Eq. (8) and computed at the state x_j = 0, ∀_ij. This Jacobian matrix is given as follows:

$$J = \alpha I + \epsilon {\mathrm{\Omega ,}}$$

(10)

where Ω is the matrix with components ω_jl and I the identity matrix.^14,42 The destabilization of the zero state corresponds to the emergence of the first non-zero stable steady state after the symmetry breaking and occurs when the maximal eigenvalue of the Jacobian matrix J, noted λ₁, is equal to zero. The eigenvalue λ₁ is given as $\lambda _1 = \alpha + {\it{\epsilon }}\mu _1$, where μ₁(>0) is the maximal eigenvalue of the matrix Ω, which we consider unique for the sake of simplicity. The first non-zero steady state becomes stable when λ₁ = 0, i.e., when the bifurcation parameter α is equal to α_c given as follows:^14,42

$$\alpha _c = - {\it{\epsilon }}\mu _1.$$

(11)

As the bifurcation parameter α increases further above the threshold α_c, other states become stable and the system develops a complex attractor landscape. Note that Eq. (11) suggests that α_c < 0 in general.

Condition for successful mapping

The linear stability analysis of the zero state allows determining the bifurcation parameter α_c for which the zero state becomes unstable. Moreover, the eigenvector associated with the eigenvalue λ₁ indicates the locations of the first non-zero stable steady states just after the bifurcation has occurred but, in general case, these locations do not correspond to the ground-state configurations of the Ising Hamiltonian. However, it can be noticed that, under the particular condition that all analog amplitudes |x_j| are equal, the first non-zero stable steady states are mapped rigorously to the ground-state configurations of the Ising Hamiltonian.⁴² Indeed, the steady-state condition of Eq. (8) is given as follows when x_j = σ_jx, ∀_ij, with x > 0:

$$\begin{array}{ccccc}\\ & \dot x_j = 0 \Rightarrow \alpha x_j - x_j^3 + {\it{\epsilon }}\mathop {\sum }\limits_{l \ne j} \omega _{jl}x_l = 0\\ \\ & \Rightarrow \alpha - x^2 + {\it{\epsilon }}\mathop {\sum }\limits_{l \ne j} \omega _{jl}\sigma _l\sigma _j = 0.\\ \end{array}$$

(12)

Thus, the homogeneous squared amplitude x² can be written as follows:

$$x^2 = \alpha - \frac{{2{\it{\epsilon }}}}{N}H,$$

(13)

with the Ising Hamiltonian H without a dc field. When ${\mathrm{\alpha }} - \frac{{2{\it{\epsilon }}}}{N}{\it H} < 0$, the only solution of Eq. (12) is x = 0. As α increases, the first non-zero steady state appears at the minimal value of the function $\frac{{2{\it{\epsilon }}}}{N}{\it H}$ as shown in Eq. (13). As shown in Fig. 1a, b, the ground-state configurations of the Ising Hamiltonian appear as a single oscillation mode. In a nutshell, the first non-zero stable steady states of Eq. (12) encode for the ground-state configurations of the Ising Hamiltonian if the steady-state amplitudes |x_j|,∀j, are all equal.

Effect of the amplitude heterogeneity

In more general case, the amplitudes |x_j| are not equal and the mapping between the first stable steady-state configurations and the minima of the Ising Hamiltonian does not hold.¹⁴ In practice, approximate solutions to various combinatorial optimization problems can still be obtained under this approximate mapping.^14,42 In order to quantify the distance between the case of exact mapping and more general case, i.e., when amplitudes are homogeneous and heterogeneous, respectively, we quantify the amplitude heterogeneity using the quantity δ_x defined as follows:

$${\delta _x} = \frac{\sqrt{\langle{\delta_j^2}\rangle}}{\langle{x_j^2}\rangle},$$

(14)

with $\delta _j = x_j^2 - \langle x^2\rangle$ and $\langle x^2\rangle = \frac{1}{N}\mathop {\sum }\limits_j x_j^2$. The quantity δ_x corresponds to the coefficient of variation of the squared amplitudes. For α > 0 and $\epsilon \ll 1$, the amplitudes x_j can be expressed as $x_j = x_j^{\left( 0 \right)} + {\it{\epsilon }}x_j^{\left( 1 \right)} + {\mathrm{O}}\left( {{\it{\epsilon }}^2} \right)$ with $x_j^{(0)} = \sigma _j\sqrt \alpha$ and $x_j^{(1)} = \left( {1/2\sqrt \alpha } \right)\mathop {\sum }\limits_l \omega _{jl}\sigma _l$ using the perturbation theory in order to find an approximate solution to the steady-state equation (12). Thus, the squared amplitudes can be expressed as follows in the general case:

$$\begin{array}{ccccc}\\ x_j^2 = & \hskip-13pt \left[ {x_j^{(0)} + {\it{\epsilon }}x_j^{(1)} + O\left( {{\it{\epsilon }}^2} \right)} \right]^2\\ \\ \,\,\,\,= & \hskip-6pt \alpha + {\it{\epsilon }}\mathop {\sum }\limits_l \omega _{jl}\sigma _j\sigma _i + O\left( {{\it{\epsilon }}^2} \right).\\ \end{array}$$

(15)

Using Eq. (15), we can express $x_j^2$ as follows:

$$x_j^2 = \left\langle {x_j^2} \right\rangle + \delta _j + O\left( {{\it{\epsilon }}^2} \right),$$

(16)

with $\delta _j = {\it{\epsilon }}\left( {\sigma _jh_j - \left\langle {\sigma _jh_j} \right\rangle } \right)$, h_j = ∑_l ≠ j ω_jlσ_l, and $\left\langle {x^2} \right\rangle = \alpha - \frac{{2\varepsilon }}{N}H$. We can remark that the homogeneous case, for which there exists exact mapping between the first non-zero stable steady states and the ground-state configurations of the Ising Hamiltonian, is the limit of the general case when δ_j → 0. Thus, the approximate mapping of the Ising Hamiltonian results from the variations of the squared amplitudes δ_j.

Experimentally, δ_j → 0 can be achived by additional measurement-feedback control. That is, CIM measures $x_j^2$ at each round trip and computes $\delta _j = x_j^2 - \alpha + \frac{{2\varepsilon }}{N}H$. This error signal is negatively fed back to the jth pump pulse or directly to the jth DOPO pulse to suppress δ_j.

Figure 7a shows the modulated threshold gain α_c due to the mutual coupling ω_ij and the success probability P_s of finding a ground state vs. the normalized pump rate p for an N = 8 Ising problem, in which each vertex has three edges and there is no Zeeman term (MAX-CUT-3 problem).¹⁴ In this problem, each vertex has three edges with the constant weight ω_ij = −0.1 as shown in Fig. 7b. At high pump rates p ≥ 1.3, the possible value of α_c is minimum for the ground state and increases monotonically with the order of excited states. That is, the mapping is successful. However, the threshold gain α_c of the first excited state decreases to below that of the ground state at p < 1. This unexpected result stems from the heterogeneity of the DOPO amplitudes. Figure 7b shows the steady-state amplitude (by circle size) and phase (by color) of each DOPO for the first excited state at p ≲ 1. Five of the total eight DOPOs mutually couple without destructive interference so that they can store large amplitudes. On the other hand, the remaining three DOPOs mutually couple with destructive interference. By keeping amplitudes in these three DOPOs small, the whole network realizes an effective field decay rate even smaller than that of the ground state. Consequently, the first excited state oscillates at a smaller pump power than the ground state, which leads to the malfunction of the DOPO network.

At above the threshold (p ≥ 1), the first excited state is metastable so that the oscillation at the first excited state switches to the oscillation at the ground state by quantum tunneling with a finite lifetime.⁴³ This is why the success probability P_s increases from p = 1 to p = 1.3. However, if the pump rate is too high (p > 1.3), the potential barrier becomes too high to tunnel from a local minimum to a global minimum, which stabilizes the false oscillation mode at the first excited state. This is the reason why P_s decreases at p > 1.3 in Fig. 7a.

We remark that a technique to ensure equal amplitudes of DOPOs is actually realized by using an error detection and correction method so that the ground state of the Ising Hamiltonian is selected by a single oscillation mode.

Experimental CIMs

Optical delay line coupling CIM

The first experimental CIM implements N = 4 Ising spins with a free-space multiple-pulse DOPO and achieves all-to-all connections with N−1(=3) optical delay lines.¹⁵ The Ising coupling constants are uniform and anti-ferromagnetic type (J_ij < 0). When the pump power is increased gradually, the machine finds one of three degenerate ground states of this complete graph with equal probabilities as shown in Fig. 8a. On the other hand, if the Ising coupling is switched-off, the machine randomly picks 1 of the 8 states by spontaneous symmetry breaking.

The second experimental CIM implements N = 16 Ising spins with a free-space multiple-pulse DOPO and connect them sparsely to realize one-dimensional ring or Möbius-Ladder graph configuration using two or three optical delay lines, respectively.²² The Ising coupling constants are uniform. Again, the machine finds one of the degenerate ground states with almost equal probabilities. These two CIMs have a 100% success probability to find ground states and achieve the random sampling capability for the degenerate ground states.

The third experimental CIM implements N = 10,000 Ising spins with a fiber DOPO and realizes one-dimensional ring.²³ The success probability as a function of system size N normalized by the correlation length x₀ and pump rate p normalized by the threshold pump rate p₀ agrees fairly well with the theoretical model⁴⁸ as shown in Fig. 8b.

Measurement-feedback CIM as exact solvers

A setup of the measurement-feedback based CIM, where N = 100 DOPO pulses are fully connected by a single measurement-feedback circuit, is installed at Stanford University.²⁴ A fraction (~10%) of each pulse intensity is picked off and its in-phase amplitude $\hat X_j$ is measured by balanced optical homodyne detectors. Then, a feedback signal for the ith DOPO pulse, $\mathop {\sum }\limits_j J_{ij}\tilde X_j$, is computed by an FPGA. This signal is imposed on the feedback optical pulse for the ith DOPO pulse by intensity and phase modulators. Finally, the injection coupler combines the feedback optical pulse and the ith DOPO pulse. Similarly, a feedback optical pulse for the jth DOPO pulse, whose amplitude is proportional to $\mathop {\sum }\limits_i J_{ji}\tilde X_i$ is prepared and injected to the jth DOPO pulse. After one round trip, the full Ising Hamiltonian is implemented via N injected feedback pulses. Note that asymmetric Ising coupling J_ij ≠ J_ji can be readily implemented in this scheme. On the other hand, a Zeeman term h_i can be implemented by injecting a constant optical pulse which is produced by appropriate amplitude and phase modulation of the master oscillator output.

Figure 9a shows the observed success probability of finding a ground state of the Möbius-Ladder graph as a function of the graph size.²⁴ Multiple 100-run batches were performed for each graph size and the standard deviations are plotted as error bars. Figure 9a also show the histograms of obtaining the low-energy excited states as well as the ground states. In the case of N = 100, the CIM can find the ground state out of ~10³⁰ solution candidates with a probability of 21 ± 9%. Note that the amplitude heterogeneity is not actively suppressed in this experiment. It is theoretically predicted that the success probability is ~100% for the problem size up to N = 100 when the DOPO amplitudes are kept uniform.

Measurement-feedback CIM as approximate solvers

A similar setup of the larger-scale CIM, where N = 2000 DOPO pulses are fully connected by a single measurement-feedback circuit, is implemented at NTT.²⁵ Major changes from the Stanford CIM²⁴ are the fiber length increased from 300 m to 1 km and the pulse repetition frequency increased from 100 MHz to 1 GHz.

Figure 9b shows the observed Ising energy vs. computation time for the MAX-CUT problems in N = 2000 complete graphs with all-to-all connections.²⁵ It is impossible to find the ground state for this size of the complete graph with a current digital computer. We used the 87.8%, even in worst case, approximate solution obtained by the semi-definite programing (SDP)²⁰ as a benchmark target. The experimental CIM reaches this target with 70 μs,²⁵ while the simulated annealing (SA) implemented in the state of art Central Processing Unit (CPU) reaches the same target with 2.1 ms. Table 1 compares the computational time of experimental CIM with those of four different types of classical algorithms implemented on either CPU or supercomputer (PEZY-Shobu) at Riken.⁴⁴ The results summarized in Fig. 9b and Table 1 well constitute the evidence for a novel speedup of CIM. Figure 9c shows the histograms of the final energies (or cut values) for the SDP which achieves the best theoretical accuracy at the present time, SA which is the most popular heuristic at the present time, and CIM. The CIM also shows the better performance than SA and SDP in accuracy, too. The computation times for SDP, SA, and CIM are 100 s, 50 ms, and 5 ms, respectively, in this accuracy measurement.

Table 1 Four types of classical neural network models implemented as algorithms in CPU or supercomputer (PEZY-Shobu) and the computational time to reach the 87.8% accuracy level provided by Goemans–Williamson semi-definite programing⁴⁴

Full size table

Quantum features of CIM

Optical neural network at the quantum limit

One of the unique features of optical neural networks is the continuous crossover of their operational modes from quantum limit to classical limit. In this section, we will present various numerical results showing the difference between the two regimes and shedding a light on the quantum-to-classical crossover.

Figure 10a shows the trajectory of the variances $2\left\langle {\Delta \hat X^2} \right\rangle$ and $2\left\langle {\Delta \hat P^2} \right\rangle$ for the DOPO field in the measurement-feedback CIM consisting of two anti-ferromagnetically coupled spins.⁴⁰ A minimum uncertainty state at the Heisenberg limit satisfies $\left\langle {\Delta \hat X^2} \right\rangle \left\langle {\Delta \hat P^2} \right\rangle = 1{\mathrm{/}}16$, which is shown by the dashed line in Fig. 10a. The CIM with a high-Q or low-Q cavity, in which a round trip loss is either 10% (−0.5 dB) or 50% (−3dB), continuously excites the quantum states which are close to the Heisenberg limit, as shown by red or blue line in Fig. 10a. On the other hand, optical neural networks at thermal noise limit (k_BT$\gg$ħω) should operate in the classical regime defined by $2\left\langle {\Delta \hat X^2} \right\rangle \ge 1{\mathrm{/}}2$ and $2\left\langle {\Delta \hat P^2} \right\rangle \ge 1{\mathrm{/}}2$, which is shown by the shaded area in Fig. 10a.

The success probability P_s of finding the ground state of an N = 16 one-dimensional Ising spin model, in which only nearest neighbor anti-ferromagnetic interaction exists, is numerically evaluated for various values of the temperature parameter n_th = k_BT/ħω and the result is shown in Fig. 10b.¹⁷ Here the optical delay line coupling CIM is assumed. In this numerical simulation, the pump rate is abruptly increased from p = 0 to p = p₀ at t = 0. In the case of ħω$\gg$k_BT (quantum noise limit), the squeezed vacuum state allows the quantum parallel search during a transient time before the steady-state amplitude is formed. The transient time decreases with the final pump rate p₀, so that the maximum success probability is achieved at a final pump rate just above the threshold value, $p_0 \simeq p_{{\rm th}} = 1$, which allows the DOPO network to have a sufficient time to search for the solution by creating the quantum noise correlation. If the final pump rate p₀ is far above the threshold, the coherent field with random 0-phase or π-phase is formed quickly in each DOPO through spontaneous symmetry breaking, before the quantum search establishes the sufficient quantum noise correlation and identifies a correct solution. The quantum tunneling is not strong enough to overcome the potential barrier separating 0-phase and π-phase when the oscillation field is strong. In this way, the DOPO network is trapped in one of the excited states (local minima). This reasoning explains the monotonic decrease in P_s for n_th$\ll$1 as a function of p(≥ 1) in Fig. 10b.

In the case of ħω$\ll$k_BT (thermal noise limit), the squeezed thermal state, which is formed before reaching the threshold, does not allow the quantum parallel search during a transient time before the steady-state amplitude is formed. This is because the quantum coherence between the states $\left| X \right\rangle$ and $\left| { - X} \right\rangle$ is ruined by the destructive interference among different photon number eigenstates in a thermal state. The maximum success probability is achieved at a final pump rate p₀$\gg$p_th = 1, where the coherent mean-field is produced and searches for a solution. This mode of operation requires that the mean-field amplitude is larger than the thermal noise amplitude. This is an operation regime of classical neural network (CNN).

Three-step quantum computation at criticality

The three-step quantum computation of the optical delay line coupling CIM is illustrated in Fig. 11a. The success rates to find either one of the two degenerate ground states in N = 16 one-dimensioned Ising spins with anti-ferromagnetic coupling are plotted as a function of normalized computation time t/t_c, where t_c is a round trip time.¹⁷ The pump rate is linearly increased from below threshold to above threshold. The success rate for a random guess is $P_{\rm s} = (1{\mathrm{/}}2)^{16} \simeq 10^{ - 5}$. After a few round trips, the success rates are increased by two orders of magnitude due to the formation of quantum noise correlation and this trend continues to t/t_c = 60, where the average photon number per DOPO pulse reaches n = 1 and the collective symmetry breaking is kicked-in. One ground state is selected, while the other is not. The probability to find the selected ground state increases exponentially while that to find the unselected ground state decreases exponentially. This exponential increase in the success rate is made possible by the bosonic final state stimulation and the associated cross-gain saturation. The quantum parallel search at below threshold, the collective symmetry breaking at threshold, and the quantum-to-classical crossover at above threshold are the three major steps in CIM.

Quantum entanglement

In the above example of an N = 16 one-dimensional ring consisting of anti-ferromagnetically coupled Ising spins, the ground state should have a negative correlation between the in-phase amplitudes X_i and X_i±1 of neighboring DOPOs. This observation motivates us to define an EPR-like operator,

$$\hat u_ + = \hat X_1 + \hat X_2 + \cdots + \hat X_{16},$$

(17)

$$\hat v_ - = \hat P_1 - \hat P_2 - \cdots - \hat P_{16},$$

(18)

to demonstrate the entanglement that exists in the DOPO network. Since $\hat u_ +$and $\hat v_ -$ commute, the simultaneous eigenstate for $\hat u_ +$ and $\hat v_ -$ should exist and such a state satisfies $\left\langle {\Delta \hat u_ + ^2} \right\rangle = \left\langle {\Delta \hat v_ - ^2} \right\rangle = 0$. On the other hand, if all DOPOs are independent (separable), it is shown that $\left\langle {\Delta \hat u_ + ^2} \right\rangle + \left\langle {\Delta \hat v_ - ^2} \right\rangle \ge 8$.¹⁷ This means that if $\left\langle {\Delta \hat u_ + ^2} \right\rangle + \left\langle {\Delta \hat v_ - ^2} \right\rangle < 8$, such a system has the entanglement due to mutual coupling. Figure 11b demonstrates that the optical delay line coupling CIM indeed establishes the quantum entanglement in the system over a wide range of the computation time (or pump rate).¹⁷

Quantum coherence

Figure 12a–d show the contour maps of the density matrix elements $\left\langle {X\left| {\hat \rho } \right|X^{\prime}} \right\rangle$ of the measurement-feedback CIM consisting of two DOPOs with anti-ferromagnetic coupling as a function of normalized computation time N = t/t_c.⁴⁰ In a high-Q cavity with a round trip loss of 0.1% (Fig. 12a), the DOPO state near threshold (t/t_c = 60) is indeed in a Schrödinger cat-like state. However, in a low-Q cavity with a round trip loss of 50% (Fig. 12d), the DOPO state evolves from a vacuum state at t/t_c = 0, a squeezed vacuum state at threshold to a coherent state at above threshold. The quantum states of the DOPO seem to stay always in Gaussian states in Fig. 12d, which is actually not the case as we will discuss in the next section. In any case, the quantum coherence $\left\langle {X\left| {\hat \rho } \right| - X} \right\rangle$, represented by the vertical slice, always exists in the CIM, no matter how large the cavity loss is.

Non-Gaussian wavepackets

The archetype bistable potential $V_b\left( {x_j} \right) = - \frac{1}{2}\alpha x_i^2 + \frac{1}{4}x_i^4$ at above threshold has a steep potential for large |x_j| values but a shallow potential barrier for small |x_j| values when the DOPO is pumped not far above the threshold, i.e. 0 < α$\lesssim$1. As a result of this asymmetric potential profile, the DOPO wave packet has a rapidly decaying tail for large |x_j| values and slowly decaying tail for small |x_j| values.

Figure 13a shows the time evolution of skewness $\left\langle {\Delta \hat X^3} \right\rangle$ of the two DOPO fields.⁴⁵ At below threshold, $\left\langle {\Delta \hat X^3} \right\rangle$ is close to zero, which is expected for a Gaussian state. At threshold, however, $\left\langle {\Delta \hat X^3} \right\rangle$ of the two DOPO fields depart to opposite directions, which indicates that the internal DOPO state has a slowly decaying tail toward a central potential and a rapidly decaying tail toward outer potential. Such a non-Gaussian state allows more frequent quantum tunneling before the final decision is made as demonstrated in the simulation result shown in Fig. 13b.⁴⁵ This switching behavior is a characteristic quantum parallel search of CIM, which is in sharp contrast to that of a hypothetical machine supporting Gaussian states (Fig. 13c).⁴⁵ In this case, the two DOPOs never switch their polarities due to the suppressed quantum tunneling.

Gottesman–Knill theorem

Not all quantum dynamics are difficult to simulate by classical digital computers. Some of representative and important quantum processes, including entangled state generation and purification, can be efficiently simulated by classical methods, so that such a quantum system alone is unlikely to achieve a computation power exceeding the current states of art in digital computing technology.

Gottesman and Knill were the first to point out this subtle distinction between classical and quantum information processing.⁴⁶ The statement of the Gottesman–Knill theorem can be summarized as follows:

If a quantum process starts with

1.
computational basis states, such as ground states $\left| 0 \right\rangle _1\left| 0 \right\rangle _2 \cdots \left| 0 \right\rangle _N$,
2.
employs a limited set (Clifford group) of unitary gates such as Hadamard gates, phase gates and controlled-NOT gates,and ends with
3.
projective measurements along the computational basis states $\left\{ {\left| 0 \right\rangle \left| 1 \right\rangle } \right\}$,

such a quantum process can be efficiently simulated by classical digital computers. A reader who is familiar with the famous Shor’s factoring algorithm knows that it requires implementation of a fractional phase which is not included in the above Clifford group constraint, so that the Shor algorithm is outside of the above limitation.

A continuous variable (harmonic oscillator) version of the above theorem was developed by Bartlett et al.⁴⁷ The statement of this theorem runs as follows:

If a quantum process with harmonic oscillators starts with

1.
Gaussian states, such as coherent states $\left| \alpha \right\rangle _1\left| \alpha \right\rangle _2 \cdots \left| \alpha \right\rangle _N$,
2.
employs a limited set of unitary gates such as squeezing gates and displacement gates,and ends with
3.
projective measurements of one quadrature amplitude (by homodyne detection) or two quadrature amplitudes (by heterodyne detection),

such a quantum process can be efficiently simulated by classical digital computers.

If we consider the above theorem against the CIM, we can identify the gain saturation (or two photon absorption) and single photon loss as the two essential quantum dynamics which make the CIM difficult to simulate efficiently by classical methods. The gain saturation term, $- x_j^3$, and the dissipation term, αx_j, in Eq. (12) produce a non-Gaussian state and provide the non-classical nature to CIM. Indeed, as shown in Fig. 13d, the success rate to find a ground state of N = 16 one-dimensional Ising spin problem based on the exact theory is higher than that based on the Gaussian approximation.⁴⁵

Summary

Optical neural networks can operate at the quantum limit (k_BT/ħω$\ll$1) at room temperatures and realize a quantum parallel search based on squeezed vacuum states. Such CIM are composed of the two constituent devices: quantum neurons and quantum synapses.

Quantum neurons are provided by degenerate optical parametric amplifiers/oscillators, which realize the quantum parallel search at below the oscillation threshold, the decision making at the threshold based on collective symmetry breaking, and the quantum-to-classical amplification of computational results at above the threshold using bosonic final state stimulation. Quantum synapses are provided by either direct coupling with optical delay lines or indirect coupling with a measurement-feedback circuit. They utilize distinct computational resources: quantum noise correlation (entanglement) in the optical delay line coupling CIM and measurement-induced wave packet reduction to non-Gaussian states in the measurement-feedback CIM.

CIM can solve various combinatorial optimization problems by mapping them on NP-hard Ising problems. Optical neural networks at the quantum limit (CIM) can outperform those at classical regime by exploiting superposition states for quantum parallel search of solutions. The gain saturation and single photon loss are the two indispensable elements to make CIM hard to simulate by classical methods. These two dissipation processes and associated fluctuations from external reservoirs are crucial resources to accelerate the search process.

CIM is based on single-mode squeezing and oscillation at degenerate frequency. This preferred mode of operation is made possible by employing a pump rate not far above the oscillation threshold and a narrow bandpass filter in a fiber ring cavity. Nonlinear dynamics of DOPO in various parameter ranges, including crossover from degenerate to non-degenerate modes, are presented in refs. 31, 48, 49.

Another type of optical neural networks, coherent XY machines, have been recently demonstrated using coupled lasers,^50,51 non-degenerate OPOs,⁵² and polaritons.⁵³ The Kuramoto model and the continuous optimization problems, in general, can be potentially implemented on these machines.

Data and code availability

All relevant data and codes are available from the authors upon request.

Change history

01 February 2018
A correction to this article has been published and is linked from the HTML version of this article.

References

Hopfield, J. J. & Tank, D. W. Computing with neural circuits: a model. Science 233, 625–633 (1986).
Article ADS MATH Google Scholar
Tank, D. & Hopfield, J. Simple ‘neural’ optimization networks: an A/D converter, signal decision circuit, and a linear programming circuit. IEEE Trans. Circ. Syst. 33, 533–541 (1986).
Article Google Scholar
Ercsey-Ravasz, M. & Toroczkai, Z. Optimization hardness as transient chaos in an analog approach to constraint satisfaction. Nat. Phys. 7, 966–970 (2011).
Article Google Scholar
Qian, L., Winfree, E. & Bruck, J. Neural network computation with DNA strand displacement cascades. Nature 475, 368–372 (2011).
Article Google Scholar
Farhi, E. et al. A quantum adiabatic evolution algorithm applied to random instances of an NP-complete problem. Science 292, 472–475 (2001).
Article ADS MathSciNet MATH Google Scholar
Kadowaki, T. & Nishimori, H. Quantum annealing in the transverse Ising model. Phys. Rev. E 58, 5355–5363 (1998).
Article ADS Google Scholar
Brooke, J., Bitko, D., Rosenbaum, T. F. & Aeppli, G. Quantum annealing of a disordered magnet. Science 284, 779–781 (1999).
Article ADS Google Scholar
Johnson, M. W. et al. Quantum annealing with manufactured spins. Nature 473, 194–198 (2011).
Article ADS Google Scholar
Rønnow, T. F. et al. Defining and detecting quantum speedup. Science 345, 420–424 (2014).
Article ADS Google Scholar
Boixo, S. et al. Evidence for quantum annealing with more than one hundred qubits. Nat. Phys. 10, 218–224 (2014).
Article Google Scholar
Barends, R. et al. Digitized adiabatic quantum computing with a superconducting circuit. Nature 534, 222–226 (2016).
Article ADS Google Scholar
Bunyk, P. I. et al. Architectural considerations in the design of a superconducting quantum annealing processor. IEEE Trans. Appl. Supercond. 24, 1700110 (2014).
Article Google Scholar
Rieffel, E. G. et al. A case study in programming a quantum annealer for hard operational planning problems. Quant. Inform. Process. 14, 1–36 (2015).
Article ADS MATH Google Scholar
Wang, Z., Marandi, A., Wen, K., Byer, R. L. & Yamamoto, Y. Coherent Ising machine based on degenerate optical parametric oscillators. Phys. Rev. A 88, 063853 (2013).
Article ADS Google Scholar
Marandi, A., Wang, Z., Takata, K., Byer, R. L. & Yamamoto, Y. Network of time-multiplexed optical parametric oscillators as a coherent Ising machine. Nat. Photonics 8, 937–942 (2014).
Article ADS Google Scholar
Takata, K., Marandi, A. & Yamamoto, Y. Quantum correlation in degenerate optical parametric oscillators with mutual injections. Phys. Rev. A 92, 043821 (2015).
Article ADS Google Scholar
Maruo, D., Utsunomiya, S. & Yamamoto, Y. Truncated Wigner theory of coherent Ising machines based on degenerate optical parametric oscillator network. Phys. Scr. 91, 083010 (2016).
Article ADS Google Scholar
Barahona, F. On the computational complexity of Ising spin glass models. J. Phys. Math. Gen. 15, 3241–3253 (1982).
Article ADS MathSciNet Google Scholar
Istrail, S. Proc. Thirty-second Annual ACM Symposium on Theory of Computing, STOC, 87–96 (2000).
Haribara, Y., Utsunomiya, S. & Yamamoto, Y. Computational principle and performance evaluation of coherent Ising machine based on degenerate optical parametric oscillator network. Entropy 18, 151 (2016).
Article ADS Google Scholar
Lucas, A. Ising formulations of many NP problems. Front. Phys. 2, 5 (2014).
Article Google Scholar
Takata, K. et al. A 16-bit coherent Ising machine for one-dimensional ring and cubic graph problems. Sci. Rep. 6, 34089 (2016).
Article ADS Google Scholar
Inagaki, T. et al. Large-scale Ising spin network based on degenerate optical parametric oscillators. Nat. Photonics 10, 415–419 (2016).
Article ADS Google Scholar
McMahon, P. L. et al. A fully programmable 100-spin coherent Ising machine with all-to-all connections. Science 354, 614 (2016).
Article ADS MathSciNet Google Scholar
Inagaki, T. et al. A coherent Ising machine for 2000-node optimization problems. Science 354, 603 (2016).
Article ADS Google Scholar
Nigg, S. E., Lörch, N. & Tiwari, R. P. Robust quantum optimizer with full connectivity. Sci. Adv. 3, e1602273 (2017).
Article ADS Google Scholar
Puri, S., Andersen, C. K., Grimsmo, A. L. & Blais, A. Quantum annealing with all-to-all connected nonlinear oscillators. Nat. Commun. 8, 15785 (2017).
Article ADS Google Scholar
Walls, D. F. & Milburn, G. J. Quantum Optics (Springer, Berlin-Heidelberg, 2008).
Byer, R. L., Oshman, M. K., Young, J. F. & Harris, S. E. Visible CW parametric oscillator. Appl. Phys. Lett. 13, 109 (1968).
Article ADS Google Scholar
Serkland, D. K., Fejer, M. M., Byer, R. L. & Yamamoto, Y. Squeezing in a quasi-phase-matched LiNbO_3 waveguide. Opt. Lett. 20, 1649 (1995).
Article ADS Google Scholar
Hamerly, R. et al. Reduced models and design principles for half-harmonic generation in synchronously pumped optical parametric oscillators. Phys. Rev. A 94, 063809 (2016).
Article ADS Google Scholar
Nambu, Y. Nobel Lecture: Spontaneous symmetry breaking in particle physics: a case of cross fertilization. Rev. Mod. Phys. 81, 1015 (2009).
Article ADS Google Scholar
Goto, H. Bifurcation-based adiabatic quantum computation with a nonlinear oscillator network. Sci. Rep. 6, 21686 (2016).
Article ADS Google Scholar
Goto, H. Universal quantum computation with a nonlinear oscillator network. Phys. Rev. A 93, 050301 (R) (2016).
Article ADS MathSciNet Google Scholar
Goto, H., Lin, Z. & Nakamura, Y. Dissipative quantum bifurcation machine: quantum heating of coupled nonlinear oscillators. arXiv:1707.00986 [quant-ph] (2017).
Mandel, L. & Wolf, E. Optical Coherence and Quantum Optics (Cambridge University Press, 1995).
Wolinsky, M. & Carmichael, H. J. Quantum noise in the parametric oscillator: from squeezed states to coherent-state superpositions. Phys. Rev. Lett. 60, 1836 (1988).
Article ADS Google Scholar
Krippner, L., Munro, W. J. & Reid, M. D. Transient macroscopic quantum superposition states in degenerate parametric oscillation: calculations in the large-quantum-noise limit using the positive P representation. Phys. Rev. A 50, 4330 (1994).
Article ADS Google Scholar
Drummond, P. D., Dechoum, K. & Chaturvedi, S. Critical quantum fluctuations in the degenerate parametric oscillator. Phys. Rev. A 65, 033806 (2002).
Article ADS Google Scholar
Yamamura, A., Aihara, K. & Yamamoto, Y. Quantum model for coherent Ising machines: Discrete-time measurement feedback formulation. Phys. Rev. A 96, 053834 (2017).
Ising, E. Beitrag zur theorie des ferromagnetismus. Z. Phys. 31, 253–258 (1925).
Article ADS Google Scholar
Leleu, T., Yamamoto, Y., Utsunomiya, S. & Aihara, K. Combinatorial optimization using dynamical phase transitions in driven-dissipative systems. Phys. Rev. E 95, 022118 (2017).
Article ADS Google Scholar
Kinsler, P. & Drummond, P. D. Quantum dynamics of the parametric oscillator. Phys. Rev. A 43, 6194 (1991).
Article ADS Google Scholar
Haribara, Y., Ishikawa, H., Utsunomiya, S., Aihara, K. & Yamamoto, Y. Performance evaluation of coherent Ising machines against classical neural networks. Quant. Sci. Tech. 2, 044002 (2017).
Shoji, T., Aihara, K. & Yamamoto, Y. Quantum model for coherent Ising machine: Stochastic differential equations with replicator dynamics. Phys. Rev. A 96, 053833 (2017).
Gottesman, D. The Heisenberg representation of quantum computers. arXiv:quant-ph/9807006 (1998).
Bartlett, S. D., Sanders, B. C., Braunstein, S. L. & Nemoto, K. Efficient classical simulation of continuous variable quantum information processes. Phys. Rev. Lett. 88, 097904 (2002).
Article ADS Google Scholar
Hamerly, R. et al. Topological defect formation in 1D and 2D spin chains realized by network of optical parametric oscillators. Int. J. Mod. Phys. B 30, 1630014 (2016).
Article ADS MathSciNet MATH Google Scholar
Roslund, J., Medeiros de Araújo, R., Jiang, S., Fabre, C. & Treps, N. Wavelength-multiplexed quantum networks with ultrafast frequency combs. Nature Photonics 8, 109 (2014).
Nixon, M., Ronen, E., Friesem, A. A. & Davidson, N. Observing geometric frustration with thousands of coupled lasers. Phys. Rev. Lett. 110, 184102 (2013).
Article ADS Google Scholar
Tamate, S., Yamamoto, Y., Marandi, A. McMahon, P. & Utsunomiya, S. Simulating the classical XY model with a laser network. arXiv:1608.00358 [physics.optics] (2016).
Takeda, Y. et al. Boltzmann sampling for an XY model using a non-degenerate optical parametric oscillator network. Quant. Sci. Tech. 3, 014004 (2018).
Berloff, N. G., Kalinin, K., Silva, M., Langbein, W. & Lagoudakis P. G. Realizing the XY Hamiltonian in polariton simulators. arXiv:1607.06065 [cond-mat.mes-hall] (2016).

Download references

Acknowledgements

The authors wish to thank R. L. Byer, H. Mabuchi, R. D. Drummond, P. McMahon, A. Marandi, C. Langrock, T. Inagaki, T. Honjo, K. Inaba, R. Hamerly, Y. Haribara, D. Maruo, K. Takata, Z. Wang, A. Yamamura, and T. Shoji for their useful discussions. This research was supported by ImPACT Program of Council for Science, Technology and Innovation (Cabinet Office, Government of Japan).

Author information

Authors and Affiliations

Japan Science and Technology Agency, K’s Gobancho 7, Gobancho, Chiyoda-ku, Tokyo, 102-0076, Japan
Yoshihisa Yamamoto & Satoshi Kako
E. L. Ginzton Laboratory, Stanford University, Stanford, CA, 94305, USA
Yoshihisa Yamamoto & Martin Fejer
Institute of Industrial Science, The University of Tokyo, 4-6-1 Komaba, Meguro-ku, Tokyo, 153-8505, Japan
Kazuyuki Aihara & Timothee Leleu
National Institute of Informatics, 2-1-2 Hitotsubashi, Chiyoda-ku, Tokyo, 101-8403, Japan
Ken-ichi Kawarabayashi & Satoshi Kako
Division of Electrical, Electronic and Information Engineering, Osaka University, 2-1 Yamadaoka, Suita, Osaka, 565-0871, Japan
Kyo Inoue
NTT Basic Research Laboratories, NTT Corporation, 3-1 Morinosato Wakamiya, Atsugi, Kanagawa, 243-0198, Japan
Hiroki Takesue

Authors

Yoshihisa Yamamoto
View author publications
You can also search for this author in PubMed Google Scholar
Kazuyuki Aihara
View author publications
You can also search for this author in PubMed Google Scholar
Timothee Leleu
View author publications
You can also search for this author in PubMed Google Scholar
Ken-ichi Kawarabayashi
View author publications
You can also search for this author in PubMed Google Scholar
Satoshi Kako
View author publications
You can also search for this author in PubMed Google Scholar
Martin Fejer
View author publications
You can also search for this author in PubMed Google Scholar
Kyo Inoue
View author publications
You can also search for this author in PubMed Google Scholar
Hiroki Takesue
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.Y. wrote the manuscript, K.A., K.K., T.L., S.K., M.F., K.I., and H.T. critically read and revised the manuscript.

Corresponding author

Correspondence to Yoshihisa Yamamoto.

Ethics declarations

Competing interests

The authors declare that they have no competing financial interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Change history: A correction to this article has been published and is linked from the HTML version of this article.

A correction to this article is available online at https://doi.org/10.1038/s41534-018-0061-7.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yamamoto, Y., Aihara, K., Leleu, T. et al. Coherent Ising machines—optical neural networks operating at the quantum limit. npj Quantum Inf 3, 49 (2017). https://doi.org/10.1038/s41534-017-0048-9

Download citation

Received: 30 July 2017
Revised: 31 October 2017
Accepted: 03 November 2017
Published: 15 December 2017
DOI: https://doi.org/10.1038/s41534-017-0048-9

This article is cited by

Quantum computing for several AGV scheduling models
- Liang Tang
- Chao Yang
- Yiyun Guo
Scientific Reports (2024)
Solving Boltzmann optimization problems with deep learning
- Fiona Knoll
- John Daly
- Jess Meyer
npj Unconventional Computing (2024)
A benchmarking study of quantum algorithms for combinatorial optimization
- Krishanu Sankar
- Artur Scherer
- Yoshihisa Yamamoto
npj Quantum Information (2024)
Training an Ising machine with equilibrium propagation
- Jérémie Laydevant
- Danijela Marković
- Julie Grollier
Nature Communications (2024)
A spinwave Ising machine
- Artem Litvinenko
- Roman Khymyn
- Johan Åkerman
Communications Physics (2023)