Abstract
In this article, we will introduce the basic concept and the quantum feature of a novel computing system, coherent Ising machines, and describe their theoretical and experimental performance. We start with the discussion how to construct such physical devices as the quantum analog of classical neuron and synapse, and end with the performance comparison against various classical neural networks implemented in CPU and supercomputers.
Introduction
Optimization problems with discrete and continuous variables, some of which belong to NPhard or NPcomplete class in complexity theory, are ubiquitous in numerous important areas, including operations and scheduling, drug discovery, wireless communications, finance, integrated circuit design, compressed sensing, and machine learning. Despite rapid advances in both algorithms and digital computer technology, even modest sized NPhard/NPcomplete problems that arise in practice may be very difficult to solve on modern digital computers. One alternative to current vonNeumanncomputerbased methods for solving such problems is a neural network realized with analog electronic circuits and other systems.^{1,2,3,4} Another approach of contemporary interest is the adiabatic quantum computation (AQC)^{5} and quantum annealing (QA).^{6,7} Sophisticated AQC/QA devices are already under development,^{8,9,10,11} but providing dense connectivity between qubits remains a major challenge^{12} with serious implications for the efficiency of AQC/QA systems.^{13}
Networks of degenerate optical parametric oscillators (DOPOs) are an alternative physical system, with an unconventional operating mechanism,^{14,15,16,17} for solving the Ising problem,^{18,19,20} and by extension many other combinatorial optimization problems.^{21} Formally, the Nspin Ising problem is to find the configuration of spins \({\mathrm{\sigma }}_i \in \left\{ {  1, + 1} \right\}\,\left( {i = 1, \ldots ,N} \right)\) that minimizes the energy function \({\it H} =  \mathop {\sum}\nolimits_{1 \le i < j \le N} {J_{ij}\sigma _i\sigma _j}  \mathop {\sum}\nolimits_{1 \le i \le N} {h_i\sigma _i}\), where the particular problem instance being solved is specified by the N × N matrix J (with elements J_{ ij }) and the Ndimensional vector h (with elements h_{ i }).
In DOPO Ising machines, the upspin and downspin are represented by the positive and negative inphase amplitudes, i.e. \(\sigma _i = X_i{\mathrm{/}}\left {X_i} \right\). Then, the energy function H is mapped to the effective loss, photon decay rate, of the DOPO network by implementing the mutual coupling proportional to J_{ ij } between two DOPOs and injecting the dc field proportional to h_{ i } to individual DOPO.^{14} The ground state of the Ising Hamiltonian corresponds to an oscillation mode with the minimum network loss. At the pump rate below an oscillation threshold, every DOPO is in a squeezed vacuum state which is interpreted as linear superposition of σ_{ i } = +1 (positive amplitude or upspin) and σ_{ i } = −1 (negative amplitude or downspin) so that the probability amplitudes for 2^{N} spin configurations are all identical, as shown in Fig. 1a. At the threshold pump rate, the ground state with a minimum loss reaches a threshold first and oscillates as a single stable mode, which triggers the stimulated emission of photons and crossgain saturation for all the other modes, as shown in Fig. 1b. Thus, the solution of the Ising problem can be found by simply detecting the single oscillation mode.^{14}
The input states into all DOPOs are vacuum states, \(\left 0 \right\rangle _1\left 0 \right\rangle _2, \cdots \left 0 \right\rangle _N\). The essential quantum operations are the collective (not individual) symmetry breaking at DOPO threshold, which translates all DOPOs to either positive amplitude \(\left \alpha \right\rangle\) or negative amplitude \(\left {  \alpha } \right\rangle\) coherent state in a correlated way. Finally, the computational results are read out by optical homodyne detectors. A block diagram of the CIM operation is shown in Fig. 1c.
We have realized such a system with a scalable architecture that uses a measurementfeedback circuit in place of optical delay lines, which were used in earlier experiments.^{15,22,23} Our measurementfeedbackbased DOPO Ising machine allows connections between any spin and any other spin, and is fully programmable.^{24,25} Fully connected superconducting Ising machines have been also proposed recently.^{26,27}
Classical vs. quantum neural networks
Nonlinear devices that have a characteristic input–output relation shown in Fig. 2a represent a typical gain function of neurons in classical neural networks (CNN).^{1,2} When an input signal level is weak, the neuron linearly amplifies this input signal and thus compensates for unavoidable linear loss in the network. When the input signal level exceeds a certain threshold value a_{th}, however, the output signal level is clamped at a constant value b_{ s }. This nonlinear input–output relation is essential for CNN to find a stable operating point which manifests a solution for a given mathematical problem. In this model, each neuronal state is expressed by a continuous variable a_{ i }, which obeys a continuous time evolution governed by
where the first term of R.H.S. of Eq. (1) expresses the linear loss and the second term describes selffeedback with the nonlinear gain function f shown in Fig. 2a. Note that a time is normalized by the decay rate of a neuron excitation. The third term represents the mutual coupling among neurons and the interaction potential V implements a given mathematical problem. We emphasize that a simultaneous mutual coupling between neurons without introducing undesired instability or oscillation is made possible by the gradient descent character of the third term. The nonlinear gain function f is often implemented as a coupling coefficient for the third term of mutual coupling instead of the second term of selffeedback.^{1,2} An essential feature of the CNN is, however, independent of the location of the nonlinear gain function. Finally, the fourth term, g_{ i }, expresses a noise driving force associated with the decay and the nonlinear gain for a neuron excitation.
The classical picture behind Eq. (1) is that each neuron has an unpredictable but definite value a_{ i } at a given time even though the network is randomly driven by internal and/or external noise. This is a correct physical picture of a neural network when thermal noise is much larger than quantum zeropoint noise i.e. k_{B}T\(\gg\)ħω. However, if the opposite is true, i.e. ħω\(\gg\)k_{B}T, the interpretation of Eq. (1) must be altered. It should be considered as a cnumber stochastic differential equation (CSDE), which is derived from and fully equivalent to the density operator master equation for the quantum neural network.^{16,17} Now, the variable a_{ i } does not represent any physical quantity of the neuron but rather it is merely an eigenvalue (α or β) of the coherent states used in the quasiprobability density to expand the density operator, such as a positive P(α,β) representation^{16} or Wigner representation.^{17} In order to compute the expectation value of any physical quantity of the neuron, we must first reconstruct the density operator by an ensemble of many trajectories of a_{ i } generated by the CSDE. Whenever ħω\(\gg\)k_{B}T, we must take this quantum approach to reach a correct answer.
A quantum neuron (or quantum neural network) is characterized by the following three properties and in this way, it is distinct from a classical counterpart:

1.
A quantum neuron is prepared in a superposition state of different neural excitations so that a quantum parallel search can be implemented.

2.
A network of quantum neurons makes a decision to reach a final computational result through correlated and collective symmetry breaking at a critical point of phase transition.

3.
A network of quantum neurons amplifies the above quantum solution to a classical signal via bosonic final state stimulation.
As shown in the subsequent sections, we will show the above three properties are indeed realized by DOPO and their networks.
The most important advantage of the DOPObased quantum neural network compared to its classical counterpart is that each neuron (DOPO) is prepared in linear superposition of different inphase amplitude eigenstates, i.e. squeezed vacuum state and so a quantum parallel search can be implemented to the whole optimization process. This is the crucial computational resource for CIM.
Quantum neurons
Degenerate optical parametric amplifiers and oscillators
Any phasesensitive amplifier (PSA) has an input–output relation similar to that shown in Fig. 2a due to its inherent gain saturation effect. We focus here on the degenerate optical parametric amplifier (DOPA) based on the two photon emission process shown in Fig. 2b. A particular device is composed of a secondorder nonlinear crystal placed in an optical cavity. The nonlinear crystal absorbs one pump photon at a frequency 2ω_{s} and simultaneously emits two signal photons at a frequency ω_{s}. A pertinent interaction Hamiltonian^{28} is expressed by
Here \(\hat a_{\rm s}( {\hat a_{\rm s}^\dagger } )\) and \(\hat a_{\rm p}( {\hat a_{\rm p}^\dagger } )\) are the annihilation (creation) operators for the signal and pump fields, and κ is a parametric coupling constant. The simultaneously generated two signal photons are a conjugate pair in a sense that they have a positive correlation in the inphase amplitudes \(\hat X( { = ( {\hat a_{\rm s} + \hat a_{\rm s}^\dagger } ){\mathrm{/}}2} )\) and a negative correlation in the quadraturephase amplitudes \(\hat P( { = ( {\hat a_{\rm s}  \hat a_{\rm s}^\dagger } ){\mathrm{/}}2i} )\). As a result of such positive and negative correlations in \(\hat X\) and \(\hat P\), constructive and destructive interference happens between the two signal photons along the Xaxis and Paxis, respectively. As a result of such quantum interference, if a DOPA is input by an external vacuum state (zeropoint fluctuation), the zeropoint fluctuation is amplified and deamplified along the Xaxis and Paxis, respectively, as shown in Fig. 2c. The resulting state is called a squeezed vacuum state, which is a minimum uncertainty wave packet and satisfies the Heisenberg uncertainty principle with equality, \(\left\langle {\Delta \hat X^2} \right\rangle \left\langle {\Delta \hat P^2} \right\rangle = 1{\mathrm{/}}16\), just as the vacuum state.
The phasesensitive amplification/deamplification mentioned above is not a rare phenomenon in nature. One classical example is a swing driven by a person as shown in Fig. 3a, where a person (corresponding to the pump in DOPA) completes a full cycle (up–down–up) while a swing (corresponding to the signal in DOPA) makes only a half cycle (lefttoright). Note that the phase of the pump and that of the signal should be mutually locked in order to realize an amplitude amplification process, as shown in Fig. 3b. This corresponds to the situation that the inphase amplitude \(\hat X\) is amplified. You can easily imagine from your childhood memory what you did in order to stop the swing when you must go home. You stood up at the center of the swing period and crouched at both ends. Then, the swing (signal) amplitude is attenuated. This corresponds to the situation that the quadraturephase amplitude \(\hat P\) is deamplified.
The OPO has a long history of development at Stanford University.^{29} In the coherent Ising machine (CIM), a periodically polled lithiumniobate (PPLN) waveguide device is used as such a PSA for signal pulses. The first experimental demonstration of deamplification (squeezing) and amplification (antisqueezing) for optical pulses in vacuum states using this particular device was reported in 1995.^{30} Figure 3c shows the suppressed quadraturephase amplitude noise (squeezing) and enhanced inphase amplitude noise (antisqueezing) by the PPLN waveguide DOPA. It is a general property of the squeezing devices that the degree of squeezing (vacuum noise deamplification) is reduced by experimental system’s imperfections, in particular, a linear optical loss, but that the degree of antisqueezing (vacuum noise amplification) is not reduced so much by linear optical loss. Suppose a PSA output has a 20 dB of squeezing, \(4\left\langle {\Delta \hat P_{{\mathrm{in}}}^2} \right\rangle = 10^{  2}\), for the quadraturephase amplitude and 20 dB of antisqueezing, \(4\left\langle {\Delta \hat X_{{\mathrm{in}}}^2} \right\rangle = 10^2\), for the inphase amplitude. An optical linear loss L degrades the degree of squeezing and antisqueezing according to
If there is a 3 dB of linear loss after the PSA, the degree of squeezing is only ~3 dB while the degree of antisqueezing is still ~17 dB. Note the Ising spin is represented by the inphase amplitude rather than the quadraturephase amplitude. Therefore, the quantum parallel search in the CIM is based on the antisqueezed quantum noise of the inphase amplitude rather than the squeezed quantum noise of the quadraturephase amplitude, so that the operation of CIM is inherently robust against an optical loss.
The input–output relation of the DOPA for an increasing signal power above the saturation level approximately reproduces the nonlinear response function f(a) shown in Fig. 2a.^{31} A physical mechanism behind this gain saturation is the depletion of the pump power, which triggers a reverse energy flow, i.e. two signal photons are absorbed simultaneously in the nonlinear crystal to generate one pump photon. When the input signal power becomes sufficiently large, the reverse energy flow from the signal to the pump is switched on and the linear amplification process must stop. In our case of constructing quantum neurons, a related but slightly different nonlinear effect is employed to amplify a solution state at a microscopic level to that at a macroscopic level, which we will explain next.
If the parametric amplifier gain exceeds the signal decay rate from the cavity, the system can sustain a steadystate field by creating a finite average amplitude along the Xdirection. The center position of a quantum wave packet in the (X,P) phase space is no more \(X = P = 0\) but there emerges a nonzero \(X\) value, as shown in Fig. 2d. This is the oscillation threshold point. The device pumped at above the oscillation threshold is called a DOPO. The probability for creating a positive \(X\) value and negative \(X\) value is randomly selected by 50–50% probability, which is known as spontaneous symmetry breaking and is an ubiquitous feature for any secondorder phase transition phenomena.^{32} In the language of dynamical systems theory, the DOPO threshold represents the supercritical pitchfork bifurcation. The two stable states with positive and negative \(X\) values correspond to the firing and nonfiring states of a classical neuron or upspin and downspin states of an Ising problem. A similar mechanism, dissipative quantum bifurcation machine, was discussed recently in the context of Josephson junction qubits.^{33,34,35} When the input signal level to the DOPA is varied while the pump rate is fixed at below the oscillation threshold, the output signal level is first linearly amplified and then eventually reaches the oscillation state with a clamped amplitude due to gain saturation. In this way, the nonlinear input/output relation shown in Fig. 2a is realized in the transition from DOPA to DOPO in our quantum neurons.
Why are DOPOs referred to as quantum neurons and how are they different from classical neurons? We will present the answers to these important questions in the next section.
Linear superposition
A set of photon number eigenstates \(\left n \right\rangle\) can expand an arbitrary state of the field as an orthonormal set. The wavefunction of a squeezed vacuum state (see Fig. 2c) can be mathematically constructed as a superposition of photon number eigenstates with even eigenvalues:
A simple physical reasoning behind Eq. (4) is explained as follows: The photon number of an intense pump field exhibits large quantum uncertainty, typically on the order of \(\left\langle {\Delta \hat n^2} \right\rangle \sim \left\langle {\hat n} \right\rangle\) for a coherent pump field. Therefore, we cannot extract whichpath information, even in principle, whether the number of absorbed pump photons in the nonlinear crystal is zero, one, two … at a given time. Because one pump photon is converted into two signal photons, those cases correspond to the output signal field in \(\left 0 \right\rangle ,\,\left 2 \right\rangle ,\,\left 4 \right\rangle ,\, \cdots\) photon number eigenstates. Because of the lack of whichpath information, the proper expression for the quantum state of the output signal field of DOPA must be a superposition of those states. Except for an irrelevant phase factor, the probability amplitudes c_{0},c_{2},c_{4},\(\cdots\) have an identical phase, for instance, positive real numbers.^{36} In the (X, P) phase space, the different photon number eigenstates constructively interfere with each other along the Xaxis, while they destructively interfere with each other along the Paxis.
Alternatively, a squeezed vacuum state can be mathematically constructed as a superposition of quadrature amplitude eigenstates \(\left X \right\rangle\), which are the eigenstates of the Hermitian operator \(\hat X\):
It is not difficult to show that the different inphase amplitude eigenstates constructively interfere with each other in the small Pregions centered at \(P = 0\), while they destructively interfere with each other in the large Pregions. Therefore, more antisqueezing (enhanced quantum noise) along the Xaxis realizes more squeezing (reduced quantum noise) along the Paxis.
It is worth pointing out that a statistical mixture of inphase amplitude eigenstates
instead of Eq. (5), covers the same range of the eigenvalues X but requires an infinite amount of energy to realize, since the localization of the state along the Paxis near \(P = 0\) is now impossible due to the lack of phase coherence between different \(\left X \right\rangle\) eigenstates. The infinite uncertainty along the Paxis means an infinite energy. A squeezed vacuum state, represented by Eq. (5) and produced by the DOPA, allows a quantum parallel search with a finite energy, while a classical mixed state, represented by Eq. (6), does not allow such a quantum parallel search.
A particularly unique quantum feature of the DOPA/DOPO is that the aforementioned superposition survives not only below the threshold but also above the threshold to some extent if the cavity loss is small. At well above the oscillation threshold, the DOPO produces either 0phase or πphase coherent field as shown in Fig. 2d. However, a DOPO at just above the oscillation threshold maintains the superposition of 0phase and πphase states, in spite of finite photon leakage from the resonator.^{37,38,39} This is possible because a hypothetical measurement performed for a leaked signal field cannot identify which phase the DOPO selects due to the enhanced quantum noise along the Xaxis and also small separation between the two states. The antisqueezed inphase amplitude noise realizes a socalled “quantum erasure” for this hypothetical whichpath measurement.
The evidence for the above statement is shown in Fig. 4a.^{40} Here we consider a CIM consisting of synchronously pumped DOPO pulses circulating in a fiber ring cavity.^{24,25} The density matrix elements \(\left\langle {\left. X \right\hat \rho \left {X^\prime} \right.} \right\rangle\) in the (X+X′, X−X′) phase space for two outofphase (antiferromagnetically) coupled DOPOs are plotted, where the pump rate is just above the threshold. In Fig. 4a, the inphase amplitude probability distribution \(\left\langle {\left. X \right\hat \rho \left X \right.} \right\rangle\), which is given as the horizontal slice at X−X′ = 0, and the quantum coherence \(\left\langle {\left. X \right\hat \rho \left {  X} \right.} \right\rangle\) between the two inphase amplitude eigenstates \(\left X \right\rangle\) and \(\left {  X} \right\rangle\), which is given as the vertical slice at X+X' = 0, clearly show macroscopically separated coherent states \(\left \alpha \right\rangle\) and \(\left {  \alpha } \right\rangle\) maintain quantum coherence.^{40} Figure 4b compares the density matrix elements \(\left\langle {\left. X \right\hat \rho \left {X^\prime} \right.} \right\rangle\) for a superposition state, \(1{\mathrm{/}}\sqrt 2 \left( {\left \alpha \right\rangle + \left {  \alpha } \right\rangle } \right)\), and a mixed state, \(\hat \rho = 1/2\left( {\left \alpha \right\rangle \left\langle \alpha \right + \left {  \alpha } \right\rangle \left\langle {  \alpha } \right} \right)\), in the same (X+X′, X−X′) phase space, respectively. We can conclude from Fig. 4a that the two DOPOs are approximately in Schrödinger’s cat states and yet their centers of gravity are negatively correlated due to outofphase coupling. That is, the DOPO1 (above) is biased toward a positive amplitude, while the DOPO2 (below) is biased toward a negative amplitude. Figure 4c shows the Wigner function of the DOPO state. The oscillatory behavior and negative values of the Wigner function are another manifestation of the quantum coherence between \(\left \alpha \right\rangle\) and \(\left {  \alpha } \right\rangle\) states.^{40} Such a highly nonclassical feature is quickly destroyed by increasing a cavity loss. However, it should be emphasized that the superposition of \(\left X \right\rangle\) eigenstates with positive and negative eigenvalues in a squeezed vacuum state, represented by Eq. (4), is rather robust against a cavity loss as mentioned above so that the resulting quantum parallel search survive in a lowQ cavity.
Amplitude and phase error correction capability given by phasesensitive amplification
Fluctuations are induced in the DOPO fields by the external phase and amplitude noise injection. Both the amplitude and phase of the DOPO field are continuous variables so that the standard error detection and error correction techniques cannot be applied. Fortunately, the phasesensitive amplification/deamplification mechanism of DOPO can stabilize the phase to either 0 or π as long as the phase error is small compared to ±π/2.^{15} Moreover, the amplitude error can be also suppressed by the gain saturation mechanism of the DOPO. If the signal amplitude is increased to above the steadystate value, the pump amplitude is more strongly depleted which, in turn, recovers the steadystate amplitude through a reduced parametric gain. The opposite is true when the signal amplitude is decreased to below the steadystate value. The deterministic amplitude decay due to optical linear loss can be compensated for by the average gain. In this way, the output amplitude is stabilized through the balance between the linear cavity loss and the saturated gain in DOPO, while the phase is stabilized by the phasesensitive deamplification of DOPO. This is schematically shown in Fig. 5.
The DOPO provides a unique opportunity as a robust analog memory against external noise injection. We can stably store the analog information at quantum limited accuracy with the abovementioned mechanisms.
Quantum synapses
We wish to implement a target Hamiltonian as a cost function (effective loss) of the network consisting of quantum neurons. Our target Hamiltonian is the following Ising model:^{41}
where the Ising spin σ_{ i } takes either +1 (upspin) or −1 (downspin). As described above, we wish to represent σ_{ i } = 1 or σ_{ i } = −1 by the positive or negative inphase amplitude, which corresponds to 0phase or πphase oscillation, respectively (Fig. 2d). The Ising coupling J_{ ij } and the local field h_{ i } take continuous (real) values, the magnitude of which are determined by mapping a given combinatorial optimization problem on the Ising model.^{13,21} The threedimensional Ising model and the twodimensional Ising model with local fields belong to the NPhard class in complexity theory.^{18} Therefore, you can imagine that many hard problems in the real world can be solved through the Ising model.^{13,21} In order to implement the cost function described in Eq. (7) as the effective loss of the DOPO network, pairs of DOPOs must be coupled with the coupling constant J_{ ij } and also the constant optical field h_{ i } must be injected into each DOPO.^{14}
Optical delay line coupling scheme
In the configuration shown in Fig. 6a, N independent DOPOs are simultaneously realized as N optical pulses circulating in a single fiber ring cavity with an internal PSA which is driven externally by pump pulse trains. Figure 6a shows one experimental scheme to implement the Ising coupling J_{ ij } in the DOPO network, where a part of each DOPO pulse circulating in a fiber ring resonator is pickedoff at every round trip by the output coupler, amplified by an external PSA, split into multiple optical delay lines including intensity and phase modulators and then injected back to the target DOPO pulse at appropriate timing.^{15,22,23}
The external PSA is provided by another DOPA and boosts the inphase amplitude \(\hat X_{}^{}\) of a pickedoff pulse before it is attenuated by the beam splitter and modulator loss. Using this technique, we can implement a relatively large Ising coupling constant (J_{ ij } ≲ 1) even if the system size is fairly large (N ≳ 10^{4}). Using such N−1 optical delay lines, any (jth) pulse can be connected to any other (ith) pulse with a coupling constant J_{ ij }. Please note that asymmetric coupling J_{ ij } ≠ J_{ ji } can be easily implemented in this scheme and that ~N^{2} mutual coupling constants can be realized by (N−1) optical delay lines. Such an alloptical coupling scheme has been experimentally demonstrated for N = 4 and 16 pulses using freespace optics with a 4.8 m ring cavity and 250 MHz and 1 GHz repetition frequencies, respectively^{15,22} and for N = 10^{4} pulses using a planar lightwave circuit and fiber optics with 1 km fiber ring cavity and 2 GHz repetition frequency.^{23} In the latter system, for example, the optical fiber length is 1 km so that a round trip time is 5 μs while the pulse interval is 0.5 ns, which leads to N = 10^{4} DOPO pulses inside a single ring cavity.
This optical delay line coupling machine is capable of producing the quantum noise correlation among the internal DOPO pluses.^{16,17} In the section Quantum features of CIM, we will discuss this point in detail using the two measures for evaluating quantum noise correlation: quantum entanglement and discord.
Measurementfeedback coupling scheme
An alternative coupling scheme to implement the Ising coupling J_{ ij } is shown in Fig. 6b.^{24,25} Instead of directly connecting the DOPO pulses with optical delay lines, we can measure approximately the inphase amplitude of the internal DOPO pulse by the optical balanced homodyne detectors. If the inferred inphase amplitude of the jth DOPO pulse is represented by \(\tilde X_j\), the feedback pulse to the ith DOPO pulse should have an inphase amplitude proportional to \(\mathop {\sum }\nolimits_j J_{ij}\tilde X_j\). The complicated task of the synchronous computation of the vectorvector multiplication between J_{ ij } and \(\tilde X_j\), which must be completed in the pulse interval of 100 ps – 1 ns, is achieved by a single measurementfeedback circuit consisting of an analogtodigital converter, a field programmable gate array (FPGA), a digitaltoanalog converter and optical amplitude/phase modulators.^{24,25} The feedback pulse used as an input to the optical modulator and the local oscillator pulse used for optical homodyne detection are both provided by a part of the pump laser output, as shown in Fig. 6b.
Such a measurementfeedback coupling scheme (Fig. 6b) is equivalent to an optical delay line coupling scheme (Fig. 6a) except for the following advantage/disadvantage. The advantage of the measurementfeedback scheme is that alltoall coupling of the order of ~N^{2} connections can be implemented by a single measurementfeedback circuit, so that the daunting task of constructing N−1 optical delay lines and stabilizing their delay lengths (or optical phase) with an error much less than the optical wavelength can be avoided. In addition, manybody Isingtype interactions, such as \({\it H} =  \mathop {\sum }K_{ijk}\sigma _i\sigma _j\sigma _k\), can be readily implemented. The disadvantage of the measurementfeedback scheme is that the FPGA circuit must complete ~Ο(N) vector–vector multiplication and addition within each pulse interval. This imposes a limitation on the machine size N and the pulse repetition frequency. On the other hand, the optical delay line coupling scheme enjoys its inherent highspeed operation with a pulse repetition frequency limited only by optical device performance.
There is a subtle but important difference in the operational principles from quantum mechanical viewpoints between the optical delay line coupling and measurementfeedback coupling schemes. The measurementfeedback scheme does not produce any entanglement among DOPO pulses. The total density operator stays in the product state of individual DOPO pulse density operators, because the coupling is provided by the local operation and classical communications (LOCC). However, the measurementfeedback scheme is capable of implementing the nonunitary state reduction to pump out a spurious entropy from the network and prepare each DOPO pulse close to the Heisenberg limit (minimum uncertainty wave packet) induced by quantum measurements. The wave packet is actually nonGaussian at above threshold, which contributes to enhanced quantum tunneling in the quantum parallel search process.
Figure 6c shows schematically this scenario. Suppose the two DOPOs are connected by the antiferromagnetic Ising coupling constant J_{21} < 0. At the pump rate near threshold and before measurement feedback, the two DOPOs are independently in squeezed vacuum states. If the DOPO1 is measured and a positive amplitude is obtained (\(\tilde X_1 > 0\)), the center position of the DOPO1 wave packet is shifted to the positive side and the variance is reduced, but a long tail exists toward X_{1} = 0 and X_{1} < 0 (nonGaussian). Simultaneously, the center position of the DOPO2 wave packet is shifted to the negative side by injecting a feedback pulse with an amplitude of \(J_{21}\tilde X_1\). In this way, the antiferromagnetic order is implemented in the two quantum states even with a single measurementfeedback step.
Mapping of an Ising Hamiltonian to DOPO network loss
In this section, the inphase amplitude x_{ j }(t) of the jth DOPO is considered as a classical variable. The simple classical picture presented in this section still captures an essential part of the mapping principle.
Pitchfork bifurcation
We consider in this section simple equations of motion that capture the essential computational principle for the sake of clarity. We consider the dynamics of N analog variables x_{ j }, with x_{ j } ∈]−∞, +∞[and j∈{1,⋯, N}, given as follows:
with
where V_{b}(x_{ j }) is the archetype bistable potential in x_{ j } with \(V_{\rm b}\left( {x_j} \right) =  \frac{1}{2}\alpha x_j^2 + \frac{1}{4}x_j^4\) and V_{ l }(x) the analog version of the Ising Hamiltonian with V_{ l }(x) = −∑_{ j }ω_{ jl }x_{ j }x_{ l }. Here t = (γ_{s}/2)τ is the unitless time normalized by the signal field lifetime 2/γ_{s}, γ_{s} is the signal photon decay rate, p = α + 1 = F_{p}/F_{th} is the unitless pump rate normalized by the solitary DOPO threshold pump photon flux \(F_{{\rm th}} = \gamma _{\rm s}\sqrt {\gamma _{\rm p}} {\mathrm{/4}}\kappa\), γ_{p} is the pump photon decay rate, κ is the parametric coupling constant, x_{ j } = X_{ j }/A_{s} is the normalized inphase amplitude, and \(A_{\rm s} = \sqrt {\gamma _{\rm s}\gamma _{\rm p}{\mathrm{/2}}\kappa ^2}\) is the saturation amplitude which is equal to the actual DOPO oscillation amplitude at the normalized pump rate p = 2. α = −1 + p is the bifurcation parameter given by the normalized decay rate (−1) and linear gain (p) for the signal field. Moreover, the parameter \(\epsilon\) is small enough such that \(0 < \epsilon \ll 1\). Note that the Ising coupling constant ω_{ jl } can be arbitrarily chosen under this constraint on the parameter ε. Equation (8) describes the dynamics of a gradient descent system. Note that the equations of motion for a solitary DOPO are given as \(\dot x_j = \alpha x_j  x_j^3\) when ε = 0, which represents the normal form of the supercritical pitchfork bifurcation.
In the uncoupled case, i.e., ε = 0, the potential V_{b} is monostable with x_{ j } = 0 and bistable with x_{ j } = ±\(\sqrt \alpha\) when α < 0 and α > 0, respectively. In the coupled case when \(0 < \epsilon \ll 1\) and α is large enough, the state of each variable x_{ j } can be considered as approximately binary and can be mapped to the Ising spin. In particular, the Ising spin configuration {σ_{ j }}_{ j } is associated to the state {x_{ j }}_{ j } of the system described in Eq. (8) by considering that σ_{ j } is equal to the sign of x_{ j }, i.e., σ_{ j } = x_{ j }/x_{ j }.
When α increases over a critical threshold value, there is a breaking of symmetry and the states x_{ j } become either positive or negative. The emergence of the first nonzero stable steady state can be studied by considering linear stability analysis at the origin, i.e., the Jacobian matrix J associated with Eq. (8) and computed at the state x_{ j } = 0, ∀_{ ij }. This Jacobian matrix is given as follows:
where Ω is the matrix with components ω_{ jl } and I the identity matrix.^{14,42} The destabilization of the zero state corresponds to the emergence of the first nonzero stable steady state after the symmetry breaking and occurs when the maximal eigenvalue of the Jacobian matrix J, noted λ_{1}, is equal to zero. The eigenvalue λ_{1} is given as \(\lambda _1 = \alpha + {\it{\epsilon }}\mu _1\), where μ_{1}(>0) is the maximal eigenvalue of the matrix Ω, which we consider unique for the sake of simplicity. The first nonzero steady state becomes stable when λ_{1} = 0, i.e., when the bifurcation parameter α is equal to α_{ c } given as follows:^{14,42}
As the bifurcation parameter α increases further above the threshold α_{ c }, other states become stable and the system develops a complex attractor landscape. Note that Eq. (11) suggests that α_{ c } < 0 in general.
Condition for successful mapping
The linear stability analysis of the zero state allows determining the bifurcation parameter α_{ c } for which the zero state becomes unstable. Moreover, the eigenvector associated with the eigenvalue λ_{1} indicates the locations of the first nonzero stable steady states just after the bifurcation has occurred but, in general case, these locations do not correspond to the groundstate configurations of the Ising Hamiltonian. However, it can be noticed that, under the particular condition that all analog amplitudes x_{ j } are equal, the first nonzero stable steady states are mapped rigorously to the groundstate configurations of the Ising Hamiltonian.^{42} Indeed, the steadystate condition of Eq. (8) is given as follows when x_{ j } = σ_{ j }x, ∀_{ ij }, with x > 0:
Thus, the homogeneous squared amplitude x^{2} can be written as follows:
with the Ising Hamiltonian H without a dc field. When \({\mathrm{\alpha }}  \frac{{2{\it{\epsilon }}}}{N}{\it H} < 0\), the only solution of Eq. (12) is x = 0. As α increases, the first nonzero steady state appears at the minimal value of the function \(\frac{{2{\it{\epsilon }}}}{N}{\it H}\) as shown in Eq. (13). As shown in Fig. 1a, b, the groundstate configurations of the Ising Hamiltonian appear as a single oscillation mode. In a nutshell, the first nonzero stable steady states of Eq. (12) encode for the groundstate configurations of the Ising Hamiltonian if the steadystate amplitudes x_{ j },∀j, are all equal.
Effect of the amplitude heterogeneity
In more general case, the amplitudes x_{ j } are not equal and the mapping between the first stable steadystate configurations and the minima of the Ising Hamiltonian does not hold.^{14} In practice, approximate solutions to various combinatorial optimization problems can still be obtained under this approximate mapping.^{14,42} In order to quantify the distance between the case of exact mapping and more general case, i.e., when amplitudes are homogeneous and heterogeneous, respectively, we quantify the amplitude heterogeneity using the quantity δ_{ x } defined as follows:
with \(\delta _j = x_j^2  \langle x^2\rangle\) and \(\langle x^2\rangle = \frac{1}{N}\mathop {\sum }\limits_j x_j^2\). The quantity δ_{ x } corresponds to the coefficient of variation of the squared amplitudes. For α > 0 and \(\epsilon \ll 1\), the amplitudes x_{ j } can be expressed as \(x_j = x_j^{\left( 0 \right)} + {\it{\epsilon }}x_j^{\left( 1 \right)} + {\mathrm{O}}\left( {{\it{\epsilon }}^2} \right)\) with \(x_j^{(0)} = \sigma _j\sqrt \alpha\) and \(x_j^{(1)} = \left( {1/2\sqrt \alpha } \right)\mathop {\sum }\limits_l \omega _{jl}\sigma _l\) using the perturbation theory in order to find an approximate solution to the steadystate equation (12). Thus, the squared amplitudes can be expressed as follows in the general case:
Using Eq. (15), we can express \(x_j^2\) as follows:
with \(\delta _j = {\it{\epsilon }}\left( {\sigma _jh_j  \left\langle {\sigma _jh_j} \right\rangle } \right)\), h_{ j } = ∑_{l ≠ j} ω_{ jl }σ_{ l }, and \(\left\langle {x^2} \right\rangle = \alpha  \frac{{2\varepsilon }}{N}H\). We can remark that the homogeneous case, for which there exists exact mapping between the first nonzero stable steady states and the groundstate configurations of the Ising Hamiltonian, is the limit of the general case when δ_{ j } → 0. Thus, the approximate mapping of the Ising Hamiltonian results from the variations of the squared amplitudes δ_{ j }.
Experimentally, δ_{ j } → 0 can be achived by additional measurementfeedback control. That is, CIM measures \(x_j^2\) at each round trip and computes \(\delta _j = x_j^2  \alpha + \frac{{2\varepsilon }}{N}H\). This error signal is negatively fed back to the jth pump pulse or directly to the jth DOPO pulse to suppress δ_{ j }.
Figure 7a shows the modulated threshold gain α_{ c } due to the mutual coupling ω_{ ij } and the success probability P_{s} of finding a ground state vs. the normalized pump rate p for an N = 8 Ising problem, in which each vertex has three edges and there is no Zeeman term (MAXCUT3 problem).^{14} In this problem, each vertex has three edges with the constant weight ω_{ ij } = −0.1 as shown in Fig. 7b. At high pump rates p ≥ 1.3, the possible value of α_{ c } is minimum for the ground state and increases monotonically with the order of excited states. That is, the mapping is successful. However, the threshold gain α_{ c } of the first excited state decreases to below that of the ground state at p < 1. This unexpected result stems from the heterogeneity of the DOPO amplitudes. Figure 7b shows the steadystate amplitude (by circle size) and phase (by color) of each DOPO for the first excited state at p ≲ 1. Five of the total eight DOPOs mutually couple without destructive interference so that they can store large amplitudes. On the other hand, the remaining three DOPOs mutually couple with destructive interference. By keeping amplitudes in these three DOPOs small, the whole network realizes an effective field decay rate even smaller than that of the ground state. Consequently, the first excited state oscillates at a smaller pump power than the ground state, which leads to the malfunction of the DOPO network.
At above the threshold (p ≥ 1), the first excited state is metastable so that the oscillation at the first excited state switches to the oscillation at the ground state by quantum tunneling with a finite lifetime.^{43} This is why the success probability P_{s} increases from p = 1 to p = 1.3. However, if the pump rate is too high (p > 1.3), the potential barrier becomes too high to tunnel from a local minimum to a global minimum, which stabilizes the false oscillation mode at the first excited state. This is the reason why P_{s} decreases at p > 1.3 in Fig. 7a.
We remark that a technique to ensure equal amplitudes of DOPOs is actually realized by using an error detection and correction method so that the ground state of the Ising Hamiltonian is selected by a single oscillation mode.
Experimental CIMs
Optical delay line coupling CIM
The first experimental CIM implements N = 4 Ising spins with a freespace multiplepulse DOPO and achieves alltoall connections with N−1(=3) optical delay lines.^{15} The Ising coupling constants are uniform and antiferromagnetic type (J_{ ij } < 0). When the pump power is increased gradually, the machine finds one of three degenerate ground states of this complete graph with equal probabilities as shown in Fig. 8a. On the other hand, if the Ising coupling is switchedoff, the machine randomly picks 1 of the 8 states by spontaneous symmetry breaking.
The second experimental CIM implements N = 16 Ising spins with a freespace multiplepulse DOPO and connect them sparsely to realize onedimensional ring or MöbiusLadder graph configuration using two or three optical delay lines, respectively.^{22} The Ising coupling constants are uniform. Again, the machine finds one of the degenerate ground states with almost equal probabilities. These two CIMs have a 100% success probability to find ground states and achieve the random sampling capability for the degenerate ground states.
The third experimental CIM implements N = 10,000 Ising spins with a fiber DOPO and realizes onedimensional ring.^{23} The success probability as a function of system size N normalized by the correlation length x_{0} and pump rate p normalized by the threshold pump rate p_{0} agrees fairly well with the theoretical model^{48} as shown in Fig. 8b.
Measurementfeedback CIM as exact solvers
A setup of the measurementfeedback based CIM, where N = 100 DOPO pulses are fully connected by a single measurementfeedback circuit, is installed at Stanford University.^{24} A fraction (~10%) of each pulse intensity is picked off and its inphase amplitude \(\hat X_j\) is measured by balanced optical homodyne detectors. Then, a feedback signal for the ith DOPO pulse, \(\mathop {\sum }\limits_j J_{ij}\tilde X_j\), is computed by an FPGA. This signal is imposed on the feedback optical pulse for the ith DOPO pulse by intensity and phase modulators. Finally, the injection coupler combines the feedback optical pulse and the ith DOPO pulse. Similarly, a feedback optical pulse for the jth DOPO pulse, whose amplitude is proportional to \(\mathop {\sum }\limits_i J_{ji}\tilde X_i\) is prepared and injected to the jth DOPO pulse. After one round trip, the full Ising Hamiltonian is implemented via N injected feedback pulses. Note that asymmetric Ising coupling J_{ ij } ≠ J_{ ji } can be readily implemented in this scheme. On the other hand, a Zeeman term h_{ i } can be implemented by injecting a constant optical pulse which is produced by appropriate amplitude and phase modulation of the master oscillator output.
Figure 9a shows the observed success probability of finding a ground state of the MöbiusLadder graph as a function of the graph size.^{24} Multiple 100run batches were performed for each graph size and the standard deviations are plotted as error bars. Figure 9a also show the histograms of obtaining the lowenergy excited states as well as the ground states. In the case of N = 100, the CIM can find the ground state out of ~10^{30} solution candidates with a probability of 21 ± 9%. Note that the amplitude heterogeneity is not actively suppressed in this experiment. It is theoretically predicted that the success probability is ~100% for the problem size up to N = 100 when the DOPO amplitudes are kept uniform.
Measurementfeedback CIM as approximate solvers
A similar setup of the largerscale CIM, where N = 2000 DOPO pulses are fully connected by a single measurementfeedback circuit, is implemented at NTT.^{25} Major changes from the Stanford CIM^{24} are the fiber length increased from 300 m to 1 km and the pulse repetition frequency increased from 100 MHz to 1 GHz.
Figure 9b shows the observed Ising energy vs. computation time for the MAXCUT problems in N = 2000 complete graphs with alltoall connections.^{25} It is impossible to find the ground state for this size of the complete graph with a current digital computer. We used the 87.8%, even in worst case, approximate solution obtained by the semidefinite programing (SDP)^{20} as a benchmark target. The experimental CIM reaches this target with 70 μs,^{25} while the simulated annealing (SA) implemented in the state of art Central Processing Unit (CPU) reaches the same target with 2.1 ms. Table 1 compares the computational time of experimental CIM with those of four different types of classical algorithms implemented on either CPU or supercomputer (PEZYShobu) at Riken.^{44} The results summarized in Fig. 9b and Table 1 well constitute the evidence for a novel speedup of CIM. Figure 9c shows the histograms of the final energies (or cut values) for the SDP which achieves the best theoretical accuracy at the present time, SA which is the most popular heuristic at the present time, and CIM. The CIM also shows the better performance than SA and SDP in accuracy, too. The computation times for SDP, SA, and CIM are 100 s, 50 ms, and 5 ms, respectively, in this accuracy measurement.
Quantum features of CIM
Optical neural network at the quantum limit
One of the unique features of optical neural networks is the continuous crossover of their operational modes from quantum limit to classical limit. In this section, we will present various numerical results showing the difference between the two regimes and shedding a light on the quantumtoclassical crossover.
Figure 10a shows the trajectory of the variances \(2\left\langle {\Delta \hat X^2} \right\rangle\) and \(2\left\langle {\Delta \hat P^2} \right\rangle\) for the DOPO field in the measurementfeedback CIM consisting of two antiferromagnetically coupled spins.^{40} A minimum uncertainty state at the Heisenberg limit satisfies \(\left\langle {\Delta \hat X^2} \right\rangle \left\langle {\Delta \hat P^2} \right\rangle = 1{\mathrm{/}}16\), which is shown by the dashed line in Fig. 10a. The CIM with a highQ or lowQ cavity, in which a round trip loss is either 10% (−0.5 dB) or 50% (−3dB), continuously excites the quantum states which are close to the Heisenberg limit, as shown by red or blue line in Fig. 10a. On the other hand, optical neural networks at thermal noise limit (k_{B}T\(\gg\)ħω) should operate in the classical regime defined by \(2\left\langle {\Delta \hat X^2} \right\rangle \ge 1{\mathrm{/}}2\) and \(2\left\langle {\Delta \hat P^2} \right\rangle \ge 1{\mathrm{/}}2\), which is shown by the shaded area in Fig. 10a.
The success probability P_{s} of finding the ground state of an N = 16 onedimensional Ising spin model, in which only nearest neighbor antiferromagnetic interaction exists, is numerically evaluated for various values of the temperature parameter n_{th} = k_{B}T/ħω and the result is shown in Fig. 10b.^{17} Here the optical delay line coupling CIM is assumed. In this numerical simulation, the pump rate is abruptly increased from p = 0 to p = p_{0} at t = 0. In the case of ħω\(\gg\)k_{B}T (quantum noise limit), the squeezed vacuum state allows the quantum parallel search during a transient time before the steadystate amplitude is formed. The transient time decreases with the final pump rate p_{0}, so that the maximum success probability is achieved at a final pump rate just above the threshold value, \(p_0 \simeq p_{{\rm th}} = 1\), which allows the DOPO network to have a sufficient time to search for the solution by creating the quantum noise correlation. If the final pump rate p_{0} is far above the threshold, the coherent field with random 0phase or πphase is formed quickly in each DOPO through spontaneous symmetry breaking, before the quantum search establishes the sufficient quantum noise correlation and identifies a correct solution. The quantum tunneling is not strong enough to overcome the potential barrier separating 0phase and πphase when the oscillation field is strong. In this way, the DOPO network is trapped in one of the excited states (local minima). This reasoning explains the monotonic decrease in P_{s} for n_{th}\(\ll\)1 as a function of p(≥ 1) in Fig. 10b.
In the case of ħω\(\ll\)k_{B}T (thermal noise limit), the squeezed thermal state, which is formed before reaching the threshold, does not allow the quantum parallel search during a transient time before the steadystate amplitude is formed. This is because the quantum coherence between the states \(\left X \right\rangle\) and \(\left {  X} \right\rangle\) is ruined by the destructive interference among different photon number eigenstates in a thermal state. The maximum success probability is achieved at a final pump rate p_{0}\(\gg\)p_{th} = 1, where the coherent meanfield is produced and searches for a solution. This mode of operation requires that the meanfield amplitude is larger than the thermal noise amplitude. This is an operation regime of classical neural network (CNN).
Threestep quantum computation at criticality
The threestep quantum computation of the optical delay line coupling CIM is illustrated in Fig. 11a. The success rates to find either one of the two degenerate ground states in N = 16 onedimensioned Ising spins with antiferromagnetic coupling are plotted as a function of normalized computation time t/t_{ c }, where t_{ c } is a round trip time.^{17} The pump rate is linearly increased from below threshold to above threshold. The success rate for a random guess is \(P_{\rm s} = (1{\mathrm{/}}2)^{16} \simeq 10^{  5}\). After a few round trips, the success rates are increased by two orders of magnitude due to the formation of quantum noise correlation and this trend continues to t/t_{ c } = 60, where the average photon number per DOPO pulse reaches n = 1 and the collective symmetry breaking is kickedin. One ground state is selected, while the other is not. The probability to find the selected ground state increases exponentially while that to find the unselected ground state decreases exponentially. This exponential increase in the success rate is made possible by the bosonic final state stimulation and the associated crossgain saturation. The quantum parallel search at below threshold, the collective symmetry breaking at threshold, and the quantumtoclassical crossover at above threshold are the three major steps in CIM.
Quantum entanglement
In the above example of an N = 16 onedimensional ring consisting of antiferromagnetically coupled Ising spins, the ground state should have a negative correlation between the inphase amplitudes X_{ i } and X_{i±1} of neighboring DOPOs. This observation motivates us to define an EPRlike operator,
to demonstrate the entanglement that exists in the DOPO network. Since \(\hat u_ +\)and \(\hat v_ \) commute, the simultaneous eigenstate for \(\hat u_ +\) and \(\hat v_ \) should exist and such a state satisfies \(\left\langle {\Delta \hat u_ + ^2} \right\rangle = \left\langle {\Delta \hat v_  ^2} \right\rangle = 0\). On the other hand, if all DOPOs are independent (separable), it is shown that \(\left\langle {\Delta \hat u_ + ^2} \right\rangle + \left\langle {\Delta \hat v_  ^2} \right\rangle \ge 8\).^{17} This means that if \(\left\langle {\Delta \hat u_ + ^2} \right\rangle + \left\langle {\Delta \hat v_  ^2} \right\rangle < 8\), such a system has the entanglement due to mutual coupling. Figure 11b demonstrates that the optical delay line coupling CIM indeed establishes the quantum entanglement in the system over a wide range of the computation time (or pump rate).^{17}
Quantum coherence
Figure 12a–d show the contour maps of the density matrix elements \(\left\langle {X\left {\hat \rho } \rightX^{\prime}} \right\rangle\) of the measurementfeedback CIM consisting of two DOPOs with antiferromagnetic coupling as a function of normalized computation time N = t/t_{ c }.^{40} In a highQ cavity with a round trip loss of 0.1% (Fig. 12a), the DOPO state near threshold (t/t_{ c } = 60) is indeed in a Schrödinger catlike state. However, in a lowQ cavity with a round trip loss of 50% (Fig. 12d), the DOPO state evolves from a vacuum state at t/t_{ c } = 0, a squeezed vacuum state at threshold to a coherent state at above threshold. The quantum states of the DOPO seem to stay always in Gaussian states in Fig. 12d, which is actually not the case as we will discuss in the next section. In any case, the quantum coherence \(\left\langle {X\left {\hat \rho } \right  X} \right\rangle\), represented by the vertical slice, always exists in the CIM, no matter how large the cavity loss is.
NonGaussian wavepackets
The archetype bistable potential \(V_b\left( {x_j} \right) =  \frac{1}{2}\alpha x_i^2 + \frac{1}{4}x_i^4\) at above threshold has a steep potential for large x_{ j } values but a shallow potential barrier for small x_{ j } values when the DOPO is pumped not far above the threshold, i.e. 0 < α\(\lesssim\)1. As a result of this asymmetric potential profile, the DOPO wave packet has a rapidly decaying tail for large x_{ j } values and slowly decaying tail for small x_{ j } values.
Figure 13a shows the time evolution of skewness \(\left\langle {\Delta \hat X^3} \right\rangle\) of the two DOPO fields.^{45} At below threshold, \(\left\langle {\Delta \hat X^3} \right\rangle\) is close to zero, which is expected for a Gaussian state. At threshold, however, \(\left\langle {\Delta \hat X^3} \right\rangle\) of the two DOPO fields depart to opposite directions, which indicates that the internal DOPO state has a slowly decaying tail toward a central potential and a rapidly decaying tail toward outer potential. Such a nonGaussian state allows more frequent quantum tunneling before the final decision is made as demonstrated in the simulation result shown in Fig. 13b.^{45} This switching behavior is a characteristic quantum parallel search of CIM, which is in sharp contrast to that of a hypothetical machine supporting Gaussian states (Fig. 13c).^{45} In this case, the two DOPOs never switch their polarities due to the suppressed quantum tunneling.
Gottesman–Knill theorem
Not all quantum dynamics are difficult to simulate by classical digital computers. Some of representative and important quantum processes, including entangled state generation and purification, can be efficiently simulated by classical methods, so that such a quantum system alone is unlikely to achieve a computation power exceeding the current states of art in digital computing technology.
Gottesman and Knill were the first to point out this subtle distinction between classical and quantum information processing.^{46} The statement of the Gottesman–Knill theorem can be summarized as follows:
If a quantum process starts with

1.
computational basis states, such as ground states \(\left 0 \right\rangle _1\left 0 \right\rangle _2 \cdots \left 0 \right\rangle _N\),

2.
employs a limited set (Clifford group) of unitary gates such as Hadamard gates, phase gates and controlledNOT gates,and ends with

3.
projective measurements along the computational basis states \(\left\{ {\left 0 \right\rangle \left 1 \right\rangle } \right\}\),
such a quantum process can be efficiently simulated by classical digital computers. A reader who is familiar with the famous Shor’s factoring algorithm knows that it requires implementation of a fractional phase which is not included in the above Clifford group constraint, so that the Shor algorithm is outside of the above limitation.
A continuous variable (harmonic oscillator) version of the above theorem was developed by Bartlett et al.^{47} The statement of this theorem runs as follows:
If a quantum process with harmonic oscillators starts with

1.
Gaussian states, such as coherent states \(\left \alpha \right\rangle _1\left \alpha \right\rangle _2 \cdots \left \alpha \right\rangle _N\),

2.
employs a limited set of unitary gates such as squeezing gates and displacement gates,and ends with

3.
projective measurements of one quadrature amplitude (by homodyne detection) or two quadrature amplitudes (by heterodyne detection),
such a quantum process can be efficiently simulated by classical digital computers.
If we consider the above theorem against the CIM, we can identify the gain saturation (or two photon absorption) and single photon loss as the two essential quantum dynamics which make the CIM difficult to simulate efficiently by classical methods. The gain saturation term, \( x_j^3\), and the dissipation term, αx_{ j }, in Eq. (12) produce a nonGaussian state and provide the nonclassical nature to CIM. Indeed, as shown in Fig. 13d, the success rate to find a ground state of N = 16 onedimensional Ising spin problem based on the exact theory is higher than that based on the Gaussian approximation.^{45}
Summary
Optical neural networks can operate at the quantum limit (k_{B}T/ħω\(\ll\)1) at room temperatures and realize a quantum parallel search based on squeezed vacuum states. Such CIM are composed of the two constituent devices: quantum neurons and quantum synapses.
Quantum neurons are provided by degenerate optical parametric amplifiers/oscillators, which realize the quantum parallel search at below the oscillation threshold, the decision making at the threshold based on collective symmetry breaking, and the quantumtoclassical amplification of computational results at above the threshold using bosonic final state stimulation. Quantum synapses are provided by either direct coupling with optical delay lines or indirect coupling with a measurementfeedback circuit. They utilize distinct computational resources: quantum noise correlation (entanglement) in the optical delay line coupling CIM and measurementinduced wave packet reduction to nonGaussian states in the measurementfeedback CIM.
CIM can solve various combinatorial optimization problems by mapping them on NPhard Ising problems. Optical neural networks at the quantum limit (CIM) can outperform those at classical regime by exploiting superposition states for quantum parallel search of solutions. The gain saturation and single photon loss are the two indispensable elements to make CIM hard to simulate by classical methods. These two dissipation processes and associated fluctuations from external reservoirs are crucial resources to accelerate the search process.
CIM is based on singlemode squeezing and oscillation at degenerate frequency. This preferred mode of operation is made possible by employing a pump rate not far above the oscillation threshold and a narrow bandpass filter in a fiber ring cavity. Nonlinear dynamics of DOPO in various parameter ranges, including crossover from degenerate to nondegenerate modes, are presented in refs. 31, 48, 49.
Another type of optical neural networks, coherent XY machines, have been recently demonstrated using coupled lasers,^{50,51} nondegenerate OPOs,^{52} and polaritons.^{53} The Kuramoto model and the continuous optimization problems, in general, can be potentially implemented on these machines.
Data and code availability
All relevant data and codes are available from the authors upon request.
Change history
01 February 2018
A correction to this article has been published and is linked from the HTML version of this article.
References
 1.
Hopfield, J. J. & Tank, D. W. Computing with neural circuits: a model. Science 233, 625–633 (1986).
 2.
Tank, D. & Hopfield, J. Simple ‘neural’ optimization networks: an A/D converter, signal decision circuit, and a linear programming circuit. IEEE Trans. Circ. Syst. 33, 533–541 (1986).
 3.
ErcseyRavasz, M. & Toroczkai, Z. Optimization hardness as transient chaos in an analog approach to constraint satisfaction. Nat. Phys. 7, 966–970 (2011).
 4.
Qian, L., Winfree, E. & Bruck, J. Neural network computation with DNA strand displacement cascades. Nature 475, 368–372 (2011).
 5.
Farhi, E. et al. A quantum adiabatic evolution algorithm applied to random instances of an NPcomplete problem. Science 292, 472–475 (2001).
 6.
Kadowaki, T. & Nishimori, H. Quantum annealing in the transverse Ising model. Phys. Rev. E 58, 5355–5363 (1998).
 7.
Brooke, J., Bitko, D., Rosenbaum, T. F. & Aeppli, G. Quantum annealing of a disordered magnet. Science 284, 779–781 (1999).
 8.
Johnson, M. W. et al. Quantum annealing with manufactured spins. Nature 473, 194–198 (2011).
 9.
Rønnow, T. F. et al. Defining and detecting quantum speedup. Science 345, 420–424 (2014).
 10.
Boixo, S. et al. Evidence for quantum annealing with more than one hundred qubits. Nat. Phys. 10, 218–224 (2014).
 11.
Barends, R. et al. Digitized adiabatic quantum computing with a superconducting circuit. Nature 534, 222–226 (2016).
 12.
Bunyk, P. I. et al. Architectural considerations in the design of a superconducting quantum annealing processor. IEEE Trans. Appl. Supercond. 24, 1700110 (2014).
 13.
Rieffel, E. G. et al. A case study in programming a quantum annealer for hard operational planning problems. Quant. Inform. Process. 14, 1–36 (2015).
 14.
Wang, Z., Marandi, A., Wen, K., Byer, R. L. & Yamamoto, Y. Coherent Ising machine based on degenerate optical parametric oscillators. Phys. Rev. A 88, 063853 (2013).
 15.
Marandi, A., Wang, Z., Takata, K., Byer, R. L. & Yamamoto, Y. Network of timemultiplexed optical parametric oscillators as a coherent Ising machine. Nat. Photonics 8, 937–942 (2014).
 16.
Takata, K., Marandi, A. & Yamamoto, Y. Quantum correlation in degenerate optical parametric oscillators with mutual injections. Phys. Rev. A 92, 043821 (2015).
 17.
Maruo, D., Utsunomiya, S. & Yamamoto, Y. Truncated Wigner theory of coherent Ising machines based on degenerate optical parametric oscillator network. Phys. Scr. 91, 083010 (2016).
 18.
Barahona, F. On the computational complexity of Ising spin glass models. J. Phys. Math. Gen. 15, 3241–3253 (1982).
 19.
Istrail, S. Proc. Thirtysecond Annual ACM Symposium on Theory of Computing, STOC, 87–96 (2000).
 20.
Haribara, Y., Utsunomiya, S. & Yamamoto, Y. Computational principle and performance evaluation of coherent Ising machine based on degenerate optical parametric oscillator network. Entropy 18, 151 (2016).
 21.
Lucas, A. Ising formulations of many NP problems. Front. Phys. 2, 5 (2014).
 22.
Takata, K. et al. A 16bit coherent Ising machine for onedimensional ring and cubic graph problems. Sci. Rep. 6, 34089 (2016).
 23.
Inagaki, T. et al. Largescale Ising spin network based on degenerate optical parametric oscillators. Nat. Photonics 10, 415–419 (2016).
 24.
McMahon, P. L. et al. A fully programmable 100spin coherent Ising machine with alltoall connections. Science 354, 614 (2016).
 25.
Inagaki, T. et al. A coherent Ising machine for 2000node optimization problems. Science 354, 603 (2016).
 26.
Nigg, S. E., Lörch, N. & Tiwari, R. P. Robust quantum optimizer with full connectivity. Sci. Adv. 3, e1602273 (2017).
 27.
Puri, S., Andersen, C. K., Grimsmo, A. L. & Blais, A. Quantum annealing with alltoall connected nonlinear oscillators. Nat. Commun. 8, 15785 (2017).
 28.
Walls, D. F. & Milburn, G. J. Quantum Optics (Springer, BerlinHeidelberg, 2008).
 29.
Byer, R. L., Oshman, M. K., Young, J. F. & Harris, S. E. Visible CW parametric oscillator. Appl. Phys. Lett. 13, 109 (1968).
 30.
Serkland, D. K., Fejer, M. M., Byer, R. L. & Yamamoto, Y. Squeezing in a quasiphasematched LiNbO_3 waveguide. Opt. Lett. 20, 1649 (1995).
 31.
Hamerly, R. et al. Reduced models and design principles for halfharmonic generation in synchronously pumped optical parametric oscillators. Phys. Rev. A 94, 063809 (2016).
 32.
Nambu, Y. Nobel Lecture: Spontaneous symmetry breaking in particle physics: a case of cross fertilization. Rev. Mod. Phys. 81, 1015 (2009).
 33.
Goto, H. Bifurcationbased adiabatic quantum computation with a nonlinear oscillator network. Sci. Rep. 6, 21686 (2016).
 34.
Goto, H. Universal quantum computation with a nonlinear oscillator network. Phys. Rev. A 93, 050301 (R) (2016).
 35.
Goto, H., Lin, Z. & Nakamura, Y. Dissipative quantum bifurcation machine: quantum heating of coupled nonlinear oscillators. arXiv:1707.00986 [quantph] (2017).
 36.
Mandel, L. & Wolf, E. Optical Coherence and Quantum Optics (Cambridge University Press, 1995).
 37.
Wolinsky, M. & Carmichael, H. J. Quantum noise in the parametric oscillator: from squeezed states to coherentstate superpositions. Phys. Rev. Lett. 60, 1836 (1988).
 38.
Krippner, L., Munro, W. J. & Reid, M. D. Transient macroscopic quantum superposition states in degenerate parametric oscillation: calculations in the largequantumnoise limit using the positive P representation. Phys. Rev. A 50, 4330 (1994).
 39.
Drummond, P. D., Dechoum, K. & Chaturvedi, S. Critical quantum fluctuations in the degenerate parametric oscillator. Phys. Rev. A 65, 033806 (2002).
 40.
Yamamura, A., Aihara, K. & Yamamoto, Y. Quantum model for coherent Ising machines: Discretetime measurement feedback formulation. Phys. Rev. A 96, 053834 (2017).
 41.
Ising, E. Beitrag zur theorie des ferromagnetismus. Z. Phys. 31, 253–258 (1925).
 42.
Leleu, T., Yamamoto, Y., Utsunomiya, S. & Aihara, K. Combinatorial optimization using dynamical phase transitions in drivendissipative systems. Phys. Rev. E 95, 022118 (2017).
 43.
Kinsler, P. & Drummond, P. D. Quantum dynamics of the parametric oscillator. Phys. Rev. A 43, 6194 (1991).
 44.
Haribara, Y., Ishikawa, H., Utsunomiya, S., Aihara, K. & Yamamoto, Y. Performance evaluation of coherent Ising machines against classical neural networks. Quant. Sci. Tech. 2, 044002 (2017).
 45.
Shoji, T., Aihara, K. & Yamamoto, Y. Quantum model for coherent Ising machine: Stochastic differential equations with replicator dynamics. Phys. Rev. A 96, 053833 (2017).
 46.
Gottesman, D. The Heisenberg representation of quantum computers. arXiv:quantph/9807006 (1998).
 47.
Bartlett, S. D., Sanders, B. C., Braunstein, S. L. & Nemoto, K. Efficient classical simulation of continuous variable quantum information processes. Phys. Rev. Lett. 88, 097904 (2002).
 48.
Hamerly, R. et al. Topological defect formation in 1D and 2D spin chains realized by network of optical parametric oscillators. Int. J. Mod. Phys. B 30, 1630014 (2016).
 49.
Roslund, J., Medeiros de Araújo, R., Jiang, S., Fabre, C. & Treps, N. Wavelengthmultiplexed quantum networks with ultrafast frequency combs. Nature Photonics 8, 109 (2014).
 50.
Nixon, M., Ronen, E., Friesem, A. A. & Davidson, N. Observing geometric frustration with thousands of coupled lasers. Phys. Rev. Lett. 110, 184102 (2013).
 51.
Tamate, S., Yamamoto, Y., Marandi, A. McMahon, P. & Utsunomiya, S. Simulating the classical XY model with a laser network. arXiv:1608.00358 [physics.optics] (2016).
 52.
Takeda, Y. et al. Boltzmann sampling for an XY model using a nondegenerate optical parametric oscillator network. Quant. Sci. Tech. 3, 014004 (2018).
 53.
Berloff, N. G., Kalinin, K., Silva, M., Langbein, W. & Lagoudakis P. G. Realizing the XY Hamiltonian in polariton simulators. arXiv:1607.06065 [condmat.meshall] (2016).
Acknowledgements
The authors wish to thank R. L. Byer, H. Mabuchi, R. D. Drummond, P. McMahon, A. Marandi, C. Langrock, T. Inagaki, T. Honjo, K. Inaba, R. Hamerly, Y. Haribara, D. Maruo, K. Takata, Z. Wang, A. Yamamura, and T. Shoji for their useful discussions. This research was supported by ImPACT Program of Council for Science, Technology and Innovation (Cabinet Office, Government of Japan).
Author information
Affiliations
Contributions
Y.Y. wrote the manuscript, K.A., K.K., T.L., S.K., M.F., K.I., and H.T. critically read and revised the manuscript.
Corresponding author
Correspondence to Yoshihisa Yamamoto.
Ethics declarations
Competing interests
The authors declare that they have no competing financial interests.
Additional information
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Change history: A correction to this article has been published and is linked from the HTML version of this article.
A correction to this article is available online at https://doi.org/10.1038/s4153401800617.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Yamamoto, Y., Aihara, K., Leleu, T. et al. Coherent Ising machines—optical neural networks operating at the quantum limit. npj Quantum Inf 3, 49 (2017) doi:10.1038/s4153401700489
Received
Revised
Accepted
Published
DOI
Further reading

Integrated Coherent Ising Machines Based on SelfPhase Modulation in Microring Resonators
IEEE Journal of Selected Topics in Quantum Electronics (2020)

Annealing by simulating the coherent Ising machine
Optics Express (2019)

Simulating frustrated antiferromagnets with quadratically driven QED cavities
Physical Review A (2019)

Persistent Coherent Beating in Coupled Parametric Oscillators
Physical Review Letters (2019)

Equation Planting: A Tool for Benchmarking Ising Machines
Physical Review Applied (2019)