Effective implementation of $$\text{L}{0}$$ -regularised compressed sensing with chaotic-amplitude-controlled coherent Ising machines

Gunathilaka, Mastiyage Don Sudeera Hasaranga; Kako, Satoshi; Inui, Yoshitaka; Mimura, Kazushi; Okada, Masato; Yamamoto, Yoshihisa; Aonishi, Toru

doi:10.1038/s41598-023-43364-8

Download PDF

Article
Open access
Published: 26 September 2023

Effective implementation of $\text{L}{0}$-regularised compressed sensing with chaotic-amplitude-controlled coherent Ising machines

Mastiyage Don Sudeera Hasaranga Gunathilaka¹,
Satoshi Kako²,
Yoshitaka Inui²,
Kazushi Mimura^1,4,
Masato Okada⁵,
Yoshihisa Yamamoto^2,3 &
…
Toru Aonishi^1,5

Scientific Reports volume 13, Article number: 16140 (2023) Cite this article

917 Accesses
2 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Coherent Ising machine (CIM) is a network of optical parametric oscillators that can solve large-scale combinatorial optimisation problems by finding the ground state of an Ising Hamiltonian. As a practical application of CIM, Aonishi et al., proposed a quantum-classical hybrid system to solve optimisation problems of $l_0$-regularisation-based compressed sensing. In the hybrid system, the CIM was an open-loop system without an amplitude control feedback loop. In this case, the hybrid system is enhanced by using a closed-loop CIM to achieve chaotic behaviour around the target amplitude, which would enable escaping from local minima in the energy landscape. Both artificial and magnetic resonance image data were used for the testing of our proposed closed-loop system. Compared with the open-loop system, the results of this study demonstrate an improved degree of accuracy and a wider range of effectiveness.

Slowing quantum decoherence of oscillators by hybrid processing

Article Open access 15 June 2022

Variational quantum algorithm for experimental photonic multiparameter estimation

Article Open access 28 February 2024

Maximum information states for coherent scattering measurements

Article 21 January 2021

Introduction

Compressed sensing (CS) is a method of reconstructing a high-dimensional signal or image based on highly downsampled measurements.

There has been considerable interest in it across a wide range of fields and applications. Such as in the field of astronomy, a possible way to transmit data to Earth from spacecraft¹ has been attempted. And there are proposed methods with CS on astronomical image compression and in compression on remotely sensed data^2,3 as well. And in radar technologies for the reconstruction of the target image CS has been used⁴. On the other hand in the medical field using embedded compression using CS to improve energy efficiency in Electrocardiogram (ECG) machines has been proposed⁵.

$$\begin{aligned} \hat{x} = \mathop {{\text {argmin}}}\limits _{x \in \mathbb {R}^N}\Vert x\Vert _{p} \ \ subject \ to \ y = Ax . \end{aligned}$$

(1)

The above equation shows an observed signal $y \in \mathbb {R}^M$, an observation matrix $A \in \mathbb {R}^{M\times N}$, and a source signal $x \in \mathbb {R}^N$. Hereafter, the ratio of the number of non-zero entries in x to N is defined as the sparseness a, and the ratio of M to N is defined as the compression ratio $\alpha$. Since $l_1$-norm CS is a convex optimisation problem, there are many efficient algorithms for optimisation of $l_1$-norm CS that are widely applied in the real-world problems mentioned above. However, there has been a suggestion that $l_0$-norm CS should outperform $l_1$-norm CS since the $l_1$-norm penalty does not lead to any solution shrinkage^6,7. In the thermodynamic limit N, M $\longrightarrow$ $\infty$ with $\alpha = M/N$ kept fixed, an $l_0$-norm CS’s threshold for a, determining whether or not the problem has a solution with no error, is larger than that of $l_1$-norm CS’s^6,7. Nonetheless, the optimisation in $l_0$-norm CS is challenging since it involves combinatorial optimisation.

Numerous attempts have been made to overcome the issue in $l_0$-norm CS optimisations. $l_0$-norm CS can be formulated as a two-fold optimisation^8,9.

$$\begin{aligned} (\hat{R}, \hat{\sigma }) = \mathop {{\text {argmin}}}\limits _{\sigma \in \{0,1\}^{N}}\mathop {{\text {argmin}}}\limits _{R\in \mathbb {R}^{N}} \left( \Vert y - A(\sigma \circ R)\Vert _{2}^{2}\right) \ \ subject \ to \ \Vert \sigma \Vert _{0} \le \Omega . \end{aligned}$$

(2)

Here $R \in \mathbb {R}^N$ and $\sigma \in \left\{ {0,1}\right\} ^N$ correspond to the source signal and support vector, respectively. Especially, each entry in the support vector taking either 0 or 1 represents whether each entry in the source signal is zero or non-zero. The condition $\Vert \sigma \Vert _{0} \le \Omega$ is a sparsity-inducing prior for constraining the number of non-zero entries to be $\Omega$. Therefore, the optimisation with respect to $\sigma$ can be regarded as a quadratic-constrained binary optimisation problem to find a ground state of a two-state Potts Hamiltonian. Based on this formulation, simulated annealing (SA) algorithm has been attempted⁶. On the other hand, Aonishi et al., attempted to solve optimisation problems of $l_0$-norm CS with a quantum-classical hybrid approach. $l_0$-norm CS implemented with the hybrid system is given as a regularisation form as follows¹⁰.

$$\begin{aligned} (R, \sigma ) = \mathop {{\text {argmin}}}\limits _{\sigma \in \{0,1\}^{N}}\mathop {{\text {argmin}}}\limits _{R\in \mathbb {R}^{N}} \left( \frac{1}{2} \Vert y - A(\sigma \circ R)\Vert _{2}^{2} + {\lambda } \Vert \sigma \Vert _{0}\right) . \end{aligned}$$

(3)

The element-wise representation of Eq. (3) gives the following Hamiltonian.

$$\begin{aligned} \mathscr {H} = \sum _{r<r'}^{N}\sum _{k = 1}^{M} A_{r}^{k}A_{r'}^{k}R_{r}R_{r'}\sigma _{r}\sigma _{r'} - \sum _{r=1}^{N}\sum _{k =1}^{M} y^{k}A_{r}^{k}R_{r}\sigma _{r} + {\lambda } \sum _{r = 1}^{N} \sigma _r , \end{aligned}$$

(4)

where an element $A^k$ in A, an element $y^k$ in y, an element $R_r$ in R and an element $\sigma _r$ in $\sigma$. Optimisation with respect to $\sigma$ in Eq. (4) is a quadratic unconstrained binary optimisation (QUBO) problem, which is implementable with a quantum machine such as the coherent Ising machine (CIM)^10,11,12,13. In the quantum-classical hybrid approach to conducting $l_0$-regularised CS, $\sigma$ is optimised by the CIM while R is optimised by a Classical Digital Processor (CDP) (see Fig. 1).

The CIM architecture in the hybrid approach was an open-loop (OL) CIM with the Zeeman term. The hybrid approach with the OL-CIM is hereafter referred to as OL-CIM-CDP. Note that the OL means the lack of feedback loop for amplitude control described below. It has been reported that the imbalance in the size of the interaction term and the Zeeman term degrades the system performance¹⁴. To balance these terms, for the local field, the measured-amplitudes were binarised. OL-CIM-CDP in this formulation outperformed SA on the regularisation form¹⁰.

The close-loop CIM, in which the amplitudes of optical parametric oscillator (OPO) pulses are controlled to a target value, have been proposed to improve the performance of CIM’s ground-state search^15,16. Especially, introducing auxiliary nonlinear dynamics forcefully trying to equalise to a target value results in chaotic behaviour around the target in the CIM which may result in escaping from local minima in the energy landscape. This chaotic method is referred to as chaotic amplitude control (CAC)^{15,16,17,18,19}. Recently, Inui et al., have proposed an approach to efficiently incorporate the Zeeman terms in CAC-CIM by scaling the Zeeman terms with target amplitude to match that of the interaction term¹⁶.

In this paper, following Inui et al.’s approach, we modify the CAC-CIM for performing QUBO in $l_0$-regularised CS and attempt to improve the performance of the hybrid CIM-CDP system by replacing the OL-CIM with the CAC-CIM with the Zeeman term (see Fig. 1). The hybrid system proposed here is hereafter referred to as CAC-CIM-CDP. Firstly, to demonstrate the effectiveness of CAC-CIM for performing QUBO in the support estimation, we compare the performance of CAC-CIM to those of OL-CIM and SA. Then, to demonstrate the effectiveness of CAC-CIM-CDP for performing an alternating minimisation, we compare the performance of CAC-CIM-CDP to that of OL-CIM-CDP on artificial random data, as well as magnetic resonance imaging (MRI) data.

Results

Alternating minimisation algorithm

Alternating minimisation procedures on CAC-CIM-CDP and OL-CIM-CDP are summarised in Algorithm 1 and Algorithm 2, respectively. This type of minimisation suggests the back-and-forth optimisation performed between the CIM and CDP. CIM passes the optimisation results to the CDP after optimising the support, as shown in Fig. 1. The CDP then optimises the signal and sends the resulting signal to the CIM for support optimisation. In Algorithm 1 and Algorithm 2, indicate the number of iterations of alternating minimisation, the initial values and the integration interval for stochastic differential equations (SDEs) of CIM and so on. The schedules of the pump rate, threshold and target amplitude are given in “Section Schedule of pump rate, threshold and target amplitude for optimisation in CIM”. The computational time of CIM scales exponentially with the size of the problem N as exp$(O(\sqrt{N}))$^20,21.

Outline of the CIM models and injection field for QUBO on support estimation

On CIM, $l_0$-regularised CS is performed by updating the injection field dictated by the local field, which is determined by the gradient of the QUBO Hamiltonian Eq. (4) with respect to the spin coordinates. Aonishi et al., proposed OL-CIM-CDP, which is based on an open-loop injection scheme¹⁰. They used the CIM model expressed as the Wigner stochastic differential equation (W-SDE) Eq. (13) and Eq. (14) (in Methods) with the following injection field.

$$\left( \frac{dc_{r}}{dt}\right) _{inj,r} = \left( \left| {h_r}\right| - \eta \right) .$$

(5)

$$\begin{aligned}{} & {} h_{r} = -{\sum _{r' = 1 (\ne r)}^{N}\sum _{k = 1}^{M}} A_r^k A_{r'}^k R_{r'}H(c_{r'}) + \sum _{k=1}^M A_{r}^k y^{k}, \end{aligned}$$

(6)

Here, $h_r$ is the local field expressed as Eq. (6). $R_r$ is the signal value estimated by the CDP. $c_r$ is the in-phase amplitude of the r-th OPO pulse, and $H(c_r)$ is the binarised in-phase amplitude by the Heaviside step function as proposed in the discrete simulated bifurcation²². $\eta$ is the threshold which is related to the $l_0$-regularisation parameter $\lambda$ by $\eta = \sqrt{2\lambda }$ according to the Maxwell rule (see¹⁰ for a detailed explanation). In the local field Eq. (6), the mutual interaction is $\tilde{J}_{rr'} = -\sum _{k = 1}^M A_r^k A_{r'}^k$ and the Zeeman term is $\sum _{k=1}^M A_{r}^k y^{k}$. Substituting the observation model Eq. (26) (in “Section Observation model for compressed sensing”) into Eq. (6) when $w_{noise} = 0$ (no observation noise), the local field Eq. (6) can be expressed as follows.

$$\begin{aligned} h_r = -{\sum _{r' = 1 (\ne r)}^{N}\sum _{k = 1}^{M}} A_{r}^{k}A_{r'}^{k}R_{r'}H\left( c_{r'}\right) + {\sum _{r' = 1}^{N}\sum _{k = 1}^{M}}A_{r}^{k}A_{r'}^{k}x_{r'} \xi _{r'}, \end{aligned}$$

(7)

where $x_r$ is the true signal value, $\xi _r$ is the true support taking 1 or 0. The Zeeman term in the second term of Eq. (7) can be regarded as the matched filter, in which $A^T A$ is calculated. The mutual interaction term in the first term plays a role in removing off-diagonal elements ($r \ne r'$) corresponding to cross-talk noise in the Zeeman term, which are induced by the cross-correlation among the column vectors $A_1,\ldots ,A_N$ in A. To obliterate the cross-talk noise, the in-phase amplitude $c_r$ needs to be the same as the amplitude of $\xi _r$ if $R_r=x_r$. Hence, $c_r$ is binarised to either 1 or 0. In Fig. 2e, a typical evolution of $c_r$ in the open-loop-type W-SDE is illustrated. $c_r$ does not keep the same amplitude as that of $\xi _r$ and increases with increasing the pump rate.

In this paper, we propose CAC-CIM-CDP, based on a closed-loop injection scheme with CAC. The idea of CAC for CIM was first introduced by Leleu et al.,¹⁷. It simply states that forcefully trying to equalise the amplitudes of the system to a specific value (in CAC, target amplitude $\tau$) may result in a chaotic behaviour in the system which may result in escaping from local minima in the energy landscape. In this paper, we used two CIM models expressed as W-SDE Eqs. (15) and (16) and Positive-P stochastic differential equation (P-SDE) Eqs. (17)–(19) (in “Section Stochastic differential equation in OL-CIM-CDP and CAC-CIM-CDP”) commonly having the following injection field with CAC feedback.

$$\begin{aligned}{} & {} \left( \dfrac{d\mu _{r}}{dt}\right) _{inj,r} = je_r\left( R_rh_r - \dfrac{\eta ^2}{{2}}\sqrt{\dfrac{\tau }{g^2}}\right) , \end{aligned}$$

(8)

$$\begin{aligned}{} & {} {\dfrac{d}{dt}e_{r} = -\beta \left( g^2\tilde{\mu }_{r}^2 - \tau \right) e_{r}}, \end{aligned}$$

(9)

$$\begin{aligned}{} & {} \tilde{\mu }_{r} = \mu _{r} + \sqrt{\frac{1}{4j}}W_{R,r}, \end{aligned}$$

(10)

$$\begin{aligned}{} & {} h_r = -{\sum _{r' = 1 (\ne r)}^{N}\sum _{k = 1}^{M}} A_{r}^{k}A_{r'}^{k}R_{r'}\dfrac{1}{2}\left( \tilde{\mu }_{r'} + \sqrt{\dfrac{\tau }{g^2}} \right) {+} \sum _{k = 1}^{M} \sqrt{\dfrac{\tau }{g^2}}{A_{r}^{k}y^{k}}, \end{aligned}$$

(11)

where $h_r$ is the local field expressed as Eq. (11), $e_r$ is the auxiliary variable for the error feedback in the CAC feedback loop, and $\tau$ indicates the target amplitude for the CAC. $R_r$ is the signal value estimated by the CDP, which is the same as that of OL-CIM-CDP. $\eta$ is the threshold given by $\eta = \sqrt{2\lambda }$, which is introduced to keep consistency with OL-CIM-CDP. As described in “Section Stochastic differential equation in OL-CIM-CDP and CAC-CIM-CDP” in Methods, j is the normalised out-coupling rate for optical homodyne measurement, and $g^2$ is the nonlinear saturation parameter of the CIM which determines the abrupt jump of the photon number at the OPO threshold and the amplitude of the quantum noise present in CIM. $\tilde{\mu }_r$ implies the measured-amplitude, and ${W_{R,r}}$ is the independent real Gaussian noise process, which is the same as that in W-SDE (15) and P-SDE (17). In the local field Eq. (11), the mutual interaction is $\tilde{J}_{rr'} = -\sum _{k = 1}^M A_r^k A_{r'}^k$ and the Zeeman term is $h_r^z = \sqrt{{\tau /g^2}}\sum _{k=1}^M A_r^k y^k$. Substituting the observation model Eq. (26) into Eq. (11) when $w_{noise} = 0$ (no observation noise), the local field Eq. (11) can be expressed as follows.

$$\begin{aligned} h_r = -{\sum _{r' = 1 (\ne r)}^{N}\sum _{k = 1}^{M}} A_{r}^{k}A_{r'}^{k}R_{r'}\dfrac{1}{2}\left( \tilde{\mu }_{r'} + \sqrt{\dfrac{\tau }{g^2}} \right) { +} {\sum _{r' = 1}^{N}\sum _{k = 1}^{M}} \sqrt{\dfrac{\tau }{g^2}}A_{r}^{k}A_{r'}^{k}x_{r'} \xi _{r'}. \end{aligned}$$

(12)

In Fig. 2a,b, the typical evolution of normalised measured-amplitude $g\tilde{\mu }_{r}$ are shown. The corresponding error evolution is indicated in Fig. 2c,d. Due to the CAC feedback loop, as shown in Fig. 2a,b, if the squared-amplitude of DOPO is smaller than $\tau$, $e_r$ exponentially increases and vice-versa, and the measured-amplitude $\tilde{\mu }_{r'}$ is maintained around $\sqrt{{\tau /g^2}}$. Therefore, because $1/2(\tilde{\mu }_{r'}+\sqrt{{\tau /g^2}})$ in Eq. (12) can take around 0 or $\sqrt{{\tau /g^2}}$, the mutual interaction term and the Zeeman term scales are balanced, and crosstalk noise, i.e. off-diagonal elements, is eliminated from the Zeeman term as described in OL-CIM-CDP. Moreover, as shown in Fig. 2a,b, it is important to note that intermediate solutions are destabilised. By doing so, CAC introduced CIM is able to keep searching for an answer until the maximum run-time has been reached. By taking the support vector that is generated by CIM at the end of each trajectory, we are evaluating the solution to estimate the support for the simulations in this paper.

Comparison with simulated annealing

Here our purpose is to demonstrate that CAC feedback is effective on CIM by comparing CAC-CIM to OL-CIM and SA. We follow the Metropolis algorithm for $l_0$-regularised CS stated in¹⁰. As same as in¹⁰, 1000 samples of the observation matrix and source signal and true support vector are randomly generated according to “Section Simulations with artificial random data” under $N = 500$, $\alpha = a = 0.6$, $w_{noise} = 0$ (no observation noise). With the same observation matrices, source signals, and support vectors in all models, we statistically evaluate how well CAC-CIM estimates support in comparison to OL-CIM and SA when all $R_r$ are fixed to be the source signal $x_r$. To measure the support estimation quality, we used the direction cosine defined as ${\sum _{r=1}^N \xi _r \sigma _r}/{\sqrt{\sum _{r=1}^N \xi _r \sum _{r=1}^N \sigma _r}}$ where $\left( \xi _1,\ldots , \xi _N\right)$ is the true support vector and $\left( \sigma _1,\ldots , \sigma _N\right)$ is the estimated one. When the estimation is perfect, the direction cosine is equal to 1. We selected $\eta = 0.05$ corresponding to $l_0$-regularisation parameter $\lambda = \eta ^2/2 =0.00125$ as in¹⁰.

First, we evaluate the temporal profiles of the optimisation processes for the support estimation in CAC-CIM (Wigner), CAC-CIM (Positive-P), OL-CIM and SA. The upper three graphs (from left to right, CAC-CIM (Wigner), CAC-CIM (Positive-P) and OL-CIM respectively) in Fig. 3a show the change in the direction cosine of the three CIM models depending on the runtime on the CPU and the wall-clock time of physical CIM. The term physical CIM refers to the CIMs that are available in laboratories physically^23,24. Although we are using Wigner and Positive-P functions to approximate the behaviour of such machines for numerical simulation, physical CIMs are actual machines designed to solve combinatorial optimisation problems as physical computations. Recently a 100, 000-spin physical CIM was proposed by Honjo et al²⁴. We consider that time-step-to-solution is in $10^4$-order for physical CIM¹⁹. For CAC-CIM (Wigner), and CAC-CIM (Positive-P) models, $20\times$ photon’s lifetimes of integral interval (with 1000 time-steps) for the SDEs are about 105ms and 68ms of run-time respectively, and for OL-CIM, $5\times$ photon’s lifetime of integral interval (with 50 time-steps) for the SDE is about 11ms. The physical CIM’s wall-clock time for this optimisation is roughly estimated to be around 0.5ms, which can be estimated from the round-trip time of $N=500$ and the time-steps-to-solution for the Sherrington-Kirkpatrick problem with $N = 500$¹⁹. The direction cosine of these CIM models converged to about 1 by these run-times. The lower two graphs in Fig. 3a show the change in the direction cosine of SA depending on the runtime on CPU under constant temperature at $T=0$ and exponential cooling scheduling from $T=0.02$ to 0.00002. We adjusted the Monte-Carlo steps of SA (bottom two graphs of Fig. 3a) to accompany the wall-clock time of physical CIM (0.5ms) and the run-time of CAC-CIM (Wigner) (105ms). In our computational environment, the number of Monte Carlo steps for SA with runtimes of 0.5ms and 105ms is about 230 and 46000 steps, respectively. In SA, the direction cosine converged to about 1 by 105ms, while that did not by 0.5ms.

Next, we compare the histogram of the final states of direction cosines in CAC-CIM (Wigner), CAC-CIM (Positive-P), OL-CIM and SA. The upper three graphs in Fig. 3b indicate the histogram of the three CIM models (CAC-CIM (Wigner), CAC-CIM (Positive-P), OL-CIM, respectively), while the lower three graphs in Fig. 3b show the histograms of SA. The first two histograms, from the left, illustrate run-times of 105 ms with zero temperature and exponential cooling schedules, respectively. In the last graph from the left, run-times of 0.5ms are indicated for both zero-temperature (blue bars) and exponential cooling schedules (orange bars). Comparing these graphs, the proportion of the direction cosines of CAC-CIM (Wigner) and CAC-CIM (Positive-P) close to 1 is higher than those of OL-CIM and SA. The two-sample one-sided Kolmogorov-Smirnov test suggests that the histograms of the final direction cosines of CAC-CIM (Wigner) and CAC-CIM (Positive-P) are significantly biased toward 1 compared with all of those of OL-CIM and SA (P-value < 0.0001).

The above results thus demonstrate that CAC-CIM outperformed OL-CIM on support vector estimation and outperformed SA within the same run-time.

Comparison with ground state predicted with statistical mechanics on alternating minimisation

We compare CAC-CIM-CDP’s capability to find the ground state with that of OL-CIM-CDP. In our previous study, we derived the macroscopic parameter equation (MSE) (Eq. (26)-(28) in¹⁰) using a non-equilibrium statistical mechanics method to show the performance limit of OL-CIM-CDP. This statistical mechanics method is based on artificial random data which makes it possible to apply mean-field theory to obtain the MSEs. In the limit of the saturation parameter $g^2 \rightarrow 0$, the CAC-Wigner-type SDEs and CAC-Positive-P SDEs in the steady state are consistent with a two-state Potts spin system defined by the QUBO Hamiltonian Eq. (4). Additionally, the MSEs in this limit are also similar to those for the two-state Potts spin system from Eq. (4), and thus can predict the ground state of the Hamiltonian in the thermodynamic limit N, M $\longrightarrow$ $\infty$ with the compression rate $\alpha = M/N$ fixed¹⁰. Using a comparison of CAC-CIM-CDP and OL-CIM-CDP solutions to a solution of the MSEs in the limit of $g^2 \rightarrow 0$, we demonstrate the effectiveness of CAC feedback on the alternating minimisation for optimising the Hamiltonian.

The precondition for applying statistical mechanics is that the values of all entries in the observation model Eq. (26), which is the premise of Eq. (3) and Eq. (4), are randomly determined as described in “Section Simulations with artificial random data”. To compare solutions of the models with the ground state predicted with statistical mechanics, 10 samples of the observation matrix and source signal and true support vector are randomly generated according to “Section Simulations with artificial random data” under $N = 2000$ and various values of $a, \alpha$ and $\nu$. Here $\nu$ indicates the standard deviation of the observation noise ($w_{noise}$). Then, we execute Algorithms 1 and 2 for the alternating minimisation in CAC-CIM-CDPs (Wigner and Positive-P) and OL-CIM-CDP sharing the same samples of observation matrices, source signals and support vectors. Here for Fig. 4, $\eta _{init} = 0.6$ and $\eta _{init} = 0.8$ was used for CAC-CIM-CDP models and OL-CIM-CDP respectively. $\eta _{end}$ was set to 0.18 in Fig. 4a,b while in Fig. 4c,d $\eta _{end}$ was set to 0.35.

The marks in Fig. 4 show the averaged root-mean-square-error (RMSE) calculated as $\sqrt{1/N \sum _{r=1}^N \left( R_r\sigma _r - x_r\xi _r\right) ^2}$ of sampled solutions obtained from OL-CIM-CDP, Wigner and Positive-P of CAC-CIM-CDPs. Here $\sigma _r$ is calculated as stated in Eq. (20). The black solid lines in Fig. 4 indicate RMSE at the ground state corresponding to successful signal retrieval, which is predicted with statistical mechanics. RMSEs of Wigner and Positive-P CAC-CIM-CDPs tend to keep a better consistency with that of the ground state compared to OL-CIM-CDP for various values of $a, \alpha$ and v. Especially as shown in Fig. 4b,d, RMSE of OL-CIM-CDP tend to deviate gradually from that of the ground state as increasing a, while both Wigner and Positive-P CAC-CIM-CDPs keep up a better consistency with the theoretical prediction.

Application to sparse MRI

We evaluate the performance of CAC-CIM-CDP, OL-CIM-CDP and LASSO²⁵ on MRI data. LASSO is a popular $l_1$ method for MRI data reconstruction^26,27,28.

In the following numerical experiment, we used two different-sized sparse images ($64\times 64$ and $128\times 128$ pixels) spanned by a Haar basis function. Detailed explanations of the two images we used as the source images are given in “Section Simulations with MRI data” in Methods. In accordance with our previous work¹⁰, we sought to reconstruct the two images from the undersampled k-space data and by solving the optimisation problem defined in Eq. (27) (see “Section Simulations with MRI data”). To realise the optimisation problem in Eq. (27) on CIM, the Haar wavelet transform coefficients are estimated with the mutual interaction term and the Zeeman term constructed according to Eq. (28) and (29) in “Section Simulations with MRI data”. The compression rate of the k-space data from the $64\times 64$ and $128\times 128$ images is 0.4 and 0.3 respectively. And the sparseness of the images is 0.212 and 0.178 respectively. As the solver for CDP, we used the Conjugate Gradient Descent method (further details on CDP optimisation refer to “Section Optimisation in CDP”).

In Fig. 5a,b, for 10 simulations the average RMSE value is indicated for each threshold $\eta$ for $64\times 64$ and $128\times 128$ images respectively. As for the minimum RMSE in the $64\times 64$ case, LASSO (black line), OL-CIM-CDP (red), CAC-CIM-CDP (Wigner) (green) and CAC-CIM-CDP (Positive-P)’s (blue) can be stated as, 0.0292, 0.0216, 0.0182 and 0.0182 respectively (for the corresponding reconstructions see Fig. 6). In the $128\times 128$ case, the minimum RMSE is 0.0276, 0.0242, 0.0209 and 0.0209 respectively (for the corresponding reconstructions see Fig. 7). Comparing the RMSE values acquired it is clear that CAC-CIM-CDP models have a better average performance compared to the other approaches in both image sizes. And even after reaching the optimal reconstruction for the given parameters, CAC-CIM-CDP tends to keep up a minimal error rate compared to LASSO and OL-CIM-CDP. This indicates that the effective range of CAC-CIM-CDP is much wider than OL-CIM-CDP. In both image sizes, the Wigner and Positive-P variations of CAC-CIM-CDP produce identical RMSE results.

In Figs. 6 and 7 the minimal RMSE constructions are shown for LASSO, OL-CIM-CDP, CAC-CIM-CDP (Wigner) and CAC-CIM-CDP (Positive-P). In Fig. 7, only CAC-CIM-CDP (Positive-P)’s reconstruction is shown because it is clear that both CAC-CIM-CDP (Wigner) and CAC-CIM-CDP (Positive-P)’s performance is identical. In the $64\times 64$ image reconstruction when RMSE values are compared, CAC-CIM-CDP models have better reconstruction accuracy. The enlarged portions indicate the difference in pixel identification of each model compared to the initial resized image. Considering both simulations it is clear that even though the system size increases, proposing models have the upper hand in performing an accurate reconstruction compared to other models.

Discussion

In this paper, we have proposed an improved CIM approach to solve $l_0$-regularised compressed sensing problems. Finding a way to improve $l_0$-Regularised Compressed Sensing reconstruction accuracy was the motivation behind this research. Although Zeeman term realisation with CAC has been proposed, this is the first time it has been applied to a practical data analysis method and to large-scale combinatorial optimisation problems involving more than $N=4096$. Furthermore, CAC-CIM SDEs are more accurate models of measurement-feedback CIMs than Aonishi et al.,’s OL-CIM SDEs.

The proposed algorithm has shown that it can outperform the previously proposed algorithm accuracy-wise in all the simulations performed. With the OL-CIM algorithm, the CIM model in use was lacking the CAC feedback for chaotically exploring solutions. Therefore, CAC-CIM has been able to provide convergence to a better solution than OL-CIM. One factor to emphasise here is that CAC does not guarantee convergence to the ground state. Even the ground state is reached, due to the forceful equalisation to $\tau$ may prevent from stopping there. Even though this is the case in this paper, CAC has been shown to be effective especially when the problem instances are relatively harder in both artificial random data and MRI data.

Effect of system size on performance

The introduction of CAC has previously been shown to have better performance with small-scale frustrated Ising problem instances¹⁶. In this manuscript, we have demonstrated the applicability of CAC for real-world combinatorial optimisation problems (in this case Compressed sensing) where the problem instances with a Zeeman term are mapped to a QUBO formulation that is large-scale. The simulations with random artificial data on various system sizes are illustrated in Supplementary note 1. Even though the performance increase is present, in very large system sizes such as in $128\times 128$, it is clear that the RMSE gap between CAC-CIM-CDP and OL-CIM-CDP is smaller compared to $64\times 64$. This poses the question that whether there is a system-size threshold for CAC-CIM-CDP in the very-large-scale regime. Considering the MRI-based simulations require 4096 and 16384 DOPO pulses to operate (compared to 16 DOPOs in theoretical simulations in¹⁶), the system size of CAC’s applicability is largely improved. Yet the system-size-wise dependency is yet to be explored.

Advantages of CAC-CIM architecture

With the use of CDP, the problem which involves quadratic optimisation has been solved in this hybrid system. As shown in the schematic illustration of the CAC-CIM-CDP in Fig. 1, proposing approach performs an alternating minimisation between the CIM and CDP. It is clear considering the results stated in “Section Application to sparse MRI” that CAC-CIM-CDP has outperformed OL-CIM-CDP and the generally used approach LASSO which is an $l_1$-regularised method for solving compressed sensing problems. It is interesting to see that advancements in CIM architecture can offer better results in real-world problem instances.

CAC-CIM-CDP (Wigner) versus CAC-CIM-CDP (Positive-P)

Even though this paper introduces two variants (Wigner and Positive-P) of CAC-CIM-CDP, the performances have been almost identical between the models. However, we encountered a deviation when the problem instances become harder i.e. sparseness/compression ratio becomes higher when $w_{noise} = 0$. The results are presented in Supplementary note 2. As the models approach a threshold point for optimal reconstruction (a critical sparseness/compression ratio), beyond that the producing RMSE values are somewhat different between the models. Performance-wise it is hard to state that one model is better than the other. Because the significance of Wigner and Positive-P lies in the density matrix approximation and how it behaves with a large quantum noise presence. We discuss this in Supplementary note 2.

CIM and simulated bifurcation

Aonishi et al., proposed a quantum-classical hybrid system composed of a general quantum machine and CDP (Fig. 1 in¹⁰). Using the quantum machine to optimise $\sigma$ and the CDP for optimising R, this system solves the two-fold optimisation problem by alternately performing two minimisation processes. There are several quantum machines which can be used for optimising $\sigma$, including quantum annealers (QA), quantum approximate optimisation algorithms (QAOA), simulated bifurcation (SB), and CIM. It is likely that CIM and SB will be the most suitable machines for this task because they can connect densely connected networks necessary to optimise $\sigma$, have similar performance (e.g. time to solution^19,21), and can be fast simulated with hardware such as FPGAs. It would be interesting to see how SB and CAC-CIM-CDP’s performance differs when implemented on the same hardware in large-scale simulations such as the ones reported in this paper.

Future improvements to the CAC-CIM-CDP

Simultaneous minimisation

One of the major bottlenecks the proposed model (CAC-CIM-CDP) has is the alternating minimisation process between the CIM and CDP. This is a time-consuming operation. As a future direction to this model, we plan to improvise the CIM system to accommodate quadratic optimisation problems and perform simultaneous minimisation using only the CIM to solve compressed sensing problems. We believe that the use of “CIM-only” will have a positive effect on accuracy as well.

CAC-CIM-CDP with large quantum noise

While this manuscript solely focuses on combining CAC with CIM for solving CS problems more accurately, the considered quantum noise present in the CIM is very low ($g^2 = 10^{-7}$). This opens up a problem of whether CAC-CIM-CDP can keep up the performance with a large quantum noise presence. For small-scale frustrated Ising Hamiltonians, this has been previously explored in¹⁶ ($N = 16$) where it has shown a decrease in success probability for larger $g^2$ terms. This result is consistent with CAC-CIM-CDP as well as shown in Supplementary note Fig. S4 for MRI simulations. Recent advances in CIM research have led to the introduction of a method known as Negative Parametric Gain (NPG), which accommodates higher quantum noise and at the same time as maintaining a higher probability of success²⁹. This method considers a negative starting pump rate with large injection field feedback. NPG has shown promising results in the theoretical simulations²⁹. We are planning to improve the endurance of the CAC-CIM-CDP with NPG for a larger quantum noise presence.

CAC-CIM-CDP with the mean-field CIM model

As it is obvious from the perspective of numerical simulations, CAC-CIM-CDP SDEs are computationally costly to simulate. Even though the shown results are acquired using a GPU implementation of the SDEs, as a digital simulator, field-programmable gate arrays (FPGAs) are more suitable (less energy cost, faster processing etc). As a future direction, we plan on implementing the mean-field CIM SDEs^17,19 with CAC on an FPGA to perform compressed sensing simulations. Due to the fact that CAC-CIM-CDP has relatively low noise present in the system, we believe that the mean-field SDEs will have approximately the same or better results but with faster simulation times. This is mainly due to the simplicity and the negligence of the noise terms in the mean-field CIM SDEs.

Methods

Stochastic differential equation in OL-CIM-CDP and CAC-CIM-CDP

Wigner-type

The CIM model based on the Wigner formulation was introduced in^30,31. The c-number Heisenberg Langevin equation³⁰ was used to overcome the higher computational cost of simulating the direct density matrix formulation of CIM and it has been found to be equivalent to the truncated Wigner SDEs. The density operator master equation expanded by the Wigner function results in the Kramers-Moyal series including third-order terms. In order to derive the Langevin equation, we neglect third-order terms¹⁶. Then, we can formulate the following Wigner SDEs used for OL-CIM-CDP.

$$\begin{aligned}{} & {} \begin{aligned} \frac{d}{dt}c_r =&\left[ -1 + p - {\left( c_r^2 + s_r^2\right) } \right] c_r + \widetilde{K}\left( \dfrac{dc_{r}}{dt}\right) _{inj,r}+\\&{g}\sqrt{\left( c_r^2 + s_r^2\right) + \frac{1}{2}} W_{r,1}, \end{aligned} \end{aligned}$$

(13)

$$\frac{d}{dt}s_r = \left[ -1 - p - {\left( c_r^2 + s_r^2\right) }\right] s_r + {g}\sqrt{\left( c_r^2 + s_r^2\right) + \frac{1}{2}} W_{r,2}.$$

(14)

Here, in-phase and quadrature-phase normalised amplitudes are represented as $c_r$ and $s_r$ respectively. p is the normalised pump rate. If p is above the oscillation threshold $(p > 1)$, each of the OPO pulses is either in the 0-phase state or $\pi$-phase state. The last terms of the upper and lower equations express the vacuum fluctuations injected from external reservoirs and the pump fluctuations coupled to the OPO system via gain saturation¹⁰. $W_{r,1}$ and $W_{r,2}$ are independent real Gaussian noise processes satisfying $\langle W_{r,k} (t)\rangle =0$ and $\langle W_{r,k}(t) W_{r',l} (t')\rangle = \delta _{rr'} \delta _{kl} \delta (t-t')$. g indicates the saturation parameter. $(dc_r/dt)_{inj,r}$ is the optical injection field, which only considers the in-phase amplitudes for the calculations. The injection field is defined in Eq. (5) and Eq. (6). $\tilde{K}$ indicates the normalised feedback strength.

Focusing on the behaviour of the OPO pulses only in the in-phase direction, the Wigner-type SDE, which is used for CAC-CIM-CDP, can be stated as,

$$\frac{d}{dt}\mu _{r} = - \left( 1 -p + j\right) \mu _{r} - g^2\mu _{r}^3 + \sqrt{j}\left( V_{r} - \frac{1}{2}\right) W_{R,r} + {{K}}\left( \frac{d\mu _{r}}{dt}\right) _{inj,r},$$

(15)

$$\frac{d}{dt}V_{r} = -2 \left( 1 -p + j\right) V_{r} - 6g^2\mu _{r}^2V_{r} + 1 + j + 2g^2\mu _{r}^2 - 2j\left( V_{r} -\frac{1}{2}\right) ^2.$$

(16)

Here $\mu _r$ and $V_r$ are the mean-amplitudes and the variance of the r-th DOPO pulse. $(d\mu _r/dt)_{inj,r}$ is the optical injection field defined in Eqs. (8)–(11). $W_{R,r}$ is independent real Gaussian noise processes satisfying $\langle W_{R,r} (t)\rangle =0$ and $\langle W_{R,r} (t)W_{R,r'} (t')\rangle =\delta _{rr'}\delta (t-t')$. g, p, j and K indicate the saturation parameter, pump rate, the normalised out-coupling rate for optical homodyne measurement and the feedback strength, respectively.

Positive-P-type

Positive-P (P-P) representation³² is a generalised form of Glauber-Sudarshan P representation. When the density operator master equations are expanded using the P-P distribution function, the resulting Kramers-Moyal series only consists of first and second-order terms. Due to this factor, there is no truncation needed to derive the Langevin equation. Because of this one can argue that P-P SDEs might be a better candidate for density operator approximations. The effectiveness of P-P SDEs has been demonstrated on CIMs with higher quantum noise presence¹⁶. We can formulate the P-P-type SDEs we used for CAC-CIM-CDP.

$$\begin{aligned}{} & {} \begin{aligned} \dfrac{d}{dt}\mu _{r} =&- \left( 1 -p + j\right) \mu _{r} - {g^2\mu _{r}\left( \mu _{r}^{2} + 2n_r + m_r\right) } + \sqrt{j}\left( m_r + n_r\right) W_{R,r}\\&+ {{K}}\left( \frac{d\mu _{r}}{dt}\right) _{inj,r}, \end{aligned} \end{aligned}$$

(17)

$${\frac{d}{dt}n_{r} = -2 \left( 1 + j\right) n_r + 2pm_r - 2g^{2}\mu _{r}^2\left( 2n_r + m_r\right) } {- j\left( m_r + n_r\right) ^{2}},$$

(18)

$$\begin{aligned}{} & {} \begin{aligned} \dfrac{d}{dt}m_{r} =&-2 \left( 1 + j\right) m_r + 2p n_r - 2g^{2}\mu _{r}^{2}\left( 2m_r + n_r\right) + p\\&- g^{2}\left( \mu _{r}^{2} + m_r\right) - j\left( m_r + n_r\right) ^{2}. \end{aligned} \end{aligned}$$

(19)

Here $\mu _r$ corresponds to the mean-amplitude, $m_r$ and $n_r$ represent variances of quantum fluctuations of the r-th DOPO pulse. $(d\mu _r/dt)_{inj,r}$ is the optical injection field defined in Eqs. (8)–(11). $W_{R,r}$ is independent real Gaussian noise processes satisfying $\langle W_{R,r} (t)\rangle =0$ and $\langle W_{R,r} (t)W_{R,r'} (t')\rangle =\delta _{rr'}\delta (t-t')$. g,p, j and K are the same as those for the Wigner model.

Optimisation in CDP

The CDP performs the optimisation of the Hamiltonian (Eq. 4) with respect to $R_r$ for a support vector $\sigma$ given by CIM. $\sigma$ is obtained by binarising the measured-amplitude ($\tilde{\mu }_r$) defined in Eq. (10) (CAC-CIM-CDP) or in-phase amplitude $c_r$ (OL-CIM-CDP) with the Heaviside function stated as,

$$\begin{aligned} \sigma _{r} = Heaviside\left( x_r\right) = {\left\{ \begin{array}{ll} 1,&{} \ \left( x_r > 0\right) \\ 0,&{} \ \left( x_r \le 0\right) . \end{array}\right. } \end{aligned}$$

(20)

The CDP solve the following system of equations, which is satisfied the stationary point that minimises $\mathbb {H}$ with respect to r.

$$\begin{aligned}{} & {} R_{r}\sum _{k = 1}^{M} \left( A_{r}^{k}\right) ^2 = \sigma _{r}\mathbb {H}_{r}, \end{aligned}$$

(21)

$$\begin{aligned}{} & {} \mathbb {H}_{r} = -\sum _{r' = 1 (\ne r)}^{N}\sum _{k = 1}^{M} A_{r}^{k}A_{r'}^{k}R_{r'}\sigma _{r'} + \sum _{k =1}^{M} A_{r}^{k}y^{k}. \end{aligned}$$

(22)

Here, $\mathbb {H}_r$ in Eq. (22) is the local field of the CDP, which is the same as Eq. (4) and (11). For the simulations, we used the Jacobi method or Conjugate Gradient Descent (CGD) method as the CDP optimiser. During the optimisation in the CDP, all $\sigma _r$ are fixed.

Schedule of pump rate, threshold and target amplitude for optimisation in CIM

A rough parameter search was used to determine the schedules for each of the following parameters in the experiments. The pump rate p for both Wigner and P-P type CAC-CIM-CDPs was scheduled depending on the time t as follows.

$$\begin{aligned} p = (p_{thr} - d) + \frac{2d}{1+e^{-\left( \dfrac{t-4}{2}\right) }} . \end{aligned}$$

(23)

Here, $p_{thr} = 1$ for all simulations of both Wigner and P-P type CAC-CIM-CDPs. For artificial random data and MRI data simulations, d was set at 0.6 and 0.4 respectively.

In accordance with¹⁰, the pump rate p for OL-CIM-CDP was scheduled depending on the time t as follows.

$$\begin{aligned} p = 1.5 \times \left( \dfrac{t}{5}\right) ^2 . \end{aligned}$$

(24)

The pump rate becomes equal to 1.5 when $t=5$. We used this pump rate schedule for all simulations of OL-CIM-CDP. In both CAC-CIM-CDP and OL-CIM-CDP, the threshold $\eta$ was scheduled depending on the alternating iteration time i as follows.

$$\begin{aligned} \eta _{i} = \max \left[ \eta _{init} \left( 1 - \dfrac{i}{velo}\right) , \eta _{end}\right] . \end{aligned}$$

(25)

Here $velo = 51$ for all simulations of both CAC-CIM-CDP and OL-CIM-CDP in artificial random data. For the MRI data, $velo = 31$ and $velo = 11$ were used in OL-CIM-CDP and CAC-CIM-CDP respectively. For synthesised random data (Figs. 3, 4, and Supplementary note Fig. S1), the threshold $\eta$ was linearly lowered from $\eta _{init}$ to $\eta _{end}$ as the alternating minimisation proceeds. $\eta _{init}$ and $\eta _{end}$ are adjusted to maximise the performance of those models. On the other hand, for MRI data (Figs. 5, 6, 7, Supplementary note Fig. S4 and Supplementary note Fig. S5), the threshold $\eta$ was constant by setting as $\eta _{init} = \eta _{end}$. The values of $\eta _{init}$ and $\eta _{end}$ used for each simulation are shown in the figure captions.

In both Wigner and P-P type CAC-CIM-CDPs, the target amplitude of CAC, $\tau$, was constant with respect to the time t. For the simulations in Fig. 4a and Fig. 4b, $\tau = 0.21$ was used while in Fig. 4c and Fig. 4d $\tau$ was set to 0.15. For other simulations, $\tau$ was 1. For all numerical simulations of OL-CIM-CDP and CL-CIM-CDP, SDEs were integrated using the Euler–Maruyama method.

Observation model for compressed sensing

The observation model that is the premise of Eq. (3) and Eq. (4) is defined as follows.

$$\begin{aligned} \begin{bmatrix} y^{1} \\ y^{2} \\ \vdots \\ y^{M} \end{bmatrix} = \begin{bmatrix} A_{1}^{1} &{} A_{2}^{1} &{} \cdots &{} A_{N}^{1} \\ A_{1}^{2} &{} A_{2}^{2} &{} \cdots &{} A_{N}^{2} \\ \vdots &{} \vdots &{} \ddots &{} \vdots \\ A_{1}^{M} &{} A_{2}^{M} &{} \cdots &{} A_{N}^{M} \end{bmatrix} \begin{bmatrix} \xi _{1}x_{1} \\ \xi _{2}x_{2} \\ \vdots \\ \xi _{N}x_{N} \end{bmatrix} + \begin{bmatrix} w_{noise}^{1} \\ w_{noise}^{2} \\ \vdots \\ w_{noise}^{M} \end{bmatrix} . \end{aligned}$$

(26)

Here, $A \in \mathbb {R}^{N\times M}$ is the observation matrix, $y\in \mathbb {R}^M$ implies the observation signal, $x\in \mathbb {R}^N$ and $\xi \in (0,1)^N$ are the true source signal and true support, respectively. $w_{noise}\in \mathbb {R}^M$ indicates the observation noise satisfying $\langle w_{noise}^{k}\rangle =0$ and $\langle w_{noise}^{k} w_{noise}^{k'}\rangle =\nu ^2 \delta _{kk'}$. $\nu ^2$ is the variance of the observation noise.

Simulations with artificial random data

To verify the performance of the proposed models statistically and moreover compare those results with ground states predicted with statistical mechanics¹⁰, we used many samples of artificial random data $y\in \mathbb {R}^M$ synthesised from the observation model Eq. (26) in which the values of all entries were randomly determined as follows. Each entry of the observation matrix $A \in \mathbb {R}^{M\times N}$ is randomly generated from an independent and identical normal distribution with the variance of 1/M, which satisfies $\langle A_r^k\rangle =0$ and ${\langle A_r^k A_{r'}^{k'}\rangle = 1/M \delta _{rr'} \delta _{kk'}}$.

Each entry of the true source signal $x\in \mathbb {R}^N$ is randomly generated from an independent and identical normal distribution with the variance of 1, which satisfies $\langle x_r\rangle = 0$ and $\langle x_r x_{r'}\rangle = \delta _{rr'}$. $a\times N$ elements of $\xi \in (0,1)^N$ are randomly selected and assigned 1 while others are assigned 0. a is the sparseness defined in the Introduction.

We set the time increment for CIM $\Delta t$ to 0.02 up to 20$\times$ the photon’s lifetime with $g^2 = 10^{-7}$ for CL-CIM-CDP. For OL-CIM-CDP, CIM time increment $\Delta t$ was set to 0.1 with 5$\times$ the photon’s lifetime with $g^2 = 10^{-7}$. As for the CDP, we used the Jacobi method where time increment $\Delta t_c$ was set to 0.1 with 100 iterations.

Simulations with MRI data

To evaluate the performance of the proposed models on realistic data, we used MRI data provided from the fastMRI datasets³³. The initial brain MRI used here was a $320\times 320$ image. To reduce the problem size, we resized the image to $64\times 64$ and $128\times 128$ images with the BILINEAR interpolation method. We applied the Haar-wavelet transform (HWT) to the two different-sized images and in Fig. 6 and Fig. 7 we set 78.8% and 82.2% of the HWT coefficients to zero to create two different-sized sparse images ($64\times 64$ and $128\times 128$ pixels) spanned by Haar basis functions with a sparseness of 0.212 and 0.178, respectively. Then, we applied the discrete Fourier transform (DFT) to the two different-sized sparse images to obtain $64\times 64$ and $128\times 128$ k-space data, respectively. Finally, we undersampled 1638 and 4915 points from the $64\times 64$ and $128\times 128$ k-space data at random red points (Fig. 6b and Fig. 7b) to create two observation signals with a compression rate of 0.4 and 0.3 respectively.

In accordance with our previous work, we sought to reconstruct the source signals from the undersampled k-space data by solving the following optimisation problem with CAC-CIM-CDP and OL-CIM-CDP.

$$\begin{aligned} x = {\text {argmin}}(\Vert y - SFx\Vert _{2}^{2} + \dfrac{1}{2}\gamma \Vert \Delta _{v}x\Vert _{2}^{2} + \dfrac{1}{2}\gamma \Vert \Delta _{h}x\Vert _{2}^{2} + \lambda \Vert \Psi x\Vert _{0}) . \end{aligned}$$

(27)

Here, x is a source signal, and y is the observation signal constructed through the above steps. F indicates the DFT matrix and $\Psi$ is the HWT matrix. F and $\Psi$ are orthogonal matrices and their transpose matrices correspond to inverse DFT and inverse HWT, respectively. S is an undersampling matrix executing undersampling at random red points shown in Fig. 6b and Fig. 7b. $\Delta _{v}$ and $\Delta _{h}$ are the matrices discretely representing the vertical and horizontal second-order derivative operators, respectively. $\gamma$ and $\lambda$ are the $l_2$ and $l_0$ regularisation parameters.

To implement the optimisation problem in Eq. (27) on CIM, we estimate the HWT coefficients instead of the pixel values of the image. Applying the HWT $r = \Psi x$ to Eq. (27), the mutual interaction matrix J and the Zeeman term vector $h^z$ for CIM are given as

$$\begin{aligned}{} & {} h^{z} = SF\Psi ^{T}y, \end{aligned}$$

(28)

$$\begin{aligned}{} & {} \tilde{J} = \Psi F^{T}S^{T}SF\Psi ^{T} + \gamma \Psi \Delta _{v}^{T}\Delta _{v}\Psi ^{T} + \gamma \Psi \Delta _{h}^{T}\Delta _{h}\Psi ^{T}. \end{aligned}$$

(29)

Here, the observation matrix is given as $A = SF\Psi ^T$. The second and third terms in $\tilde{J}$ are from the $l_2$ regularisation terms. After the alternating minimisation, the output of the CDP, r, is transformed to the image, x, with the inverse HWT $x=\Psi ^T r$. $\gamma$ is set to 0.0001. $\tilde{K}$ for OL-CIM-CDP was set to 0.25 while K for CAC-CIM-CDP was 0.01. In MRI simulations CIM time increment $\Delta t$ was set to 0.02 up to 20$\times$ the photon’s lifetime with $g^2 = 10^{-7}$ for CL-CIM-CDP. For OL-CIM-CDP, CIM time increment $\Delta t$ was set to 0.1 with 5$\times$ the photon’s lifetime with $g^2 = 10^{-7}$. For the CDP, we used the Conjugate Gradient Descent method, with 10,000 max iterations. Here we use LASSO’s solution as the initial condition for the CIM simulation.

Data availability

The data generated and/or analysed during this study are not publicly available for legal/ethical reasons. But M.D.S.H.G. can provide the raw data if formally requested.

The magnetic resonance imaging (MRI) images that we have used in our numerical experiments are from the dataset of³³

References

Bobin, J., Starck, J.-L. & Ottensamer, R. Compressed sensing in astronomy. IEEE J. Sel. Top. Signal Process. 2(5), 718–726. https://doi.org/10.1109/JSTSP.2008.2005337 (2008).
Article ADS Google Scholar
Zhang, Y., Jiang, J. & Zhang, G. Compression of remotely sensed astronomical image using wavelet-based compressed sensing in deep space exploration. Remote Sens. 13, 2. https://doi.org/10.3390/rs13020288 (2021).
Article Google Scholar
Zhou, W.-P., Li, Y., Liu, Q.-S., Wang, G.-D. & Liu, Y. Fast compression and reconstruction of astronomical images based on compressed sensing. Res. Astron. Astrophys. 14(9), 1207 (2014).
Article ADS Google Scholar
Herman, M. A. & Strohmer, T. High-resolution radar via compressed sensing. IEEE Trans. Signal Process. 57(6), 2275–2284. https://doi.org/10.1109/TSP.2009.2014277 (2009).
Article ADS MathSciNet MATH Google Scholar
Mamaghanian, H., Khaled, N., Atienza, D. & Vandergheynst, P. Compressed sensing for real-time energy-efficient ECG compression on wireless body sensor nodes. IEEE Trans. Biomed. Eng. 58(9), 2456–2466. https://doi.org/10.1109/TBME.2011.2156795 (2011).
Article PubMed Google Scholar
Obuchi, T., Nakanishi-Ohno, Y., Okada, M. & Kabashima, Y. Statistical mechanical analysis of sparse linear regression as a variable selection problem. J. Stat. Mech. Theory Exp. 2018(10), 103401. https://doi.org/10.1088/1742-5468/aae02c (2018).
Article MathSciNet Google Scholar
Kabashima, Y., Wadayama, T. & Tanaka, T. A typical reconstruction limit for compressed sensing based on $l_p$-norm minimization. J. Stat. Mech. Theory Exp. 2009(09), L09003. https://doi.org/10.1088/1742-5468/2009/09/l09003 (2009).
Article Google Scholar
Louizos, C., Welling, M. & Kingma, D. P. Learning sparse neural networks through $l_0$ regularization (2017). arXiv:1712.01312.
Nakanishi-Ohno, Y., Obuchi, T., Okada, M. & Kabashima, Y. Sparse approximation based on a random overcomplete basis. J. Stat. Mech. Theory Exp. 2016(6), 063302. https://doi.org/10.1088/1742-5468/2016/06/063302 (2016).
Article MathSciNet MATH Google Scholar
Aonishi, T., Mimura, K., Okada, M. & Yamamoto, Y. L0 regularization-based compressed sensing with quantum-classical hybrid approach (2021). arXiv:2102.11412.
Mohseni, N., McMahon, P. L. & Byrnes, T. Ising machines as hardware solvers of combinatorial optimization problems. Nat. Rev. Phys. 4(6), 363–379 (2022).
Article Google Scholar
Matsumoto, N., Hamakawa, Y., Tatsumura, K. & Kudo, K. Distance-based clustering using qubo formulations. Sci. Rep. 12(1), 1–10 (2022).
Article Google Scholar
Tanahashi, K., Takayanagi, S., Motohashi, T. & Tanaka, S. Application of Ising machines and a software development for Ising machines. J. Phys. Soc. Jpn. 88(6), 061010. https://doi.org/10.7566/JPSJ.88.061010 (2019).
Article ADS Google Scholar
Aonishi, T., Mimura, K., Okada, M. & Yamamoto, Y. Statistical mechanics of CDMA multiuser detector implemented in coherent Ising machine. J. Appl. Phys. 124(23), 233102. https://doi.org/10.1063/1.5041998 (2018).
Article ADS CAS Google Scholar
Kako, S. et al. Coherent Ising machines with error correction feedback. Adv. Quantum Technol. 3(11), 2000045. https://doi.org/10.1002/qute.202000045 (2020).
Article Google Scholar
Inui, Y., Gunathilaka, M. D. S. H., Kako, S., Aonishi, T. & Yamamoto, Y. Control of amplitude homogeneity in coherent Ising machines with artificial Zeeman terms. Commun. Phys. 5(1), 154. https://doi.org/10.1038/s42005-022-00927-x (2022).
Article Google Scholar
Leleu, T., Yamamoto, Y., McMahon, P. L. & Aihara, K. Destabilization of local minima in analog spin systems by correction of amplitude heterogeneity. Phys. Rev. Lett. 122, 040607. https://doi.org/10.1103/PhysRevLett.122.040607 (2019).
Article ADS CAS PubMed Google Scholar
Leleu, T., Yamamoto, Y., Utsunomiya, S. & Aihara, K. Combinatorial optimization using dynamical phase transitions in driven-dissipative systems. Phys. Rev. E 95, 022118. https://doi.org/10.1103/PhysRevE.95.022118 (2017).
Article ADS MathSciNet PubMed Google Scholar
Reifenstein, S., Kako, S., Khoyratee, F., Leleu, T. & Yamamoto, Y. Coherent Ising machines with optical error correction circuits. Adv. Quantum Technol. 4(11), 2100077. https://doi.org/10.1002/qute.202100077 (2021).
Article CAS Google Scholar
Hamerly, R. et al. Experimental investigation of performance differences between coherent Ising machines and a quantum annealer. Sci. Adv. 5(5), eaau0823. https://doi.org/10.1126/sciadv.aau0823 (2019).
Article ADS PubMed PubMed Central Google Scholar
Leleu, T. et al. Scaling advantage of chaotic amplitude control for high-performance combinatorial optimization. Commun. Phys. 4(1), 266 (2021).
Article Google Scholar
Goto, H. et al. High-performance combinatorial optimization based on classical mechanics. Sci. Adv. 7(6), eabe7953. https://doi.org/10.1126/sciadv.abe7953 (2021).
Article ADS PubMed Google Scholar
Takesue, H., Inagaki, T., Inaba, K., Ikuta, T. & Honjo, T. Large-scale coherent Ising machine. J. Phys. Soc. Jpn. 88(6), 061014. https://doi.org/10.7566/JPSJ.88.061014 (2019).
Article ADS Google Scholar
Honjo, T. et al. 100,000-spin coherent Ising machine. Sci. Adv. 7(40), eabh0952 (2021).
Article ADS PubMed PubMed Central Google Scholar
Tibshirani, R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B (Methodol.) 58(1), 267–288 (1996).
MathSciNet MATH Google Scholar
Dimopoulos, V., Desmet, W. & Deckers, E. Sparse damage detection with complex group lasso and adaptive complex group lasso. Sensors 22(8), 2978 (2022).
Article ADS PubMed PubMed Central Google Scholar
Na, S. et al. Compressed sensing radar detectors based on weighted lasso. arXiv preprintarXiv:2306.17372 (2023).
Berk, A., Brugiapaglia, S. & Hoheisel, T. Lasso reloaded: a variational analysis perspective with applications to compressed sensing. arXiv preprint arXiv:2205.06872 (2022).
Ng, E. et al. Efficient sampling of ground and low-energy Ising spin configurations with a coherent Ising machine. Phys. Rev. Res. 4, 013009. https://doi.org/10.1103/PhysRevResearch.4.013009 (2022).
Article CAS Google Scholar
Wang, Z., Marandi, A., Wen, K., Byer, R. L. & Yamamoto, Y. Coherent Ising machine based on degenerate optical parametric oscillators. Phys. Rev. A 88, 063853. https://doi.org/10.1103/PhysRevA.88.063853 (2013).
Article ADS CAS Google Scholar
Inui, Y. & Yamamoto, Y. Noise correlation and success probability in coherent Ising machines (2020). arXiv:2009.10328.
Drummond, P. D. & Gardiner, C. W. Generalised p-representations in quantum optics. J. Phys. A Math. Gen. 13(7), 2353. https://doi.org/10.1088/0305-4470/13/7/018 (1980).
Article ADS MathSciNet Google Scholar
Zbontar, J. et al. fastMRI: An open dataset and benchmarks for accelerated MRI (2018). arXiv:1811.08839.

Download references

Funding

This work is supported by the Japan Science and Technology Agency through its ImPACT program, NTT Research Inc. And Authors acknowledges the support of the NSF CIM Expedition award (CCF-1918549).

Author information

Authors and Affiliations

School of Computing, Tokyo Institute of Technology, Yokohama, Kanagawa, Japan
Mastiyage Don Sudeera Hasaranga Gunathilaka, Kazushi Mimura & Toru Aonishi
Physics and Informatics Laboratories, NTT Research Inc., 940 Stewart Dr, Sunnyvale, CA, 94085, USA
Satoshi Kako, Yoshitaka Inui & Yoshihisa Yamamoto
E. L. Ginzton Laboratory, Stanford University, Stanford, CA, 94305, USA
Yoshihisa Yamamoto
Graduate School of Information Sciences, Hiroshima City University, Hiroshima, Japan
Kazushi Mimura
Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba, Japan
Masato Okada & Toru Aonishi

Authors

Mastiyage Don Sudeera Hasaranga Gunathilaka
View author publications
You can also search for this author in PubMed Google Scholar
Satoshi Kako
View author publications
You can also search for this author in PubMed Google Scholar
Yoshitaka Inui
View author publications
You can also search for this author in PubMed Google Scholar
Kazushi Mimura
View author publications
You can also search for this author in PubMed Google Scholar
Masato Okada
View author publications
You can also search for this author in PubMed Google Scholar
Yoshihisa Yamamoto
View author publications
You can also search for this author in PubMed Google Scholar
Toru Aonishi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.D.S.H.G., and T.A., modelled the system, performed the numerical simulations for the proposed models and wrote the manuscript. M.D.S.H.G., T.A., and S.K., worked on the evaluation of the models. K.M., and M.O., provided feedback on numerical simulations. S.K., Y.I., and Y.Y., helped with the physics of CIM and provided feedback.

Corresponding author

Correspondence to Mastiyage Don Sudeera Hasaranga Gunathilaka.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gunathilaka, M.D.S.H., Kako, S., Inui, Y. et al. Effective implementation of $\text{L}{0}$-regularised compressed sensing with chaotic-amplitude-controlled coherent Ising machines. Sci Rep 13, 16140 (2023). https://doi.org/10.1038/s41598-023-43364-8

Download citation

Received: 14 March 2023
Accepted: 22 September 2023
Published: 26 September 2023
DOI: https://doi.org/10.1038/s41598-023-43364-8

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.