Distance-based clustering using QUBO formulations

Matsumoto, Nasa; Hamakawa, Yohei; Tatsumura, Kosuke; Kudo, Kazue

doi:10.1038/s41598-022-06559-z

Download PDF

Article
Open access
Published: 17 February 2022

Distance-based clustering using QUBO formulations

Nasa Matsumoto¹,
Yohei Hamakawa²,
Kosuke Tatsumura² &
…
Kazue Kudo^1,3

Scientific Reports volume 12, Article number: 2669 (2022) Cite this article

4213 Accesses
14 Citations
3 Altmetric
Metrics details

Subjects

Abstract

In computer science, clustering is a technique for grouping data. Ising machines can solve distance-based clustering problems described by quadratic unconstrained binary optimization (QUBO) formulations. A typical simple method using an Ising machine makes each cluster size equal and is not suitable for clustering unevenly distributed data. We propose a new clustering method that provides better performance than the simple method, especially for unevenly distributed data. The proposed method is a hybrid algorithm including an iterative process that comprises solving a discrete optimization problem with an Ising machine and calculating parameters with a general-purpose computer. To minimize the communication overhead between the Ising machine and the general-purpose computer, we employed a low-latency Ising machine implementing the simulated bifurcation algorithm with a field-programmable gate array attached to a local server. The proposed method results in clustering 200 unevenly distributed data points with a clustering score 18% higher than that of the simple method. The discrete optimization with 2000 variables is performed 100 times per iteration, and the overhead time is reduced to approximately 20% of the total execution time. These results suggest that hybrid algorithms using Ising machines can efficiently solve practical optimization problems.

A novel hermit crab optimization algorithm

Article Open access 19 June 2023

Energy valley optimizer: a novel metaheuristic algorithm for global and engineering optimization

Article Open access 05 January 2023

Distance-based clustering challenges for unbiased benchmarking studies

Article Open access 23 September 2021

Introduction

Many combinatorial optimization problems can be described by the Ising model or quadratic unconstrained binary optimization (QUBO) formulations¹. Ising machines, which are special-purpose computers for solving combinatorial optimization problems, have attracted significant interest in recent years. Inspired by the first quantum annealer², several devices have been developed, such as digital processors based on simulated annealing (SA)^{3,4,5,6,7,8,9}, those on simulated bifurcation (SB)^10,11,12, coherent Ising machines implemented with pulsed lasers^{13,14,15,16,17,18}, and other types of optical Ising machines^19,20,21. Most Ising machines accept an objective function in the form of a Hamiltonian formulated by the Ising model or QUBO formulation. Ising machines return binary solutions that minimize the objective function, although they do not always return optimal solutions because of their heuristic nature. Recent research on the application of Ising machines has shifted from simple combinatorial optimization to hybrid methods using both an Ising machine and a general-purpose computer^{22,23,24,25,26,27}. Most hybrid methods are iterative methods that offload the sampling or combinatorial optimization step to an Ising machine.

Clustering is a technique for grouping data such that the members in each cluster have similar characteristics. Although there are various types of clustering, this work focuses on non-hierarchical and distance-based clustering. Because clustering can be formulated as a combinatorial optimization problem, several algorithms using Ising machines have been developed^{28,29,30,31,32,33,34,35,36}. Clustering methods using a quantum computer and a quantum annealer have also been proposed and investigated^37,38. A typical simple method using an Ising machine minimizes the distances between data points in the same cluster, resulting in cluster sizes that are approximately equal^34,35. Let us suppose that a small group is away from other large groups. Then, using this method, part of a large group is merged into a small group such that the number of members in each group becomes almost equal. This implies that this method is not suitable for clustering unevenly distributed data.

In this work, we propose a clustering method using an Ising machine that applies to unevenly distributed data. We compare the clustering of two data sets, one uniformly distributed and the other unevenly distributed, using the simple method and the proposed method. Employing the average silhouette coefficient, we evaluate clustering performance. The proposed method provides better clustering results than the simple method, especially for unevenly distributed data. The proposed method is a hybrid algorithm that solves discrete optimization problems iteratively. The discrete optimization described in a QUBO formulation is performed by an Ising machine, while a general-purpose (conventional) computer calculates parameters for the QUBO formulation in each iteration.

The proposed method could be implemented in one of three possible ways: (i) discrete optimization executed on a remote Ising machine provided as a cloud service, (ii) discrete optimization undertaken using a local Ising machine, and (iii) all computations undertaken on a local general-purpose computer. The first approach incurs a high communication cost with the Ising machine, and so is unsuitable for hybrid algorithms. The second implementation option offloads the computation of discrete optimization steps to the accelerator attached to a local server. This approach has the benefit of a much lower communication cost with the Ising machine than the first approach. In this work, we compare the second and third implementations. The Ising machine used is an SB-based machine implemented with a field-programmable gate array (FPGA)¹². The low latency of the SB-based machine is advantageous for executing the proposed method. Because the method requires the iterative computation of discrete optimization, a low communication cost between an Ising machine and a general-purpose computer is essential for high performance.

This work demonstrates that the proposed hybrid method using an Ising machine provides high-quality results in clustering problems. The method’s effectiveness suggests a new usage that takes advantage of a low-latency Ising machine. Moreover, this work will highly motivate developing iterative hybrid algorithms. Although hybrid algorithms using an Ising machine and a general-purpose computer take extra time for communication, they have many practical applications. The reduction in communication cost enables iterative hybrid algorithms to solve practical optimization problems efficiently. The proposed method will also be applicable as a quantum-classical hybrid algorithm when the communication cost reduces significantly.

Related work

We briefly review some recent works on the clustering method using Ising machines and quantum computers. Simple distance-based clustering is a typical example using an Ising machine or a quantum computer. Simple clustering methods based on the distance between data points^34,35,37 and graph partitioning³³ using a quantum annealer still attract some interest. Several methods inspired by classical clustering have been proposed and developed recently. For example, quantum-assisted clustering based on similarities²⁸, K-means^29,30,31 and K-Medoids³² clustering on a quantum annealer, as well as K-means clustering on a gate-model quantum computer³⁸, have been proposed. K-means-like clustering on a digital Ising machine has also been investigated³⁶.

Models and algorithms

We examine two clustering methods. One is the typical simple method in which the Hamiltonian is given as

$$\begin{aligned} H = \sum _{g=1}^G \sum _{i<j} d_{ij} x_{i,g}x_{j,g} + \alpha \sum _{i=1}^N \left( \sum _{g=1}^G x_{i,g} - 1 \right) ^2, \end{aligned}$$

(1)

where $d_{ij}$ is the distance between points i and j, N is the number of points, and G is the number of groups. The sum $\sum _{i<j}$ is taken over all combinations that satisfy $1\le i<j \le N$. Here, we refer to this method as the simple-cost method. When point i belongs to group g, $x_{i,g}=1$; otherwise, $x_{i,g}=0$. The first term of the Hamiltonian sums up all the distances between point pairs in each group. The second term of the Hamiltonian requires each point to belong to exactly one group, and $\alpha$ is a positive constant. This term gives a penalty when a point belongs to more than two groups or does not belong to any group. Because the number of point pairs in each group dominates in the Hamiltonian, each group tends to have almost the same number of points.

To resolve the issue, we propose another Hamiltonian described by

$$\begin{aligned} H=\sum _{g=1}^G\frac{\sum _{i<j} d_{ij}x_{i,g}x_{j,g}}{N_g(N_g-1)} + \alpha \sum _{i=1}^N \left( \sum _{g=1}^G x_{i,g} - 1 \right) ^2, \end{aligned}$$

(2)

where $N_g=\sum _{i=1}^N x_{i,g}$ is the number of points in group g. The number of nonzero terms in $\sum _{i<j} d_{ij}x_{i,g}x_{j,g}$ equals the number of point pairs in group g, i.e., $N_g(N_g-1)/2$. Therefore, $\sum _{i<j} d_{ij}x_{i,g}x_{j,g}/[N_g(N_g-1)]$ is the average distance between the point pairs in group g, where a factor of 2 is omitted for simplicity. In other words, the first term on the right-hand side of Eq. (2) expresses the sum of the average distances between the point pairs in each group. The dominance of the number of point pairs in each group in Eq. (1) is dissolved in Eq. (2). It will be experimentally shown later (in Fig. 4) that this formulation works well, especially for unevenly distributed data points.

Ising machines cannot directly minimize non-QUBO formulations such as Eq. (2). Instead, we employ a hybrid algorithm in which another Hamiltonian described by a QUBO formulation is iteratively minimized. We refer to this method as the iterative fractional-cost method.

The iterative fractional-cost method originates from the hybrid parametric method proposed to solve the vehicle routing problem²⁴. The hybrid parametric method is an extension of an inexact parametric algorithm to solve fractional programming problems³⁹. Instead of minimizing the original fractional objective function, the hybrid method iteratively solves the corresponding parametric problem in the discrete-optimization step, using a quantum annealer or an Ising machine.

The algorithm of the iterative fractional-cost method is as follows.

1.
Set the error parameter $\delta$, the iteration counter n as $n=0$, and parameter $\lambda$ as an initial value $\lambda _0=0$.
2.
Minimize the following Hamiltonian with an Ising machine.
$$\begin{aligned} H = \sum _{g=1}^G \left[ \sum _{i<j}^N d_{ij} x_{i,g} x_{j,g} -\lambda _n \left( \sum _{i=1}^N x_{i,g}\right) \left( \sum _{i=1}^N x_{i,g}-1\right) \right] + \alpha \sum _{i=1}^N\left( \sum _{g=1}^G x_{i,g} - 1 \right) ^2, \end{aligned}$$
(3)
where $\{x_{i,g}\}$ represents binary variables. Let $\{\hat{x}_{i,g}\}$ denote an obtained solution.
3.
If $\{\hat{x}_{i,g}\}$ is a feasible solution, set $\lambda$ for the next iteration as
$$\begin{aligned} \lambda _{n+1} = \sum _{g=1}^G \frac{\sum _{i<j} d_{ij}\hat{x}_{i,g}\hat{x}_{j,g}}{\hat{N}_g(\hat{N}_g-1)}, \end{aligned}$$
(4)
where $\hat{N}_g=\sum _i \hat{x}_{i,g}$. Otherwise, terminate with no solution.
4.
If $|\lambda _{n+1} -\lambda _n| \le \delta$, terminate with $\{\hat{x}_{i,g}\}$ as the final solution. Otherwise, return to step 2 with $n\rightarrow n+1$ while $n+1 < n_\mathrm{max}$. Alternatively, terminate with no solution when $n+1=n_\mathrm{max}$.

If the algorithm successfully terminates, $\lambda _{n+1}$ is expected to give the minimum cost. Because of the heuristic nature of the Ising machine, $\lambda _{n+1}$ does not always coincide with the global optimal solution. However, it is proved that the inexact parametric method converges to a global optimum if the obtained solution at the discrete-optimization step (step 2) is a good approximation³⁹.

The key point of the iterative fractional-cost method is the use of an Ising machine to minimize Eq. (3), which is a QUBO formulation, instead of the fractional-cost Hamiltonian, Eq. (2). In the simulation below, we take the error parameter as $\delta =10^{-6}$ and the maximum iteration number as $n_\mathrm{max}=10$.

Results

We apply the two methods to two kinds of data sets. One is the set of points distributed uniformly in a square region. The other is the set of unevenly distributed points in the same size region. Each set comprises 200 points ($N=200$), and we take the number of groups as $G=10$ here, so that there are 2,000 binary variables in total. The data sets are provided as Supplementary Information.

Simple-cost method

In the simple-cost method, we minimize Eq. (1) using the SA or SB. Figure 1 exhibits clustering results, where points with the same color belong to the same group. Each panel shows a result selected to demonstrate the best performance of the method. We obtained Fig. 1a,c using SA, and Fig. 1b,d using SB. We find almost no difference between Fig. 1a and Fig. 1b, with little difference between Fig. 1c and Fig. 1d. For the unevenly distributed data, an apparent cluster is divided into two groups, and some points in apparent clusters are classified as other group members.

Solutions provided by SA or SB are not always valid because they sometimes violate the constraint that each point should belong to exactly one group. We call the solution satisfying this constraint a feasible solution. Here, we perform the simulation 100 times for each data set, using either SA or SB for the given time steps. The ratio of feasible solutions among the solutions obtained in the 100 trials is the feasible solution rate shown in Fig. 2. The horizontal axis in Fig. 2 represents the number of time steps. The larger the number of steps, the slower the SA or SB process. Here, the hyperparameter values are $\alpha =5.5$ and $\alpha =6.0$ for SA and SB, respectively, for the data set with a uniform distribution; $\alpha =5$ for SA and SB for that with an uneven distribution. In Fig. 2, the feasible solution rate is almost 100%, except for the region where the number of steps is relatively small. However, a low feasible solution rate does not always result in an inferior result. By changing the value of $\alpha$, we obtained similar clustering results even though feasible solution rates were 40–80%.

For evaluation of the clustering result, we introduce the silhouette coefficient⁴⁰. Because the silhouette coefficient is defined for each data point, we take the average of every point’s silhouette coefficient in a feasible solution. The better the clustering result, the higher the average silhouette coefficient. Figure 3 demonstrates how clustering results depend on the execution time. The box plots show the distribution of average silhouette coefficients for feasible solutions. The execution time is the average of the feasible solutions for each number of steps. The average silhouette coefficient is about 0.35–0.39 or lower for the uniformly distributed data set [Fig. 3a,b], while it is approximately 0.45–0.6 or higher for the unevenly distributed one [Fig. 3c,d]. The difference reflects the difference in data distribution. On the other hand, the difference between SA and SB is significant. Although the execution time is much shorter for SB than for SA, the average silhouette coefficient for SB is almost equal or even higher. Moreover, the variance of the average silhouette coefficient is small for SB in the long-time region.

Iterative fractional-cost method

We use only SB to perform discrete optimization in the iterative fractional-cost method. Because the simulation using SA had a longer run time and provided similar or worse results than SB in the simple-cost method, we cannot expect SA to provide better results than SB in the iterative fractional-cost method. Figure 4 exhibits examples of clustering results in the iterative fractional-cost method. Compared with Fig. 1, unevenly distributed data points are well classified.

Figure 5 illustrates the final feasible solution rates. Here, the simulation was run 50 times for each data set. In other words, the final feasible solution rate is the ratio of feasible solutions obtained at the end of the algorithm among the 50 trials. The number in the legend represents the number of iterations of steps 2–4 in the iterative fractional-cost method before the final solution is obtained. In this work, we perform optimization sampling 100 times at the discrete optimization step (step 2) to obtain several feasible solutions for Eq. (3). The optimal solution among them is $\hat{\varvec{x}}$, which evaluates $\lambda _{n+1}$ in Eq. (4). Even if $\hat{\varvec{x}}$ is feasible at each iteration, this method sometimes fails to obtain a final solution within $n_\mathrm{max}=10$.

In most cases in Fig. 5, the final feasible solutions are obtained in three iterations. More iterations are required to reach a final feasible solution in the small-number-of-step region for the uniformly distributed data, as shown in Fig. 5a,c. The simulation for the unevenly distributed data requires more iterations for a large number of steps, as shown in Fig. 5b,d. The discrete optimization step works better when the number of steps is large compared to when it is small, which can cause overfitting. If the solution is overfitted to a temporal parameter $\lambda _n$ in each iteration, it may become difficult for the algorithm to converge. However, controlling the hyperparameter $\alpha$ can result in the simulation requiring fewer iterations.

Figure 6 exhibits the execution time taken for the discrete optimization in the iterative fractional-cost method. Each panel of the figure corresponds to each one of Fig. 5. The execution time depends on the number of iterations, optimization samplings at the discrete optimization step, and SB time steps.

The average silhouette coefficient is almost independent of $\alpha$ and the number of SB time steps in the region corresponding to Figs. 5 and 6. For the uniformly distributed data, the average silhouette coefficient is 0.392–0.394, which is comparable to that of the simple-cost method. However, for the unevenly distributed data, the average silhouette coefficient is 0.709–0.716, approximately 18% higher than that of the simple-cost method.

Discussion

In this work, we have compared two clustering methods based on QUBO formulations. One is the simple-cost method, and the other is the iterative fractional-cost method. We have applied each method to data sets with uniform and uneven distributions. The simulation of the simple-cost method is performed using SA and SB, whereas that of the iterative fractional-cost method is performed using only SB.

Clustering results highly depend on data distribution. For an uneven distribution, the iterative fractional-cost method works better than the simple-cost method. However, the iterative fractional-cost method requires more time to obtain a result because it requires several iterations. If the variation in cluster size is significant, the iterative fractional-cost method is a better choice. Otherwise, the simple-cost method is acceptable.

The difference between SA and SB is significant, especially in execution time. The difference in execution time arises mainly from the parallelization of an algorithm. In this work, SA is based on the single spin-flip Monte Carlo method and is out of parallelization. However, SB is executed through massively parallel processing. The parallelization is the key to the acceleration of finding reasonable solutions.

Another reason why SB outperformed SA in execution time is that the accelerator made of an FPGA is attached to the local server. Even if a remote Ising machine or a quantum annealer solves QUBO problems faster than SA, the communication time between the machine and the local server may cancel the acceleration. The results shown above imply that the acceleration by the SB-based machine overcame the communication overhead between the accelerator and the general-purpose processor, i.e., CPU, in the local server.

This work paves the way for solving practical optimization problems requiring iterative hybrid algorithms in a reasonable time by using an Ising machine. For most present Ising machines, including quantum annealers, iterative hybrid algorithms are relatively inefficient, mainly because of communication overheads. However, iterative hybrid methods using a high-speed Ising machine with low latency can provide high performance in solving practical optimization problems. The present work demonstrates such an example.

Methods

Execution time

In this section, we define one-shot execution time as the time required to solve a discrete optimization problem. The detailed definition differs between SA and SB because different architectures were used for each method. For SA, we used the dwave-neal package⁴¹, which is an SA software, and measured the time from calling the SA sampler to obtaining solutions. When the SA sampler executes SA $n_\mathrm{reads}$ times per call, the one-shot execution time is the measured time divided by $n_\mathrm{reads}$. In SB, the one-shot execution time, which is measured for each call, includes the FPGA calculation time and the communication time between the CPU and FPGA.

The execution time for the simple-cost method (shown in Fig. 3) is the same as the one-shot execution time. On the other hand, in the iterative fractional-cost method, the execution time shown in Fig. 6 is defined as the sum of the one-shot execution times. As shown in Fig. 7, only the discrete optimization step (step 2) is executed in the FPGA. The step is iterated several times, and 100 shots of SB are executed at each iteration. Thus, the execution time for the iterative fractional-cost method is several hundred times longer than that for the simple-cost method.

The overhead time in SB is a minor component of execution time. For example, as shown in Fig. 3b, the execution time for 2000 SB time steps is approximately 14 ms. The communication time between the CPU and FPGA, which is independent of the number of SB time steps, is approximately 3 ms. Thus, on average, the overhead time is approximately 20% of the execution time for the problem considered in this work.

Ballistic simulated bifurcation

The SB algorithm, which is a quantum-inspired algorithm, was proposed to accelerate combinatorial optimization^10,11,12. The SB is based on adiabatic evolution in classical nonlinear systems showing bifurcations. The SB algorithm finds a spin configuration minimizing the Ising model energy defined by

$$\begin{aligned} E = -\frac{1}{2}\sum _{i,j=1}^N J_{i,j} s_i s_j + \sum _{i=1}^N h_i s_j, \end{aligned}$$

(5)

where $s_i=\pm 1$ is the ith spin, N is the number of spins, $J_{i,j}$ is the coupling coefficient between the ith and jth spins, and $h_i$ is the local field on the ith spin.

SB has several variants, the first of which was adiabatic SB (aSB). The set of equations for aSB is given by

$$\begin{aligned} \frac{dx_i}{dt}&= a_0y_i, \end{aligned}$$

(6)

$$\begin{aligned} \frac{dy_i}{dt}&= -x_i^3 -\left[ a_0-a(t)\right] x_i -\eta h_i + c_0\sum _{j=1}^NJ_{i,j}x_j, \end{aligned}$$

(7)

where $x_i$ and $y_i$ are real numbers corresponding to the ith spin, $a_0$, $c_0$ and $\eta$ are positive constants, and a(t) is a control parameter that increases from zero to $a_0$. Equations (6) and (7) express the equations of motion of the classical particle corresponding to the ith spin: $x_i$ and $y_i$ represent the position and momentum of the ith particle, respectively. Classical particles interact in a potential whose shape gradually changes. The sign of $x_i$ at the end of the time evolution gives $s_i$ as the solution to the problem described by Eq. (5). Because $x_i$ is a continuous variable, even though spin $s_i=\pm 1$ is discrete, analog errors arise in the aSB.

The ballistic SB used in this work was developed to suppress analog errors¹². Instead of the nonlinear term $x_i^3$ in Eq. (7), perfectly inelastic walls are introduced at $x_i=\pm 1$. Using the symplectic Euler method, we numerically solve the set of equations of motion for bSB, whose updating rule is as follows¹².

$$\begin{aligned} y_i(t_{k+1})&= y_i(t_k) + \left\{ -[a_0-a(t_k)]x_i(t_k) -\eta h_i + c_0\sum _{j=1}^NJ_{i,j}x_j(t_k)\right\} \Delta _t, \end{aligned}$$

(8)

$$\begin{aligned} x_i(t_{k+1})&= x_i(t_k) + a_0 y_i(t_{k+1})\Delta _t, \end{aligned}$$

(9)

where $t_k$ is the kth time step, and $\Delta _t$ is the time-step width. Namely, $t_{k+1}=t_k+\Delta _t$. Here, we take $t_0=0$. At each time after the update of $x_i$, if $|x_i|>1$, we replace $x_i$ with $\mathrm {sgn}(x_i)=\pm 1$ and set $y_i=0$.

Parameters

The hyperparameter $\alpha$ in Eqs. (1) and (3) are tuned such that the feasible solution rate and average silhouette coefficient become large. Setting the number of time steps as 10,000 and 2,000 for SA and SB, respectively, we changed $\alpha$ from 3 to 7 in 0.5 increments. The feasible solution rate (final feasible solution rate in the iterative fractional-cost method) rapidly increases at a specific value of $\alpha$. We selected one from a few $\alpha$ values around the value based on the following criteria: (i) The average values of the average silhouette coefficients are higher than those of any other $\alpha$ in the entire time-step region of our experiments. (ii) If the average values are similar among the $\alpha$ values, the (final) feasible solution rates are high.

In SA, the parameter controlling the annealing time is the number of sweeps or steps, called num_sweeps in the dwave-neal package. In SB, the number of steps $n_\mathrm{steps}$ controls the increasing rate of a(t), that is, $a(t_k)=(1-k/n_\mathrm{steps})a_0$. In this work, we set $c_0=a_0/\lambda _\mathrm{max}$, where $\lambda _\mathrm{max}$ is the maximum eigenvalue of matrix $J_{ij}$, and $a_0=\eta =1$ and $\Delta _t=0.5$ in Eqs. (8) and (9).

Average silhouette coefficient

The silhouette coefficient is defined for each data point, and it is higher when the point belongs to a well-defined cluster. The silhouette coefficient for point i is defined by

$$\begin{aligned} s(i) = \frac{b(i)-a(i)}{\max (a(i), b(i))}, \end{aligned}$$

(10)

where a(i) is the mean distance between point i and the other points in the same group, and b(i) is the mean distance between point i and points in the neighboring group. However, if a point belongs to several groups or does not belong to any group, the silhouette coefficient cannot be properly calculated. Therefore, we calculate the silhouette coefficient for a feasible solution in which each point belongs to exactly one group and take the average over all the points in the feasible solution.

References

Lucas, A. Ising formulations of many NP problems. Front. Phys. 2, 5 (2014).
Article Google Scholar
Johnson, M. W. et al. Quantum annealing with manufactured spins. Nature 473, 194–198 (2011).
Article ADS CAS Google Scholar
Yamaoka, M. et al. A 20k-spin Ising chip to solve combinatorial optimization problems with CMOS annealing. IEEE J. Solid-State Circ. 51, 303–309 (2016).
Article Google Scholar
Yamamoto, K. et al. A time-division multiplexing Ising machine on FPGAs. In Proceedings of the 8th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies, HEART2017 (Association for Computing Machinery, New York, NY, USA, 2017).
Kihara, Y. et al. A new computing architecture using Ising spin model implemented on FPGA for solving combinatorial optimization problems. In 2017 IEEE 17th International Conference on Nanotechnology (IEEE-NANO), 256–258 (2017).
Aramon, M. et al. Physics-inspired optimization for quadratic unconstrained problems using a digital annealer. Front. Phys. 7, 48 (2019).
Article Google Scholar
Okuyama, T., Sonobe, T., Kawarabayashi, K.-I. & Yamaoka, M. Binary optimization by momentum annealing. Phys. Rev. E 100, 012111 (2019).
Article ADS CAS Google Scholar
Matsubara, S. et al. Digital annealer for high-speed solving of combinatorial optimization problems and its applications. In 2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC), 667–672 (2020).
Yamamoto, K. et al. Statica: A 512-spin 0.25M-weight full-digital annealing processor with a near-memory all-spin-updates-at-once architecture for combinatorial optimization with complete spin–spin interactions. In 2020 IEEE International Solid-State Circuits Conference - (ISSCC), 138–140 (2020).
Goto, H., Tatsumura, K. & Dixon, A. R. Combinatorial optimization by simulating adiabatic bifurcations in nonlinear hamiltonian systems. Sci. Adv. 5, eaav2372 (2019).
Article ADS Google Scholar
Tatsumura, K., Hidaka, R., Yamasaki, M., Sakai, Y. & Goto, H. A currency arbitrage machine based on the simulated bifurcation algorithm for ultrafast detection of optimal opportunity. In 2020 IEEE International Symposium on Circuits and Systems (ISCAS), 1–5 (2020).
Goto, H., Endo, K., Suzuki, M., Sakai, Y., Kanao, T., Hamakawa, Y., Hidaka, R., Yamasaki, M. & Tatsumura, K. High-performance combinatorial optimization based on classical mechanics. Sci. Adv. 7, eabe7953 (2021).
Article ADS Google Scholar
Wang, Z., Marandi, A., Wen, K., Byer, R. L. & Yamamoto, Y. Coherent Ising machine based on degenerate optical parametric oscillators. Phys. Rev. A 88, 063853 (2013).
Article ADS Google Scholar
Marandi, A., Wang, Z., Takata, K., Byer, R. L. & Yamamoto, Y. Network of time-multiplexed optical parametric oscillators as a coherent Ising machine. Nat. Photonics 8, 937–942 (2014).
Article ADS CAS Google Scholar
Inagaki, T. et al. A coherent Ising machine for 2000-node optimization problems. Science 354, 603–606 (2016).
Article ADS CAS Google Scholar
McMahon, P. L. et al. A fully programmable 100-spin coherent Ising machine with all-to-all connections. Science 354, 614–617 (2016).
Article ADS CAS Google Scholar
Yamamoto, Y. et al. Coherent Ising machines–optical neural networks operating at the quantum limit.. npj Quantum Inf. 3, 1–15 (2017).
Article Google Scholar
Hamerly, R. et al. Experimental investigation of performance differences between coherent Ising machines and a quantum annealer. Sci. Adv. 5, eaau0823 (2019).
Article ADS Google Scholar
Pierangeli, D., Marcucci, G. & Conti, C. Large-scale photonic Ising machine by spatial light modulation. Phys. Rev. Lett. 122, 213902 (2019).
Article ADS CAS Google Scholar
Pierangeli, D., Marcucci, G., Brunner, D. & Conti, C. Noise-enhanced spatial-photonic Ising machine.. Nanophotonics 9, 4109–4116 (2020).
Article Google Scholar
Pierangeli, D. et al. Adiabatic evolution on a spatial-photonic Ising machine. Optica 7, 1535–1543 (2020).
Article ADS Google Scholar
Feld, S. et al. A hybrid solution method for the capacitated vehicle routing problem using a quantum annealer. Front. ICT 6, 13 (2019).
Article Google Scholar
Kitai, K. et al. Designing metamaterials with quantum annealing and factorization machines. Phys. Rev. Res. 2, 013319 (2020).
Article CAS Google Scholar
Ajagekar, A., Humble, T. & You, F. Quantum computing based hybrid solution strategies for large-scale discrete-continuous optimization problems. Comp. Chem. Eng. 132, 106630 (2020).
Article CAS Google Scholar
Asaoka, H. & Kudo, K. Image analysis based on nonnegative/binary matrix factorization. J. Phys. Soc. Jpn. 89, 085001 (2020).
Article ADS Google Scholar
Golden, J. & O’Malley, D. Reverse annealing for nonnegative/binary matrix factorization. PLOS ONE 16, e0244026 (2021).
Article CAS Google Scholar
Irie, H., Liang, H., Doi, T., Gongyo, S. & Hatsuda, T. Hybrid quantum annealing via molecular dynamics. Sci. Rep. 11, 8426 (2021).
Article CAS Google Scholar
Neukart, F., Dollen, D. V. & Seidel, C. Quantum-assisted cluster analysis on a quantum annealing device. Front. Phys. 6, 55 (2018).
Article Google Scholar
Arthur, D. & Date, P. Balanced k-means clustering on an adiabatic quantum computer. Quant. Inf. Process. 20, 294 (2021).
Article MathSciNet Google Scholar
Date, P., Arthur, D. & Pusey-Nazzaro, L. QUBO formulations for training machine learning models. Sci. Rep. 11, 10029 (2021).
Article ADS CAS Google Scholar
Bauckhage, C., Ojeda, C., Sifa, R. & Wrobel, S. Adiabatic quantum computing for kernel $k = 2$ means clustering. In LWDA, 21–32 (2018).
Bauckhage, C., Piatkowski, N., Sifa, R., Hecker, D. & Wrobel, S. A QUBO formulation of the $k$-medoids problem. In LWDA, 54–63 (2019).
Ushijima-Mwesigwa, H., Negre, C. F. A. & Mniszewski, S. M. Graph partitioning using quantum annealing on the D-Wave System. In Proceedings of the Second International Workshop on Post Moores Era Supercomputing, PMES’17, 22–29 (Association for Computing Machinery, New York, NY, USA, 2017).
Kumar, V., Bass, G., Tomlin, C. & Dulny, J. Quantum annealing for combinatorial clustering. Quant. Inf. Process. 17, 39 (2018).
Article ADS MathSciNet Google Scholar
Kumagai, M. et al. An external definition of the one-hot constraint and fast QUBO generation for high-performance combinatorial clustering. Int. J. Netw. Comput. 11, 463–491 (2021).
Google Scholar
Cohen, E., Senderovich, A. & Beck, J. C. An Ising framework for constrained clustering on special purpose hardware. In Hebrard, E. & Musliu, N. (eds.) Integration of Constraint Programming, Artificial Intelligence, and Operations Research, Lecture Notes in Computer Science, 130–147 (Springer International Publishing, Cham, 2020).
Wereszczynski, K., Michalczuk, A., Josinski, H. & Polanski, A. Quantum computing for clustering big datasets. In 2018 Applications of Electromagnetics in Modern Techniques and Medicine (PTZE), 276–280 (IEEE, Racławice, 2018).
Khan, S. U., Awan, A. J. & Vall-Llosera, G. K-means clustering on noisy intermediate scale quantum computers. arXiv:1909.12183 [quant-ph] (2019).
Zhong, Z. & You, F. Globally convergent exact and inexact parametric algorithms for solving large-scale mixed-integer fractional programs and applications in process systems engineering. Comp. Chem. Eng. 61, 90–101 (2014).
Article Google Scholar
Rousseeuw, P. J. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comp. Appl. Math. 20, 53–65 (1987).
Article Google Scholar
D-Wave Systems Inc. dwave-neal. https://github.com/dwavesystems/dwave-neal.

Download references

Acknowledgements

N.M. thanks METI and IPA for their support through the MITOU Target program. This work was partially supported by JSPS KAKENHI (Grant Number JP18K11333).

Author information

Authors and Affiliations

Department of Computer Science, Ochanomizu University, Tokyo, 112-8610, Japan
Nasa Matsumoto & Kazue Kudo
Corporate Research and Development Center, Toshiba Corporation, Kawasaki, 212-8582, Japan
Yohei Hamakawa & Kosuke Tatsumura
Graduate School of Information Sciences, Tohoku University, Sendai, 980-8579, Japan
Kazue Kudo

Authors

Nasa Matsumoto
View author publications
You can also search for this author in PubMed Google Scholar
Yohei Hamakawa
View author publications
You can also search for this author in PubMed Google Scholar
Kosuke Tatsumura
View author publications
You can also search for this author in PubMed Google Scholar
Kazue Kudo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

N.M. conceived and conducted the simulation and analyzed the results. Y.H. and K.T. discussed the improvement on computing and parameter tuning. K.K. supervised the project and wrote the manuscript. All authors reviewed the manuscript.

Corresponding author

Correspondence to Kazue Kudo.

Ethics declarations

Competing interests

K.T. is an inventor on a Japanese patent application related to this work filed by the Toshiba Corporation (no. P2019-164742, filed 10 September 2019). This work was conducted as collaborative research between Ochanomizu University and Toshiba Corporation. The authors declare that they have no other competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Matsumoto, N., Hamakawa, Y., Tatsumura, K. et al. Distance-based clustering using QUBO formulations. Sci Rep 12, 2669 (2022). https://doi.org/10.1038/s41598-022-06559-z

Download citation

Received: 12 October 2021
Accepted: 02 February 2022
Published: 17 February 2022
DOI: https://doi.org/10.1038/s41598-022-06559-z

This article is cited by

Clustering method for time-series images using quantum-inspired digital annealer technology
- Tomoki Inoue
- Koyo Kubota
- Yu Matsuda
Communications Engineering (2024)
Effective implementation of $\text{L}{0}$-regularised compressed sensing with chaotic-amplitude-controlled coherent Ising machines
- Mastiyage Don Sudeera Hasaranga Gunathilaka
- Satoshi Kako
- Toru Aonishi
Scientific Reports (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.