CMOS-compatible ising machines built using bistable latches coupled through ferroelectric transistor arrays

Mallick, Antik; Zhao, Zijian; Bashar, Mohammad Khairul; Alam, Shamiul; Islam, Md Mazharul; Xiao, Yi; Xu, Yixin; Aziz, Ahmedullah; Narayanan, Vijaykrishnan; Ni, Kai; Shukla, Nikhil

doi:10.1038/s41598-023-28217-8

Download PDF

Article
Open access
Published: 27 January 2023

CMOS-compatible ising machines built using bistable latches coupled through ferroelectric transistor arrays

Antik Mallick¹,
Zijian Zhao²,
Mohammad Khairul Bashar¹,
Shamiul Alam³,
Md Mazharul Islam³,
Yi Xiao⁴,
Yixin Xu⁴,
Ahmedullah Aziz³,
Vijaykrishnan Narayanan⁴,
Kai Ni² &
…
Nikhil Shukla¹

Scientific Reports volume 13, Article number: 1515 (2023) Cite this article

3133 Accesses
5 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Realizing compact and scalable Ising machines that are compatible with CMOS-process technology is crucial to the effectiveness and practicality of using such hardware platforms for accelerating computationally intractable problems. Besides the need for realizing compact Ising spins, the implementation of the coupling network, which describes the spin interaction, is also a potential bottleneck in the scalability of such platforms. Therefore, in this work, we propose an Ising machine platform that exploits the novel behavior of compact bi-stable CMOS-latches (cross-coupled inverters) as classical Ising spins interacting through highly scalable and CMOS-process compatible ferroelectric-HfO₂-based Ferroelectric FETs (FeFETs) which act as coupling elements. We experimentally demonstrate the prototype building blocks of this system, and evaluate the scaling behavior of the system using simulations. Our work not only provides a pathway to realizing CMOS-compatible designs but also to overcoming their scaling challenges.

Large-scale coherent Ising machine based on optoelectronic parametric oscillator

Article Open access 25 November 2022

Antiferromagnetic spatial photonic Ising machine through optoelectronic correlation computing

Article Open access 12 November 2021

Scaling silicon-based quantum computing using CMOS technology

Article 20 December 2021

Introduction

Ising Machines, as dynamical systems, have recently shown promise for accelerating computationally challenging problems in combinatorial optimization. The intrinsic energy minimization in the highly interconnected system gives rise to rich spatio-temporal properties, which can subsequently be mapped to the solutions of many computationally intractable optimization problems^1,2. However, the highly interconnected nature of the system also poses a significant implementation and scalability challenge for Ising platforms. In fact, the number of coupling elements (representing edges) required for mapping an arbitrary graph scales up quadratically (~ N²) with the number of nodes in the graph. Consequently, scaling the system to large sizes continues to be a significant challenge for most Ising machine designs. Our approach to addressing this challenge relies on developing novel hardware components that are not only compact but can also leverage the maturity of CMOS-process technology and integration.

There is a full pallet of hardware technologies^{3,4,5,6,7,8,9,10,11,12,13,14,15,16}, each with their advantages and shortcomings, that have been considered for implementing Ising machines. Quantum annealing approaches^{17,18,19,20,21} using qubits offer the possibility of exponential speed up (by overcoming the fundamental hardness of the problem) but require cryogenic cooling. Besides cost, this requirement also restricts the type of applications where this approach would be practical. At the classical end, Ising machine implementations can be classified into the optical domain—using optoelectronic oscillators to design Coherent Ising Machines (CiM)^22,23,24,25, and the electronic domain using a variety of classical spin implementations. CiMs offer advantages such as speed as well as a relatively broad dynamic range for the implementation of weights²⁶. However, such implementations have traditionally been bulky, requiring long optical fibers, although there is some recent work on monolithic integration^27,28,29.

Electronic Ising machines have relied on the following approaches: iterative annealing in memory (AIM)^30,31, that digitally emulates the Ising model^32,33 and the actual implementation of the dynamical system^{34,35,36,37,38}. While the former approach essentially minimizes the Ising energy using a heuristic iterative approach, the latter method (relevant to this work) uses the hardware as classical spins and maps the energy minimization in the hardware directly for minimization of the Ising Hamiltonian. Various devices such as oscillators^{39,40,41,42,43,44,45,46} and ZIV diodes⁴⁷ have been experimentally demonstrated as classical Ising spins. More recently, CMOS-based (bi-stable) latches have also been theoretically shown to behave as Ising spins as well⁴⁸. Here, we experimentally demonstrate CMOS-latches as highly scalable and compact Ising spins. Additionally, in all of the electronic designs, the implementation of the coupling network continues to be a significant challenge for scaling. To address this, we propose to exploit the non-volatile behavior of CMOS-compatible FeFET memory arrays (in fact, the FeFETs used in this work are built in 28 nm high-κ metal gate technology platform) to implement the interaction among the spins (CMOS latches). Consequently, our work enables a pathway to a compact Ising platform (Fig. 1) that is positioned to exploit the maturity of CMOS process technology to realize a scalable solution.

Results

CMOS latch as a classical spin

We first focus on experimentally evaluating the behavior of CMOS latches as classical spins with simple resistive coupling. The theoretical foundation for the latch-based Ising machine was elegantly formulated by J. Roychowdhury⁴⁸ wherein the energy function for a resistively coupled system of latches was shown to map to the Ising Hamiltonian. The Ising Hamiltonian is given by \({\text{H}}=-\sum_{\mathrm{i},\mathrm{j}}^{\mathrm{N}}{\mathrm{J}}_{\mathrm{ij}}{\mathrm{s}}_{\mathrm{i}}{\mathrm{s}}_{\mathrm{j}}\), where \({\mathrm{s}}_{\mathrm{i}}\) ∈ {± 1} corresponds to the \(\mathrm{ith}\) spin, and \({\mathrm{J}}_{\mathrm{ij}}\) is the interaction coefficient between nodes \(\mathrm{i}\) and \(\mathrm{j}\). Figure 2 shows the experimentally observed behavior of a system of two latches with positive (J_ij = + 1) and negative (J_ij = -1) coupling over 100 runs; the details of the setup used in the experiments are discussed in supplementary note 1. It can be observed that the latches settle to the same voltage output level [i.e., the read terminals have the same outputs (0, 0) or (V_DD, V_DD)], when positively coupled, and opposite voltage levels [i.e., the read terminals have opposite outputs (V_DD, 0) or (0, V_DD)], when negatively coupled. We note, however, that the exact output (i.e., whether V_out1 or V_out2 settles to V_DD or 0) shows statistical behavior, as expected. Moreover, the skew in the statistical behavior likely arises from the mismatch in the coupling resistances; although the correct ground state is attained, one configuration of the ground state is preferred over the other one.

Building on this coupled two-latch spin system, we evaluate the dynamics of a system of four negatively coupled latches as shown in Fig. 3a–c. Our choice of negative coupling is motivated by the fact that the dynamics of such an Ising network can be directly mapped to computing the Maximum Cut (MaxCut) of a topologically equivalent graph-the MaxCut of a graph is defined as the challenge of dividing the nodes of a graph into two sets such that the number of common edges is maximized (unweighted graphs are considered here).

When powered up, the latch outputs [L₁ = L₃ = V_DD (+ 1); L₂ = L₄ = 0 V (− 1)] correspond to the two sets created by the MaxCut (Fig. 3b) and yield an optimal MaxCut solution of 4 (Fig. 3c). Furthermore, to rule out the possibility that the output states of the latches are resulting from any inherent asymmetry among them, we map the nodes of the graph in Fig. 3 to different physical latches. The results, discussed in supplementary note 2, show that optimal solutions are observed in all the cases irrespective of the mapping, implying that the observed relationships between the latch outputs arise from the interaction among the latches governed by the minimization of the energy of the system. We also evaluate the effect of the coupling strength on the system properties by tuning the coupling resistance (\({J}_{ij}\propto 1/R)\). It can be observed that the system exhibits the desired functionality as an Ising machine only in a limited coupling range (Fig. 3d). When the coupling is very weak (large R), the latch outputs are essentially decoupled (to a varying degree depending on the coupling strength) resulting in sub-optimal ‘solutions’. In contrast, when the coupling is very strong (small R), the latches settle to a nearly common state (~ V_DD/2). Next, we experimentally evaluate the (Max)Cut on graphs of up to 10 nodes. Figure 3f shows the stochastic behavior of the system (expected for the Ising machine) for a representative graph with 10 nodes (Fig. 3e) measured over 150 iterations. Optimal solutions are observed in 86 out of the 150 measurements and the sub-optimal solutions in the remaining measurements result from the system getting trapped in local minima in the high dimensional phase space (Fig. 3g). Further, we also experimentally compute the cut value on multiple graph configurations (= 20) up to 10 nodes (Fig. 3h); each graph is measured 10 times. Optimal MaxCut solutions are obtained in 17 of the 20 graphs.

Coupling spins using FeFETs

While the above experiments showcase the functional behavior of CMOS latches as classical Ising spins, the implementation of programmable monolithic resistors as coupling elements can be challenging and area inefficient, particularly in scaled systems. We therefore evaluate, first at a singular device level, the possibility of using non-volatile FeFETs (Fig. 4a) as programmable coupling elements between the CMOS latches. Our choice of using the FeFET as the coupling element is motivated by the fact that FeFETs are compatible with CMOS process technology, provide a wide dynamic range for the resistance (coupling strength), and can be efficiently integrated and programmed in a scalable array that is required to map the spin interactions in the entire network. We envision that the tunable threshold voltage of the FeFET would allow us to program the interaction between the latches; the low V_T (high conductance) state would correspond to J_ij = ± 1 whereas the high V_T (low conductance) state would correspond to J_ij = \(0\). Figure 4b shows the experimentally measured transfer characteristics of the ferroelectric-HfO₂-based FeFETs used in this work; the devices are fabricated (see “Methods”) on 28 nm high-κ metal gate technology platform, as shown in the cross-sectional TEM image^49,50. It features a doped HfO₂ layer as the ferroelectric and SiO₂ as the interlayer. Detailed processing information is described elsewhere⁵⁰. Figure 4c shows the memory window vs. programming voltage characteristics for the FeFET. When a programming voltage of ± 4 V is used to program the FeFET state, a 100 × modulation in the current is obtained for V_GS = 1 V.

We subsequently characterize the behavior of the FeFET as a programmable coupling element in a two-latch system. To evaluate this, the FeFETs are first programmed into the low V_T/high conductance state (J_ij = ± 1) using a programming pulse of magnitude + 4 V and a period of 1 µs. We test the interaction induced by the FeFETs among the latches by intentionally programming them into the ‘incorrect’ state, in order to observe the system evolve into the correct state i.e., when J_ij = − 1 (+ 1), the latches are initialized into the same (opposite) states (0/V_DD), and subsequently, it is observed whether the system evolves to the correct state. During the initialization of the latches, the FeFETs are maintained at V_GS = − 0.5 V. This reduces the conductance of the FeFETs without affecting the threshold voltage. After the latches are initialized, the gate voltage is increased to V_GS = 1.5 V, and the corresponding dynamics of the FeFET coupled CMOS latches are evaluated. We note that V_GS = 1.5 V was required to achieve the desired level of conductance from the FeFET-based coupling element (coupling strength) since the threshold voltage of the device has not been optimized for this application. The gate voltage can be reduced to zero by appropriately adjusting the threshold as considered in the following simulations and a detailed study of the optimization and the effect of the finite FeFET output conductance will be undertaken in future work. Figure 4d,e shows the observed behavior of the coupled system. Similar to the resistive coupling above, the coupled two-latch system settles in-phase (out-of-phase) when the coupling is positive J_ij = + 1 (negative, J_ij = − 1), respectively.

Using the above building blocks, we now explore a pathway to design a scalable Ising machine using latches as classical Ising spins and the FeFETs as programmable coupling elements. We propose to use a FeFET-based array to realize the coupling network among the latches (Fig. 5a). In this architecture, the rows (bit lines; BL) and the columns (source lines; SL) are connected to the drain and the source of the FeFET, respectively. Each row and column are driven by the two complementary outputs of a latch, effectively realizing the negative coupling (J_ij = − 1), required to solve the MaxCut problem using the Ising model (Fig. 5a). We also note that the latches can be decoupled from the rows and columns using the transmission gate-based switches. This is required for the two-stage operation of the array as illustrated further. The word-line (WL), connected to the gates of all the FeFETs in a row, is used to program the FeFET state according to the adjacency matrix of the input graph.

The above array is simulated using the HSPICE platform⁵¹ which is interfaced with MATLAB in order to input the circuit simulation parameters required as well as to evaluate the output obtained from the circuit simulation. The CMOS latches are designed using the 10 nm PTM model⁵²; a 5-fin FinFET design is used in order to achieve the desired drive strength for each inverter. The FeFETs are implemented using a circuit-compatible SPICE model which tracks the history-dependent switching behavior of the ferroelectric and the details of the model have been presented in our prior work⁵⁰. A nominal variation of 10 mV in threshold voltage (per fin) is also considered; furthermore, the impact of the increased threshold voltage is detailed in supplementary note 3. Additionally, for the interconnect routing in the FeFET array, a line-to-ground capacitance of 0.122 fF/µm and 0.109 fF/µm, and a line-to-line capacitance of 0.0229 fF/µm and 0.0217 fF/µm are considered for the two metal layers- M1 and M2, respectively⁴⁴. A random noise source, available in the spice stimulus⁵³, of a maximum amplitude of 50 mV is added to the supply voltage.

Computing the MaxCut using the above array is a two-step process, and is illustrated here with the aid of a small 4-node graph as shown in Fig. 5c: (a) Programming the FeFET array to represent the input graph: During this phase, the latches are decoupled from the FeFET array by turning OFF the interfacing switches. The FeFET array can now be programmed as a standard memory array, and we employ the half-select (V_W/2) bias scheme (Fig. 5b) to write the desired state to the FeFETs. First, a negative write voltage V_W (− 3 V considered here) pulse is applied to all the WLs ensuring that all the FeFETs are initialized to the high-V_T (low conductance) state. Subsequently, a column-wise write scheme is used wherein all the FeFET cells corresponding to A_ij = 1 (in the respective column) are programmed into the low-V_T (high conductance) state; no programming is required for the FeFETs that represent to A_ij = 0 since the whole FeFET array was initialized into the high V_T (low conductance) state at the start, as discussed earlier. Programming the FeFETs into the low-V_T (high conductance) state is achieved by asserting the word line to + 3 V and the corresponding BL and SL to 0 V. The column-wise programming scheme as well as the applied voltages are selected such that the half-selected cells in the corresponding column and row experience minimal program disturbs (see Supplementary Note 4).

(b) Execution phase: Once the FeFET array has been programmed to represent the input graph, the latches are powered on, and connected to the rows and columns of the FeFET array. The FeFETs now act as coupling elements among the latches. Subsequently, the coupled system evolves towards the ground-state energy- manifested as certain latches changing their state, and in the process, computing the solution to the MaxCut problem. Considering the statistical nature of the computation, wherein the system can get trapped in local minima (resulting in a sub-optimal solutions), the power supply to the latches and the enable signal to the switches interfacing the latches and the FeFET array are cycled five times. Such periodic cycling is done to facilitate the system’s evolution through multiple trajectories through the phase space, and consequently, increase the probability of finding the global minima. However, we note that the size of the graph and the resulting complexity of the phase space will impact the number of cycles required to achieve a high-quality solution, which will be a critical factor for evaluating the scalability of the proposed implementation.

Figure 5d shows the programming and execution of a 4-node representative problem using the array proposed above. Once the FeFET array has been programmed according to the adjacency matrix of the graph, the interface between the latch and the array is turned ON and OFF periodically five times. It can be observed that each time the latches are connected to the array, the latch outputs evolve toward the ground-state energy, and the final state (V_out1 = V_out3 = V_DD; V_out2 = V_out4 = 0) represents the two sets created by the MaxCut of the input graph (Fig. 5d). We note that optimal solutions are observed in all five evaluations here. However, this may not be the case in all cases, particularly, as the graph size increases.

Subsequently, we evaluate the dynamics of the proposed array using numerous (30) graph instances of varying sizes up to 50 nodes; 10 randomly instantiated graphs are considered for each graph size. The entire two-stage process including the programming of the FeFETs is simulated in each case, and hence the simulations are computationally intensive. We also observe that in all the evaluated instances, the latches settle to a steady-state within 100 ns after the transmission gates are turned ON. Figure 5e shows the Cut computed by the proposed array. For 10-node problem instances, the solutions are normalized to the MaxCut solution computed by the BiqMac solver⁵⁴ which guarantees an optimal solution if the algorithm converges (within the maximum run-time of 3 h). For the larger 25 and 50-node graphs, since the convergence time exceeded the BiqMac solver’s run-time limit, we compare our results with the heuristic manifold optimization⁵⁵ algorithm. We observe that for smaller graphs (10 nodes) optimal solutions are obtained in 9 out of the 10 cases. However, the solution quality degrades (mean accuracy for the 50 node graphs is 94%) as the graph size is increased owing to the increasing complexity of the solution space- a feature observed in all Ising machine implementations.

Discussion

Figure 6 compares the proposed approach with other approaches used to implement Ising machines. The implementation provides a pathway to realizing a compact Ising machine to solve computationally challenging problems such as MaxCut using CMOS-compatible components. We also note that the FeFET array, in general, provides a coupling framework among the Ising spins, that in principle, could also be used for other realizations of Ising spins such as oscillators^{39,40,41,42,43}. We also acknowledge that the present design considers only a subset of graphs i.e., unweighted graphs, and has been evaluated here for negative J_ij coefficients only (since they are relevant to solving the MaxCut problem). Nevertheless, these results motivate future work into the feasibility of this approach for solving weighted graphs (preliminary simulations shown in supplementary note 5) that can potentially exploit the multi-bit operation of the FeFET—a task that requires co-optimization of the FeFET array at the materials, device, and circuit level^56,57,58,59.

In summary, this work marks the first step towards proposing an Ising machine implementation that is well-positioned to take advantage of the maturity of the CMOS process technology making it a promising design approach for realizing high-performance application-specific accelerators for solving combinatorial optimization problems.

Methods:

Device fabrication

FeFETs employed in this work have a poly-crystalline Si/TiN/doped HfO₂/SiO₂/p-Si gate stack, which is integrated on the 28 nm node high-κ metal gate CMOS technology platform on 300 mm silicon wafers. Detailed information is described elsewhere⁵⁰. For the ferroelectric gate stack, a thin SiO₂ interfacial layer is grown first, followed by the deposition of the doped HfO₂ film. Then a TiN metal gate electrode was deposited using physical vapor deposition and following that the poly-Si gate electrode is deposited. The source and drain n + regions were formed by phosphorous ion implantation and then rapid thermal annealing at ~ 1000 °C. This step also results in the formation of the ferroelectric orthorhombic phase within the doped HfO₂.

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

Code availability

All codes used in this work are either publicly available or available from the authors upon reasonable request.

References

Lucas, A. Ising formulations of many NP problems. Interdiscip. Phys. 2, 5 (2014).
Google Scholar
Wang, T. & Roychowdhury, J. OIM: Oscillator-based ising machines for solving combinatorial optimisation problems. In International Conference on Unconventional Computation and Natural Computation, 232–256 (Springer, 2019).
Borders, W. A. et al. Integer factorization using stochastic magnetic tunnel junctions. Nature 573, 390–393 (2019).
Article ADS CAS Google Scholar
Camsari, K. Y., Faria, R., Sutton, B. M. & Datta, S. Stochastic p-bits for invertible logic. Phys. Rev. X 7, 031014 (2017).
Google Scholar
Parihar, A., Shukla, N., Jerry, M., Datta, S. & Raychowdhury, A. Vertex coloring of graphs via phase dynamics of coupled oscillatory networks. Sci. Rep. 7, 911 (2017).
Article ADS Google Scholar
Csaba, G., Raychowdhury, A., Datta, S. & Porod, W. Computing with coupled oscillators: Theory, devices, and applications. In 2018 IEEE International Symposium on Circuits and Systems (ISCAS), 1–5 (IEEE, 2018).
Pierangeli, D., Marcucci, G. & Conti, C. Large-scale photonic Ising machine by spatial light modulation. Phys. Rev. Lett. 122, 213902 (2019).
Article ADS CAS Google Scholar
Andrawis, R. & Roy, K. Antiferroelectric tunnel junctions as energy-efficient coupled oscillators: Modeling, analysis, and application to solving combinatorial optimization problems. IEEE Trans. Electron Devices 67(7), 2974–2980 (2020).
Article ADS CAS Google Scholar
Roques-Carmes, C. et al. Heuristic recurrent algorithms for photonic Ising machines. Nat. Commun. 11, 1–8 (2020).
Article Google Scholar
Cai, F. et al. Power-efficient combinatorial optimization using intrinsic noise in memristor hopfield neural networks. Nat. Electron. 3, 409–418 (2020).
Article Google Scholar
Bojnordi, M. N. & Ipek, E. Memristive boltzmann machine: A hardware accelerator for combinatorial optimization and deep learning. In 2016 IEEE International Symposium on High Performance Computer Architecture (HPCA), 1–13 (IEEE, 2016).
Kiraly, B., Knol, E. J., van Weerdenburg, W. M., Kappen, H. J. & Khajetoorians, A. A. An atomic Boltzmann machine capable of self-adaption. Nat. Nanotechnol. 16, 1–7 (2021).
Article Google Scholar
Byrnes, T., Koyama, S., Yan, K. & Yamamoto, Y. Neural networks using two-component Bose–Einstein condensates. Sci. Rep. 3, 1–7 (2013).
Article Google Scholar
Mizushima, K., Goto, H. & Sato, R. Large-scale Ising-machines composed of magnetic neurons. Appl. Phys. Lett. 111, 172406 (2017).
Article ADS Google Scholar
Reis, D., Laguna, A. F., Niemier, M. & Hu, X. S. Attention-in-memory for few-shot learning with configurable ferroelectric FET arrays. In Proceedings of the 26th Asia and South Pacific Design Automation Conference, 49–54 (2021).
Yin, X., Chen, X., Niemier, M. & Hu, X. S. Ferroelectric FETs-based nonvolatile logic-in-memory circuits. In IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 27, 159–172 (2018).
Albash, T. & Lidar, D. A. Adiabatic quantum computation. Rev. Mod. Phys. 90, 015002 (2018).
Article ADS Google Scholar
Das, A. & Chakrabarti, B. K. Colloquium: Quantum annealing and analog quantum computation. Rev. Mod. Phys. 80, 1061–1081 (2008).
Article ADS MATH Google Scholar
Johnson, M. et al. Quantum annealing with manufactured spins. Nature 473, 194–198 (2011).
Article ADS CAS Google Scholar
Albash, T. & Lidar, D. A. Demonstration of a scaling advantage for a quantum annealer over simulated annealing. Phys. Rev. X 8, 031016 (2018).
CAS Google Scholar
Hauke, P., Katzgraber, H. G., Lechner, W., Nishimori, H. & Oliver, W. D. Perspectives of quantum annealing: Methods and implementations. Rep. Prog. Phys. 83, 054401 (2020).
Article ADS CAS Google Scholar
Utsunomiya, S., Takata, K. & Yamamoto, Y. Mapping of Ising models onto injection-locked laser systems. Opt. express 19, 18091–18108 (2011).
Article ADS Google Scholar
Marandi, A., Wang, Z., Takata, K., Byer, R. L. & Yamamoto, Y. Network of time-multiplexed optical parametric oscillators as a coherent Ising machine. Nat. Photonics 8, 937–942 (2014).
Article ADS CAS Google Scholar
Inagaki, T. et al. A coherent Ising machine for 2000-node optimization problems. Science 354, 603–606 (2016).
Article ADS CAS Google Scholar
Hamerly, R. et al. Experimental investigation of performance differences between coherent Ising machines and a quantum annealer. Sci. Adv. 5, eaau0823 (2019).
Article ADS Google Scholar
McMahon, P. L. et al. A fully programmable 100-spin coherent Ising machine with all-to-all connections. Science 354, 614–617 (2016).
Article ADS CAS Google Scholar
Nabors, C. D., Yang, S. T., Day, T. & Byer, R. L. Coherence properties of a doubly-resonant monolithic optical parametric oscillator. J. Opt. Soc. Am. 7, 815–820 (1990).
Article ADS CAS Google Scholar
Marandi, A., Leindecker, N. C., Pervak, V., Byer, R. L. & Vodopyanov, K. L. Coherence properties of a broadband femtosecond mid-IR optical parametric oscillator operating at degeneracy. Opt. Express 20, 7255–7262 (2012).
Article ADS Google Scholar
Serkland, D. K., Bartolini, G. D., Agarwal, A., Kumar, P. & Kath, W. L. Pulsed degenerate optical parametric oscillator based on a nonlinear-fiber Sagnac interferometer. Opt. Lett. 23, 795–797 (1998).
Article ADS CAS Google Scholar
Yamaoka, M. et al. A 20k-spin Ising chip to solve combinatorial optimization problems with CMOS annealing. IEEE J. Solid-State Circuits 51, 303–309 (2015).
Google Scholar
Su, Y., Kim, H. & Kim, B. Cim-spin: A 0.5-to-1.2 V scalable annealing processor using digital compute-in-memory spin operators and register-based spins for combinatorial optimization problems. In 2020 IEEE International Solid-State Circuits Conference (ISSCC), 480–482 (IEEE, 2020).
Yamamoto, K. et al. A time-division multiplexing Ising machine on FPGAs. In Proceedings of the 8th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies, 1–6 (2017).
Aadit, N. A. et al. Massively parallel probabilistic computing with sparse Ising machines. arXiv:2110.02481 (2021).
Raychowdhury, A. et al. Computing with networks of oscillatory dynamical systems. Proc. IEEE 107, 73–89 (2018).
Article Google Scholar
Mallick, A. et al. Using synchronized oscillators to compute the maximum independent set. Nat. Commun. 11, 1–7 (2020).
Article Google Scholar
Mallick, A., Bashar, M. K., Truesdell, D. S., Calhoun, B. H., Joshi, S. & Shukla, N. Graph coloring using coupled oscillator-based dynamical systems. In 2021 IEEE International Symposium on Circuits and Systems (ISCAS), 1–5 (2021).
Traversa, F. L. & Di Ventra, M. Universal memcomputing machines. IEEE Trans. Neural Netw. Learn. Syst. 26, 2702–2715 (2015).
Article Google Scholar
Yu, E. et al. Ferroelectric FET based coupled-oscillatory network for edge detection. IEEE Electron Device Lett. 42(11), 1670–1673 (2021).
Article ADS CAS Google Scholar
Chou, J. et al. Analog coupled oscillator based weighted ising machine. Sci. Rep. 9, 14786 (2019).
Article ADS Google Scholar
Dutta, S., Khanna, A., Gomez, J., Ni, K., Toroczkai, Z. & Datta, S. Experimental demonstration of phase transition nano-oscillator based Ising machine. In 2019 IEEE International Electron Devices Meeting (IEDM), 37–8 (IEEE, 2019).
Ahmed, I., Chiu, P.-W. and Kim, C. H. A probabilistic self-annealing compute fabric based on 560 hexagonally coupled ring oscillators for solving combinatorial optimization problems. In Proc. IEEE Symp. VLSI Circuits 1–2 (2020).
Bashar, M. K. et al. Experimental demonstration of a reconfigurable coupled oscillator platform to solve the max-cut problem. IEEE. J. Explor. Solid State Comput. Devices Circuits. 6(2), 116–121 (2020).
Article ADS Google Scholar
Mallick, A., Bashar, M. K., Truesdell, D. S, Calhoun, B. H. & Shukla, N. Overcoming the accuracy vs. performance trade-off in oscillator ising machines. In 2021 IEEE International Electron Devices Meeting (IEDM), 40.2.1–40.2.4 (2021).
Dutta, S. et al. An Ising Hamiltonian solver based on coupled stochastic phase-transition nano-oscillators. Nat. Electron. 4, 502–512 (2021).
Article Google Scholar
Wang, Z., Khandelwal, S. & Khan, A. I. Ferroelectric oscillators and their coupled networks. IEEE Electron. Device Lett. 38(11), 1614–1617 (2017).
Article ADS CAS Google Scholar
Wang, Z. & Khan, A. I. Ferroelectric relaxation oscillators and spiking neurons. IEEE J. Explor. Solid State Comput. Devices Circuits 5(2), 151–157 (2019).
Article ADS Google Scholar
Afoakwa, R., Zhang, Y., Vengalam, U. K. R., Ignjatovic, Z. & Huang, M. BRIM: Bistable resistively-coupled ising machine. In 2021 IEEE International Symposium on High-Performance Computer Architecture (HPCA). 749–760 (IEEE, 2021).
Roychowdhury, J. Bistable latch ising machines. In International Conference on Unconventional Computation and Natural Computation 131–148. (Springer, 2021).
Trentzsch, M. et al. A 28 nm HKMG super low power embedded NVM technology based on ferroelectric FETs. In 2016 IEEE International Electron Devices Meeting (IEDM) 11–15 (IEEE, 2016).
Deng, S. et al. A comprehensive model for ferroelectric FET capturing the key behaviors: Scalability, variation, stochasticity, and accumulation. In 2020 IEEE Symposium on VLSI Technology, 1–2 (2020).
https://www.synopsys.com/content/dam/synopsys/verification/datasheets/hspice-ds.pdf.
https://ptm.asu.edu/.
https://cseweb.ucsd.edu/classes/wi10/cse241a/assign/hspice_sa.pdf.
Biq Mac Solver—Binary quadratic and max cut solver. Accessed 1 Aug 2020. [Online]. http://biqmac.uni-klu.ac.at.
Boumal, N., Mishra, B., Absil, P. A., & Sepulchre, R. Manopt, a Matlab toolbox for optimization on manifolds. J. Mach. Learn. Res. 15 (2014).
Abrar, K. A. et al. BEOL compatible superlattice FerroFET-based high precision analog weight cell with superior linearity and symmetry. In 2021 IEEE International Electron Devices Meeting (IEDM), 19–6. (IEEE, 2021).
Jerry, M. et al. Ferroelectric FET analog synapse for acceleration of deep neural network training. In 2017 IEEE International Electron Devices Meeting (IEDM), 6–12. (IEEE, 2017).
Shelby, F. S. et al. Wake-up and fatigue mechanisms in ferroelectric Hf0. 5Zr0. 5O2 films with symmetric RuO2 electrodes. J. Appl. Phys. 130(13), 134101 (2021).
Article ADS Google Scholar
Shelby, F. S. et al. Phase-exchange-driven wake-up and fatigue in ferroelectric hafnium zirconium oxide films. ACS Appl. Mater. Interfaces. 12(23), 26577–26585 (2020).
Article Google Scholar

Download references

Acknowledgements

This work was supported in part by NSF ASCENT grant (No. 2132918). It is also partially supported in part by the Army Research Office under Grant Number W911NF-21-1-0341. We also acknowledge GlobalFoundries, Dresden Germany for providing the testing devices.

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, University of Virginia, Charlottesville, VA, 22904, USA
Antik Mallick, Mohammad Khairul Bashar & Nikhil Shukla
Department of Electrical and Microelectronic Engineering, Rochester Institute of Technology, Rochester, NY, 14623, USA
Zijian Zhao & Kai Ni
Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, TN, 37996, USA
Shamiul Alam, Md Mazharul Islam & Ahmedullah Aziz
Department of Computer Science and Engineering, Pennsylvania State University, State College, PA, 16801, USA
Yi Xiao, Yixin Xu & Vijaykrishnan Narayanan

Authors

Antik Mallick
View author publications
You can also search for this author in PubMed Google Scholar
Zijian Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Khairul Bashar
View author publications
You can also search for this author in PubMed Google Scholar
Shamiul Alam
View author publications
You can also search for this author in PubMed Google Scholar
Md Mazharul Islam
View author publications
You can also search for this author in PubMed Google Scholar
Yi Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Yixin Xu
View author publications
You can also search for this author in PubMed Google Scholar
Ahmedullah Aziz
View author publications
You can also search for this author in PubMed Google Scholar
Vijaykrishnan Narayanan
View author publications
You can also search for this author in PubMed Google Scholar
Kai Ni
View author publications
You can also search for this author in PubMed Google Scholar
Nikhil Shukla
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.M. and N.S. developed the main idea. K.N., A.A. and V.N. participated in discussions and further development of the idea. A.M. and Z.Z. performed the experiments. Y.X. and Y.X. developed the FeFET model. A.M., M.K.B., S.A., M.M.I. performed the simulations. A.M., K.N. and N.S. wrote the manuscript. All authors discussed the results and commented on the manuscript.

Corresponding author

Correspondence to Nikhil Shukla.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mallick, A., Zhao, Z., Bashar, M.K. et al. CMOS-compatible ising machines built using bistable latches coupled through ferroelectric transistor arrays. Sci Rep 13, 1515 (2023). https://doi.org/10.1038/s41598-023-28217-8

Download citation

Received: 12 August 2022
Accepted: 16 January 2023
Published: 27 January 2023
DOI: https://doi.org/10.1038/s41598-023-28217-8

This article is cited by

Ferroelectric compute-in-memory annealer for combinatorial optimization problems
- Xunzhao Yin
- Yu Qian
- Kai Ni
Nature Communications (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.