Highly parallel and ultra-low-power probabilistic reasoning with programmable gaussian-like memory transistors

Lee, Changhyeon; Rahimifard, Leila; Choi, Junhwan; Park, Jeong-ik; Lee, Chungryeol; Kumar, Divake; Shukla, Priyesh; Lee, Seung Min; Trivedi, Amit Ranjan; Yoo, Hocheon; Im, Sung Gap

doi:10.1038/s41467-024-46681-2

Download PDF

Article
Open access
Published: 18 March 2024

Highly parallel and ultra-low-power probabilistic reasoning with programmable gaussian-like memory transistors

Nature Communications volume 15, Article number: 2439 (2024) Cite this article

1480 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Probabilistic inference in data-driven models is promising for predicting outputs and associated confidence levels, alleviating risks arising from overconfidence. However, implementing complex computations with minimal devices still remains challenging. Here, utilizing a heterojunction of p- and n-type semiconductors coupled with separate floating-gate configuration, a Gaussian-like memory transistor is proposed, where a programmable Gaussian-like current-voltage response is achieved within a single device. A separate floating-gate structure allows for exquisite control of the Gaussian-like current output to a significant extent through simple programming, with an over 10000 s retention performance and mechanical flexibility. This enables physical evaluation of complex distribution functions with the simplified circuit design and higher parallelism. Successful implementation for localization and obstacle avoidance tasks is demonstrated using Gaussian-like curves produced from Gaussian-like memory transistor. With its ultralow-power consumption, simplified design, and programmable Gaussian-like outputs, our 3-terminal Gaussian-like memory transistor holds potential as a hardware platform for probabilistic inference computing.

Bringing uncertainty quantification to the extreme-edge with memristor-based Bayesian neural networks

Article Open access 20 November 2023

Hardware implementation of Bayesian network based on two-dimensional memtransistors

Article Open access 23 September 2022

Uncertainty quantification via a memristor Bayesian deep neural network for risk-sensitive reinforcement learning

Article 22 June 2023

Introduction

With the remarkable strides of AlphaGo, the integration of deep learning into artificial intelligence (AI) has assumed a pivotal role across diverse industrial domains, encompassing autonomous driving^1,2,3, image recognition^4,5,6, and translation^7,8. However, the intricate interplay of environmental factors such as variability, occlusion, and fluctuations in lighting, coupled with sensor constraints like noise, limited resolution, and range, collectively employ a considerable multitude of uncertainties in perception^9,10,11. Regrettably, conventional AI systems, which generally disregard such kinds of uncertainty, often yield inaccurate predictions and compromise the dependability of inferences from deep learning models. To combat this overconfidence quandary, a critical imperative emerges: adopting a calculation methodology that can faithfully embrace uncertainty^12,13. Probabilistic inference procedures, rooted in probability theory and statistical techniques, rise to the challenge, skillfully navigating the complexities of diverse uncertainties in perception and learning. By adeptly representing and reasoning unfamiliar predictive models or ambiguous data patterns that breed uncertainty, probabilistic inference models empower decision-makers to exercise heightened discernment and adaptability in the face of evolving circumstances^14,15,16.

However, the decision-making with probabilistic models also necessitates substantial computational resources^17,18,19. Gaussian distribution and its mixtures are typically employed in probabilistic inference procedures²⁰. The central limit theorem states that the sum of many independent and identifiably distributed random variables, regardless of their underlying distribution, can be approximately normally distributed²¹. By combining multiple Gaussian functions through a mixture, multi-modal statistical data can be efficiently modeled. Nevertheless, evaluating functions of Gaussian mixture model (GMM) using a digital data-path often results in excessive computational workload^22,23. Notably, the computation of a GMM’s output entails a multitude of multiplications, additions, and look-up operations. Such workload in computation grows with the signal dimension and the complexity of the model, determined by the number of mixture functions in the GMM²⁴. Given the importance and versatility of GMM, previous studies have investigated GMM computation hardware intensively utilizing dynamic random-access memory (DRAM)²⁵, field-programmable gate array (FPGA)^26,27,28, or analog circuits. However, usually a large number of transistors was required, which inevitably resulted in a high level of complexity²⁹. Consequently, without innovative technologies that can dramatically reduce resource demands when dealing with high-dimensional complex distribution functions like GMMs, the advantages of probabilistic inference procedures remain out of reach for numerous applications.

Pursuing a probabilistic inference, significant strides have been made in recent years to develop devices capable of physically emulating the characteristics of a Gaussian function^30,31,32,33. This physics-based computing approach, using the Gaussian devices, holds the potential for a revolutionary enhancement in energy efficiency, thereby enabling efficient operation of GMMs with highly constrained resources. For instance, analog circuits based on silicon-based complementary metal–oxide–semiconductor (CMOS) technology have been used to emulate Gaussian functions. However, in conventional Si metal-oxide-semiconductor field-effect transistor (MOSFET)-based circuits, at least four transistors are required to represent a Gaussian distribution^34,35. Michail et al. presented a GMM-based classifier by connecting NMOS and PMOS bump circuits with a winner-takes-all (WTA) circuit³⁶. Another study, Vassilis et al. proposed low-power bell-shaped analog classifiers (CLFs) by implementing GMM (machine learning) circuits with fewer number of transistors and three input values, and tested on real-world biomedical classification problems³⁷. Nevertheless, to tune the characteristics of the distribution function, the number of transistors increases often to more than 10, rendering the circuit quite complex and inevitably increasing power consumption^31,32,38,39.

Here, we show a device, called Gaussian-like memory transistor (GMT), yielding a Gaussian-like current (I)−voltage (V) relationship in a single heterojunction transistor device. Moreover, each semiconductor has a separate floating gate (FG), allowing independent adjustment of their channel conductivity, thereby enabling systematic control of Gaussian-like expression through the FG. The GMT demonstrates controllable Gaussian-like I − V characteristics based on programming states of the semiconductors, exhibiting over 10000 s retention, operational stability against repeated cycles. Leveraging GMT’s Gaussian-like behavior and controllability, we pioneer a component architecture that allows for the implementation of inference capabilities for probabilistic robotics applications through a remarkably simple design. The GMT devices possess the remarkable capacity to adjust Gaussian-like shapes extensively, all the while maintaining an impressively low power consumption profile. This attribute empowers us to offer practical calculations that effectively mitigate the challenge of overconfidence.

Results

Design of the Gaussian-like memory transistor

A schematic diagram of our approach is shown in Fig. 1a–c. We adopt a Bayesian filtering approach for different probabilistic inference tasks, and construct the operational framework using GMM-based probability distribution functions within GMT arrays^40,41,42,43. By modeling the likelihood function as a mixture of Gaussian functions, the posterior weight density storage can be computed. Although leveraging distribution functions enables energy-efficient, low-latency, and on-board processing, minimizing evaluation workload, representing a single probability distribution function necessitates numerous devices to perform multiplication and accumulation operations (Fig. 1d, e)^{19,44,45,46,47}. To effectively handle diverse situations through precise control over the output format of Gaussian distributions, this work proposes a simple 3-terminal GMT device capable of representing precisely tunable Gaussian-like distributions (Fig. 1e). The GMT utilizes an organic semiconductor heterojunction, where a p-type semiconductor is connected to an n-type counterpart at the midpoint of the channel layer, inducing anti-ambipolar characteristics shaped by each semiconductor’s subthreshold region. To achieve Gaussian distribution-like symmetric outputs, N,N’-ditridecylperylene-3,4,9,10-tetracarboxylic diimide (PTCDI-C13) and pentacene were utilized as n-type and p-type semiconductors, respectively, owing to their comparable charge transport performance^48,49 (Supplementary Fig. 1).

**Fig. 1: Design of Gaussian-like memory transistor.**

For systematic controllability of Gaussian-like output characteristics by independent modulation of p- and n-type channel conductance, separate FG structures were introduced beneath both p- and n-type semiconductors. Gold nanoparticles (AuNPs) served as separate FGs due to their high charge-loss tolerance resulting from discrete spatial distribution^50,51,52 (Supplementary Fig. 2). The independent control of channel conductivity of each semiconductor through FGs allows for versatile and precise control over the characteristics of the Gaussian-like I − V characteristics, enabling efficient modeling of complex probability distribution functions as well as representing the target distribution by connecting GMT arrays (Fig. 1f, g). To achieve memory performance, careful consideration of both the floating gate layer and several other insulating layers are important. The blocking dielectric layer (BDL) must possess exceptional insulating characteristics to impede charge leakage and current flow to the gate electrode. Additionally, the tunneling dielectric layer (TDL) should not only facilitate tunneling during programming voltage application but also ensure the non-leakage of stored charge. Therefore, the TDL requires appropriate insulating properties, especially under low-thickness conditions. By strategically placing the vacuum-deposited polymer dielectric films with different electrical properties at appropriate locations within the device^{50,53,54,55,56}. A significant reduction in operating voltage and programming/erasing voltage of the GMT device was achieved, which is crucial in the realization of low-power, energy-efficient GMT arrays (Supplementary Fig. 3).

The high-resolution transmission electron microscope (HRTEM) images, optical microscope (OM) images and corresponding energy dispersive spectroscopy (EDS) mapping clearly confirm that the flash memory structure was successfully implemented (Fig. 2a–c and Supplementary Fig. 4). As shown in the optical microscope (OM) image, the GMT was operated by connecting pentacene and PTCDI-C13 into drain (D) and source (S) electrodes, respectively. As shown in Fig. 2d, a Gaussian distribution-like current output was indeed implemented in the GMT device, regardless of different drain voltages (V_D). The operating voltage of the GMT devices was reduced to be less than 3 V with further scaling down the thickness of dielectric layers (Supplementary Fig. 17). With the variation in V_D, only off voltage (V_off) was affected, while on voltage (V_on) remained constant_, causing the variation of V_off with the change in V_D^57,58 (Supplementary Fig. 6). The Gaussian distribution-like I − V transfer characteristics were modeled through the following equations below⁵⁹:

$${I}_{D}=A\,\exp \left[-\frac{{({V}_{{BG}}-\mu )}^{2}}{2{\sigma }^{2}}\right]{{{{{\rm{;}}}}}}$$

(1)

where the A, μ, and σ represent the amplitude, mean, and standard deviation of the gaussian function, respectively. A strong correlation between the experimentally measured data and fitted curves was observed (Supplementary Fig. 6). The operational stability of our GMT device was also monitored where Gaussian distribution-like transfer curve was fully preserved in the repetitive operation (Fig. 2e). Over 100 consecutive operations, only marginal changes in μ (less than 100 mV) and σ (less than 30 mV) was observed (Fig. 2f). Furthermore, uniform operation was demonstrated across 8 devices, securing the potential for large-scale circuits (Supplementary Fig. 7).

**Fig. 2: The structure and basic operation of GMT device.**

Individual programming of channel conductance

For the hardware implementation of diverse probabilistic model within a single device, it is highly desirable to retain precise control of the Gaussian distribution characteristics^38,60. To validate the controllability of the channel conductivity through the separate FGs, we analyzed the change in I − V characteristics by the programming operation, as shown in Fig. 3a. It should be noted that we applied reverse bias to D (or S) electrode for programming p-type (or n-type) semiconductor, instead of applying V_G directly into the global gate electrode^53,55,61. We refer to this inverse bias for p- and n-type semiconductors as programming voltage of p-type (V_prg,P) and n-type semiconductor (V_prg,N), respectively.

**Fig. 3: GMT Device programming method.**

The changes in transfer curves depending on programming operation of n-type depicted in Fig. 3b. These programming operations were investigated by using an incremental step pulse programming (ISPP) technique^55,62, with a common pulse width set to 1 s. The shape of the transfer curve of the GMT device is determined by the conductivity of the p- and n-channel. In other words, the current switching behavior of the GMT device is closely dependent on the V_T of p-channel (V_T,P) and n-channel (V_T,N). When a programming voltage is applied, electrons (or holes) trapping occurs through tunneling, similar to conventional floating gate memory^53,55,61. When the electrons are stored in the FG, they induce an additional negative voltage, causing the V_T shift toward positive direction. Analogously, stored holes force the V_T shift toward negative direction. The V_T,P and V_T,N were gradually changed along with the applied V_prg,P and V_prg,N, respectively. Initially, the V_T values of each channel layer were measured to be V_T,N = 2.01 V and V_T,P = 4.68 V. As shown in Fig. 3c, with the increasing absolute value of V_prg,N, the amount of holes (or electrons) in the n-type FG increased. Therefore, the change of V_T,N (ΔV_T,N) reached +1.20 V and −1.35 V at the V_prg,N = +22 V and V_prg,N = − 22 V, respectively.

Similarly, the transfer curve changes depending on programming operation p-type semiconductors depicted in Fig. 3d, and by p-type programming operation, V_T,P also experienced a shift; the change of V_T,P (ΔV_T,P) reached −1.09 V and +2.12 V at the each point of V_prg,P = − 20 V and V_prg,P = +24 V (Fig. 3e). These observations confirm that the programming on each FG beneath the corresponding channel layer can efficiently suppress unintended programming operation of FG on the other side, and enables the independent control of the n- and p-channels through the programming operation. Thanks to these advantageous attributes, while one side’s threshold voltage (V_T) shifts, the other remains unchanged and consistently maintained.

Gaussian factor control

Beyond the accomplished control of Gaussian-like distribution through individual FG programming, the concurrent programming through the desirable ways allows for the meticulous manipulation of μ or σ, while ensuring minimal interference with other parameters (Fig. 4a, b). For probabilistic inference operations applying Bayesian filtering, it is essential to adjust μ and σ in a way that avoids interference between them. Therefore, the programming process is designed to minimize variations in σ when μ changes, and vice versa, ensuring that changes in σ result in minimal alterations to μ. Based on the programming method suggested above, we devise two distinctive programming methods to control the Gaussian-like distribution of our GMT device. Case 1 is the injected charges have the same polarity and quantity, an equal amount of V_T shift arises in each channel layer (ΔV_T,P = ΔV_T,N). This leads to a parallel shift in the transfer curve (μ regulation) without perturbing σ, and the direction of the V_T shift corresponds to the polarity of charges injected through FGs (Fig. 4a). Case 2 is when the injected charges have opposite polarity and the same quantity, and an opposite direction of V_T shift occurs with the same magnitude (ΔV_T,P = −ΔV_T,N), This results in a shape change of the transfer curve, leading to σ regulation while retaining the constant μ (Fig. 4b).

Figure 4c, d demonstrates the change in the Gaussian distribution-like transfer curve when the injected charge carriers have the same polarity and quantity (ΔV_T,P = ΔV_T,N, Case 1). When holes were trapped on both n- and p-type FGs, the V_T of both channel layers shifted in the negative direction to the same extent, resulting in a parallel movement of the Gaussian-like curve towards the negative direction (Fig. 4c). Likewise, when electrons were trapped in both FGs, the V_T shifted towards the positive V_G direction for both channel layers, leading to a parallel movement of the Gaussian-like curve towards the positive direction without changing the Gaussian-like curve shape (Fig. 4d). As shown in Fig. 4e, due to the V_T shift induced by the amount of trapped charges, a near-linear relationship between μ and the programming voltage was achieved⁶¹. The initial μ value was observed to be 4 V and it gradually shifted after programming. The μ value reached 2.1 V with a negative movement (V_prg,P = − 25 V and V_prg,N = − 24 V) and reached 6.4 V with a positive movement (V_prg,P = +23 V and V_prg,N = +22 V), which is substantial variation considering the width value of 1.92 V set as 6σ following the 3σ rule in Supplementary Note 4. In the parallel shift of the transfer curve, only practically negligible variation was detected in σ (Fig. 4e).

Under the condition where the injected charge carriers have different polarity (ΔV_T,P = − ΔV_T,N, Case 2), the σ regulation could be accomplished (Fig. 4f, g). When holes were trapped on the n-type FG and electrons were trapped on the p-type FG, V_T,N and V_T,P moved in opposite directions, leading to the widening of the Gaussian distribution-like output (Fig. 4f). Conversely, when the opposite charges are trapped, V_T,N and V_T,P moved closer to each other, resulting in the narrowing the distribution (Fig. 4g). The changes in the σ value decreased to 0.23 V at the V_prg,P = − 22 V and V_prg,N = +20.5 V from the initial value of 0.32 V, and increased to 0.54 V at the V_prg,P = +21.5 V and V_prg,N = − 21 V as shown in Fig. 4h. While σ undergoes systematic change, the μ value remain virtually constant (Fig. 4h). Detailed programming conditions of μ and σ regulations are described in Supplementary Fig. 13. These results clearly demonstrate that the adjustment of Gaussian distribution-like I − V characteristics including μ and σ are successfully achieved by simple programming operation by a single 3-terminal heterojunction device. These results demonstrate that GMT device characteristics can be altered with fewer transistors compared to previous studies (Table 1).

Table 1 Table of the performance of the reported CMOS-based Gaussian I-V characteristics system and GMT device

Full size table

Electrical and mechanical stability characteristics

To ensure reliable and versatile operation, we examined the retention performance of our GMT device (Fig. 5a, b). For the retention performance in μ regulation, we applied V_prg,P = +21 V and V_prg,N = +20 V with 1 s pulse width to induce a sufficient change in μ value (1.19 V). To assess the retention capability in σ regulation, V_prg,P = + 16.5 V and V_prg,N = − 17 V with 1 s pulse width was applied to double the maximum current value. The changes in μ and σ values over time were extracted from the transfer curves (Fig. 5c). The μ value at 1000 s, 3000 s, and 10,000 s after programming showed 5.14 V, 5.11 V, and 5.05 V, respectively, which indicates the change in μ was less than 0.14 V from the initial μ value (5.19 V) even after 10,000 s. The change in σ was also minimal, remaining within 0.016 V of the initial value of 0.440 V (0.433 V, 0.420 V, and 0.418 V at 1000 s, 3000 s, and 10000 s, respectively). These retention characteristics are attributed to the trapped charge on the isolated AuNP and the minimized leakage path in the dielectric layers^50,51,52. For μ and σ cycle endurance measurement, appropriate signs of V_prg,P and V_prg,N were repeatedly cycled. Remarkably, the GMT device also exhibited cyclic endurance (Supplementary Fig. 16). These retention and endurance characteristics of our GMT device are essential for reliable inference computations^63,64,65. The separate FG structure, introduced to control Gaussian-like I-V characteristics, leads to a remarkable reduction of over 50 times in power consumption and latency than other conventional CMOS techniques, when we calculate GMT characteristics emulated with a 45 nm CMOS circuit. Furthermore, the device configuration with just three terminals single transistor contributes to its inherent simplicity, rendering it highly advantageous for future high-density integration.

**Fig. 5: Stability of the GMT device.**

The proposed GMT devices also offer a promising solution for probabilistic inference in wearable devices and robotics⁶⁶, where mechanical stability is crucial to meet various form factors^67,68,69. To validate the reliable operation of our GMT device in such applications, we fabricated it on a flexible polyethylene naphthalate (PEN) substrate (Fig. 5d). Figure 5e demonstrates that the Gaussian distribution-like current output remained intact even under the applied tensile strain as high as 2.0%. Furthermore, the GMT device exhibited remarkable electrical stability even after 1000 bending cycles at a fixed applied tensile strain of 1.2% (Fig. 5f). Although marginal current level decrease was observed and the voltage with maximum current point (V_max) also shifted slightly due to the small dimensional variation by the applied strain, the I − V characteristics fully recovered after releasing from the tensile strain, supporting that these changes are not caused by device degradation^{56,70,71,72,73} (Fig. 5g, h). These results confirm the GMT device shows full compatibility with flexible substrates.

Simulation for insect size-drones mapping model

Next, we evaluate the performance of GMT-based architectures for probabilistic reasoning for key autonomous navigation objectives, namely, drone localization and obstacle-free path planning⁷⁴. Figure 6a, b show crossbars of GMTs that can physically evaluate likelihood functions for probabilistic localization and path planning, respectively, which use a co-designed map function discussed below, to significantly accelerate their processing.

**Fig. 6: Robotic drone applications based on simulated GMT array.**

Compared to CMOS-based digital accelerator, that sequentially computes a likelihood function over hundreds of clock cycles, the architecture of GMT-based crossbar is significantly simplified in the circuit and has increased energy efficiency (Fig. 6a–c). The currently demonstrated GMT devices are sizable and therefore operate at elevated voltages, approximately 10 V. To evaluate the advantages of the scaled GMT-based architectures for probabilistic inference against digital CMOS, we emulated GMT characteristics with a 45 nm CMOS circuit (Supplementary Fig. 18). It is worthwhile to note that increasing the mixture functions in GMT network in Fig. 6a only increases the number of GMT devices but still requires three logarithmic analog-to-digital converters (log-ADCs) and three digital-to-analog converters (DACs). The energy dissipation of the proposed structure for computing the likelihood from a 100-mixture component GMM, in comparison to that of digital data-path is presented in Table 2 and Supplementary Tables 1 and 2. For computing the likelihood, the digital data-path requires thousands of addition/subtraction and comparison, and hundreds of multiplications and read operations. Such extensive workload results into ~941 pJ energy dissipation. Meanwhile, only a handful of operations are needed for the proposed GMT-based architecture since the Gaussian-like computations can be directly mapped to the characteristics of the device, only requiring ~18.33 pJ energy (Supplementary Table 2). Thus, the proposed scheme requires about 50 times lower energy than the digital data-path. Furthermore, when compared to the state-of-the-art 14 nm and 7 nm digital CMOS, it also exhibits slightly improved performance (Supplementary Table 3).

Table 2 Table of improvement in key performance metrics for likelihood computation using 100-mixture, 3-dimensional density function at 45 nm technology node

Full size table

By leveraging Bayes’ rule, the particle filtering method seamlessly integrates single-shot maximum a posteriori (MAP) estimates of robot positions with sequential localization stages (Supplementary Note 7). In particular, we co-designed mapping function for the flying domain from the traditionally used GMM-based map models to better suit the characteristics of GMTs and adapted expectation-maximization procedures to learn the co-designed map models. To better match the measured characteristics of GMT, an accurate fitting was attempted using the following equation (Supplementary Fig. 6):

$${I}_{D}={I}_{0}\exp \left(-{abs}{\left(\frac{{V}_{G}-{V}_{\mu }}{{\sigma }^{\pm }\left({V}_{\mu }\right)}\right)}^{\eta }\right)$$

(2)

where I₀ is the peak current magnitude which is invariant to V_µ and only depends on the transistor size and V_D. σ⁺ is a fitting parameter modeling I_D roll-off when V_G > V_µ. σ⁻ models I_DS roll-off when V_GS < Vµ. η is a fitting parameter modeling the power law index of log-I_D and V_G dependence. At varying V_µ, I_D − V_G characteristics resemble a skewed Gaussian-like function where the skewness itself is controlled by the peak gate voltage. We modeled such V_µ-dependent, skewed Gaussian-like I_D−V_G of the GMT transistor.

Even though each GMT only emulates one-dimensional (1D) Gaussian-like function, the higher-dimensional maps can be efficiently implemented through an array of GMT devices, without requiring complex analog multiplications, by co-designing the map function to GMT characteristics. A series connection of GMTs is considered (Fig. 6a, b). Since the gate-controlled conductance of each GMT follows a Gaussian-like function, resultantly, their column current follows a harmonic mean of Gaussian-like (HMGL) function. HMGL is a multidimensional function, matching the dimension of map model, and is given by

$${{{{{\rm{HMGL}}}}}}\left({V}_{X},{V}_{X},{V}_{X}\right)=\frac{1}{\frac{1}{G\left({\mu }_{X},{\sigma }_{X},{V}_{X}\right)}+\frac{1}{G\left({\mu }_{Y},{\sigma }_{Y},{V}_{Y}\right)}+\frac{1}{G\left({\mu }_{Y},{\sigma }_{Y},{V}_{Y}\right)}}$$

(3)

Here, for Gaussian-like function along the x-axis, µ_x and σ_x are the programmed parameters of GMT that are learned using expectation-maximization procedure⁷⁵. V_X is the applied gate voltage. Likewise, Gaussian-like functions along y-axis and z-axis are defined. Utilizing a mixture of 3D HMGL functions, the 3D map of flying domain is modeled as

$${{{{{\rm{M}}}}}}\left(x,y,z\right)=\mathop{\sum }\limits_{i=1}^{N}\frac{1}{\frac{1}{G\left({\mu }_{{Xi}},{\sigma }_{{Xi}},{V}_{X}\right)}+\frac{1}{G\left({\mu }_{{Yi}},{\sigma }_{{Yi}},{V}_{Y}\right)}+\frac{1}{G\left({\mu }_{{Yi}},{\sigma }_{{Yi}},{V}_{Y}\right)}}$$

(4)

where arbitrary complex maps can be modeled with a sufficient number of HMGL functions.

To compute the log-likelihood of a predictive hypothesis using the mapping model in Eq. (4), the pixel outputs of depth measurements are projected to 3D based on a position and orientation (i.e., pose) hypothesis for the localizing drone. The transformed outputs are converted to analog domain and applied to the crossbar using digital-to-analog converters (DAC). The output current induced by the crossbar follows the log-likelihood of pose hypothesis as defined by Eq. (4). The log-likelihood output of multiple pixels of a depth map are summed to compute the net likelihood of a hypothesis. The computations are iterated over all hypotheses to decimate the less likely ones.

The hypothesis was projected onto the GMM-modeled domain map and the pose hypotheses converge based on this projected map (Fig. 6d, e). Using this as a foundation, the weighted mean trajectory of the drone position was derived from the considered hypothesis set in comparison with the ground truth (Fig. 6f). Significantly, our GMT-based co-designed approach fulfils a comparable performance despite various non-idealities such as skewed mapping of a Gaussian-like function and limited programmability of only mean variables of the characteristics.

Likewise, the point cloud map of obstacles in a room was fitted with the co-designed map model. Utilizing the GMT array, a graph of obstacle-free paths is prepared by initially randomly sampling points within the map domain. To verify the validity of the travel path between two adjacent points, the sampled points were interpolated in 3D and applied to the GMT array, while the array’s response output current was characterized. If the response current from the array remains below a small threshold for all interpolating points between the sample points, this denotes a very low likelihood of an obstacle present between these two points and the edge connecting them represents a valid path. Figure 6g presents the final graph representation of the potential path through the 3D point cloud map of obstacles. To enhance comprehension, Fig. 6h demonstrates path planning in a 2D environment. Based on the generated graph, the optimal obstacle-free path with the least distance between two locations can be determined using Dijkstra’s algorithm^76,77.

The simulation results demonstrate that implemented GMT arrays to obtain probabilistic likelihood functions allow for the analysis of dense graphs using much fewer node devices and reduced computational workload, thus enabling result optimization with rapid operation. Consequently, this approach holds the potential for mitigating overconfidence issues, by operating on probabilistic reasoning models with significantly enhanced both area and energy efficiency.

Discussion

A device architecture, called the GMT, is proposed, incorporating a heterojunction of p-type and n-type organic semiconductors along with a non-volatile flash memory structure. This conjunct structure allows for the systematic programming of Gaussian distribution-like I − V characteristics from a single device, making it possible to implement probabilistic inference in ultralow-power hardware and highly simplified circuit design. The separate FG memory structure enables independent programmability of the p-type and n-type channel conductance of the GMT device. By programming and erasing each FG, meticulous control of the amplitude (A), mean (μ), and standard deviation (σ) of the transfer curve output of the GMT device is achieved. This controllability, enabled by the separate FGs, not only facilitates the hardware implementation of probabilistic inference but also enhances model circuit feasibility while reducing power consumption and latency. The GMT devices also exhibit remarkable retention performance and cyclic endurance. Moreover, the GMT devices maintain their performance even with 2.0% of the applied tensile strain. Using the GMT device array and by co-designing the probabilistic inference model to device characteristics, we demonstrate applications towards 3D probabilistic localization and path finding for drones. The GMT device, featured by a simple unit transistor design with a heterojunction channel, is capable of representing output values in the form of a probability distribution function. Moreover, the separate floating-gate structure allows for the programmable modification of the probability distribution function without the need for additional components. Furthermore, the reconfigurable device design in this study has a great potential for the application to various probabilistic inference computing fields with increased computational efficiency and integration density thereof. The successful implementation of inference operations is indeed demonstrated by using actual data obtained from GMT devices. Therefore, the proposed heterojunction-based, Gaussian-like transistor embedding a separate floating gate memory structure can serve as a powerful platform for Bayesian operations. For instance, they can improve the robustness and scalability of speech recognition models that use GMMs and HMMs by efficiently processing large-scale GMMs. In natural language processing, GMTs can accelerate tasks such as word embedding and semantic analysis, thanks to Gaussian function-based techniques. Additionally, GMTs can expedite variational inference, Gaussian processes, and other algorithms by leveraging Gaussian functions to model uncertainties, making them valuable for Bayesian analysis, regression, classification, and optimization in fields like robotics, time-series forecasting, and spatial data analysis.

Methods

Device fabrication and characterization

For the fabrication of the GMT device, a 25 mm × 25 mm glass substrate (Samsung Corning Co.) underwent a cleaning process involving ultrasonication in deionized (DI) water, acetone, and isopropyl alcohol for 20 minutes, followed by drying with dry N₂ gas. The metal electrode and organic semiconductor were deposited via thermal evaporation in a vacuum of 2 × 10⁻⁶ Torr, with the thickness monitored in real-time using a quartz crystal microbalance (QCM). The gate electrode, consisting of thermally evaporated Al with a deposition rate of 0.1 nm s⁻¹, reached a thickness of 50 nm. The dielectric layers were deposited using an initiated chemical vapor deposition (iCVD) process. A high-k poly(2-cyanoethyl acrylate-co-diethylene glycol divinyl ether) [p(CEA-co-DEGDVE)] with optimized chemical composition (referred to as pC1D1) served as the blocking dielectric layer (BDL)⁵⁶, while a low-k poly(1,3,5-trivinyl-1,3,5-trimethyl cyclotrisiloxane) (pV3D3) was used as the tunneling dielectric layer (TDL)⁵⁴. The thicknesses of pC1D1 (BDL) and pV3D3 (TDL) were 100 nm and 14 nm, respectively. Additionally, gold (Au) was thermally evaporated with a deposition rate of 0.01 nm s⁻¹ to a thickness of 3 nm for use as the floating gate (FG) between the two dielectric layers⁵⁰. Organic semiconductors, including pentacene and PTCDI-C13, were also thermally deposited at a rate of 0.03 nm s⁻¹. PTCDI-C13 underwent recrystallization through thermal annealing at 160 °C for 1 h. For the source and drain electrodes, 70 nm-thick Au was thermally evaporated at a rate of 0.1 nm s⁻¹. The channel dimensions were 500 μm for width and 600 μm for length. Flexible GMT devices were fabricated on a 100 μm-thick PEN substrate, and the applied tensile strain was calculated using the following equation⁷⁸:

$$S=\frac{{d}_{{{{{{\rm{sub}}}}}}}}{2R+{d}_{{{{{{\rm{sub}}}}}}}}$$

(5)

where S is the tensile strain, and R and d_sub are the bending radius and substrate thickness, respectively. To obtain cross-section image of GMT device, the device was sliced by a focused ion beam (Helios Nanolab 450) and cross-sectional HRTEM images were obtained by using Cs-corrected TEM (Titan cubed G2, FEI) with EDS mapping analysis. The scanning probe microscope (XE-100, Park Systems) was used to obtain AFM images to analyze the surface morphologies with the scan size of 5 μm × 5 μm. To analyze the insulating properties of polymer dielectric layers, metal-insulator-metal (MIM) devices were fabricated where the polymer dielectric layers were deposited between thermally evaporated Al electrodes. The electrical characteristics were measured by B1500A semiconductor analyzer (Agilent Technologies). All the device fabrication and characterization were performed in the N₂-filled glovebox.

Deposition of polymeric thin film

pV3D3 and pC1D1 dielectric layers were deposited by using a custom-built iCVD reactor. V3D3 (95%, Gelest, USA), CEA ( > 95%, Tokyo Chemical Industry (TCI), Japan), and DEGDVE (99%, Sigma-Aldrich, USA) acted as monomers, while tert-butyl peroxide (TBPO, 98%, Aldrich, USA) served as the initiator. All chemicals were utilized as received, without any additional purification steps. For the deposition of pV3D3, 40 °C heated V3D3 and 30 °C heated TBPO were introduced into the chamber at flow rates of 2.5 standard cubic centimeter per minute (sccm), and 1 sccm, respectively. The chamber pressure was maintained at 100 mTorr, and the substrate temperature was set to 50 °C⁵⁴. In the case of pC1D1, CEA, DEGDVE, and TBPO were heated to 50°C, 50°C and, 30°C, respectively, and they were injected into the chamber at flow rates of 0.28 sccm, 0.28 sccm, and 0.48 sccm, respectively. The chamber pressure and substrate temperature were maintained at 60 mTorr and 30 °C, respectively⁵⁶. Additionally, the filament was heated to 130 °C to initiate the decomposition of the initiator into radicals. The thickness of the resulting polymer films was determined using a spectroscopic ellipsometer (M2000, Woollam).

Data availability

The data within the article and its supplementary Information are available from the corresponding authors upon request.

Code availability

The Code within the article and its supplementary Information are available from the corresponding authors upon request.

References

Grigorescu, S., Trasnea, B., Cocias, T. & Macesanu, G. A survey of deep learning techniques for autonomous driving. J. Field Robot. 37, 362–386 (2020).
Article Google Scholar
Yurtsever, E., Lambert, J., Carballo, A. & Takeda, K. A survey of autonomous driving: Common practices and emerging technologies. IEEE access 8, 58443–58469 (2020).
Article Google Scholar
Muhammad, K., Ullah, A., Lloret, J., Del Ser, J. & de Albuquerque, V. H. C. Deep learning for safe autonomous driving: Current challenges and future directions. IEEE Trans. Intell. Transportation Syst. 22, 4316–4336 (2020).
Article Google Scholar
Fujiyoshi, H., Hirakawa, T. & Yamashita, T. Deep learning-based image recognition for autonomous driving. IATSS Res. 43, 244–252 (2019).
Article Google Scholar
Wu, M. & Chen, L. Image recognition based on deep learning. In: 2015 Chinese automation congress (CAC) 542–546 (IEEE, 2015).
Tian, Y. Artificial intelligence image recognition method based on convolutional neural network algorithm. IEEE Access 8, 125731–125744 (2020).
Article Google Scholar
Ananthanarayana, T. et al. Deep learning methods for sign language translation. ACM Trans. Accessible Comput. (TACCESS) 14, 1–30 (2021).
Article Google Scholar
Li, P., Ning, Y. & Fang, H. Artificial intelligence translation under the influence of multimedia teaching to study English learning mode. The International Journal of Electrical Engineering & Education 60, 325–338 (2021).
Burns, B. & Brock, O. Sampling-based motion planning with sensing uncertainty. In: Proceedings 2007 IEEE International Conference on Robotics and Automation) 3313–3318 (IEEE, 2007).
Liu, J., Zhang, J. & Barnes, N. Modeling aleatoric uncertainty for camouflaged object detection. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision) 1445–1454 (2022).
Einhorn, H. J. & Hogarth, R. M. Ambiguity and uncertainty in probabilistic inference. Psychological Rev. 92, 433 (1985).
Article Google Scholar
Guo, C., Pleiss, G., Sun, Y. & Weinberger, K. Q. On calibration of modern neural networks. In: International conference on machine learning) 1321–1330 (PMLR, 2017).
Kendall, A. & Gal, Y. What uncertainties do we need in bayesian deep learning for computer vision? Adv. Neural. Inf. Process. Syst. 30, 1–11 (2017).
Indyk, P. Proceedings of the Twenty-Sixth Annual ACM-SIAM Symposium on Discrete Algorithms. (SIAM, 2014).
Calliess, J.-P. Conservative decision-making and inference in uncertain dynamical systems.). University of Oxford Oxford (2014).
Thrun, S. Probabilistic robotics. Commun. ACM 45, 52–57 (2002).
Article Google Scholar
Wu, M. & Goodman, N. Foundation posteriors for approximate probabilistic inference. Adv. Neural Inf. Process. Syst. 35, 5335–5347 (2022).
Google Scholar
Dagum, P. & Chavez, R. M. Approximating probabilistic inference in bayesian belief networks. IEEE Trans. Pattern Anal. Mach. Intell. 15, 246–255 (1993).
Article Google Scholar
Shukla, P., Shylendra, A. & Tulabandhula, T. Trivedi AR. MC 2 RAM: Markov chain Monte Carlo sampling in SRAM for fast Bayesian inference. In: 2020 IEEE International Symposium on Circuits and Systems (ISCAS) 1–5 (IEEE, 2020).
Seeger, M. Gaussian processes for machine learning. Int J. Neural Syst. 14, 69–106 (2004).
Article PubMed Google Scholar
Stark, B. A. Studying “moments” of the Central Limit theorem. Math. Enthus. 14, 53–76 (2017).
Article Google Scholar
Shi, M. & Bermak, A. An efficient digital VLSI implementation of Gaussian mixture models-based classifier. IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 14, 962–974 (2006).
Article Google Scholar
Price, M., Glass, J. & Chandrakasan, A. P. A 6 mW, 5,000-word real-time speech recognizer using WFST models. IEEE J. Solid-State Circuits 50, 102–112 (2014).
Article ADS Google Scholar
Keriven, N., Bourrier, A., Gribonval, R. & Pérez, P. Sketching for large-scale learning of mixture models. Inf. Inference.: A J. IMA 7, 447–508 (2018).
Article MathSciNet Google Scholar
Yang, W., Yu, R., Jiang, W. & Shu, H. Efficient implementation of Gaussian Mixture Models using vote count circuit. In: Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific) 1–5 (IEEE, 2014).
Shi, M. Bermak A., Chandrasekaran S., Amira A. An efficient FPGA implementation of Gaussian mixture models-based classifier using distributed arithmetic. In: 2006 13th IEEE International Conference on Electronics, Circuits and Systems) 1276–1279 (IEEE, 2006).
Genovese, M. & Napoli, E. A. S. I. C. and FPGA implementation of the Gaussian mixture model algorithm for real-time segmentation of high definition video. IEEE Trans. very large scale Integr. (VLSI) Syst. 22, 537–547 (2013).
Article Google Scholar
He, C., Fu, H., Guo, C., Luk, W. & Yang, G. A fully-pipelined hardware design for gaussian mixture models. IEEE Trans. Computers 66, 1837–1850 (2017).
Article MathSciNet Google Scholar
Alimisis, V. et al. Nanopower Integrated Gaussian Mixture Model Classifier for Epileptic Seizure Prediction. Bioengineering 9, 160 (2022).
Article PubMed PubMed Central Google Scholar
Vrtaric, D., Ceperic, V. & Baric, A. Area-efficient differential Gaussian circuit for dedicated hardware implementations of Gaussian function based machine learning algorithms. Neurocomputing 118, 329–333 (2013).
Article Google Scholar
Gourdouparis, M., Alimisis, V., Dimas, C. & Sotiriadis, P. P. An ultra-low power,±0.3 V supply, fully-tunable Gaussian function circuit architecture for radial-basis functions analog hardware implementation. AEU-Int. J. Electron. Commun. 136, 153755 (2021).
Article Google Scholar
Alimisis, V. et al. Gaussian Mixture Model classifier analog integrated low-power implementation with applications in fault management detection. Microelectron. J. 126, 105510 (2022).
Article Google Scholar
Alimisis, V., Gourdouparis, M., Gennis, G., Dimas, C. & Sotiriadis, P. P. Analog gaussian function circuit: Architectures, operating principles and applications. Electronics 10, 2530 (2021).
Article Google Scholar
Gilbert, B. Translinear circuits: A proposed classification. Electron. Lett. 1, 14–16 (1975).
Article ADS Google Scholar
Delbrueck, T. & Mead, C. Bump circuits. In: Proceedings of International Joint Conference on Neural Networks) 1, 475–479 (1993).
Gkaltemis, M.-A., Lymperakis, G., Gennis, G., Alimisis, V., Sotiriadis, P. P. A Hardware-Friendly Low-Power Area-Efficient GMM-Based Analog Classifier For Skin Detection. In: 2022 Panhellenic Conference on Electronics & Telecommunications (PACET) 1–4 (IEEE, 2022).
Alimisis, V. et al. General methodology for the design of bell-shaped analog-hardware classifiers. Electronics 12, 4211 (2023).
Article Google Scholar
Alimisis, V., Gourdouparis, M., Dimas, C. & Sotiriadis, P. P. A 0.6 v, 3.3 nw, adjustable gaussian circuit for tunable kernel functions. In: 2021 34th SBC/SBMicro/IEEE/ACM Symposium on Integrated Circuits and Systems Design (SBCCI) 1–6 (IEEE, 2021).
Alimisis, V., Gourdouparis, M., Dimas, C. & Sotiriadis, P. P. Ultra-low power, low-voltage, fully-tunable, bulk-controlled bump circuit. In: 2021 10th International Conference on Modern Circuits and Systems Technologies (MOCAST) 1–4 (IEEE, 2021).
Kim, D., Park, M. & Park, Y.-L. Probabilistic modeling and bayesian filtering for improved state estimation for soft robots. IEEE Trans. Robot. 37, 1728–1741 (2021).
Article Google Scholar
Baltussen, M. G., van de Wiel, J., Fernandez Regueiro, C. L., Jakstaite, M. & Huck, W. T. S. A Bayesian Approach to Extracting Kinetic Information from Artificial Enzymatic Networks. Anal. Chem. 94, 7311–7318 (2022).
Article CAS PubMed PubMed Central Google Scholar
Wills, A. G. Hendriks J., Renton C., Ninness B. A Bayesian filtering algorithm for Gaussian mixture models. arXiv:1705.05495v2 (2017).
Wallén Kiessling, A. Monte Carlo Localization with Hilbert maps as Likelihood Fields) (2023).
Iliev, N., Gianelli, A. & Trivedi, A. R. Low power speaker identification by integrated clustering and gaussian mixture model scoring. IEEE Embedded Syst. Lett. 12, 9–12 (2019).
Article Google Scholar
Gianelli, A., Iliev, N., Nasrin, S., Graziano, M. & Trivedi, A. R. Low power speaker identification using look up-free Gaussian mixture model in CMOS. In: 2019 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS) 1–3 (IEEE, 2019).
Cai, R. et al. VIBNN: Hardware acceleration of Bayesian neural networks. ACM SIGPLAN Not. 53, 476–488 (2018).
Article Google Scholar
Fan, H. et al. High-performance FPGA-based accelerator for Bayesian neural networks. In: 2021 58th ACM/IEEE Design Automation Conference (DAC) 1063–1068 (IEEE, 2021).
Rullyani, C., Sung, C.-F., Lin, H.-C. & Chu, C.-W. Flexible organic thin film transistors incorporating a biodegradable CO2-based polymer as the substrate and dielectric material. Sci. Rep. 8, 8146 (2018).
Article ADS PubMed PubMed Central Google Scholar
Jang, J. et al. High T g cyclic olefin copolymer/Al 2 O 3 bilayer gate dielectrics for flexible organic complementary circuits with low-voltage and air-stable operation. J. Mater. Chem. 21, 12542–12546 (2011).
Article CAS Google Scholar
Yang, S. C. et al. Large‐scale, low‐power nonvolatile memory based on few‐layer MoS2 and ultrathin polymer dielectrics. Adv. Electron Mater. 5, 1800688 (2019).
Article Google Scholar
Woo, M. H. et al. Low‐power nonvolatile charge storage memory based on MoS2 and an ultrathin polymer tunneling dielectric. Adv. Funct. Mater. 27, 1703545 (2017).
Article Google Scholar
Kang, M., Baeg, K. J., Khim, D., Noh, Y. Y. & Kim, D. Y. Printed, flexible, organic nano‐floating‐gate memory: Effects of metal nanoparticles and blocking dielectrics on memory characteristics. Adv. Funct. Mater. 23, 3503–3512 (2013).
Article CAS Google Scholar
Lee, C. et al. Long‐term retention of low‐power, nonvolatile organic transistor memory based on ultrathin, trilayered dielectric containing charge trapping functionality. Adv. Funct. Mater. 30, 2004665 (2020).
Article ADS CAS Google Scholar
Moon, H. et al. Synthesis of ultrathin polymer insulating layers by initiated chemical vapour deposition for low-power soft electronics. Nat. Mater. 14, 628–635 (2015).
Article ADS CAS PubMed Google Scholar
Choi, J. et al. Vertically stacked, low-voltage organic ternary logic circuits including nonvolatile floating-gate memory transistors. Nat. Commun. 13, 2305 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Choi, J. et al. Flexible, low-power thin-film transistors made of vapor-phase synthesized high-k, ultrathin polymer gate dielectrics. ACS Appl Mater. Interfaces 9, 20808–20817 (2017).
Article CAS PubMed Google Scholar
Hayakawa, R., Honma, K., Nakaharai, S., Kanai, K. & Wakayama, Y. Electrically reconfigurable organic logic gates: a promising perspective on a dual‐gate antiambipolar transistor. Adv. Mater. 34, 2109491 (2022).
Article CAS Google Scholar
Yoo, H., On, S., Lee, S. B., Cho, K. & Kim, J. J. Negative transconductance heterojunction organic transistors and their application to full‐swing ternary circuits. Adv. Mater. 31, 1808265 (2019).
Article Google Scholar
Guo, H. A simple algorithm for fitting a Gaussian function [DSP tips and tricks]. IEEE Signal Process. Mag. 28, 134–137 (2011).
Article ADS Google Scholar
Swart, J., Gehm Moraes, F. & Sill Torres, F. 2021 34th SBC/SBMicro/IEEE/ACM Symposium on Integrated Circuits and Systems Design (SBCCI) (2021).
Lee, S., Seong, H., Im, S. G., Moon, H. & Yoo, S. Organic flash memory on various flexible substrates for foldable and disposable electronics. Nat. Commun. 8, 725 (2017).
Article ADS PubMed PubMed Central Google Scholar
Lue, H.-T. et al. Understanding STI edge fringing field effect on the scaling of charge-trapping (CT) NAND Flash and modeling of incremental step pulse programming (ISPP). In: 2009 IEEE International Electron Devices Meeting (IEDM) 1–4 (IEEE, 2009).
Banerjee, W. Challenges and applications of emerging nonvolatile memory devices. Electronics 9, 1029 (2020).
Article CAS Google Scholar
Chih, Y.-D. et al. Design challenges and solutions of emerging nonvolatile memory for embedded applications. In: 2021 IEEE International Electron Devices Meeting (IEDM) 2–4 (IEEE, 2021).
Yin, L., Cheng, R., Wen, Y., Liu, C. & He, J. Emerging 2D memory devices for in‐memory computing. Adv. Mater. 33, 2007081 (2021).
Article CAS Google Scholar
Trivedi, A. R. Shylendra A. Ultralow power acoustic feature-scoring using gaussian IV transistors. In: Proceedings of the 55th Annual Design Automation Conference) 1–6 (2018).
Cherenack, K. Van Pieterson L. Smart textiles: Challenges and opportunities. J. Appl. Phys. 112, (2012).
Wong, W. S. Salleo A. Flexible electronics: materials and applications. Springer Science & Business Media. 11 (2009).
Park, H. L. et al. Flexible neuromorphic electronics for computing, soft robotics, and neuroprosthetics. Adv. Mater. 32, 1903558 (2020).
Article CAS Google Scholar
Jang, J., et al. High Tg cyclic olefin copolymer/Al2O3 bilayer gate dielectrics for flexible organic complementary circuits with low-voltage and air-stable operation. J. Mater. Chem. 21, (2011).
Rullyani, C., Sung, C. F., Lin, H. C. & Chu, C. W. Flexible organic thin film transistors incorporating a biodegradable co(2)-based polymer as the substrate and dielectric material. Sci. Rep. 8, 8146 (2018).
Article ADS PubMed PubMed Central Google Scholar
Nam, S. H. et al. Highly sensitive non-classical strain gauge using organic heptazole thin-film transistor circuit on a flexible substrate. Adv. Funct. Mater. 24, 4413–4419 (2014).
Article CAS Google Scholar
Jeon, P. J., Lee, K., Park, E. Y., Im, S. & Bae, H. Ultrasensitive low power-consuming strain sensor based on complementary inverter composed of organic p- and n-channels. Org. Electron 32, 208–212 (2016).
Article CAS Google Scholar
Wang, H., Yu, K. & Mao, B. Self-localization and obstacle avoidance for a mobile robot. Neural Comput. Appl. 18, 495–506 (2009).
Article ADS Google Scholar
Moon, T. K. The expectation-maximization algorithm. IEEE Signal Process. Mag. 13, 47–60 (1996).
Article ADS Google Scholar
Makariye, N. Towards shortest path computation using Dijkstra algorithm. In: 2017 International Conference on IoT and Application (ICIOT) 1–3 (IEEE, 2017).
Chen, J.-C. Dijkstra’s shortest path algorithm. J. formalized Math. 15, 237–247 (2003).
CAS Google Scholar
Li, W. et al. Solution‐processed wide‐bandgap organic semiconductor nanostructures arrays for nonvolatile organic field‐effect transistor memory. Small 14, 1701437 (2018).
Article ADS Google Scholar

Download references

Acknowledgements

This work was supported by Samsung Advanced Institute of Technology (SAIT), SEC (S. G. I.). This work was supported by the National Research Foundation of Korea(NRF) grant funded by the Korea government(MSIT), No. 2021R1A2B5B03001416 (S. G. I.), NRF-2022R1C1C1004590 (H. Y.) and RS-2023-00210194 (H. Y.). This work was supported by the Wearable Platform Materials Technology Center (WMC) funded by the National Research Foundation of Korea(NRF) Grant by the Korean Government (MSIT), NRF-2022R1A5A6000846 (S. G. I.). This work was also supported by a grant from National Science Foundation (NSF), grant # 2106824.

Author information

These authors contributed equally: Changhyeon Lee, Leila Rahimifard, Junhwan Choi.

Authors and Affiliations

Department of Chemical and Biomolecular Engineering, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon, 34141, Korea
Changhyeon Lee, Jeong-ik Park, Chungryeol Lee, Seung Min Lee & Sung Gap Im
Department of Electrical and Computer Engineering, University of Illinois at Chicago, Chicago, IL, 60607, USA
Leila Rahimifard, Divake Kumar, Priyesh Shukla & Amit Ranjan Trivedi
Department of Chemical Engineering, Dankook University, 152 Jukjeon-ro, Suji-gu, Yongin, Gyeonggi-do, 16890, Korea
Junhwan Choi
Department of Electronic Engineering, Gachon University, 1342 Seongnam-daero, Sujeong-gu, Seongnam, Gyeonggi-do, 13120, Korea
Hocheon Yoo
KAIST Institute for NanoCentury (KINC), Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon, 34141, Korea
Sung Gap Im

Authors

Changhyeon Lee
View author publications
You can also search for this author in PubMed Google Scholar
Leila Rahimifard
View author publications
You can also search for this author in PubMed Google Scholar
Junhwan Choi
View author publications
You can also search for this author in PubMed Google Scholar
Jeong-ik Park
View author publications
You can also search for this author in PubMed Google Scholar
Chungryeol Lee
View author publications
You can also search for this author in PubMed Google Scholar
Divake Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Priyesh Shukla
View author publications
You can also search for this author in PubMed Google Scholar
Seung Min Lee
View author publications
You can also search for this author in PubMed Google Scholar
Amit Ranjan Trivedi
View author publications
You can also search for this author in PubMed Google Scholar
Hocheon Yoo
View author publications
You can also search for this author in PubMed Google Scholar
Sung Gap Im
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.L., J.C., H.Y., and S.G.I. conceived the idea and designed the experiments. Chang.L. and J.C. designed, fabricated, and measured all the devices and circuits. J.P. assisted with device characterization, and C.L. helped with the circuit design. S.M.L. assisted device fabrication. J.C., C.L., H.Y., and S.G.I. wrote the manuscript. L.R., D.K., P.S., and A.R.T. conceived the application-level ideas. L.R., D.K., and P.S. simulated the design. A.R.T. prepared the description of the results. All authors reviewed the manuscript and discussed the results. C.L., J.C., and L.R. contributed equally to this work. A.R.T., H.Y. and S.G.I. contributed equally as corresponding authors.

Corresponding authors

Correspondence to Amit Ranjan Trivedi, Hocheon Yoo or Sung Gap Im.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lee, C., Rahimifard, L., Choi, J. et al. Highly parallel and ultra-low-power probabilistic reasoning with programmable gaussian-like memory transistors. Nat Commun 15, 2439 (2024). https://doi.org/10.1038/s41467-024-46681-2

Download citation

Received: 18 October 2023
Accepted: 06 March 2024
Published: 18 March 2024
DOI: https://doi.org/10.1038/s41467-024-46681-2

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.