Hardware implementation of Bayesian network based on two-dimensional memtransistors

Zheng, Yikai; Ravichandran, Harikrishnan; Schranghamer, Thomas F.; Trainor, Nicholas; Redwing, Joan M.; Das, Saptarshi

doi:10.1038/s41467-022-33053-x

Download PDF

Article
Open access
Published: 23 September 2022

Hardware implementation of Bayesian network based on two-dimensional memtransistors

Yikai Zheng¹,
Harikrishnan Ravichandran¹,
Thomas F. Schranghamer¹,
Nicholas Trainor^2,3,
Joan M. Redwing^2,3 &
…
Saptarshi Das ORCID: orcid.org/0000-0002-0188-945X^1,2,3,4

Nature Communications volume 13, Article number: 5578 (2022) Cite this article

5614 Accesses
24 Citations
31 Altmetric
Metrics details

Subjects

Abstract

Bayesian networks (BNs) find widespread application in many real-world probabilistic problems including diagnostics, forecasting, computer vision, etc. The basic computing primitive for BNs is a stochastic bit (s-bit) generator that can control the probability of obtaining ‘1’ in a binary bit-stream. While silicon-based complementary metal-oxide-semiconductor (CMOS) technology can be used for hardware implementation of BNs, the lack of inherent stochasticity makes it area and energy inefficient. On the other hand, memristors and spintronic devices offer inherent stochasticity but lack computing ability beyond simple vector matrix multiplication due to their two-terminal nature and rely on extensive CMOS peripherals for BN implementation, which limits area and energy efficiency. Here, we circumvent these challenges by introducing a hardware platform based on 2D memtransistors. First, we experimentally demonstrate a low-power and compact s-bit generator circuit that exploits cycle-to-cycle fluctuation in the post-programmed conductance state of 2D memtransistors. Next, the s-bit generators are monolithically integrated with 2D memtransistor-based logic gates to implement BNs. Our findings highlight the potential for 2D memtransistor-based integrated circuits for non-von Neumann computing applications.

Giant energy storage and power density negative capacitance superlattices

Article 09 April 2024

Suraj S. Cheema, Nirmaan Shanker, … Sayeef Salahuddin

Phase-change memory via a phase-changeable self-confined nano-filament

Article 03 April 2024

See-On Park, Seokman Hong, … Shinhyun Choi

High-speed and large-scale intrinsically stretchable integrated circuits

Article 13 March 2024

Donglai Zhong, Can Wu, … Zhenan Bao

Introduction

The concept of a Bayesian network (BN) is deep rooted within natural intelligence. Animals gather information from their surroundings with the help of their sensory organs and process this information using their brain to make decisions, enabling their survival. However, gathering accurate information is often very difficult in practice either due to the limitations of sensory organs or due to noisy environment. For example, visual cues are an unreliable source of information for freshwater fish like the rainbow trout to identify the presence of a predator. In contrast, chemical cues released into the water from an injured fish are more reliable indicators of a predatory event¹. The decision to invoke an alarm response, therefore, depends on how the brain processes the visual and chemical cues based on their relative probability of success from prior experiences. While the neural basis of such computations is relatively unknown, the mathematical construct is represented using a BN with theoretical foundation in Bayes’ theorem.

A BN is a probabilistic graphic network used to estimate and infer the probability of interdependent events². Figure 1a shows the basic building block of a BN, comprising a parent node, $A$, a child node, $B$, and an edge connecting the two. Each node represents an event, e.g., the presence of a chemical cue $(A)$ and the presence of a predator ($B$), and the connection represents how two events are mutually dependent. The dependence is provided in a conditional probability table (CPT) which contains the conditional probability (likelihood) values $P(B/A)$ and $P(B/{A}^{c})$, where ${A}^{c}$ is the complement of the event $A$. In the present example, these represent the likelihood of the presence of a predator when a chemical cue is present ($A$) or absent (${A}^{c}$), respectively. When the probability of occurrence for event $A$, i.e., $P(A)$, is known, the marginal probability of occurrence of event $B$, i.e., $P(B)$, can be evaluated using Bayes’ theorem following Eq. 1.

$$P\left(B\right)=P(B/A)P(A)+P(B/{A}^{c})P({A}^{c})=P(B/A)P(A)+P(B/{A}^{c})[1-P\left(A\right)]$$

(1)

$$P\left(A\right)+P\left({A}^{C}\right)=1$$

(2)

In a generic BN, a child node can have multiple parent nodes, and a parent node can have multiple children. For example, Supplementary Fig. S1a shows a BN where the child node, $B$, is connected to 2 parent nodes, ${A}_{1}$ and ${A}_{2}$. Note that the CPT in this instance contains $N$ = 4 entries, which are the conditional probability (likelihood) for the occurrence of event $B$ under all possible combinations of the occurrence of events ${A}_{1}$ and ${A}_{2}$. Similarly, Supplementary Fig. S1b shows a BN where the parent node, $A$, is connected to 2 children, ${B}_{1}$ and ${B}_{2}$. In this case, there are 2 CPTs with $N$ = 2 entries each.

Note that the probability estimation for a child node requires multiple arithmetic operations such as multiplication, subtraction, and addition. This makes hardware implementation of a BN using conventional silicon complementary metal-oxide-semiconductor (CMOS) technology^{3, 4} less attractive because 1) arithmetic operations require circuits consisting of hundreds of transistors, which have large footprints and consume a significant amount of energy, and 2) the von Neumann bottleneck necessitates storing of the CPT in the memory, which is physically separated from the arithmetic core and therefore requires frequent data shuttling between the two, further aggravating the energy burden. In contrast, even the tiniest brains with very limited numbers of neurons can perform such apparently complex computational tasks with miniscule energy expenditure. The success of biological brains in implementing BNs could lie in the inherently stochastic nature of neural computation.

Drawing inspiration from biology, stochastic computing (SC) has been explored for the hardware implementation of BNs⁵. The key difference from classical computing, where information in presented in the form of binary values (1’s and 0’s), is that SC encodes information using stochastic bits (s-bits) that are interpreted as probabilities that fall in the interval [0,1]. For instance, the bit-stream S = [1 0 0 1 0 1 0 0] encodes the value $P(S)$ = 3/8, i.e., the probability of finding ‘1’ in the bit-stream $S$. An attractive feature of SC is that arithmetic operations can be performed using simple logic gates^{6, 7}. For example, the 2-node BN in Fig. 1a can be realized using a multiplexer (${MUX}$) circuit as shown in Fig. 1b. The output, $B$, of a ${MUX}$ with two input variables, ${X}_{1}$ and ${X}_{2}$, and a select line, $A$, is given by Eq. (3).

$$B=A{X}_{1}+{A}^{c}{X}_{2}$$

(3)

If, instead of being digital variables, ${X}_{1}$, ${X}_{2}$, and $A$ represent stochastic variables with $P\left({X}_{1}\right)$, $P\left({X}_{2}\right)$, and $P\left(A\right)$ being the respective probability of obtaining ‘1’ in their bit-streams, then $B$ also transforms into a random variable whose probability is given by Eq. (4).

$$P\left(B\right)=P\left(A\right)P\left({X}_{1}\right)+P\left({A}^{c}\right)P\left({X}_{2}\right)$$

(4)

Note that, if $P\left({X}_{1}\right)$ = $P\left(B/A\right)$ and $P\left({X}_{2}\right)$ = $P\left(B/{A}^{C}\right)$, then Eq. (4) transforms into Eq. (1). Therefore, hardware implementation of a child node with a single parent can be accomplished by using 3 s-bit generators and a 2 × 1 MUX. Interestingly, the ${MUX}$ architecture can be scaled to implement any BN. For example, hardware implementation of the BN in Fig. 1a can be achieved by using 2 s-bit generators to obtain ${A}_{1}$ and ${A}_{2}$, another 4 s-bit generators to obtain the CPT, and one 4 × 1 MUX with 2 select lines as shown in Supplementary Fig. S1c. Similarly, Supplementary Fig. 1d shows the hardware architecture for the BN in Supplementary Fig. S1b, consisting of $1$ s-bit generator to obtain $A$, another 4 s-bit generators to obtain the 2 CPTs, and 2 2 × 1 MUXs.

Note that BN architecture can be used to represent many real-life situations, as shown in Fig. 1c. For example, in the case of the rainbow trout, events ${A}_{1}$ and ${A}_{2}$ represent the presence of independent visual and chemical cues and event $B$ represents the presence of a predator. Events ${C}_{1}$ and ${C}_{2}$, meanwhile, represent the decision taken by the rainbow trout to stop swimming and stop foraging, respectively, which are also independent of each other but depend on $B$. Similarly, in forecasting, events ${A}_{1}$ and ${A}_{2}$ represent the probability of a day being cloudy and windy, respectively, event $B$ represents the probability of rain, and events ${C}_{1}$ and ${C}_{2}$ may represent the decision to purchase an umbrella or drink coffee, respectively. Finally, a third example is derived from genetics and drug discovery, where events ${A}_{1}$ and ${A}_{2}$ may represent the probability of expressing gene 1 and gene 2 when intervening with a specific drug, respectively, event $B$ represents the activation of a critical signaling pathway, and events ${C}_{1}$ and ${C}_{2}$ represent production of specific hormones or antibodies, respectively. The above discussion exemplifies the usefulness of BNs in depicting causal relationships using acyclic graphs, which can subsequently be used to predict outcomes based on prior knowledge and likelihood. For example, to predict the relative effectiveness between drug-1 and drug-2 that influence expression for gene 1 and gene 2, respectively, the only experiments that one needs to do is to obtain respective prior results, i.e., $P\left({A}_{1}\right)$ and $P\left({A}_{2}\right)$. A BN can then be used to obtain marginal likelihoods, i.e., $P\left({C}_{1}\right)$ and/or $P\left({C}_{2}\right)$, to assess the relative effectiveness of the two drugs.

The fundamental computing primitive for the stochastic computing implementation of a BN is an s-bit generator, which allows control of the output probability of obtaining ‘1’ in a given bit-stream. So far, probabilistic CMOS⁸, field-programmable gate arrays (FPGAs)^9,10,11, memristors^12,13,14, and spintronic devices^{15,16,17,18,19,20,21} have been successfully used for BN implementation. However, CMOS- and FPGA-based BN architectures require hundreds of transistors to generate s-bits, which limits their area and energy efficiency^{22,23,24,25,26,27}. In contrast, memristors offer inherent stochasticity in their switching dynamics, which can be exploited to obtain random bits. However, memristor-based BN architectures heavily rely on CMOS peripherals to translate random bits into s-bits and for subsequent logic operations using those s-bits. Recently, spintronic devices such as magnetic random access memory (MRAM)²⁸ and magnetic tunnel junctions (MTJs)^29,30,31 have shown potential for BN implementation since s-bits can be obtained by controlling the probability of spin-flip through externally driven current. However, temperature and supply voltage fluctuations can impact the spin-flip probability, which necessitates additional CMOS-based peripheral circuits to remove the bit-bias. In addition, spin-based devices still require CMOS-based logic circuits for BN implementation.

In this work, we demonstrate hardware implementation of a BN using a monolithic memtransistor technology based on two-dimensional (2D) semiconductors such as monolayer MoS₂. Memtransistors are three-terminal devices in which the gate terminal allows non-volatile and analog programming of the conductance states, which can then be readout by applying a source-to-drain bias. Our main contributions in this work are 1) the design of an area and energy efficient s-bit generator circuit composed of six memtransistors, allowing it to achieve a tunable probability of obtaining ‘1’ in the bit-stream over the range [0,1], and 2) integration of s-bit generators with a 2D memtransistor-based 2×1 ${MUX}$ that consists of three ${NAND}$ gates and one ${NOT}$ gate for BN implementation. In brief, we exploit the inherent stochasticity of the charge trapping and detrapping processes in the gate dielectric of the memtransistor as the source of randomness. Our in-memory computing approach based on three-terminal 2D memtransistors not only overcomes the von Neumann limitations of conventional digital CMOS, but also eliminates the need for peripherals, which is inescapable for emerging memristor- and spin-based 2-terminal stochastic devices for BN implementation.

Our choice of monolayer MoS₂ is motivated by the fact that atomically thin 2D materials are being considered for advanced technology nodes³². It is widely accepted that scaling silicon thickness beyond ~3–4 nm is challenging. Yet, the gate electrostatics demand aggressive reduction in the channel thickness to preserve the desired device performance for sub-10 nm technology nodes³³. The ultimate channel thickness for a field-effect transistor (FET) would be in the sub-1 nm range, which is difficult to realize using bulk semiconductors³⁴, making 2D materials a natural choice for ultra-scaled FETs^{35,36,37,38,39,40,41}. In fact, recent years have witnessed many experimental breakthroughs in the development of high-performance 2D FETs^42,43,44,45, neurosynaptic devices^{46,47,48,49,50}, and very large scale integrated (VLSI) circuits^51,52,53,54. Similarly, theoretical calculations and quantum mechanical simulation have found that the 2D FETs can outperform CMOS HP (high performance) in both energy and delay^55,56,57,58.

Results

Fabrication and characterization of 2D memtransistors

Figure 2a, b, respectively, show the 2D schematic and optical image of a representative 2D memtransistor based on monolayer MoS₂, which is locally back-gated with sputter-deposited 40/30 nm Pt/TiN serving as the back-gate electrode with atomic layer deposition (ALD) grown 50 nm Al₂O₃ as the gate dielectric. All back-gate islands were placed on a commercially purchased SiO₂/p⁺⁺-Si substrate. As we will discuss later, the analog, non-volatile, and stochastic programming capability offered by the Al₂O₃/Pt/TiN gate stack is central to our BN architecture. The monolayer MoS₂ used in this work was grown using a metal-organic chemical vapor deposition (MOCVD) technique on a sapphire substrate at 950 °C^{45, 59}. Use of an epitaxial substrate and elevated growth temperature ensured a uniform and high quality 2D film, which is critical for the successful demonstration of our BN architecture that involves many 2D memtransistors. For subsequent 2D memtransistor fabrication, the monolayer MoS₂ film was transferred from the growth substrate to the SiO₂/p⁺⁺-Si substrate with predefined islands of Al₂O₃/Pt/TiN. Details on monolayer MoS₂ synthesis, film transfer, and fabrication of the local back-gate gate islands, MoS₂ memtransistors, and BN architecture can be found in the “Methods” section as well as in the Methods sections of our recent works ^{45, 60,61,62,63}.

The film quality and device performance were assessed using optical and electrical measurements. The Raman spectra (Supplementary Fig. S2a) obtained for a representative 2D memtransistor shows two characteristic monolayer MoS₂ peaks at 383 cm⁻¹ and 404 cm⁻¹ corresponding to the in-plane $E$_2g and out-of-plane $A$_1g modes, respectively, with the expected peak separation of ~20 cm⁻¹ for monolayer MoS₂⁶⁴. Similarly, the photoluminescence (PL) spectra (Supplementary Fig. S2b) shows a peak at 1.83 eV corresponding to the direct bandgap of monolayer MoS₂. The transfer characteristics, i.e., source-to-drain current (${I}_{{{{{{\rm{DS}}}}}}}$) versus local back-gate voltage (${V}_{{{{{{\rm{BG}}}}}}}$), measured using a source-to-drain bias (${V}_{{{{{{\rm{DS}}}}}}}$) of 1 V are shown in Fig. 2c in both linear and logarithmic scale for a representative MoS₂ memtransistor with a channel length ($L$) of 1 µm and a channel width ($W$) of 5 µm. As expected, n-type transport is observed in MoS₂, which is attributed to the pinning of the metal Fermi level near the conduction band^65,66,67. Nevertheless, the MoS₂ memtransistor exhibits excellent electrostatic gate control with a current on/off ratio (${r}_{{{{{{\rm{ON}}}}}}/{{{{{\rm{OFF}}}}}}}$) > 10⁵, a subthreshold slope (${SS}$) < 400 mV/decade averaged over 3 orders of magnitude change in ${I}_{{{{{{\rm{DS}}}}}}}$, minimal gate hysteresis when measured in air, and low gate leakage current. The threshold voltage (${V}_{{{{{{\rm{TH}}}}}}}$) was found to be ~1.75 V extracted at an iso-current of 10 nA/µm and the electron field effect mobility (${\mu }_{{{{{{\rm{FE}}}}}}}$) extracted from the peak trans-conductance was found to be 5 cm²/V-s. Figure 2d shows the output characteristics, i.e., ${I}_{{{{{{\rm{DS}}}}}}}$ versus ${V}_{{{{{{\rm{DS}}}}}}}$, at different ${V}_{{{{{{\rm{BG}}}}}}}$ for the same representative MoS₂ memtransistor. The on-current $({I}_{{{{{{\rm{ON}}}}}}})$ reached as high as ~ 11 µA/µm for an inversion carrier density of ~1 × 10¹²/cm² at ${V}_{{{{{{\rm{DS}}}}}}}$ = 5 V. These results suggest that the monolayer MoS₂ film grown using MOCVD is of reasonably good quality, and that the memtransistor fabrication processes including the film transfer are clean and damage-free.

The post-programmed and post-erased transfer characteristics of a representative 2D memtransistor after being subjected to negative “Write” (${V}_{P}$) and positive “Erase” (${V}_{E}$) voltage pulses applied to the local back-gate electrode of varying amplitudes, each for a duration of ${\tau }_{P/E}$ = 100 µs, are shown in Fig. 2e, f, respectively. The negative and positive shift in the respective transfer characteristics can be ascribed to electron trapping and detrapping at and near the MoS₂/Al₂O₃ interface, respectively. Note that trap states can originate from defects/imperfections in the dielectric and/or adsorbed species at the 2D/dielectric interface as reported in various earlier studies^68,69,70. These states can also be engineered at desired energetic locations by introducing intentional defects in the 2D channel material^{51, 71}. Carrier occupancy in these trap states follow Fermi-Dirac distribution. As illustrated using the energy band diagrams in Supplementary Fig. S3, at equilibrium, i.e., in the absence of any gate bias, the trap states with energy levels above the Fermi energy (${E}_{F}$) are empty, whereas the ones below ${E}_{F}$ are filled. When the memtransistor is subjected to a negative “Write” (${V}_{P}$) voltage pulse, electrons are released (detrapped) from these trap states leaving them positively charged. This leads to screening of the back-gate bias, which is reflected as shift in the threshold voltage (${\triangle V}_{{TH}}$). Similarly, when the memtransistor is subjected to a positive “Erase” (${V}_{E}$) voltage pulse, electrons are captured back (trapped) into the trap states, restoring the ${V}_{{TH}}$. Note that the number of electrons getting trapped/detrapped can be controlled by both the magnitude and duration of ${V}_{P}$ and ${V}_{E}$, which allow us to have an analog control of the ${\triangle V}_{{TH}}$ and of the conductance state of the memtransistor.

The minimum program/erase pulse width is determined by the trapping/detrapping time constants. Supplementary Fig. S4a–d show the post-programmed and post-erased transfer characteristics of a 2D memtransistor subjected to ${V}_{P}$ and ${V}_{E}$ voltage pulses of different amplitudes ranging from 8 V to 15 V applied to the local back-gate electrode, each for a duration of ${\tau }_{P/E}$ = 100 µs, 10 µs, 1 µs, and 100 ns, respectively. Clearly, the charge trapping and detrapping processes can occur as fast as 100 ns, which is the limit set by our measurement tools, allowing further improvement in the programming speed^{72, 73}. Supplementary Fig. S4e, f show the extracted shift in the threshold voltage (${\triangle V}_{{TH}}$) as a function of ${V}_{P/E}$ for ${\tau }_{P/E}$ = 100 µs and ${\tau }_{P/E}$ = 100 ns, respectively. From these results, we can conclude that, for any given pulse magnitude ${V}_{P/E}$, ${\triangle V}_{{TH}}$ becomes smaller as ${\tau }_{P/E}$ becomes shorter. To retain similar ${\triangle V}_{{TH}}$ for smaller ${\tau }_{P/E}$, larger ${V}_{P/E}$ is required, which will increase the energy expenditure. Therefore, one needs to strike a balance between fast programmability and energy consumption based on the application needs.

The trapping and detrapping processes were found to be non-volatile, as shown in Fig. 2g for 4 representative post-programmed and post-erased conductance states (${G}_{{MT}}$) over 100 s. We also examined long-term memory retention for the 2D memtransistors and found that states remain distinguishable even after 3 hrs. Memory retention is important to store the CPT and the memtransistors demonstrate adequate memory performance for the hardware implementation of BNs using SC. The program/erase endurance is also important for the 2D memtransistor. Supplementary Fig. S5 shows the post-programmed and post-erased conductance states of a representative memtransistor, achieved with ${V}_{P}$ = −7 V and ${V}_{E}$ = 10 V using ${\tau }_{P/E}$ = 100 ns and measured at ${V}_{{{{{{\rm{BG}}}}}}}$ = 0 V for up to 10⁹ endurance cycles. Clearly, there is no significant change in the two states. While it is desirable to demonstrate endurance for an even higher number of cycles, note that, for the many edge applications, the current endurance results can be sufficient. For example, in weather forecasting, the BN will be used every minute rather than every microsecond; similarly, in medical diagnostics, the BN will be only used several thousand times a day to assess patients.

Programming stochasticity in 2D memtransistors and design of s-bit generator

Design of hardware for high-quality random bit generation is central to the hardware implementation of BNs. Here, we exploit the cycle-to-cycle variation in the post-programmed and post-erased conductance states (${G}_{{MT}}$) of 2D memtransistors as a source of true randomness. Figure 3a shows the transfer characteristics of a representative MoS₂ memtransistor, which is measured each time after applying ${V}_{P}$ = −10 V and ${V}_{E}$ = 10 V for ${\tau }_{s}$ = 100 µs, for a total of 100 cycles and Fig. 3b, c, respectively, show the histograms of post-programmed and post-erased ${G}_{{MT}}$ values extracted at ${V}_{{BG}}$ = 0 V. Clearly, the ${G}_{{MT}}$ values follow Gaussian random distributions. The cycle-to-cycle variation in program/erase processes is a direct consequence of the stochastic nature of charge trapping and detrapping observed in most semiconductor/dielectric interfaces^74,75,76. In the simple two-state model, a trap state can be electrically neutral or charged, and it can transition between the two states even under equilibrium condition with transition times exponentially distributed. In other words, the state transition dynamics for traps follows the classic Markovian process^{77, 78}. In ultra-scaled metal-oxide-semiconductor field effect transistors (MOSFETs) such stochastic state transitions lead to random telegraph noise (RTN). Metastable states are also often involved in the trapping/detrapping processes, making the transition dynamic more complex, rich, and, at the same time, introducing an additional source of randomness⁷⁹. While RTN is not observed in our relatively large area memtransistors, the stochasticity of trapping/detrapping processes manifest during the program/erase operations, thus leading to the cycle-to-cycle variation in ${\triangle V}_{{TH}}$.

**Fig. 3: 2D memtransistor-based s-bit generator.**

To translate the stochastic conductance fluctuation into s-bits, we deploy a circuit consisting of six memtransistors (${MT}1$, $MT2$, $MT3$, $MT4$, $MT5$, and $MT6$), as shown using the circuit diagram and corresponding optical image in Fig. 3d, e, respectively. The voltage waveforms applied to the nodes $N1$ and $N2$, i.e., ${V}_{N1}$ and ${V}_{N2}$, respectively, are shown in Fig. 3f. Note that during each clock cycle (${\tau }_{{clk}}$), ${V}_{N1}$ switches between 0 V, 0 V, and 2 V and ${V}_{N2}$ switches between ${V}_{P}$ = − 7 V, ${V}_{E}$ = 10 V, and ${V}_{R}$ = 1 V. Voltages applied to nodes $N3$ and $N4$, i.e., ${V}_{N3}$ and ${V}_{N4}$, are held constant at 1 V and 0 V, respectively. This allows programming and erasing of ${MT}1$ during each ${\tau }_{{clk}}$. The voltage readout at node $N5$, i.e., ${V}_{N5}$, is shown in Fig. 3g and exhibits stochastic fluctuation. Note that the series connection of memtransistors ${MT}1$ and $MT2$ represents a voltage divider circuit, and hence ${V}_{N5}$ is determined by their respective conductance values, i.e., ${G}_{{MT}1}$ and ${G}_{{MT}2}$. Since ${G}_{{MT}1}$ fluctuates from cycle-to-cycle owing to the programming and erasing voltages applied to its local back-gate terminal, i.e., $N2$, so does ${V}_{N5}$. In other words, the voltage divider translates conductance fluctuations into voltage fluctuations. Figure 3h shows the histogram of ${V}_{N5}$, which, as expected, follows a random Gaussian distribution with a mean (${\mu }_{{{{{{\rm{VN}}}}}}5}$) of 0.40 V and standard deviation (${\sigma }_{{{{{{\rm{VN}}}}}}5}$) of 0.02 V.

Next, the Gaussian distribution is broadened by using an inverting amplifier constructed using $MT3$ and $MT4$. Note that the local back-gate of ${MT}3$ is shorted to its source at node ${N}_{6}$. This ensures that $MT3$ operates as a depletion mode (normally on) transistor or as a load resistor. Figure 3i shows the output, ${V}_{{{{{{\rm{N}}}}}}6}$, as a function of the input, ${V}_{{{{{{\rm{N}}}}}}5}$. The slope of the curve is referred to as the gain of the amplifier, and the higher the gain, the wider the broadening of the Gaussian. We achieved a gain of ~24. The gain can be increased further by cascading multiple amplifiers; however, this adds area and energy overhead. Figure 3j shows ${V}_{{{{{{\rm{N}}}}}}6}$ corresponding to ${V}_{N5}$ obtained in Fig. 3g. Clearly, the histogram of ${V}_{N6}$ shown in Fig. 3k exhibits a Gaussian distribution with a mean (${\mu }_{{{{{{\rm{VN}}}}}}6}$) of 0.99 V and an increased standard deviation $({\sigma }_{{{{{{\rm{VN}}}}}}6})$ of 0.41 V.

Finally, to transform the analog fluctuations seen in ${V}_{N6}$ into s-bits, a thresholding inverter with a programmable inversion threshold, ${V}_{{{{{{\rm{IT}}}}}}}$, is constructed using $MT5$ and $MT6$. Figure 3l shows the output, ${V}_{{{{{{\rm{N}}}}}}7}$, as a function of the input, ${V}_{{{{{{\rm{N}}}}}}6}$, for different ${V}_{{{{{{\rm{IT}}}}}}}$. Note that ${V}_{{{{{{\rm{IT}}}}}}}$ is the magnitude of ${V}_{{{{{{\rm{N}}}}}}6}$ for which ${V}_{{{{{{\rm{N}}}}}}7}$ reaches ${V}_{{{{{{\rm{DD}}}}}}}$/2, i.e., 1 V in the present case. The programmability of ${V}_{{{{{{\rm{IT}}}}}}}$ is a critical feature that distinguishes 2D memtransistor-based inverters from conventional CMOS-based inverters and allows us to seamlessly obtain the s-bits. Figure 3m shows ${V}_{{{{{{\rm{N}}}}}}7}$ corresponding to ${V}_{N6}$ obtained in Fig. 3j for different ${V}_{{{{{{\rm{IT}}}}}}}$ and Fig. 3n shows the corresponding probability of obtaining ‘1’ in the bit-stream, i.e., ${p}_{s}$ as a function of ${V}_{{{{{{\rm{IT}}}}}}}$. As expected, if ${V}_{{{{{{\rm{IT}}}}}}}$ is too low, then almost all ${V}_{N6}$ values translate into ${V}_{{{{{{\rm{N}}}}}}7}$ ≈ 0 V, which is reflected as near zero ${p}_{s}$. Similarly, if ${V}_{{{{{{\rm{IT}}}}}}}$ is too high, then almost all ${V}_{N6}$ values translate into ${V}_{{{{{{\rm{N}}}}}}7}$ ≈ 2 V, leading to ${p}_{s}$ = 1. Between these two extremes, ${p}_{s}$ increases monotonically with ${V}_{{{{{{\rm{IT}}}}}}}$. This clearly shows that we are able to convert the cycle-to-cycle random conductance fluctuations in 2D memtransistor into s-bits with reconfigurable ${p}_{s}$ values that lie between [0,1] using the described circuit.

Note that the cycle-to-cycle variation in the programming of 2D memtransistors will lead to fluctuations in the threshold voltage (${V}_{{TH}}$) of ${{MT}}_{6}$ and hence in ${V}_{{IT}}$ of the thresholding inverter and ${p}_{s}$ for the s-bit-stream. Supplementary Fig. S6a-b, respectively, show the distribution of ${V}_{{TH}}$ and ${V}_{{IT}}$ when ${{MT}}_{6}$ is subjected to 50 program/erase/read cycles with ${V}_{P}$ = −7 V, ${V}_{E}$ = 10 V, and ${\tau }_{P/E}$ = 100 µs. The means and standard deviations were found to be −0.04 V and 0.08 V for ${V}_{{TH}}$, respectively, and 0.14 V and 0.08 V for ${V}_{{IT}}$, respectively. Therefore, ${p}_{s}$ will not be perfectly deterministic; instead there will be a small uncertainty in its value, which is represented using the uncertainty band in Fig. 3n. Next, to assess randomness, we utilized the s-bit generator to generate 10⁴ random bits using the same programming and erasing voltage pulses of ${V}_{E}$= 10 V and ${V}_{P}$= −7 V, respectively, at ${\tau }_{P/E}$ = 100 µs. Supplementary Fig. S7 shows the results of eight of the statistical tests developed by the National Institute of Standards and Technology (NIST) performed on these 10⁴ bits. According to the test protocol, the bit-streams are considered random only if the p-value is greater than 0.01 with the null hypothesis that the sequence is random with 99% confidence level. The NIST test results confirm that the s-bits generated are truly random.

The rough estimate of the energy expenditure for s-bit generation (${E}_{s-{bit}}$) was calculated using Eq. (5).

$${E}_{s-{bit}}={C}_{G}\left({V}_{P}^{2}+{V}_{E}^{2}+{V}_{R}^{2}+{V}_{{DD}}^{2}\right)+\left\langle {I}_{N1N4}\right\rangle {V}_{{DD}}{\tau }_{{clk}}$$

(5)

$$\left\langle {I}_{N1N4}\right\rangle=\frac{1}{n}\mathop{\sum}\limits_{i=1}^{n}{I}_{N1N4,i}$$

(6)

$${C}_{G}={\varepsilon }_{0}{\varepsilon }_{{ox}}{WL}/{t}_{{ox}}$$

(7)

In Eq. (5), ${V}_{P}$, ${V}_{E}$, ${V}_{R}$, and ${V}_{{DD}}$ are the program, erase, read, and supply voltages, respectively. ${C}_{G}$ ≈ 10^-14F is the gate capacitance, ${\varepsilon }_{0}=8.85\times {10}^{-12}F/m$ is the vacuum permittivity, and ${\varepsilon }_{{ox}}=10$ and ${t}_{{ox}}=50 \,{nm}$ are, respectively, the relative permittivity and thickness of Al₂O₃; $W$ = 5 µm and $L$ = 1 µm are, respectively, the channel width and length of the 2D-memtransistor. $\left\langle {I}_{N1N4}\right\rangle$ is the average current flowing through the s-bit generator circuit, i.e., the total current through the voltage divider, inverting amplifier, and threshold inverter during each ${\tau }_{{clk}}$. We have used $n=200$ to calculate the average current per ${\tau }_{{clk}}$ = 100 µs based on the experimental measurements. Since most of the memtransistors operate in their respective subthreshold regimes, the extracted $\left\langle {I}_{N1N4}\right\rangle$ is ~1.5 nA as shown in Supplementary Fig. S8. As such, the second term in Eq. (5) accounts for ~0.3 pJ, whereas the first term in Eq. (5) accounts for ~2 pJ. This results in ${E}_{s-{bit}}\approx$ 2 pJ/clock-cycle, which supports our claim of energy efficient s-bit generation. Also note that since each memtransistor has an active device area of ~5 µm², excluding the large contact pads used for probing, the active footprint for the s-bit generator is ~30 µm². Since monolayer 2D materials offer aggressive dimensional scalability, it is possible to reduce the footprint of s-bit generators even further. Nevertheless, the use of only 6 memtransistors is the key towards the realization of area and energy efficient s-bit generator circuits.

2D memtransistor-based digital circuits and BN implementation

As described earlier, stochastic multiplexers (${MUX}$s) can be used for computing the marginal probability values at any BN node. Figure 4a shows the circuit configuration of a 2×1 MUX which consists of one inverter and three 2-input ${NAND}$ gates. Figure 4b shows the optical image and corresponding circuit configuration of a 2-input ${NAND}$ gate comprising 3 memtransistors (${MT}1$, $MT2$, and $MT3$) connected in series, with ${MT}1$ serving as the depletion load. The supply voltage, ${V}_{{DD}}$ = 2 V, is applied to the drain terminal of ${MT}1$ at node ${N}_{1}$, whereas the source terminal of ${MT}3$, i.e., node ${N}_{5}$, is kept grounded. Figure 4c shows the input waveforms, ${V}_{N3}$ and ${V}_{N4}$, which are applied to the local back-gate terminals of $MT2$ and $MT3$ at nodes ${N}_{3}$ and ${N}_{4}$, respectively, and the corresponding output waveform, ${V}_{N2}$, which is obtained at node ${N}_{2}$. Clearly, the circuit operates as a ${NAND}$ gate.

**Fig. 4: Hardware implementation of BN.**

Figure 4d, e, respectively, show the optical image and corresponding circuit configuration for hardware implementation of a 2-node BN consisting of 3 s-bit generators and a 2 × 1 ${MUX}$ for a total of 29 memtransistors. The ${V}_{{{{{{\rm{IT}}}}}}}$ values for the s-bit generators generating ${X}_{1}$ and ${X}_{2}$ can be pre-programmed corresponding to the CPT for the nodes $A$ and $B$ of the 2-node BN. Figure 4f shows the representative stochastic bit-streams for the random variables $A$, ${X}_{1}$, and ${X}_{2}$ with $P\left(A\right)$ = 0.28, $P\left({X}_{1}\right)$ = $P\left(B/A\right)$ = 0.50, and $P\left({X}_{2}\right)$ = $P\left(B/{A}^{C}\right)$ = 0.56. Note that accurate estimation of $P(B)$ requires that the stochastic input variables to the ${MUX}$, i.e., $A$, ${X}_{1}$, and ${X}_{2}$, must be mutually independent. Figure 4g shows the correlation coefficient (${CC}$) between these three variables. The ${CC}$ values were found to be close to zero, which confirms mutual independence of the s-bit generator modules. Figure 4h shows the stochastic bit-streams obtained at the output node, $B$. The measured and expected values for $P\left(B\right)$ are 0.56 and 0.54, respectively. Supplementary Fig. S9 shows the results for three more sets of measurements. In all instances, we found that our 29 memtransistor module is able to demonstrate a 2-node BN with relatively high accuracy. The rough estimate of the energy expenditure for our hardware BN implementation is miniscule at ~1.2 nJ when 200 ${\tau }_{{clk}}$ are used. Certainly, the energy expense can be further reduced by shortening the length of the s-bit streams at the cost of reduced precision. Supplementary Fig. S10 shows the numerical simulation of the error in expected values for $P\left(B\right)$ as a function of the bit-length of the s-bit stream for the inputs $P\left(A\right)$, $P\left(B/A\right)$, and $P(B/{A}^{C})$. The percentage error increases significantly with the reduction in bit-length of the s-bit streams.

While we have experimentally demonstrated that the distribution of the output voltage $({V}_{N6})$ from the inverting amplifier follows a Gaussian profile, it is possible that the distribution may deviate from a perfect Gaussian distribution due to many operational reasons. This will definitely lead to computation error. To assess the impact of a skewed distribution on the precision of the BN, we have performed numerical simulations assuming that ${V}_{N6}$ follows the Pearson random distribution function. Supplementary Fig. S11a shows the distribution of ${V}_{N6}$ for different values of skewness from −1 to 1 in steps of 0.5. Note that a skewness of −1 or 1 will be a rare occurrence under most practical circumstances. Supplementary Fig. S11b shows the corresponding ${p}_{s}$ as a function of ${V}_{{IT}}$. As the skewness increases, the deviation of ${p}_{s}$ from its expected value also increases. Supplementary Fig. S11c shows the colormap of the percentage error in estimating $P\left(B\right)$ using the BN hardware for different skewness in the stochastic input variables ${X}_{1}$ and ${X}_{2}$ that represent $P\left(B/A\right)$ and $P\left(B/{A}^{C}\right)$, respectively. As expected, the percentage error increases with increasing skewness. Furthermore, we have experimentally demonstrated that the distribution of the inverting threshold voltage (${V}_{{IT}}$) exhibits a Gaussian distribution after ${MT}6$ is subjected to 50 program/erase/read cycles with ${V}_{P}$ = −7 V, ${V}_{E}$ = 10 V, and ${\tau }_{P/E}$ = 100 µs. This ${V}_{{IT}}$ distribution leads to a small uncertainty ($\triangle P$) in probability of output voltages (${V}_{N7}$), as shown in Fig. 3n. We have used numerical simulations to assess the impact of uncertainty in obtained probabilities on the precision of the BN, where the probability of the select line, $A$, remains as a constant while the probability of both ${X}_{1}$ and ${X}_{2}$ are inflicted with $\triangle P$ due to cycle-to-cycle variation in the programmed probability. Supplementary Fig. S12 shows the colormap of the percentage error in estimating $P\left(B\right)$ using the BN hardware for uncertainty in the stochastic input variables ${X}_{1}$ and ${X}_{2}$ that represent $P\left(B/A\right)=0.50$ and $P\left(B/{A}^{C}\right)=0.56$, respectively, while $P\left(A\right)=0.28$ and $\triangle P \, \approx \, 0.065$ . From this colormap, we can conclude that even if the ${V}_{{IT}}$ of the thresholding inverter (${MT}6$) is inflicted with cycle-to-cycle variation from device programming, the inaccuracy of the 2-node Bayesian network $(B=A{X}_{1}+{A}^{C}{X}_{2})$ is less than $15\%$. This simulation result shows decent accuracy in hardware implementation of the BN.

Finally, the impact of device-to-device variation on the operation of BN is examined. Supplementary Fig. S13a shows the transfer characteristics of 10 MoS₂ memtransistors and Supplementary Fig. S13b shows the transfer characteristics for these 10 devices after one programming/erasing clock cycle (${V}_{P}$ = −7 V, ${V}_{E}$ = 10 V, and ${\tau }_{P/E}$ = 100 µs$.$). The device-to-device variation translates into error in $\triangle P$ and impacts the accuracy at the output of the BN. Supplementary Fig. S14 shows the colormap of error in $P(B)$ for $P({X}_{1})=0.5$, $P({X}_{2})=0.56$, and $P(A)=0.28$. We have used $\triangle P$ =0.046 for both ${X}_{1}$ and ${X}_{2}$ inferred from Supplementary Fig. S13b. From the error map, it is evident that the variation in the programmed probability inflicted by the device-to-device programming variation of the memtransistors resulted in a maximum error of 8% at the output of the BN.

Discussion

In conclusion, we have exploited cycle-to-cycle variability in the programmed conductance of 2D memtransistors and transcribed the same into s-bits with reconfigurable probability of obtaining ‘1’ in the bit-stream using a circuit that comprises only 6 memtransistors and spends < 2 pJ per s-bit. We subsequently combined the s-bit generator with a 2D memtransistor-based 2 × 1 ${MUX}$ to demonstrate hardware implementation of a BN. The BN architecture comprises 29 memtransistors and requires ~ 1.2 nJ of energy for precise computation. Our demonstration of a memtransistor-based standalone in-memory compute fabric shows the potential for emerging 2D materials and devices.

Methods

Fabrication of local back-gate islands

To define the back-gate island regions, a commercially-purchased substrate (285 nm SiO₂ on p⁺⁺-Si) was spin coated (4000 RPM for 45 s) with bilayer photoresist consisting of Lift-Off-Resist (LOR 5 A) and Series Photoresist (SPR 3012) and baked at 185 °C for 120 s and 95 °C for 60 s, respectively. The bilayer photoresist was then exposed using a Heidelburg Maskless Aligner (MLA 150) to define the island and developed using MF CD26 microposit, followed by a de-ionized (DI) water rinse. The back gate electrode of 20/50 nm TiN/Pt was deposited using reactive sputtering. The photoresist was removed using acetone and Photo Resist Stripper (PRS 3000) and cleaned using 2-propanol (IPA) and DI water. An atomic layer deposition (ALD) process was then implemented to grow 50 nm Al₂O₃ across the entire substrate, including the island regions. To access the individual Pt back-gate electrodes, etch patterns were defined using the same bilayer photoresist consisting of LOR 5 A and SPR 3012. The bilayer photoresist was then exposed to MLA 150 and developed using MF CD26 microposit. The 50 nm Al₂O₃ was subsequently dry etched using a BCl₃ reactive ion etch (RIE) chemistry at 5 °C for 20 s, which was repeated four times to minimize heating in the substrate. Finally, the photoresist was removed to give access to the individual Pt electrodes.

Large-area monolayer MoS₂ film growth

Monolayer MoS₂ was deposited on epi-ready 2” c-sapphire substrate by metalorganic chemical vapor deposition (MOCVD). An inductively heated graphite susceptor equipped with wafer rotation in a cold-wall horizontal reactor was used to achieve uniform monolayer deposition as previously described⁸⁰. Molybdenum hexacarbonyl (Mo(CO)₆) and hydrogen sulfide (H₂S) were used as precursors. Mo(CO)₆ maintained at 10 °C and 650 Torr in a stainless-steel bubbler was used to deliver 1.1 × 10⁻³ sccm of the metal precursor for the growth, while 400 sccm of H₂S was used for the process. MoS₂ deposition was carried out at 1000 °C and 50 Torr in H₂ ambient, where monolayer growth was achieved in 18 min. The substrate was first heated to 1000 °C in H₂ and maintained for 10 min before the growth was initiated. After growth, the substrate was cooled in H₂S to 300 °C to inhibit decomposition of the MoS₂ films. More details can be found in our earlier work^{45, 48, 81}.

MoS₂ film transfer to local back-gate islands

To fabricate the 2D memtransistors, the MOCVD-grown monolayer MoS₂ film was transferred from the sapphire growth substrate to the SiO₂/p⁺⁺-Si substrate with local back-gate islands using a PMMA (polymethyl-methacrylate) assisted wet transfer process. First, growth substrate was spin coated with PMMA and left to sit for 24 h to promote PMMA/MoS₂ adhesion. The corners of the spin-coated film were scratched using a razor blade and immersed inside 1 M NaOH solution kept at 90 °C. Capillary action caused the NaOH to be drawn into the substrate/film interface, separating the PMMA/MoS₂ film from the sapphire substrate. The separated film was rinsed three times inside separate water baths and fished-out using the SiO₂/p⁺⁺-Si substrate with local back-gate islands. The substrate was then baked at 50 °C and 70 °C for 10 min each to remove moisture and promote adhesion. An acetone bath was usd to remove the PMMA supporting layer, with a subsequent IPA bath to remove residue.

Fabrication of 2D memtransistors

To define the channel regions for the memtransistors, the substrate was spin-coated with PMMA and baked at 180 °C for 90 s. The resist was then patterned using electron beam (e-beam) lithography and developed using a 1:1 mixture of 4-methyl-2-pentanone (MIBK) and 2 propanol (IPA), with a subsequent IPA rinse. The monolayer MoS₂ film was then etched using a sulfur hexafluoride (SF₆) RIE chemistry at 5 °C for 30 s. Next, the sample was rinsed in acetone and IPA to remove PMMA. To define the source and drain contacts, sample was then spin coated with methyl methacrylate (MMA) followed by PMMA. E-beam lithography was used to pattern the source and drain contacts and 1:1 MIBK/ IPA was again used for development. 40 nm of nickel (Ni) and 30 nm of gold (Au) were deposited using e-beam evaporation. Finally, a lift-off process was performed to remove the excess Ni/Au and resist by immersing the sample in acetone for 30 min followed by IPA for another 30 mins. Each island contains one memtransistor to allow for individual gate control.

Monolithic integration

To define the connections between respective memtransistors, the substrate was spin coated with MMA and PMMA, followed by e-beam lithography and development using a 1:1 mixture of MIBK/IPA. E-beam evaporation of was used to deposit 60 nm of Ni and 30 nm of Au to form the connections. Finally, the e-beam resist was rinsed away by the same acetone and IPA lift-off process used previously.

Electrical characterization

Electrical characterization of the fabricated devices was performed using a Lake Shore CRX-VF probe station under atmospheric conditions and with Keysight B1500A parameter analyzer.

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

Code availability

The codes used for plotting the data are available from the corresponding authors on reasonable request.

References

Brown, G. E. & Smith, R. J. F. Conspecific skin extracts elicit antipredator responses in juvenile rainbow trout (Oncorhynchus mykiss). Can. J. Zool. 75, 1916–1922 (1997).
Article Google Scholar
Puga, J. L., Krzywinski, M. & Altman, N. Bayesian networks. Nat. Methods. 12, 799–800 (2015).
Article CAS PubMed Google Scholar
Smithson, S. C., Onizawa, N., Meyer, B. H., Gross, W. J. & Hanyu, T. Efficient CMOS Invertible Logic Using Stochastic Computing. IEEE Trans. Circuits Syst. I: Regul. Pap. 66, 2263–2274 (2019).
Article MathSciNet MATH Google Scholar
Ardakani, A., Leduc-Primeau, F., Onizawa, N., Hanyu, T. & Gross, W. J. VLSI Implementation of Deep Neural Network Using Integral Stochastic Computing. IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 25, 2688–2699 (2017).
Article Google Scholar
Alaghi, A. & Hayes, J. P. Survey of stochastic computing. ACM Trans. Embedded Comput. Syst. (TECS) 12, 1–19 (2013).
Article Google Scholar
Gaines, B. R. In Proceedings of the April 18–20, 1967, spring joint computer conference. 149–156.
Poppelbaum, W., Afuso, C. & Esch, J. In Proceedings of the November 14–16, 1967, fall joint computer conference. 635–644.
Weijia, Z., Ling, G. W. & Seng, Y. K. In 2007 IEEE Conference on Electron Devices and Solid-State Circuits. 337–340.
Cai, R. et al. in Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems 476–488 (Association for Computing Machinery, Williamsburg, VA, USA, 2018).
Kulesza, Z. & Tylman, W. In Proceedings of the International Conference Mixed Design of Integrated Circuits and System, 2006. MIXDES 2006. 711-715.
Zermani, S., Dezan, C., Chenini, H., Diguet, J. & Euler, R. In 2015 IEEE Conference on Prognostics and Health Management (PHM). 1-10.
Knag, P., Lu, W. & Zhang, Z. A native stochastic computing architecture enabled by memristors. IEEE Trans. Nanotechnol. 13, 283–293 (2014).
Article ADS CAS Google Scholar
Gaba, S., Knag, P., Zhang, Z. & Lu, W. In 2014 IEEE International Symposium on Circuits and Systems (ISCAS). 2592-2595 (IEEE).
Gaba, S., Sheridan, P., Zhou, J., Choi, S. & Lu, W. Stochastic memristive devices for computing and neuromorphic applications. Nanoscale 5, 5872–5878 (2013).
Article ADS CAS PubMed Google Scholar
Debashis, P. et al. Hardware implementation of Bayesian network building blocks with stochastic spintronic devices. Sci. Reports. 10, https://doi.org/10.1038/s41598-020-72842-6 (2020).
Faria, R., Kaiser, J., Camsari, K. Y. & Datta, S. Hardware Design for Autonomous Bayesian Networks. Front. Computational Neurosci. 15, https://doi.org/10.3389/fncom.2021.584797 (2021).
Shim, Y., Chen, S., Sengupta, A. & Roy, K. Stochastic Spin-Orbit Torque Devices as Elements for Bayesian Inference. Sci. Rep. 7, 14101 (2017).
Article ADS PubMed PubMed Central Google Scholar
Faria, R., Camsari, K. Y. & Datta, S. Implementing Bayesian networks with embedded stochastic MRAM. AIP Adv. 8, 045101 (2018).
Article ADS Google Scholar
Venkatesan, R., Venkataramani, S., Fong, X., Roy, K. & Raghunathan, A. in 2015 Design, Automation & Test in Europe Conference & Exhibition (DATE). 1575–1578 (IEEE).
Finocchio, G. et al. The promise of spintronics for unconventional computing. J. Magn. Magn. Mater. 521, 167506 (2021).
Article CAS Google Scholar
Hu, J., Li, B., Ma, C., Lilja, D. & Koester, S. J. Spin-Hall-Effect-Based Stochastic Number Generator for Parallel Stochastic Computing. IEEE Trans. Electron Devices. 66, 3620–3627 (2019).
Article ADS CAS Google Scholar
Yang, K. et al. In 2014 IEEE International Solid-State Circuits Conference Digest of Technical Papers (ISSCC). 280-281 (IEEE).
Pamula, V. R. et al. In 2018 IEEE Symposium on VLSI Circuits. 1–2.
Satpathy, S. et al. In 2018 IEEE Symposium on VLSI Circuits. 169–170.
Jayaraj, A., Gujarathi, N. N., Venkatesh, I. & Sanyal, A. 0.6–1.2 V, 0.22 pJ/bit True Random Number Generator Based on SAR ADC. IEEE Trans. Circuits Syst. II: Express Briefs. 67, 1765–1769 (2020).
Article Google Scholar
Bae, S., Kim, Y., Park, Y. & Kim, C. 3-Gb/s High-Speed True Random Number Generator Using Common-Mode Operating Comparator and Sampling Uncertainty of D Flip-Flop. IEEE J. Solid-State Circuits. 52, 605–610 (2017).
Article ADS Google Scholar
Cao, Y., Zhao, X., Zheng, W., Zheng, Y. & Chang, C. H. A New Energy-Efficient and High Throughput Two-Phase Multi-Bit per Cycle Ring Oscillator-Based True Random Number Generator. IEEE Trans. Circuits Syst. I: Regul. Pap. 69, 272–283 (2022).
Article Google Scholar
Jaiswal, A., Fong, X. & Roy, K. Comprehensive scaling analysis of current induced switching in magnetic memories based on in-plane and perpendicular anisotropies. IEEE J. Emerg. Sel. Top. Circuits Syst. 6, 120–133 (2016).
Article ADS Google Scholar
Sengupta, A., Panda, P., Wijesinghe, P., Kim, Y. & Roy, K. Magnetic tunnel junction mimics stochastic cortical spiking neurons. Sci. Rep. 6, 1–8 (2016).
Article Google Scholar
Daniels, M. W., Madhavan, A., Talatchian, P., Mizrahi, A. & Stiles, M. D. Energy-Efficient Stochastic Computing with Superparamagnetic Tunnel Junctions. Phys. Rev. Appl. 13, 034016 (2020).
Article ADS CAS Google Scholar
Vodenicarevic, D. et al. Low-Energy Truly Random Number Generation with Superparamagnetic Tunnel Junctions for Unconventional Computing. Phys. Rev. Appl. 8, 054045 (2017).
Article ADS Google Scholar
Li, M.-Y., Su, S.-K., Wong, H.-S. P. & Li, L.-J. (Nature Publishing Group, 2019).
Jacob, A. P. et al. Scaling challenges for advanced CMOS devices. Int. J. High. Speed Electron. Syst. 26, 1740001 (2017).
Article CAS Google Scholar
Uchida, K. et al. In Digest. International Electron Devices Meeting. 47–50 (IEEE).
Manzeli, S., Ovchinnikov, D., Pasquier, D., Yazyev, O. V. & Kis, A. 2D transition metal dichalcogenides. Nat. Rev. Mater. 2, 17033 (2017).
Article ADS CAS Google Scholar
Akinwande, D. et al. Graphene and two-dimensional materials for silicon technology. Nature 573, 507–518 (2019).
Article ADS CAS PubMed Google Scholar
Chhowalla, M., Jena, D. & Zhang, H. Two-dimensional semiconductors for transistors. Nat. Rev. Mater. 1, 1–15 (2016).
Article Google Scholar
Schwierz, F., Pezoldt, J. & Granzner, R. Two-dimensional materials and their prospects in transistor electronics. Nanoscale 7, 8261–8283 (2015).
Article ADS CAS PubMed Google Scholar
Liu, C. et al. Two-dimensional materials for next-generation computing technologies. Nat. Nanotechnol. 15, 545–557 (2020).
Article ADS CAS PubMed Google Scholar
Iannaccone, G., Bonaccorso, F., Colombo, L. & Fiori, G. Quantum engineering of transistors based on 2D materials heterostructures. Nat. Nanotechnol. 13, 183–191 (2018).
Article ADS CAS PubMed Google Scholar
Liu, Y. et al. Promises and prospects of two-dimensional transistors. Nature 591, 43–53 (2021).
Article ADS CAS PubMed Google Scholar
Shen, P.-C. et al. Ultralow contact resistance between semimetal and monolayer semiconductors. Nature 593, 211–217 (2021).
Article ADS CAS PubMed Google Scholar
English, C. D., Smithe, K. K. H., Xu, R. L. & Pop, E. In 2016 IEEE International Electron Devices Meeting (IEDM). 5.6.1-5.6.4.
Price, K. M., Schauble, K. E., McGuire, F. A., Farmer, D. B. & Franklin, A. D. Uniform Growth of Sub-5-Nanometer High-κ Dielectrics on MoS2 Using Plasma-Enhanced Atomic Layer Deposition. ACS Appl. Mater. Interfaces. 9, 23072–23080 (2017).
Article CAS PubMed Google Scholar
Sebastian, A., Pendurthi, R., Choudhury, T. H., Redwing, J. M. & Das, S. Benchmarking monolayer MoS2 and WS2 field-effect transistors. Nat. Commun. 12, 693 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Sebastian, A., Pannone, A., Subbulakshmi Radhakrishnan, S. & Das, S. Gaussian synapses for probabilistic neural networks. Nat. Commun. 10, 4199 (2019).
Article ADS PubMed PubMed Central Google Scholar
Subbulakshmi Radhakrishnan, S., Sebastian, A., Oberoi, A., Das, S. & Das, S. A biomimetic neural encoder for spiking neural network. Nat. Commun. 12, 2143 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Jayachandran, D. et al. A low-power biomimetic collision detector based on an in-memory molybdenum disulfide photodetector. Nat. Electron. 3, 646–655 (2020).
Article Google Scholar
Schranghamer, T. F., Oberoi, A. & Das, S. Graphene memristive synapses for high precision neuromorphic computing. Nat. Commun. 11, 5474 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Das, S., Dodda, A. & Das, S. A biomimetic 2D transistor for audiomorphic computing. Nat. Commun. 10, 3450 (2019).
Article ADS PubMed PubMed Central Google Scholar
Das, S. et al. Transistors based on two-dimensional materials for future integrated circuits. Nat. Electron. 4, 786–799 (2021).
Article CAS Google Scholar
Zhu, K. et al. The development of integrated circuits based on two-dimensional materials. Nat. Electron. 4, 775–785 (2021).
Article CAS Google Scholar
Wachter, S., Polyushkin, D. K., Bethge, O. & Mueller, T. A microprocessor based on a two-dimensional semiconductor. Nat. Commun. 8, 14948 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Polyushkin, D. K. et al. Analogue two-dimensional semiconductor electronics. Nat. Electron. 3, 486–491 (2020).
Article CAS Google Scholar
Nikonov, D. E. & Young, I. A. Benchmarking of beyond-CMOS exploratory devices for logic integrated circuits. IEEE J. Exploratory Solid-State Computational Devices Circuits 1, 3–11 (2015).
Article ADS Google Scholar
Sylvia, S. S., Alam, K. & Lake, R. K. Uniform benchmarking of low-voltage van der Waals FETs. IEEE J. Exploratory Solid-State Computational Devices Circuits 2, 28–35 (2016).
Article ADS Google Scholar
Lee, C.-S., Cline, B., Sinha, S., Yeric, G. & Wong, H. S. P. 32-bit Processor core at 5-nm technology: Analysis of transistor and interconnect impact on VLSI system performance. 28.23.21-28.23.24, https://doi.org/10.1109/iedm.2016.7838498 (2016).
Agarwal, T. et al. Benchmarking of monolithic 3D integrated MX₂ FETs with Si FinFETs. 5.7.1–5.7.4, https://doi.org/10.1109/iedm.2017.8268336 (2017).
2DCC. 2d-crystal-consortium, <https://www.mri.psu.edu/2d-crystal-consortium/user-facilities/thin-films/list-thin-film-samples-available>
Pendurthi, R. et al. Heterogeneous Integration of Atomically Thin Semiconductors for Non‐von Neumann CMOS. Small 18, 2202590 (2022).
Radhakrishnan, S. S. et al. A Sparse and Spike‐timing‐based Adaptive Photo Encoder for Augmenting Machine Vision for Spiking Neural Networks. Adv. Materials. 2202535 (2022).
Dodda, A., Trainor, N., Redwing, J. & Das, S. All-in-one, bio-inspired, and low-power crypto engines for near-sensor security based on two-dimensional memtransistors. Nat. Commun. 13, 1–12 (2022).
Article Google Scholar
Oberoi, A., Dodda, A., Liu, H., Terrones, M. & Das, S. Secure Electronics Enabled by Atomically Thin and Photosensitive Two-Dimensional Memtransistors. ACS Nano. 15, 19815–19827 (2021).
Article CAS PubMed Google Scholar
Li, H. et al. From bulk to monolayer MoS2: evolution of Raman scattering. Adv. Funct. Mater. 22, 1385–1390 (2012).
Article ADS CAS Google Scholar
Das, S., Chen, H.-Y., Penumatcha, A. V. & Appenzeller, J. High performance multilayer MoS2 transistors with scandium contacts. Nano Lett. 13, 100–105 (2013).
Article ADS CAS PubMed Google Scholar
Schulman, D. S., Arnold, A. J. & Das, S. Contact engineering for 2D materials and devices. Chem. Soc. Rev. 47, 3037–3058 (2018).
Article CAS PubMed Google Scholar
Chuang, S. et al. MoS2 p-type transistors and diodes enabled by high work function MoO x contacts. Nano Lett. 14, 1337–1342 (2014).
Article ADS CAS PubMed Google Scholar
Arnold, A. J. et al. Mimicking Neurotransmitter Release in Chemical Synapses via Hysteresis Engineering in MoS2 Transistors. ACS Nano. 11, 3110–3118 (2017).
Article CAS PubMed Google Scholar
Illarionov, Y. Y. et al. The role of charge trapping in MoS₂/SiO₂and MoS₂/hBN field-effect transistors. 2D Materials 3, https://doi.org/10.1088/2053−1583/3/3/035004 (2016).
Illarionov, Y. Y. et al. Energetic mapping of oxide traps in MoS2 field-effect transistors. 2D Mater. 4, 025108 (2017).
Article Google Scholar
Jiang, J. et al. Defect engineering for modulating the trap states in 2D photoconductors. Adv. Mater. 30, 1804332 (2018).
Article Google Scholar
Tsai, H.-S. et al. Ultrafast exciton dynamics in scalable monolayer MoS2 synthesized by metal sulfurization. ACS Omega. 5, 10725–10730 (2020).
Article CAS PubMed PubMed Central Google Scholar
Docherty, C. J. et al. Ultrafast transient terahertz conductivity of monolayer MoS2 and WSe2 grown by chemical vapor deposition. ACS Nano. 8, 11147–11153 (2014).
Article CAS PubMed Google Scholar
Bhagdikar, S. & Mahapatra, S. In 2019 International Conference on Simulation of Semiconductor Processes and Devices (SISPAD). 1-4.
Grasser, T. Stochastic charge trapping in oxides: From random telegraph noise to bias temperature instabilities. Microelectron. Reliab. 52, 39–70 (2012).
Article CAS Google Scholar
Waltl, M. In 2019 IEEE International Integrated Reliability Workshop (IIRW). 1-9.
Kirton, M. & Uren, M. Noise in solid-state microstructures: A new perspective on individual defects, interface states and low-frequency (1/ƒ) noise. Adv. Phys. 38, 367–468 (1989).
Article ADS CAS Google Scholar
Ibe, O. Markov processes for stochastic modeling. (Newnes, 2013).
Uren, M., Kirton, M. & Collins, S. Anomalous telegraph noise in small-area silicon metal-oxide-semiconductor field-effect transistors. Phys. Rev. B 37, 8346 (1988).
Article ADS CAS Google Scholar
Xuan, Y. et al. Multi-scale modeling of gas-phase reactions in metal-organic chemical vapor deposition growth of WSe2. J. Crys.Growth 527, https://doi.org/10.1016/j.jcrysgro.2019.125247 (2019).
Dodda, A. et al. Stochastic resonance in MoS2 photodetector. Nat. Commun. 11, 4406 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The work was supported by Army Research Office (ARO) through Contract Number W911NF1920338 and National Science Foundation (NSF) through a CAREER Award under grant no. ECCS-2042154. Authors also acknowledge the materials support from the National Science Foundation (NSF) through the Pennsylvania State University 2D Crystal Consortium–Materials Innovation Platform (2DCCMIP) under NSF cooperative agreement DMR-2039351.

Author information

Authors and Affiliations

Engineering Science and Mechanics, Penn State University, University Park, 16802, PA, USA
Yikai Zheng, Harikrishnan Ravichandran, Thomas F. Schranghamer & Saptarshi Das
Materials Science and Engineering, Penn State University, University Park, 16802, PA, USA
Nicholas Trainor, Joan M. Redwing & Saptarshi Das
Materials Research Institute, Penn State University, University Park, 16802, PA, USA
Nicholas Trainor, Joan M. Redwing & Saptarshi Das
Electrical Engineering and Computer Science, Penn State University, University Park, 16802, PA, USA
Saptarshi Das

Authors

Yikai Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Harikrishnan Ravichandran
View author publications
You can also search for this author in PubMed Google Scholar
Thomas F. Schranghamer
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas Trainor
View author publications
You can also search for this author in PubMed Google Scholar
Joan M. Redwing
View author publications
You can also search for this author in PubMed Google Scholar
Saptarshi Das
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.D. conceived the idea and designed the experiments. Y.Z., H.R., and T.F.S. fabricated the memtransistors. Y.Z., H.R., and S.D. performed the measurements, analyzed the data, discussed the results, and agreed on their implications. N.T. grew MOCVD MoS₂ under the supervision of J.M. R. All authors contributed to the preparation of the manuscript.

Corresponding author

Correspondence to Saptarshi Das.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Punyashloka Debashis, Hyungjin Kim and the other, anonymous, reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zheng, Y., Ravichandran, H., Schranghamer, T.F. et al. Hardware implementation of Bayesian network based on two-dimensional memtransistors. Nat Commun 13, 5578 (2022). https://doi.org/10.1038/s41467-022-33053-x

Download citation

Received: 13 January 2022
Accepted: 31 August 2022
Published: 23 September 2022
DOI: https://doi.org/10.1038/s41467-022-33053-x

This article is cited by

The Roadmap of 2D Materials and Devices Toward Chips
- Anhan Liu
- Xiaowei Zhang
- Tian-Ling Ren
Nano-Micro Letters (2024)
Recent Advances in In-Memory Computing: Exploring Memristor and Memtransistor Arrays with 2D Materials
- Hangbo Zhou
- Sifan Li
- Yong-Wei Zhang
Nano-Micro Letters (2024)
An all 2D bio-inspired gustatory circuit for mimicking physiology and psychology of feeding behavior
- Subir Ghosh
- Andrew Pannone
- Saptarshi Das
Nature Communications (2023)
Bringing uncertainty quantification to the extreme-edge with memristor-based Bayesian neural networks
- Djohan Bonnet
- Tifenn Hirtzlin
- Elisa Vianello
Nature Communications (2023)
A bio-inspired visuotactile neuron for multisensory integration
- Muhtasim Ul Karim Sadaf
- Najam U Sakib
- Saptarshi Das
Nature Communications (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.