The gate injection-based field-effect synapse transistor with linear conductance update for online training

Seo, Seokho; Kim, Beomjin; Kim, Donghoon; Park, Seungwoo; Kim, Tae Ryong; Park, Junkyu; Jeong, Hakcheon; Park, See-On; Park, Taehoon; Shin, Hyeok; Kim, Myung-Su; Choi, Yang-Kyu; Choi, Shinhyun

doi:10.1038/s41467-022-34178-9

Download PDF

Article
Open access
Published: 28 October 2022

The gate injection-based field-effect synapse transistor with linear conductance update for online training

Nature Communications volume 13, Article number: 6431 (2022) Cite this article

7013 Accesses
22 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Neuromorphic computing, an alternative for von Neumann architecture, requires synapse devices where the data can be stored and computed in the same place. The three-terminal synapse device is attractive for neuromorphic computing due to its high stability and controllability. However, high nonlinearity on weight update, low dynamic range, and incompatibility with conventional CMOS systems have been reported as obstacles for large-scale crossbar arrays. Here, we propose the CMOS compatible gate injection-based field-effect transistor employing thermionic emission to enhance the linear conductance update. The dependence of the linearity on the conduction mechanism is examined by inserting an interfacial layer in the gate stack. To demonstrate the conduction mechanism, the gate current measurement is conducted under varying temperatures. The device based on thermionic emission achieves superior synaptic characteristics, leading to high performance on the artificial neural network simulation as 93.17% on the MNIST dataset.

Understanding asymmetric switching times in accumulation mode organic electrochemical transistors

Article 17 April 2024

Phase-change memory via a phase-changeable self-confined nano-filament

Article 03 April 2024

Solution-processed memristors: performance and reliability

Article 12 April 2024

Introduction

In the advent of the big data era, the dramatic advance of machine learning technology and artificial intelligence have occurred, demanding the computing ability to handle the data-intensive task¹. However, the currently exploited conventional von Neumann architecture has become the bottleneck due to its limitation to parallel computing ability and high power consumption to deal with the big data, caused by obligated data transfer through the data bus between the physically separated processing unit and memory^2,3,4. Therefore, to perform successful big data analysis, new computing architectures have been developed. The main key idea of the new architectures is to compute the data in memory without data transfer (or small movement of data), enabling reducing power consumption and suppressing latency by parallel data processing ability^4,5,6,7.

Neuromorphic computing is one of the candidates for post-von Neumann architecture. By mimicking the synaptic behavior of the biological neural network, the big data can be processed by parallel computing in an energy-efficient way in real-time^5,8,9. For accelerating the artificial neural network (ANN) with this new architecture, the neuromorphic device, which can memorize and compute the data on the same device, is required. Recently, several studies utilizing conventional memory, such as DRAM and charge trap flash memory (CTF), and emerging memory devices such as PRAM and ReRAM have been reported in neuromorphic applications^{10,11,12,13,14,15,16,17}.

In the case of conventional memories, the well-established DRAM secures fast write speed and linear conductance update¹⁰. Capacitor-based synaptic devices with a DRAM-like structure also have main advantages in online training for repeated updates because of their high endurance¹⁸. However, because of poor retention characteristics, the weight values must be transferred to nonvolatile memories very frequently during the training process, resulting in high power consumption. Moreover, these devices are difficult to create and retain analog conductance states with a single device and require a capacitor (storing charges for weight values) and several additional transistors to implement analog states^10,19,20. This means that it has a drawback in terms of device integration density compared to a single synaptic device.

On the other hand, nonvolatile memories such as CTF can distinguish between states of multi-level cells depending on how many charges are trapped in the charge trap layer, and have long retention¹². Additionally, several studies show that the endurance characteristics of CTF can be significantly improved by structural and material engineering of the device^21,22,23. Therefore, research using CTF devices is being actively conducted for applications in neuromorphic computing, as well as for the main memory for data storage. However, while data can be stored for long periods without data loss, CTF normally has a large operation voltage and slow speed, requiring more energy, especially for data erasing^24,25,26. In the case of online training, using devices with low update energy is advantageous because the training demands repeated writing and erasing operations more than millions of times.

The two-terminal emerging memories have been extensively studied as a promising candidate among neuromorphic devices due to their simple structure and scalability. Furthermore, they can be integrated into large-scale crossbar-array for vector-matrix multiplication, which is essential for the basic operation of neuromorphic computing^8,27,28,29. However, their device variation caused by randomly formed filament during set/reset. This stochastic behavior of ion movement causes unreliable variation and it has been the significant bottleneck for the successful application as a computing device^4,28. On the other hand, the three-terminal synaptic device has the advantage of enhancing synaptic weight controllability, and allows simultaneous reading and writing the data^8,9,30,31,32.

Besides suppressing the problems mentioned above, synapse device characteristics such as high linear conductance update and compatibility with the conventional Complementary-Metal-Oxide-Semiconductor (CMOS) system are commonly required to acquire the high performance of crossbar-array based neuromorphic computing as that of software-based ANN^{32,33,34,35,36}. Especially, the linearity of the Long-Term Potentiation and Long-Term Depression (LTP-LTD) is regarded as one of the most important characteristics for synapse device evaluation³³. By achieving the linear conductance update with identical consecutive pulse scheme, it is believed to enable the multi-level operation while reducing the burden on peripheral circuits to operate crossbar array^12,37.

The conventional three-terminal floating gate-based flash memory shows high nonlinearity in weight updates^12,30,38,39 due to the Fowler-Nordheim (F-N) tunnelling, a vital function of the electric field changed by electrons stored charge state^38,40. Ion-conducting electrolyte-based three-terminal synapse devices show high linear conductance update for weight state^41,42,43,44. However, they are vulnerable in the perspective of low on/off ratio, high programming pulse width, and incompatibility with conventional CMOS devices.

In this paper, we propose a three-terminal Gate Injection-based Field-Effect Transistor (GIFET), which utilizes the CMOS compatible material and fabrication process. Through different operation mechanisms from conventional flash memory, we derive superior synapse device characteristics, such as high linearity and symmetry, high temporal and spatial uniformity (<1.64%, 9.76%), and low power consumption (50 fJ/SOP). These performances lead to a high accuracy of approximately 93.17% with the MNIST handwritten recognition dataset.

Results

Structure and operation principle of GIFET

The dependence of current density through tunnelling oxide on the current floating gate charge state triggers the nonlinearity of conventional flash memory due to its primary update mechanism, F-N tunnelling^38,39,40. To relieve the current density dependence on floating gate charge, we program and erase the charges in the stored layer by thermionic emission of an electron to and from the gate metal (see Fig. 1a), which is the weaker function of electric field than field emission^45,46.

As the current density through the thermionic emission depends on the barrier height between each layer, the band diagram of GIFET was designed as shown in Fig. 1b. The barrier height between the charge store layer (CSL) and blocking layer should be moderately low to guarantee sufficient current density for a high on/off ratio and to hold electrons, except during write/erase operation for high retention. Therefore, we selected a CSL material (\({\chi }_{{{{{{\rm{CSL}}}}}}}=4.92{{{{{\rm{eV}}}}}}\))⁴⁷ with a greater electron affinity than the blocking layer (\({\chi }_{{{{{{\rm{BL}}}}}}}=3.93{{{{{\rm{eV}}}}}}\))⁴⁸.

Figure 1c shows the cross-sectional Transmission Electronic Microscope (TEM) image of the device. As presented, 20 nm-thick n⁺ doped Si layer was utilized on silicon dioxide layer to design device area by mesa pattern. On top of that, silicon dioxide (SiO₂) was applied as the gate oxide. WO_x was deposited as CSL with high electron affinity to form a shallow well in the energy band diagram, where the electrons are stored. Subsequently, a-Si:H was utilized as a blocking layer for a lower barrier height difference with gate metal (see details in Methods and Fig. 1d for Second Ion Mass Spectroscopy (SIMS) data of the gate stack of the GIFET). The amount of charge stored in the WO_x layer widens or shortens the Si channel’s depletion region by field-effect, and this changing depletion region is the primary mechanism to control artificial synapse weight in terms of the conductance.

Figure 1e–g present the schematics of basic operations of the GIFET for write, erase, and read process, respectively. When a positive bias is applied on the gate while source and drain are grounded, negative charge electrons on WO_x layer are extracted to gate metal due to electric field between gate and channel (see Fig. 1e). Therefore, the depletion region in the channel decreases, leading to increasing channel conductance (write process). Reversely, when a negative bias is applied on the gate, and channel source-drain are grounded, electrons are injected from gate metal to the WO_x layer, decreasing channel conductance (erase process, see Fig. 1f). In order to read the stored weight of a device, the gate is grounded, and read voltage is applied to drain to confirm the current weight state of the memory cell by measuring the conductance of the cell (see Fig. 1g). More detailed operating principles are described in Supplementary Fig. 1. In addition, because GIFET is a transistor-based device, its electrical characteristics are measured and evaluated as a transistor (see Supplementary Fig. 2). As the number of electrons stored in the CSL by the write/erase operation varies, the I_D-V_G and I_D-V_D characteristics can be changed, which means that the weight of the synaptic device can be controlled.

The relationship between linearity and conduction mechanism

To investigate the effect of the conduction mechanism for charge transport through the blocking layer on the linearity, we observed the dependence of the current density through the blocking layer at several temperatures. To focus on the current through the blocking layer and confirm the presence of the Schottky barrier, a gate stack of the device without the gate oxide was prepared as shown in Fig. 2a. I-V characteristic measurements were conducted in the temperature range 273 K–423 K.

**Fig. 2: The linearity of GIFET and its relationship with the conduction mechanism.**

Figure 2b presents the Arrhenius plot of ln(I/T²) versus q/kT. The linear relationship between ln(I/T²) and q/kT is observed, which implies the existence of the Schottky barrier, and the primary conduction mechanism through the blocking layer is thermionic emission current⁴⁹. More details on barrier height are in Supplementary Fig. 3. On the other hand, we also observed the current density of the gate stack with SiO_x interfacial layer between CSL and blocking layer to see the temperature dependence (see Supplementary Fig. 4a). During the write operation, the SiO_x interfacial layer is under a higher electric field due to its lower dielectric constant than a-Si:H if they are in the same thickness. Therefore, the voltage drop occurs through the interfacial layer, and the barrier height between WO_x and a-Si:H decreases. Accordingly, the conduction mechanism between CSL and blocking layer converted from thermionic emission to field emission or trap assisted tunnelling through SiO_x layer (see Supplementary Fig. 4b). Supplementary Fig. 4c shows the Arrhenius plot of ln(I/T²) versus q/kT for the device with the interfacial layer. The zero slope of the graph shows that the disappearance of the Schottky barrier, and the linear relationship between ln(I/V²) and 1/V with the interfacial layer at room temperature in Supplementary Fig. 4d implies the changed conduction mechanism is F-N tunnelling⁵⁰.

Consequently, the LTP-LTD characteristics of both the GIFET device with the SiO_x layer between CSL and blocking layer (field emission dominant) and without the layer (thermionic emission dominant) were estimated to examine the relation between linearity and conduction mechanism for charge transfer through the blocking layer. As displayed in Supplementary Fig. 4e, the device with SiO_x interfacial layer loses linear conductance update property compared with the device without interfacial layer under the same pulse train (write: 2.5 V, 500 μs, erase: −3 V, 500 μs, read: 1 V, 500 μs), especially during LTP.

Figure 2c is the LTP-LTD characteristic of the GIFET observed with the 1000 potentiation (500 μs, 5 V)–1000 depression (500 μs, −3.3 V) gate pulse trains (see Supplementary Fig. 5 for pulse information). As shown in Fig. 2c, the device has high linearity with low asymmetric ratio^32,51 (see Supplementary Fig. 6 and Note 1). Conductance ratio G_max/G_min around 10 is achieved, which was reported as the on/off ratio value for achieving high performance in ANN task³³. Furthermore, 1000 analog conductance levels are more than enough for high accuracy. Figure 2d is enlarged conductance update from the part of the data in Fig. 2c. Each figure illustrates potentiation or depression of the weight with 100 switching pulses. As shown in Fig. 2d, the device shows stable linear conductance updates in the entire conductance state, which means it has similar conductance changes with the same number of pulses.

Program operation time of GIFET is practicable to be reduced while updating conductance linearly by controlling the pulse condition to optimize appropriate synaptic properties, such as on/off ratio and linearity for neuromorphic applications. In practical machine learning applications, the program operation time of synaptic devices is considered the main parameter of the system speed. Therefore, minimizing the operation time is important, and the shortest program operation time of the GIFET for linear weight update was determined to be 200 μs (see Supplementary Fig. 7).

Figure 2e presents the linear conductance update with arbitrary pulse trains. Pulses consisting of 500 write pulses (1.4 V, 500 μs), 500 hold pulses (0 V, 500 μs) and 500 erase pulses (−2.5 V, 500 μs) were applied to the gate. For the read process, a read pulse (1 V, 200 μs) was applied to the drain terminal. As shown in Fig. 2e, during the hold process, current change has not been observed, which means that the data is well preserved. It is important for synaptic devices to maintain nonvolatile states at different intermediate conductance levels for neuromorphic computing applications. Moreover, the device has constant conductance change under repeated write/erase pulse train, as magnified in Fig. 2f. This data implicates that the weight stored in the device can be manipulated under an identical pulse scheme, which helps soften the burden of the peripheral circuit^12,37.

Next, the relationship between operation pulse amplitude/duration and conductance change per pulse was investigated as shown in Supplementary Fig. 8. It shows the relationship between the pulse amplitude and current change at different pulse durations. GIFET can control the linear conductance update with small current change per pulse in various pulse scheme (see Supplementary Fig. 9–11). In other words, the GIFET shows stable characteristics for controlling the pulse scheme for neuromorphic computing applications, and has the advantage of being customizable to fit the needs of other applications.

Operational stability as a synaptic device

Besides the linearity in the LTP-LTD, to obtain high performance in the large-scale crossbar-array structure, it is highly required to satisfy various characteristics such as uniformity, G_max/G_min ratio, low power consumption, endurance, and retention simultaneously^{9,32,33,35,52}. To be integrated as a large-scale crossbar array, spatiotemporal uniformity is one of the essential properties^28,52.

First, to investigate the spatio-temporal uniformity of the GIFET, we assessed I_D–V_G characteristics of the GIFET by gate voltage sweeping with constant read voltage on the drain. Figure 3a, b present gate voltage at a specific drain current (I_D = 3 nA) from repeated cycles on a single device for cycle-to-cycle variation and from 15 different devices for device-to-device variation, respectively. In these figures, cycle-to-cycle variation was observed as 4.30% at HRS and 1.15% at LRS (σ/μ), while device-to-device variation was measured as 5.16% at HRS and 3.67% at LRS (σ/μ) (ref. Supplementary Fig. 12). In addition, the repeated LTP-LTD characteristics on a single device and several different devices were observed with 1000 potentiation (5 V, 500 μs,)–1000 depression (−3.3 V, 500 μs) gate pulse trains, as shown in Fig. 3c, d. The LTP-LTD characteristics of the device in Fig. 3c presented a low variation of 1.64% (σ/μ) for 100 repeated cycles. The device-to-device variation was experimentally measured on 15 devices and it showed 9.76% (see Fig. 3d). The standard deviation of nonlinearity based on the LTP-LTD characteristics of 15 devices is also calculated in Supplementary Fig. 13. The nonlinearity during potentiation/depression in Supplementary Fig. 13a, b was fitted using the method from Supplementary Note 2. Each spatial variation in the results of the above I_D-V_G and LTP-LTD characteristics was slightly higher than each temporal variation because of the Si channel thickness variation during fabrication (see Methods section). This structure shows uniform switching because it utilizes a large population of electrons, minimizing the effect of fluctuation or stochastic behavior of individual charged particles, instead of individual ion movement.

**Fig. 3: Measured GIFET data for high performance in a crossbar-array structure.**

Figure 3e shows the I_G-V_G characteristics of the GIFET. As observed, the gate current of the device was lower than 20 pA at 5 V gate bias, which means that power consumption for a write pulse with 500 μs pulse width is lower than 50 fJ. In addition, the power consumption during the read process in Fig. 3c is 5.54 pJ at the maximum conductance level. Notably, the read current level of the device can be modulated by controlling Si channel doping concentration (see Fig. 3f and Supplementary Fig. 14), indicating that we can tune the device operation speed and power consumption for specific applications such as edge computing processor and high-performance processor.

The endurance and retention of the device are also crucial for long-term and reliable neuromorphic computing applications⁵². To investigate the endurance of GIFET, we applied 500 consecutive potentiation pulses with an amplitude of 5 V and a width of 200 μs, followed by 500 consecutive depression pulses with an amplitude of −5 V and a width of 200 μs per switching cycle. We then read the change in state by drain voltage (1 V, 50 μs) at each switching cycle. As presented in Fig. 3g, the device achieves robust endurance (≥2 × 10⁸ pulses).

Figure 3h shows the data-holding ability of the GIFET. We observed a data loss of 5.45% (13.6 nS) of the updated conductance after 1000 s. Also, several intermediate conductance levels were maintained without severe degradation. It is essential for synaptic devices to maintain non-volatile states at various intermediate conductance levels in order to be used for neuromorphic computing applications. There is a tradeoff between retention, endurance, and linearity for weight update due to lowered barrier height of the blocking layer. The newly developed device improves endurance and linear conductance update while it loses data holding ability compared to conventional flash memory devices. Because the barrier height between the blocking layer and CSL can be controlled by the material stoichiometry and film quality^53,54, the device characteristic can be optimized for specific purposes through further engineering of processes and materials utilized for CSL and blocking layer.

The Robustness of the GIFET to temperature variations

Figure 4a, b present the linear characteristics of the GIFET under varying temperatures. We confirmed the robustness of the linearity over a temperature change from 298 K to 393 K during 1000 potentiation (4 V, 500 μs) and 1000 depression (−4 V, 500 μs) operations. As shown in Supplementary Fig. 15, the linearity of conductance update is stable at all temperatures without severe degradation.

**Fig. 4: Robustness of the GIFET to temperature variations.**

For verifying the reliability of multi-level conductance states in high temperatures, the retention characteristics were measured for two states according to temperature from 298 K to 393 K as shown in Fig. 4c. Erase pulses (−3.3 V, 500 μs) of 100 (read sphere) and 1000 (blue sphere) were applied in the initial state, respectively, and read operations (1 V, 50 μs) were performed every 1 s. It was confirmed that only the conductance level (drain current level) increased as the temperature increased and that the conductance states remained unchanged for 200 s. This indicates long-term plasticity properties, and the proposed GIFET can hold data for online training, even at high temperatures.

Figure 4d and Supplementary Fig. 16 shows the endurance characteristics of GIFET. To investigate the robustness of the hardware, the endurance was measured under the same pulse conditions over a temperature change from 298 K to 393 K. The pulse train consists of ten consecutive potentiation pulses with an amplitude of 6 V and width of 500 μs, followed by ten consecutive depression pulses with an amplitude of −6 V and width of 500 μs. As presented in Fig. 4d and Supplementary Fig. 16, the device operates stably by holding its high-level and low-level states, without severe degradation after 10⁵ switching cycles (2×10⁶ pulses in total). This indicates reliable endurance characteristics for highly frequent updates during online learning.

ANN simulation with the performance of GIFET

Figure 5a presents the read operation of the GIFET with 1 V, 30 μs pulse to read changed conductance with 1.8 V/−2.5 V, 300 μs update pulses. As presented in the graph, the read operation of GIFET can be conducted without applying gate bias by read pulses of 30 μs, which is comparable with that of conventional NAND flash⁵⁵. Therefore, it has advantages in terms of low power consumption for dense array application, while NAND flash needs repeated processes that are determining on/off of the cell using threshold voltage with applying specific bias for reading current state.

**Fig. 5: GIFET simulation with MNIST dataset.**

To examine the performance of the GIFET for neuromorphic computing as a large-scale crossbar array structure, we simulated the device with the multi-layer artificial neural network using long-term plasticity characteristics directly extracted from measured data³³ (see Supplementary Fig. 17 and Note 4). The multi-layer perceptron ANN consists of an input layer with 400 nodes, a hidden layer with 100 nodes, and an output layer with 10 nodes, as shown in Fig. 5b. Each 400 input node represents each pixel of 20×20 MNIST handwritten data and this input data resulted in 10 output through 2-layer of vector-matrix multiplication and activation function. Ten output nodes mean the result of classification among 0~9 digits. Each weight of the synapse device was updated based on the stochastic gradient descent method with parameters of the GIFET such as nonlinearity, G_max/G_min ratio, cycle-to-cycle variation, device-to-device variation, and applied pulse scheme to account for device non-ideality. 8000 random images per each epoch out of 60,000 training image set were utilized in the training process. After training, the system accuracy was evaluated with 10,000 MNIST images of a testing set. To inspect the influence of linearity on ANN performance, we conducted the simulations with varying linearity. Figure 5c presents the resulting accuracy graphs of MNIST classification by each epoch with the GIFET, the software baseline with ideal device, the device with ideal linearity and symmetry, and the device with deteriorated linearity (see Supplementary Fig. 18). As observed, the ideal device shows an accuracy of 96.78%, and the GIFET-based artificial neural network obtained an accuracy of approximately 93.17% during 300 epochs, which is almost equivalent to that of an ideal linearity device, 93.45%. Furthermore, the device with lightly- and highly-deteriorated linearity exhibited degraded training results as maximum accuracy of 85.13% and 70.56%, respectively. Compared to other synaptic parameters of the same algorithm, the GIFET shows high accuracy with fair comparison including number of conductance states, nonlinearity, on/off ratio and spatio-temporal variation. (see Supplementary Table 1). These results demonstrate the importance of the linear conductance update for neuromorphic computing on a large-scale crossbar array and suggest that the GIFET has enough linearity.

Discussion

In summary, we developed a three-terminal synapse device for enhancing linearity on LTP-LTD based on field-effect to control channel conductance by stored charge in CSL, which is injected from or extracted to gate metal based on thermionic emission. The effect of the conduction mechanism between CSL and gate metal through the a-Si:H blocking layer on linear conductance update was investigated by comparing the device with and without interfacial layer and observing the gate current through the blocking layer of each device under varying temperatures. The thermionic emission-based GIFET reported linear conductance update while the device with interfacial layer presented nonlinearity. Furthermore, GIFET shows superior properties such as number of conductance states, area, power consumption, on/off ratio, operating voltage, programming time, spatio-temporal variation, linearity, retention, endurance, simulation accuracy and CMOS compatibility (see Table 1), since the mechanism based on electron movement employs the flash memory structure. In addition, low spatio-temporal variation, reliable endurance and retention, and low power consumption of the GIFET support that the device is qualified for the large-scale crossbar array to conduct neuromorphic computing. Moreover, all the processes and materials utilized for the GIFET fabrication were CMOS compatible, which suggested low-cost and fast integration with the conventional system. Artificial neural network simulation based on MNIST dataset with the parameters extracted from GIFET measurement data shows high accuracy of 93.17%, which implies the possibility of AI acceleration with GIFET-based large-scale crossbar array.

Table 1 Comparison with the various synaptic transistors for neuromorphic computing

Full size table

Methods

Device Fabrication

The Si top layer of SOI wafer with 145 nm thickness oxidized by thermal furnace and the oxide was removed by hydrofluoric acid to reduce the thickness of the Si layer, remaining 20 nm. Ion implantation (Dose 2 × 10¹³ cm⁻² 7.5 KeV, Phosphorus) and annealing (1273 K, N₂ atmosphere, 1 min) was conducted. Each cell on buried oxide were designed with 5 μm channel width by mesa pattern lithography and Reactive Ion Etch (RIE) with SF₆ and Ar gas. Silicon dioxide 25 nm was deposited as gate oxide using Plasma Enhanced Chemical Vapor Deposition (PECVD). WO_x was deposited with RF sputtering using WO₃ sputtering target (Kurt J. Lesker, USA). The sample was annealed in Rapid Thermal Annealing system with O₂ atmosphere, 573 K to enhance stoichiometry of WO_x layer (see Supplementary Fig. 19 and Note 5). The WO_x layer was designed with 20 μm gate length by lithography and RIE etch with SF₆ and Ar gas. Hydrogenated amorphous silicon 50 nm was deposited using PECVD and designed by lithography and RIE etch. Silicon dioxide on source and drain was removed by wet etch with BOE. Lastly Ti/Au (10 nm/50 nm) was deposited as metal pad for source, drain, and gate.

Electrical Measurement

Parameter analyzer (Keithley 4200A-SCS) with the conventional probe station was used to measure I_D-V_G characteristic of the GIFET by gate voltage sweeping during applying read voltage on drain. The resolution of sweep gate bias was 0.05 V. The analog conductance update under successive identical pulse train was measured using parameter analyzer and Pulse Measurement Unit (PMU), which allow setting pulse width and amplitude on gate and drain intentionally. Read pulses applied to drain after every write or erase process to read conductance state.

Data acquisition system (USB-6363, National Instrument) and current preamplifier (DL instruments, Model 1211) were utilized to measure the endurance of the GIFET. The pulse magnitude and width were modulated through MATLAB® code. The repeated switching cycle applied to the GIFET through DAQ and output current flowed to preamplifier under drain read voltage. Current was measured by averaging the output current over specified duration. Each switching cycle was similar to what we utilized in measurement of analog conductance update with parameter analyzer and PMU. LRS and HRS of each switching cycle were extracted.

Conduction mechanism analysis

To investigate the conduction mechanism through gate stack, I-V characteristic was measured at varied temperature under vacuum condition (~10⁻² Torr). Keithley 236 Source Measurement Unit (SMU) driven by LabVIEW was utilized to apply voltage and measure current with cryogenic probe station (ModuSystems, Inc).

MNIST simulation based on GIFET array

The simulation of the GIFET based crossbar array was conducted based on “NeuroSim+”. The neural network was composed of three layers to conduct supervised learning with back propagation. The input layer had 400 nodes for 20 × 20 pixels of binary MNIST image and the hidden neuron had 100 nodes, while the output neuron had 10 nodes for results of classification, representing 0~9 digits. Stochastic gradient decent was used for weight update. The gradient of the cost function for the neural network parameters was computed using a stochastic gradient descent algorithm. Stochastic gradient decent randomly samples examples from the training dataset for each epoch to compute the gradients. Therefore, it is usually much faster and widely used for the training process⁵⁶.

The simulation consists of two parts: the synaptic array and peripheral circuitry. The peripheral circuit includes a switch matrix, crossbar WL decoder, MUX decoder, analog-to-digital read circuit, adder, and shift register. The device parameters of GIFET such as set voltage, pulse width, min/max conductance, nonlinearity, cycle-to-cycle variation, and device-to-device variation are utilized to perform the simulation.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Code availability

The codes used for the simulations are available from the corresponding author upon reasonable request.

References

Lecun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article ADS CAS PubMed Google Scholar
Fuller, E. J. et al. Parallel programming of an ionic floating-gate memory array for scalable neuromorphic computing. Science 364, 570–574 (2019).
Article ADS CAS PubMed Google Scholar
Lee, S. H., Zhu, X. & Lu, W. D. Nanoscale resistive switching devices for memory and computing applications. Nano Res. 13, 1228–1243 (2020).
Article Google Scholar
Dai, S. et al. Recent advances in transistor-based artificial synapses. Adv. Funct. Mater. 29, 1–22 (2019).
Article ADS Google Scholar
Van De Burgt, Y. et al. A non-volatile organic electrochemical device as a low-voltage artificial synapse for neuromorphic computing. Nat. Mater. 16, 414–418 (2017).
Article ADS PubMed Google Scholar
Choi, S., Sheridan, P. & Lu, W. D. Data clustering using memristor networks. Sci. Rep. 5, 1–10 (2015).
Google Scholar
Ielmini, D. & Wong, H. S. P. In-memory computing with resistive switching devices. Nat. Electron. 1, 333–343 (2018).
Article Google Scholar
Han, H., Yu, H., Wei, H., Gong, J. & Xu, W. Recent progress in three-terminal artificial synapses: from device to system. Small 15, 1–17 (2019).
CAS Google Scholar
Choi, Y., Oh, S., Qian, C., Park, J. H. & Cho, J. H. Vertical organic synapse expandable to 3D crossbar array. Nat. Commun. 11, 1–9 (2020).
Article ADS CAS Google Scholar
Li, Y. et al. Capacitor-based cross-point array for analog neural network with record symmetry and linearity. In 2018 IEEE Symposium on VLSI Technology 25–26 (IEEE, 2018).
Painkras, E. et al. SpiNNaker: A 1-W 18-core system-on-chip for massively-parallel neural network simulation. IEEE J. Solid-State Circuits 48, 1943–1953 (2013).
Article ADS Google Scholar
Choi, H. S., Park, Y. J., Lee, J. H. & Kim, Y. 3-D synapse array architecture based on charge-trap flash memory for neuromorphic application. Electron 9, 1–10 (2020).
Google Scholar
Wright, C. D., Hosseini, P. & Diosdado, J. A. V. Beyond von-neumann computing with nanoscale phase-change memory devices. Adv. Funct. Mater. 23, 2248–2254 (2013).
Article CAS Google Scholar
Wang, Z. et al. Reinforcement learning with analogue memristor arrays. Nat. Electron. 2, 115–124 (2019).
Article Google Scholar
Yao, P. et al. Fully hardware-implemented memristor convolutional neural network. Nature 577, 641–646 (2020).
Article ADS CAS PubMed Google Scholar
Li, C. et al. Analogue signal and image processing with large memristor crossbars. Nat. Electron. 1, 52–59 (2018).
Article Google Scholar
Merolla, P. A. et al. A million spiking-neuron integrated circuit with a scalable communication network and interface. Science 345, 668–673 (2014).
Article ADS CAS PubMed Google Scholar
Yu, S. Neuro-inspired computing with emerging nonvolatile memorys. Proc. IEEE 106, 260–285 (2018).
Article CAS Google Scholar
Ambrogio, S. et al. Equivalent-accuracy accelerated neural-network training using analogue memory. Nature 558, 60–67 (2018).
Article ADS CAS PubMed Google Scholar
Chen, Z., Chen, X. & Gu, J. 15.3 a 65nm 3T dynamic analog RAM-based computing-in-memory macro and CNN accelerator with retention enhancement, adaptive analog sparsity and 44TOPS/W system energy efficiency. In 2021 IEEE International Solid-State Circuits Conference (ISSCC) 240–242 (IEEE, 2021).
Park, G. H. & Cho, W. J. Reliability of modified tunneling barriers for high performance nonvolatile charge trap flash memory application. Appl. Phys. Lett. 96, 1–4 (2010).
Google Scholar
Park, G. H., Jung, M. H., Kim, K. S., Chung, H. B. & Cho, W. J. Tunneling barrier engineered charge trap flash memory with ONO and NON tunneling dielectric layers. Curr. Appl. Phys. 10, e13–e17 (2010).
Article Google Scholar
Zhu, H. et al. Discrete charge states in nanowire flash memory with multiple Ta₂O₅ charge-trapping stacks. Appl. Phys. Lett. 104, 1–6 (2014).
Article Google Scholar
Zhang, W. et al. Neuro-inspired computing chips. Nat. Electron. 3, 371–382 (2020).
Article ADS Google Scholar
Zhao, M., Gao, B., Tang, J., Qian, H. & Wu, H. Reliability of analog resistive switching memory for neuromorphic computing. Appl. Phys. Rev. 7, 011301 (2020).
Park, Y. J. et al. 3-D stacked synapse array based on charge-trap flashmemory for implementation of deep neural networks. IEEE Trans. Electron Devices 66, 420–427 (2019).
Article ADS CAS Google Scholar
Wang, Z. et al. Resistive switching materials for information processing. Nat. Rev. Mater. 5, 173–195 (2020).
Article ADS CAS Google Scholar
Yang, Y. & Lu, W. Nanoscale resistive switching devices: Mechanisms and modeling. Nanoscale 5, 10076–10092 (2013).
Article ADS CAS PubMed Google Scholar
Choi, S. et al. SiGe epitaxial memory for neuromorphic computing with reproducible high performance based on engineered dislocations. Nat. Mater. 17, 335–340 (2018).
Article ADS CAS PubMed Google Scholar
Diorio, C., Hasler, P. & Minch, B. A. A singletransistor silicon synapse. IEEE Trans. Electron Devices 43, 19721980 (1996).
Article Google Scholar
Sun, J. et al. Optoelectronic synapse based on IGZO-alkylated graphene oxide hybrid structure. Adv. Funct. Mater. 28, 1804397 (2018).
Article ADS Google Scholar
Yu, J. M. et al. All-solid-state ion synaptic transistor for wafer-scale integration with electrolyte of a nanoscale thickness. Adv. Funct. Mater. 2010971, 1–10 (2021).
Google Scholar
Chen, P. Y., Peng, X. & Yu, S. NeuroSim: A circuit-level macro model for benchmarking neuro-inspired architectures in online learning. IEEE Trans. Comput. Des. Integr. Circuits Syst. 37, 3067–3080 (2018).
Article Google Scholar
Wu, H. et al. Device and circuit optimization of RRAM for neuromorphic computing. In 2017 IEEE Int. Electron Devices Meeting (IEDM) 11.5.1–11.5.4 (IEEE, 2017).
Kim, M. K. & Lee, J. S. Ferroelectric analog synaptic transistors. Nano Lett. 19, 2044–2050 (2019).
Article ADS CAS PubMed Google Scholar
Burr, G. W. et al. Experimental demonstration and tolerancing of a large-scale neural network (165,000 synapses) using phase-change memory as the synaptic weight element. IEEE Trans. Electron Dev. 62, 3498–3507 (2015).
Article ADS Google Scholar
Moon, K. et al. RRAM-based synapse devices for neuromorphic systems. Faraday Discuss 213, 421–451 (2019).
Article ADS CAS PubMed Google Scholar
Shrivastava, S., Chavan, T. & Ganguly, U. Ultra-low Energy charge trap flash based synapse enabled by parasitic leakage mitigation. Preprint at https://arxiv.org/abs/1902.09417 (2019).
Choi, H. S. et al. 3-D floating-gate synapse array with spike-time-dependent plasticity. IEEE Trans. Electron Devices 65, 101–107 (2018).
Article ADS CAS Google Scholar
Fowler, R. H. & Nordheim, L. Electron emission in intense electric fields. Proc. R. Soc. Lond. A 119, 173–181 (1928).
Article ADS CAS MATH Google Scholar
Yang, C. S. et al. All-solid-state synaptic transistor with ultralow conductance for neuromorphic computing. Adv. Funct. Mater. 28, 1–10 (2018).
Article ADS CAS Google Scholar
Yang, C. S. et al. A synaptic transistor based on quasi-2D molybdenum oxide. Adv. Mater. 29, 1–10 (2017).
ADS Google Scholar
Fuller, E. J. et al. Li-ion synaptic transistor for low power analog computing. Adv. Mater. 29, 1–8 (2017).
Article Google Scholar
Zhu, J. et al. Ion-gated synaptic transistors based on 2D van der Waals crystals with tunable diffusive dynamics. Adv. Mater. 30, 1800195 (2018).
Article Google Scholar
Simmons, J. G. Richardson-Schottky effect in solids. Phys. Rev. Lett. 15, 967–968 (1965).
Article ADS Google Scholar
Kiziroglou, M. E. et al. Thermionic field emission at electrodeposited Ni-Si Schottky barriers. Solid. State Electron. 52, 1032–1038 (2008).
Article ADS CAS Google Scholar
Liu, X., Zheng, H., Li, Y. & Zhang, W. Factors on the separation of photogenerated charges and the charge dynamics in oxide/ZnFe₂O₄ composites. J. Mater. Chem. c. 1, 329–337 (2013).
Article CAS Google Scholar
Matsuura, H., Okuno, T., Okushi, H. & Tanaka, K. Electrical properties of n-amorphous/p-crystalline silicon heterojunctions. J. Appl. Phys. 55, 1012–1019 (1984).
Article ADS CAS Google Scholar
Ang, K.-W. et al. Novel silicon-carbon (Si:C) Schottky barrier enhancement layer for dark-current suppression in Ge-on-SOI MSM photodetectors. IEEE Electron Device Lett. 29, 704–707 (2008).
Article ADS CAS Google Scholar
Li, H., Zhang, Q., Yap, C. C. & Tay, B. K. Electrical transport in carbon nanotube intermolecular p-n junctions. In The 4th IEEE International NanoElectronics Conference 1–2 (IEEE, 2011).
Jang, J.-W., Park, S., Jeong, Y.-H. & Hwang, H. ReRAM-based synaptic device for neuromorphic computing. In 2014 IEEE International Symposium on Circuits and Systems (ISCAS) 1054–1057 (IEEE, 2014).
Lanza, M. et al. Recommended methods to study resistive switching devices. Adv. Electron. Mater. 5, 1–28 (2019).
Article ADS Google Scholar
Bivour, M., Zähringer, F., Ndione, P. & Hermle, M. Sputter-deposited WO_x and MoO_x for hole selective contacts. Energy Procedia 124, 400–405 (2017).
Article CAS Google Scholar
Mews, M., Korte, L. & Rech, B. Oxygen vacancies in tungsten oxide and their influence on tungsten oxide/silicon heterojunction solar cells. Sol. Energy Mater. Sol. Cells 158, 77–83 (2016).
Article CAS Google Scholar
Cheong, W. et al. A flash memory controller for 15μs ultra-low-latency SSD using high-speed 3D NAND flash with 3μs read time. In 2018 IEEE International Solid-State Circuits Conference (ISSCC) 338–340 (IEEE, 2018).
Ruder, S. An overview of gradient descent optimization algorithms. Preprint at https://arxiv.org/abs/1609.04747 (2016).
Li, X. et al. Multi-terminal ionic-gated low-power silicon nanowire synaptic transistors with dendritic functions for neuromorphic systems. Nanoscale 12, 16348–16358 (2020).
Article CAS PubMed Google Scholar
Go, J. et al. W/WO_{3− x} based three-terminal synapse device with linear conductance change and high on/off ratio for neuromorphic application. Appl. Phys. Express 12, 26503 (2019).
Article Google Scholar
Nikam, R. D., Kwak, M., Lee, J., Rajput, K. G. & Hwang, H. Controlled ionic tunneling in lithium nanoionic synaptic transistor through atomically thin graphene layer for neuromorphic computing. Adv. Electron. Mater. 6, 1901100 (2020).
Article CAS Google Scholar

Download references

Acknowledgements

This work has partially supported by the R&D program of Korea Evaluation Institute of Industrial Technology (KEIT) grant funded by the Korea government (Ministry of Trade, Industry, and Energy) (20003789) and by the R&D programs of National Research Foundation of Korea (NRF) grant funded by the Korea government (Ministry of Science and ICT) (2018R1A2A3075302, 2019M3F3A1A02072336, 2020M3F3A2A01085755, 2020M3F3A2A01082592, 2021M3F3A2A01037858, 2022M3F3A2A01072851, and 2022M3I7A2078273), and by Nanomedical Devices Development Project of National Nano Fab Center (CMS2103M001).

Author information

These authors contributed equally: Seokho Seo, Beomjin Kim, Donghoon Kim, Seungwoo Park.

Authors and Affiliations

The School of Electrical Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea
Seokho Seo, Beomjin Kim, Donghoon Kim, Seungwoo Park, Tae Ryong Kim, Junkyu Park, Hakcheon Jeong, See-On Park, Taehoon Park, Hyeok Shin, Myung-Su Kim, Yang-Kyu Choi & Shinhyun Choi

Authors

Seokho Seo
View author publications
You can also search for this author in PubMed Google Scholar
Beomjin Kim
View author publications
You can also search for this author in PubMed Google Scholar
Donghoon Kim
View author publications
You can also search for this author in PubMed Google Scholar
Seungwoo Park
View author publications
You can also search for this author in PubMed Google Scholar
Tae Ryong Kim
View author publications
You can also search for this author in PubMed Google Scholar
Junkyu Park
View author publications
You can also search for this author in PubMed Google Scholar
Hakcheon Jeong
View author publications
You can also search for this author in PubMed Google Scholar
See-On Park
View author publications
You can also search for this author in PubMed Google Scholar
Taehoon Park
View author publications
You can also search for this author in PubMed Google Scholar
Hyeok Shin
View author publications
You can also search for this author in PubMed Google Scholar
Myung-Su Kim
View author publications
You can also search for this author in PubMed Google Scholar
Yang-Kyu Choi
View author publications
You can also search for this author in PubMed Google Scholar
Shinhyun Choi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.S., B.K., D.K., and S.P. contributed equally to this work, and S.C. directed the team. T.R.K., S.O.P., and S.C. designed the basic concept of the idea. S.S., B.K., D.K., and S.P. planned and performed the experiments. S.S., B.K., S.P., D.K., T.P., H.S., M.S.K., and Y.K.C manufactured the device. S.S., B.K., D.K., S.P., H.J., M.S.K, and Y.K.C measured the device and analyzed the data. J.P. conducted array simulation. S.S., B.K., D.K., and S.C. wrote the manuscript.

Corresponding author

Correspondence to Shinhyun Choi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Su-Ting Han and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Seo, S., Kim, B., Kim, D. et al. The gate injection-based field-effect synapse transistor with linear conductance update for online training. Nat Commun 13, 6431 (2022). https://doi.org/10.1038/s41467-022-34178-9

Download citation

Received: 07 July 2021
Accepted: 13 October 2022
Published: 28 October 2022
DOI: https://doi.org/10.1038/s41467-022-34178-9

This article is cited by

Rational tuning of the cation ratio in metal oxide semiconductor nanofibers for low-power neuromorphic transistors
- Haofei Cong
- Yu Chang
- Fengyun Wang
Science China Materials (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.