Abstract
We present a study on characteristics of operating region-dependent weight updates in a synaptic thin-film transistor (Syn-TFT) with an amorphous In–Ga–Zn–O (IGZO) channel layer. For a synaptic behavior (e.g. a memory phenomenon) of the IGZO TFT, a defective oxide (e.g. SiO2) is intentionally used for a charge trapping due to programming pulses to the gate terminal. Based on this synaptic behavior, a conductance of the Syn-TFT is modulated depending on the programming pulses, thus weight updates. This weight update characteristics of the Syn-TFT is analyzed in terms of a dynamic ratio (drw) for two operating regions (i.e. the above-threshold and sub-threshold regimes). Here, the operating region is chosen depending on the level of the gate read-voltage relative to the threshold voltage of the Syn-TFT. To verify these, the static and pulsed characteristics of the fabricated Syn-TFT are monitored experimentally. As experimental results, it is found that the drw of the sub-threshold regime is larger compared to the above-threshold regime. In addition, the weight linearity in the sub-threshold regime is observed to be better compared to the above-threshold regime. Since it is expected that either the drw or weight linearity can affect performances (e.g. a classification accuracy) of an analog accelerator (AA) constructed with the Syn-TFTs, the AA simulation is performed to check this with a crossbar simulator.
Similar content being viewed by others
Introduction
An emerging technology based on non-silicon material has been intensively studied for the futuristic applications, such as a brain-inspired neuromorphic system1,2,3,4,5. Here, as a fundamental building block of the neuromorphic system, synaptic devices with the emerging material have been reported6,7,8,9,10. For example, thin-film transistors (TFTs) based on an amorphous oxide semiconductors (AOSs), such as In–Ga–Zn–O (IGZO), with a high mobility have been considered as a synaptic device because of their optical and electrical instabilities, mimicking the properties of a biological synapse (e.g. a memory phenomenon).11,12,13,14,15. Note that, the optical and electrical instabilities have been believed to be poor properties of the IGZO TFTs for displays16,17. However, for the synaptic properties of the IGZO TFTs, electrical instabilities (e.g. defects in the gate oxide) among these are intentionally used. IGZO TFTs with these synaptic properties are conventionally operated at the above-threshold regime due to a relatively high current driving capability18,19,20,21. This may be expected that the speed of the weight update in the above-threshold regime is faster compared to the sub-threshold regime. However, the above-threshold regime can have a narrow synaptic dynamic-range due to a linear (or parabolic) dependence of the drain current at its regime22,23. In addition, for the case of the above-threshold regime, it is expected that the power dissipation is difficult to be decreased because of its relatively high current level24,25,26. On the other hand, the sub-threshold regime of the IGZO TFT can achieve an ultra-low power consumption27,28. However, it can be a relatively slower speed of the weight updates due to a lower current level compared to the above-threshold regime, although the speed of the weight updates at the sub-threshold regime can also be reasonable for a neuronal time scale of the human brain which is very long with a range from milliseconds to hundreds of milliseconds29,30. Besides these power consumption and speed, it can also be expected that two operating regions are to be different each other in terms of the characteristics of weight updates, such as a dynamic ratio and weight linearity, so it needs to be studied.
In this paper, a study on the characteristics of operating region-dependent weight updates with a synaptic thin-film transistor (Syn-TFT) based on an amorphous IGZO channel layer is presented. In order to achieve a memory function of the IGZO TFT, the defective oxides (e.g. SiO2) can be intentionally used for a charge trapping because of programming pulses to the gate terminal (VGS). Based on this memory behavior, a conductance of the Syn-TFT is varied, which is dependent on the programming pulses, thus a weight updates. This characteristics of weight updates is analyzed for two operating regions, such as the above- threshold and sub-threshold regimes, in terms of a dynamic ratio (drw). Here, the operational regime can be determined depending on the level of the gate read-voltage (\({\text{V}}_{\text{GS}}^{\text{ read}}\)) relative to the threshold voltage (VT) of the Syn-TFT. To check these, a static and pulsed characteristics of the fabricated Syn-TFT are experimentally monitored using a standard semiconductor analyzer. For the experimental results, it is shown that the drw of the sub-threshold regime is larger compared to the above-threshold regime. Moreover, the linear range of the weight in the sub-threshold regime is wider compared to the above-threshold regime. Either the drw or weight linearity is expected to affect performances (e.g. a classification accuracy (CA)) of an analog accelerator (AA) where Syn-TFTs are arrayed. To verify this, the AA simulation is performed with a crossbar simulator.
Syn-TFT and related theories
Synaptic behavior of the IGZO thin-film transistor
Figure 1a,b show a cross-sectional view of the Syn-TFT (i.e. the IGZO TFT) and its equivalent circuit, respectively. As can be seen, the defects in the gate oxide (e.g. SiO2) of the Syn-TFT can be given due to the disorder of the oxide deposited with a low-temperature process for the TFT31. In this defective oxide, multiple trap states can exist, where charges (e.g. electrons) in the IGZO channel layer can be trapped or de-trapped with applying programming pulses to the gate terminal (VGS), so the memory function is achieved. This memory function is depicted in Fig. 1c,d in detail with the band diagrams for the electron trapping and de-trapping, respectively. Here, the electron trapping or de-trapping is dependent on the polarity of the programming voltage (\({\text{V}}_{\text{GS}}^{ \, {\text{prog}}}\)) and the number of programming pulses applied to the gate for a fixed input voltage (VI) at the drain. Due to these trapped or de-trapped electrons, the threshold voltage shift (\(\Delta{\text{V}}_{\text{T}}\)) can be varied, which is defined as \(\, \Delta{\text{Q}}_{\text{e}}^{\text{trap}}{ /} {\text{C}}_{\text{i}}\). Here, the \(\Delta{\text{Q}}_{\text{e}}^{\text{trap}}\) is the variation of the negative charge density per area and Ci is a gate insulator capacitance per area. So, the VT can also be varied and expressed as follows,
where the VT0 is an initial value of the VT. For the initial state, assuming that there are no trapped electrons in the gate insulator, \(\Delta{\text{V}}_{\text{T}}\) = 0, thus VT = VT0. And, for the case with applying multiple programming pulses (e.g. positive or negative programming pulses), the \(\Delta{\text{V}}_{\text{T}}\) can be varied due to the electron trapping or de-trapping, thus VT = VT0 + \(\Delta{\text{V}}_{\text{T}}\). With this VT, the \({\text{V}}_{\text{GS}}^{ \, {\text{read}}}\) can be set as \({\text{V}}_{\text{GS}}^{ \, {\text{read}}}\) > VT or \({\text{V}}_{\text{GS}}^{ \, {\text{read}}}\) < VT to select the operating region (i.e. the above-threshold region or sub-threshold region). Figure 2 describes the schematic transfer characteristics in the above-threshold and sub-threshold regions, respectively. Here, assuming a sufficiently low VI in two operating regions, the output current (IO) of respective operational regimes is approximated as follows32,
where the Kabove is a constant for the above-threshold regime, Ksub is a constant for the sub-threshold regime, L is the channel length, W is the channel width, nt is the ideality factor related to interface states, kT is the thermal energy, and q is the elementary charge. As seen in Fig. 2, the initial state of the Syn-TFT is a full facilitation (FF) for a maximum value of the IO (i.e. \({\text{I}}_{\text{O}}^{ \, {\text{ff}}}\)) with VT = VT0 (i.e. \(\Delta{\text{V}}_{\text{T}}\) = 0). So, the \({\text{I}}_{\text{O}}^{ \, {\text{ff}}}\) can be rewritten with Eqs. (2) and (3), as follows,
where the \({\text{I}}_{\text{O}}^{ \, {\text{ff}}}\left({\text{above}}\right)\) is the \({\text{I}}_{\text{O}}^{ \, {\text{ff}}}\) for the above-threshold regime and \({\text{I}}_{\text{O}}^{ \, {\text{ff}}}\left({\text{sub}}\right)\) is the \({\text{I}}_{\text{O}}^{ \, {\text{ff}}}\) for the sub-threshold regime.
From the FF, with applying multiple positive programming pulses to the gate, the IO can be gradually decreased due to the increase of the VT, which is called a synaptic depression process. Here, for the synaptic depression, the increase of the \(\Delta{\text{V}}_{\text{T}}\) is limited with applying more programming pulses, thus the saturated \(\Delta{\text{V}}_{\text{T}}\). At that moment, the state of the decreased IO becomes a full depression (FD), defining the maximized \(\Delta{\text{V}}_{\text{T}} \, {= \Delta}{{\text{V}} \, }_{\text{T}}^{\text{fd}}\) > 0 (see Fig. 2). Here, the \(\Delta{{\text{V}} \, }_{\text{T}}^{\text{fd}}\) is the saturated \(\Delta{\text{V}}_{\text{T}}\) with respect to the time although the \(\Delta{\text{V}}_{\text{T}}\) can be further enhanced with more programming pulses. Similarly, the IO for this FD (i.e. \({\text{I}}_{\text{O}}^{ \, {\text{fd}}}\)) can be represented with Eqs. (2) and (3), as follows,
Here, the \({\text{I}}_{\text{O}}^{ \, {\text{f}}{\text{d}}}\left({\text{above}}\right)\) is the \({\text{I}}_{\text{O}}^{ \, {\text{fd}}}\) for the above-threshold regime and \({\text{I}}_{\text{O}}^{ \, {\text{fd}}}\left({\text{sub}}\right)\) is the \({\text{I}}_{\text{O}}^{ \, {\text{fd}}}\) for the sub-threshold regime. Note that, with applying multiple negative programming pulses from the FD condition, the IO can be recovered to the initial state (i.e. the FF) because of the decreased VT, which is called a synaptic facilitation process.
Synaptic weight and dynamic ratio
Based on the synaptic behavior explained in the previous section, the characteristics of weight updates can be theoretically explained for two operating regions. As illustrated in the equivalent circuit of the Syn-TFT (see Fig. 1b), the conductance as a synaptic weight (Gw) can be defined as a ratio between the VI and IO, as follows,
Based on this Eq. (8), the Gw for the FF and FD at \({\text{V}}_{\text{GS}} \, = \, {\text{V}}_{\text{GS}}^{ \, {\text{read}}}\) can be expressed with \({\text{I}}_{\text{O}}^{ \, {\text{ff}}}\) and \({\text{I}}_{\text{O}}^{ \, {\text{fd}}}\), respectively, as follows,
where \({\text{G}}_{\text{w}}^{ \, {\text{ff}}}\) is the Gw for the FF and \({\text{G}}_{\text{w}}^{ \, {\text{fd}}}\) is the Gw for the FD. Here, for a detailed comparison between two operating regions, a normalized synaptic weight (\(\overline{{\text{w} }_{\text{G}}}\)) needs to be defined. So, using the \({\text{G}}_{\text{w}}^{\text{ ff}}\) as the maximum value of the Gw, the \(\overline{{\text{w} }_{\text{G}}}\) can be represented with Eqs. (8) and (9), as follows,
With Eqs. (2) and (4), the \(\overline{{\text{w} }_{\text{G}}}\) in the above-threshold region (i.e. \(\overline{{\text{w} }_{\text{G}}}\left({\text{above}}\right)\)) can be represented based on Eq. (11), as follows,
Similarly, the \(\overline{{\text{w} }_{\text{G}}}\) in the sub-threshold region (i.e. \(\overline{{\text{w} }_{\text{G}}}\left({\text{sub}}\right)\)) can be represented with Eqs. (3) and (5), as follows,
Here, the \(\overline{{\text{w} }_{\text{G}}}\left({\text{above}}\right)\) is given as a linear function of the \(\Delta{\text{V}}_{\text{T}}\) whereas the \(\overline{{\text{w} }_{\text{G}}}\left({\text{sub}}\right)\) has an exponential dependence with the \(\Delta{\text{V}}_{\text{T}}\). As the two extreme cases of \(\overline{{\text{w} }_{\text{G}}}\) commonly for both operating regions, the \(\overline{{\text{w} }_{\text{G}}}\) for the FF (i.e. \({\text{I}}_{{\text{O}} \, }{=} \, {\text{I}}_{\text{O}}^{ \, {\text{ff}}}\)) and FD (i.e. \({\text{I}}_{\text{O}} \, {=} \, {\text{I}}_{\text{O}}^{ \, {\text{fd}}}\)) is expressed based on Eq. (11), respectively, as follows,
where the \(\overline{{\text{w} }_{\text{G}}^{ \, {\text{ff}}}}\) is the \(\overline{{\text{w} }_{\text{G}}}\) for the FF and \(\overline{{\text{w} }_{\text{G}}^{ \, {\text{fd}}}}\) is the \(\overline{{\text{w} }_{\text{G}}}\) for the FD. As can be seen in Eq. (15), it is expected that the \(\overline{{\text{w} }_{\text{G}}^{ \, {\text{fd}}}}\) approaches zero since \({\text{I}}_{\text{O}}^{ \, {\text{ff}}}\) > > \({\text{I}}_{\text{O}}^{ \, {\text{fd}}}\).
As another synaptic characteristics, the drw can also be defined as a ratio between the \(\overline{{\text{w} }_{\text{G}}^{ \, {\text{ff}}}}\) and \(\overline{{\text{w} }_{\text{G}}^{ \, {\text{fd}}}}\) with Eqs. (14) and (15), as follows,
Note that, for the ideal case, the drw is infinity because the \(\overline{{\text{w} }_{\text{G}}^{ \, {\text{ff}}}}\) is unity and \(\overline{{\text{w} }_{\text{G}}^{ \, {\text{fd}}}}\) is zero, so a large drw is advantageous. Likewise, with Eqs. (4) and (6), the drw in the above-threshold region (i.e. \({\text{d}}{\text{r}}_{\text{w}}({\text{above}})\)) is expressed based on Eq. (16), as follows,
Similarly, the drw in the sub-threshold region (i.e. \({\text{d}}{\text{r}}_{\text{w}}({\text{sub}})\)) is represented with Eqs. (5) and (7), as follows,
Here, the drw(sub) is expected to be larger compared to the drw(above) because of the exponential dependence in the sub-threshold regime despite of the same \(\Delta{\text{V}}_{\text{T}}\) = \(\Delta{\text{V}}_{\text{T}}^{ \, {\text{fd}}}\) in two operating regions, implying an advantage of the sub-threshold regime.
Note that, as another advantage of the sub-threshold regime, since the current level in this region is lower compared to the above-threshold regime, it is expected that the static power consumption at \({\text{V}}_{\text{GS}} \, = \, {\text{V}}_{\text{GS}}^{ \, {\text{read}}}\) (\({\text{P}}_{\text{static}}^{ \, {\text{read}}}\)) is relatively low. The \({\text{P}}_{\text{static}}^{ \, {\text{read}}}\) can be represented for a fixed VI, as follows33,
And the \({\text{P}}_{\text{static}}^{ \, {\text{read}}}\)(max) as a maximum value of the \({\text{P}}_{\text{static}}^{ \, {\text{read}}}\) can be expressed as a product of \({\text{I}}_{\text{O}}^{ \, {\text{ff}}}\) and VI, as follows,
Based on the theory with respect to the Syn-TFT, since the static and pulsed characteristics are expected to be different for two operating regions, these characteristics of the fabricated Syn-TFT need to be monitored experimentally.
Results and discussion
Based on the theoretical analysis in the previous section, it is expected that the synaptic behavior of the IGZO TFT in two operating regions can be achieved due to the trap states in its defective oxide (e.g. SiO2), thus the Syn-TFT. The fabrication process of the Syn-TFT is given in following “Materials and fabrication process of the Syn-TFT”27. In addition, to verify memory, experimental results of the static and pulsed characteristics of the fabricated Syn-TFT with the IGZO channel layer are described in the following “Selection of the gate read-voltage to determine the operating region” and “Pulsed characteristics of the Syn-TFT”, respectively. For the static characteristics of “Selection of the gate read-voltage to determine the operating region”, the operating region is determined depending on the level of the \({\text{V}}_{\text{GS}}^{ \, {\text{read}}}\) relative to the VT. In “Pulsed characteristics of the Syn-TFT”, the synaptic behavior, such as the weight updates, is analyzed in terms of the drw for two operating regions. Here, the experimental framework is performed with a standard semiconductor analyzer, called Keithley™ 4200A. As another observation of the weight updates characteristics, the weight linearity can be analyzed for two operational regimes. In order to check this, the plots of \(\Delta\overline{{\text{w} }_{\text{G}}}\) versus \(\overline{{\text{w} }_{\text{G}}}\) are shown in the following “Weight linearity of the Syn-TFT”.
In addition, when an array composed of Syn-TFTs is used as an analog accelerator (AA), the performances (e.g. a classification accuracy (CA)) of the AA are expected to be different depending on the operating region. For this, the comparison of the CA as a training result for a classification task is presented in “Analog accelerator based on the Syn-TFT array”. Here, the CA is checked with the AA simulation based on the weight updates monitored with the fabricated Syn-TFT, using a resistive memory simulator, called CrossSim™34,35,36,37.
Materials and fabrication process of the Syn-TFT
The fabrication process of the Syn-TFT is as follows. On the glass wafer, RF-sputtering is performed for Molybdenum (Mo) deposition for the gate electrode, assuming that the work-function of Mo is 5 eV27. This is followed by wet etching to pattern the gate electrode. And then, SiO2 are layered with plasma enhanced chemical vapor deposition (PECVD). The total film thickness of 350 nm is used as the gate insulator where the defects (e.g. trap states) can be given due to a low temperature process. The channel layer of the Syn-TFT is a 50 nm-thick IGZO deposited with RF-sputtering, using an IGZO ceramic target, subsequently patterned by RIE. In the sputter deposition, the oxygen-gas partial pressure against Ar (i.e. O2/(O2 + Ar)) is 15% to realize Syn-TFT structure, in which each was annealed at 250 ℃. Then an etch-stop layer is formed using SiOx to protect the IGZO channel layer. This is followed by dry etching to define source and drain electrode, respectively. For the final metallization, 150 nm-thick Mo is deposited with a RF sputtering, and patterned with a wet etching. Finally, the device is passivated with a 200 nm-thick SiNx/SiOx, which followed by another thermal annealing at 250 ℃ for all the samples.
Selection of the gate read-voltage to determine the operating region
In order to check the characteristics of the weight updates depending on the operating region (i.e. the above-threshold and sub-threshold regions), it is needed to select a proper \({\text{V}}_{\text{GS}}^{ \, {\text{read}}}\). For this, the static characteristics of the fabricated Syn-TFT need to be firstly monitored for the conditions of both \({\text{V}}_{\text{GS}}^{ \, {\text{read}}} > \, {\text{V}}_{\text{T}}\) and \({\text{V}}_{\text{GS}}^{ \, {\text{read}}} \, {<} \, {\text{V}}_{\text{T}}\). Figure 3a,b show the transfer characteristics at the two extreme states (i.e. the FF and FD) for each operational region of the fabricated Syn-TFT, respectively. As can be seen, the \({\text{V}}_{\text{GS}}^{ \, {\text{read}}}\) in the above-threshold and sub-threshold regimes are set as 3.5 V and 1.8 V, respectively. Note that, for the case of \({\text{V}}_{\text{GS}}^{ \, {\text{read}}}\) in the above-threshold regime, since the \({\text{V}}_{\text{GS}}^{ \, {\text{read}}}\) can get into the sub-threshold regime for the shift of the VT with applying multiple positive programming pulses, a proper \({\text{V}}_{\text{GS}}^{ \, {\text{read}}}\) needs to be chosen satisfying with the condition, i.e. \({\text{V}}_{\text{GS}}^{ \, {\text{read}}}\) > VTO + \(\Delta{\text{V}}_{\text{T}}^{ \, {\text{fd}}}\). For the FF state (i.e. VT = VTO) with the \({\text{V}}_{\text{GS}}^{ \, {\text{read}}}\) determined in two operating regions, the \({\text{I}}_{\text{O}}^{\text{ ff}}({\text{above}})\) and \({\text{I}}_{\text{O}}^{\text{ ff}}({\text{sub}})\) at VGS = \({\text{V}}_{\text{GS}}^{ \, {\text{read}}}\) are approximately 100 nA and 3.5 nA, respectively. From this state, after positive programming pulses of 100 cycles are applied to the gate terminal, the IO in two operational regions at VGS = \({\text{V}}_{\text{GS}}^{ \, {\text{read}}}\) arrive at the FD state with \(\Delta{\text{V}}_{\text{T}}\) = \(\Delta{\text{V}}_{\text{T}}^{ \, {\text{fd}}}\) due to multiple electron trapping (see Fig. 1c), resulting in \({\text{I}}_{\text{O}}^{\text{ fd}}({\text{above}})\) = 51 nA and \({\text{I}}_{\text{O}}^{\text{ fd}}({\text{sub}})\) = 0.06 nA, respectively. Here, it is found that the ratio between the \({\text{I}}_{\text{O}}^{ \, {\text{ff}}}({\text{sub}})\) and \({\text{I}}_{\text{O}}^{\text{ fd}}({\text{sub}})\) is larger compared to the ratio between \({\text{I}}_{\text{O}}^{ \, {\text{ff}}}({\text{above}})\) and \({\text{I}}_{\text{O}}^{\text{ fd}}({\text{above}})\), as shown in Fig. 3a,b. This is because the sub-threshold current is dependent on the exponential function, which can be explained with Eqs. (4) to (7). On the other hand, for the case of the above-threshold regime, the current equation is a linear function (see Eq. 2), thus a relatively small ratio between the \({\text{I}}_{\text{O}}^{ \, {\text{ff}}}\) and \({\text{I}}_{\text{O}}^{ \, {\text{fd}}}\). Note that it is found that the \(\Delta{\text{V}}_{\text{T}}^{ \, {\text{fd}}}\) of two operating regions, such as the \(\Delta{\text{V}}_{\text{T}}^{ \, {\text{fd}}}\) of the above-threshold regime (i.e. \(\Delta{\text{V}}_{\text{T}}^{ \, {\text{fd}}}\)(above) = 0.46 V) and \(\Delta{\text{V}}_{\text{T}}^{ \, {\text{fd}}}\) of the sub-threshold regime (i.e. \(\Delta{\text{V}}_{\text{T}}^{ \, {\text{fd}}}\)(sub) = 0.58 V), are to be different each other. This can be explained with a sensitivity (s). And the s can be represented, as follows38,
where the \(\Delta{\text{n}}_{\text{e}}\) is the negative charge density trapped or de-trapped from the defective gate oxide and \({\text{n}}_{\text{e}}^{ \, {\text{read}}}\) is the negative charge density in the In–Ga–Zn–O film at \({\text{V}}_{\text{GS}} \, = \, {\text{V}}_{\text{GS}}^{ \, {\text{read}}}\). Here, since the \({\text{n}}_{\text{e}}^{ \, {\text{read}}}\) of the sub-threshold regime is smaller compared to the \({\text{n}}_{\text{e}}^{ \, {\text{read}}}\) of the above-threshold regime and the \(\Delta{\text{n}}_{\text{e}}\) of the sub- threshold regime can be bigger than the \(\Delta{\text{n}}_{\text{e}}\) of the above-threshold regime, the s of the sub-threshold regime can be larger compared to the s of the above-threshold regime, thus \(\Delta{\text{V}}_{\text{T}}^{ \, {\text{fd}}}\)(sub) > \(\Delta{\text{V}}_{\text{T}}^{ \, {\text{fd}}}\)(above).
Based on the selected \({\text{V}}_{\text{GS}}^{ \, {\text{read}}}\) in respective operating regions, the detailed synaptic processes (i.e. the synaptic depression and facilitation) need to be monitored to verify the characteristics of the weight updates. Therefore, both the detailed pulse-specification and respective pulsed characteristics of the fabricated Syn-TFT are explained in the following section.
Pulsed characteristics of the Syn-TFT
Based on the analysis with respect to the static characteristics of the fabricated Syn-TFT to select the appropriate \({\text{V}}_{\text{GS}}^{ \, {\text{read}}}\) for each operational region, the pulsed characteristics of the device need to be monitored. Figure 4a,b show the pulse-specification for the synaptic depression and facilitation in respective operating regions. For the above-threshold regime, the \({\text{V}}_{\text{GS}}^{ \, {\text{prog}}}\) for the synaptic depression and facilitation is 8.5 V and −6.5 V, respectively, maintaining \({\text{V}}_{\text{GS}}^{ \, {\text{read}}}\) = 3.5 V, as seen in Fig. 4a. For the case of the sub-threshold regime (see Fig. 4b), the \({\text{V}}_{\text{GS}}^{ \, {\text{prog}}}\) is set as 6.8 V and −8.2 V for the synaptic depression and facilitation at \({\text{V}}_{\text{GS}}^{ \, {\text{read}}}\) = 1.8 V, respectively. Note that the pulse height (i.e. the difference between the \({\text{V}}_{\text{GS}}^{ \, {\text{prog}}}\) and \({\text{V}}_{\text{GS}}^{ \, {\text{read}}}\)) depending on the synaptic process is commonly given for both operating regions, such as 5 V for the synaptic depression and −10 V for the synaptic facilitation. In addition, the pulse width in two operating regions is commonly set as 3.08 s in the synaptic depression and 37.76 s in the synaptic facilitation, respectively. Other specifications of the programming pulses are common for two operating regions, as follows, the number of cycles and duty cycle for each synaptic process are 100 cycles and 50%, respectively.
With the waveform condition of these programming pulses as seen in Fig. 4a,b, the dynamics of the IO needs to be firstly monitored for the synaptic depression and facilitation. Figure 4c,d describe the IO varied for the synaptic processes in two operational regimes. As can be seen, it is found that the IO is gradually decreased with a series of positive programming pulses for the synaptic depression. It is because the \(\Delta{\text{V}}_{\text{T}}\) is increased by electrons trapped into the gate oxide (see Fig. 1c), thus the increase of the VT. This can be explained with Eqs. (2) and (3). After 100 cycles of positive programming pulses for the synaptic depression, the IO arrives at the FD with \(\Delta{\text{V}}_{\text{T}}^{ \, {\text{fd}}}\)(above) = 0.46 V and \(\Delta{\text{V}}_{\text{T}}^{ \, {\text{fd}}}\)(sub) = 0.58 V, respectively. As a result, the \({\text{I}}_{\text{O}}^{\text{ fd}}({\text{above}})\) and \({\text{I}}_{\text{O}}^{\text{ fd}}({\text{sub}})\) are approximately 41 nA and 0.06 nA in terms of the IO, respectively, as indicated in Fig. 4c,d. During the synaptic facilitation, it is shown that the IO is gradually increased with applying negative programming pulses because the \(\Delta{\text{V}}_{\text{T}}\) is reduced by electrons de-trapped out of the gate oxide (see Fig. 1d), resulting in the recovery of the VT. This can also be explained with Eqs. (2) and (3). And the IO is almost back to the \({\text{I}}_{\text{O}}^{ \, {\text{ff}}}({\text{above}})\) of 91 nA and \({\text{I}}_{\text{O}}^{ \, {\text{ff}}}({\text{sub}})\) of 2.8 nA, respectively, which corresponds to the FF (see Fig. 4c,d).
Based on the IO as mentioned earlier, the characteristics of weight updates can be checked and monitored. Figure 4e,f show the behavior of the \(\overline{{\text{w} }_{\text{G}}}\) for the synaptic depression and facilitation in two operating regions. During the synaptic depression, it is shown that the trend of the \(\overline{{\text{w} }_{\text{G}}}\) decays with applying positive programming pulses due to the increased \(\Delta{\text{V}}_{\text{T}}\), which results in the decrease of the IO. Its decay can also be explained with Eqs. (12) and (13). At the FD, the \(\overline{{\text{w} }_{\text{G}}}\) in two operational regimes finally approaches at \(\overline{{\text{w} }_{\text{G}}^{ \, {\text{fd}}}}\)(above) = 0.45 and \(\overline{{\text{w} }_{\text{G}}^{ \, {\text{fd}}}}\)(sub) = 0.021, respectively. This is because the IO arrives at the FD state, which is consistent with Eq. (15). After positive programming pulses continued for 100 cycles, it is found that the \(\overline{{\text{w} }_{\text{G}}}\) is facilitated with a series of negative programming pulses, approaching the \(\overline{{\text{w} }_{\text{G}}^{ \, {\text{ff}}}}\)(above) \(\cong\) 1 and \(\overline{{\text{w} }_{\text{G}}^{ \, {\text{ff}}}}\)(sub) \(\cong\) 1, respectively. It is because of the gradual decrease of the \(\Delta{\text{V}}_{\text{T}}\), resulting in the increase of the IO, which can also be explained with Eqs. (12) and (13). In addition, it is found to be a linear increase of the \(\overline{{\text{w} }_{\text{G}}}\) for the synaptic facilitation. This can specifically be explained with both the IO relative to the ΔVT and the ΔVT as a function of the number of programming pulses described in Fig. S2 of Supplementary Information.
With both the extracted \(\overline{{\text{w} }_{\text{G}}^{ \, {\text{fd}}}}\) and \(\overline{{\text{w} }_{\text{G}}^{ \, {\text{ff}}}}\), the drw can be calculated as approximately 2.2 in the above-threshold and 47 in the sub-threshold regime, respectively, using Eq. (16). As expected, the drw(sub) is much bigger compared to the drw(above). This is because the exponential function of the sub-threshold regime (see Eq. 18) can be more sensitive compared to the rational function of the above-threshold regime (see Eq. 17) as the \(\Delta{\text{V}}_{\text{T}}^{ \, {\text{fd}}}\) for each operating region is comparable (see Fig. 3). Accordingly, for the sub-threshold regime, it is verified that the drw is much larger compared to the above-threshold regime. Note that, since drw(sub) > drw(above) with the same pulse width for each synaptic process with respect to two operating regions, it can be considered that the speed of weight updates in the sub-threshold regime is faster compared to the above-threshold regime.
Additionally, as another observation of the pulsed characteristics, the \({\text{P}}_{\text{static}}^{ \, {\text{read}}}\) can be estimated during the synaptic processes for two operational regimes, as seen in Fig. 4g,h. It is found that the \({\text{P}}_{\text{static}}^{ \, {\text{read}}}\) in each operating region decays for the synaptic depression since the IO is reduced with applying positive programming pulses, which can be explained with Eq. (19). After the synaptic depression, with applying a series of negative programming pulses, it is shown that the \({\text{P}}_{\text{static}}^{ \, {\text{read}}}\) is gradually increased for the synaptic facilitation due to the gradual increase of the IO, which is consistent with Eq. (19). Afterward, at the FF, the \({\text{P}}_{\text{static}}^{ \, {\text{read}}}\) finally arrives at the \({\text{P}}_{\text{static}}^{ \, {\text{read}}}\)(max) of 9.1 nW in the above-threshold regime and 0.28 nW in the sub-threshold regime, respectively, which can be calculated with Eq. (20). As can be seen, the \({\text{P}}_{\text{static}}^{ \, {\text{read}}}\)(max) in the sub-threshold regime is found to be about two orders of magnitude smaller than that in the above-threshold regime since the operating current in the sub-threshold regime is much lower compared to the above-threshold regime, as indicated in Fig. 3a,b.
Weight linearity of the Syn-TFT
As another observation for the characteristics of weight updates (e.g. the drw), the weight linearity can be checked for two operating regions, using the plots of \(\Delta\overline{{\text{w} }_{\text{G}}}\) versus \(\overline{{\text{w} }_{\text{G}}}\) based on the extracted data in Fig. 4e,f. Here, the \(\Delta\overline{{\text{w} }_{\text{G}}}\) is represented as a difference between the \(\overline{{\text{w} }_{\text{G}}^{ \, {\text{n}}}}\) and \(\overline{{\text{w} }_{\text{G}}^{ \, {\text{n-1}}}}\) where the \(\overline{{\text{w} }_{\text{G}}^{ \, {\text{n}}}}\) is the \(\overline{{\text{w} }_{\text{G}}}\) after applying the nth programming pulses and \(\overline{{\text{w} }_{\text{G}}^{ \, {\text{n-1}}}}\) is the \(\overline{{\text{w} }_{\text{G}}}\) before applying the nth programming pulse. Figure 5 describes the weight updates of the Syn-TFT with a color map of the cumulative distribution function (CDF) for the synaptic depression and facilitation of those operating regions, respectively, applying the noise components (e.g. the noise power spectral density (NPSD)). Here, the CDF distribution presents a probabilistic variation of the \(\Delta\overline{{\text{w} }_{\text{G}}}\) with respect to a synaptic state. In addition, with the plots of \(\Delta\overline{{\text{w} }_{\text{G}}}\) versus \(\overline{{\text{w} }_{\text{G}}}\), the linear range in the weight update is estimated with 30% variation of the \(\Delta\overline{{\text{w} }_{\text{G}}}\). As indicated in Fig. 5a,b, it is shown that the linear range of the weight for the synaptic depression (i.e. \({\mathcal{L}}_{d}\)) in the above-threshold and sub-threshold regimes is 0.615 and 0.562, respectively. Moreover, the linear range of the weight for the synaptic facilitation (i.e. \({\mathcal{L}}_{f}\)) is found to be 0.637 in the above-threshold regime and 0.966 in the sub-threshold regime, respectively (see Fig. 5c,d). With both the extracted \({\mathcal{L}}_{d}\) and \({\mathcal{L}}_{f}\), the arithmetic mean with respect to the linear range of the weight (i.e. \({\mathcal{L}}_{\text{avg}}=({\mathcal{L}}_{d}+{\mathcal{L}}_{f}) / 2\)) for total synaptic processes can be calculated as 0.626 in the above-threshold regime and 0.764 in the sub-threshold regime, thus a good linearity with a wider linear range of the weight in the sub-threshold regime. This can be because a large drw as a ratio between the \(\overline{{\text{w} }_{\text{G}}^{ \, {\text{fd}}}}\) and \(\overline{{\text{w} }_{\text{G}}^{ \, {\text{ff}}}}\) leads to the wide linear range of the weight. In addition, this result is consistent with a linear increase of \(\overline{{\text{w} }_{\text{G}}}\) for the synaptic facilitation in the sub-threshold regime, as shown in Fig. 4f. Note that the physical interpretation with respect to the weight linearity is specifically explained with IO versus \(\Delta\) VT and \(\Delta\) VT versus time (t) plots for two operating regions in Supplementary Information (see Fig. S1a–d in Section S1).
Therefore, it is verified that the weight linearity in the sub-threshold regime can be better compared to the above-threshold regime for overall synaptic processes (e.g. the synaptic depression and facilitation), achieving the wide linear range of the weight.
Analog accelerator based on the Syn-TFT Array
As shown in the previous sections, for the device level, the drw(sub) is found to be larger compared to the drw(above), showing a relatively lower power consumption. And it is also found that the linear range of the weight in the sub-threshold regime is wider compared to the above-threshold regime. These results are summarized in Table 1 for two operating regions.
With these results indicated in Table 1, it can be expected that performances, such as the CA, of the AA based on the Syn-TFT array are to be different in those operating regions at the array level. To check this, the CA as a result of the classification task needs to be monitored. Note that handwritten digits in the Modified National Institute of Standards and Technology database (MNIST) are used for the classification task of the AA, as shown in Fig. 6a. To evaluate the performance of the AA based on the Syn-TFT array for an artificial neural network (ANN), the 60,000 training examples and 10,000 test examples are also used (see Fig. 6b)35. In addition, the network size, which can be calculated as the product of the size of each layer in the ANN (e.g. the input, hidden, and output layers), is 784 × 300 × 10 for the MNIST classification. Note that, in order to resolve the effects of the drw and linearity on the AA performance with respect to two operating regions, the effect of the signal-to-noise ratio (SNR) is intentionally kept the same for each operating region. So, the SNR is equally set in the simulation while considering noise components with 10% of signal for two cases39. As a result of the classification task with this AA, the CA is estimated in Fig. 7 for two operating regions, which is tested for 40 epochs. As can be seen, the CA of the sub-threshold regime is found to be bigger compared to the above-threshold regime at each epoch, resulting in the maximum value of 87.51%. This can be explained with the drw and \({\mathcal{L}}_{\text{avg}}\) for two operating regions. For the case of the drw, it is found that the sub-threshold regime is larger than that of the above-threshold regime, as shown in Table 1. Moreover, the \({\mathcal{L}}_{\text{avg}}\) of the sub-threshold regime is found to be wider compared to the above-threshold regime (see Table 1), thus a relatively good weight linearity in the sub-threshold regime. Here, since the SNR, which is proportional to the CA, is intentionally set the same for both two operating regions, it is expected that other parameters, such as the dynamic ratio and weight linearity, would mainly determine the CA. Note that the comparison related to efficiency metrics between proposed Syn-TFT and existing similar synaptic devices is shown in Supplementary Information (see Section S3)40,41,42,43.
Consequently, these results indicate that the Syn-TFT operated in the sub-threshold regime can achieve a relatively large drw(sub) while showing a relatively wide linear range of the weight. With these results of the device level, when the Syn-TFT array is used as the AA, it is also found that the CA of the sub-threshold regime is larger than that of the above-threshold due to those advantages at the device level of the Syn-TFT operated in the sub-threshold region. Therefore, it is believed that the selection of the sub-threshold regime in the Syn-TFT is essential for a high performance of the ANN while achieving a large drw and good weight linearity.
Conclusion
In this article, an experimental framework on the characteristics of operating region-dependent weight updates for the Syn-TFT with an amorphous IGZO channel layer has been shown. To implement a memory function of the IGZO TFT, the defective oxides, such as SiO2, have been intentionally used for a charge trapping or de-trapping because of programming pulses to the gate terminal. Based on this memory function, a conductance of the Syn-TFT is changed, which is dependent on the programming pulses, thus a weight updates. This weight updates characteristics has been analyzed for the above-threshold and sub-threshold regimes in terms of the dynamic ratio, respectively. Here, the operating region has been selected depending on the level of the gate read-voltage relative to the threshold voltage of the Syn-TFT. In order to verify these, a static and pulsed characteristics of the fabricated Syn-TFT have been experimentally monitored with a standard semiconductor analyzer. From the experimental results, it has been shown that the dynamic ratio of the sub-threshold regime (\(\approx\) 47) is larger compared to the above-threshold regime (\(\approx\) 2.2) and the weight linearity in the sub-threshold regime is better compared to the above-threshold regime. Since either the dynamic ratio or weight linearity is expected to affect performances (e.g. CA) of the AA based on the Syn-TFT array, to check this, the AA simulation has been performed with a crossbar simulator. From the simulation result, it has been found that the CA of the sub-threshold regime is always larger compared to the above-threshold regime at each epoch, indicating the maximum value of 87.51% due to the advantages in the device level of the Syn-TFT operated in the sub-threshold region (e.g. a relatively large dynamic ratio and linear range). Consequently, it can be believed that the proposed Syn-TFT device with both a large dynamic ratio and good linearity for the sub-threshold regime would be essential for the neuromorphic system.
Data availability
All data generated or analyzed during this study are included in this published article.
References
Indiveri, G. & Liu, S. C. Memory and information processing in neuromorphic systems. Proc. IEEE 103(8), 1379–1397 (2015).
Upadhyay, N. K. et al. Emerging memory devices for neuromorphic computing. Adv. Mater. Technol. 4(4), 1800589 (2019).
Li, H. et al. A light-stimulated synaptic transistor with synaptic plasticity and memory functions based on InGaZnOx–Al2O3 thin film structure. J. Appl. Phys. 119(24), 244505 (2016).
Kang, Y., Jang, J., Cha, D. & Lee, S. Synaptic weight evolution and charge trapping mechanisms in a synaptic pass-transistor operation with a direct potential output. IEEE Trans. Neural Netw. Learn. Syst. 32(10), 4728–4741 (2021).
Yu, S. et al. An electronic synapse device based on metal oxide resistive switching memory for neuromorphic computation. IEEE Trans. Electron Dev. 58(8), 2729–2737 (2011).
Dai, S. et al. Light-stimulated synaptic devices utilizing interfacial effect of organic field-effect transistors. ACS Appl. Mater. Interfaces. 10(25), 21472–21480 (2018).
Park, Y. & Lee, J. Artificial synapses with short-and long-term memory for spiking neural networks based on renewable materials. ACS Nano 11(9), 8962–8969 (2017).
Yang, R. et al. Synaptic plasticity and memory functions achieved in a WO3−x-based nanoionics device by using the principle of atomic switch operation. Nanotechnology 24(38), 384003 (2013).
Xu, W. et al. Organic core-sheath nanowire artificial synapses with femtojoule energy consumption. Sci. Adv. 2(6), e1501326 (2016).
Eryilmaz, S. et al. Brain-like associative learning using a nanoscale non-volatile phase change synaptic device array. Front. Neurosci. 8, 205 (2014).
Park, S. et al. Effect of the gate dielectric layer of flexible InGaZnO synaptic thin-film transistors on learning behavior. ACS Appl. Electron. Mater. 3(9), 3972–3979 (2021).
Peng, C. et al. Photoelectric IGZO electric-double-layer transparent artificial synapses for emotional state simulation. ACS Appl. Electron. Mater. 1(11), 2406–2414 (2019).
Zhu, L. et al. Synergistic modulation of synaptic plasticity in IGZO-based photoelectric neuromorphic TFTs. IEEE Trans. Electron Dev. 68(4), 1659–1663 (2021).
Duan, N. et al. An electro-photo-sensitive synaptic transistor for edge neuromorphic visual systems. Nanoscale 11(38), 17590–17599 (2019).
Wu, Q. et al. Photoelectric plasticity in oxide thin film transistors with tunable synaptic functions. Adv. Electron. Mater. 4(12), 1800556 (2018).
Jang, J. et al. Thin-film optical devices based on transparent conducting oxides: Physical mechanisms and applications. Curr. Comput.-Aided Drug Des. 9(4), 192 (2019).
Bae, J., Jeong, I. & Lee, S. Wavelength-dependent optical instability mechanisms and decay kinetics in amorphous oxide thin-film devices. Sci. Rep. 9(1), 1–6 (2019).
Yang, P. et al. Synaptic transistor with a reversible and analog conductance modulation using a Pt/HfOx/n-IGZO memcapacitor. Nanotechnology 28(22), 225201 (2017).
Huang, W. et al. Memristive artificial synapses for neuromorphic computing. Nano-Micro Lett. 13(1), 1–28 (2021).
Kwon, S. et al. Environment-adaptable artificial visual perception behaviors using a light-adjustable optoelectronic neuromorphic device array. Adv. Mater. 31(52), 1906433 (2019).
Jang, Y. et al. Amorphous InGaZnO (a-IGZO) synaptic transistor for neuromorphic computing. ACS Appl. Electron. Mater. 4(4), 1427–1448 (2022).
Subramanian Periyal, S. et al. Halide perovskite quantum dots photosensitized-amorphous oxide transistors for multimodal synapses. Adv. Mater. Technol. 5(11), 2000514 (2020).
Park, Y., Kim, M. & Lee, J.-S. Artificial synaptic transistors based on Schottky barrier height modulation using reduced graphene oxides. Carbon 165, 455–460 (2020).
Beom, K. et al. Single-and double-gate synaptic transistor with TaOx gate insulator and IGZO channel layer. Nanotechnology 30(2), 025203 (2018).
Daus, A. et al. Ferroelectric-like charge trapping thin-film transistors and their evaluation as memories and synaptic devices. Adv. Electron. Mater. 3(12), 1700309 (2017).
Yang, P. et al. Synaptic behaviors of thin-film transistor with a Pt/HfOx/n-type indium–gallium–zinc oxide gate stack. Nanotechnology 29(29), 295201 (2018).
Lee, S. & Nathan, A. Subthreshold Schottky-barrier thin-film transistors with ultralow power and high intrinsic gain. Science 354(6310), 302–304 (2016).
Duan, H. et al. IGZO/CsPbBr 3-nanoparticles/IGZO neuromorphic phototransistors and their optoelectronic coupling applications. ACS Appl. Mater. Interfaces. 13(25), 30165–30173 (2021).
Kiebel, S. J., Daunizeau, J. & Friston, K. J. A hierarchy of time-scales and the brain. PLoS Comput. Biol. 4(11), e1000209 (2008).
Kuzum, D., Yu, S. & Wong, H. P. Synaptic electronics: Materials, devices and applications. Nanotechnology 24(38), 382001 (2013).
Hoshino, K. et al. Constant-voltage-bias stress testing of a-IGZO thin-film transistors. IEEE Trans. Electron Dev. 56(7), 1365–1370 (2009).
Lee, S. Bias-dependent subthreshold characteristics and interface states in disordered semiconductor thin-film transistors. Semicond. Sci. Technol. 34(11), 1101 (2019).
Hayt, W., Kemmerly, J. & Durbin, S. Engineering Circuit Analysis (McGraw-Hill, 1978).
Agarwal, S. et al. Hughart. CrossSim. http://crosssim.sandia.gov (2018).
Agarwal, S. et al. Resistive memory device requirements for a neural algorithm accelerator. in 2016 International Joint Conference on Neural Networks (IJCNN) (2016).
Cox, J., Conrad, D. J. & James, B. A. A signal processing approach for cyber data classification with deep neural networks. Proc. Comput. Sci. 61, 349–354 (2015).
Cha, D., Kang, Y., Lee, S. & Lee, S. A geometrical optimization rule of the synaptic pass-transistor for a low power analog accelerator. IEEE Access 10, 35120–35130 (2022).
Lee, S. A gate bias and temperature dependencies of contact resistances in amorphous oxide semiconductor thin-film transistors. IEEE Access 9, 165085–165089 (2021).
Razavi, B. Design of Analog CMOS Integrated Circuits (Tata McGraw-Hill Education, 2002).
Rao, J. et al. An electroforming-free, analog interface-type memristor based on a SrFeOx epitaxial heterojunction for neuromorphic computing. Mater. Today Phys. 18, 100392 (2021).
Tang, J. et al. ECRAM as scalable synaptic cell for high-speed, low-power neuromorphic computing. in 2018 IEEE International Electron Devices Meeting (IEDM) (2018).
Mohta, N. et al. An artificial synaptic transistor using an α-In2Se3 van der Waals ferroelectric channel for pattern recognition. RSC Adv. 11(58), 36901–36912 (2021).
Lee, Y. et al. IGZO synaptic thin-film transistors with embedded AlOx charge-trapping layers. Appl. Phys. Exp. 15(6), 061005 (2022).
Acknowledgements
We thank Donghyeok Park, Hyunjoon Jeon, Chanwoo Kim, Hyunjin Choi, Juhyun Hong and Shin Yerin for help with device testing. This research was supported by the National Research Foundation of Korea (NRF) grants funded by the Korea government (MSIT) (No. 2018R1C1B6001688, No. 2021R1A4A1027087).
Author information
Authors and Affiliations
Contributions
D.C. and Y.K. performed the measurement and characterization with the fabricated devices. D.C. did the synaptic analysis with Y.K. Y.K. drew figures with D.C. All the authors did the discussion writing the manuscript together.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Cha, D., Kang, Y. & Lee, S. Operating region-dependent characteristics of weight updates in synaptic In–Ga–Zn–O thin-film transistors. Sci Rep 12, 21441 (2022). https://doi.org/10.1038/s41598-022-26123-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-022-26123-z
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.