Introduction

In this era of big data, the amount of data to be processed has increased significantly. Data processing through the conventional von Neumann computing architecture, where the processor and memory units are physically separated, has caused significant latency bottlenecks and power consumption1. To address this challenge, brain-inspired neuromorphic systems using parallel-connected biological synapses have been proposed as new computing architectures to enhance computation efficiency2,3,4,5. To implement this architecture in hardware systems, it is important to emulate the functions of biological synapses using analog memory devices. Among the various candidates for synaptic devices, resistive switching random-access memory (RRAM) has mainly been explored because of its scalability6,7,8. The multiple resistance states of RRAM can be demonstrated by the precise control of the formation and rupture of local conductive filaments through materials engineering and novel programming schemes9. However, stochastic ion motion in the switching layer interferes with the filament, causing a nonuniform synaptic response with respect to the programming pulses10,11,12,13,14.

For uniform analog synaptic devices, ion-actuated transistors, which have also been referred to as electrochemical random-access memory or redox transistors and were inspired by the field-driven Li ion motion of rechargeable batteries, have recently been suggested15. Instead of the gate dielectric of a conventional transistor, an electrolyte that not only contains mobile ion sources but also allows ionic motion driven by the field can be utilized. As the number of ions attracted (or repulsed) to the channel layer increases (or decreases), the channel current (ID) between the source (S) and drain (D) is gradually tuned towards higher (or lower) levels as a function of the positive (or negative) programing pulses applied to the gate (G). Thus far, this analog switching behavior has been demonstrated in synaptic transistor unit cells for various combinations of electrolyte/channel material stacks and mobile ion species such as Li, H, O, and Cu ions15,16,17,18. Recent advances in synaptic transistors have been extended to array-level analyses19,20,21. Analog switching of some or all cells in arrays that can be updated simultaneously has been experimentally achieved using a half-bias scheme, where the drain voltage (VD) and gate voltage (VG) are used during programming. It is worth noting that the ID of the synaptic transistor should be as low as possible considering the energy consumption at the array level21. This is because, unlike usual transistors serving as switch elements, analog synaptic transistors are always on-channel without a threshold voltage required to allow current to flow in the channel. The magnitude of the ID of the synaptic transistor is tuned only by the VG. Assuming that only the farthest synaptic transistor in the array is programmed, unselected synaptic transistors located on a horizontal (or vertical) line applied to VG (or VD) generate unwanted gate (or drain) leakage currents, as shown in Fig. 1a. In particular, although extremely low gate currents (IGs) of several tens of picoamperes have been reported to date, the ID has mainly been several microamperes15,16,17,18,22. This implies that the magnitude of the ID is primarily related to the energy consumption in the synaptic array and its impact becomes significant as the array size increases, as shown in Fig. 1b.

Figure 1
figure 1

(a) Schematic illustration of leakage current sources in synaptic arrays. The half-bias scheme was used to program the farthest cell for simplicity. (b) Numerical calculations show the impact of leakage currents on energy consumption with respect to array size. Energy was calculated considering the leakage currents originated from unselected synaptic transistors, where either VG or VD was applied.

Therefore, we fabricated a CuOx/HfOx/WOx synaptic transistor array that exhibited analog switching in the range of tens of microamperes. Next, the effect of the WOx channel layer was investigated using the sputtering method and annealing conditions related to the stoichiometry and crystallinity of the material.

The Cu-ion-actuated synaptic transistor structure developed in our previous study was used in this study18. For the synaptic transistor array, S and D lines consisting of W were patterned on an Si/SiO2 wafer, as shown in Fig. 2a. Via-holes for the S/D contacts were defined after Si3N4 interlayer deposition. A WOx channel with a length (or width) of 20 (or 5) μm was deposited by sputtering with a WO3 single target. For comparison, a different channel layer was formed by reactive sputtering with a W metal target under ambient Ar and O plasma conditions. Additional annealing was performed at 450 °C using a rapid thermal annealing system to lower ID in the range of nanoamperes. Subsequently, an HfOx electrolyte and CuOx G with a length of 10 μm were deposited by sputtering using HfOx and Cu targets, respectively. Finally, a W capping layer was formed on top of the G and S/D contacts by sputtering. The thickness of each layer was evaluated by alpha step at the 3-D convergence center of Inha University. The fabricated CuOx/HfOx/WOx synaptic transistor was analyzed using transmission electron microscopy (TEM) and X-ray photoelectron spectroscopy (XPS) depth profiling, and each layer was clearly distinguished, as shown in Fig. 2b and c.

Figure 2
figure 2

(a) Fabricated CuOx/HfOx/WOx synaptic transistor arrays. ID of the synaptic transistor was updated by applying VG and grounding S. Updated ID was read by addressing voltage to D. (b) Cross-sectional TEM image and (c) XPS depth profiling of the synaptic transistor.

The electrical characteristics of the CuOx/HfOx/WOx unit synaptic transistor were evaluated by applying a VG of + 6 V (or − 5 V) with a pulse width of 100 ms to increase (or decrease) the ID, as shown in Fig. 3a. The tuned ID was sensed by applying a VD of 0.5 V. The observed switching behavior was attributed to the reversible driving of Cu ions back and forth through the HfOx layer. The effective thickness of HfOx was thinned or thickened, and the electric field-adjusted channel conductivity was enhanced or weakened23. As reported in a previous study, a limited number of Cu ions was provided by the CuOx G electrode, which could be effectively and precisely controlled by successive VG pulses to allow linear and symmetric synaptic behavior in the range of tens of microamperes18. Under the given programming conditions, we first analyzed the effect of the stoichiometry of the channel layer. A stoichiometric WO3 layer was formed by reactive sputtering with O gas injection, as shown in Fig. 3b. Although an extremely low ID at a level below tens of picoamperes was measured, no change in ID was observed as a function of VG pulses. When annealing was conducted at 450 °C and atmospheric pressure for 30 min, ID became higher and began to be analogously controlled by the VG pulses, as shown in Fig. 3c. The WO3 layer crystallizes at temperatures above 400 °C24. The polycrystalline WO3 layer, which was verified by the fast Fourier transform in the TEM analysis (inset of Fig. 3c), boosted the conductivity. Thus, most of the applied VG was transferred to the HfOx electrolyte to drive the Cu ions, leading to a progressive increase and decrease in ID for 200 consecutive VG pulses. However, when ambient N gas was applied during annealing, ID was increased to hundreds of microamperes, making the ID modulation negligible, as shown in Fig. 3d. To identify the impact of the annealing gas on analog switching, the W 4f peak at the WO3 channel interface of the two annealed devices, obtained through XPS analysis, was investigated, as shown in Fig. 3e. Two peaks at binding energies of approximately 31 and 34 eV, indicating metallic states related to nonstoichiometric WOx were clearly observed for both samples. However, two peaks corresponding to WO3 observed at 36.5 and 38 eV were noticeable for the annealed sample without N gas. This implies that N preferentially bonds to W. W–N bonding is usually represented by metallic peaks (approximately 31 and 34 eV); therefore, the enhanced channel conductivity can be attributed to the conductive WN formed.

Figure 3
figure 3

VG of + 6 V (or − 5 V) with a pulse width of 100 ms was used for all devices to evaluate device characteristics. (a) Analog synaptic behavior observed in the synaptic transistor employing amorphous WOx channel. (b) No switching was observed when relatively stoichiometric WO3 channel was used. (c) Crystallized WO3 channel, which was verified by fast Fourier transformation image, allowed the ID to be adjusted by pulse number. (d) Increased ID level due to N annealing hid the switching behavior. (e) W 4f peak obtained through XPS analysis.

Next, we studied the temperature dependence of the IG to identify the mechanisms of analog switching in the optimized gate stack. IG, which was obtained by applying voltage to G and grounding S, was measured by raising the temperature to 140 °C. As a result, the transport characteristics were suitably fitted using the Poole–Frenkel emission, as shown in Fig. 4a. This indicates that bulk switching across the entire electrolyte area was involved, rather than metallic conduction, owing to the locally clustered Cu ions. Specifically, by comparing simultaneously obtained IG and ID, we found that the IG response to VG pulses was related to ID modulation, as shown in Fig. 4b. That is, the gradual transition of IG by a single VG pulse during initial programming was typically projected onto the lateral ID, resulting in a linear ID response in working cell. However, at the given programming condition, nonlinear ID behavior was often observed in non-optimized stacks. When we examined the IG response of these failed cells, the abrupt increase and saturation of IG was observed. These results imply that the extent to which Cu ions move steadily through the electrolyte per VG pulse is related to the linearity and symmetry of ID.

Figure 4
figure 4

(a) Poole–Frenkel conduction of the gate stack. (b) Link between the IG response, which can be described by Cu ion motion over the electrolyte, and ID modulation.

Finally, we evaluated synaptic arrays employing the engineered WO3 channel. As the channel resistance was reduced, a larger electric field across the HfOx layer was used to drive the Cu ions; therefore, analog switching could be achieved by lowering VG to + 3 V. To facilitate fully parallel programming, all the G, S, and D pads of the 7 × 7 array were contacted by a probe card, as shown in Fig. 5a. In the half-bias scheme, the + 3 V (or − 3 V) required to program the unit cell was halved, and + 1.5 V (or − 1.5 V) was simultaneously applied to VG and VD. Consequently, a full voltage of approximately ± 3 V was applied to the selected cells. Reliable analog switching behavior was obtained for all synaptic transistors. The cell-to-cell update curves expressed as the median repeatedly increase and decrease, as the polarity of the pulse changes every 50 times for a total of 200 pulses, as shown in Fig. 5b. Specifically, the ID states updated by the first 50 potentiation pulses reached its maximum value and returned to the initial state by the next 50 depression pulses. As a result, the ID states applied at the first, 100th, and 200th pulses (or 50th and 150th) showed uniform cycle-to-cycle distribution, as shown in Fig. 5c. Note that although the reversible analog ID modulation from all cells was achieved, cell-to-cell uniformity in the array needed to be improved. We recently revealed that the effect of Cu ion distribution (or concentration) in the stack on the synaptic behavior through physics-based simulation25. Based on the clue, further study is currently underway on process development and strategies to ensure reliability as well as uniformity by controlling the adequate amount of Cu ions in the device.

Figure 5
figure 5

(a) Optical microscope image of fabricated synaptic transistor array contacted by probe card. To update all synaptic transistor cells simultaneously, 50 VG (or VD) pulses of + 1.5 V (or − 1.5 V) were consecutively addressed to all G (or D) lines to increase ID. By changing the polarity of each pulse, ID can be steadily lowered. (b) Reliable synaptic behaviors expressed as median were obtained from all cells. (c) Distributions of IDs when pulses were applied every 50 cycles from the initial state. (d) Pattern recognition accuracy of multilayer perceptron evaluated using IBM analog hardware acceleration kit simulator. The parameters used in this simulation were listed in the table.

Based on these results, we built multilayer neural networks with size 784-256-128-10 (from input to output neurons) designed for image recognition, as shown in Fig. 5d. The input signals were transferred from input neurons to output neurons through synaptic weights, which served as the optimized synaptic transistor in this study. The degree of linearity of synaptic characteristics is expressed by a linearity factor, α, which is calculated by the following equation:26

$${\text{G}} = { }\left\{ {\begin{array}{*{20}l} {\left( {\left( {G_{MAX}^{{\upalpha }} - G_{MIN}^{{\upalpha }} } \right) \times \omega + G_{MIN}^{{\upalpha }} } \right)^{{\frac{1}{{\upalpha }}}} } \hfill & { \left( {if \alpha \ne 0} \right)} \hfill \\ {G_{MIN} \times \left( {G_{MAX} /G_{MIN} } \right)^{\omega } } \hfill & {\left( {if \alpha = 0} \right)} \hfill \\ \end{array} } \right.,$$

where, GMAX and GMIN are conductance at the maximum and minimum ID state, respectively, and \(\omega\) is an internal variable which ranges from 0 to 1. Moreover, α is equal to 1 in the case of the ideal synaptic behavior. Based on these equations, linearity of potentiation (or depression) of 1.47 (or − 1.61) was achieved for the developed synaptic transistor. The difference between the maximum and minimum ID states of ~ 4 was also used in the simulations. Subsequently, the recognition accuracy on the Fashion-MNIST dataset was evaluated with a learning rate of 0.01 using the IBM analog hardware acceleration kit simulator by assuming that the achieved synaptic transistors served as weight elements26. Consequently, our device exhibited a recognition accuracy of approximately 90%, which is comparable to the baseline, when the multilevel states were almost linearly adjusted15,27.

Finally, through simple numerical analysis, we showed that update energy consumption is reduced by lowering ID of the synaptic transistors, which are assumed to be configured in 1 K array, as shown in Fig. 6. Compared to reported synaptic transistors that utilize the Li or O ions, the VG to drive the Cu ions seemed to be larger. Current has a greater impact on power than voltage because its magnitude can be changed significantly. To date, the energy consumed to change the ID state by a single VG pulse has been mainly discussed considering unit cell operation28. However, to sequentially update all cells in the array, the VG and VD signals were applied to the array via all interconnect lines. Thus, synaptic transistors with tunable ID in the low current regime enable low-power operation because the ID is typically higher than the IG. In addition to energy consumption, recent study showed that recognition accuracy is degraded due to interference from high IDs of neighboring cells29. These results mean that the range of ID, which has often been ignored, should be considered when designing a three-terminal synaptic transistor.

Figure 6
figure 6

From a unit cell perspective, synaptic transistors that exhibit low IG and are driven by low VG seem to be preferred for low-power application. However, considering update operation in the array, the ID range of the synaptic transistors contributes more to energy consumption.

In this work, we achieved analog switching in a low ID regime (< 100 nA) of a 7 × 7 CuOx/HfOx/WO3 synaptic transistor array using WO3 channel engineering. Specifically, we revealed that the ID range could be lowered or increased by utilizing a stoichiometric WO3 layer or by strengthening the crystallinity. In addition, the steady and gradual motion of the Cu ions across the gate stack enabled a linearly tuned ID response. The obtained synaptic behavior not only accurately inferred the Fashion-MNIST dataset but also performed recognition with low energy consumption.