Artificial van der Waals hybrid synapse and its application to acoustic pattern recognition

Seo, Seunghwan; Kang, Beom-Seok; Lee, Je-Jun; Ryu, Hyo-Jun; Kim, Sungjun; Kim, Hyeongjun; Oh, Seyong; Shim, Jaewoo; Heo, Keun; Oh, Saeroonter; Park, Jin-Hong

doi:10.1038/s41467-020-17849-3

Download PDF

Article
Open access
Published: 07 August 2020

Artificial van der Waals hybrid synapse and its application to acoustic pattern recognition

Seunghwan Seo¹,
Beom-Seok Kang¹,
Je-Jun Lee¹,
Hyo-Jun Ryu^1,2,
Sungjun Kim^1,3,
Hyeongjun Kim¹,
Seyong Oh¹,
Jaewoo Shim ORCID: orcid.org/0000-0003-3496-5583⁴,
Keun Heo¹,
Saeroonter Oh ORCID: orcid.org/0000-0003-4281-6879⁵ &
…
Jin-Hong Park ORCID: orcid.org/0000-0001-8401-6920^1,6

Nature Communications volume 11, Article number: 3936 (2020) Cite this article

14k Accesses
128 Citations
6 Altmetric
Metrics details

Subjects

Electronic devices

Abstract

Brain-inspired parallel computing, which is typically performed using a hardware neural-network platform consisting of numerous artificial synapses, is a promising technology for effectively handling large amounts of informational data. However, the reported nonlinear and asymmetric conductance-update characteristics of artificial synapses prevent a hardware neural-network from delivering the same high-level training and inference accuracies as those delivered by a software neural-network. Here, we developed an artificial van-der-Waals hybrid synapse that features linear and symmetric conductance-update characteristics. Tungsten diselenide and molybdenum disulfide channels were used selectively to potentiate and depress conductance. Subsequently, via training and inference simulation, we demonstrated the feasibility of our hybrid synapse toward a hardware neural-network and also delivered high recognition rates that were comparable to those delivered using a software neural-network. This simulation involving the use of acoustic patterns was performed with a neural network that was theoretically formed with the characteristics of the hybrid synapses.

Accurate and efficient time-domain classification with adaptive spiking recurrent neural networks

Article 14 October 2021

Bojian Yin, Federico Corradi & Sander M. Bohté

Towards spike-based machine intelligence with neuromorphic computing

Article 27 November 2019

Kaushik Roy, Akhilesh Jaiswal & Priyadarshini Panda

A biohybrid synapse with neurotransmitter-mediated plasticity

Article 15 June 2020

Scott T. Keene, Claudia Lubrano, … Francesca Santoro

Introduction

It is predicted that a large amount of unstructured data in the upcoming Big Data era will not be processed efficiently via conventional serial computing technology based on the von Neumann architecture¹. Thus, a brain-inspired parallel computing technology suitable for dealing with such unstructured data was recently proposed^2,3. Brain-inspired computing is generally performed using a hardware neural-network (HW-NN) platform consisting of numerous artificial synapses^4,5. Therefore, considerable effort has been directed toward the implementation of artificial synapses mimicking the behavior of biological synapses, such as short-term plasticity and long-term plasticity^6,7. Synaptic devices based on various operating mechanisms and materials have been reported, including resistive random-access memory (RRAM), phase-change memory (PCM), field-effect transistors (FETs) with ferroelectric or charge-trapping layer, electrochemical memory, optoelectronic memory^{8,9,10,11,12,13,14,15,16,17,18,19,20}. However, it has not been demonstrated that an HW-NN composed of such synaptic devices can perform training and inference tasks with the same level of accuracy as a software-based neural-network (SW-NN). This is because such devices do not sufficiently satisfy the synaptic characteristics, such as the cycle-to-cycle variation (CCV), device-to-device variation (DDV), retention time, endurance, number of conductance states, dynamic range, and linear/symmetrical conductance change^20,21,22,23. In particular, the linearity and symmetricity of the conductance change are known to significantly affect the inference accuracy after the training process of HW-NNs^24,25.

J. Woo et al. and S. Park et al. reported HfO_x RRAM and AlO_x/TiN PCMO-based synaptic devices, respectively, where the nonlinear and asymmetrical conductance change resulted in a low inference accuracy of <40% for the Modified National Institute of Standards and Technology (MNIST) dataset^26,27. For the HfO_x RRAM, an AlO_x barrier layer was introduced, consequently improving its conductance change more linearly, but this device inherently presented low dynamic range and high CCV^18,26. For the hafnium-zirconium oxide FeFET-type synapse reported by M. Jerry et al., a highly symmetric conductance change was achieved, leading to a high inference accuracy of 90% for the MNIST dataset²⁸. However, in the FeFET synapse, non-identical spikes were applied for controlling the conductance state. This is because most of the reported synaptic devices operate on the basis of a physical mechanism that cannot change the conductance linearly with respect to the applied voltage. Meanwhile, E. J. Fuller et al. reported an ionic floating gate (IFG)-based synaptic device featuring a very linear conductance change that was achieved by a gradual composition modulation in the IFG²⁹. Recently, various studies to approach the training/inference accuracy of an SW-NN have been attempted by designing synaptic unit cells with highly linear and symmetric conductance change characteristic^24,30,31. S. Kim et al. reported a very linear conductance change in their synaptic unit cell consisting of three transistors and one capacitor³⁰. Although the excellent linearity allowed an accurate training process, the volatility of the cell made inference based on the trained information difficult. S. Ambrogio et al. and X. Sun et al. applied nonvolatile memory elements, such as PCM and ferroelectric capacitors, to their synapse cells, yielding high training and inference accuracies simultaneously^24,31. However, such synapse cells require highly complex peripheral circuits for operation, as well as a large number of devices. Therefore, additional studies on artificial synapses should be performed to achieve desirable synaptic characteristics required for high-performance HW-NNs.

Herein, we report an artificial vdW-hybrid synaptic device that features linear and symmetric conductance change characteristics. The excellent conductance controllability is accomplished by using tungsten diselenide (WSe₂) and molybdenum disulfide (MoS₂) hybrid channels, which are specialized for linear conductance potentiation and depression, respectively. We also discuss the CCV & DDV, relative standard deviation (RSD), endurance, symmetricity, and dynamic range for the long-term potentiation (LTP)/long-term depression (LTD) characteristic curves with respect to the conditions of weight control spikes. In particular, our synaptic device is investigated and compared with other devices reported heretofore, in terms of various synaptic characteristics mentioned above, weight updating energy, and active area (see Supplementary Table 1). Finally, we demonstrate the feasibility of the vdW-hybrid synaptic device for an HW-NN and present high recognition rates close to those for an SW-NN via training and inference simulation, where both our designed acoustic patterns and existing MNIST digit patterns are used.

Results

Artificial van der Waals hybrid synapse

Biological synapses are known to transmit spike signals from the presynaptic terminal to the postsynaptic terminal using neurotransmitters and to adjust their synaptic weights on the basis of the timing of the spike signals³². In this study, as shown in Fig. 1a, we implemented an vdW-hybrid synaptic device that successfully mimics the operation of biological synapses and presented excellent synaptic characteristics. This vdW-hybrid synaptic device features two signal paths for potentiation and depression operations, which are formed on WSe₂ (for hole transport)/hexagonal-boron nitride (h-BN) and MoS₂ (for electron transport)/h-BN heterostructures, respectively. Here, such vdW heterostructures are free from concerns about lattice mismatching owing to the dangling-bond-free surface nature of the vdW materials^33,34,35, thereby allowing the formation of interfacial defect-free floating gate structure^36,37 or the modulation of the number of interfacial traps/dipoles for achieving the synaptic functionalities^12,18,38. The potentiation and depression channels are tied by two electrodes, which are defined as the presynaptic and postsynaptic terminals, and the two channels have an individual gate electrode functioning as a weight control terminal (WCT). Additional information regarding the vdW-hybrid synaptic device, such as an optical microscopy image, thickness profiles of vdW materials confirmed via atomic force microscopy (AFM), and the Raman spectra of each vdW material are provided in Supplementary Fig. 1. When a voltage spike (V_pre) is applied to the presynaptic terminal, a postsynaptic current (PSC) appears at the postsynaptic terminal, which is the sum of the PSCs of the potentiation (PSC_P) and depression (PSC_D) channels (PSC = PSC_P + PSC_D). This indicates that the conductance of the vdW-hybrid synaptic device (G) is identical to the sum of the conductance values of the potentiation (G_P) and depression (G_D) channels (G = G_P + G_D). Therefore, as shown in Fig. 1b, the synaptic conductance of this device can be potentiated (G↑ = G_P↑ + G_D) or depressed (G↓ = G_P + G_D↓) by applying only a positive voltage spike (+V_WCT) to the WCT. The conductance of the WSe₂ and MoS₂ channels is modulated on the basis of the phenomenon that electrons are only trapped in the weight control layer (WCL) formed on h-BN under the positive voltage spike condition. This differs from the conventional transistor-type synapse, where both trapping and detrapping of electrons are used for potentiation and depression, which allows highly symmetric operation of the synapse^12,39. For accurate operation of the vdW-hybrid synaptic device, two different polarity FET devices with a similar current level are needed to be integrated. When voltage spikes with an amplitude of 1 V, a duration of 20 ms, and a frequency of 2 Hz were applied to the WCT for the potentiation channel four times consecutively, the conductance of the vdW-hybrid synaptic device increased in steps from 159 to 257 nS (potentiation operation). When the same voltage spikes were applied to the WCT for the depression channel, the conductance decreased in steps to 163 nS, which was similar to the initial conductance value (depression operation).

**Fig. 1: vdW-hybrid synaptic device with excellent controllability of the conductance.**

Following the demonstration of the selective controllability of the conductance in the vdW-hybrid synaptic device, we compared its synaptic characteristics with those of a control synaptic device with a WSe₂ or MoS₂ channel. As shown in Fig. 1c, we confirmed the LTP/LTD characteristics in each synaptic device, where 128 excitatory and inhibitory voltage spikes were applied in a row to the WCTs of the devices. For the WSe₂ synaptic device, the conductance increased linearly and decreased nonlinearly (red curve). The conductance change of the MoS₂ synaptic device (blue curve) exhibited the opposite behavior. According to these results, the channels composed of WSe₂ and MoS₂ were specialized for potentiation and depression, respectively. Therefore, in the vdW-hybrid synaptic device that selectively exploits the specialized channels for the LTP/LTD, the conductance states are uniformly distributed (green curve). The measurement setup for the synaptic devices is described in detail in Supplementary Fig. 2. To evaluate the LTP/LTD characteristics quantitatively, we extracted the nonlinearity from the characteristic curves (β_P from the LTP curve and β_D from the LTD curve). The nonlinearity values (β_P/β_D) of the WSe₂ and MoS₂ synaptic devices were 2.3/21 and 8.6/2, respectively. For the vdW-hybrid synaptic device, the nonlinearity values were 1.9/1.9. The dynamic ranges of the WSe₂, MoS₂, and vdW-hybrid synaptic devices, which were defined as the difference between G_max and G_min (G_max – G_min), were 237, 210, and 191 nS, respectively. These nonlinearity values were analyzed and compared with the values of previously reported artificial synapses, as shown in Supplementary Fig. 3. We then calculated the symmetricity indicating the degree of symmetry between the LTP and LTD characteristic curves in Fig. 1d (top). Details regarding the calculation are provided in Supplementary Fig. 4. The symmetricity values of the WSe₂ and MoS₂ synaptic devices were approximately 2.13 and 3.82, respectively, and the symmetricity was improved to 13.26 for the vdW-hybrid channel. Here, there are flake-to-flake variations in the vdW channels in terms of defect density and doping concentration, which consequently affect the synaptic characteristics including the linearity and symmetricity of the LTP and LTD curves. Detailed analysis regarding with this issue is provided in Supplementary Fig. 5. In addition, the nonlinearity and symmetricity values were investigated in multiple samples to confirm the DDV in the nonlinearity and symmetricity, as shown in Supplementary Fig. 6. Furthermore, we determined and compared the effective conductance-state ratios, as shown in Fig. 1d (bottom), because an insufficient conductance change (ΔG) in the LTP and LTD curves has no effect on the recognition rate in the HW-NN. The effective conductance-state ratio was defined as the ratio of the number of conductance states in which ΔG exceeded a certain percentage of G_max/G_min (threshold_ΔG) to the total number of conductance states. When threshold_ΔG was set as 0.3%, the WSe₂ and MoS₂ synaptic devices exhibited effective conductance-state ratios of 43.75% and 69.14%, respectively, and the vdW-hybrid synaptic device exhibited a relatively high ratio of 85.94%. As shown in Fig. 1e, we applied eight spikes in a row for potentiation and depression (four excitatory spikes and four inhibitory spikes) to the WCTs of the three devices. Consequently, the conductance of the WSe₂ and MoS₂ devices decreased (conductance variation σ = –32.9%) and increased (σ = +25.1%), respectively, compared with the initial values. For the hybrid synaptic device, a conductance similar to the initial value was observed under the same spike conditions (σ = +1.16%). Here, the conductance variation σ was calculated using the equation shown in Fig. 1e (bottom). Additional results measured under different combinations of eight spikes are provided in Supplementary Fig. 7, (e.g., two excitatory spikes, two inhibitory spikes, two excitatory spikes, and two inhibitory spikes).

Analysis of vdW-hybrid synaptic device

The excellent conductance controllability of the vdW-hybrid synaptic device was due to the electron-trapping phenomenon in the WCL, which was formed by exposing CF₄ plasma on top of the h-BN (details are presented in the METHODS Section). As shown in Fig. 2a, we examined the WCL via cross-sectional transmission electron microscopy (X-TEM). By performing CF₄ plasma treatment with a reactive ion etcher power of 10 W and a process time of 10 s, the WCL was created at depths of 11.1 and 10.5 nm from the surface underneath the potentiation and depression channels, respectively. The regions inside the yellow dotted line in Fig. 2a were analyzed via electron energy-loss spectroscopy (EELS) mapping, yielding atomic-composition information for each region. As shown in Fig. 2b and c, the WCL regions mainly exhibited signals corresponding to C (green), F (yellow), and B (weak signal, blue). As expected, signals corresponding to W (bright red), Se (bright green), Mo (purple), and S (cyan) were obtained in the regions of WSe₂ and MoS₂. The B (blue) and N (red) signals also appeared in the region of h-BN. As shown in Supplementary Figs. 8, 9, and 10, we analyzed the energy distribution of the WCL/h-BN via micro photoluminescence and micro X-ray photoelectron spectroscopy measurements^40,41,42. Additionally, we estimated the trap density in the WCL region containing C and F atoms, obtaining density values of 5.2 × 10¹⁷ and 5.8 × 10¹⁷ cm^–3 for the potentiation and depression channels, respectively. These trap-density values were on the same order as the number of electrons stored in the floating gate of current flash memory cell, as discussed in Supplementary Fig. 11 (http://www.itrs2.net/). As depicted in Fig. 2d, we investigated carrier injection barrier heights and work functions for the WSe₂ (potentiation) and MoS₂ (depression) channels via the modified Richardson plotting method, based on a thermionic emission-diffusion model and Kelvin probe force microscopy (KPFM) analysis, respectively (see also Supplementary Fig. 12 and Note 8)^43,44. A hole barrier height of 0.31 eV and work function of 4.86 were estimated for the WSe₂ channel, where a platinum contact is formed between the pre/postsynaptic terminals and channel. Meanwhile, an electron barrier height of 0.19 eV and work function of 4.75 eV were obtained for the MoS₂ channel, where a titanium contact is formed between the pre/postsynaptic terminals and channel. Therefore, the WSe₂ (potentiation) and MoS₂ (depression) channels were confirmed as p- and n-type channels, respectively. We then investigated the spike responses for the current flow through the potentiation and depression channels in detail. When an excitatory voltage spike was applied to the WCTs for the WSe₂ and MoS₂ channels, the PSCs flowing through the potentiation and depression channels increased from 20.2 to 25.6 nA and decreased from 21.1 to 16.1 nA for 10⁴ seconds, denoting that the trap states of WCL have high confinement energy (see Fig. 2d). This is because the electrons trapped in the WCL increased and decreased the number of holes and electrons in the WSe₂ and MoS₂ channels, respectively, decreasing and increasing the width of the tunneling barrier (W_TN) from the presynaptic terminal (T_pre) metal to the vdW channels, as shown in Fig. 2d (bottom). When an inhibitory spike was applied, as shown in Supplementary Fig. 13, the PSC decreased and increased in the potentiation and depression channels, respectively. The operating energy for reading and updating a weight were approximately from 0.12 to 0.71 nJ (for reading energy) and 0.79 (for updating energy in the potentiation channel)/0.93 (for updating energy in the depression channel) pJ, respectively, where a spike with 1 V of amplitude and 10 ms of duration was applied⁴⁵. Such dissipated energy per event were determined by P=I×V×t_duration^12,16,18, and relevant details are provided in Supplementary Fig. 14. The PSC responses with respect to the amplitude and duration of the spike and with respect to the conditions of CF₄ plasma treatment were also investigated, as shown in Supplementary Fig. 15. As shown in Fig. 2e, when excitatory spikes with a 1-V amplitude, 20-ms duration, and 2-Hz frequency were applied consecutively to the WCTs underneath the WSe₂ and MoS₂ channels, the conductance linearly increased (Case 1) and decreased (Case 2), respectively. Under the excitatory-spike condition, the energy band of the semiconductor near the WCL was instantly bent downward, generating an electric field (E) that attracted electrons toward the WCL (see Fig. 2f). Simultaneously, the probability of trap states being filled increased because the Fermi level of the semiconductor was close to its conduction band edge. Therefore, the trap sites at the WCL were partially filled with electrons during each excitatory spike, which gradually changed the conductance of the potentiation and depression channels. In contrast, as shown in Fig. 2g, when the same inhibitory voltage spikes were applied in a row, the conductance nonlinearly decreased and increased at the WSe₂ and MoS₂ channels, respectively. The energy band of the semiconductor near the WCL was instantly bent upward when the inhibitory spike was applied, generating an electric field for electron detrapping (see Fig. 2h). Additionally, because there were many empty states within the conduction band of the semiconductor, which were well aligned with the filled trap sites in the energy level, most of the trapped electrons were released from the WCL during the initial few inhibitory spikes. Consequently, when the inhibitory spikes were applied in a row after the excitatory spikes, the conductance rapidly changed at the beginning stages and then gradually became saturated. In the proposed hybrid synapse device, the linearity of PSC was determined by (i) the PSC updating origin (electron trapping into the WCL or detrapping from the WCL), and (ii) the polarity of the channel (p- or n-type channel) (see Supplementary Fig. 16). We further investigated the spike response for the synaptic device using an MoSe₂ that normally functions as an n-type channel and subsequently confirmed the similar LTP/LTD characteristics of the MoSe₂ and MoS₂ synaptic devices (see Supplementary Fig. 17). Meanwhile, the nonlinearity for the potentiation and depression channels was investigated with respect to (i) the amplitudes and durations of the spikes, and (ii) the conditions of CF₄ plasma treatment, as shown in Supplementary Fig. 18. The information on the spike responses for the potentiation and depression channels without WCL is provided in Supplementary Fig. 19.

**Fig. 2: Study on the weight control mechanism of the vdW-hybrid synaptic device.**

Synaptic characteristics of the vdW-hybrid synaptic device with respect to various spike conditions

Next, we examined the CCV and RSD for the LTP/LTD curves and the symmetricity and dynamic range of the vdW-hybrid device under various voltage spike conditions. These indices significantly affect the performance of an HW-NN composed of artificial synapses^20,23,46. Figure 3a explains the two voltage spike conditions—one for potentiation and the other for depression—where the number, amplitude, duration, and frequency of the pulses were varied. As shown in Fig. 3b, we measured the LTP/LTD characteristics 15 times and then evaluated the CCV and RSD^21,46. The CCV was estimated as <1% in the LTP/LTD curves, as shown in Supplementary Fig. 20. Here, the nonlinearity ranging from 1.75 to 2.2 for the potentiation channel and from 1.8 to 2.35 for the depression channel was confirmed. For the RSD, which represents the ratio of the standard deviation (σ) to the mean (μ), values of 0.05 and 0.03 were obtained in the LTP and LTD curves, respectively. Also, as shown in Supplementary Fig. 21, we investigated the endurance (>10⁵ weight updating, 500 cycles of LTP/LTD) of the vdW-hybrid device. We then extracted the symmetricity and dynamic range from the LTP/LTD curves obtained when 32, 64, and 128 voltage spikes were applied, as shown in Fig. 3c. While the symmetricity was not significantly affected by the number of spikes, the dynamic range increased rapidly as the number of spikes increased (symmetricity/dynamic range: 7.95/96.12 nS for 32 states, 8.05/124.6 nS for 64 states, and 7.61/178 nS for 128 states). In addition to the effects of the number of spikes, we investigated the symmetricity and the dynamic range under different spike voltages. As the amplitude of the spikes increased from 1 to 5 V, the symmetricity decreased from 5.11 to 3.39, and the dynamic range increased from 95 to 326 nS (see Fig. 3d). As the duration of the spikes increased from 10 to 50 ms, the symmetricity decreased from 11.65 to 6.11, and the dynamic range increased from 91 to 270 nS (see Fig. 3e). As the frequency of the spikes increased from 2 to 8 Hz, the symmetricity decreased from 8.79 to 5.11, and the dynamic range increased from 71 to 95 nS (see Fig. 3f). As the amplitude, duration, and frequency of the spikes increased, the symmetricity was degraded and the dynamic range value increased, indicating that these performance indices had a tradeoff relationship with each other. A detailed analysis of the results for the symmetricity and the dynamic range is provided in Supplementary Fig. 22.

**Fig. 3: Characteristics of the vdW-hybrid synaptic device with respect to various spike conditions.**

Acoustic pattern recognition task

Finally, we demonstrated the feasibility of the vdW-hybrid synaptic device for an HW-NN via a training and inference simulation. For this simulation task, we defined a method to convert vocal signals into acoustic patterns and then prepared training and inference datasets, as shown in Fig. 4a and b. The first step was to express vocal signals as a function of time or frequency. We recorded the sound wave of a spoken word (“strawberry”) and obtained the sound information as a function of time. The sound amplitude vs. time information was transformed to the frequency domain via a Fourier transform. The second step was sampling the sound signals. In this step, we divided the sound signals into 200 time or frequency points. Finally, in the third step, the discrete signal information was transformed into an acoustic image with a 20 × 20 array size, as shown in Fig. 4b. For example, the 109th data point in the sound amplitude vs. time graph and the 61st data point in the sound magnitude vs. frequency graph were transferred into the pixels located at (6,11) and (14,1) in the acoustic pattern, respectively (see dotted red line). Here, each pixel had a grayscale value in the range of 0–255. Datasets with 3000 training and 400 inference acoustic pattern images were prepared similarly for five distinct words: “apple,” “orange,” “kiwi,” “banana,” and “strawberry” (see Fig. 4c). Additional information about the datasets is presented in Supplementary Fig. 23. We also prepared two types of spoken digit datasets consisting of cochleagram patterns or our acoustic patterns, where the Lyon’s auditory model was applied to create the cochleagram patterns (see Supplementary Fig. 24)⁴⁷. Then, as shown in Fig. 4d, we theoretically designed a single-layer artificial neural network (ANN) consisting of 400 input neurons, five output neurons, and 400 × 5 artificial synapses connecting the neurons. The voltage signals (V_n) corresponding to each pixel of the acoustic pattern were assumed to be applied to the input neuron layer. They were multiplied by the synapse weight (W_n,m) and then summed at the output neurons. Consequently, output currents $( {I_{\mathrm{m}} = \mathop {\sum }\nolimits_{n = 1}^{400} W_{{\mathrm{n}},{\mathrm{m}}}V_{\mathrm{n}}} )$ were obtained at the output neuron layer. The synapse weight was defined as the conductance values of the synaptic device (W = G). Next, the output value (f_m) obtained via the sigmoid activation function $( {f( {I_{\mathrm{m}}}) = \frac{1}{{1 + e^{ - I_{\mathrm{m}}}}}} )$ was compared with each label value (k_m). Finally, the synapse weights were updated via the backpropagation algorithm (details are presented in the METHODS Section). Figure 4e shows the hardware neural network (HW-NN) comprising the vdW-hybrid synaptic devices, which is applicable to the implementation of the conceptual neural networks for acoustic and MNIST digit pattern recognition tasks. Details on this HW-NN are described in Supplementary Fig. 25.

The conductance of the vdW-hybrid synaptic device was updated by only trapping electrons in the WCL, which caused the conductance state to no longer potentiate or depress when G_P or G_D reached G_max or G_min. Therefore, for training the ANN composed of the hybrid devices, we employed a conductance updating method based on the operations of “refresh” and “reprogram” for updating G_P and G_D. As shown in Fig. 4f (top), when G_P reached G_max (G_{P_128}), both G_P and G_D were refreshed to G_min (G_{P_128→0}) and G_max (G_{D_16→128}). For the operation of “refresh” in terms of implementation in hardware, (i) the peripheral circuits to read the G_P and G_D separately and (ii) the physical separation of the channels are required simultaneously. Subsequently, G_P was reprogrammed to the value of G_D before the refreshing step (G_{P_0→16}), maintaining its conductance value (G = G_{P_128} + G_{D_16} = G_{P_16} + G_{D_128}). For the operation of “reprogram” in terms of implementation in hardware, the peripheral circuits and memory are required additionally, which will store the conductance value and write it back to the device. Similarly, as shown in Fig. 4f (bottom), when G_D reached G_min (G_{D_0}), G_P and G_D were refreshed, and then G_D was reprogrammed to the value of G_P before the refreshing step (G_{D_128→112}). We also employed a conductance updating method without the operations of “refresh” and “reprogram” as shown in Supplementary Fig. 26. The training process was conducted for three types of ANNs composed of hybrid (green curve), WSe₂ (red curve), and MoS₂ (blue curve) devices, and we calculated the recognition rate every 100 training steps, as shown in Fig. 4g. The same training process was performed with an SW-NN (purple curve), for which the synaptic weights were updated using the Widrow–Hoff learning rule⁴⁸. Also, the recognition rates for the acoustic patterns formed with frequency- and/or time-domain data are provided in Supplementary Fig. 27. After the training and inference tasks, as shown in Fig. 4h, the maximum recognition rates and corresponding variation values, which denote the degree of fluctuating in learning curves, were examined. The maximum recognition rate/variation values were 73.6%/12.5%, 78.5%/7.9%, and 94.2%/4.9% for the WSe₂, MoS₂, and vdW-hybrid synaptic devices. The values for the hybrid device were closest to those for the SW-NN (95.3%/6.1%). Similar training and inference analyses were performed for (i) various spike conditions (number, amplitude, duration, and frequency of spikes) with the designed acoustic patterns, (ii) different layer numbers of the ANN (single- and multi-layer) using the MNIST datasets, and (iii) the two types of spoken digit datasets consisting of cochleagram patterns or acoustic patterns, as shown in Supplementary Figs. 28, 29, 30, and Supplementary Table 2, respectively^21,49.

Discussion

We developed a vdW-hybrid synaptic device featuring linear and symmetric update characteristics by utilizing WSe₂ and MoS₂ hybrid channels that are specialized for linear conductance potentiation and depression, respectively. Excellent conductance controllability of the vdW-hybrid synapse was achieved by utilizing only electron-trapping phenomenon in the WCL. The vdW-hybrid synaptic device exhibited nonlinearity and symmetricity of 1.9/1.9 (β_P/β_D) and 13.26, respectively, an effective conductance-state ratio of 85.94% for threshold_ΔG = 0.3%, a very small variation (~1%) after state changes by excitatory and inhibitory spikes, a CCV of <1%, and an RSD of 0.05/0.03 (weight potentiation/depression). Such synaptic characteristics are highlighted in Supplementary Table 1, where our synaptic device is investigated and compared with other devices reported heretofore. Through in-depth analysis and characterization of the vdW-hybrid synaptic device, we demonstrated the feasibility of the device for an HW-NN. It exhibited high recognition rates close to those for an SW-NN via training and inference simulation, in which our designed acoustic patterns were employed. Using this hybrid synaptic device, we achieved recognition of 93.8% for an acoustic pattern recognition task, which was close to that for the SW-NN (95.3%). This work indicates the potential for building HW-NNs for highly accurate brain-inspired computing.

Methods

Fabrication of the synaptic devices

The individual electrodes for the WCT with a width of 20 μm were patterned on a 90-nm-thick SiO₂ oxide layer on a heavily B-doped Si substrate using an optical lithography process, followed by the deposition of 10-nm-thick Ti and 30-nm-thick Au using an electron-beam evaporator. h-BN flakes were mechanically transferred onto the WCTs via a residue-free transfer method based on adhesion energy engineering¹². Then, CF₄ plasma treatment was conducted on the h-BN flakes using a plasma machine (Miniplasma Cube, PLASMART). For stabilizing the chamber conditions, CF₄ gas flowed for 1 min before the CF₄ plasma treatment. The CF₄ plasma treatment conditions were as follows: reactive ion etcher powers of 5, 10, and 20 W; a plasma pressure of 500 mTorr; a CF₄ flow rate of 5 sccm; and treatment times of 10, 20, and 90 s. The WSe₂ and MoS₂ flakes were then transferred onto the WCL/h-BN via the same transfer method. The postsynaptic and presynaptic electrodes (distance between the two electrodes and width of the electrodes were 5 μm) were patterned on the WSe₂/WCL/h-BN (potentiation channel) and MoS₂/WCL/h-BN (depression channel) structure, followed by 10-nm-thick Pt contact for the potentiation channel and Ti contact for the depression channel and 50-nm-thick Au pad deposition.

Characterization of the synaptic devices

For structural and elemental analyses of the WSe₂/WCL/h-BN and MoS₂/WCL/h-BN regions, X-TEM (JEM ARM 200 F) and EELS (GIF Quantum ER, 200 keV) measurements were performed. Raman analysis was performed at various positions on the WSe₂/WCL/h-BN and MoS₂/WCL/h-BN samples using a WITec micro-Raman spectrometer system with a frequency-doubled Nd-doped yttrium aluminum garnet (Nd-YAG) laser beam (532-nm laser excitation). AFM was performed using an NX10 system (Park Systems Corp.). Electrical measurements of the synaptic devices were performed using an HP-4155A semiconductor parameter analyzer connected to a voltage spike generator (Keysight, 33500B). The aforementioned measurement setup for the synaptic devices is described in detail in Supplementary Fig. 2.

Weight update for synaptic devices

Currents at output neurons were transformed by a sigmoid activation function, resulting in output neuron signals (f). Based on the delta value (δ), which is difference between the output neuron signals and the label values (k) for input patterns (δ = k − f), the synaptic weight was determined to be potentiated or depressed. If δ > 0 (potentiation phase), then G is increased. In the depression phase (δ < 0), G is decreased. These conductance changes (∆G) were determined by the following equations:

$$G_{{\mathrm{n}} + 1} = G_{\mathrm{n}} + {\mathrm{{\Delta}}}G_{\mathrm{P}} = G_{\mathrm{n}} + {\upalpha}_{\mathrm{P}}e^{ - \beta _{\mathrm{P}}\frac{{G_{\mathrm{n}} - G_{{\mathrm{min}}}}}{{G_{{\mathrm{max}}} - G_{{\mathrm{min}}}}}}\,\left( {{\Delta}G \,> \, 0, G \uparrow } \right),$$

$$G_{{\mathrm{n}} + 1} = G_{\mathrm{n}} + {\mathrm{{\Delta}}}G_{\mathrm{D}} = G_{\mathrm{n}} - {\upalpha}_{\mathrm{D}}e^{ - \beta _{\mathrm{D}}\frac{{G_{{\mathrm{max}}} - G_{\mathrm{n}}}}{{G_{{\mathrm{max}}} - G_{{\mathrm{min}}}}}}\,\left( {{\Delta}G \, < \, 0,\,G \downarrow } \right).$$

In these equations, G_n+1 and G_n denote the synaptic conductance when the n + 1th and nth pulses are applied, and parameters α and β are the conductance change amount and the nonlinearity, respectively. Fitting results are provided in Supplementary Table 3. The above pattern recognition processing was implemented with MATLAB.

Data availability

The data that support the findings of this study are available from the corresponding author upon request.

Code availability

Code from this study (MATLAB scripts) is available from the corresponding author upon request.

References

Backus, J. Can programming be liberated from the von Neumann style?: a functional style and its algebra of programs. Commun. ACM21, 613–641 (1978).
MathSciNet MATH Google Scholar
Mead, C. Neuromorphic electronic systems. Proc. IEEE78, 1629–1636 (1990).
Google Scholar
Churchland, P. S. & Sejnowski, T. J. The Computational Brain (MIT Press, Cambridge, 1992).
Merolla, P. A. et al. A million spiking-neuron integrated circuit with a scalable communication network and interface. Science345, 668–673 (2014).
ADS CAS PubMed Google Scholar
Burr, G. W. et al. Experimental demonstration and tolerancing of a large-scale neural network (165 000 synapses) using phase-change memory as the synaptic weight element. IEEE Trans. Electron Devices62, 3498–3507 (2015).
ADS Google Scholar
Ohno, T. et al. Short-term plasticity and long-term potentiation mimicked in single inorganic synapses. Nat. Mater.10, 591–595 (2011).
ADS CAS PubMed Google Scholar
Shi, Y. et al. Electronic synapses made of layered two-dimensional materials. Nat. Elect.1, 458–465 (2018).
Google Scholar
Yu, S. et al. An electronic synapse device based on metal oxide resistive switching memory for neuromorphic computation. IEEE Trans. Electron Devices58, 2729–2737 (2011).
ADS CAS Google Scholar
Prezioso, M. et al. Training and operation of an integrated neuromorphic network based on metal-oxide memristors. Nature521, 61–64 (2015).
ADS CAS PubMed Google Scholar
Zhang, W. et al. Designing crystallization in phase-change materials for universal memory and neuro-inspired computing. Nat. Rev. Mater.4, 150–168 (2019).
ADS CAS Google Scholar
Wong, H.-S. P. et al. Phase change memory. Proc. IEEE98, 2201–2227 (2010).
Google Scholar
Seo, S. et al. Artificial optic-neural synapse for colored and color-mixed pattern recognition. Nat. Commun.9, 5106 (2018).
ADS PubMed PubMed Central Google Scholar
Shi, J. et al. A correlated nickelate synaptic transistor. Nat. Commun.4, 2676 (2013).
ADS PubMed Google Scholar
Kim, M.-K. & Lee, J.-S. Ferroelectric analog synaptic transistors. Nano Lett.19, 2044–2050 (2019).
ADS CAS PubMed Google Scholar
Wang, H. et al. A ferroelectric/electrochemical modulated organic synapse for ultraflexible, artificial visual-perception system. Adv. Mater.30, e1803961 (2018).
PubMed Google Scholar
van de Burgt, Y. et al. A non-volatile organic electrochemical device as a low-voltage artificial synapse for neuromorphic computing. Nat. Mater.16, 414–418 (2017).
ADS PubMed Google Scholar
Qian, C. et al. Artificial synapses based on in-plane gate organic electrochemical transistors. ACS Appl. Mater. Interface8, 26169–26175 (2016).
CAS Google Scholar
Seo, S. et al. Recent progress in artificial synapses based on two-dimensional van der Waals materials for brain-inspired computing. ACS Appl. Electron Mater.2, 371–388 (2020).
CAS Google Scholar
Kang, D.-H. et al. A neuromorphic device implemented on a Salmon-DNA electrolyte and its application to artificial neural networks. Adv. Sci.6, 1901265 (2019).
Google Scholar
Sun, J. et al. Optoelectronic synapse based on IGZO-Alkylated graphene oxide hybrid structure. Adv. Funct. Mater.28, 1804397 (2018).
Google Scholar
Chen, P.-Y. et al. NeuroSim: a circuit-level macro model for benchmarking neuro-inspired architectures in online learning. IEEE Trans. Comput. Aid. Design Integ. Circuits Syst.37, 3067–3080 (2018).
Google Scholar
Lim, S. et al. Adaptive learning rule for hardware-based deep neural networks using electronic synapse devices. Neural Comput. Appl.31, 8101–8116 (2019).
Google Scholar
Yu, S. Neuro-inspired computing with emerging nonvolatile memorys. Proc. IEEE106, 260–285 (2018).
CAS Google Scholar
Ambrogio, S. et al. Equivalent-accuracy accelerated neural-network training using analogue memory. Nature558, 60–67 (2018).
ADS CAS PubMed Google Scholar
Burr, G. W. et al. Neuromorphic computing using non-volatile memory. Adv. Phys. X2, 89–124 (2017).
Google Scholar
Woo, J. et al. Improved synaptic behavior under identical pulses using AlO_X/HfO₂ bilayer RRAM array for neuromorphic systems. IEEE Electron Device Lett.37, 994–997 (2016).
ADS CAS Google Scholar
Park, S. et al. Neuromorphic speech systems using advanced ReRAM-based synapse. IEEE Int. Electron Devices Meeting (IEDM). https://ieeexplore.ieee.org/document/6724692 (2013).
Jerry, M. et al. Ferroelectric FET analog synapse for acceleration of deep neural network training. IEEE Int. Electron Devices Meeting (IEDM). https://ieeexplore.ieee.org/document/8268338 (2017).
Fuller, E. J. et al. Parallel programming of an ionic floating-gate memory array for scalable neuromorphic computing. Science364, 570–574 (2019).
ADS CAS PubMed Google Scholar
Kim, S. et al. Analog CMOS-based resistive processing unit for deep neural network training. IEEE 60thMWSCAS. https://ieeexplore.ieee.org/document/8052950 (2017).
Sun, X. et al. Exploiting hybrid precision for training and inference: A 2T-1FeFET based analog synaptic weight cell. IEEE Int. Electron Devices Meeting (IEDM). https://ieeexplore.ieee.org/document/8614611 (2018).
Foster, M. & Sherrington, C. S. Textbook of Physiology. (Macmillan, 1897).
Geim, A. K. & Grigorieva, I. V. Van der Waals heterostructures. Nature449, 419–425 (2013).
Google Scholar
Novoselov, K. S. et al. 2D materials and van der Waals heterostructures. Science353, aac9439 (2016).
CAS PubMed Google Scholar
Shim, J. et al. Electronic and optoelectronic devices based on two-dimensional materials: from fabrication to application. Adv. Elect. Mater.3, 1600364 (2017).
Google Scholar
Paul, T. et al. A high-performance MoS₂ synaptic device with floating gate engineering for neuromorphic computing. 2D Mater.6, 045008 (2019).
CAS Google Scholar
Choi, M. S. et al. Controlled charge trapping by molybdenum disulphide and graphene in ultrathin heterostructured memory devices. Nat. Commun.4, 1624 (2013).
ADS PubMed Google Scholar
Liu, B. et al. A Fluorographene-based synaptic transistor. Adv. Mater. Tech.4, 1900422 (2019).
CAS Google Scholar
Arnold, A. J. et al. Mimicking neurotransmitter release in chemical synapses via hysteresis engineering in MoS₂ transistors. ACS Nano11, 3110–3118 (2017).
CAS PubMed Google Scholar
Tran, T. T. et al. Quantum emission from hexagonal boron nitride monolayers. Nat. Nanotech.11, 37–41 (2016).
ADS CAS Google Scholar
Luo, X. et al. Reversible photo-induced doping in WSe₂ field effect transistors. Nanoscale11, 7358–7363 (2019).
CAS PubMed Google Scholar
Museur, L. et al. Defect-related photoluminescence of hexagonal boron nitride. Phys. Rev. B.78, 155204 (2008).
ADS Google Scholar
Hastas, N. A. et al. Electrical transport and low frequency noise characteristics of Au/n-GaAs Schottky diodes containing InAs quantum dots. Semicon. Sci. Tech.19, 461–467 (2004).
ADS CAS Google Scholar
Lin, Y.-F. et al. Barrier inhomogeneities at vertically stacked graphene-based heterostructures. Nanoscale6, 795–799 (2014).
ADS CAS PubMed Google Scholar
Yang, C.-S. et al. All-solide-state synaptic transistor with ultralow conductance for neuromorphic computing. Adv. Funct. Mater. 27, 1804170 (2018).
Google Scholar
Kim, S. et al. Impact of synaptic device variations on pattern recognition accuracy in a hardware neural network. Sci. Rep.8, 2638 (2018).
ADS PubMed PubMed Central Google Scholar
Lyon, R. F. et al. A computational model of filtering, detection, and compression in the cochlea. Proceed of IEEE-ICASSP-82. 1282–1285 (1982).
Widrow, B. et al. Stationary and nonstationary learning characteristics of the LMS adaptive filters. Proc. IEEE64, 1151–1162 (1976).
MathSciNet Google Scholar
LeCun, Y. et al. Deep learning. Nature521, 436–444 (2015).
ADS MathSciNet CAS Google Scholar

Download references

Acknowledgements

This research was supported by the Basic Science Research Program, Basic Research Lab Program, and Nano-Material Technology Development Program through National Research Foundation of Korea (NRF) grants funded by the Korean government (MSIP) (2020R1A4A2002806, 2019M3F3A1A01074451, 2018R1A2A2A05020475, and 2016M3A7B4910426).

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, Sungkyunkwan University, Suwon, 16419, Korea
Seunghwan Seo, Beom-Seok Kang, Je-Jun Lee, Hyo-Jun Ryu, Sungjun Kim, Hyeongjun Kim, Seyong Oh, Keun Heo & Jin-Hong Park
Semiconductor R&D Center, Samsung Electronics Co. Ltd, Hwasung, 18448, Korea
Hyo-Jun Ryu
Foundry Division, Samsung Electronics Co. Ltd., Youngin, 17113, Korea
Sungjun Kim
Department of Mechanical Engineering, Massachusetts Institute of Technology (MIT), Cambridge, MA, 02139, USA
Jaewoo Shim
Division of Electrical Engineering, Hanyang University, Ansan, 15588, Korea
Saeroonter Oh
Sungkyunkwan Advanced Institute of Nanotechnology (SAINT), Sungkyunkwan University, Suwon, 16417, Korea
Jin-Hong Park

Authors

Seunghwan Seo
View author publications
You can also search for this author in PubMed Google Scholar
Beom-Seok Kang
View author publications
You can also search for this author in PubMed Google Scholar
Je-Jun Lee
View author publications
You can also search for this author in PubMed Google Scholar
Hyo-Jun Ryu
View author publications
You can also search for this author in PubMed Google Scholar
Sungjun Kim
View author publications
You can also search for this author in PubMed Google Scholar
Hyeongjun Kim
View author publications
You can also search for this author in PubMed Google Scholar
Seyong Oh
View author publications
You can also search for this author in PubMed Google Scholar
Jaewoo Shim
View author publications
You can also search for this author in PubMed Google Scholar
Keun Heo
View author publications
You can also search for this author in PubMed Google Scholar
Saeroonter Oh
View author publications
You can also search for this author in PubMed Google Scholar
Jin-Hong Park
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.S. and P.J.-H. designed the experiments and analyzed the data. K.B.-S. and R.H.-J. contributed to the device fabrication. L.J.-J., K.H., and O.S. performed the X-TEM, EELS, XPS, and PL measurements. K.S., S.J., and H.K. performed the pattern recognition simulation. P.J.-H. supervised the research. All authors have discussed the results and commented on the manuscript.

Corresponding author

Correspondence to Jin-Hong Park.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review informationNature Communications thanks Christopher H Bennett and the other, anonymous, reviewers for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Seo, S., Kang, BS., Lee, JJ. et al. Artificial van der Waals hybrid synapse and its application to acoustic pattern recognition. Nat Commun 11, 3936 (2020). https://doi.org/10.1038/s41467-020-17849-3

Download citation

Received: 23 November 2019
Accepted: 20 July 2020
Published: 07 August 2020
DOI: https://doi.org/10.1038/s41467-020-17849-3

This article is cited by

Recent progress in three-terminal artificial synapses based on 2D materials: from mechanisms to applications
- Fanqing Zhang
- Chunyang Li
- Jing Zhao
Microsystems & Nanoengineering (2023)
A flexible artificial chemosensory neuronal synapse based on chemoreceptive ionogel-gated electrochemical transistor
- Hamna Haq Chouhdry
- Dong Hyun Lee
- Nae-Eung Lee
Nature Communications (2023)
Ferroelectric gating of two-dimensional semiconductors for the integration of steep-slope logic and neuromorphic devices
- Sadegh Kamaei
- Xia Liu
- Adrian M. Ionescu
Nature Electronics (2023)
Synaptic transistor with multiple biological functions based on metal-organic frameworks combined with the LIF model of a spiking neural network to recognize temporal information
- Qinan Wang
- Chun Zhao
- Zhen Wen
Microsystems & Nanoengineering (2023)
Interface-type tunable oxygen ion dynamics for physical reservoir computing
- Zhuohui Liu
- Qinghua Zhang
- Chen Ge
Nature Communications (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.