Pattern classification by memristive crossbar circuits using ex situ and in situ training

Alibart, Fabien; Zamanidoost, Elham; Strukov, Dmitri B.

doi:10.1038/ncomms3072

Article
Published: 25 June 2013

Pattern classification by memristive crossbar circuits using ex situ and in situ training

Fabien Alibart¹^nAff2,
Elham Zamanidoost¹ &
Dmitri B. Strukov¹

Nature Communications volume 4, Article number: 2072 (2013) Cite this article

15k Accesses
476 Citations
10 Altmetric
Metrics details

Subjects

Abstract

Memristors are memory resistors that promise the efficient implementation of synaptic weights in artificial neural networks. Whereas demonstrations of the synaptic operation of memristors already exist, the implementation of even simple networks is more challenging and has yet to be reported. Here we demonstrate pattern classification using a single-layer perceptron network implemented with a memrisitive crossbar circuit and trained using the perceptron learning rule by ex situ and in situ methods. In the first case, synaptic weights, which are realized as conductances of titanium dioxide memristors, are calculated on a precursor software-based network and then imported sequentially into the crossbar circuit. In the second case, training is implemented in situ, so the weights are adjusted in parallel. Both methods work satisfactorily despite significant variations in the switching behaviour of the memristors. These results give hope for the anticipated efficient implementation of artificial neuromorphic networks and pave the way for dense, high-performance information processing systems.

You have full access to this article via your institution.

Download PDF

A scalable ferroelectric non-volatile memory operating at 600 °C

Article 29 April 2024

Scaling deep learning for materials discovery

Article Open access 29 November 2023

High-speed and large-scale intrinsically stretchable integrated circuits

Article 13 March 2024

Introduction

The development of artificial neural networks (ANNs)¹ that are capable of matching the efficiency of their biological counterparts in information processing remains a major challenge in computing. If solved, the results will undoubtedly be very rewarding. For example, the mammalian brain outperforms computers in many computational tasks: it can recognize a complex image faster and with better fidelity, but it consumes only a fraction of the energy that computers require to accomplish such a task. Today, the need for efficient ANNs is as great as ever, given the increased awareness of energy consumption and the growing needs in information processing in the coming era of robotics, bioinformatics and distributed sensor networks.

One of the major challenges in this pursuit is the lack of suitable hardware to implement synapses, which are the most numerous processing elements in ANNs. In this context, memristors^{2,3,4,5,6,7,8} are very promising candidates owing to their analogue memory functionality, which is similar to that of biological synapses^8,9. In its simplest form, a memristor is a two-terminal device whose conductance can be changed continuously by applying a relatively large voltage bias but is retained when a smaller bias or no bias is applied^2,8. In accordance with ref. 2, in this paper, the term ‘memristor’ is used in a significantly broader sense and implies various types of memristive devices, characterized by pinched hysteresis loops. A memristor’s simple structure allows for crossbar circuit integration and aggressive size scaling, which are necessary for the practical implementation of large-scale ANNs^9,10. Another prominent artificial synapse candidate with analogue memory functionality is based on a floating gate transistor and was pioneered in the 1980s by Carver Mead and his students^11,12. However, floating gate synapses do not scale as well as memristors because of the increasing difficulty of confining electrons at the nanoscale and their inherent three-terminal structure.

The first centimetre-scale, three-terminal memristor, which was called the ‘memistor,’ was demonstrated more than half a century ago¹³, but that work was largely overlooked owing to the rapid progress of silicon technology at that time. Hence, the memristor research did not lead to the development of any major hardware types, except for a few notable projects in the 1990s^14,15. However, the situation has changed dramatically in recent years⁸, and memristor technology is rapidly advancing owing to the large-scale semiconductor industry’s effort to develop digital memristor-based crossbar memories. The recent milestones for metal-oxide memristors, which are compatible with existing silicon technology, include demonstrations of sub-10-nm devices¹⁶, 10¹²-cycle endurance¹⁷, pico-Joule¹⁸ and sub-ns switching¹⁹, and the monolithic integration of several memristive crossbar layers²⁰. These milestones, in turn, revived interest in the development of memristor-based ANNs and led to numerous few- or single-memristor demonstrations of synaptic functionality and simple associative memory^{21,22,23,24,25,26,27,28}, as well as the theoretical modelling of large-scale networks^{10,29,30,31,32,33}. Despite significant progress in memristor crossbar memories^{20,34,35,36,37}, memristor-based ANNs have proven to be significantly more challenging and have yet to be demonstrated. This paper reports the first successful experimental demonstration of pattern classification by a single-layer perceptron network implemented with a memristive crossbar circuit, in which synaptic weights are realized as conductances of titanium dioxide memristors with nanoscale-active regions. In particular, we first demonstrate an ex situ training method for which the synaptic weights are calculated on the precursor software-based network and then imported sequentially into the crossbar circuit. We then present in situ training, which is more practical but also significantly more demanding because it requires adjusting the synaptic weights in parallel according to the learning rule. For the latter case, the demonstrated circuit functionality relies on the collective operation of 20 variation-prone memristors, which is at least an order of magnitude higher in complexity in comparison with that of previously reported memristor-based ANNs^23,24,25.

Results

Pattern classifier implementation

A single-layer perceptron (Fig. 1a) consists of nine binary inputs x corresponding to a 3 × 3 pixel array (Fig. 1b), one bias input x₀ and one binary output y. The bias is fixed and has the logical value x₀=+1, whereas the inputs and outputs have values +1 or −1 (ref. 1). The task of the binary perceptron is to classify the input patterns (Fig. 1d) into two groups by performing operation, where w_i is an analogue weight representing a synaptic strength between the i-th input and the output. The considered sets of patterns (Fig. 1d) represent the letters ‘X’ and ‘T’ and their noisy versions, and are adopted from the original work on memistors¹³. As such sets are linearly separable, there exist weights w that permit successful classification¹. Such weights cannot be calculated analytically for the considered patterns but are found via an optimization procedure, such as the training process in the context of ANNs. During one training epoch, randomly ordered patterns from the training set are individually fed to the perceptron, and the weights are updated according to a training rule every time a new pattern is processed. For the ex situ method, training is performed in a software-implemented precursor network and, then, the final set of weights is imported to the hardware. For the in situ method, training is implemented directly in the hardware, and the weights are continuously updated in situ for every new pattern applied. Owing to simplicity, we have chosen the perceptron learning rule³⁸, which requires a weight update according to the formula Δw_i=αx_i^(p)(d^(p)-y^(p)), where α is the learning rate constant and y^(p) and d^(p) are the actual and desired outputs for a particular pattern p from the training set, respectively. Similar to the original work on memistors¹³, the training and test sets are the same and consist of the 50 patterns shown in Fig. 1d, with some patterns appearing several times (Supplementary Table S1).

**Figure 1: The pattern classification setup.**

Figure 1c shows perceptron implementation with 20 Pt/TiO_2-x/Pt single memristors connected into a 2 × 10 crossbar circuit. The physical variables corresponding to x and w are the input voltage V=±0.2 V and the memristor conductance G≡I(+0.2 V)/0.2 V measured at the input voltage, respectively. For the considered device, |I(+0.2 V)|≈|I(−0.2 V)| owing to the symmetric I–V. More specifically, each synaptic weight w_iG_i≡G_i⁺−G_i⁻ is represented with a pair of memristors with conductances G_i⁺ and G_i^–, which allows for the implementation of negative weight values. By grounding the output (horizontal) wires, the crossbar circuit efficiently implements the most critical operation in the perceptron, namely, the multiplication of the input voltage by the conductance of the memristors and the summation of the resulting currents on the wire, that is, and , all in analogue fashion. In principle, the output y=sgn[I⁺−I⁻] could be obtained from I⁺and I⁻ by first converting the currents to voltages using virtual ground mode operational amplifiers and then using a differential amplifier for thresholding¹. As such circuitry could be readily implemented with conventional silicon technology³⁹ and is not critical for our demonstration, y is obtained by directly measuring I⁺and I⁻ with a parameter analyser (Fig. 1c and Supplementary Figs S1 and S2).

Memristors (Fig. 2a) are fabricated with nanoscale e-beam-defined protrusions to localize the active area during the forming process to (20 nm)³ volume (Fig. 2c). Although this technique significantly decreased the variations and improved the yield (Supplementary Fig. S3c), significant dispersion in the switching dynamics still exists (Fig. 2b, Supplementary Fig. S4). Another feature supported by Fig. 2b is the strongly nonlinear switching dynamics. Such dynamics are typical for nonvolatile anion memristors, in which the ionic mobility depends super-exponentially on the applied voltage due to Joule heating^40,41,42. Because of the strong nonlinearity, no significant changes in the conductance state occur for applied voltages in the range of | V |<V_switch≈0.7 V (shown in grey in Fig. 2a) over a large time scale. For all memristors considered in this work, G_max=5 × 10⁻⁴ S and G_min=5 × 10⁻⁵ S, so the dynamic range G_max/G_min≈10 and the effective synaptic strength can be set to any value from –G_max+G_min to +G_max−G_min owing to differential representation.

**Figure 2: The titanium dioxide memristor.**

Ex situ and in situ training

Both ex situ and in situ training methods are experimentally demonstrated. For the first method, the calculated weights (Supplementary Table S2) are imported individually using a programming algorithm³⁹, which consists of a sequence of alternating write and read pulses (Fig. 3a and Supplementary Fig. S5). Similar to crossbar memories^8,43, a half biasing technique is employed (Fig. 3a) to ensure that the write pulses modify only the memristor in question and do not disturb half-selected memristors (which are all other memristors in this case). As is evident from Fig. 3b (and Supplementary Fig. S6), during programming, the changes in half-selected memristors in the crossbar circuit are insignificant owing to the strongly nonlinear dynamics. Figure 3c illustrates that all patterns are clearly separated for 10%- and 2%-accurate (on average; see Supplementary Table S2 and its discussion for further detail) weight imports. The programming accuracy β is defined as the relative difference between the desired and programmed (actual) conductance values, that is, β=|G_desired − G_actual|/G_actual and roughly equivalent to having log₂[log_1+2β [G_max/G_min]] bits of precision for logarithmic quantization. However, even for the latter case, one pattern from the ‘T’ subset results in a positive differential output current and will thus be misclassified (Supplementary Table S1). This misclassification is due to the slight asymmetry in the I–V curve that is not accounted for in the software-based precursor network. The saturation of classifier performance with weight import accuracy is consistent with software quantization experiments and confirms that minimum 3-bit weight precision is required to classify all patterns successfully. This requirement is equivalent to ~15% accuracy for the memristive devices with G_max/G_min≈10. The shift of the histogram to the right for the 40%-accurate weight import experiment is owing to the asymmetry in switching dynamics, which leads to systematically larger conductance values during programming. A higher precision weight import would still be required for more complex networks and/or to ensure more rapid convergence for the in situ training approach considered below. The actual requirement is still under debate⁴⁴ and would depend on the particular ANN implementation and training methods but is unlikely to exceed 6-bit accuracy based on simulation results^45,46.

**Figure 3: Pattern classification with *ex situ* training.**

In the in situ approach, the training process is implemented in hardware, and the conductance of all memristors is modified according to the perceptron learning rule, which is Δw_i^±=±αx_i(d^(p)−y^(p)) for the case of differential weight representation, upon processing each training pattern. When a particular pattern is applied to the perceptron network, output y is obtained by computing sgn[I⁺−I⁻] and then compared with the desired d. According to the perceptron rule, if y≠d, then each weight must be either strengthened or weakened by some amount. Such strengthening or weakening is implemented by simultaneously applying four specific pulse sequences (Fig. 4b) to the inputs and outputs of the crossbar circuit. The circuitry, which generates proper pulse sequences, is not critical for this demonstration and is thus not implemented in the hardware. Instead, this part of the training algorithm is executed by software running on a parameter analyser. In particular, the pulse sequences PS_x=+1 and PS_x=−1 (Fig. 4b) are always applied to the inputs of the crossbar according to the specific x values of a given pattern. Simultaneously, the pulse sequences PS⁺_d=+1 or PS⁺_d=−1 to the positive (that is, top) output of the crossbar, and PS⁻_d=+1=PS⁺_d=−1 or PS⁻_d=−1=PS⁺_d=+1 to the negative (that is, bottom) output according to the value of d (see the example given in Fig. 4a–c). Each pulse sequence consists of four pulses due to the topology of the crossbar circuit, which requires the increasing and decreasing of the conductance of the memristors to be implemented in four steps rather than simultaneously. For the considered pulse sequences, memristors whose conductance should be weakened in the positive (negative) wire of the crossbar circuit are modified in parallel during the first (second) pulse, whereas strengthening in the positive (negative) wire occurs in the fourth (third) pulse (Fig. 4b). Following the half-bias method, the magnitude of pulses in the pulse sequence is either zero or V_train/2, which results in five possible voltage biases, namely, zero, ±V_train/2 or ±V_train, across the memristors. Because of the specifically chosen V_train, such that V_train>V_switch and V_train/2<V_switch, weight modification occurs only for the latter two biases across memristors.

**Figure 4: Pattern classification with *in situ* training.**

In general, the perceptron rule for in situ training cannot be followed precisely because of the following issues. The first issue is the dispersion that occurs in the switching behaviour of memristors (Supplementary Fig. S4), which effectively means that the learning rate constant α in the perceptron rule varies between devices. This weight modification is step-like for the memristors (for example, weight G₅ in Fig. 4d) with lower voltage switching thresholds, and thus larger values of α, in comparison with the smooth behaviour observed for the devices (for example, weight G₉ in Fig. 4d) with higher thresholds and smaller α. Such behaviour is in qualitative agreement with software simulations (Supplementary Fig. S7). The second issue is that the rate of change in conductance depends on not only voltage (that is, V_train) but also the current state G of the memristor (Fig. 2b and Supplementary Fig. S4). In particular, the switching dynamics eventually saturate for a given V_train (Supplementary Fig. S4), which keeps G conveniently bounded; however, this saturation also means that V_train must be increased upon training if the full dynamic range for G is required. Using an ad hoc method for choosing the appropriate V_train, the in situ method is demonstrated to achieve 100% classification performance after 17 epochs (10 epochs at V_train=0.9 V and seven epochs at V_train=1 V).

Discussion

In summary, a perceptron classifier implemented with a memristive crossbar circuit and trained using in situ and ex situ methods has been experimentally demonstrated for the first time. Both approaches are shown to perform satisfactorily despite significant variations in the memristor switching behaviour. The ex situ approach is easier to implement, although the tradeoff is a potentially longer training time (for large network size) in comparison with that of the in situ method, which allows for a more efficient parallel synaptic weight update. Another advantage of in situ training is the ability to adapt to variations in a circuit and for in situ reconfiguration (see, for example, Supplementary Fig. S8), which is partially supported by the superior classification performance results.

Although the considered circuit is simple and hardly practical by itself, the established work presents a proof-of-concept demonstration for memristor-based ANNs. Extending this work to large-scale circuits with more complex training rules that are also capable of solving practical problems, such as the CrossNets variety of hybrid ANN circuits¹⁰, implementing convolutional neural networks⁴⁵ and recurrent Hopfield networks⁴⁶ appears to be relatively straightforward (Supplementary Fig. S9). Owing to the excellent scaling properties of memristors, the performance of memristor-based ANNs could be very impressive. For example, crude estimates indicate that ANN implementation with 5-nm scale memristors might be as dense as the mammal cerebral cortex while operating three orders of magnitude faster at a practical power consumption density¹⁰. Achieving such performance would certainly require the development of crossbar-integrated memristors monolithically stacked on conventional silicon circuits⁴³, which presents the next important goal for memristor-based ANNs.

Methods

Device structure and fabrication

The memristor consists of a TiO₂ blanket layer sandwiched between two metallic wires that are electrodes of 1 μm in width (top and bottom). Supplementary Fig. S3a schematically illustrates the crosspoint region between the two wires. The bottom electrodes are patterned using conventional optical lithography on silicon substrates SiO₂/Si (200 nm/500 μm). An anchoring layer of Ti (5 nm) and a Pt layer (25 nm) are deposited by e-beam evaporation before lift-off of the patterned resist⁴⁷. Then, a 30-nm TiO₂ switching layer is fabricated by atomic layer deposition at 200 °C using titanium isopropoxide (C₁₂H₂₈O₄Ti) and water as the precursor and reactant, respectively. Before the top electrode (TE) is deposited, a nanohole measuring 40 nm in diameter and 15 nm in depth is patterned into the TiO₂ blanket layer by a combination of e-beam lithography and dry etching. The unexposed resist is used as a protective layer during dry etching while the TiO₂ open region is etched by CHF₃. After removing the residual resist and any possible polymer contamination in the nanohole using O₂ plasma treatment, a TE of Pt/Au (15 nm/25 nm) is patterned by optical lithography, evaporated on top of the TiO₂ blanket layer. The nanoscale protrusion is then obtained by filling the nanohole with Pt during the TE deposition process. The protrusion results in a significant yield improvement (Supplementary Fig. S3c). Finally, multiple individual memristors are wire-bonded with gold pads to a standard chip package.

Switching dynamics characterization

To implement analogue state tuning for memristive devices, a specific characterization protocol based on the original work in ref. 48 was developed for SET and RESET switching transitions. As I–V is nonlinear for TiO_2−x devices, the state of the memristive devices G is defined as the conductance measured at V=200 mV. The SET and RESET transitions are then described by the change of state for the memristive device after the application of fixed-duration and fixed-amplitude voltage pulses. As the change of state also depends on the initial state of the memristor, it is convenient to characterize the SET and RESET transitions starting from precisely defined initial states. In particular, the characterization protocol is completed in two phases: (i) the device is first tuned to some initial state using a high-precision feedback algorithm, which is described in ref. 39 (see also Supplementary Fig. S5 and its discussion). Specifically, three initial states, namely, 50, 150 and 300 μS, are investigated; (ii) starting from the initial states, the change in conductivity induced by the application of a sequence of ten write pulses of fixed amplitude and duration is investigated. Each write pulse is followed by a read pulse of 200 mV in amplitude and 1 ms in duration to examine the current state of the device. These two phases are repeated for various initial states and write pulse amplitudes.

The SET transition is characterized by the application of 10 200-ns-long negative pulses with amplitudes ranging from −0.7 to −1.3 V, whereas the RESET transition is obtained by 1-μs-long positive voltage pulses with amplitudes ranging from 0.7 to 1.2 V. The difference in duration for the SET and RESET pulses was experimentally chosen to obtain a comparable change in state for both switching transitions after the application of 10 pulses. In addition, at all times, the switching is constrained to a limited dynamic range to avoid over-stressing because driving the device to very conductive or very resistive states can induce endurance issues or permanent failure. More specifically, if G becomes larger than 350 μS (lower than 25 μS) during the SET (RESET) transition, the measurement is stopped and the next sequence is performed.

Supplementary Fig. S4 presents the RESET (panel a) and SET (panel b) characterization steps for an initial state of 150 μS. For the largest voltages, only partial SET and RESET transitions are obtained for some devices owing to the constraint on the maximum (minimum) conductance state. The cumulative time corresponds to the summation of the total stress time duration. Equivalent characterizations are obtained when the initial state is set to 50 and 300 μS. A restrictive analysis of these measurements is presented in Fig. 1b of the main text. In this figure, the analysis is limited to the effect of the very first pulse for the SET and RESET transitions, starting from the three different initial states. For all devices, when the amplitude of the voltage pulses is in the range of | V |<V_switch≈0.7 V, the changes to the state are negligible. Such a ‘safe’ operating voltage range is conveniently larger in comparison with that of our earlier reported TiO_2−x devices³⁹ owing to the effect of the protrusion.

Experimental measurement setup

The experimental setup is composed of an Agilent B1500 semiconductor parameter analyser with the B1530 option (Supplementary Fig. S1). The B1530 is an arbitrary waveform generator and fast current measurement unit that allows for the application of pulse signals down to 100 ns and precise current measurement at a sampling rate of 50 ns. The waveform generator and fast current measurement units are connected to the inputs of an E5250A switching matrix. All equipment in the setup is controlled by the Agilent Easy Expert software. This software is used to implement the different write protocols presented in this paper.

The switching matrix is connected to the crossbar with memristive devices, which is obtained by wiring 20 individual crosspoint devices in the configuration corresponding to a crossbar architecture (that is, the top electrodes are shared by devices that belong to the same line and the bottom electrodes by devices that belong to the same column). Before wiring, each device is formed by the application of a voltage sweep between 0 and 10 V (with a forming voltage of ~6 V), with the active current compliance set at 100 μA using a transistor 2N2369A in series. Twenty devices are then wired in a 10 × 2 crossbar circuit. The crossbar circuit is not modified in any of the measurements presented in this paper.

In the ex situ training method, each weight is imported sequentially by applying the developed algorithm (Supplementary Fig. S5). Supplementary Fig. S2a,b provide a schematic of the ex situ training protocol. (For clarity, the arbitrary waveform generator and current measurement unit are separated in these figures. In reality, these operations are performed simultaneously by the B1530 unit.)

The write pulse amplitude V_write in the tuning algorithm under consideration is always applied using the half-select technique, that is, by applying +V_write/2 and –V_write/2 to the corresponding row and column, respectively, of the crossbar. Simultaneously, all other rows and columns are kept grounded. With this scheme, all half-selected devices are biased with the voltage ±V_write/2. During the application of the read pulse, 200 mV is applied to the crossbar column line and the current is sensed from the corresponding row line. All other rows and columns are kept grounded. In this situation, sneak currents are avoided, and the sensed current is an accurate measure of the device’s state under training.

In situ training is realized by applying the signals described in Fig. 4 of the main text. The specific pulse sequences corresponding to inputs x=±1 are applied to the columns of the crossbar. Simultaneously, the crossbar rows receive a specific pulse sequence corresponding to d=+1, if the pattern belongs to class X, and to d=−1, if the pattern belongs to class T. The example in Supplementary Fig. S10a corresponds to the presentation of a specific pattern, ‘X.’

For both ex situ and in situ training experiments, the testing set of 50 patterns is measured as illustrated in Supplementary Fig. S10b. The input lines receive a 1-ms-long positive or negative 200 mV read pulse depending on the pattern presented at the input of the pattern classifier circuit. A black pixel corresponds to +200 mV, and a white pixel to −200 mV. The output lines of the crossbar are connected to two ammeters, which sense the currents I⁺ and I⁻. The final classification is performed by post-processing I⁺–I⁻.

Additional information

How to cite this article: Alibart, F. et al. Pattern classification by memristive crossbar circuits using ex situ and in situ training. Nat. Commun. 4:2072 doi: 10.1038/ncomms3072 (2013).

References

Hertz, J., Krogh, A. & Palmer, R. G. Introduction to the Theory of Neural Computation Perseus: Cambridge, MA, (1991).
Chua, L. O. Resistance switching memories are memristors. Appl. Phys. A 102, 765–783 (2011).
Article ADS CAS Google Scholar
Waser, R. & Aono, M. Nanoionics-based resistive switching memories. Nat. Mater. 6, 833–840 (2007).
Article ADS CAS PubMed Google Scholar
Valov, I., Waser, R., Jameson, J. R. & Kozicki, M. N. Electrochemical metallization memories - fundamentals, applications, prospects. Nanotechnology 22, 254003 (2011).
Article ADS PubMed Google Scholar
Ha, S. D. & Ramanathan, S. Adaptive oxide electronics: a review. J. Appl. Phys. 110, 071101 (2011).
Article ADS Google Scholar
Wong, H. S. P. et al. T. Metal—Oxide RRAM. Proc. IEEE. 100, 1951–1970 (2012).
Article CAS Google Scholar
Pershin, Y. V. & Di Ventra, M. Memory effects in complex materials and nanoscale systems. Adv. Phys. 60, 145–227 (2011).
Article ADS Google Scholar
Yang, J. J., Strukov, D. B. & Stewart, D. R. Memristive devices for computing. Nat. Nanotechnol. 8, 13–24 (2013).
Article ADS CAS PubMed Google Scholar
Strukov, D. B. Nanotechnology: smart connections. Nature 476, 403–405 (2011).
Article ADS CAS PubMed Google Scholar
Likharev, K. K. CrossNets: neuromorphic hybrid CMOS/nanoelectronic networks. Sci. Adv. Mater. 3, 322–331 (2011).
Article CAS Google Scholar
Diorio, C., Hasler, P., Minch, A. & Mead, C. A. A single-transistor silicon synapse. IEEE Trans. Elec. Dev. 43, 1972–1980 (1996).
Article ADS CAS Google Scholar
Indiveri, G. et al. Neuromorphic silicon neuron circuits. Front. Neurosci. 5, 1–23 (2011).
Google Scholar
Widrow, B. & Angel, J. B. Reliable, trainable networks for computing and control. Aerosp. Eng. 21, 78–123 (1962).
Google Scholar
Thakoor, S., Moopenn, A., Daun, T. & Thakoor, A. P. Solid-state thin film memistor for electronic neural network. J. Appl. Phys. 67, 3132–3135 (1990).
Article ADS CAS Google Scholar
Holmes, A. J. et al. Use of a-Si:H memory devices for non-volatile weight storage in artificial neural network. J. Non-Crystalline Solids 164–166, 817–820 (1993).
Article ADS Google Scholar
Govoreanu, B. et al. 10 × 10 nm² Hf/HfOx crossbar resistive RAM with excellent performance, reliability and low-energy operation. IEEE Int. Electron. Devices Meet. 2012, 31.36.31 (2012).
Google Scholar
Lee, M.-J. et al. A fast, high–endurance and scalable non-volatile memory device made from asymmetric Ta2O5-x/TaO2-x bilayer structures. Nat. Mater. 10, 625–630 (2011).
Article ADS CAS PubMed Google Scholar
Strachan, J. P., Torrezan, A. C., Medeiros‐Ribeiro, G. & Williams, R. S. Measuring the switching dynamics and energy efficiency of tantalum oxide memristors. Nanotechnology 22, 505402 (2011).
Article ADS PubMed Google Scholar
Torrezan, A. C., Strachan, J. P., Medeiros‐Ribeiro, G. & Williams, R. S. Sub-nanosecond switching of a tantalum oxide memristor. Nanotechnology 22, 485203 (2011).
Article PubMed Google Scholar
Kawahara, A. et al. An 8Mb multi-layered cross-point ReRAM macro with 443MB/s write throughput. IEEE Int. Solid-State Circuits Conf. 2012, 432–434 (2012).
Google Scholar
Jo, S. H. et al. Nanoscale memristor device as synapse in neuromorphic systems. Nano. Lett. 10, 1297–1301 (2010).
Article ADS CAS PubMed Google Scholar
Ohno, T. et al. Short-term plasticity and long-term potentiation mimicked in single inorganic synapses. Nat. Mater. 10, 591–595 (2011).
Article ADS CAS PubMed Google Scholar
Ziegler, M. et al. An electronic version of Pavlov’s dog. Adv. Funct. Mater. 22, 2744–2749 (2012).
Article CAS Google Scholar
Bichler, O. et al. Pavlov’s dog associative learning demonstrated on synaptic-like organic transistors. Neural. Comput. 25, 549–566 (2013).
Article CAS PubMed Google Scholar
Pershin, Y. V. & Di Ventra, M. Experimental demonstration of associative memory with memristive neural networks. Neural Networks 23, 881–886 (2010).
Article PubMed Google Scholar
Kuzum, D., Jeyasingh, R. G. D., Lee, B. & Wong, H. S. P. Nanoelectronic programmable synapses based on phase-change materials for brain inspired computing. Nano Lett. 12, 2179–2186 (2012).
Article ADS CAS PubMed Google Scholar
Seo, K. et al. Analog memory and spike-timing-dependent plasticity characteristics of a nanoscale titanium oxide bilayer resistive switching device. Nanotechnology 22, 254023 (2011).
Article ADS PubMed Google Scholar
Choi, H. et al. An electrically modifiable synapse array of resistive switching memory. Nanotechnology 20, 345201 (2009).
Article ADS PubMed Google Scholar
Snider, G. S. Self-organized computation with unreliable memristive nanodevices. Nanotechnology 18, 365202 (2007).
Article Google Scholar
Querlioz, D., Bichler, O. & Gamrat, C. Simulation of a memristor based spiking neural network immune to device variations. Int. Joint Conf. Neural Networks 2011, 1775–1781 (2011).
Google Scholar
Zaveri, M. S. & Hammerstrom, D. Performance/price estimates for cortex-scale hardware: a design space exploration. Neural Networks 24, 291–304 (2011).
Article PubMed Google Scholar
Zamarreno-Ramos, C. et al. On spike-timing-dependent-plasticity, memristive devices, and building a self-learning visual cortex. Front. Neuromorphic Eng. 5, 1–22 (2011).
Google Scholar
Kuzum, D., Jeyasingh, R. G. D. & Wong, H. -S. P. Energy efficient programming of nanoelectronic synaptic devices for large-scale implementation of associative and temporal sequence learning. Int. Electron Device Meeting 2011, 30.3.1–30.3.4 (2011).
Google Scholar
Green, J. E. et al. A 160-kilobit molecular electronic memory patterned at 10¹¹ bits per square centimetre. Nature 445, 414–417 (2007).
Article ADS CAS PubMed Google Scholar
Lee, M. -J. et al. Stack friendly all-oxide 3D RRAM using GaInZnO peripheral TFT realized over glass substrates. Int. Electron Device Meeting 2008, 1–4 (2008).
ADS Google Scholar
Kim, K. -H. et al. A functional hybrid memristor crossbar-array/CMOS system for data storage and neuromorphic applications. Nano. Lett. 12, 389–395 (2012).
Article ADS CAS PubMed Google Scholar
Kim, G. H. et al. 32 × 32 crossbar array resistive memory composed of a stacked Schottky diode and unipolar resistive memory. Adv. Func. Mat. 23, 1440–1449 (2012).
Article Google Scholar
Rosenblatt, F. The perceptron—a perceiving and recognizing automaton. Report 85-460-1 Cornell Aeronautical Laboratory: Ithaca, NY, (1957).
Alibart, F., Gao, L., Hoskins, B. & Strukov, D. B. High precision tuning of state for memrsitive devices by adaptable variation-tolerant algorithm. Nanotechnology 23, 075201 (2012).
Article ADS PubMed Google Scholar
Strukov, D. B. & Williams, R. S. Exponential ionic drift: fast switching and low-volatility of thin-films memristors. Appl. Phys. A 94, 515–519 (2009).
Article ADS CAS Google Scholar
Karg, S. F. et al. Transition-metal oxide-based resistance change memories. IBM J. Res. Dev. 52, 481–492 (2008).
Article CAS Google Scholar
Russo, U. et al. Conductive-filament switching analysis and self-accelerated thermal dissolution model for reset in NiO-based RRAM. Int. Electron Device Meeting 2007, 775–778 (2007).
Google Scholar
Likharev, K. K. Hybrid CMOS/nanoelectronic circuits: Opportunities and challenges. J Nanoel. Optoel. 3, 203–230 (2008).
Article Google Scholar
Pfeil, T. et al. Is a 4-bit synaptic weight resolution enough? – constraints on enabling spike-timing dependent plasticity in neuromorphic hardware. Front. Neurosci. 6, 1–19 (2012).
Article Google Scholar
Farabet, C. et al. Machine Learning on Very Large Data Sets (eds Bekkerman R.et al. 399–419Cambridge Press: Cambridge, MA, (2011).
Likharev, K., Mayr, A., Muckra, I. & Türel, Ö. CrossNets: high-performance neuromorphic architectures for CMOL circuits. Ann. New York Acad. Sci. 1006, 146–163 (2003).
Article ADS CAS Google Scholar
Yang, J. J. et al. Diffusion of adhesion layer metals controls nanoscale memristive switching. Adv. Mat. 22, 4034–4038 (2010).
Article CAS Google Scholar
Pickett, M. et al. Switching dynamics in a titanium dioxed memristive device. J. Appl. Phys. 106, 074508 (2009).
Article ADS Google Scholar

Download references

Acknowledgements

We acknowledge our useful discussions with O. Bichler, C. Gamrat, L. Gao, D. Hammerstrom, B. Hoskins, V. Kochergin, S. Stemmer, G. Snider, D. Querlioz, B. Widrow and S. Williams, and are especially grateful to K. Likharev for reading and commenting on the early version of the manuscript. This work was supported by the Air Force Office of Scientific Research (AFOSR) under the MURI grant FA9550-12-1-0038 and FA8750-12-C-0157.

Author information

Fabien Alibart
Present address: Present address: Institut d’Electronique de Micro et Nanotechnologie (IEMN), Centre National de la Recherche Scientifique (CNRS), Villeneuve d’Ascq, 59652 Cedex, France,

Authors and Affiliations

Department of Electrical and Computer Engineering, University of California at Santa Barbara, Santa Barbara, 93106, California, USA
Fabien Alibart, Elham Zamanidoost & Dmitri B. Strukov

Authors

Fabien Alibart
View author publications
You can also search for this author in PubMed Google Scholar
Elham Zamanidoost
View author publications
You can also search for this author in PubMed Google Scholar
Dmitri B. Strukov
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

F.A. and D.B.S. conceived the idea, designed the experiments and interpreted the results; F.A. performed the experimental work; E.Z. performed the simulations; and F.A. and D.B.S. co-wrote the manuscript.

Corresponding author

Correspondence to Dmitri B. Strukov.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures S1-S10, Supplementary Tables S1-S2, Supplementary Methods and Supplementary References (PDF 948 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Alibart, F., Zamanidoost, E. & Strukov, D. Pattern classification by memristive crossbar circuits using ex situ and in situ training. Nat Commun 4, 2072 (2013). https://doi.org/10.1038/ncomms3072

Download citation

Received: 25 January 2013
Accepted: 29 May 2013
Published: 25 June 2013
DOI: https://doi.org/10.1038/ncomms3072

This article is cited by

Echo state graph neural networks with analogue random resistive memory arrays
- Shaocong Wang
- Yi Li
- Ming Liu
Nature Machine Intelligence (2023)
Light-enhanced molecular polarity enabling multispectral color-cognitive memristor for neuromorphic visual system
- Jongmin Lee
- Bum Ho Jeong
- Hui Joon Park
Nature Communications (2023)
Highly-scaled and fully-integrated 3-dimensional ferroelectric transistor array for hardware implementation of neural networks
- Ik-Jyae Kim
- Min-Kyu Kim
- Jang-Sik Lee
Nature Communications (2023)
Braille–Latin conversion using memristive bidirectional associative memory neural network
- Jayasri Vaidyaraman
- Abitha K. Thyagarajan
- V. Ravi
Journal of Ambient Intelligence and Humanized Computing (2023)
Memristor-based analogue computing for brain-inspired sound localization with in situ training
- Bin Gao
- Ying Zhou
- Huaqiang Wu
Nature Communications (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.