Memristive neural network for on-line learning and tracking with brain-inspired spike timing dependent plasticity

Pedretti, G.; Milo, V.; Ambrogio, S.; Carboni, R.; Bianchi, S.; Calderoni, A.; Ramaswamy, N.; Spinelli, A. S.; Ielmini, D.

doi:10.1038/s41598-017-05480-0

Download PDF

Article
Open access
Published: 13 July 2017

Memristive neural network for on-line learning and tracking with brain-inspired spike timing dependent plasticity

G. Pedretti¹,
V. Milo¹,
S. Ambrogio¹,
R. Carboni¹,
S. Bianchi¹,
A. Calderoni²,
N. Ramaswamy²,
A. S. Spinelli¹ &
…
D. Ielmini ORCID: orcid.org/0000-0002-1853-1614¹

Scientific Reports volume 7, Article number: 5288 (2017) Cite this article

9666 Accesses
137 Citations
4 Altmetric
Metrics details

Subjects

A Publisher Correction to this article was published on 18 June 2018

This article has been updated

Abstract

Brain-inspired computation can revolutionize information technology by introducing machines capable of recognizing patterns (images, speech, video) and interacting with the external world in a cognitive, humanlike way. Achieving this goal requires first to gain a detailed understanding of the brain operation, and second to identify a scalable microelectronic technology capable of reproducing some of the inherent functions of the human brain, such as the high synaptic connectivity (~10⁴) and the peculiar time-dependent synaptic plasticity. Here we demonstrate unsupervised learning and tracking in a spiking neural network with memristive synapses, where synaptic weights are updated via brain-inspired spike timing dependent plasticity (STDP). The synaptic conductance is updated by the local time-dependent superposition of pre- and post-synaptic spikes within a hybrid one-transistor/one-resistor (1T1R) memristive synapse. Only 2 synaptic states, namely the low resistance state (LRS) and the high resistance state (HRS), are sufficient to learn and recognize patterns. Unsupervised learning of a static pattern and tracking of a dynamic pattern of up to 4 × 4 pixels are demonstrated, paving the way for intelligent hardware technology with up-scaled memristive neural networks.

Memristive synapses connect brain and silicon spiking neurons

Article Open access 25 February 2020

Time and rate dependent synaptic learning in neuro-mimicking resistive memories

Article Open access 28 October 2019

Spiking neurons with spatiotemporal dynamics and gain modulation for monolithically integrated memristive neural networks

Article Open access 07 July 2020

Introduction

Artificial intelligence, namely the ability to reproduce brain-like reasoning in a silicon chip, has been the objective of scientific research for the last 60 years¹. Computers able to learn by sensory excitement from the external world, to infer abstract concepts and to make decisions, will spur the next technology revolution reshaping all aspects of our life and society. Recently, neural networks empowered with deep learning algorithms have shown the capability of playing games^2,3, providing accurate translation of sentences⁴, and passing visual Turing tests⁵. These achievements were all demonstrated via software implementations in high-performance digital computers with conventional complementary metal-oxide-semiconductor (CMOS) technology. However, upscaling of these software approaches is frustrated by the von Neumann architecture of conventional computing machines where the processor and memory units are physically separate, thus resulting in large area, long time latency, and multichip system complexity. Also, there are fundamental power-density constraints affecting Moore’s law in the medium-long term which prevent future scaling of von Neumann computers to the complexity level required to emulate the brain⁶. Increasing research efforts are thus being directed at developing neural-network accelerators with suitable parallelism, low-power consumption and non-von Neumann, computing-in-memory architecture, suitable for performing brain-like tasks. For instance, a CMOS-based neuromorphic multi-core processor with one million neurons and 256 million synapses showed a reduction of power consumption by a factor 10⁴ with respect to the conventional CMOS architecture⁷. Low-power operation was also demonstrated in analog circuits with leaky integrate-and-fire (LIF) neurons and silicon synapses capable of spike-based visual pattern learning⁸ and solving complex constraint-satisfaction problems⁹. All these neuromorphic implementations rely on silicon CMOS synapses which are inherently volatile, binary, and poorly scalable. In fact, a CMOS-based static random access memory (SRAM) occupies a relatively large area of more than 100 F², where F is the lithographic feature to manufacture the technology¹⁰. The logic state in the SRAM can be either 0 or 1, and is immediately lost upon turning off the power supply. A truly bio-realistic technology for neuromorphic systems requires a change of paradigm toward nonvolatile, multilevel, and scalable synapses consistent with the ultra-high density of connections (about 10⁴ synapses per neuron on average) in the human cortex¹¹. In addition, the artificial synapses should display brain-inspired time-dependent weight update, such as spike-timing dependent plasticity (STDP)^12,13, which is an essential feature of event-driven learning in biological neural networks.

Resistive/memristive devices, where the resistance changes in response to the application of an electrical stimulus, represent an ideal solution for electronic synapses in future neuromorphic systems^14,15. At least 3 main categories of memristive devices have been described with reference to synaptic applications, namely resistive switching memory (RRAM) devices¹⁶, phase change memory (PCM) devices¹⁷, and magneto-resistive memory (MRAM) devices¹⁸. All types of memristive devices share the multilevel capability of changing their conductance to any arbitrary value within a possible range. The conductance is dictated by a nanoscale material modification, e.g., a structural phase distribution in PCM¹⁹, or a magnetic domain orientation in MRAM²⁰, thus the multivalued conductance state can be retained even without any power supply. In addition, memristive devices show outstanding area efficiency thanks to their 2-terminal structure, which allows a minimum device size in the range of only few square-nm²¹, and stacking capability thanks to 3D integration^22,23. Due to these beneficial properties, memristive devices have attracted strong interest as artificial electronic synapses in the last decade. In particular, the ability to update the synaptic weight by STDP has been verified in stand-alone synapses, such as RRAM^24,25,26 and PCM^27,28. Visual pattern training and recognition have been demonstrated by simulations of neuromorphic networks with memristive synapses^28,29,30,31. Neuromorphic circuits with memristive synaptic arrays were experimentally evaluated by using recurrent Hopfield networks^32,33,34 and perceptron networks, showing pattern classification³⁵ and supervised weight-update via backpropagation³⁶ or winner-take-all algorithms³⁷. Bio-inspired unsupervised learning was only demonstrated in simulations³¹ or with a mixed set of hardware and software synapses³⁸. All attempts were aimed at learning static patterns of a limited amount of pixels, although time evolution is an essential character of sensory information and enables object tracking in brain-inspired machine vision^39,40. In this work, we demonstrate unsupervised learning of a static pattern and adaptation to a dynamic pattern within a perceptron-like network of memristive synapses where the weights are updated via local STDP^26,28,31. Functional networks with up to 2 post-synaptic neurons are shown, supporting parallel neuromorphic computing and enabling future vision machines such as artificial retinas.

Results

Synaptic STDP characteristics

Figure 1a shows the individual building block at the basis of any neural network, namely a synapse connected to a pre-synaptic neuron (PRE) and a post-synaptic neuron (POST). The synapse is responsible for the learning function in a neural network, since the synapse weight dictates the amount of signal that effectively reaches the POST upon PRE spiking. In our artificial neural network, the POST is represented by a LIF circuit while the synapse consists of a hybrid one-transistor/one-resistor (1T1R) structure^26,28,31, as illustrated in the conceptual scheme of Fig. 1b. In this artificial synapse, the resistor is a RRAM device with a 10-nm thick switching layer of HfO₂ (ref.⁴¹ and Fig. S1 of the Supplementary Information). As shown in Fig. 1c, the application of a positive voltage causes a transition to the low resistance state (LRS), called set process, as a result of the formation of a conductive filament (CF) containing oxygen vacancies between the 2 electrodes. The field-effect transistor (FET) in the 1T1R allows to limit the maximum current to a compliance current I_C during the set transition, thus providing control of the CF conductivity and avoiding irreversible breakdown⁴². The application of a negative voltage causes the retraction of the CF and the consequent transition to the high resistance state (HRS), called reset process.

The 1T1R synapse allows spike communication and STDP as detailed in Fig. 1b: when a PRE spike is applied to the gate terminal of the transistor, a positive current flows into the input terminal of the POST due to a positive static voltage V_TE at the top electrode, and is then integrated by the integrating stage of the LIF neuron. The result of the current integration is stored as an internal potential V_int (Fig. S2): as V_int exceeds a certain threshold V_th, the neuron generates a forward spike, delivered to the next neuron, and a feedback spike, consisting of a sequence of positive and negative pulses, which are back-propagated to the synapse to allow for STDP⁴³. As illustrated in Fig. 1d, if the POST-spike event follows the PRE-spike event, i.e., if the spike delay Δt = t_POST − t_PRE is positive, then the transistor is enabled by the PRE spike during the positive spike of the POST, which results in a set transition, or synaptic potentiation. On the other hand, if Δt < 0, then the transistor is enabled during the negative spike of the POST, thus causing a reset transition, or synaptic depression. Synaptic potentiation and depression controlled by spike timing delay Δt result in STDP, which was experimentally demonstrated by applying independent voltage pulses to the transistor gate and the top electrode of the synapse of Fig. 1b with variable timing delay Δt. After the application of voltage spikes, the resistance R of the 1T1R synapse was measured, allowing to determine the conductance change η defined as the inverse ratio between R and the initial resistance R₀, namely η = R₀/R. Figure 1e shows the measured η as a function of Δt and of the initial RRAM state R₀. Potentiation (η > 1) occurs for 0 < Δt < 10 ms, except for relatively low R₀ which is comparable to the target LRS resistance dictated by the gate voltage amplitude V_G. On the other hand, depression (η < 1) takes place at −10 ms < Δt < 0 except for relatively high R₀ which is comparable to the HRS resistance dictated by the top-electrode voltage V_TE (ref.³¹). When the delay time is larger than the gate pulse width, namely for |Δt| > 10 ms in this experiment, there is no overlap between pulses, thus the RRAM conductance is left unchanged. The time- and state-dependent plasticity in Fig. 1e is consistent with multiplicative STDP that is at the basis of self-adaptation⁴⁴. The STDP response of the 1T1R synapse was also simulated by a physics-based analytical model for RRAM, showing good agreement with the experimental characteristics of Fig. 1e (see Fig. S3) and further supporting the STDP functionality of the 1T1R artificial synapse.

Pattern learning in a neural network

Learning of a visual pattern was experimentally demonstrated using the 2-layer perceptron network in Fig. 2a. The perceptron includes a first layer of 4 × 4 = 16 PREs, representing simplified retina neurons spiking in response to visual stimuli, and a single POST responsible for recognition and classification. Each PRE is connected to the POST by an artificial hybrid synapse capable of STDP. The neural network is operated in 2 phases: the first phase consists of training the network by stochastically submitting a visual pattern to the PREs to induce proper synaptic potentiation/depression by STDP, while the second phase consists of the recognition of patterns, where various patterns are submitted to the network to test the quality of learning. Learning is considered successful if the POST fires only in response to the same pattern used during training, whereas other patterns do not induce any fire, i.e., there are no false positives. Training relies on STDP occurring at any individual synapse in response to PRE stimulation and consequent POST fire events. STDP is usually disabled during recognition to avoid unwanted learning of false patterns.

The perceptron network was physically implemented by connecting PRE/POST neurons and synapses on a printed circuit board (PCB, see Fig. S4). To find the most appropriate voltages of the POST spike to induce potentiation or depression, pulses with increasing voltage and 1 ms width were applied and the resulting resistance change was collected. Figure 2b shows the measured R as a function of the absolute value of the pulse voltage |V_TE|, indicating that the RRAM synapse completes the transition from high to low resistance at V_TE = 1 V, and from low to high resistance at V_TE = −1.5 V. In view of these set/reset characteristics and to take into account possible fluctuations of the set voltage V_set due to statistical variations of HRS⁴⁵, the POST spike included a positive pulse of 2 V and a negative pulse of −1.6 V. Figure 2c shows examples of PRE and POST voltage spikes with a positive delay Δt = 3 ms, causing synaptic potentiation (Fig. 2d), followed by another pair of PRE and POST spikes with a negative delay Δt = −7 ms, causing synaptic depression. Figure 2e summarizes the effects of STDP by showing the correlation of resistance R(t_i+1) measured after a spike as a function of R(t_i) before the spike for cases of potentiation (0 < Δt < 10 ms), depression (−10 ms < Δt < 0), and no overlap between the PRE and POST spikes (|Δt| > 10 ms). The resistance decreases [R(t_i+1) < R(t_i)] for potentiation events and increases [R(t_i+1) > R(t_i)] for depression events, while all other cases show no change in the synaptic resistance [R(t_i+1) ≈ R(t_i)]. Note that the resistance after a single STDP event is either equal to the LRS or the HRS level, thus evidencing binary set/reset operations in the STDP characteristics.

After verifying the STDP at the level of single synapse, we tested learning of predefined images of 4 × 4 pixels. The synaptic network was first trained with a first image, namely the diagonal pattern #1 in Fig. 3a, to test the learning of a static image, then patterns #2 (Fig. 3b) and #3 (Fig. 3c) are subsequently submitted to demonstrate dynamic learning. A stochastic training approach was adopted, where PRE spikes alternatively present the image or a random pattern (e.g., see Fig. 3d), consisting of only 3% of the pixels on average being randomly activated³¹. Image and noise were alternatively submitted at each epoch, consisting of an individual time fragment of 10 ms width. The probabilities of presenting pattern and noise were equally set to 50%. The synaptic weights were initially prepared in a high resistance state, as indicated in Fig. 3e. The threshold voltage was set to V_th = 0.72 V.

During the first 300 epochs of training with pattern #1, the image was readily learnt because of STDP causing potentiation of image synapses and depression of background pixels. Static learning of pattern #1 is evidenced in Fig. 3f, where all pattern synapses show LRS conductance, while background synapses show HRS conductance. As the submitted image is changed from pattern #1 to pattern #2, the learning of pattern #2 is demonstrated, as evidenced by the final synaptic weights after 600 epochs in Fig. 3g. This supports ‘dynamic’ learning, or adaptation of synaptic weights to the presented image in real time by our neuromorphic system. Similarly, pattern #3 is learnt during the third training phase between epoch 600 and epoch 1000, as shown by the final synaptic weights in Fig. 3h. Figure 3i shows the PRE spikes as a function of epochs, indicating the 3 sequential training phases, while Fig. 3j shows the corresponding time evolution of synaptic weights 1/R for synapses stimulated by the pattern, or simply pattern synapses in the following (red), and synapses located outside the pattern, referred to as background synapses in the following (blue). A movie showing the evolution of the synaptic weights in a color plot similar to Fig. 3a–h is available in the Supplementary Movie 1.

Pattern and background weights show STDP-induced potentiation and depression, respectively, in each of the 3 training stages. Selective synapse potentiation/depression can be understood as follows: as the pattern is submitted at epoch i, V_int increases significantly, thus potentially inducing a fire event. This causes STDP with Δt > 0, hence potentiation (Fig. 2c–e). On the other hand, if a noise pattern is submitted at epoch i + 1 after fire, then STDP with Δt < 0 takes place, thus causing depression of the corresponding synapses. As a result, selective potentiation takes place at pattern synapses, while unselective depression takes place throughout the whole synaptic network. By properly adjusting the noise percentage in a range between 2% and 7% of randomly activated pixels, stable learning can be achieved. A larger percentage of noise would cause fire in response to the submission of noise, which induces potentiation of random synapses and depression of pattern synapses, thus is to be avoided. Static learning similar to the one in the first 300 epochs in Fig. 3 can be demonstrated irrespective of the initial configuration of synaptic weights, which can be prepared in either HRS (Fig. 3), LRS (Fig. S5), or random states (Fig. S6). The independence on the initial configuration of weights is due to the STDP inducing selective potentiation and unselective depression of synapses, and is essential for the dynamic learning in Fig. 3, where a new image must overwrite the previous one by potentiating weak synapses and depressing strong synapses where needed.

While neuromorphic systems are generally expected to operate in the same timescale (10 to 100 ms) as the biological counterparts, e.g., to enable gesture and speech recognition, our RRAM devices are also capable of much faster learning and recognition via STDP. High-speed learning can be achieved by using the same hardware operated with 100 times shorter pulses, i.e., PRE spikes of 100 μs width and POST positive/negative pulses of 10 μs width (Fig. S7). A higher feedback voltage V_TE+ = 3.3 V was used to enable set transition in the 10 μs timescale. Given the proportionality between energy and time, accelerated STDP can also be used to reduce energy consumption during learning²⁸. Time flexibility of RRAM devices thus allows to match various time/energy requirements depending on the specific application scenario.

STDP in our approach is implemented as a deterministic binary plasticity rule, i.e., positive delay results in full set transition to the LRS, while negative spike delay causes full reset transition to the HRS. This is also dictated by the binary switching characteristics of our device in Fig. 1c, where both set and reset transitions appear quite abruptly as the voltage exceeds the set or reset threshold. However, for certain applications, analog weight variation may be useful, e.g., vision does not only imply recognition of object shapes, but also textures and colors, which can be represented by analog weights. Analog STDP with inherently digital RRAM devices was previously obtained by probabilistic potentiation/depression, where application of a voltage close to the threshold results in set/reset only in a random subset of cases⁴⁶. Here, we adopted a different approach to achieve analog weight potentiation. To demonstrate learning of gray-scale images, we represented different gray tones through variable PRE spike voltage amplitudes (Fig. S8 and Supplementary Movie 2). An increasing value of V_G results in an increasing transistor current I_C during the set operation, which controls the LRS resistance^25,26. As a result, synapses stimulated by a light gray intensity (high V_G) are potentiated to a high conductance, while a dark gray intensity yields low conductance. Similarly, color-scale images can be represented by multiple synapses per pixel where each synapse represents the intensity of a color component, e.g., adopting a RGB representation⁴³.

Image recognition

The second key function of a perceptron network is the pattern recognition, that is the capability to discriminate between patterns that were previously submitted during the learning phase. In the recognition phase, an image is presented to the network while monitoring the response of the POST. The POST should fire in response to an image which is similar (or perfectly equivalent) to the one submitted during training, i.e., the training pattern. In addition, recognition should result in no false positives, namely, the POST should not fire in response to patterns which are significantly different from the training pattern. To test the recognition capability, we statically trained our network with the training pattern of Fig. 4a, resulting in the final synaptic weights of Fig. 4b after 300 epochs. Then we submitted a sequence of all 1820 test patterns (see, e.g., Fig. 4c) with 4 activated pixels out of 16, i.e., the same number of activated pixels as in the training pattern. After submitting any test pattern, we checked for a possible POST fire event and discharged the internal potential V_int for a new test. Figure 4d shows the cumulative distribution of the 1820 calculated values of V_int, obtained after integrating the total current I_post given by:

$${I}_{post}={V}_{TE}\sum _{n=1}^{16}{R}_{n}^{-1}$$

(1)

where R_n is the resistance of the n-th synapse. Note that the latter consists of a 1T1R structure, thus R_n includes both contributions from the transistor, which is conductive only for the activated pixels of the test pattern, and the memristor, which is conductive (LRS) only within the pattern which was submitted in the training phase. The distribution shows five sub-distributions, corresponding to patterns sharing no pixels with the training pattern (V_int ≈ 0), and patterns sharing 1, 2, 3 or 4 pixels with the training pattern, showing increasing values of V_int. In this recognition experiments, the threshold voltage V_rec for fire was set to 1.7 V, which led to a fire event only in correspondence of the presentation of the training pattern, i.e., no false positives were recorded. These results support the pattern recognition capability of our synaptic network.

Multiple pattern learning and tracking

Unsupervised learning in the brain usually proceeds by simultaneous specialization of distinct neurons in response to sensory stimuli⁴⁷. To enable multiple image learning, we extended our network to include one additional POST as shown in Fig. 5a. POST1 and POST2 are both fully connected by separate synapses to the first layer of PRE³¹. The operation of the 2-neuron network is the same as the 1-neuron network of Figs 1–4, except for the presence of lateral inhibitory synapses between the 2 POSTs. When POST1 fires, a spike is sent through the inhibitory synapse to POST2 to reduce its internal potential V_int,2 by a fixed amount (40% in our experiment). Similarly, when POST2 fires, a spike through the inhibitory synapse to POST1 forces V_int,1 to decrease by the same amount. This winner-take-all approach prevents the 2 neurons to specialize to the same image, thus allowing the maximization of the network learning and recognition functionalities⁴⁸. Complex neuron networks with inhibitory synapses have also been shown to enable parallel computing tasks, including tackling NP-hard to a certain level, Sudoku games and similar constraint satisfaction problems⁴⁹.

We first tested static training in the 2-neuron network by submitting the 2 images of 3 × 3 size in Fig. 5b. Static training was continued for 1000 epochs using the usual stochastic approach with alternated patterns (Fig. 5b) and random noise. After the initial training, the 2 images were shifted counter-clockwise along the perimeter of the 3 × 3 square as indicated by arrows in Fig. 5b. Images were moved by a total of 4 steps, and after each step the image was submitted for 1000 epochs to verify the ability of our network to track the moving image. Results are shown in Fig. 5c for POST1 and Fig. 5d for POST2, reporting the final synaptic weights at the end of each training phase. Not only the static learning of patterns in Fig. 5b is demonstrated by the 2 neurons after 1000 epochs, but also each modified pattern is correctly learnt at the end of each phase of the dynamic learning. Note that each neuron remains locked to one specific image during its movement, since this minimizes the number of synapses (2 for each POST) that must change their weights. The synaptic weights 1/R are shown as a function of time in Fig. 5e for POST1 and Fig. 5f for POST2, while the Supplementary Movie 3 shows an overview of the time evolution of synaptic weights during learning and tracking of the moving image. The results confirm that synaptic weights can track dynamic patterns as a result of on- line unsupervised learning.

Discussion

Our results support object learning, recognition and adaptation in synaptic networks by unsupervised Hebbian learning, which is believed to be a fundamental synaptic plasticity principle within the human brain. Hebb’s rule generally describes a reward scheme where neurons firing in a causal sequence are awarded with incremented synaptic connection, while neurons firing with apparently uncorrelated timing are penalized with a decremented synaptic connection⁵⁰. In machine learning, unsupervised techniques find application in data clustering and anomaly detection, which is the standard methodology to monitor intrusion hazards, bank frauds, medical errors, and similar threats⁵¹. In biological systems, reward schemes have been evidenced in several sensory functions such as vision⁵², olfactory system⁵³, and sensory-motor system^54,55. Even the ability to recognize and anticipate the direction of moving objects, which is fundamental for the control of autonomous robots and vehicles, has been modeled by burst-mode STDP in the visual cortex⁵⁶. The ubiquitous character of STDP suggests that physical hardware capable of STDP might have a key role in the development of humanoid robots and other artificial systems aiming at mimicking human perception and cognition. Thanks to the bio-mimetic nature of STDP, unsupervised synaptic networks might enable neuro-prosthetics technologies, where implanted hardware interconnected with biological neurons can supply and complement various brain functionalities to correct disabilities and heal injuries. Similarly, hardware systems based on STDP or other bio-realistic plasticity rules might be designed to replicate, or at least imitate, certain areas of the human brain in silico, thus helping to understand human cognition and perception.

A key limitation to meet these challenges is the difficulty to understand and recreate the architecture of biological neural networks. For instance, the visual cortex is organized into 8–10 functional layers, with various types of neurons and complex arrangement of synaptic connections within the axon arbor^39,57. Replication and unsupervised training of such deep networks with STDP and other spike time-dependent rules is not yet understood and achieved in hardware. In addition, the response in the neural network can be extremely complicated, including short-term and long-term plasticity, excitatory and inhibitory synaptic response, and various types of network-level behaviors, such as feedforward or recurrent spike propagation. Various forms of plasticity rules have been proposed, including not only STDP but also rate-based and triplet-based learning⁵⁸. Recreating the deep architecture and complex phenomenology within hardware requires a detailed understanding of the structure and operation of the brain. In this scenario, our STDP synaptic memristive network offers a flexible building block to build up-scaled spiking networks to mimic learning and processing in the human brain.

In summary, we presented a neural network with memristive synapses capable of STDP. Stochastic learning relies on the alternated presentation of pattern images and random noise, to enable potentiation and depression, respectively. As a result, unsupervised learning of static and dynamic images, and recognition of the same patterns were demonstrated. The demonstrated concept might provide a fundamental building block for scalable, low-power, brain-inspired computing hardware based on memristive devices.

Methods

RRAM synapses

The RRAM devices used in this study consist of a 10-nm thick switching layer of HfO₂ which was deposited by atomic layer deposition (ALD) on top of a lithographically-confined bottom electrode made of TiN. A cross-section TEM photograph of the device is shown in Fig. S1. The HfO₂ layer was doped with silicon and deposited in the amorphous phase, as confirmed by diffraction studies⁴¹. A reactive Ti top electrode was deposited on top of the HfO₂ dielectric layer, to act as oxygen scavenger, leading to oxygen exchange layer (OEL) of TiO_x between Ti and HfO₂. The OEL was instrumental in increasing the concentration of oxygen vacancies in HfO₂, thus enhancing the leakage current in the pristine state and reducing the forming voltage. Forming was operated by the application of 100 ms-long pulses of 3 V amplitude, to initiate the CF creation and the related resistive switching process by a controlled soft-breakdown of the dielectric layer. The RRAM was connected to a FET, which was integrated in the front-end of the same silicon chip by conventional complementary-metal-oxide-semiconductor (CMOS) process. The resulting 1T1R structure was controlled during forming, set, and reset by connecting its 3 terminals, namely the FET gate, the FET source and the top electrode of the RRAM. The dc conduction and bipolar switching characteristic of the RRAM (Fig. 1c) were collected by an HP4155B Semiconductor Parameter Analyzer connected to the experimental device within a conventional probe station for electrical characterization.

Synaptic network

The 1T1R synapses were connected to an Arduino Due microcontroller (μC) on a PCB for experiments on the neural network. The PCB hosted up to 18 RRAM chips, each containing a 1T1R synapse, and all of them connected with their 3 terminals according to the schematic of Fig. S4. In the network, each PRE represented an axon terminal, controlled by the μC and connected to a synapse gate. All synaptic top electrodes were driven by the μC and normally biased to V_bias = −0.2 V to induce a small current through the 1T1R synapses under a PRE spike. All source terminals were connected to the POST input, consisting of a transimpedance amplifier (TIA), enabling current-to-voltage conversion. The output voltage of the TIA was fed into an input terminal of the μC’s ADC for digital integration to describe the first stage of the POST. The internal threshold potential was tuned to enable firing in correspondence of 2 PRE spikes activating full-LRS synapses. At the fire event, the voltage controlling the synaptic top electrodes was switched from V_bias to the V_TE+ and V_TE− according to the pulse trace in Fig. 2c, to induce time-dependent potentiation or depression. To operate the network, the PRE spike sequence was first stored in the internal memory of the μC, then the sequence was launched while monitoring the synaptic weights 1/R and the internal potential V_int at each epoch. The spike and fire voltages and the input currents were also monitored by a Lecroy Waverunner oscilloscope with 600 MHz bandwidth and maximum 4 GSample/s sampling rate.

Change history

18 June 2018
A correction to this article has been published and is linked from the HTML and PDF versions of this paper. The error has been fixed in the paper.

References

Chouard, T. & Venema, L. Machine intelligence. Nature 521, 435 (2015).
Article ADS PubMed CAS Google Scholar
Mnih, V. et al. Human-level control through deep reinforcement learning. Nature 518, 529–533 (2015).
Article ADS PubMed CAS Google Scholar
Silver, D. Mastering the game of Go with deep neural networks and tree search. Nature 529, 484–489 (2016).
Article ADS PubMed CAS Google Scholar
Sutskever, I., Vinyals, O. & Le, Q. V. Sequence to sequence learning with neural networks. Proc. Advances in Neural Information Processing Systems 27, 3104–3112 (2014).
Google Scholar
Lake, B. M., Salakhutdinov, R. & Tenenbaum, J. B. Human-level concept learning through probabilistic program induction. Science 350, 1332–1338 (2015).
Article ADS MathSciNet PubMed MATH CAS Google Scholar
Waldrop, M. M. More than Moore. Nature 530, 144–147 (2016).
Article ADS PubMed CAS Google Scholar
Merolla, P. A. et al. A million spiking-neuron integrated circuit with a scalable communication network and interface. Science 345, 668–673 (2014).
Article ADS PubMed CAS Google Scholar
Chicca, E., Stefanini, F., Bartolozzi, C. & Indiveri, G. Neuromorphic Electronic Circuits for Building Autonomous Cognitive Systems. Proc. IEEE 102, 1367–1388 (2014).
Article Google Scholar
Mostafa, H. et al. An event-based architecture for solving constraint satisfaction problems. Nat. Commun. 6, 8941, doi:10.1038/ncomms9941 (2015).
Article PubMed PubMed Central CAS Google Scholar
Chen, Y. H. et al. A 16 nm 128 Mb SRAM in High-κ Metal-Gate FinFET Technology with Write-Assist Circuitry for Low-VMIN Applications. In Proc. ISSCC 238–240 (2014).
Tang, Y. et al. Total Regional and Global Number of Synapses in the Human Brain Neocortex. Synapse 41, 258–273 (2001).
Article PubMed CAS Google Scholar
Markram, H., Lübke, J., Frotscher, M. & Sakmann, B. Regulation of synaptic efficacy by coincidence of postsynaptic APs and EPSPs. Science 275, 213–215 (1997).
Article PubMed CAS Google Scholar
Bi, G.-Q. & Poo, M.-M. Synaptic modifications in cultured hippocampal neurons: dependence on spike timing, synaptic strength, and postsynaptic cell type. J. Neurosci. 18, 10464 (1998).
Article PubMed CAS Google Scholar
Indiveri, G. et al. Integration of nanoscale memristor synapses in neuromorphic computing architectures. Nanotechnology 24, 384010 (2013).
Article PubMed CAS Google Scholar
Zamarreno-Ramos, C. et al. On spike-timing-dependent-plasticity, memristive devices, and building a self-learning visual cortex. Front. Neurosci 5, 26, doi:10.3389/fnins.2011.00026 (2011).
Article PubMed PubMed Central Google Scholar
Strukov, D. B. et al. The missing memristor found. Nature 453, 80–83 (2008).
Article ADS PubMed CAS Google Scholar
Raoux, S., Ielmini, D., Wuttig, M. & Karpov, I. V. Phase change materials. MRS Bull. 37, 118–123 (2012).
Article CAS Google Scholar
Chappert, C. et al. The emergence of spin electronics in data storage. Nature 6, 813–823 (2007).
Article CAS Google Scholar
Ielmini, D., Lacaita, A. L., Pirovano, A., Pellizzer, F. & Bez, R. Analysis of phase distribution in phase-change nonvolatile memories. IEEE Electron Device Lett. 25, 507–509 (2004).
Article ADS Google Scholar
Locatelli, N., Cros, V. & Grollier, J. Spin-torque building blocks. Nature Materials 13, 11–20 (2014).
Article ADS PubMed CAS Google Scholar
Zhirnov, V. V., Meade, R., Cavin, R. K. & Sandhu, G. Scaling limits of resistive memories. Nanotechnology 22, 254027 (2011).
Article ADS PubMed CAS Google Scholar
Yu, S. et al. HfO_x-based Vertical Resistive Switching Random Access Memory Suitable for Bit-Cost-Effective Three-Dimensional Cross-Point Architecture. ACS Nano 7, 2320–2325 (2013).
Article PubMed CAS Google Scholar
Adam, G. C. et al. 3-D memristor crossbars for analog and neuromorphic computing applications. IEEE Trans. Electron Devices 64(1), 312–318 (2017).
Article ADS Google Scholar
Jo, S. H. et al. Nanoscale memristor device as synapse in neuromorphic systems. Nano Lett. 10, 1297–1301 (2010).
Article ADS PubMed CAS Google Scholar
Yu, S. et al. An Electronic Synapse Device Based on Metal Oxide Resistive Switching Memory for Neuromorphic Computation. IEEE Trans. Electron Devices 58, 2729–2737 (2011).
Article ADS CAS Google Scholar
Ambrogio, S. et al. Spike-timing dependent plasticity in a transistor-selected resistive switching memory. Nanotechnology 24, 384012, doi:10.1088/0957-4484/24/38/384012 (2013).
Article PubMed CAS Google Scholar
Kuzum, D., Jeyasingh, R. G. D., Lee, B. & Wong, H.-S. P. Nanoelectronic Programmable Synapses Based on Phase Change Materials for Brain-Inspired Computing. Nano Lett. 12, 2179–2186 (2011).
Article ADS PubMed CAS Google Scholar
Ambrogio, S. et al. Unsupervised learning by spike timing dependent plasticity in phase change memory (PCM) synapses. Front. Neurosci 10, 56, doi:10.3389/fnins.2016.00056 (2016).
Article PubMed PubMed Central Google Scholar
Yu, S. et al. A Low Energy Oxide-Based Electronic Synaptic Device for Neuromorphic Visual Systems with Tolerance to Device Variation. Adv. Mater. 25, 1774–1779, doi:10.1002/adma.201203680 (2013).
Article PubMed CAS Google Scholar
Garbin, D. et al. HfO₂-Based OxRAM Devices as Synapses for Convolutional Neural Networks. IEEE Trans. Electron Devices 62, 2494–2501 (2015).
Article ADS Google Scholar
Ambrogio, S. et al. Neuromorphic learning and recognition with one-transistor-one-resistor synapses and bistable metal oxide RRAM. IEEE Trans. Electron Devices 63, 1508–1515, doi:10.1109/TED.2016.2526647 (2016).
Article ADS CAS Google Scholar
Eryilmaz, S. B. et al. Brain-like associative learning using a nanoscale non-volatile phase change synaptic device array. Front. Neurosci. 8, 205 doi:10.3389/fnins.2014.00205 (2014).
Guo, X. et al. Modeling and experimental demonstration of a Hopfield network analog-to-digital converter with hybrid CMOS/memristor circuits. Frontiers in Neuroscience 9, 488 (2015).
Article PubMed PubMed Central CAS Google Scholar
Hu, S. G. et al. Associative memory realized by a reconfigurable memristive Hopfield neural network. Nat. Commun. 6, 7522, doi:10.1038/ncomms8522 (2015).
Article PubMed CAS Google Scholar
Alibart, F. et al. Pattern classification by memristive crossbar circuits using ex situ and in situ training. Nat. Commun. 4, 2072, doi:10.1038/ncomms3072 (2013).
Article PubMed CAS Google Scholar
Prezioso, M. et al. Training and operation of an integrated neuromorphic network based on metal-oxide memristors. Nature 521, 61–64, doi:10.1038/nature14441 (2015).
Article ADS PubMed CAS Google Scholar
Park, S. et al. Electronic system with memristive synapses for pattern recognition. Sci. Rep. 5, 10123, doi:10.1038/srep10123 (2015).
Article ADS PubMed PubMed Central CAS Google Scholar
Serb, A. et al. Unsupervised learning in probabilistic neural networks with multi-state metal-oxide memristive synapses. Nat. Commun. 7, 12611, doi:10.1038/ncomms12611 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Serrano-Gotarredona, R. et al. CAVIAR: A 45k Neuron, 5 M Synapse, 12 G Connects/s AER Hardware Sensory–Processing–Learning–Actuating System for High-Speed Visual Object Recognition and Tracking. IEEE Trans. Neural Netw. 20, 1417 (2009).
Article PubMed Google Scholar
Woo, Y., Lim, J., & Yang, M.-H. Online Object Tracking: A Benchmark. IEEE Conference on Computer Vision and Pattern Recognition 2411–2418, doi:10.1109/CVPR.2013.312 (2013).
Calderoni, A., Sills, S. & Ramaswamy, N. Performance Comparison of O-based and Cu-based ReRAM for High-Density Applications. Proc. Int. Memory Workshop (IMW), 1–4 (2014).
Ambrogio, S. et al. Analytical modeling of oxide-based bipolar resistive memories and complementary resistive switches. IEEE Trans. Electron Devices 61, 2378–2386, doi:10.1109/TED.2014.2325531 (2014).
Article CAS Google Scholar
Milo, V. et al. Demonstration of hybrid CMOS/RRAM neural networks with spike time/rate-dependent plasticity. IEDM Tech. Dig. 440 (2016).
Prezioso, M. et al. Self-adaptive spike-timing-dependent plasticity of metal-oxide memristors. Sci. Rep. 6, 21331, doi:10.1038/srep21331 (2016). doi:.
Article ADS PubMed PubMed Central CAS Google Scholar
Ambrogio, S. et al. Statistical fluctuations in HfO_x resistive-switching memory (RRAM): Part I – Set/Reset variability. IEEE Trans. Electron Devices 61, 2912–2919, doi:10.1109/TED.2014.2330200 (2014).
Article ADS CAS Google Scholar
Suri, M. et al. Bio-Inspired Stochastic Computing Using Binary CBRAM Synapses. IEEE Trans. Electron Devices 60, 2402–2409, doi:10.1109/TED.2013.2263000 (2013).
Article ADS Google Scholar
Cromer, J. A., Roy, J. E. & Miller, E. K. Representation of multiple, independent categories in the primate prefrontal cortex. Neuron 66, 796–807, doi:10.1016/j.neuron.2010.05.005 (2010).
Article PubMed PubMed Central CAS Google Scholar
Masquelier, T., Guyonneau, R. & Thorpe, S. J. Competitive STDP-Based Spike Pattern Learning. Neural Computation 21, 1259–1276 (2009).
Article PubMed MATH Google Scholar
Maass, W. Noise as a Resource for Computation and Learning in Networks of Spiking Neurons. Proc. IEEE 102, 860–880, doi:10.1109/JPROC.2014.2310593 (2014).
Article Google Scholar
Caporale, N. & Dan, Y. Spike timing-dependent plasticity: A Hebbian learning rule. Annu. Rev. Neurosci. 31, 25–46, doi:10.1146/annurev.neuro.31.060407.125639 (2008).
Article PubMed CAS Google Scholar
Hodge, V. J. & Austin, J. A Survey of Outlier Detection Methodologies. Artif. Intell. Rev. 22, 85, doi:10.1007/s10462-004-4304-y (2004).
Article MATH Google Scholar
Zhang, L. I., Tao, H. W., Holt, C. E., Harris, W. A. & Poo, M. A critical window for cooperation and competition among developing retinotectal synapses. Nature 395, 37–44 (1998).
Article ADS PubMed CAS Google Scholar
Cassenaer, S. & Laurent, G. Hebbian STDP in mushroom bodies facilitates the synchronous flow of olfactory information in locusts. Nature 448, 709–713 (2007).
Article ADS PubMed CAS Google Scholar
Wolters, A. et al. A temporally asymmetric Hebbian rule governing plasticity in the human motor cortex. J. Neurophysiol. 89, 2339–2345 (2003).
Article PubMed Google Scholar
Nishimura, Y., Perlmutter, S. I., Eaton, R. W. & Fetz, E. E. Spike-Timing-Dependent Plasticity in Primate Corticospinal Connections Induced during Free Behavior. Neuron 80, 1301–1309, doi:10.1016/j.neuron.2013.08.028 (2013).
Article PubMed PubMed Central CAS Google Scholar
Nere, A., Olcese, U., Balduzzi, D. & Tononi, G. A Neuromorphic Architecture for Object Recognition and Motion Anticipation Using Burst-STDP. PLoS ONE 7, e36958, doi:10.1371/journal.pone.0036958 (2012).
Article ADS PubMed PubMed Central CAS Google Scholar
Binzegger, T., Douglas, R. J. & Martin, K. A. C. A Quantitative Map of the Circuit of Cat Primary Visual Cortex. J. Neurosci. 24, 8441–8453, doi:10.1523/JNEUROSCI.1400-04.2004 (2004).
Article PubMed CAS Google Scholar
Gjorgjieva, J., Clopath, C., Audet, J. & Pfisterd, J. P. A triplet spike-timing–dependent plasticity model generalizes the Bienenstock–Cooper–Munro rule to higher-order spatiotemporal correlations. PNAS 108, 19383–19388, doi:10.1073/pnas.1105933108 (2011).
Article ADS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported in part by the European Research Council (grant ERC-2014-CoG-648635-RESCUE).

Author information

Authors and Affiliations

Dipartimento di Elettronica, Informazione e Bioingegneria, Politecnico di Milano and IU.NET, Piazza L. da Vinci 32, 20133, Milano, Italy
G. Pedretti, V. Milo, S. Ambrogio, R. Carboni, S. Bianchi, A. S. Spinelli & D. Ielmini
Micron Technology, Inc., Boise, ID, 83707, USA
A. Calderoni & N. Ramaswamy

Authors

G. Pedretti
View author publications
You can also search for this author in PubMed Google Scholar
V. Milo
View author publications
You can also search for this author in PubMed Google Scholar
S. Ambrogio
View author publications
You can also search for this author in PubMed Google Scholar
R. Carboni
View author publications
You can also search for this author in PubMed Google Scholar
S. Bianchi
View author publications
You can also search for this author in PubMed Google Scholar
A. Calderoni
View author publications
You can also search for this author in PubMed Google Scholar
N. Ramaswamy
View author publications
You can also search for this author in PubMed Google Scholar
A. S. Spinelli
View author publications
You can also search for this author in PubMed Google Scholar
D. Ielmini
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

G.P., V.M., S.A., A.S.S. and D.I. designed the experiments, G.P. prepared the PCB and carried out the electrical experiments, R.C. performed STDP measurements on individual synapses, V.M., S.B., S.A. and A.S.S. conducted the device/network simulations, A.C. and N.R. designed and fabricated the RRAM devices. All of the authors discussed the results and contributed to the preparation of the manuscript. D.I. supervised the research.

Corresponding author

Correspondence to D. Ielmini.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Movie 1

Movie 2

Movie 3

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Pedretti, G., Milo, V., Ambrogio, S. et al. Memristive neural network for on-line learning and tracking with brain-inspired spike timing dependent plasticity. Sci Rep 7, 5288 (2017). https://doi.org/10.1038/s41598-017-05480-0

Download citation

Received: 30 March 2017
Accepted: 30 May 2017
Published: 13 July 2017
DOI: https://doi.org/10.1038/s41598-017-05480-0

This article is cited by

Autonomous vehicles decision-making enhancement using self-determination theory and mixed-precision neural networks
- Mohammed Hasan Ali
- Mustafa Musa Jaber
- P. Punitha
Multimedia Tools and Applications (2023)
Quasi-synchronization of stochastic memristive neural networks subject to deception attacks
- Zhou Chao
- Chunhua Wang
- Wei Yao
Nonlinear Dynamics (2023)
Memory-inspired spiking hyperdimensional network for robust online learning
- Zhuowen Zou
- Haleh Alimohamadi
- Mohsen Imani
Scientific Reports (2022)
Spike-time-dependent plasticity rule in memristor models for circuit design
- Mouna Elhamdaoui
- Faten Ouaja Rziga
- Kamel Besbes
Journal of Computational Electronics (2022)
Toward memristive in-memory computing: principles and applications
- Han Bao
- Houji Zhou
- Xiangshui Miao
Frontiers of Optoelectronics (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.