A neuromorphic physiological signal processing system based on VO2 memristor for next-generation human-machine interface

Yuan, Rui; Tiw, Pek Jun; Cai, Lei; Yang, Zhiyu; Liu, Chang; Zhang, Teng; Ge, Chen; Huang, Ru; Yang, Yuchao

doi:10.1038/s41467-023-39430-4

Download PDF

Article
Open access
Published: 21 June 2023

A neuromorphic physiological signal processing system based on VO₂ memristor for next-generation human-machine interface

Rui Yuan¹^na1,
Pek Jun Tiw¹^na1,
Lei Cai¹,
Zhiyu Yang²,
Chang Liu¹,
Teng Zhang¹,
Chen Ge ORCID: orcid.org/0000-0002-8093-940X³,
Ru Huang ORCID: orcid.org/0000-0002-8146-4821¹ &
…
Yuchao Yang ORCID: orcid.org/0000-0003-4674-4059^1,2,4,5

Nature Communications volume 14, Article number: 3695 (2023) Cite this article

9874 Accesses
18 Citations
2 Altmetric
Metrics details

Subjects

Electronic devices

Abstract

Physiological signal processing plays a key role in next-generation human-machine interfaces as physiological signals provide rich cognition- and health-related information. However, the explosion of physiological signal data presents challenges for traditional systems. Here, we propose a highly efficient neuromorphic physiological signal processing system based on VO₂ memristors. The volatile and positive/negative symmetric threshold switching characteristics of VO₂ memristors are leveraged to construct a sparse-spiking yet high-fidelity asynchronous spike encoder for physiological signals. Besides, the dynamical behavior of VO₂ memristors is utilized in compact Leaky Integrate and Fire (LIF) and Adaptive-LIF (ALIF) neurons, which are incorporated into a decision-making Long short-term memory Spiking Neural Network. The system demonstrates superior computing capabilities, needing only small-sized LSNNs to attain high accuracies of 95.83% and 99.79% in arrhythmia classification and epileptic seizure detection, respectively. This work highlights the potential of memristors in constructing efficient neuromorphic physiological signal processing systems and promoting next-generation human-machine interfaces.

Control of working memory by phase–amplitude coupling of human hippocampal neurons

Article Open access 17 April 2024

Flexible quasi-2D perovskite solar cells with high specific power and improved stability for energy-autonomous drones

Article 17 April 2024

Understanding asymmetric switching times in accumulation mode organic electrochemical transistors

Article 17 April 2024

Introduction

Physiological signals reflect the electrical activity of a specific body part¹ and provide valuable information about mood, cognition, and many other health issues², thus any deviation from the norm in patterns may indicate an underlying health problem. For instance, arrhythmias can be picked up by electrocardiogram (ECG) signals³ while epilepsy, which is a common neurological disorder, manifests itself as abnormalities in electroencephalogram (EEG) signals during epileptic seizure⁴. Monitoring and analyzing these physiological signals form the basis of biomedical devices used for the diagnosis, detection, and treatment of various diseases². While anomaly detection and analysis can be done manually, a physiological signal processing system that is capable of providing diagnosis without human intervention can be useful in providing a second opinion or even picking up subtle and easily overlooked patterns.

In a traditional physiological signal processing system, the analog physiological signals are first converted into digital signals by analog-to-digital converters (ADC) and then stored in memory before being further processed in digital computing units^5,6,7. However, the frequent movement of a massive amount of data between memory and computing units heavily affects the speed and power consumption^8,9. The parallel and event-driven¹⁰ neuromorphic computing system, which is inspired by the human brain, is a promising alternative approach for breaking the von Neumann bottleneck¹¹. It is much more energy efficient and suited for processing physiological signals, as they contain spatiotemporal information, thus motivating the design of a brain-like physiological signal processing system. Although some neuromorphic physiological signal processing systems based on complementary metal-oxide semiconductor (CMOS) technology have been demonstrated, most of them suffered from area and energy inefficiencies, due to the incorporation of complex auxiliary circuits and bulky capacitors for the implementation of bio-dynamics^{11,12,13,14,15}.

To achieve an efficient neuromorphic physiological signal processing system, memristors provide an appealing platform due to their abundant ion dynamics^{16,17,18,19,20,21,22,23,24,25,26,27,28} and electrical behaviors akin to those found in biological neurons and synapses, hence lending themselves well to realizing compact neuromorphic architectures. While the hardware implementations of Leaky Integrate and Fire (LIF) neurons have been reported widely in literature^27,28,29, few studies demonstrated hardware implementations of Adaptive-Leaky Integrate and Fire (ALIF) neurons^30,31, which have been shown to improve the computational capabilities of neuromorphic systems. Nevertheless, these ALIF hardware implementations still have room for optimization. More importantly, a complete memristor-based neuromorphic physiological signal processing system that features a highly efficient spike encode scheme and a more biologically plausible neural network with ALIF neurons has not yet been reported.

In this work, a complete neuromorphic physiological signal processing hardware system for the next-generation human-machine interface based on VO₂ memristors is demonstrated. Specifically, a platform that can convert analog physiological signals into a stream of asynchronous spike events is proposed, which fully utilizes the positive and negative symmetric thresholds and fast volatile characteristics of VO₂ memristors so as to simplify the circuit. Different from the frequency-encoding mode of traditional neurons, the spikes from each channel of the encoding platform mark the time at which the input signal has changed beyond a fixed threshold, which can preserve the original input information content to the greatest extent while keeping a low spiking rate to reduce energy consumption. Besides, a memristor-based decision system that features a Long short-term memory Spiking Neural Network (LSNN) with powerful computational capabilities³² is provided, wherein ALIF neurons were designed efficiently using VO₂ memristors. The performance of this system was evaluated via arrhythmia classification and epileptic seizure detection tasks, achieving accuracies of 95.83% and 99.79%, respectively. This system with a small LSNN has implied immense potential in processing various physiological signals and can hold great prospect in dealing with other temporal signals in general.

Results

Design of VO₂ memristor-based neuromorphic physiological signal processing system

Figure 1 schematically illustrates the proposed VO₂ memristor-based neuromorphic processing system, which integrates an asynchronous spike encoder and an LSNN-based decision system. In the data compression and encoding stage, the memristor-based asynchronous spike encoder converts each channel of the collected physiological signals, such as ECG and EEG, into two-channel spike trains (UP/DOWN channel), which represent the rise or fall of the original signal, respectively. The asynchronous spike encoder based on memristor was inspired by LC-ADC^{14,33,34,35,36} and delta modulator circuits^37,38,39, wherein spikes from each channel mark the time at which the input signal changes beyond a fixed threshold. The speed of spikes emission is determined by the variation rate of the input signal, thus realizing non-uniform and sparse spike encoding, which can reduce the amount of data and energy consumption. Compared with the frequency encoding of traditional neurons, this method contains temporal information and is thus more friendly to neuromorphic systems. Since the information of the original signal is preserved to the greatest extent, the encoded asynchronous spike trains can reconstruct the original signal accurately, which is hard for frequency coding. In the section regarding the asynchronous spike encoder based on VO₂ memristor, we introduced in detail how to use memristors to implement asynchronous spike encoding without ADC/DAC and special control circuits. Another core of the system is the decision network. Here, a memristor-based decision system that features an LSNN is utilized in which the memristor also plays a central role. The LSNN-based decision system contains two kinds of neurons, the LIF neuron and ALIF neuron. Among them, the hardware implementation of the ALIF neuron requires a feedback mechanism and hence is relatively difficult. In this system, we utilized VO₂ memristors to construct not only LIF neurons but also ALIF neurons efficiently. By incorporating these key characteristics, the neuromorphic system can exhibit high accuracy with few weights and used for physiological signal processing in human-machine interfaces.

**Fig. 1: The neuromorphic physiological signal processing system based on VO₂ memristor for the next-generation human-machine interface.**

VO₂ memristor-based artificial LIF neuron

Neurons are the building blocks of brain-like systems. To construct the artificial neuron efficiently, memristors with highly uniform threshold switching (TS)^27,40,41 and volatile characteristics are required. The memristor used in this work is based on VO₂ and is designed as a planar device as shown in Fig. 2a. Supplementary Fig. 1a shows a scanning electron microscopy (SEM) image of the device, where the channel length is 400 nm and the electrode width is 2 μm. Details of fabrication processes are shown in Methods. Supplementary Fig. 1b shows the transmission electron microscopy (TEM) image of the device, and a zoom-in view of the VO₂ film and corresponding fast Fourier transformation is shown in Supplementary Fig. 1c, d, where well-ordered lattice fringes are evident, verifying the high crystalline quality of VO₂ film which is important for achieving high uniformity in our devices. The cross-sectional scanning transmission electron microscopy (STEM) image and corresponding energy dispersive X-ray spectroscopy (EDS) mapping of O, Al, Si, V, Ti, and Au elements in the device can be seen in Supplementary Fig. 2a, along with EDS elemental line profile in the same region (Supplementary Fig. 2b). Stable volatile resistive switching is indicated by the I–V characteristics of the VO₂ memristor (Fig. 2b), where 100 cycles were performed. The device changes from a high resistance state (HRS) to a low resistance state (LRS) once the applied voltage exceeds a threshold voltage (V_th) of around ±3.4 V and immediately returns to HRS when the applied voltage falls below the holding voltage (V_hold) of around ±1.45 V. This resistive switching phenomenon arises from the metal-insulator transition of VO₂, which is a result of the intertwined structural and electronic phase changes^42,43,44. The transition between the low-temperature semiconducting phase and the high-temperature metallic phase occurs at around ~340 K, and can be triggered by Joule heating⁴⁵. To illustrate this point, we simulated the thermodynamic resistive switching process using COMSOL Multiphysics. As shown in Supplementary Fig. 3, the switching of the VO₂ memristor between HRS and LRS is accompanied by the formation or disappearance of a high-temperature filament, which has also been previously observed^46,47. To be specific, heat is generated in the VO₂ memristor as the applied voltage increases (state (1) to (2)). Once the phase transition is triggered, a filament forms through the VO₂ gap, switching the device from HRS to LRS. Then, the filament expands as the voltage is increased (state (2) to state (3)). When the voltage is reduced, the heat dissipates, and the filament size decreases (state (3) to state (4)). Once the applied voltage is below V_hold, the filament breaks down and the device eventually returns to HRS (state (4) to state (1)). The simulated I-V curve agrees well with the experimentally measured curve, further verifying the Joule heating-induced phase transition and the filament formation picture. Figure 2c displays the cumulative plots of positive and negative threshold/holding voltages, including V_{th_pos}, V_{hold_pos}, V_{th_neg}, and V_{hold_neg} in 100 repeated cycles. The coefficient of variation (C_v) defined by the ratio of the standard deviation (σ) to the mean value (μ) of V_{th_pos}, V_{hold_pos}, V_{th_neg}, and V_{hold_neg} were 0.65%, 0.86%, 0.31%, and 1.68%, respectively, showing very low cycle-to-cycle (C2C) variations. The superior uniformity can be attributed to the high crystallinity epitaxial VO₂ thanks to the matching lattice planes across the film-substrate interface⁴⁸, as well as the preservation of such desirable qualities in a planar device structure (Supplementary Note 1). In addition to the uniformity observed under steady state, the VO₂ memristor also displayed very small variations in V_th and V_hold when it was connected to an external circuit and was operating in a dynamical state (Supplementary Fig. 4). The C_v of V_th and V_hold during ~1000 periods of transient oscillations were 0.73% and 0.48%, respectively. Moreover, when the planar VO₂ memristor was operated in air under normal atmospheric pressure, under different ambient pressures ranging from 3.5 × 10⁻³ mbar down to 5.0 × 10⁻⁴ mbar and in an N₂ environment, it also exhibited stable threshold switching behavior with no appreciable difference in its I-V characteristics (Supplementary Fig. 5). This implies that such devices are not affected by various atmospheric content such as moisture. Crucially, the VO₂ memristor demonstrated a high endurance of >6.5 × 10⁶ switching cycles (Supplementary Fig. 6), which ensures the reliability of encoders and neurons that incorporate such devices. Supplementary Fig. 7 displays the transient electrical measurements, where the switching speed of the VO₂ memristor in this work is <70 ns from off-state to on-state and <60 ns from on-state to off-state, exhibiting a high-speed threshold switching characteristic.

**Fig. 2: The implementation of memristor-based artificial LIF neuron.**

The circuit configuration of artificial neuron based on VO₂ memristor is shown in Fig. 2d. The VO₂ memristor is connected in parallel with a capacitor and in series with a load resistor R_L. Besides, a 50 Ω resistor R₀ is used to convert the current into a voltage output. The dynamics of an ion channel located near the soma of a neuron can be mimicked by the threshold switching (TS) behavior of VO₂ while the membrane capacitance is represented by C_p. The oscilloscope is used to measure electrical waveforms across the C_p, the input waveforms, and the output of the artificial neuron (see Methods and Supplementary Fig. 8). When a voltage is applied to the artificial neuron, the capacitor begins to charge. Once the voltage on the capacitor exceeds V_th, the VO₂ memristor switches to LRS. As a result, a spike is generated, which will be transmitted to the next neuron. Besides, the capacitor will be discharged through the on-state memristor. When the voltage on the capacitor drops below V_hold, the device will return to HRS. The spiking rate of the artificial neuron strongly depends on the series resistance, applied voltage, and parallel capacitance. Figure 2e, f shows the response of the artificial neuron under different series resistance R_L (18 kΩ, 10 kΩ) when fixing a constant input voltage of 10 V without an external parallel capacitor (More results can be found in Supplementary Fig. 9). A larger R_L will reduce the input current, thus slowing down the charging process, thereby reducing the firing frequency (Fig. 2i). On the other hand, a larger input voltage will increase the charging current, thereby speeding up the charging process, thus increasing the frequency (Fig. 2g, h, j and Supplementary Fig. 10). Supplementary Fig. 11 shows the experimental response of the artificial neuron under different parallel capacitors. As the parallel capacitance increases, the integration process becomes slower, thus reducing the firing frequency. These firing behaviors can also be deduced from the RC circuit analysis detailed in Supplementary Note 2.

To gain insights into the neuron circuit behavior and to assist in designing the ALIF neuron and the spike encoder, we developed a SPICE model using LTSPICE for our VO₂ memristor (Supplementary Fig. 12 and “Methods”) based on the one proposed in ref. ⁴⁹. Our improved model has no polarity, thus allowing symmetrical static I–V characteristics and switching thresholds under positive and negative biases, which is in accordance with practical planar VO₂ memristors. In essence, the model consists of a comparator, which compares the terminal voltage of the device to V_th/V_hold when it is in HRS/LRS and flips the state if the terminal voltage increases/decreases beyond the thresholds. The inclusion of R₀ and C₀ is to suppress instantaneous state transitions of the comparator, which models the finite switching time of the real-world VO₂ memristor. The voltage across C₀ is then used to determine the resistance of the VO₂ memristor model. The simulation results were in good agreement with the experimental results as shown in Fig. 2k–l, where V_in and R_L were set as 10 V and 18 kΩ, respectively (Supplementary Table 1 lists the parameters of the device).

VO₂ memristor-based adaptive LIF neuron

Building upon the compact LIF neuron presented in the previous section, we designed a highly efficient VO₂ memristor-based ALIF neuron by adding an adaptive control circuit, which requires only a few extra components and a feedback connection (Fig. 3a). The adaptive property stems from the increased membrane leakage current after the neuron fires, which renders subsequent input integration harder and has its analogous process found in biological neurons⁵⁰. The key processes involved in this ALIF neuron are summarized in Fig. 3b. The workings of the LIF part are similar to that of the LIF neuron, but with an additional membrane leakage path via M₃. To achieve adaptation, the spike output is amplified by the M₁ common-emitter amplifier to drive M₂, which charges C₂ and increases V_g. Consequently, M₃ turns on more and increases the leakage current with each spike. By Kirchoff’s Current Law, the increased leakage current is subtracted from the input charging current, resulting in a slower charging of C₁ during the integration phase and a reduced spiking frequency. In biological neurons, the adaptive effect diminishes and the spiking frequency returns to the initial level when the neuron is rested. In our ALIF neuron, this feature is enabled by R₃, which provides C₂ with a discharging path and turns off M₃ slowly. It is important to note that V_g has to be a slow-changing variable relative to V_m, that is to say, the adaptive time constant (τ_a = R₃C₂) needs to be sufficiently large compared to the membrane time constant (τ_m = (R_VO2 + R₁)C₁). To effectively utilize the temporal processing capability of ALIF neurons, which stems from their adaptive property, the choice of τ_a should roughly be on the same time scale as the total input duration^31,51. Another point to note is that although not demonstrated in this work, the simple common-emitter amplifier introduces signal gain and allows the neuron to drive subsequent stages²³, which could be beneficial in realizing future compact multilayer networks.

To further understand the workings of this ALIF neuron, we simulated the circuit in LTSPICE with the aforementioned VO₂ model. The circuit parameters used under controlled conditions are listed in Supplementary Table 2. To illustrate the effect of V_g on the spiking frequency, we directly varied the voltage applied on the gate of M₃, and calculated the spiking frequency and its reciprocal, the inter-spike interval (ISI), from the resulting output voltage spikes (Fig. 3c). When V_g is lower than the turn-on threshold voltage of M₃ (V_t, _M3 ~ 0.7 V), M₃ is off and the spiking frequency remains constant in this range. As V_g is increased further beyond V_t, _M3, the spiking frequency decreases monotonically. Beyond a V_g of ~1.65 V, the membrane leakage current is so large that the reduced input current cannot charge C₁ sufficiently to raise V_m to V_th, therefore the neuron stops firing. Thus, is it evident that the spiking frequency of the proposed ALIF neuron can be modulated by V_g. The effect of the width-to-length (W/L) ratio of M₃ on the spiking frequency was also simulated and the results are plotted in Supplementary Fig. 13, showing that, for a given V_g, a larger W/L ratio results in a lower spiking frequency due to an increased leakage current.

Next, we simulated the dynamical adaptation of the circuit by applying a constant step input current while allowing V_g to dynamically evolve. The resulting waveforms of V_g, V_m, and V_spike are illustrated in Fig. 3d. The evolution of the ISI is illustrated in the curve corresponding to the controlled condition in Supplementary Fig. 15. As V_g increases with each spike, the spiking frequency remains constantly high initially, before decreasing at an increasing rate, which is a similar trend as that in Fig. 3c. Eventually, V_g is high enough such that V_m cannot reach V_th unless V_g has decreased sufficiently. This is evident from the gradual plateau feature during the late charging phase in the V_m waveform. As a result, V_g simply oscillates around a fixed value with a prolonged period and the spiking frequency saturates at its lowest level. When the input signal is removed, V_g decays at a rate much lower than that of V_m, illustrating the difference in their time constants. The adaptive property of the circuit can be tuned by adopting different values for W/L (M₂, M₃), C₂, and R₃, as elucidated in Supplementary Fig. 14-15. It can be seen that the onset of adaptation is later if either C₂ is large or the W/L of M₂ is small, as a larger capacitor and a smaller current result in a slower charging process. On the other hand, the saturation frequency can be tuned by M₂, M₃, and R₃. A larger W/L of M₂ induces a larger step increase in V_g, which dwells on its raised value longer if R₃ is larger, hence contributing to a larger ISI. Besides, M₃ with a larger W/L requires a lower V_g to achieve the same leakage current, and a lower V_g decays at a slower rate, which increases the ISI. It is worth noting that the initial high frequency cannot be adjusted as it is solely determined by the LIF part. Thus, we presented the various tuning knobs to obtain different adaptive properties, which can be useful in optimizing the performance of the ALIF neurons. The benchmark of the ALIF neuron in this work against previous implementations is shown in Supplementary Table 3, highlighting the simplicity of our circuit.

The asynchronous spike encoder based on VO₂ memristor

Another key role in neuromorphic physiological signal processing systems is the spike encoder. An ideal encoder should provide a compressed representation of the data while preserving as much information as possible⁵². In ordinary neurons, analog signals are encoded as spike frequencies which do not contain accurate timing information, making it difficult to reconstruct the original analog signals. The asynchronous spike encoder based on VO₂ memristor converts the input analog signal into two spike trains, a positive and a negative (UP/DOWN channel). The positive spike represents the moment when the input signal increases beyond a threshold, while the negative spike represents the moment when the input signal decreases beyond a threshold. The spike trains can accurately reconstruct the original input analog signal due to the inclusion of precise time information. The schematic of memristor-based asynchronous spike encoder is depicted in Fig. 4a, including input amplifier with a capacitive-divider gain stage, intermediate amplifier, VO₂ memristor and feedback reset branch. According to the law of conservation of charge at the negative input terminal node of the input stage op-amp, the voltage at gout changes only when the input voltage changes. Then the voltage of gout is amplified by the intermediate stage op-amp and applied to the VO₂ memristor. When the voltage exceeds the positive/negative threshold V_th of the memristor, the memristor will become LRS, thus issuing a positive/negative high voltage on R₃. Then, the high voltage will turn on the NMOS/PMOS through the feedback path, and reset the voltage of gout. At this moment, the voltage on the VO₂ memristor will be lower than positive/negative V_hold, thus the memristor will automatically return to HRS, the voltage on R₃ falls, thereby turning off the PMOS/NMOS on the feedback path. On the other hand, the positive and negative spikes are separated by two diodes to UP/DOWN channels as outputs. The threshold δ represents the incremental or decremental change of the input signal that causes a single spike, which can be described by Eq. 1 when C₁ = C₂:

$$\delta=\frac{{V}_{{{{{{\rm{th}}}}}}}}{\alpha \,\frac{{R}_{{{{{{\rm{off}}}}}}}}{{R}_{{{{{{\rm{off}}}}}}}+{R}_{3}}}$$

(1)

where V_th is the threshold voltage of the VO₂ memristor, while R_off is the resistance of the HRS. α is the absolute amplification factor of the intermediate stage op-amp, which can be described by Eq. 2:

$$\alpha=\frac{{R}_{2}}{{R}_{1}}$$

(2)

**Fig. 4: Proposed memristor-based asynchronous spike encoder.**

It can be seen from the above formula that δ can be adjusted by the amplification factor α of the intermediate stage op-amp.

We first use a sine wave to verify the functionality of the memristor-based asynchronous spike encoder. Figure 4b exhibits the simulation results in LTSPICE, where the blue curve represents the original input, and the pink curve represents the reconstructed result, in the first row. It can be seen that the signal is well reconstructed. The middle row shows the node gout of the asynchronous spike encoder (green curve), which is next amplified by intermediate stage op-amp and applied to the VO₂ memristor. When the amplified voltage reaches the symmetrical positive/negative threshold voltage of the VO₂ memristor, the memristor switches to low resistance and emits a positive/negative spike, which is divided into two channels by two parallel reverse diodes as shown in the last row of Fig. 4b, where the red and blue curve represent the UP and DOWN channels, respectively. When a spike appears in the UP channel, it means that the original signal has increased by a δ. On the contrary, when a spike appears in the DOWN channel, the original signal has decreased by a δ. The frequency of spikes emission depends on the rate at which the original input signal changes. The faster the original input signal changes, the higher the intensity of the spikes. This type of encoding has the advantage of the sparsity of the spikes, and the on-demand nature of the encoding (when the input signal is not changing, no output spikes are produced)¹¹. Supplementary Fig. 16 exhibits the influence of amplification factor α of the intermediate stage op-amp on the delta, the larger the α, the smaller the δ. Increasing the amplification factor improves the accuracy, but also increases the spike emission rate, thereby increasing energy consumption. We tested the encoder with two typical ECG signal waveforms, both of them can be well encoded and reconstructed as shown in Fig. 4c, d, demonstrating its potential as a next-generation neuromorphic spike encoder for physiological signals. Due to the full use of the positive and negative thresholds and volatile characteristics of the VO₂ memristor, our circuit has been greatly simplified without using complex control circuits and ADC/DAC, compared with previous work (Supplementary Table 4). The simulation parameters of the asynchronous spike encoder are provided in Supplementary Table 5.

A key aspect that needs to be considered when using a VO₂ memristor in the asynchronous spike encoder is its reliability in encoding physiological signals. This can be assessed based on the lifespan of the encoder and the signal encoding quality. As aforementioned, the VO₂ memristor has a high endurance (Supplementary Fig. 6), which will ensure the durability of the encoder. On the other hand, the quality of signal encoding is affected by V_th fluctuations. We introduced varying degrees of V_th fluctuations in the SPICE model and performed multiple noisy encoding processes (Supplementary Note 3). The encoding quality was quantified by the mean squared error (MSE) between the original and the reconstructed signals. The results are plotted in Supplementary Fig. 17, along with examples of signal reconstruction under zero, moderate and high degrees of V_th fluctuations. As the MSE and C_v correlate positively, our VO₂ memristor, which has a remarkably low C_v, will yield accurately encoded spike outputs (Supplementary Note 3). Moreover, the tight MSE distribution at such low C_v will also enable superior repeatability in spike encoding. Therefore, these results on the endurance and signal encoding quality attest to the reliability of our VO₂ memristor-based spike encoding architecture.

VO₂ memristor-based LSNN and arrhythmia classification

Using the key modules presented above, we designed a robust and efficient physiological signal processing system with great temporal processing capacity. As shown in Fig. 5a, the system consists of two stages, namely the VO₂ memristor-based encoder followed by the decision-making VO₂-based LSNN³². The encoder converts analog physiological signals, for instance an ECG signal of a heartbeat, into UP and DOWN spike trains on a per input channel basis. These spike trains, which faithfully represent the original signal, are then relayed to the LSNN. Spatially, the LSNN is a 3-layer network comprising an input spiking layer, a hidden recurrent spiking layer with a low-pass filter, and an output classification layer. Each synaptic weight in all connections is assigned a random synaptic delay at network initialization. The core of the LSNN is the hidden recurrent layer, which consists of LIF neurons and ALIF neurons. It is the adaptive property of the ALIF neurons that endows LSNN with its temporal computing capability⁵¹. Here, we utilized the VO₂ memristor-based LIF neurons and ALIF neurons from the previous sections. During network simulations, the dynamics of these neurons were modeled according to their corresponding circuit designs and have the following discretized membrane dynamics equations (Eqs. 3–4):

$$V_{{{{{{\rm{m}}}}}}\_{{{{{\rm{LIF}}}}}}}\left(t+\Delta t\right)={\alpha V}_{{{{{{\rm{m}}}}}}\_{{{{{\rm{LIF}}}}}}}\left(t\right)+\left(1-\alpha \right){R}_{{{{{{\rm{eff}}}}}}}x$$

(3)

$$V_{{{{{{\rm{m}}}}}}\_\; {{{{{\rm{ALIF}}}}}}}\left(t+\Delta t\right)={\alpha V}_{{{{{{\rm{m}}}}}}\_{{{{{\rm{ALIF}}}}}}}\left(t\right)+\left(1-\alpha \right){R}_{{{{{{\rm{eff}}}}}}}\left(x-{I}_{{{{{{\rm{leak}}}}}}}\right)$$

(4)

where $\alpha={{\exp }}\left(-\Delta t/{R}_{{{{{{\rm{eff}}}}}}}{C}_{1}\right)$. R_eff, x, and I_leak are the effective resistance of the VO₂ memristor in HRS in series with the readout resistor, the input current scaled by a factor, and the leakage current via transistor M₃ due to adaptation, respectively. When V_m exceeds V_{th_eff}, it is reset to V_{hold_eff}, where V_{th_eff} and V_{hold_eff} are the effective threshold and holding voltages of VO₂ memristor considering the readout resistor, respectively. I_leak depends on V_g, which evolves according to the discretized dynamics equation (Eq. 5):

$${V}_{{{{{{\rm{g}}}}}}}\left(t+\Delta t\right)={\beta V}_{{{{{{\rm{g}}}}}}}\left(t\right)+\left(1-\beta \right){R}_{3}{I}_{{{{{{\rm{a}}}}}}}z$$

(5)

where $\beta={{\exp }}\left(-\Delta t/{R}_{3}{C}_{2}\right)$, z is 1 if a spike was fired and 0 otherwise, and I_a is the adaptive charging current via M₂. The forward pass during both the training and testing phases, as well as the backward pass during only the training phase, is shown in the flow chart in Fig. 5b. In the forward pass, the input spike vector of the current timestep and the hidden spike vector of the previous timestep are linearly transformed by the forward weights and the recurrent weights, respectively. The resulting vectors are added together and then integrated in the hidden layer to produce a hidden spike vector, which is subsequently low-pass filtered before being linearly transformed by output weights into an output vector. The output node with the highest value in the last timestep corresponds to the classification result. In the backward pass, the total error consisting of the classification cross-entropy loss and a spike regularization term, which promotes sparse firing of the spiking neurons³², is back-propagated to train the fully-connected weights. As each spiking neuron is effectively a non-differentiable step activation function, we used a surrogate derivative for gradient calculations^32,53 (see “Methods”).

**Fig. 5: Illustration of the VO₂ memristor-based LSNN in the physiological processing system for ECG heartbeat classification.**

Next, we investigated the performance of the proposed physiological signal processing system on classifying heartbeats from the MIT-BIH arrhythmia database⁵⁴. The ECG recordings were preprocessed and categorized according to the AAMI recommended classes⁵⁵ (see “Methods”). 2000 heartbeat samples were used as our dataset, which were randomly split into a training set of 1664 samples and a testing set of 336 samples.

For this 4-class heartbeat classification task, we used the proposed system with an LSNN of size 3 × 100 × 4, in which out of the 100 hidden neurons, 60 were LIF neurons while the other 40 were ALIF neurons. The 3 input nodes correspond to UP, DOWN, and CUE channels, while the 4 output nodes correspond to the 4 classes of heartbeats. Other LSNN parameters are listed in Supplementary Table 6, wherein the parameters describing the VO₂ memristor were extracted from experimental data. The single-channel analog heartbeat is encoded into an UP spike train and a DOWN spike train using the VO₂ memristor-based encoder as shown in Fig. 5c. Also shown is the CUE channel, which fires constantly at the end of the heartbeat, prompting the LSNN to generate a valid classification output. We trained the LSNN for 150 training epochs. Figure 5d, e plot the spike raster of the LIF neurons and ALIF neurons, respectively, when the trained system was classifying a normal heartbeat (Fig. 5a inset, Fig. 5c). Figure 5f illustrates the V_g evolution of five ALIF neurons. Each step increase in V_g implies a spike being fired by that neuron in the previous timestep, hence the spiking activities and the patterns within can be easily observed. ALIF neurons that fire at a high rate during the heartbeat (timestep ~500) could not fire easily during the output period (timestep >1000) by virtue of their adaptive property manifested here as a high V_g, while those initially inactive neurons fired at a high rate during the output period. This exemplifies the negative imprinting principle⁵¹, which equips the ALIF neurons with remarkable temporal processing capabilities. As can be seen from the output probabilities in Fig. 5g, the system correctly classified this heartbeat. The evolution of the test accuracy of the system is shown in Fig. 5h, indicating a maximum accuracy of 95.83%. The confusion matrix in Fig. 5i further illustrates the classification results in detail.

To further investigate the computational advantage of the VO₂ memristor-based ALIF neurons, we trained two other same-sized LSNNs but with different configurations, one with only hidden LIF neurons (LIF-only LSNN) while the other with only hidden ALIF (ALIF-only LSNN) neurons, on the same task. The spike raster plots, V_g evolution, and output probabilities for an instance of a normal heartbeat are shown in Supplementary Figs. 18–19. The accuracy evolutions, loss evolutions, and confusion matrices are compared to the LSNN with both types of spiking neurons (mixed LSNN) as shown in Supplementary Fig. 20. We also trained the three configurations of LSNNs in classifying heartbeats as Normal (N) or not Normal (not N). The LSNNs were of size 3 × 20 × 2, wherein 8 out of the 20 hidden neurons in the mixed LSNN were ALIF neurons. The spike raster plots, V_g evolution, and output probabilities for an instance of a normal heartbeat are shown in Supplementary Figs. 21–23, while the accuracy evolutions, loss evolutions, and confusion matrices are shown in Supplementary Fig. 24. The best test accuracy statistics for 18 training trials are shown in Supplementary Table 7. By comparing the maximum test accuracies attained by the three LSNN configurations as shown in Fig. 5j, two important observations can be made. Firstly, the LIF-only LSNN performed worse than the mixed LSNN by approximately 15% and 18% in terms of accuracy in the 2-class and the 4-class task, respectively, thereby highlighting the importance of ALIF neurons in processing the temporally-structured information within physiological signals. Secondly, the ALIF-only LSNN performed only marginally better than the mixed LSNN in general, thereby signifying the necessity of setting only a fraction of the hidden nodes as ALIF neurons to achieve superior performance. These findings elucidate the immense temporal computing capability of these neurons. Moreover, the design choice of using the mixed LSNN configuration is further justified by the potential reduction of area costs over the ALIF-only LSNN, especially when scaling up the system and considering its major use case in compact wearable medical devices.

Epileptic seizure detection

To further verify the ability of the proposed VO₂ memristor-based physiological signal processing system in dealing with other complex signals, we demonstrated epileptic seizure detection on EEG signals from the CHB-MIT scalp EEG database⁵⁶. The EEG recordings were preprocessed as detailed in the “Methods” section. 2530 EEG clips were used for training, which is comprised of 1265 Normal (N, negative class) and 1265 Epileptic (E, positive class) independent non-contiguous EEG clips. The testing set is comprised of 2878 contiguous EEG clips amounting to a one-hour period and is a highly imbalanced dataset with only 31 contiguous epileptic clips. The choice of a contiguous imbalanced testing set is to closely simulate a real-world scenario during epileptic seizures, wherein the seizure episodes are often sparse with each lasting for a short period of time. This is to ensure that our trained system can be deployed in real-time epileptic seizure detection in the future.

A schematic illustrating the training, testing, and post-processing steps of the epileptic detection system is shown in Fig. 6a. The 18-channel EEG clip (inset is an example of an epileptic clip) is first encoded by the VO₂ memristor-based encoder into 18 pairs of UP and DOWN spike trains, before being input into the LSNN. The training and testing phases of the LSNN are similar to that of the ECG task, wherein the encoded signal is classified by the LSNN in the forward pass during both phases, and the error, which includes cross-entropy loss and spike regularization, is back-propagated through time to update the weights during training only. Due to the highly imbalanced nature of the testing set, the performance of the LSNN during testing was evaluated using the G-mean metric⁵⁷, which is the geometric mean of the sensitivity and the specificity of the classification system (Eqs. 6–8):

$${{{{{\rm{Sensitivity}}}}}}=\frac{{{{{{\rm{TP}}}}}}}{{{{{{\rm{TP}}}}}}+{{{{{\rm{FN}}}}}}}$$

(6)

$${{{{{\rm{Specificity}}}}}}=\frac{{{{{{\rm{TN}}}}}}}{{{{{{\rm{TN}}}}}}+{{{{{\rm{FP}}}}}}}$$

(7)

$${{{{{\rm{G}}}}}}-{{{{{\rm{mean}}}}}}=\sqrt{{{{{{\rm{Sensitivity}}}}}}\cdot {{{{{\rm{Specificity}}}}}}}$$

(8)

where TP, FP, TN, and FN denote true positives, false positives, true negatives, and false negatives, respectively. To improve the system performance, especially in terms of specificity, a post-processing step was performed on the contiguous LSNN classification results to obtain the final classification results^58,59 (light purple box in Fig. 6a). It consists of a moving average operation, followed by a thresholding operation at each timestep to output a binary sequence, which is also contiguous in time. Note that the post-processing step is decoupled from the LSNN, and is not involved in loss calculation during training, or in model evaluation during testing.

**Fig. 6: Illustration of the physiological processing system for EEG epileptic seizure detection.**

The LSNN employed for this task was of size 37 × 40 × 2, with 16 out of the 40 hidden spiking neurons being ALIF neurons. The 37 input nodes include 18 UP channels, 18 DOWN channels, and a CUE channel, while the 2 output nodes correspond to N and E. Other LSNN parameters are listed in Supplementary Table 6. We trained the LSNN for 150 epochs. The CUE signal and the encoded spike trains that represent the epileptic EEG clip in Fig. 6a are shown in Fig. 6b. The spike raster plots of the LIF and ALIF neurons when the trained LSNN was classifying this clip are shown in Fig. 6c, d, respectively. The V_g evolution of five ALIF neurons is shown in Fig. 6e, while the output probabilities are shown in Fig. 6f. As can be seen, the system correctly classified this EEG clip. As shown in the confusion matrix in Fig. 6n, the accuracy, sensitivity, and specificity of the LSNN were 82.70%, 100%, and 82.51%, respectively. All of the positive epileptic EEG clips were accurately identified. We also trained a LIF-only LSNN and an ALIF-only LSNN on the same task (Supplementary Figs. 25–27), again corroborating the superior temporal processing capability of our ALIF neurons. The test G-mean statistics for 18 training trials are shown in Supplementary Table 8.

From these results, we can see that the specificity of the LSNN indicates a rather high number of false positives, possibly due to insufficient training data. The nature of the detected positives is revealed by visualizing the contiguous LSNN classification results (Fig. 6h) and comparing them against the target labels (Fig. 6g). While the accurately identified true positives spanned several contiguous EEG clips (Fig. 6k), the predicted false positives were randomly distributed throughout the one-hour period (Fig. 6l). This observation motivated the inclusion of the aforementioned post-processing step. As the moving average and the thresholding depends on the width of the averaging window and the threshold value, respectively, we optimized these parameters by enumerating their possible combinations and comparing the post-processing accuracies, sensitivities, and specificities (Supplementary Fig. 28). A window width of 9 and a threshold of 0.8 were selected for the best accuracy and sensitivity. As shown in Fig. 6i, the moving average smoothed out the random false positives while preserving the clustered true positives. Upon thresholding, the smoothened false positives were effectively removed (Fig. 6j), while the true positives were retained (Fig. 6m). As shown in the confusion matrix in Fig. 6o, the accuracy, sensitivity, and specificity after post-processing were 99.79%, 90.32%, and 99.89%, respectively. Owing to the efficient spike encoding scheme and the LSNN with high temporal processing capability, our system achieved state-of-the-art performance in various metrics while only needing 1–3 orders of magnitude fewer weights (Supplementary Table 9). The small network coupled with compact memristive circuit design for the encoder and the spiking neurons will benefit future hardware integration in biomedical devices and next-generation human-machine interfaces⁶⁰.

Discussion

The proposed VO₂ memristor-based physiological signal processing system has a high area efficiency. To illustrate this, we compared each VO₂ circuit module with existing CMOS or memristor implementations (Supplementary Note 4). With proper device and circuit optimizations (Supplementary Table 10), the LIF and ALIF neuron can achieve a small area of ~41.3 μm² and ~53.4 μm², respectively. Besides achieving the smallest area overhead, it is worth noting that the optimized VO₂ memristor-based ALIF neuron is also superior in terms of the combined aspects of area, speed and energy consumption (Supplementary Fig. 29, Supplementary Table 11). Furthermore, the proposed VO₂ memristor-based encoder can achieve an area of ~2231 μm², which is almost an order of magnitude smaller than other similar encoders (Supplementary Tables 12–13). Thus, VO₂ memristor-based encoder and neurons can provide substantial benefits over other CMOS or memristor implementations in realizing physiological signal processing systems. Further shrinking of VO₂ memristors is desirable in realizing hardware-based neural networks with an even higher integration density, especially in neuron circuits when capacitors, which are the dominant area-consuming components, are reduced or even replaced by the intrinsic parasitic capacitance for faster computations. Planar devices with gap sizes of 100 nm or less have been reported previously^40,61, and aggressive scaling down to the limits of lithography is possible given that the metal-insulator transition and, subsequently, the threshold switching behavior still exists at the nanoscale^62,63. Apart from illustrating the benefits of our proposed physiological signal processing system, another takeaway from this discussion is the need for meticulous co-optimizations between various circuit components. The demonstrated co-optimizations, although simple, represent the first of many steps that need to be emphasized. Lastly, we further envision the merging of the VO₂ memristor-based encoders and neurons with non-volatile crossbar arrays of emerging memories²⁷ via proper interfacing (Supplementary Fig. 30) to ultimately realize an extremely compact physiological signal processing architecture.

In summary, a highly efficient neuromorphic physiological signal processing hardware system for the next-generation human-machine interface based on VO₂ memristors was proposed for the first time. This system contains a memristor-based asynchronous spike encoder and a decision network that features a long short-term memory spiking neural network which analyzes the physiological signals encoded in spikes. The spikes from memristor-based encoder mark the time at which the input signal has changed beyond a fixed threshold, which can preserve the original input information content to the greatest extent, so the encoded spikes can reconstruct the original signal accurately. The accuracy of signal encoding and reconstruction can be adjusted by the amplification factor of the intermediate stage op-amp. This spike encoding type has the advantage of sparse spikes and on-demand nature (no output spikes if input signal does not change). The asynchronous spike encoder was achieved efficiently without ADC/DAC and special control circuits due to the positive/negative symmetric threshold and volatile characteristics of the VO₂ memristor. In the decision-making LSNN, the ALIF neuron plays a key role which was achieved efficiently with VO₂ memristor. The release of each spike will change the current in the discharge path by feedback, achieving self-adaptation. The incorporation of ALIF neurons significantly improved the accuracy of the LSNN. The neuromorphic physiological signal processing system based on memristor achieved high accuracies of 95.83% and 99.79% with a very small LSNN on arrhythmia classification and epileptic seizure detection tasks, respectively. Our work demonstrated the potential and high efficiency of memristor-based neuromorphic systems for physiological signal processing, facilitating the construction of next-generation human-machine interfaces.

Methods

Fabrication of VO₂ memristor devices

The 20 nm VO₂ films were epitaxially grown on c-Al₂O₃ substrates by pulsed-laser deposition (PLD) technique using a 308-nm XeCl excimer laser operated at an energy density of about 1 J cm⁻² and a repetition rate of 3 Hz. The VO₂ films were deposited at 530 °C in a flowing oxygen atmosphere at the oxygen pressure of 2.0 Pa. Then, the films were cooled down to room temperature at the speed of 20 °C min⁻¹. The deposition rate of VO₂ thin films was calibrated by X-ray Reflection (XRR).

The VO₂ memristor was designed as a planar structure with a channel length of 400 nm and a width of 2 μm. The electrodes, which are composed of Au (40 nm) and Ti (5 nm) with a distance of 400 nm, were patterned with electron beam lithography (EBL) along with electron beam evaporation and lift-off.

Electrical measurements

The VO₂ memristor was placed in a Signatone probe station to facilitate connections to the external circuit, source measurement unit and oscilloscope. As for measurements under various ambient pressures and in an N₂ environment in Supplementary Fig. 5, the VO₂ memristor was placed in a LakeShore cryogenic probe station. Electrical measurements were performed using an Agilent B1500A semiconductor parameter analyzer and the RIGOL MSO8104 digital storage oscilloscope. We used an Agilent B1500A semiconductor parameter analyzer to perform electrical measurements of a single VO₂ device in Fig. 2b and Supplementary Fig. 5. In Supplementary Figs. 4, 6 and 7, Agilent B1500A was applied to create the pulse signal, and the oscilloscope was used to measure either the voltage across the device or the current on the device. The experimental setup depicted in Supplementary Fig. 8 was used to connect the VO₂ device to the external LIF circuit for electrical measurements. In Fig. 2e–h and Supplementary Figs. 9–11, Agilent B1500A was applied to create the input signal, and the oscilloscope was used to measure the output of Agilent B1500A, the voltage on the capacitor and the output of the LIF neuron circuit.

The physiological signal dataset

The MIT-BIH heart arrhythmia database⁵⁴ contains 30 min ECG recordings from 48 subjects. In order to improve the simulation accuracy, the original ECG waveforms were resampled at a frequency of 1800 Hz and split into single heartbeats of ~556 ms (1000 timesteps). Then, the heartbeats were normalized to 0–0.6 V. In total, 2000 different heartbeats were used as the dataset for this task, wherein the Normal (N), Ventricular ectopic beat (VEB), Supraventricular ectopic beat (SVEB), and Fusion beat (F) classes consisted of 1000, 500, 250, and 250 heartbeats, respectively.

The seizure data was obtained from CHB-MIT Scalp EEG Database⁵⁶. The CHB-MIT database contains scalp EEG recordings from 22 patients at the Children’s Hospital Boston. The original sampling rate of the database is 256 Hz. The EEG data in this work was from patient 1 where the data were resampled to 800 Hz. We selected 2530 data clips from the database for training. Each data clip contained 1000 time-step data with 18 channels (~1.25 s). The test sets were arranged in a one-hour-long segment to simulate the real-world situation. The EEG waveforms were all normalized to 0–0.6 V.

The SPICE model of VO₂ memristor

The schematic diagram of the planar VO₂ memristor model is shown in Supplementary Fig. 12. First, the biasing polarity is determined. If ${V}_{{{{{{\rm{top}}}}}}}\ge {V}_{{{{{{\rm{bot}}}}}}}$, then the model compares ${V}_{{{{{{\rm{top}}}}}}}-{V}_{{{{{{\rm{bot}}}}}}}$ to the thresholds: in HRS, the model checks if ${V}_{{{{{{\rm{top}}}}}}}-{V}_{{{{{{\rm{bot}}}}}}}\ge {V}_{{{{{{\rm{th}}}}}}}$ (equivalently ${V}_{{{{{{\rm{top}}}}}}}\ge {V}_{{{{{{\rm{bot}}}}}}}+{V}_{{{{{{\rm{th}}}}}}}$), and in LRS, the model checks if ${V}_{{{{{{\rm{top}}}}}}}-{V}_{{{{{{\rm{bot}}}}}}}\le {V}_{{{{{{\rm{hold}}}}}}}$ (equivalently ${V}_{{{{{{\rm{top}}}}}}}\le {V}_{{{{{{\rm{bot}}}}}}}+{V}_{{{{{{\rm{hold}}}}}}}$). If ${V}_{{{{{{\rm{top}}}}}}} < {V}_{{{{{{\rm{bot}}}}}}}$, the model compares ${V}_{{{{{{\rm{bot}}}}}}}-{V}_{{{{{{\rm{top}}}}}}}$ to the thresholds, that is, checking ${V}_{{{{{{\rm{bot}}}}}}}\ge {V}_{{{{{{\rm{top}}}}}}}+{V}_{{{{{{\rm{th}}}}}}}$ and ${V}_{{{{{{\rm{bot}}}}}}}\le {V}_{{{{{{\rm{top}}}}}}}+{V}_{{{{{{\rm{hold}}}}}}}$ in HRS and LRS, respectively. The right-hand sides of these four inequalities are constructed using the four voltage sources on the left to give ${V}_{{{{{{\rm{top}}}}}}}^{+}$ and ${V}_{{{{{{\rm{bot}}}}}}}^{+}$, taking into account the state of the device given by V_o (1 in HRS, 0 in LRS)_. These comparisons are done by a comparator, which is modeled here by the behavioral voltage source V_o according to Eq. 9:

$${V}_{{{{{{\rm{o}}}}}}}=\frac{1}{2}\left[1+{{\tanh }}\left(2\alpha \Delta V\right)\right]$$

(9)

where ΔV is described by Eq. 10:

$$\Delta V=\left\{\begin{array}{cc}{V}_{{{{{{\rm{bot}}}}}}}^{+}-{V}_{{{{{{\rm{top}}}}}}},& {V}_{{{{{{\rm{top}}}}}}}\ge {V}_{{{{{{\rm{bot}}}}}}}\\ {V}_{{{{{{\rm{top}}}}}}}^{+}-{V}_{{{{{{\rm{bot}}}}}}},& {V}_{{{{{{\rm{top}}}}}}} < {V}_{{{{{{\rm{bot}}}}}}}\end{array}\right.$$

(10)

This will result in a hysteretic I-V behavior typical of such devices. To model the finite switching time, R₀ and C₀ are introduced to suppress instantaneous change in V_o. The resistance of the device, R_VO2, which is determined by V_c, is then given by Eq. 11:

$$\frac{1}{{R}_{{{{{{\rm{V}}}}}}{{{{{{\rm{O}}}}}}}_{2}}}=\frac{1-{V_{{{{{\rm{c}}}}}}}}{{R}_{{{{{{\rm{off}}}}}}}}+\frac{{V_{{{{{\rm{c}}}}}}}}{{R}_{{{{{{\rm{on}}}}}}}}$$

(11)

Simulation of the LSNN

The decision-making LSNN for physiological signal processing was implemented using the PyTorch-based SpikingJelly module⁶⁴. BPTT was employed to train the network. The total loss L is given by Eq. 12:

$$L=\frac{1}{B}\mathop{\sum }\limits_{i=1}^{B}\mathop{\sum }\limits_{j=1}^{C}-{t}_{{ij}}{{\log }}{{{{{\rm{\sigma }}}}}}\left({y}_{{ij}}\right)+\frac{{\lambda }_{{{{{{\rm{f}}}}}}}}{N}\mathop{\sum }\limits_{n=1}^{N}{\left(\bar{{f_n}}-{f_0}\right)}^{2}$$

(12)

The first term on the right-hand side is the categorical cross-entropy loss considering C number of categories and a training batch size of B. y_ij and t_ij are the raw output (logits) and the target output, respectively, of the j-th output neuron for the i-th input sample. σ(·) is the softmax function. The second term describes the spike regularization for sparse firing³², which is the mean squared difference between the average firing rate of each hidden neuron and the target frequency f₀. N is the total number of hidden neurons and λ_f is the regularization coefficient. The average firing rate of the n-th neuron is given by Eq. 13:

$$\bar{{f}_{n}}=\frac{1}{\Delta t}\cdot \frac{1}{B}\mathop{\sum }\limits_{i=1}^{B}\left(\frac{1}{T}\mathop{\sum }\limits_{j=1}^{T}{z}_{{ijn}}\right)$$

(13)

where T is the total number of timesteps, Δt is the length of each timestep, and z_ijn is the presence of a spike at the j-th timestep during the i-th input sample. Spiking neurons can be regarded as having a non-differentiable step activation function, thus a surrogate derivative described by Eq. 14 was used for gradient calculations^32,53.

$$f\left(x\right)={{\max }}\left[0,\,\gamma \left(1-\left|x\right|\right)\right]$$

(14)

The values for all relevant parameters are listed in Supplementary Table 6.

Data availability

All data supporting this study and its findings are available within the article, its Supplementary Information and associated files. The source data underlying Figs. 2b, c, e–l, 3c, d, 4b–d, 5a, c–j, 6a–o have been deposited in [https://zenodo.org/record/7888750] or are available from the corresponding author upon reasonable request.

Code availability

The codes used for the simulations are described in [https://github.com/pekjuntiw/NCOMMS-23-03137] or are available from the corresponding author upon reasonable request.

References

Faust, O. et al. Deep learning for healthcare applications based on physiological signals: a review. Comput. Methods Prog. Biomed. 161, 1–13 (2018).
Article Google Scholar
Zhao, S., Fang, C., Yang, J. & Sawan, M. Emerging energy-efficient biosignal-dedicated circuit techniques: a tutorial brief. IEEE Trans. Circuits Syst. II Express Briefs 69, 2592–2597 (2022).
Google Scholar
Chazal, P. D., O’Dwyer, M. & Reilly, R. B. Automatic classification of heartbeats using ECG morphology and heartbeat interval features. IEEE Trans. Biomed. Eng. 51, 1196–1206 (2004).
Article PubMed Google Scholar
Feigin, V. L. et al. Global, regional, and national burden of neurological disorders during 1990–2015: a systematic analysis for the Global Burden of Disease Study 2015. Lancet Neurol. 16, 877–897 (2017).
Article Google Scholar
Satti, A. T. et al. Microneedle array electrode-based wearable EMG system for detection of driver drowsiness through steering wheel grip. Sensors 21, 5091 (2021).
Fan, Y. et al. SafeDriving: an effective abnormal driving behavior detection system based on EMG signals. IEEE Internet Things J. 9, 12338–12350 (2022).
Article Google Scholar
Jung, J. et al. Development of wearable wireless electrocardiogram detection system using bluetooth low energy. Electronics 10, 608 (2021).
Article Google Scholar
Zidan, M. A., Strachan, J. P. & Lu, W. D. The future of electronics based on memristive systems. Nat. Electron. 1, 22–29 (2018).
Article Google Scholar
Zhang, W. et al. Neuro-inspired computing chips. Nat. Electron. 3, 371–382 (2020).
Article ADS Google Scholar
Kim, Y. et al. A bioinspired flexible organic artificial afferent nerve. Science 360, 998–1003 (2018).
Article ADS CAS PubMed Google Scholar
Corradi, F. et al. ECG-based heartbeat classification in neuromorphic hardware. in 2019 International Joint Conference on Neural Networks (IJCNN) 1–8 (2019).
Bauer, F. C., Muir, D. R. & Indiveri, G. Real-time ultra-low power ECG anomaly detection using an event-driven neuromorphic processor. IEEE Trans. Biomed. Circuits Syst. 13, 1575–1582 (2019).
Article PubMed Google Scholar
He, Y. et al. A 28.2 μC Neuromorphic sensing system featuring SNN-based near-sensor computation and event-driven body-channel communication for insertable cardiac monitoring. in 2021 IEEE Asian Solid-State Circuits Conference (A-SSCC) (2021).
Chu, H. et al. A neuromorphic processing system for low-power wearable ECG classification. in 2021 IEEE Biomedical Circuits and Systems Conference (BioCAS) (2021).
Sharifshazileh, M., Burelo, K., Sarnthein, J. & Indiveri, G. An electronic neuromorphic system for real-time detection of high frequency oscillations (HFO) in intracranial EEG. Nat. Commun. 12, 3095 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Yang, Y. et al. Electrochemical dynamics of nanoscale metallic inclusions in dielectrics. Nat. Commun. 5, 4232 (2014).
Article ADS CAS PubMed Google Scholar
Yang, Y. & Huang, R. Probing memristive switching in nanoionic devices. Nat. Electron. 1, 274–287 (2018).
Article Google Scholar
Yang, Y. et al. Probing nanoscale oxygen ion motion in memristive systems. Nat. Commun. 8, 15173 (2017).
Article ADS PubMed PubMed Central Google Scholar
Wang, Z. et al. Memristors with diffusive dynamics as synaptic emulators for neuromorphic computing. Nat. Mater. 16, 101–108 (2016).
Article ADS PubMed Google Scholar
Tuma, T. et al. Stochastic phase-change neurons. Nat. Nanotechnol. 11, 693–699 (2016).
Article ADS CAS PubMed Google Scholar
Yi, W. et al. Biological plausibility and stochasticity in scalable VO2 active memristor neurons. Nat. Commun. 9, 4661 (2018).
Article ADS PubMed PubMed Central Google Scholar
Wang, W. et al. Learning of spatiotemporal patterns in a spiking neural network with resistive switching synapses. Sci. Adv. 4, eaat4752 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Pickett, M. D., Medeiros-Ribeiro, G. & Williams, R. S. A scalable neuristor built with Mott memristors. Nat. Mater. 12, 114–117 (2012).
Article ADS PubMed Google Scholar
Wu, Q. et al. Full imitation of synaptic metaplasticity based on memristor devices. Nanoscale 10, 5875–5881 (2018).
Article CAS PubMed Google Scholar
Xia, Q. & Yang, J. J. Memristive crossbar arrays for brain-inspired computing. Nat. Mater. 18, 309–323 (2019).
Article ADS CAS PubMed Google Scholar
Ohno, T. et al. Short-term plasticity and long-term potentiation mimicked in single inorganic synapses. Nat. Mater. 10, 591–595 (2011).
Article ADS CAS PubMed Google Scholar
Duan, Q. et al. Spiking neurons with spatiotemporal dynamics and gain modulation for monolithically integrated memristive neural networks. Nat. Commun. 11, 3399 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Yuan, R. et al. A calibratable sensory neuron based on epitaxial VO2 for spike-based neuromorphic multisensory system. Nat. Commun. 13, 3973 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Dang, B. et al. Stochastic neuron based on IGZO Schottky diodes for neuromorphic computing. APL Mater. 7, 071114 (2019).
Article ADS Google Scholar
Wang, X. et al. A novel RRAM-based adaptive-threshold LIF neuron circuit for high recognition accuracy. in 2018 International Symposium on VLSI Technology, Systems and Application (VLSI-TSA) (2018).
Shaban, A., Bezugam, S. S. & Suri, M. An adaptive threshold neuron for recurrent spiking neural networks with nanodevice hardware implementation. Nat. Commun. 12, 4234 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Bellec, G. et al. Long short-term memory and learning-to-learn in networks of spiking neurons. in Advances in Neural Information Processing Systems 31 (2018).
Mark, J. W. & Todd, T. D. A nonuniform sampling approach to data compression. IEEE Trans. Commun. 29, 24–32 (1981).
Article Google Scholar
Hou, Y. et al. A 1-to-1-kHz, 4.2-to-544-nW, multi-level comparator based level-crossing ADC for IoT applications. IEEE Trans. Circuits Syst. II Express Briefs 65, 1390–1394 (2018).
Google Scholar
Liu, Y. et al. An 82nW 0.53pJ/SOP clock-free spiking neural network with 40µs latency for AloT wake-up functions using ultimate-event-driven bionic architecture and computing-in-memory technique. in 2022 IEEE International Solid-State Circuits Conference (ISSCC) (2022).
Hou, Y. et al. A 61-nW level-crossing ADC with adaptive sampling for biomedical applications. IEEE Trans. Circuits Syst. II Express Briefs 66, 56–60 (2019).
Google Scholar
Corradi, F. & Indiveri, G. A neuromorphic event-based neural recording system for smart brain-machine-interfaces. IEEE Trans. Biomed. Circuits Syst. 9, 699–709 (2015).
Article PubMed Google Scholar
Yang, M., Liu, S.-C. & Delbruck, T. A dynamic vision sensor with 1% temporal contrast sensitivity and in-pixel asynchronous delta modulator for event encoding. IEEE J. Solid-State Circuits 50, 2149–2160 (2015).
Article ADS Google Scholar
Lichtsteiner, P., Posch, C. & Delbruck, T. A 128x128 120 dB 15 μs latency asynchronous temporal contrast vision sensor. IEEE J. Solid-State Circuits 43, 566–576 (2008).
Article ADS Google Scholar
Dutta, S. et al. Programmable coupled oscillators for synchronized locomotion. Nat. Commun. 10, 3299 (2019).
Article ADS PubMed PubMed Central Google Scholar
Lappalainen, J., Mizsei, J. & Huotari, M. Neuromorphic thermal-electric circuits based on phase-change VO2 thin-film memristor elements. J. Appl. Phys. 125, 044501 (2019).
Article ADS Google Scholar
Kumar, S. et al. Sequential electronic and structural transitions in VO2 observed using X-ray absorption spectromicroscopy. Adv. Mater. 26, 7505–7509 (2014).
Article CAS PubMed Google Scholar
Zhou, Y. & Ramanathan, S. Mott memory and neuromorphic devices. Proc. IEEE 103, 1289–1310 (2015).
Article CAS Google Scholar
Shao, Z., Cao, X., Luo, H. & Jin, P. Recent progress in the phase-transition mechanism and modulation of vanadium dioxide materials. NPG Asia Mater. 10, 581–605 (2018).
Article Google Scholar
Morin, F. J. Oxides which show a metal-to-insulator transition at the neel temperature. Phys. Rev. Lett. 3, 34–36 (1959).
Article ADS CAS Google Scholar
Lee, S. B. et al. Origin of variation in switching voltages in threshold-switching phenomena of VO2 thin films. Appl. Phys. Lett. 102, 063501 (2013).
Article ADS Google Scholar
Kumar, S. et al. Local temperature redistribution and structural transition during Joule-Heating-driven conductance switching in VO2. Adv. Mater. 25, 6128–6132 (2013).
Article CAS PubMed Google Scholar
Narayan, J. & Bhosle, V. M. Phase transition and critical issues in structure-property correlations of vanadium oxide. J. Appl. Phys. 100, 103524 (2006).
Article ADS Google Scholar
Maffezzoni, P. et al. Modeling and simulation of vanadium dioxide relaxation oscillators. IEEE Trans. Circuits Syst. I Regul. Pap. 62, 2207–2215 (2015).
Article MathSciNet MATH Google Scholar
Indiveri, G. et al. Neuromorphic silicon neuron circuits. Front. Neurosci. 5, 73 (2011).
Article PubMed PubMed Central Google Scholar
Salaj, D. et al. Spike frequency adaptation supports network computations on temporally dispersed information. eLife 10, e65459 (2021).
Article CAS PubMed PubMed Central Google Scholar
Zamani, M. et al. Flexible energy-efficient implementation of adaptive spiking encoder for neuromorphic processors. in 2021 IEEE International Symposium on Circuits and Systems (ISCAS) (2021).
Neftci, E. O., Mostafa, H. & Zenke, F. Surrogate gradient learning in spiking neural networks: bringing the power of gradient-based optimization to spiking neural networks. IEEE Signal Process. Mag. 36, 51–63 (2019).
Article Google Scholar
Moody, G. B. & Mark, R. G. The impact of the MIT-BIH arrhythmia database. IEEE Eng. Med. Biol. Mag. 20, 45–50 (2001).
Article CAS PubMed Google Scholar
Luz, E. Jd. S., Schwartz, W. R., Cámara-Chávez, G. & Menotti, D. ECG-based heartbeat classification for arrhythmia detection: a survey. Comput. Methods Prog. Biomed. 127, 144–164 (2016).
Article Google Scholar
Shoeb, A. H. Application of Machine Learning to Epileptic Seizure Onset Detection and Treatment. Massachusetts Institute of Technology (Harvard-MIT Division of Health Sciences and Technology, 2009).
Gu, Q., Zhu, L. & Cai, Z. Evaluation measures of the classification performance of imbalanced data sets. Part Commun. Comput. Inf. Sci. book Ser. 51, 461–471 (2009).
MATH Google Scholar
Liu, X. et al. Epileptic seizure detection based on variational mode decomposition and deep forest using EEG signals. Brain Sci. 12, 1275 (2022).
Article PubMed PubMed Central Google Scholar
O’Leary, G. et al. NURIP: neural interface processor for brain-state classification and programmable-waveform neurostimulation. IEEE J. Solid-State Circuits 53, 3150–3162 (2018).
Article ADS Google Scholar
Zhu, M., He, T. & Lee, C. Technologies toward next generation human machine interfaces: from machine learning enhanced tactile sensing to neuromorphic sensory systems. Appl. Phys. Rev. 7, 031305 (2020).
Article ADS CAS Google Scholar
Aetukuri, N. P. B., Harris, J. S., McIntyre, P. C. & Parkin, S. S. P. The Control of Metal-insulator Transition in Vanadium Dioxide. Stanford University (Department of Materials Science and Engineering, 2013).
Bohaichuk, S. M. et al. Fast spiking of a Mott VO2–carbon nanotube composite device. Nano Lett. 19, 6751–6755 (2019).
Article ADS CAS PubMed Google Scholar
Bohaichuk, S. M. et al. Localized triggering of the insulator-metal transition in VO2 using a single carbon nanotube. ACS Nano 13, 11070–11077 (2019).
Article CAS PubMed Google Scholar
Fang, W. et al. SpikingJelly. https://github.com/fangwei123456/spikingjelly (2020).

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (61925401, 92064004, 61927901, 92164302) and the 111 Project (B18001). Y.Y. acknowledges support from the Fok Ying-Tong Education Foundation and the Tencent Foundation through the XPLORER PRIZE.

Author information

These authors contributed equally: Rui Yuan, Pek Jun Tiw.

Authors and Affiliations

Beijing Advanced Innovation Center for Integrated Circuits, School of Integrated Circuits, Peking University, Beijing, 100871, China
Rui Yuan, Pek Jun Tiw, Lei Cai, Chang Liu, Teng Zhang, Ru Huang & Yuchao Yang
School of Electronic and Computer Engineering, Peking University, Shenzhen, 518055, China
Zhiyu Yang & Yuchao Yang
Beijing National Laboratory for Condensed Matter Physics, Institute of Physics, Chinese Academy of Sciences, Beijing, 100190, China
Chen Ge
Center for Brain Inspired Chips, Institute for Artificial Intelligence, Frontiers Science Center for Nano-optoelectronics, Peking University, Beijing, 100871, China
Yuchao Yang
Center for Brain Inspired Intelligence, Chinese Institute for Brain Research (CIBR), Beijing, Beijing, 102206, China
Yuchao Yang

Authors

Rui Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Pek Jun Tiw
View author publications
You can also search for this author in PubMed Google Scholar
Lei Cai
View author publications
You can also search for this author in PubMed Google Scholar
Zhiyu Yang
View author publications
You can also search for this author in PubMed Google Scholar
Chang Liu
View author publications
You can also search for this author in PubMed Google Scholar
Teng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Chen Ge
View author publications
You can also search for this author in PubMed Google Scholar
Ru Huang
View author publications
You can also search for this author in PubMed Google Scholar
Yuchao Yang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.Y. and P.J.T. contributed equally to this work. R.Y., C.L., and C.G. fabricated the VO₂ devices. R.Y., T.Z., and C.L. performed electrical measurements. R.Y., P. J.T., L.C., and Z.Y. performed the simulations. R.Y., P.J.T., and Y.Y. prepared the manuscript. Y.Y. and R.H. directed all the research. All authors analyzed the results and implications and commented on the manuscript at all stages.

Corresponding author

Correspondence to Yuchao Yang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks, Viswanath Balakrishnan and Mario Lanza for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yuan, R., Tiw, P.J., Cai, L. et al. A neuromorphic physiological signal processing system based on VO₂ memristor for next-generation human-machine interface. Nat Commun 14, 3695 (2023). https://doi.org/10.1038/s41467-023-39430-4

Download citation

Received: 21 January 2023
Accepted: 08 June 2023
Published: 21 June 2023
DOI: https://doi.org/10.1038/s41467-023-39430-4

This article is cited by

VO2 memristor-based frequency converter with in-situ synthesize and mix for wireless internet-of-things
- Chang Liu
- Pek Jun Tiw
- Yuchao Yang
Nature Communications (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.