Novel nondelay-based reservoir computing with a single micromechanical nonlinear resonator for high-efficiency information processing

Sun, Jie; Yang, Wuhao; Zheng, Tianyi; Xiong, Xingyin; Liu, Yunfei; Wang, Zheng; Li, Zhitian; Zou, Xudong

doi:10.1038/s41378-021-00313-7

Download PDF

Article
Open access
Published: 20 October 2021

Novel nondelay-based reservoir computing with a single micromechanical nonlinear resonator for high-efficiency information processing

Jie Sun ORCID: orcid.org/0000-0002-2560-106X^1,2,
Wuhao Yang¹,
Tianyi Zheng^1,2,
Xingyin Xiong¹,
Yunfei Liu^1,2,
Zheng Wang³,
Zhitian Li¹ &
…
Xudong Zou^1,2,3

Microsystems & Nanoengineering volume 7, Article number: 83 (2021) Cite this article

3894 Accesses
14 Citations
Metrics details

Subjects

Abstract

Reservoir computing is a potential neuromorphic paradigm for promoting future disruptive applications in the era of the Internet of Things, owing to its well-known low training cost and compatibility with hardware. It has been successfully implemented by injecting an input signal into a spatially extended reservoir of nonlinear nodes or a temporally extended reservoir of a delayed feedback system to perform temporal information processing. Here we propose a novel nondelay-based reservoir computer using only a single micromechanical resonator with hybrid nonlinear dynamics that removes the usually required delayed feedback loop. The hybrid nonlinear dynamics of the resonator comprise a transient nonlinear response, and a Duffing nonlinear response is first used for reservoir computing. Due to the richness of this nonlinearity, the usually required delayed feedback loop can be omitted. To further simplify and improve the efficiency of reservoir computing, a self-masking process is utilized in our novel reservoir computer. Specifically, we numerically and experimentally demonstrate its excellent performance, and our system achieves a high recognition accuracy of 93% on a handwritten digit recognition benchmark and a normalized mean square error of 0.051 in a nonlinear autoregressive moving average task, which reveals its memory capacity. Furthermore, it also achieves 97.17 ± 1% accuracy on an actual human motion gesture classification task constructed from a six-axis IMU sensor. These remarkable results verify the feasibility of our system and open up a new pathway for the hardware implementation of reservoir computing.

A memristor-based analogue reservoir computing system for real-time and power-efficient signal processing

Article 26 September 2022

Rotating neurons for all-analog implementation of cyclic reservoir computing

Article Open access 23 March 2022

Dynamic memristor-based reservoir computing for high-efficiency temporal signal processing

Article Open access 18 January 2021

Introduction

Recently, emerging sensor applications, such as the Internet of Things (IoT)¹ and ubiquitous sensing, require sensors with smaller size and lower power consumption, as well as “edge computing”² capabilities, to process a deluge of data locally. These expanding computing requirements have motivated the creation of new and specialized computing paradigms to break through the “von Neumann bottleneck”. Among them, neuromorphic computing that mimics a biological neural network has been advocated as a candidate in recent years because of its high energy efficiency³. As a neuromorphic computing paradigm, reservoir computing (RC)^4,5,6 was originally a recurrent neural network (RNN)^7,8,9 framework and is therefore suitable for temporal information processing. RC is different from conventional RNNs in that the weights on the recurrent connections in the reservoir are not trained; only the output connection weights in the readout are trained, which makes it possible to drastically reduce the computational cost of learning. More importantly, the hardware implementation of RC can be achieved using a variety of nonlinear dynamic systems with nonlinearity and fading memory (or short-term memory). Because a mechanism for adaptive changes is not necessary for training, the main two implementation structures are RC based on numerous randomly interacting nonlinear nodes and a time-delayed nonlinear system¹⁰.

RC based on spatially extended nodes provides efficient parallel information processing^11,12. However, it suffers from the complexity of hardware implementation. Simple RC based on a time-delayed nonlinear system possessing only a single nonlinear node has been proposed. It can emulate the spatially extended nodes of RC using virtual nodes temporally extended along with a delayed feedback. This superiority has recently motivated the search for hardware implementation using emerging devices, such as electronic devices¹³, optical systems^{14,15,16,17,18,19}, spintronic devices²⁰, dynamic memristors^{21,22,23,24,25}, and mechanical resonators^26,27. However, the delayed feedback and the additional masking procedure reduce processing efficiency. Recent studies have aimed to solve this problem by optimizing the system parameters^13,28, such as mask length and feedback strength, or using different feedback structures, such as double feedback loops¹⁶ and parallel multiple feedback loops^29,30. However, the effect of the nonlinear characteristics of the device on the performance of the RC system has rarely been studied. Moreover, an initial report demonstrated the feasibility of RC with a single “delay-coupled” nonlinear microelectromechanical system (MEMS) resonator, and its best classification accuracy was only 78+2% for the TI-46 recognition benchmark.

In this work, we propose a novel reservoir computer structure using a single micromechanical resonator with hybrid nonlinear dynamics and omitting time-delayed feedback. Moreover, we focus on the well-known nonlinear dynamics of the micromechanical resonator and first propose a hybrid nonlinear response (HNL), which comprises the transient nonlinear response (TNL) and the Duffing nonlinear response (DuNL). Due to the dynamic richness of the HNL, time-delayed feedback can be removed to achieve high-efficiency RC. Furthermore, we define a self-masking process to replace the traditional masking procedure to simplify and improve the efficiency of RC. The self-masking process directly feeds serialized input data into the reservoir, reshaped by the e-exponential characteristics of TNL with a certain temporal solution, and then picks up the nonlinear cumulative response at the separation time. Since our mask procedure utilizes the self-nonlinear characteristics of the reservoir, the masking procedure and RC are simultaneously completed, which is why we call it a self-masking process.

This allows us to achieve a novel RC architecture using the HNL with the self-masking process for high-efficiency temporal pattern classification, such as the Mixed National Institute of Standards and Technology (MNIST) handwritten digit task, TI-46 spoken word recognition benchmark, and human motion gesture recognition task sensing, from a six-axis inertial measurement unit (IMU) sensor. To evaluate the memory capacity of our system, we perform a nonlinear autoregressive moving average (NARMA) task. The results show that this novel structure can effectively adjust the nonlinear richness of the system to adapt to the specific pattern classification task¹³; it also reduces system multiparameter optimization difficulties and simplifies control loop complexities. More importantly, the simple structure and device compatibility with MEMS can facilitate the hardware implementation of RC and promote the emergence of disruptive applications using MEMS technology in the future IoT era.

Results

Hybrid nonlinear resonator-based RC system

In the time-delayed RC, structural parameters such as the mask function, the number of virtual nodes, and feedback strength should be optimized to generate a sufficiently rich reservoir state, which reduces processing efficiency. In particular, to obtain a large number of different transient responses to the input, the input signal is time-multiplexed by a mask function that serves the dual purpose of serializing the input and maximizing the effectively used dimensionality of the system. Therefore, we propose hybrid nonlinear RC with a self-masking process. The basic principle of our scheme is shown in Fig. 1a. It is composed of three distinct parts: an input layer, a reservoir, and an output layer. The serialized input signals are fed to the reservoir after preprocessing. Then the self-masking process and nonlinear transformation are simultaneously realized in the hybrid nonlinear reservoir. Thus, the reservoir states are sampled through postprocessing, and the training and test procedures are implemented using a linear regression algorithm. Compared with time-delayed feedback RC¹³, we directly serialize the input stream U(t) and feed it into the reservoir, and different degrees of nonlinear cumulative effects can be obtained by the self-masking process, which simplifies the masking procedures and improves the information processing efficiency.

**Fig. 1: RC system architecture and hybrid nonlinear phenomena.**

The concept of HNL we introduce ensures the rich nonlinear dynamics of the reservoir. Figure 1b shows the envelope detection result of the HNL oscillation response of the resonator, which can be roughly divided into four stages: T1 represents the oscillation starting stage, T2 the steady-state oscillation stage, T3 the oscillation attenuation stage, and T4 the oscillation resting stage. For T1–T4, all operate in the Duffing nonlinear oscillation state of the resonator, but the T1 and T3 stages also operate in the TNL. We select θ < T (T = T1 = T3) for better state richness due to the HNL we propose. Thus, the masking procedure in time-delayed RC can be replaced with the self-masking process, and the feedback loop is not necessary. As a result of this simple structure, the system is capable of generating a sufficiently rich reservoir state for high-efficiency information processing.

Model analysis of hybrid nonlinear RC

In this hybrid nonlinear RC system, a microelectromechanical clamped–clamped (C-C) silicon beam resonator is used as the reservoir to nonlinearly map the input data into a higher-dimensional state space, which can also be seen as a typical underdamped second-order oscillation system. This hybrid nonlinear reservoir combines the DuNL characteristics of the resonator^31,32 and its transient exponential nonlinear response characteristics as a typical second-order oscillation system, which guarantees the rich nonlinear dynamics of single resonator RC to process the pattern classification tasks. The resonator is driven and detected by the parallel plate electrostatic force. A scanning electron microscopic (SEM) image of the resonator is shown in Fig. 3, and its displacement can be approximated by the Duffing nonlinear equation:

$$\ddot x + 2\xi w_n\dot x + w_n^2x + \beta x^3 = F_{\rm{d}}(t),$$

(1)

where x, $\dot x$, and $\ddot x$ are the displacement, velocity, and acceleration of the resonator, respectively, $w_{\rm{n}} = 2\pi f_{\rm{n}}$ is the natural angular frequency of the resonator in its linear regime, $\xi = \frac{1}{{2Q}}$ is the damping ratio, Q is the quality factor, and F_d is the force per unit mass driving the beam. Note that β is the coefficient controlling the amount of nonlinearity in the restoring force and introduces the Duffing nonlinearity to the equation. In the case of the C-C beam (Figs. 2e and 3), the value of β can be estimated by³²

$$\beta = \frac{{32E}}{{\surd 2\rho L^4{{{\mathrm{A}}}}}}$$

(2)

where E is the silicon Young’s modulus, L is the beam length, ρ is its density, and A is a constant term. Equation (2) further indicates that the geometric nature of the nonlinearity of the resonator depends on the beam length. Short beams can cause a larger nonlinear restoring force term than long beams but need a larger excitation amplitude.

**Fig. 2: Nonlinear resonator and RC models.**

**Fig. 3: Schematic of the experimental reservoir computer.**

The size information of the (C-C) silicon beam resonator is described in the “Device fabrication” part, which is designed to satisfy the demands for the appropriate value of β and Q. To drive the C-C beam to the sufficient nonlinear region with less energy consumption, an expected β value of 2.4 × 10²² Hz² m⁻² with an effective beam length of 500 µm should be determined. If β = 0, the beam will oscillate periodically in a linear region, and we can obtain the analytical solution $x\left( t \right) = 1 \pm \frac{{e^{ - \frac{{w_{\rm{n}}}}{{2Q}}t}}}{{\sqrt {1 - \left( {\frac{1}{{2Q}}} \right)^2} }}$. When assuming the initial condition F_d = 0, this e-exponential term is introduced as the TNL. The HNL provides sufficiently rich computing dynamics when mixing the β and e-exponential terms. In addition, the natural frequency and quality factor also influence the performance of RC. To maintain a certain memory capacity, a high quality factor Q >1000 should be considered, which determines the decay time for the TNL. To further reduce the “anchor loss”, a unique “cross” anchor design is adopted.

In the hardware implementation of time-delayed RC by a mechanical oscillator^26,27, a high natural frequency and relatively low value quality factor must be combined for a higher processing speed ${{t}}_0^{ - 1} = \frac{{\pi f_{\rm{n}}}}{{MQ}}$, where M is the number of virtual nodes. Generally, several hundred virtual nodes²⁸ (M ~ 400) are employed to obtain good performance; thus, the system needs a larger driving voltage to operate in suitable nonlinear states because of the low quality factor (Q ~ 100). While the number of virtual nodes M is not needed in the self-masking process, a high quality factor (Q ~ 4300) can be chosen, which not only ensures the processing speed but also improves the nonlinear effect of the TNL.

As mentioned above, the characterization of the nonlinear dynamic response of the beam is crucial since it is the source of reservoir nonlinearity. A “frequency sweep” and “amplitude sweep” are common characterization methods used to analyze the nonlinear response of a beam; therefore, we construct a numerical simulation of Eq. (1) to determine several main parameters, which are used to drive the resonator into an appropriate nonlinear state for the realization of reservoir state richness, such as the driving frequency f_d and the excitation amplitude.

The Duffing nonlinear frequency/amplitude response can be observed in Fig. 2a, b, and the nonlinear solutions to (1) equation of motion have been well studied previously³². Finite element analysis is performed through the solid mechanics interface of COMSOL Multiphysics 5.3a to further simulate the dynamic vibration modes of the beam. Figure 2e shows the first mode shape. Moreover, to ensure that the beam works in a higher signal-to-noise ratio and stability amplitude output at the specified drive frequency, the value at the front bifurcation point (g point) of the frequency hysteresis loop is selected³³, as shown in Fig. 2c. The sweep results of the simulation are almost the same as the experimental results (Fig. 2c, d), verifying the feasibility of the constructed model. Therefore, we can choose suitable parameters and states to verify our HNL-RC concept.

After determining the key parameters of the beam, we choose the handwritten digit recognition dataset to compare the classification performance of the TNL- and HNL-RC models. We choose a subdataset to test or optimize the system, which contains 1000 samples with 10 classes randomly selected from the MNIST dataset³⁴: 100 samples for the test set, and 900 samples for the training set. Preprocessing is performed before the samples are input to the reservoir to reduce redundant information of the input signal, as shown in Fig. 4a. The details are shown in the “Methods” section.

**Fig. 4: Handwritten digit recognition using an HNL resonator-based RC system.**

Figure 2k shows the reservoir states corresponding to the four test samples shown in Fig. 2j for the two different models. The reservoir states of the two models are significantly different, preliminarily verifying the rationality of the above analysis. The reservoir state is then used as input to the readout function via ridge regression for training and classification. The better temporal information processing ability of the HNL reservoir is clearly revealed by recognizing the test dataset; the classification accuracy rate of the TNL model is 88% and that of the HNL model is 91%. We thus demonstrate that the novel RC structure possesses highly efficient information processing capabilities and good classification accuracy.

Experimental set-up for single-resonator RC

When verifying the concept of the hybrid nonlinear RC structure by the simulation, hardware implementation experiments should be further established to verify the feasibility of the system. We select the widely studied MEMS C-C beam resonator, and its nonlinear oscillation characteristic is verified through numerical simulation and yields similar results. The SEM image and the experimental set-up are shown in Fig. 3.

In the experimental set-up of our system, the hardware implementation of the time-delayed RC²⁶ is simplified because of the self-masking process, including the multiplier applied to multiply input digital data U(t) with a mask. The adder is used to add the feedback signal, the shift register, and amplifier for the precise time delay, which are no longer required in our system. When the sensing electrode obtains the state response, an envelope filter (ENV) and a downsampling module (D-Samples) are set behind the analog-to-digital converter (ADC) because it is convenient to adjust the response waveform under different nonlinear response conditions using the LabVIEW program. Therefore, the experimental set-up of the single resonator RC better simplifies the complexity compared with the time-delayed feedback RC and improves the flexibility and information processing efficiency.

MNIST handwritten digit recognition

The nonlinear cumulative effect during the self-masking process can be adjusted by varying the parameter θ, ${\theta} = n \cdot {{T}}\left( {n \in \left[ {0,1} \right]} \right)$, where ${{T}} = \frac{{2Q}}{{w_{\rm{n}}}}$ is the decay time of the resonator. Here we ignore the influence of the decay time itself because T is not changed under the specific experimental conditions. According to previous research^13,26 and our simulation analysis, the separation time value is set at $\theta \sim \frac{1}{2}T = 0.008\,{\rm{s}}$ to offer optimal performance. A parameter optimization trial is designed by changing the parameter θ with the subdataset, and the results are shown in Fig. 4b, c. We can obtain a classification accuracy of 67% when $\theta = \frac{1}{{80}}{{T}}$; increasing the separation time to $\frac{1}{2}{{T}}$ can potentially achieve 91% accuracy, and the accuracy can be lowered to 88% as θ continues to increase because the steady-state oscillation stages only have one nonlinear response. Consequently, we experimentally verify that the richness of the nonlinear dynamics of the reservoir can be effectively optimized by varying the separation time θ so that it can be more widely and effectively applied to different types of classification tasks with this hybrid nonlinear RC system.

In addition, after parameter optimization and selection of specific nonlinear vibration states, it is vital to further verify that the system is suitable for MNIST tasks with a large amount of data. We randomly select 25,000 samples from the MNIST dataset, of which 2500 samples are used as the test set and 22,500 samples are used as the training set. When we set the separation time $\theta = \frac{1}{2}T$, the final classification accuracy obtained from the RC system is 93%. Figure 4d shows a false-color confusion matrix highlighting the experimentally obtained classification results from the RC system versus the desired outputs.

NARMA task to assess memory capacity

Different tasks require different key properties to make a correct estimation of the target function. While classification tasks require a strong nonlinear transformation¹³, forecasting tasks are strongly dependent on good linear memory. In the experiments assessing the handwritten digit recognition task, we demonstrate the high-efficiency classification performance of the HNL reservoir. Here we can also verify the memory capacity of this reservoir through the NARMA benchmark.

NARMA is an acronym for the nonlinear autoregressive moving average. It is one of the most widely used benchmarks for measuring memory ability. The parameter n represents the correlation between the current and the previous n data. We choose the task parameter n = 1, as the nearest-neighbor correlation in this reservoir framework only exists between the virtual nodes (the details are presented in “Methods”). Based on the research results¹³, we should choose a weak nonlinearity condition to obtain better forecasting precision by selecting a suitable θ, which is not smaller than T. Another option is reducing the parameter Q to weaken the Duffing nonlinear effect. Formula (3) is the transfer function of the NARMA1 task, described as follows:

$$y_k = 0.3y_{k - 1} + 0.05y_{k - 1}^2 + 1.5u_k^2 + 0.1.$$

(3)

where k is the length of the training and test sequences, and the input u(k) is generated from a uniform density in [0,0.5].

To quantify the performance of the reservoir, the normalized mean square error (NMSE) of the predicted value versus the value obtained from the NARMA model is used. The details are described in the “Methods” section. In Fig. 5, we depict the predicted results versus the target value. The training set result is NMSE = 5.5e-3, and the test set result is NMSE = 0.051. Therefore, we achieve memory capacity in the HNL reservoir but have the potential to realize a longer memory capacity with the novel architecture.

Motion gesture recognition of six-axis IMU sensor

To further verify the high-efficiency information processing ability of the proposed HNL-RC system for the real sensing of temporal signals and demonstrate its application potential in real-world scenarios, we design an application scenario to recognize the different human motion gestures by our proposed HNL-RC system. Signal data acquisition from a homemade six-axis IMU sensor, which integrates commercial three-axis accelerometers and three-axis gyroscopes, is performed using the functional integrated circuits made by our research group.

Figure 6a, b shows the optical image of the six-axis IMU sensor and wearing effect and the samples of four out of eight different motion gestures, which include jumping jacks, jogging, walking, squatting, stretching, chest expansion, arm circling, and body circling. Figure 6c shows the response of the sensor when the subject performs the eight gestures. The preprocessing involves only smooth filtering with a 30-point window length to reduce the noise, sampling, and normalization of each waveform. The final signal contains 600 feature points. We train and test the system with the motion gesture sample set, which consists of 8 actions, each repeated 20 times, for a total of 3 subjects. After obtaining the response of the HNL-RC, as shown in Fig. 6d, e, tenfold cross-validation is used to obtain the optimal weight matrix to prevent the system from overfitting to specific training and test data. The experimental conditions are the same as above, and the input voltage streams with 10 different time intervals θ (0.05T, 0.1T, 0.2T, 0.3T, 0.4T, 0.5T, 0.7T, 0.9T, T, 2T) and the optimal θ = 0.2T.

**Fig. 6: Motion gesture recognition of the six-axis IMU sensor using the HNL-RC system.**

Figure 6g shows the best recognition results obtained by the HNL-RC system; it can achieve (97.17 ± 1)% recognition accuracy for the real sensing signal from the six-axis IMU sensor. Moreover, if the application scenario is changed to only the motion gesture recognition of a certain person, a classification accuracy rate of (99.29 ± 0.5)% can be obtained by using this system. Therefore, the single resonator-based HNL-RC system is verified as a new architecture with high efficiency information processing ability.

Discussion

The remarkable performance demonstrates the high-efficiency information processing ability for pattern recognition tasks and short-term memory capacity for simple forecasting tasks, which shows the feasibility of our system and opens up a new pathway for the hardware implementation of RC. The use of a hybrid nonlinear system can simplify the hardware reservoir implementation of RC, and it can improve the computation rate compared with traditional time-delayed architectures.

To further illustrate the excellent performance and reliability of the new architecture proposed here, we test the MNIST handwritten digit recognition benchmark by our HNL-RC system based on a single resonator, and the classification accuracy is better than that of memristor-based RC, which uses 88 memristors²⁴. Furthermore, a “similar” preprocessing procedure is performed in the TI-46 spoken word classification task. We construct a time-delayed feedback reservoir system using the same resonator to compare the HNL-RC system, and the experimental results show that the classification accuracy of the latter (87.4%) is superior to that of the former (78%). All of these results verify the high efficiency and accuracy pattern recognition ability of this novel HNL-RC architecture. The NARMA benchmark verifies the regression forecasting ability when n = 1 for the memory capacity, which we will improve in future work. Then we design a motion gesture recognition experiment to test the feasibility of the architecture. The sample set is composed of a real signal sensed from a six-axis IMU sensor or three accelerometers. The high classification accuracy in the “Results” section proves the high-efficiency signal classification capability, and it also provides basic performance verification for future “sensing + computing” integrated device applications in this novel hybrid nonlinear RC hardware system.

Conclusion

In summary, we propose a novel RC architecture using a single micromechanical resonator with hybrid nonlinear dynamics while omitting time-delayed feedback. Based on this approach, we numerically and experimentally analyze the nonlinear response of the resonator and first propose that the hybrid nonlinear dynamics of the resonator comprise hybrid types of nonlinear responses, transient responses, and Duffing responses. Moreover, a self-masking process is defined based on the approach. We also perform two typical tasks and one real signal sample-set task sensed by a self-assembled six-axis IMU sensor to verify its classification capability and memory capacity. Experimental tests on the MNIST dataset show a high accuracy of 93% for handwritten digit classification, the motion gesture classification accuracy of the sample set composed of three subjects is 98.17%, and the accuracy can reach 99. 79% when the sample set is composed of only one subject. For the NARMA task, the results show that the NMSE is 0.051 when the correlation parameter n equals 1, which is consistent with the situation wherein only the nearest-neighbor input is correlated in the reservoir.

Considering the simple structure of our system and the device compatibility with MEMS, we expect that the proposed novel structure can facilitate the hardware implementation of RC and inspire emerging applications using MEMS technology in the future IoT era.

Methods

Device fabrication

The C-C beam resonator is microfabricated on (100) p-doped silicon on a glass substrate by the standard silicon-on-glass process. A device layer thickness of 40 µm defines the width of the beam; the length, in-plane thickness, and the gap between the beam and the drive/sense electrode are chosen to be 500, 6.5, and 3 µm, respectively, and the electrode length is 360 µm. For more complex nonlinearity of the resonator, we select the parallel plate drive and detection mode instead of the comb drive mode. For the COMSOL simulation diagram and the actual device diagram of the designed resonator, please refer to Figs. 2e and 3, respectively.

Device characterizations

The experimental single resonator RC system is realized with a personal computer (PC), an NI 6366 X Series Data Acquisition (SDA), and a resonator device with a functional interface circuit (IC). The PC is used to run the loop of the control algorithm, which is programmed by LabVIEW 17.0; the SDA is used to realize the function of an ADC and a digital-to-analog converter, which are 12 bits, and the functional IC contains a trans-impedance amplifier module, a second amplifier module, and a bandpass filter module for transforming, amplifying, and filtering the response signal, respectively.

Suitable driving parameters should be chosen before the final test with special tasks. A Zurich lock-phase amplifier is the most commonly used basic performance measurement instrument for MEMS devices. It is used to perform the open-loop frequency scanning test to determine the required driving frequency and the effective quality factor.

Mixed National Institute of Standards and Technology

The MNIST database³⁴ is a large dataset that is commonly used for training and testing classification capacity. The database was created by “remixing” the digit samples written by high school students and employees of the United States Census Bureau and consists of 60,000 training samples and 10,000 test samples. Each sample in the dataset is composed of a 28 × 28 gray value matrix. Preprocessing was performed before the images were fed into the reservoir, as shown in Fig. 4a. Taking the image of 6 as an example, the original grayscale image of 28 × 28 pixels was trimmed to a 22 × 20 pixel image to reduce redundant information. Then the 22 × 20 pixel matrix was transformed into 1 × 440 temporal sequences of input pulse streams with separation time θ, serializing the N = 440 input signal as the “neural” nodes of the HNL reservoir.

Finally, we obtained a 440 × 10 readout network that was used for classification after training. To perform the MNIST classification function, we need to construct ten appropriate target functions as ten linear classifiers, each of which is a polynomial function composed of the optimal weight coefficient vector, ${{{\mathrm{y}}}}_i({{{\mathrm{t}}}}) = {{{\boldsymbol{w}}}}_i^T{{{\boldsymbol{x}}}}(t)$ with i = 10. For every test sample, the function is applied to select the actual digit through a winner-takes-all approach. The target function is +0 if the handwritten digit does not correspond to the sought digit and +1 if it does. We called this postprocessing.

NARMA task

The NARMA task is one of the most widely used benchmarks for measuring RC memory capacity. The full name is the nonlinear autoregressive moving average¹³. It is used in many other publications in the context of RC, such as refs. ^5,35. For the NARMA task, the input u(k) is generated from a uniform density in [0,0.5]. Then, after being normalized, the variable n is a positive integer value of [1,∞], where a larger n represents a stronger correlation between the generated data contexts, which means longer memory length. The target y_k is given by the following recursive formula:

$$\begin{array}{*{20}{ll}} {u\left( k \right)} \, = \, {rand\left[ {0,0.5} \right],} \\ {U\left( k \right)} \, = \, {2 \ast u\left( k \right),} \\ \quad\,\,{y_k} \, = \, {0.3y_{k - 1} + 0.05y_{k - 1}\left( {\mathop {\sum}\nolimits_i^{n - 1} {y_{k - i - 1}} } \right) + 1.5u_{k - n + 1}u_k + 0.1} \end{array}$$

(4)

Different n values represent different correlations. Our new model is designed for pattern classification tasks that need strong nonlinear mapping ability; therefore, it sacrifices a certain memory capacity in this special RC framework. We choose n = 1, which indicates that the current input is only associated with the last previous response. For the regularization, training, and testing of the dynamic system modeling task, we used two samples with a length of 300 points as the dataset, one for the training and one for the testing. To calculate the memory capacity of the RC system, we calculated our output signal error using the NMSE, which is defined as follows:

$${\rm{NMSE}} = \frac{{\mathop {\sum}\nolimits_{k = 1}^m {\mathop {\sum}\nolimits_{i \in {{{\mathrm{O}}}}} {\left( {p_i\left( k \right) - y_i\left( k \right)} \right)^2} } }}{{\mathop {\sum}\nolimits_{k = 1}^m {\mathop {\sum}\nolimits_{i \in {{{\mathrm{O}}}}} {y_i^2\left( k \right)} } }}$$

(5)

where p(k) is the predicted signal, y(k) is the original signal, and m is the number of time steps in the target function.

Readout function training via ridge regression

The reservoir readout layer was constructed by a linear regression algorithm. We chose ridge regression with Tikhonov regularization to prevent data from overflowing during training and adjusted the weights to minimize the mean squared error between y and y_t.

$$\begin{array}{*{20}{ll}} {y\left( t \right)} \, = \, {{{{\boldsymbol{w}}}}^{{{\boldsymbol{T}}}}{{{\boldsymbol{x}}}}\left( t \right),} \\ \quad{{{\boldsymbol{w}}}} \, = \, {{{{\boldsymbol{y}}}}_{{{\boldsymbol{t}}}}X^T(XX^T + \lambda I)^{ - 1}} \end{array},$$

(6)

where w is a vector of weights, y_t is the target vector, X is the data matrix that contains y(t) and x(t), and λ is the regularization coefficient.

References

Al-Fuqaha, A., Guizani, M., Mohammadi, M., Aledhari, M. & Ayyash, M. Internet of Things: a survey on enabling technologies, protocols, and applications. IEEE Commun. Surv. Tutor. 17, 2347–2376 (2015).
Article Google Scholar
Shi, W., Cao, J., Zhang, Q., Li, Y. & Xu, L. Edge computing: vision and challenges. IEEE Internet Things J. 3, 637–646 (2016).
Article Google Scholar
Merolla, P. A. et al. A million spiking-neuron integrated circuit with a scalable communication network and interface. Science 345, 668–673 (2014).
Article Google Scholar
Lukosevicius, M. & Jaeger, H. Overview of Reservoir Recipes (Jacobs University Bremen, 2007).
Jaeger, H. Adaptive nonlinear system identification with echo state networks. in Advances in Neural Information Processing Systems 609–616 (NIPS, 2003).
Maass, W., Natschläger, T. & Markram, H. Real-time computing without stable states: a new framework for neural computation based on perturbations. Neural Comput. 14, 2531–2560 (2002).
Article Google Scholar
Graves, A. Generating sequences with recurrent neural networks. arXiv preprint arXiv:1308.0850 (2013).
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9, 1735–1780 (1997).
Article Google Scholar
Werbos, P. J. Backpropagation through time: what it does and how to do it. Proc. IEEE 78, 1550–1560 (1990).
Article Google Scholar
Tanaka, G. et al. Recent advances in physical reservoir computing: a review. Neural Netw. 115, 100–123 (2019).
Liu, Z. et al. Multichannel parallel processing of neural signals in memristor arrays. Sci. Adv. 6, eabc4797 (2020).
Article Google Scholar
Vandoorne, K. et al. Experimental demonstration of reservoir computing on a silicon photonics chip. Nat. Commun. 5, 1–6 (2014).
Article Google Scholar
Appeltant, L. et al. Information processing using a single dynamical node as complex system. Nat. Commun. 2, 1–6 (2011).
Article Google Scholar
Larger, L. et al. High-speed photonic reservoir computing using a time-delay-based architecture: million words per second classification. Phys. Rev. X 7, 011015 (2017).
Google Scholar
Duport, F., Schneider, B., Smerieri, A., Haelterman, M. & Massar, S. All-optical reservoir computing. Opt. Express 20, 22783 (2012).
Article Google Scholar
Chen, Y. et al. Reservoir computing system with double optoelectronic feedback loops. Opt. Express 27, 27431–27440 (2019).
Article Google Scholar
Larger, L. et al. Photonic information processing beyond Turing: an optoelectronic implementation of reservoir computing. Opt. Express 20, 3241–3249 (2012).
Article Google Scholar
Paquot, Y. et al. Optoelectronic reservoir computing. Sci. Rep. https://doi.org/10.1038/srep00287 (2012).
Van Der Sande, G., Brunner, D. & Soriano, M. C. Advances in photonic reservoir computing. Nanophotonics 6, 561–576 (2017).
Article Google Scholar
Torrejon, J. et al. Neuromorphic computing with nanoscale spintronic oscillators. Nature 547, 428–431 (2017).
Article Google Scholar
Du, C., Ma, W., Chang, T., Sheridan, P. & Lu, W. D. Biorealistic implementation of synaptic functions with oxide memristors through internal ionic dynamics. Adv. Funct. Mater. 25, 4290–4299 (2015).
Article Google Scholar
Marinella, M. J. & Agarwal, S. Efficient reservoir computing with memristors. Nat. Electron. 2, 437–438 (2019).
Moon, J. et al. Temporal data classification and forecasting using a memristor-based reservoir computing system. Nat. Electron. 2, 480–487 (2019).
Article Google Scholar
Du, C. et al. Reservoir computing using dynamic memristors for temporal information processing. Nat. Commun. 8, 1–10 (2017).
Article Google Scholar
Yao, P. et al. Fully hardware-implemented memristor convolutional neural network. Nature 577, 641–646 (2020).
Article Google Scholar
Dion, G., Mejaouri, S. & Sylvestre, J. Reservoir computing with a single delay-coupled non-linear mechanical oscillator. J. Appl. Phys. 124, 152132 (2018).
Article Google Scholar
Barazani, B., Dion, G., Morissette, J.-F., Beaudoin, L. & Sylvestre, J. Microfabricated neuroaccelerometer: integrating sensing and reservoir computing in MEMS. J. Microelectromech. Syst. 29, 338–347 (2020).
Article Google Scholar
Appeltant, L., Van der Sande, G., Danckaert, J. & Fischer, I. Constructing optimized binary masks for reservoir computing with delay systems. Sci. Rep. 4, 3629 (2014).
Article Google Scholar
Zhong, Y. et al. Dynamic memristor-based reservoir computing for high-efficiency temporal signal processing. Nat. Commun. 12, 408 (2021).
Zheng, T. et al. Enhancing performance of reservoir computing system based on coupled MEMS resonators. Sensors 21, 2961 (2021).
Article Google Scholar
Kovacic, I., Brennan, M. J. & Lineton, B. On the resonance response of an asymmetric Duffing oscillator. Int. J. Non-Linear Mech. 43, 858–867 (2008).
Article Google Scholar
Kaajakari, V., Mattila, T., Oja, A. & Seppa, H. Nonlinear limits for single-crystal silicon microresonators. J. Microelectromech. Syst. 13, 715–724 (2004).
Article Google Scholar
Zhao, C. et al. Experimental observation of noise reduction in weakly coupled nonlinear MEMS resonators. J. Microelectromech. Syst. 26, 1196–1203 (2017).
Article Google Scholar
LeCun, Y. & Cortes, C. The MNIST database of handwritten digits. https://www.lri.fr/~marc/Master2/MNIST_doc.pdf (1998).
Tran, Q. H. & Nakajima, K. Higher-order quantum reservoir computing. arXiv preprint arXiv:2006.08999 (2020).

Download references

Acknowledgements

The authors would like to thank the members of the State Key Laboratory of Transducer Technology for helpful discussions. This research was partially supported by the National Key Research and Development Program of China (Grant No. 2018YFB2002300), the National Natural Science Foundation of China (Grant No. 61971399), and the Key Research Program of Frontier Science (CAS, Grant No. ZDBS-LY-JSC028).

Author information

Authors and Affiliations

The State Key Laboratory of Transducer Technology, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, China
Jie Sun, Wuhao Yang, Tianyi Zheng, Xingyin Xiong, Yunfei Liu, Zhitian Li & Xudong Zou
School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing, China
Jie Sun, Tianyi Zheng, Yunfei Liu & Xudong Zou
QILU Aerospace Information Research Institute, Jinan, China
Zheng Wang & Xudong Zou

Authors

Jie Sun
View author publications
You can also search for this author in PubMed Google Scholar
Wuhao Yang
View author publications
You can also search for this author in PubMed Google Scholar
Tianyi Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Xingyin Xiong
View author publications
You can also search for this author in PubMed Google Scholar
Yunfei Liu
View author publications
You can also search for this author in PubMed Google Scholar
Zheng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zhitian Li
View author publications
You can also search for this author in PubMed Google Scholar
Xudong Zou
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.Z. supported and conceived the experiments and correspondence to this article; J.S. performed the numerical simulations and the experiments, as well as most of the analysis, data acquisition and processing, and writing; W.Y. contributed to parts of the analysis, discussion, writing, and polishing of the article as well as correspondence to this article together with X.Z.; X.X. contributed to parts of the analysis, discussion, and polishing of the article; T.Z. contributed some ideas on numerical simulations and discussion; Y.L. contributed to parts of the numerical simulations and data processing; Z.W. contributed to some ideas on the device design and parts of the discussion; Z.L. contributed to parts of the discussion. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Wuhao Yang or Xudong Zou.

Ethics declarations

Competing interests

The authors declare no competing interests.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sun, J., Yang, W., Zheng, T. et al. Novel nondelay-based reservoir computing with a single micromechanical nonlinear resonator for high-efficiency information processing. Microsyst Nanoeng 7, 83 (2021). https://doi.org/10.1038/s41378-021-00313-7

Download citation

Received: 20 May 2021
Revised: 14 September 2021
Accepted: 16 September 2021
Published: 20 October 2021
DOI: https://doi.org/10.1038/s41378-021-00313-7

This article is cited by

In-sensor human gait analysis with machine learning in a wearable microfabricated accelerometer
- Guillaume Dion
- Albert Tessier-Poirier
- Julien Sylvestre
Communications Engineering (2024)
Physical reservoir computing with emerging electronics
- Xiangpeng Liang
- Jianshi Tang
- Huaqiang Wu
Nature Electronics (2024)
Non-linear processing with a surface acoustic wave reservoir computer
- Claude Meffan
- Taiki Ijima
- Toshiyuki Tsuchiya
Microsystem Technologies (2023)
A novel progressive wave gyroscope based on acousto-optic effects
- Lu Tian
- Qiang Shen
- Honglong Chang
Microsystems & Nanoengineering (2022)
Nonlinearity-mediated digitization and amplification in electromechanical phonon-cavity systems
- Tongqiao Miao
- Xin Zhou
- Dingbang Xiao
Nature Communications (2022)