Sensory information processing in robot skins currently rely on a centralized approach where signal transduction (on the body) is separated from centralized computation and decision-making, requiring the transfer of large amounts of data from periphery to central processors, at the cost of wiring, latency, fault tolerance and robustness. We envision a decentralized approach where intelligence is embedded in the sensing nodes, using a unique neuromorphic methodology to extract relevant information in robotic skins. Here we specifically address pain perception and the association of nociception with tactile perception to trigger the escape reflex in a sensorized robotic arm. The proposed system comprises self-healable materials and memtransistors as enabling technologies for the implementation of neuromorphic nociceptors, spiking local associative learning and communication. Configuring memtransistors as gated-threshold and -memristive switches, the demonstrated system features in-memory edge computing with minimal hardware circuitry and wiring, and enhanced fault tolerance and robustness.
Conventional robots carry out tasks in a structured programmable manner in controlled environments. The next generation of robots will work in unconstrained dynamic environments and in close proximity with humans. In this scenario, robots must be able to correctly perceive the external world and adapt their behaviour accordingly. Tactile and somatosensory perception are crucial to handle physical contact and develop spatial perception, to plan movements that avoid contact with obstacles1. However, few robotic platforms are equipped with tactile sensors to match the scale of the human skin (the majority only in their end-effectors and a small percentage over their full body2-HEX-O-SKIN3 and RoboSkin4), as covering robots with a myriad of small sensing sites brings about a number of technological challenges ranging from wiring to sensitivity, bandwidth, fault tolerance and robustness5,6. The current approach in robotics is based on the use of remote embedded sensors (on the robot body) and central processing. These implementations do not match the scale, connectivity and energy efficiency of their biological counterpart and often place little emphasis on fault tolerance against mechanical damage, which is essential for robots working in uncertain environments.
Implementations relying on the serial transmission of sensory information result in latency bottlenecks directly proportional to the number of sensors6. Time-divisional multiple access7,8 protocols can reduce the wiring complexity but require sequential and periodic sampling of sensors to map the data distribution, hence falling behind on latency and scalability. Inspired from biology, there have been efforts to mimic the signal processing and data handling architectures of the human nervous system, relying on neuromorphic, event-driven sampling of the tactile signal and asynchronous communication (e.g. address event representation6,9 and spread spectrum techniques10). So far, such systems are limited to encoding sensor stimuli with trains of digital pulses for simple signal transduction11 and high compression of the tactile signals, tackling the issues of wiring, communication bandwidth and efficiency to some extent. However, they still rely on the transfer of low-level sensory information to centralized neuromorphic spike-based processing units12,13,14 to use biologically inspired learning methodologies to extract relevant information from the tactile signals, hence becoming vulnerable to the above-mentioned issues. Additionally the robustness of such demonstrations to inadvertent mechanical damage and harsh operating conditions is hitherto unaddressed. Most critically, from the device perspective, current neuromorphic implementations are based on complementary metal-oxide-semiconductor (CMOS) technology that does not match the scale and connectivity of the human nervous system. For example, the implementation of a single CMOS-based synapse with spike-timing-dependent plasticity (STDP)-based learning rule requires multiple transistors, resistors and capacitors, which limits its scalability15. Memristive devices, instead, enable power-efficient in-memory computations via neurons and synapses with much simpler circuitry16,17,18,19.
We present a unique, decentralized neuromorphic decision-making concept to lower the temporal redundancy of event-based sensory signals and vastly reduce the amount of data shuttled to the central processing system, hence lowering the latency and wiring demands6. We enable this based on the unique sliding threshold behaviour of biological nociceptors and associative learning in synapses to shift intelligence to the location of the sensor nodes. We demonstrate a three-tier decision-making process flow-nociceptors identify and filter noxious information based on short-term temporal correlations, synapses associatively learn patterns in sensory signals with noxious information and neurons integrate synaptic weights. In this work, we propose and demonstrate the first comprehensive memristive implementation of neuromorphic tactile receptors and their fusion with memristive synapses and CMOS neurons. We present three-terminal indium–tungsten oxide (IWO) memtransistors with ionic dielectrics as peripheral signal processors which when fashioned as volatile gated-threshold elements emulate artificial nociceptors and when fashioned as non-volatile gated-memristive switches emulate artificial synapses. Enabled by the computational and circuit efficiency of these memtransistors, we optimize wiring, data transfer and decision-making latency by decentralizing tactile signal processing to the transduction sites. While the proposed method is different from biology where millions of nerve bundles connect the peripheral nervous system to the central nervous system, it is a necessary solution for artificial robotic nervous systems where it is impossible to replicate the wiring density seen in biology. To demonstrate the feasibility of the proposed approach, we develop a proof-of-concept system comprising artificial nociceptors that respond to pain produced by touching a sharp tip and receptors that respond to pressure and temperature, coupled with spiking memristive learning synapses and CMOS neurons to associate pressure and pain perception. In comparison to the artificial afferent nerve implementations utilizing oscillators, Mott memristors20 and organic synaptic transistors21 with physically separate signal transduction and processing, we propose a unique, decentralized scheme and demonstrate decision-making at the sensor node as a viable solution to address the peripheral sensory signal processing in robotics.
Figure 1 illustrates the concept of decentralized intelligence for robotics. In the conventional centralized approach, signal transduction is separated from centralized computation and all the learning happens at a powerful central processor. In comparison, in the proposed decentralized approach, learning is embedded into the sensor nodes, reducing the wiring complexity at the same time improving latency and fault tolerance. Nociceptors identify and filter noxious information based on short-term temporal correlations, synapses associatively learn patterns in sensory signals with noxious information and neurons integrate synaptic weights. Nociception is implemented by satellite threshold adjusting receptors (STARs). Built with memtransistors that act as gated-threshold switches, STARs possess unique features of no adaptation, relaxation and sensitization, and differs from other common sensory receptors in recalibrating their threshold and response only upon injury22. Associative learning is implemented in satellite learning modules (SLMs) near the sensing nodes, composed of satellite weight adjusting resistive memories (SWARMs) and CMOS satellite spiking neurons (SSNs). Configuring memtransistors as gated-memristive switches, learning in the SLMs occur via strengthening and weakening of the memristive connections between spiking neurons using STDP23. This increases the tolerance to nociceptor damage while reducing the number of wires to be connected to upstream processors. Robustness to inadvertent mechanical damage and harsh operating conditions is further enhanced by the self-healing capabilities of the active switching devices (STARs and SWARMs), going beyond healing at only the substrate level10. We report the first healable neuromorphic memristive devices—artificial nociceptors and synapses, to the best of our knowledge. The resulting system is integrated on a robotic hand, the detection of pain signals and the perception of the corresponding pressure by mechanoreceptors triggers an avoidance reflex by the robotic arm. Our results illustrate the viability of implementing the reflex locally with improved latency and fault tolerance rather than in a centralized processing unit.
Satellite threshold adjusting receptors
With a highly distributed set of receptors, sensory/afferent and motor/efferent nerves, the peripheral nervous system in humans processes lower-order information, signals and relays information between the central nervous system and other areas of the body, which allows us to react to our environment24. Correspondingly, a robotic nervous system should offer a rich peripheral interface to accommodate various sensory receptors, locally process lower-order sensory information and relay necessary higher-order signals to the main learning modules to accelerate inference and decision-making.
Modelled on the working of biological nociceptors, STARs act as peripheral lower-order signal processing units that detect noxious stimuli. Located at the end of sensory neuron’s axon, biological nociceptors transmit warning action potentials to the central nervous system upon arrival of noxious stimuli such as mechanical stress and temperatures above the pain threshold. Unlike other sensory receptors that adapt their sensitivity upon continuous exposure to stimuli, nociceptors operate uniquely in two modes, exhibiting an intensity-dependent sliding threshold behaviour. In their normal operation mode (Mode-1: defined as the absence of noxious stimuli), the nociceptors maintain a constant threshold and do not adapt to stimuli. Upon injury, they enter an emergency mode (Mode-2) and recalibrate their threshold and response, exhibiting features of relaxation and sensitization to overprotect the injured site. In Mode-2, the sensitivity threshold to stimuli decreases, enhancing the response to innocuous stimuli immediately following noxious stimuli and decreasing latency. Switching between these modes, the sliding threshold function allows to filter significant noxious information from other sensory information, while the short-term memory features of relaxation and sensitization fuses temporal correlations with noxious information to enable lower-order processing of sensory signals. Figure 2a illustrates the working principle of biological nociceptors and the analogy to STARs. In the system we propose, pressure stimuli from mechanoreceptors or thermal stimuli from thermal receptors represent the signal of interest analogous to noxious inputs in biology. As a first demonstration of this concept, we implement STARs by configuring IWO thin film transistors (Supplementary Note 1, Supplementary Fig. 1) to operate as gated-threshold switches. In the gated-threshold a.k.a. diffusive mode, migration and relaxation of ions in the ionic dielectric temporarily strengthens and weakens the charge carrier accumulation in the semiconducting channel, resulting in a volatile hysteresis, as detailed in Supplementary Note 2. This ion migration-relaxation dynamics at the semiconductor–dielectric interface defines the volatile short-term memory/plasticity behaviour in our memtransistors and is harnessed to present the temporal dynamics of artificial nociceptors or STARs.
To demonstrate the unique sliding threshold feature, the STARs are pulsed with voltage triggers representing external stimuli and the corresponding output current responses are recorded as a function of time. While voltage triggers of weak amplitude (Vgs = 1 V) are used to generate normal-state responses (no pain), intense voltage shocks of high amplitude (Vgs ≥ 2 V) representing pain/injury (noxious inputs) generate sensitized responses from STARs. In their normal state (Mode-1), the STARs generate output responses but is unable to reach the pain threshold (Inox = 3.3 mA) even when stimulated by a train of voltage triggers (number ~50). Increased amplitude, pulse width and number of triggers enhances the current response and reduces the incubation time (time required to reach the pain threshold) as shown in Supplementary Note 2 and Supplementary Fig. 2, akin to biological nociceptors. Saturation of current responses upon persistent activation and volatility of these states helps maintain a constant pain threshold throughout the normal-state operation. Supplementary Note 2, Supplementary Figs. 3 and 5 show the relaxation effect of STAR during a stimulation protocol with paired pulses at different interval times (5–32 ms): the priori triggers (V1 = 1.2 V) are of noxious type, generating pain response, while the successive triggers (V2 = 0.75 V) are innocuous stimuli that would not elicit a response. As expected, innocuous stimuli that arrived more than 10 ms after the V1 stimuli, i.e. after the device fully relaxed to its original state, do not change the state of the device, but elicit significant response when the pulse intervals are kept shorter than 10 ms. The response amplitude increases for shorter pulse intervals, reminiscent of the enhanced sensitivity of biological nociceptors within the relaxation process, and could be further tuned as a function of the stimulus amplitude (Supplementary Note 2, Supplementary Figs. 3 and 5).
Application of voltage shocks (representing injuries of increasing severity) drive STARs to their emergency mode or injured or sensitized state (Mode-2), characterized by reduction of the activation threshold (allodynia) and enhanced current responses (hyperalgesia)25,26. To demonstrate the sensitization characteristics of STARs, we recorded current responses after application of high amplitude (2 V, 3 s pulse width) voltage shocks (representing injuries). As indicated in Fig. 2b–e, Supplementary Note 2 and Supplementary Fig. 4, the normal-state responses are initially measured at a Vgs = 1 V. Noxious stimuli or Injury pulses (Vgs ≥ 2 V) are next applied on the gate terminal of the STARs, after which the sensitized current response curves are once again measured at Vgs = 1 V. Injured or sensitized STARs exhibit higher output currents (hyperalgesia) and reduced current response thresholds (allodynia), when compared to their normal-state operation as shown in Fig. 2b. Injuries of increased severity (amplitude > 2 V: Fig. 2b, Supplementary Note 2, Supplementary Fig. 4b or pulse width (pw)>3 s: Fig. 2c, Supplementary Note 2, Supplementary Fig. 4c) reduce the pain threshold and elevate the response further, demonstrating the unique sliding threshold behaviour of STARs. For example, when sensitized with +2.25 V noxious stimuli, the STAR’s response reaches the noxious threshold (Inox = 3.3 mA) within 20 pulses and outputs a maximum value of 4.3 mA at the end of 46 pulses; while +2.5 V noxious stimuli induces activation within 19 pulses and shows higher current responses of up to 5.2 mA. The threshold switching behaviour also enables passive healing with time and active healing with curing pulses of opposite polarity as shown in Fig. 2d, e, Supplementary Note 2 and Supplementary Fig. 4d, e. Figure 2 depicts the peak points of current response of the STAR. The corresponding raw output current spikes are shown in Supplementary Note 2 and Supplementary Fig. 4.
Hence, STARs comprehensively emulate all the signatures of their biological counterparts, thanks to the fatigue-less ion migration-relaxation effects at the electrical double layer interface of the three-terminal memristive device used. Moreover, they do not require precise fabrication of ion reservoirs and shallow defects to ensure good cyclability like their two-terminal diffusive memristor counterparts27,28,29. In comparison, CMOS-based implementation of a single nociceptor would require multiple (at least six) transistors and one capacitor wired together to implement its adaptability to repeated exposure to noxious stimuli (Supplementary Note 3, Supplementary Fig. 6, Supplementary Table 1).
Satellite learning modules
While delocalized STARs pre-process the sensory signal via volatile mathematical threshold functions, learning of more complex phenomena like association and conditioning entails non-volatile synaptic weight updates. Plasticity mediated by activity-dependent strengthening and weakening of synaptic connections forms the basis of learning and memory in the human brain30. Analogously in neuromorphic architectures, artificial synapses act as weighted connections between layers of the neural network enabling energy-efficient in-memory computations31. In this work, SLMs comprise SWARMs that alter their weights upon arrival of temporally causal and acausal stimuli using STDP; and CMOS spiking neurons (SSNs) that modulate their firing rate as a function of synaptic weight and input stimuli.
Satellite weight adjusting resistive memories
We configure thin film memtransistors to operate as gated-memristive switches a.k.a. drift mode to functionally emulate the signal processing of a biological synapse. On persistent application of positive voltage pulses with higher amplitude, additional oxygen vacancies are created in the ultra-thin IWO semiconducting channel, modulating its local electronic structure, and resulting in a non-volatile memory. The accompanying stochiometric transformations monitored through X-ray photoelectron spectroscopy provide critical insights into the underlying oxygen-vacancy generation mechanism and corroborates the comprehensive weight update analyses of the SWARMs (Supplementary Note 4, Supplementary Fig. 7)32. Figure 3 shows the characterization of SWARMs, the implemented STDP learning rule compared to biological synapses and higher-order associative learning fusing the information from STARs. We characterized STDP as a function of the temporal window between pre and postsynaptic stimuli. Asynchronous electrical spikes of identical amplitude and duration (representing information like texture of surfaces and objects) induce asymmetric STDP functions in SWARMs (Fig. 3b; Supplementary Note 4, Supplementary Fig. 8). The SWARMs implement an anti-Hebbian33 form of the STDP where a temporal order of first presynaptic activity followed by postsynaptic activity leads to long-term depression (LTD) while the reverse order leads to long-term potentiation (LTP). This weight modulation in turn modulates the firing rates of neurons, triggering motor responses to avoid potential physical damage, when the texture associated with the noxious stimulus is detected.
To demonstrate the higher-order associative learning in SLMs, we train four of our SWARMs with spikes as illustrated in Fig. 3c. The spikes could be generated using rigid CMOS SSN circuits such as in Supplementary Note 5, Supplementary Figs. 9–11 and Supplementary Table 2 that implement an integrate and fire neuron or could be generated from Mott memristors17 and flexible ring oscillators that encode stimulus intensity in firing rates. The pattern “1010” corresponds to the signal of interest (texture information corresponding to a very sharp pencil tip) while all other patterns namely “1001”, “1100”, “0110” and “0011” represent information not correlated with a noxious stimulus (texture information of pencil tips of decreasing sharpness). After three training repetitions, a strong association is developed between the texture of the surface of the object (represented by the patterns) and pain (relayed from STARs), thanks to a significant total weight change (>15 μS) corresponding to the sharp pencil tip (“1010”), while all other inputs fail to cause significant weight changes. The high conductance readouts corresponding to the pattern “1010” during the inference stage even in the absence of the nociceptive signal from STARs demonstrate the fault tolerance enabled by the higher-order associative learning ability of SLMs. Implementation of this associative learning near the sensor node highlights the utility of this decentralized neuromorphic approach in increasing fault tolerance and reducing the wiring complexity for large-area sensing. Implementation of a similar associative logic on a traditional CMOS platform would require multiple interconnected elements with multi-state non-volatile storage capabilities. However, the gated-memristive configuration of SWARMs ensures that one single device can be trained to display this complex association/conditioning rule, minimizing the footprint of the involved circuitry34.
Self-healing peripheral sensing and computation
Self-repair of the components in direct contact with the environment is a crucial capability of biological tissues35,36 that would support reliable operation of robots in unconstrained environments. To this aim, STARs and SWARMs are designed with self-healable ionic gels/dielectrics that heal themselves when subjected to damage (Fig. 4a). The basic concept in the design of this ion gel is to combine a polar, stretchable polymer with mobile species of the ionic liquid. The ion–dipole interactions—forces between charged ions and polar groups on the polymer increases as the ion charge or molecular polarity increases. Upon addition of high-ionic-strength ionic liquids into the polymer, there are two effects: first, the ionic liquid will plasticize the polymer chains to a much lower glass transition temperature below room temperature; and second the polymer chain diffusion is facilitated by ion–dipole interactions. This allows the polymer to autonomously repair themselves at room temperature. In contrast to majority of the healable systems that depend only on strong intermolecular interactions to glue them back together, requiring manual rejointing of the two cut ends of the polymer pieces37,38, our physical cut creates a 10 μm gap between the two parts of the polymer film. The reduced glass transition temperature of the ionic liquid–polymer combination allows the polymer to flow back across the 10 μm gap and stitch itself back together with the intermolecular interactions. Thus, we utilize both the reduced glass transition temperature and high intermolecular interactions for healing. Since all electronic devices are typically in a thin film format, this design choice of a material with both the mechanisms of healing is significant for such applications.
Here, the ion gels are composed of a highly polar fluoro-elastomer-poly(vinylidene fluoride-co-hexafluoropropylene) P(VDF-HFP) with very high dipole moment, together with a stable low vapour pressure ionic liquid-1-ethyl-3-methylimidazolium bis(trifluoromethylsulfonyl) imide ([EMI] + [TFSI]− or EMITFSI). The high electronegativity of fluorine, the strong electrostatic nature of the carbon–fluorine (C–F) bonds in the fluoro-elastomer and the fluorine-rich ionic liquid makes the ion gel hydrophobic. Previously, density functional theory calculations have estimated attractive binding energy of ion–dipole interaction, i.e. between a single oligomer of PVDF-HFP and an imidazolium cation to be ~22.4 kcal mol−1, nearly twice the binding energy between oligomers (11.3 kcal mol−1)39. In addition to strong ion–dipole interactions, the CF3 pendant group’s steric hindrance provides higher free volume for mobile ions, resulting in higher ionic conductivity and self-healing capability40. Additionally, the ionic liquid’s high miscibility with the polymer makes the ion gel highly transparent, with an average transmittance of over 87% under visible light. Upon injury, the ionic liquid inclusions trigger the healing process by improving the thermal mobility of the polymer housing via a plasticizing mechanism41. The self-healing nature of the proposed ion gels via spectroscopic and mechanical analysis is shown in Supplementary Note 6 and Supplementary Figs. 11–15. Fourier transform infrared spectroscopy (FTIR) studies point to ion–dipole interactions between the polymer and the imidazolium-based ionic liquid (Supplementary Note 6, Supplementary Fig. 12). Differential scanning calorimetry (DSC) reflects the lowering of the glass transition temperature and provides direct evidence of the plasticizing effect (Supplementary Note 6, Supplementary Fig. 13), while Thermogravimetric analysis (TGA) reveals thermal stability of the ion gel above 350 °C (Supplementary Note 6, Supplementary Fig. 14). From the stress–strain curves, softening of the matrix and a decrease in the Young’s modulus of the ion gels is observed with higher EMITFSI content. The healed sample depicts a similar slope for the stress–strain curve as the pristine sample, indicating no change in the Young’s modulus and mechanical stiffness of the material during the healing process. We describe the mechanical self-healing efficiency as the proportion of restored toughness relative to the original toughness (the area under the stress–strain curve), since this approach considers both stress and strain restoration40. The sample shows a healing efficiency of 27% in terms of maximum strain at break (ultimate strain) and an impressive 67% in terms of peak load (Supplementary Note 6, Supplementary Figs. 15 and 16). Figure 4b and Supplementary Note 6 and Supplementary Fig. 17 show the optical and scanning electron microscopy images of the drop casted films at various stages of the damage-heal process at room temperature across 24 h after a 10-μm wide knife cut. The ion–dipole interaction and the plasticizing effect both contributes to the self-healing property of the materials. Figures 4c and 5, Supplementary Note 6 and Supplementary Figs. 18 and 19 show the functional electrical recovery of STARs and SWARMs at various stages of the damage and healing process.
Figure 5a depicts the healing behaviour of short-term plasticity of SWARMs. Paired-pulse facilitation (PPF) refers to a short-term homosynaptic facilitation in which the postsynaptic response to the second action potential is much larger relative to the first due to the accumulation of residual Ca2+ in the presynaptic terminal. The degree of facilitation is greatest when the pulse interval is kept shortest, i.e., when the Ca2+ ions are not allowed to return to the baseline concentration prior to the second stimulus42. Analogous to this, action potentials (+1.5 V, pw = 20 ms) separated by minute pulse intervals (<50 ms) trigger higher excitatory postsynaptic currents in the second presynaptic spike, resulting in PPF indices well above 100%. The SWARMs exhibit the highest PPF index ~181% for a pulse interval of 10 ms. Increasing intervals result in an exponential reduction of the facilitation indices in accordance with the Ca2+ residual hypothesis (Fig. 5a, Supplementary Note 6, Supplementary Fig. 18). The exponential decrease of the facilitation indices indicates the temporal dynamics of the ion relaxation mechanism. Similarly, application of presynaptic pulses of opposite polarity results in a decrease in short-term conductance or paired-pulse depression (PPD). The decay of the depression curves is again exponential, similar to PPF. The devices are then damaged with a knife cut that creates a 10 μm gap. Analysis of the PPF and PPD indices after the healing process indicate good recovery of the conductance levels and restoration of the ion accumulation-relaxation mechanism (Fig. 5a, Supplementary Note 6, Supplementary Fig. 18).
Since SWARMs are utilized to implement associative learning via non-volatile weight changes, we next focus on measurements of their long-term plasticity behaviour. Supplementary Note 6 and Supplementary Fig. 19 show the representative LTP and LTD curves before damage and after the healing process, and Fig. 5b, c depicts the STDP and LTP-LTD behaviour of our SWARMs as a function of the number of training cycles, before damage and after the healing process, respectively. The weight update trace follows a similar trend before damage and after healing, indicating complete functional recovery and healing of the ion gel dielectric after mechanical damage. The device-to-device variations are captured by the error plots in Fig. 5c. In general, the LTP and LTD weight updates do indicate higher variations after damage, but the trend of the overall weight update traces remain consistent even after severe mechanical damage to the ion gel dielectric. From a yield perspective, 19 out of the 20 damaged samples recover functionally after 24 h of healing time. In comparison, conventional CMOS-compatible dielectrics like SiO2 fail upon mechanical damage as shown in Supplementary Note 6 and Supplementary Fig. 20. The healing behaviour of our devices is further validated in real time by the demonstrations shown in Supplementary Movies 1–3, Fig. 6, Supplementary Note 6 and Supplementary Fig. 21. The devices heal back after damage and are able to trigger motor responses in the robot.
As a validation step of the proposed sensory signal processing artificial skin, we implemented pain-reflex movements in a robotic arm upon sensing of noxious mechanical and thermal stimuli (Fig. 6, Supplementary Note 6, Supplementary Fig. 21) that after a training phase are associated to the noxious stimuli. The output pain signals from an array of 16 STARs are sent to 16 SWARMs via an Arduino controller, which acts as the SSN in the experimental setup. On encountering noxious stimuli, the warning outputs from STARs act as punishment signals to SWARMs, modulating their response as shown in Fig. 3c. The internal re-adjustment of the weights in turn modulates the firing rates of SSNs, triggering escape motor responses of a robotic arm to avoid potential physical damage. Supplementary Movie 1 illustrates how associative learning evolves over time between the STARs and SWARMs and how this association enables fault tolerance (identification of noxious signal even after nociceptor damage—in alignment with the findings shown in Fig. 3). Supplementary Movie 2 illustrates how the system as a whole survives and responds to unintentional mechanical damage. Self-healability of the ion gels helps restore functionality of the STARs and SWARMs upon inadvertent mechanical damage. Finally, Supplementary Movie 3 shows the potential of scaling up this concept to fuse more number of sensory inputs and learning modules for better decision-making.
The unique sliding threshold feature of STARs improves signal integrity by filtering lower-order sensory information like pain signals at the location of the sensing nodes. Equipping the sensor nodes with associative learning capabilities addresses the scalability and data transfer, and helps to reduce the complexity of wiring. While synaptic learning is typically relegated to the central brain in other implementations, we show for the first time how incorporating associative learning via plasticity of synapses (SWARMs) in the robotic equivalent of peripheral nervous system enables fault tolerance (identification of noxious signal even after nociceptor damage). In contrast to very recent artificial afferent nerve implementations with organic synaptic transistor21 and Mott memristors20, the proposed system adds learning and decision-making for the first time at the sensor node.
We report the first three-terminal artificial nociceptor (STAR) fully integrated with artificial synapses and neurons, to the best of our knowledge. With respect to existing two-terminal memristive implementations of artificial nociceptors27,28,29 which require precise fabrication of ion reservoirs and shallow defects to ensure good cyclability, our STARs seamlessly work on the basis of the ion migration-relaxation effects at the electrical double layer interface without fatigue. From a device perspective, the thin film transistor-based configuration of STARs, SWARMs and SSNs enables emulation of nociceptors, synapses and neurons with the minimum possible number of devices, reducing circuit and wiring complexity. Compared to CMOS implementations, many of the required layers can be facilely and cheaply printed and support bending and stretching, paving the way for the implementation of flexible neuromorphic robotic skin. Moreover, since the switching mechanism between the high- and low-resistance states depends only on the accumulation and depletion of carriers in the semiconducting channel, both STARs and SWARMs do not require a forming step nor a compliance current control to avoid damage, enhancing their cyclability and simplifying the peripheral circuitry design. In addition, the gating strategy also allows access to a large number of states when compared to traditional two-terminal memristors34. Since the memtransistor can be configured to operate as both a nociceptor and synapse, this allows us to facilely build a platform with a single core device configuration unlike conventional two-terminal memristors that require special design of diffusive and drift configurations.
This is the first report of healable neuromorphic memristive devices—artificial nociceptors and synapses to the best of our knowledge. Although sensorized artificial synapses approaching human skin-like performance in terms of mechanical sensing and form factor have been very recently demonstrated21,43, the ability to repeatedly self-heal neuromorphic circuit elements has not been demonstrated yet. Such repeatable electrical and mechanical healing at room temperature (even at the same damage location) is a breakthrough towards the deployment of robots and prostheses in real-world applications. Compared to the very recent report on electronic skin where healing is limited to only the substrate10, we demonstrate complete functional and mechanical self-healing of all devices and the associative learning within the learning modules enables good signal integrity even if the nociceptor is damaged after learning, enhancing fault tolerance.
In summary, the proposed concept of decentralized intelligence finds close correlations to very recent biological investigations that prove important functions like pain reflex and motor control to be implemented at the level of the spinal cord44. This approach can be readily extended to other sensing modalities and material platforms. While still at a prototypical stage, this works lays down a novel framework for building a memristive robotic nervous system with direct implications for intelligent robotics and prostheses45. The self-healing capability of these intelligent devices opens up the possibility that robots may one day have an artificial nervous system that can repair itself. This ability is hitherto not demonstrated for hardware neuromorphic circuits and is timely especially with the future of electronics and robotics going soft.
Solid-state ionic dielectric
The ionic liquid [EMI][TFSI] was initially dried in vacuum for 24 h at a temperature of 70 °C. Next, P(VDF-HFP) and [EMI][TFSI] were co-dissolved in acetone with a weight ratio of 1:4:7. The ion gels were further dried in vacuum at 70 °C for 24 h to remove the residual solvent, after which it was cut with a razor blade, and then laminated onto the substrate of choice. To determine the nature of interaction between the ionic liquid and polymer matrix, FTIR spectroscopy was performed using FTIR spectrum GX, PerkinElmer. The plasticizing effect was investigated using DSC (DSC TA Instruments 2010) at a ramping rate of 10 °C min−1. The samples were tightly sealed in aluminium pans, and the measurements were carried out while heating up the sample to 200 °C, followed by cooling down to –80 °C, at a heating and cooling rate of 10 °C min−1. The degradation (working) temperature of the ion gel was measured by TGA (TGA-Q500). The self-healing nature of the ion gels was observed and captured under a polarizing optical microscope (Olympus, CX31-P).
Device fabrication and characterization
IWO thin films (thickness ∼7 nm) were deposited on SiO2/Si wafers at room temperature using an RF magnetron sputtering technique with an In2O3:WO3 (a-IWO) (98:2 wt%) target at a gas mixing ratio of Ar:O2 (20:1), total chamber pressure of 5 mtorr and RF power of 50 W. ITO source and drain contacts (thickness ∼100 nm) were then sputter deposited through a shadow mask using an In2O3:SnO2 (90:10 wt%) target. The devices were then annealed at 200 °C for 30 min in ambient environment for optimized transistor performance. Ion gels were next laminated on to these devices. Contacts to the ion gel were made directly via a side metal gate (Ag) or directly using the probe station tip. Electrical measurements were carried out in a Desert Cryogenics (Lakeshore) probe station using Keithley 4200-SCS semiconductor characterization system. Capacitance measurements were carried out using an Alpha A Analyzer, Novocontrol analyser, over a frequency range of 1 Hz to 10 kHz.
The data that support the findings of this study are available from the corresponding author upon reasonable request.
Roncone, A., Hoffmann, M., Pattacini, U., Fadiga, L. & Metta, G. Peripersonal space and margin of safety around the body: learning visuo-tactile associations in a humanoid robot with artificial skin. PLoS ONE11, e0163713 (2016).
Dahiya, R. S., Mittendorfer, P., Valle, M., Cheng, G. & Lumelsky, V. J. Directions toward effective utilization of tactile skin: a review. IEEE Sens. J.13, 4121–4138 (2013).
Mittendorfer, P. & Cheng, G. Humanoid multimodal tactile-sensing modules. IEEE Trans. Robot.27, 401–410 (2011).
Maiolino, P., Maggiali, M., Cannata, G., Metta, G. & Natale, L. A flexible and robust large scale capacitive tactile system for robots. IEEE Sens. J.13, 3910–3917 (2013).
Bartolozzi, C., Natale, L., Nori, F. & Metta, G. Robots with a sense of touch. Nat. Mater.15, 921 (2016).
Cheng, G. et al. A comprehensive realization of robot skin: sensors, sensing, control, and applications. Proc. IEEE107, 2034–2051 (2019).
Schürmann, C., Schöpfer, M., Haschke, R. & Ritter, H. in Towards Service Robots for Everyday Environments (eds. Prassler, E. et al.) 403–415 (Springer, 2012).
Lee, W. W., Kukreja, S. L. & Thakor, N. V. A kilohertz kilotaxel tactile sensor array for investigating spatiotemporal features in neuromorphic touch. In 2015 IEEE Biomedical Circuits and Systems Conference (BioCAS) 1–4 https://doi.org/10.1109/BioCAS.2015.7348412 (Atlanta, GA, 2015).
Bartolozzi, C. et al. Event-driven encoding of off-the-shelf tactile sensors for compression and latency optimisation for robotic skin. In 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 166–173 https://doi.org/10.1109/IROS.2017.8202153 (Vancouver, BC, 2017).
Lee, W. W. et al. A neuro-inspired artificial peripheral nervous system for scalable electronic skins. Sci. Robot.4, eaax2198 (2019).
Caviglia, S., Valle, M. & Bartolozzi, C. Asynchronous, event-driven readout of POSFET devices for tactile sensing. In 2014 IEEE International Symposium on Circuits and Systems (ISCAS) 2648–2651 https://doi.org/10.1109/ISCAS.2014.6865717 (Melbourne VIC, 2014).
Davies, M. et al. Loihi: A neuromorphic manycore processor with on-chip learning. IEEE Micro38, 82–99 (2018).
Moradi, S., Qiao, N., Stefanini, F. & Indiveri, G. A scalable multicore architecture with heterogeneous memory structures for dynamic neuromorphic asynchronous processors (DYNAPs). IEEE Trans. Biomed. Circuits Syst.12, 106–122 (2017).
Rasouli, M., Chen, Y., Basu, A., Kukreja, S. L. & Thakor, N. V. An extreme learning machine-based neuromorphic tactile sensing system for texture recognition. IEEE Trans. Biomed. Circuits Syst.12, 313–325 (2018).
Bartolozzi, C. & Indiveri, G. Synaptic dynamics in analog VLSI. Neural Comput.19, 2581–2603 (2007).
Wang, Z. et al. Capacitive neural network with neuro-transistors. Nat. Commun.9, 1–10 (2018).
Yi, W. et al. Biological plausibility and stochasticity in scalable VO2 active memristor neurons. Nat. Commun.9, 1–10 (2018).
Fuller, E. J. et al. Parallel programming of an ionic floating-gate memory array for scalable neuromorphic computing. Science364, 570–574 (2019).
Cai, F. et al. A fully integrated reprogrammable memristor–CMOS system for efficient multiply–accumulate operations. Nat. Electron.2, 290–299 (2019).
Zhang, X. et al. An artificial spiking afferent nerve based on Mott memristors for neurorobotics. Nat. Commun.11, 1–9 (2020).
Kim, Y. et al. A bioinspired flexible organic artificial afferent nerve. Science360, 998–1003 (2018).
Tracey, I. Nociceptive processing in the human brain. Curr. Opin. Neurobiol.15, 478–487 (2005).
Abbott, L. F. & Nelson, S. B. Synaptic plasticity: taming the beast. Nat. Neurosci.3, 1178–1183 (2000).
Hall, J. E. Guyton and Hall Textbook of Medical Physiology e-Book (Elsevier Health Sciences, 2015).
Jensen, T. S. & Finnerup, N. B. Allodynia and hyperalgesia in neuropathic pain: clinical manifestations and mechanisms. Lancet Neurol.13, 924–935 (2014).
Sandkuhler, J. Models and mechanisms of hyperalgesia and allodynia. Physiol. Rev.89, 707–758 (2009).
Wang, Z. et al. Memristors with diffusive dynamics as synaptic emulators for neuromorphic computing. Nat. Mater.16, 101–108 (2017).
Kim, Y. et al. Nociceptive memristor. Adv. Mater.30, 1704320 (2018).
Yoon, J. H. et al. An artificial nociceptor based on a diffusive memristor. Nat. Commun.9, 417 (2018).
Abbott, L. F. & Regehr, W. G. Synaptic computation. Nature431, 796–803 (2004).
Merolla, P. A. et al. A million spiking-neuron integrated circuit with a scalable communication network and interface. Science345, 668–673 (2014).
Kulkarni, M. R. et al. Field‐driven athermal activation of amorphous metal oxide semiconductors for flexible programmable logic circuits and neuromorphic electronics. Small15, 1901457 (2019).
Caporale, N. & Dan, Y. Spike timing–dependent plasticity: a Hebbian learning rule. Annu. Rev. Neurosci.31, 25–46 (2008).
John, R. A. et al. Ultralow power dual-gated subthreshold oxide neuristors: an enabler for higher order neuronal temporal correlations. ACS Nano12, 11263–11273 (2018).
Tee, B. C. K., Wang, C., Allen, R. & Bao, Z. An electrically and mechanically self-healing composite with pressure-and flexion-sensitive properties for electronic skin applications. Nat. Nanotechnol.7, 825 (2012).
Li, C.-H. et al. A highly stretchable autonomous self-healing elastomer. Nat. Chem.8, 618 (2016).
Colquhoun, H. M. Materials that heal themselves. Nat. Chem.4, 435–436 (2012).
Ahn, B. K., Lee, D. W., Israelachvili, J. N. & Waite, J. H. Surface-initiated self-healing of polymers in aqueous media. Nat. Mater.13, 867–872 (2014).
Cao, Y. et al. A transparent, self‐healing, highly stretchable ionic conductor. Adv. Mater.29, 1605099 (2017).
Cao, Y. et al. Self-healing electronic skins for aquatic environments. Nat. Electron. 2, 75–82 (2019).
Tiwari, N., Ho, F. & Mathews, N. A rapid low temperature self-healable polymeric composite for flexible electronic devices. J. Mater. Chem. A6, 21428–21434 (2018).
Katz, B. & Miledi, R. The role of calcium in neuromuscular facilitation. J. Physiol.195, 481–492 (1968).
Lee, Y. et al. Stretchable organic optoelectronic sensorimotor synapse. Sci. Adv.4, eaat7387 (2018).
Weiler, J., Gribble, P. L. & Pruszynski, J. A. Spinal stretch reflexes support efficient hand control. Nat. Neurosci.22, 529–533 (2019).
Osborn, L. E. et al. Prosthesis with neuromorphic multilayered e-dermis perceives touch and pain. Sci. Robot.3, eaat3818 (2018).
The authors would like to acknowledge the funding from MOE Tier 1 grants: RG87/16, RG 166/16 and MOE Tier 2 grants MOE2016-T2-1-100 and MOE2018-T2-2-083. We thank Prof. Koh Soo Jin Adrian (Department of Mechanical Engineering, NUS) for access to dielectric spectroscopy measurements.
The authors declare no competing interests.
Peer review informationNature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
John, R.A., Tiwari, N., Patdillah, M.I.B. et al. Self healable neuromorphic memtransistor elements for decentralized sensory signal processing in robotics. Nat Commun 11, 4030 (2020). https://doi.org/10.1038/s41467-020-17870-6