In contrast to AI hardware, neuromorphic hardware is based on neuroscience, wherein constructing both spiking neurons and their dense and complex networks is essential to obtain intelligent abilities. However, the integration density of present neuromorphic devices is much less than that of human brains. In this report, we present molecular neuromorphic devices, composed of a dynamic and extremely dense network of single-walled carbon nanotubes (SWNTs) complexed with polyoxometalate (POM). We show experimentally that the SWNT/POM network generates spontaneous spikes and noise. We propose electron-cascading models of the network consisting of heterogeneous molecular junctions that yields results in good agreement with the experimental results. Rudimentary learning ability of the network is illustrated by introducing reservoir computing, which utilises spiking dynamics and a certain degree of network complexity. These results indicate the possibility that complex functional networks can be constructed using molecular devices, and contribute to the development of neuromorphic devices.
Brain-inspired computing has attracted considerable attention in recent years for its potential to perform intelligent, robust and low-power computations in situations in which conventional algorithm-based computing on Neumann-based computers may falter1. Neumann- and silicon-based AI accelerators, such as MIT and NVIDIA’s Eyeriss processors2, a machine-learning supercomputer (DaDianNao)3 and commercial general-purpose graphic processing units (GP-GPUs), are used to create intelligent artefacts with learning and cognitive abilities. The silicon-based neural accelerators mentioned above provide such intelligent functions; however, they are based on advanced computer science and engineering, and are not based on contemporary neuroscience, which causes their applications to be limited to those such as pattern classification and inference. Neuromorphic hardware, on the other hand, is based on neuroscience, and provides excellent opportunities to replicate higher-level brain functions. In contemporary neuromorphic hardware (e.g. IBM’s neurosynaptic chip (TrueNorth)4, analogue or digital neuromorphic integrated circuits5,6, etc.), artificial spiking neurons that mimic nerve impulse (spike) generation and the construction of their dense and complex networks are essential. Coding neuronal information using spikes is functionally important upon transmitting actions on neuronal membranes (active transmission lines) in noisy and unreliable environments7. The usefulness of spiking neural networks in practical applications has not become completely clear; however, it has recently been demonstrated that complex and spontaneous dynamics generated by large-scale spiking neural networks are useful for blind source separation8, reservoir computing9 and so on.
In present neuromorphic systems, both the integration density and wiring complexity, which directly represent the potential intelligent information processing ability, are much lower than those of human brains10, because present major neuromorphic hardware is only composed of silicon complementary metal–oxide semiconductor (CMOS) devices. In this report, we present an extremely dense, molecular neuromorphic network device, composed of a network of single-walled carbon nanotubes (SWNTs) complexed with polyoxometalate (POM)11 as an alternative to the present silicon CMOS analogue and digital neural processors.
At least two types of devices are required to construct analogue neuromorphic hardware: synaptic devices and neuronal membranes. The synaptic device lies at the intersection between the axonal and dendritic wires of the neuron devices and acts as a memristive junction whose coupling strength is stored. A synaptic device consisting of a network of carbon nanotubes (CNTs) has been proposed12. The neuronal membrane (neuron) device emits spikes (nerve impulses) and transmits the impulses to other neurons via axonal and dendritic wires13. Such neural membrane devices have not been further explored in spite of recent significant advances in materials science. SWNTs are good candidate materials for neural membrane devices because metallic CNT-based conductors generate large electrical noise with rich dynamics14,15,16. Moreover, it has been observed that the electronic state and conduction mode of an SWNT vary enormously with the type of molecular species adsorbed17,18. Phosphododecamolybdic acid, (H3PMo12O40; PMo12 hereafter), is one of the POMs exhibiting reversible multi-electron redox properties19,20,21, electronic versatility22,23 and negative differential resistance (NDR) on highly ordered pyrolytic graphite (HOPG)24.
In this report, we present a complexed SWNT/POM network molecular neuromorphic device consisting of a dense and complex network of spiking molecules (PMo12 particles25) that imitates a large-scale spiking neural network. Based on experimental studies of the fabricated SWNT/POM network, we first discuss its NDR and noise properties and then demonstrate its collective impulse generation. To reveal the possible impulse generation mechanism, we propose an abstract model of the network by assuming a two-dimensional (2D) structure of molecular junctions and demonstrate that the model yields results in good agreement with the experimental results. In Supplementary Note 2, we illustrate the potential of the SWNT/POM network model for neuromorphic reservoir computing26,27 by demonstrating basic learning ability of the network.
Neuromorphic SWNT/POM device and experiments
The typical structure of an individual SWNT/POM complex prepared on a Si substrate observed by atomic force microscopy (AFM) is shown in Fig. 1a. Both the bundled SWNTs and adsorbed POM particles have diameters of a few nanometres. The total thickness of the complex is around 10 nm. The current-voltage (I–V) characteristics measured on the individual complexes by point-contacted current-imaging (PCI-) AFM18,28,29, are presented in Fig. 1b. Because the voltage sweeping speed was fixed and rapid in the AFM controller, this measurement was performed purely as a preliminary check. Several peaks are apparent; i.e. the current increases non-monotonically as the bias voltage increases due to the NDR characteristics of the SWNT/POM device. The NDR observed for the POM particle is considered to be a result of multi-redox activities within the charge container. The NDR peak position is closely related to the reduction potential as well as the electronegativity of the counter-action23. However, the NDR positions in the I–V plots shown in Fig. 1b are not unique. Some reactions with surrounding counter-ions might have substantial influence on the NDR characteristics; however, the physics of the NDR phenomena of POM particles has not yet been fully understood.
We studied the electrical properties of the SWNT/POM network. Figure 2a illustrates the assumed network structure where the yellow cuboids, black tubes and purple spheres represent the terminal electrodes, SWNTs and POM particles, respectively. Figure 2b represents a micrograph of the fabricated SWNT/POM network device having multiple terminal electrodes, denoted (1–6), with different gap distances between the electrodes. Two kinds of samples, namely ‘sample A’ rinsed by ethanol and ‘sample B’ rinsed by DI water, were evaluated.
Electrical measurement results for sample A are shown in Figs. 2c and 3a, b, e, f. Figure 2c presents sampled current plots for sample A over time, representing the distribution of the current magnitudes with stepwise-increasing VB (from 0 to 125 V). Gaussian-like noises were generated, and the mean and variance of each distribution were increased as the bias voltage VB was increased. Figure 3a shows the I–V characteristics of sample A, lying between electrodes (1) and (2) (Fig. 2b), which exhibit an NDR peak (marked with a red arrow) between VB of 125 and 150 V across the terminal electrodes. The I–V characteristics were measured in air at room temperature with averaging over 100 power-line cycles (PLCs) with a 60-Hz power line in the environment. When VB was further increased to 150 V, the current became unstable and non-Gaussian distributions appeared, as shown in the inset of Fig. 2c, resulting in periodic/aperiodic current impulse generation. By applying a voltage higher than 150 V, the system became unstable, indicating current impulse generation, as shown in Fig. 3b. This condition allowed the generation of periodic current impulses, as shown in the inset of Fig. 3b.
The I–V characteristics of sample B, lying between electrodes (1) and (2), are shown in Fig. 3c. When the absolute bias voltage across the terminal electrodes |VB| is larger than approximately 80 V, large hysteresis is observable between the forward and backward sweeps, exhibiting NDR characteristics, as well as unstable currents. The generation of unstable currents at lower |VB| (<80 V) was suppressed by accelerated ion transfer in the water-treated samples, as compared to that in the ethanol-treated samples such as sample A. The NDR peaks of the SWNT/POM network appeared for both negative and positive bias in Fig. 3c, whereas the peaks in Fig. 1b appear only for negative polarity. The polarity of the NDR appears to depend on the voltage sweep direction in Fig. 1b, because the sweeping speed is too fast to cause redox in the same polarity sweep. Even the NDR originating from the redox of PMo12 should be irrelevant to the polarity. Figure 3d shows plots of the current versus time for sample B when VB = 80 V, where periodic/aperiodic current oscillations (at approximately 25 Hz) and random current impulses are observable. Note that such impulses can also be observed in a POM/SWNT complex device having a microscale channel under VB < 1 V. See Supplementary Note 1 for the details.
Temporal sequences of inter-spike intervals (ISIs) of the impulse trains were measured to quantify the self-similarity in the impulse generation process. Figure 3e, f show the Poincaré plots30 against the applied bias voltages VB and concentration ratio of POMs to SWNTs, respectively. As an example, short sequences of impulses and their ISIs are shown in the inset of Fig. 3e, where tn represents the n-th ISI (n is the natural index number, and the maximum value is equal to the number of measured impulses minus 1). Each Poincaré plot was then created by plotting a dot at (tn, tn+1) for all n, to distinguish chaos from randomness visually. As shown in Fig. 3e, the Poincaré plots of the ISI sequences obtained from sample A for various VB do not exhibit any self-similarity, indicating that the generated impulse train was a random (not chaotic) sequence. This phenomenon was reproduced even when the concentration ratio of POMs to SWNTs RP2S was varied.
Furthermore, we found that the ISI distribution can be modified by changing RP2S. In Fig. 3f obtained from sample A, ISI distributions for specific RP2S are represented. With increasing RP2S, these interval distributions decrease in size and shift to the upper right, which represents the tendency for longer ISI impulse generation with higher RP2S. In fact, increasing the amount of POM also increased the size of the POM nanoparticles, which directly affected the electrical properties of the POM junctions by increasing the capacitance, which explains why the impulse generation period also increased.
The experimental results indicate that impulse generation in the POM/SWNT network is closely related to the existence of NDRs in the I–V characteristic of the device. Since similar NDR characteristics are also observable in single POM junctions, we hypothesise that conductance switching at a POM junction having multiple-redox ability is the origin of the impulse generation (the term ‘multiple-redox’ reflects the fact that a POM molecule is capable of having multiple charges). Electronic chemical measurements have proven that PMo12 can store up to 24 electrons entailing the molecular structural change; hence, the PMo12 is called an ‘electron sponge’31. On the other hand, it is known that the conductance of a single molecular junction is changed by electronic and structural state fluctuations, resulting in observations of noisy current across such junctions. Conductance switching by oxidation and reduction of molecules has recently been reported32, where the transient charging effect yields low-to-high current ratios exceeding 1000 at bias voltages of less than 1 V.
Based on the observations above, we built a POM/SWNT network model. In our model, the conductance of a junction between POMs and SWNTs changes from low to high, when the accumulated charges at a POM exceed its electron storage limit. To preserve multiple charges at a POM, the conductance of the junction must be low. Preserving a large number of charges at a POM results in a large potential difference across the junction, which induces low-to-high conductance switching.
When the electrons accumulated at a POM in the POM/SWNT network are discharged through a highly conductive junction, they are transferred to the neighbouring POM with the largest potential gradient. If the charging limit of the neighbouring POM is also exceeded, the discharge causes a chain reaction to take place in the network. A cascade of charges will occur in the network at that time and it will be detected as a charge impulse at the electrode.
Cellular automata model of SWNT/POM network device
A cellular automata (CA) model on a 2D regular grid, representing a random network of POM particles as cells, was constructed to understand the qualitative electrical behaviours of the SWNT/POM network device. To represent the random network on a regular grid, we introduced a defect density Df. Among all of the intersection points of the regular grid, Df percent were randomly chosen and regarded as defects, while the others were regarded as POM cells. Figure 4a illustrates the CA device model represented by a 2D network of POM cells (blue spheres) with counter-electrodes (yellow bars) and SWNTs bridging the neighbouring POMs (black lines), where Df = 45%. In the network, each POM is connected to from zero (minimum, meaning it is an isolated POM) to eight (maximum) neighbouring POMs, because vacant cells (defects) are excluded. For a POM cell located at (i,j) representing the discrete position in a 2D space, let mi,j be the number of the neighbouring POM cells. The number of charges (being proportional to the potential) of a POM particle at time t is defined by ai,j(t). Hence, when mi,j > 0, the potential gradients between a POM cell at (i,j) and its neighbouring POMs are represented by (k = 1,…,mi,j), as shown in Fig. 4b (mi,j = 5 in this example). The cells in the source electrode have constant charges of VB, which are supplied into the network, whereas the drain electrode is kept neutral.
In the CA model, a limited number of charges is ‘stochastically’ transferred from a cell to one of the neighbouring cell with the largest . The probability of the charge transfer, which we call the state transition probability, is assumed by:
where aTH represents the threshold charge, and p and q are the parameters of the probability, as plotted in Fig. 4c, where p = 1 and q = 0.95. Low Pc suggests low conductivity charge transfer between cells and the high capacitive nature of the POM. Charge accumulation at each cell will occur representing the multi-redox nature of POM. Our particular proposition of the charge transition rule is a definition of the threshold charge aTH for the discharging operation. In other words, passing all of the charges of a POM cell to a neighbouring cell means that the POM junction becomes highly conductive because of the state change of the molecule. When ai,j < aTH, a small number of charges is transferred to one of the neighbours, as shown in Fig. 4d, whereas when ai,j > aTH, all of the charges should be discharged and transferred to its neighbours, as shown in Fig. 4e. Due to the discharging transfer, neighbouring cells generally have an excess number of charges (>aTH) at the next update cycle. The subsequent discharging results in a chain reaction (discharging-event propagation) in the 2D network, providing a charge cascade in the CA system. The current is defined as the accumulated number of charges Nc transferred into the drain electrode at a given time t. Here, the time is divided by the calculation cycle.
We performed the 2D CA calculations under various conditions. The computational results well reproduced the experimental results, showing random spike generation, NDR properties and noise generation. Figure 4f shows the current with respect to time for a 30×30 array with (p, q) = (0.6, 0.3) ≡ , VB = 30, aTH = 5 and Df = 45% where VB is a dimensionless value that is proportional to the bias voltage. Random impulse generations is exhibited in the posterior half of Fig. 4f. The inset images depict the gradient of the number of charges in each cell at a given time. The right and left sides are the source and drain electrodes, respectively. During the first 100 cycles, a continuous charge gradient appears, although no current is evident. At 500 cycles, a small current is observable, as are many bright cells (having large ai,j) on the right side. Before the first impulse, a bright cell, which exceeds aTH, is evident at the left edge of the 2D array. After around 1000 cycles, large impulses appear with accompanying adjunctive medium and small pulses. After 2000 cycles, similar random impulse generation continues as long as VB is applied.
Figure 4g shows the current versus VB, i.e. the I–V characteristics, in a 30×30 array with and Df = 20%. The current increases monotonically as VB increases, although no large impulses are observable. Figure 4h shows the I–V characteristics when (p, q) = (1, 0.95) ≡ (Fig. 4c). The current in Fig. 4h exhibits a lower increase at lower VB and distinct peaks at higher VB. The I–V characteristics shown in Fig. 4g, h are analogous with the experimental I–V characteristics shown in Fig. 3a, c, respectively. Correspondingly, the lower (higher) Pc values applied for primary charge transfer indicate the high (low) charge capability of each cell. The SWNT/POM networks under different treatments should provide different POM chemical conditions. The deionized water-treated POM must maintain higher multi-redox activity by providing the system with low conductance at lower VB. The number of impulses and the threshold VB are affected by Pc. From the viewpoint of the calculated number of impulses against VB for different Pc and array sizes, the higher threshold VB and lower number of impulses generated for than agrees with the threshold bias voltages trends observed for samples A and B.
The noise was also reproduced by this model. Figure 4i depicts the current at each cycle for stepwise increments of VB, under the subthreshold condition of impulse generation. The amplitude of the current noise increases as VB increases. To generate large noise in the base current, large primary charge transport and aTH are needed to avoid impulse generation. The use of a threshold and sufficiently high potential are required to generate impulses. If VB is lower than the threshold, impulses never appear, no matter how much time passes. With sufficient VB, even in larger arrays, impulses appear after a long time. In a 300×300 array, the first impulse appears after 160,000 cycles, and the impulses have large magnitudes.
In the model, we focused on conductance switching between POMs and SWNTs induced by potential differences across their junctions. In a real POM/SWNT network, the occurrence and disappearance of highly conductive passes should be caused by complex chemical reaction phenomena. The reaction centre is electro-chemical reduction33, i.e. multiple electron charging entailing structural change of POMs31. The discharging is caused by electro-chemical oxidation. Aggregation and dissociation of counter-cations, which are generally water ions, at the POM junction should play a significant role in the conductance switching between POMs and SWNTs. The chemical reaction rate should be much slower than that of charge transfer. After the total discharge, the interface becomes normal again via a reversible reaction. The POM returns to a neutral state and charge accumulation in the network starts again. The amount of charge accumulated in real POM particles, however, remains unknown. The threshold charge number, aTH = 5, used in this study was an appropriately small for reproducing distinct impulse generation. Smaller aTH increases the appearance of impulses and decreases their magnitude.
Based on the POM/SWNT device model above, we succeeded to demonstrate basic functionality of reservoir computing. Reservoir computing is a computation framework that can be found in certain dynamical recurrent neural networks27, as well as in classical liquid state machines34 consisting of a network of randomly connected spiking neurons. Many types of physical media have been introduced as reservoirs, such as spiking neural networks9, photonic medium26 and soft matter35. Supplementary Figure 2a depicts the typical structure of a reservoir system. The details of the simulation using the POM/SWNT model and their possible experimental setups are described in Supplementary Note 2 and 3, respectively. Although the present reservoir computer is built up on a simulated model of POM/SWNT, we are now planning to fabricate the system on real POM/SWNT devices.
The SWNTs were purchased from Carbon Nanotechnologies, Inc. (Houston, TX) and annealed at 200 °C for 20 h to remove the amorphous carbon. The SWNTs were washed with 12 M HCl to remove metallic catalysts and were subsequently rinsed with DI water. To prepare the SWNT/POM complex, 0.8 mg of PMo12 was dissolved in 10 mL of ethanol, and 1.6 mg of the purified SWNTs was added to the solution. The solution was sonicated for 16 h and centrifuged at 4000 rpm for 3 h. The supernatant (1 mL) was vacuum-filtered using 1-μm-mesh nitrocellulose filter paper (MCE, Millipore) to fabricate the SWNT/POM network structure on the filter paper. The network structure was rinsed with ethanol or pure DI water. The wet filter paper was adhered to a Si substrate with a 30 or 100 nm oxidised layer with the network side facing the substrate surface. The substrate was then placed upside-down on an open-mouth bottle filled with acetone, and the bottle was heated to 80 °C. The evaporating acetone melted the filter paper, and the SWNT network was transferred to the SiO2 surface of the substrate. Thermal deposition with a patterned metal mask was conducted to fabricate 80-nm-thick Au electrodes as shown in Fig. 2b.
The electrical measurements were performed using a four-probe system with a data acquisition (DAQ) module (National Instruments, NI USB-9234), which was controlled using the LabVIEW 2014 software. The bias voltage was generated using a function generator (NF Corporation, Wavefactory WF1974) and amplified by 30 times with a voltage amplifier. The output current was transduced by a preamplifier to an output voltage, which was measured using the DAQ system. The currents at each point was integrated over 10 PLCs at 60 Hz and then averaged.
Simulation of POM/SWNT CA model
A N × N rectangular grid was created, and Df percent of the intersection points were randomly set as vacant (defect) cells, while the remainder were set as POM cells. As in conventional CA, the state of a POM cell at time t, represented by ai,j (t), was determined by its previous state at time t–1, i.e. ai,j(t–1), and the previous states of its neighbouring POM cells, i.e. ai–1,j–1(t–1), ai–1,j(t–1), ai–1,j+1(t–1), ai,j–1(t–1), ai,j+1(t–1), ai+1,j–1(t–1), ai+1,j(t–1) and ai+1,j+1(t–1). Our state transition rule was as follows: step 1, when mi,j > 0, calculate the potential gradients between a POM cell located at (i,j) and its neighbouring POM cells at time t–1, represented by , where k represents the index number of the neighbours and ranges from 1 to mi,j; step 2, find the largest potential gradient among the candidates ; step 3, when ai,j(t–1) is smaller than aTH, a limited number of charge  is stochastically transferred, obeying the state transition probability function , to the neighbouring cell with , as shown in Fig. 4d. The charge limit  has an exponential relation based on Marcus theory36, , where ⌈∙⌉ represents the floor function, ε is the transfer coefficient and γ is the sensitivity constant. When ai,j(t–1) is larger than aTH, all of the charge moves to the two neighbours with the largest and second-largest potential gradients. The charges are moved to the neighbours on a ratio of 9:1, as shown in Fig. 4e. This charge split ratio is quite sensitive to the density and magnitude of the impulse. The array has two additional columns at each side as the source and drain electrodes. VB, the number of charges in the cells of the source column, is a constant. The number of charges in the cells of the drain column is zero. The current is the accumulated number of charges transferred into the cells of the drain column at one time. In the I–V calculations, the current was accumulated every 5000 cycles. VB was derived using ⌈αVB⌉; here, the coefficient α = 0.5 was used, where α is necessary to control the resolution step and the floor function is used to make VB an integer.
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Hi.T. thanks Prof. T. Morie of Kyushu Institute of Technology for the fruitful discussions. This work was supported in part by JSPS KAKENHI Grant Numbers JP21710107, JP20111001, JP15K12109, JP20111012, JP25110002, JP16H00968 and JP25110015, and by PRESTO JST JPMJPR1521-15655977. Hi.T. also thanks the Inamori Foundation and Iketani Foundation for their financial support.