Toward Fast Neural Computing using All-Photonic Phase Change Spiking Neurons

Chakraborty, Indranil; Saha, Gobinda; Sengupta, Abhronil; Roy, Kaushik

doi:10.1038/s41598-018-31365-x

Download PDF

Article
Open access
Published: 28 August 2018

Toward Fast Neural Computing using All-Photonic Phase Change Spiking Neurons

Indranil Chakraborty ORCID: orcid.org/0000-0003-4829-3706¹,
Gobinda Saha¹,
Abhronil Sengupta¹ &
…
Kaushik Roy¹

Scientific Reports volume 8, Article number: 12980 (2018) Cite this article

8743 Accesses
123 Citations
4 Altmetric
Metrics details

Subjects

Abstract

The rapid growth of brain-inspired computing coupled with the inefficiencies in the CMOS implementations of neuromrphic systems has led to intense exploration of efficient hardware implementations of the functional units of the brain, namely, neurons and synapses. However, efforts have largely been invested in implementations in the electrical domain with potential limitations of switching speed, packing density of large integrated systems and interconnect losses. As an alternative, neuromorphic engineering in the photonic domain has recently gained attention. In this work, we propose a purely photonic operation of an Integrate-and-Fire Spiking neuron, based on the phase change dynamics of Ge₂Sb₂Te₅ (GST) embedded on top of a microring resonator, which alleviates the energy constraints of PCMs in electrical domain. We also show that such a neuron can be potentially integrated with on-chip synapses into an all-Photonic Spiking Neural network inferencing framework which promises to be ultrafast and can potentially offer a large operating bandwidth.

Ultrafast optical integration and pattern classification for neuromorphic photonics based on spiking VCSEL neurons

Article Open access 08 April 2020

All-optical spiking neurosynaptic networks with self-learning capabilities

Article 08 May 2019

Ultrafast neuromorphic photonic image processing with a VCSEL neuron

Article Open access 22 March 2022

Introduction

The recent advances in the field of neuromorphic computing largely rest on our understanding of the human brain as researchers strive to comprehend the intricacies of its complex functionalities and emulate its unparalleled energy efficiency. Despite the obvious elusivenss of the brain, neuroscientific experiments have unravelled various underlying mechanisms behind our behavorial patterns. To that effect, various studies have been performed exploring phenomena concerning the basic functional units, namely neurons and synapses, that knit the neural network in the human brain. The need to incorporate these neuroscientific findings in computing models and consequently in building bio-plausible hardware has led to extensive investigations in recent years.

Most of the available computing models that encode the information processing in a neural network are based on mathematical optimization techniques. More recently, with growing evidence of spike-based processing in the biological neural network, its event-driven nature has led researchers to explore bio-plausible hardware implementations in an effort to achieve higher energy efficiency. Spiking neural networks (SNN) comprise the third generation of neural networks and the basic principle relies on how the membrane potential of a spiking neuron rises and eventually cause the neuron to spike under the action of incident spikes. Hardware implementations of various spiking neuron models such as Hodgkin-Huxley¹ and Leaky-Integrate-Fire (LIF)² on CMOS platforms not only fail to match the energy efficiency of the human brain but is also area-inefficient.

To address these shortcomings, novel material systems and technologies^3,4 have been proposed to mimic the behavior of a spiking neuron thus providing direct mapping between a single device behaving as a functional neural element. However, each technology suffers from different drawbacks, such as energy-efficiency, speed, cross-talk, fabrication difficulties, etc. Phase change materials (PCM), in particular, have been demonstrated⁵ to have significant energy restrictions due to their high ‘write’ times in the electrical domain. It has been shown that either the exciting current or ‘write’ pulse duration has to be reduced by 10× for PCM to perform better than CMOS. However, recently PCMs, e.g. GST, have been demonstrated⁶ to achieve sub-ns ‘write’ speeds when excited by photonic laser pulses. Due to highly contrasting optical and electrical properties in their amorphous (a-GST) and crystalline (c-GST) states, PCMs have thus offered avenues to implement all-photonic memories^7,8, switches⁶ and have been even used for mixed-mode electro-optical operations⁹. The promise of fast information processing with PCMs in the photonic domain has thus encouraged the possibilities of PCMs as a viable material for photonic neuromorphic systems. Recently, device¹⁰ based on GST deposited on waveguides was proposed to emulate the synaptic weight update mechanism in synapses in SNN framework. Previous works on such based spike-based neuromorphic processing in the photonic domain have been dependent on electro-optic conversions^11,12 where lasers have been used to emulate the behavior of spiking neurons. In this paper, we propose an all-photonic operation of an Integrate-and-Fire spiking neuron. We show that the proposed neuron mimics the behavior of the biological neuron and can be seamlessly integrated in an all-photonic SNN framework. Other works in the photonic neuromorphic domain includes applications such as deep neural networks¹³ and recurrent neural networks¹⁴ which are complementary to the processing framework discussed in this paper.

GST embedded Ring Resonator as a Integrate-Fire Neuron

The basic working principle of a ring resonator is necessary to be illustrated at first. A ring resonator is a structure with two rectangular waveguides and a ring waveguide (as shown in Fig. 1(a)). Wave entering through the ‘INPUT’ port gets partially coupled to the ring waveguide and interferes constructively inside the ring when the following condition, called resonant condition, is met:

$$2\pi R{n}_{eff,wg}=m{\lambda }_{m}$$

(1)

Eq. 1 provides the resonant condition (at wavelengths λ_m) for the ring resonator of radius R where the effective refractive index of the waveguide-substrate material system is n_eff,wg. By controlling the coupling and attenuation parameters, t₁, t₂ and k₁, k₂, as shown in Fig. 1(b), light can be conditionally guided through the ‘THROUGH’ and ‘DROP’ ports.

Introducing a GST element (shown in red in Fig. 1(a)) on top of the ring waveguide in the ring resonator described above allows us to control light propagation through the ports by merely changing the state of the GST. Light passing through the waveguide get evanescently coupled to the GST element and gets differentially absorbed by the GST in its low-loss amorphous state and high-absortion crystalline state⁷. The difference in attenuation arises due to the contrasting imaginary refractive index (κ_GST) of GST in its two states. Theoretically, the transmission of the ‘THROUGH’ and ‘DROP’ ports can be expressed as:

$${T}_{t}=\frac{{t}_{2}^{2}{\alpha }^{2}-2{t}_{1}{t}_{2}\alpha cos(\theta )+{t}_{1}^{2}}{1-2{t}_{1}{t}_{2}\alpha cos(\theta )+{({t}_{1}{t}_{2}\alpha )}^{2}}\,\,\,{T}_{d}=\frac{\mathrm{(1}-{t}_{1}^{2}\mathrm{)(1}-{t}_{2}^{2})\alpha }{1-2{t}_{1}{t}_{2}\alpha cos(\theta )+{({t}_{1}{t}_{2}\alpha )}^{2}}$$

(2)

where α is the attenuation factor, θ is the phase factor, t₁ and t₂ are coupling parameters. α and θ can be expressed as:

$$\alpha =exp(-\frac{2\pi }{\lambda }[{\kappa }_{eff,wg}\mathrm{(2}\pi R-{L}_{GST})+{\kappa }_{eff,GST}{L}_{GST}])\approx exp(-\frac{2\pi }{\lambda }{\kappa }_{eff,GST}{L}_{GST})$$

(3)

$$\theta =\frac{2\pi }{\lambda }[{n}_{eff,wg}\mathrm{(2}\pi R-{L}_{GST})+{n}_{eff,GST}{L}_{GST}].$$

(4)

Here κ_eff,GST (κ_eff,wg) and n_eff,GST(n_eff,wg) are effective imaginary and real parts of the refractive index of the waveguide material with (without) GST. R is the radius of the ring waveguide and L_GST is the length of the GST element. The refractive indices of partially crystallized GST are estimated from effective permitivities approximated by an effective-medium theory^15,16:

$$\frac{{\varepsilon }_{eff}(p)-1}{{\varepsilon }_{eff}(p)+2}=p\times \frac{{\varepsilon }_{c}-1}{{\varepsilon }_{c}+2}+\mathrm{(1}-p)\times \frac{{\varepsilon }_{a}-1}{{\varepsilon }_{a}+2}$$

(5)

where ε_c and ε_a are the permittivities in the crystalline and amorphous states respectively calculated from the refractive indices of GST⁷ by $\sqrt{\varepsilon (\lambda )}=n+i\kappa $. p is the degree of crystallization. The effective refractive indices of the Si waveguide- SiO2 substrate system with and without GST was calculated using COMSOL Multiphysics simulations, shown in the inset of Fig. 1(c). These equations depict the theoretical backdrop of a ring resonator system with GST. As the GST element crystallizes (amorphizes), κ_eff,GST and hence its absorption increases (decreases) and as a result the transmission at the ‘THROUGH’ (‘DROP’) port increases. Figure 1(c,d) shows that the theoretically calculated transmission at the THROUGH and ‘DROP’ ports in a ring resonator increases with p. We propose an integrate-fire spiking neuron leveraging these characteristics of the GST-ring resonator system.

Information processing in neural networks usually involve multiplication of inputs with the significance metric of the synapses, namely ‘weight’ and feeding the corresponding output to a neuron. For most neural network applications, weights can assume negative values. It is thus necessary to realize a bipolar neuron which can receive inputs of both polarity for all practical purposes. Let us now consider a GST embedded ring resonator described above. The GST initially is in crystalline state, denoting the highest (lowest) transmission level through ‘THROUGH’ (‘DROP’) port. During the ‘write’ phase, an off-resonance pulse is input which writes into the GST element, thereby reducing (increasing) its degree of crystallization p (amorphization (1 − p)). During the ‘read’ phase, as p reduces, ‘THROUGH’ port transmission T_t decreases and ‘DROP’ port transmission (T_d) increases. Thus, with incoming pulses, the transmission through the ‘DROP’ and ‘THROUGH’ ports get positively and negatively integrated respectively. We combine these properties of the device to propose a bipolar integrate and fire neuron. The integration unit of the neuron body consists of two ring resonators as shown in Fig. 2(a) and pulses of amplitudes proportional to the positive (${O}_{j}^{+}$) and negative (${O}_{j}^{-}$) weighted sums, received from the synapses, are fed to the positive and negative ring resonators respectively. The details of entire network framework is discussed later. Note, the resultant amplitude of the incident pulse to the neuron is the difference of the positive and negative inputs fed to the two devices: ${O}_{j}={O}_{j}^{+}-{O}_{j}^{-}$. Thus, the two ring resonators integrate in opposite direction to emulate the resultant integration which should ideally be proportional to O_j. The output from the ‘DROP’ and ‘THROUGH’ ports of the positive and negative devices respectively are passed to an interferometer. We place a phase modulator (ϕ) in the path of the positive ring resonator and the interferometer to tune the output of the interferometer to produce the sum of the two incoming pulses. As the two ports integrate in the opposite direction, the output of the interferometer is the resultant integration based on both the positive and negative inputs to the neuron body and can be treated as membrane potential of the integrate and fire neuron. Thus at every time-step, the membrane potential of the j^th neuron can be represented by:

$${V}_{j}[t]={V}_{j}[t-\mathrm{1]}+{O}_{j}[t]$$

(6)

Figure 2(b) shows the operation of the proposed neuron such that the membrane potential integration is proportional to the amplitude of the resultant incident spike to the neuron. Once the GST reaches full amorphization, the membrane potential crosses its threshold (P_thresh). The ‘firing’ action of the neuron involves the generation of a spike which is implemented by an additional photonic circuit as shown in Fig. 2(a). This circuit consists of an photonic amplifier, a circulator and a rectangular waveguide with a GST element on top initially in crystalline state. For a rectangular waveuguide with GST, the transmission is low (high) in crystalline (amorphous) state. The ‘read’ and ‘write’ phases for the ‘integration unit’ and the ‘firing unit’ alternate in successive cycles. This essentially means that during the ‘write’ cycle of the integration unit of the neuron, a read pulse is passed through the firing unit. On the other hand, during the ‘read’ cycle of the integration unit, the ‘read’ pulse is passed through the ring resonators and based on the output of inteferometer and subsequent amplification, the resulting pulse attempts to write into the GST of the rectangular waveguide in the firing unit. A circulator C directs the incoming and outgoing pulses into the rectangular waveguide. When the GST elements in the integration unit are initially in crystalline state, the output of the amplifier A (P_amp) is not sufficient to amorphize the GST on rectangular waveguide and hence, a spike is not transmitted through the rectangular waveguide. However, when the membrane potential integrates, on incidence of several ‘write’ pulse, enough to the cross the threshold, P_amp is ensured to be high enough to amorphize the GST on the rectangular waveguide and a spike is transmitted. Once the neuron fires, a ‘RESET’ pulse is passed to reset the states of the devices to their initial states and the membrane potential drops to the resting potential (P_rest) as shown in Fig. 2(b). Thus, the operation of a bipolar integrate and fire neuron can be achieved using the setup described in Fig. 2.

The dynamics of the spiking neuron is primarily governed by the phase-change dynamics of GST. GST partially absorbs the wave passing through the ring waveguide and its low thermal conductivity¹⁷causes a considerable increase in temperature. The growth of the amorphization region in the material occurs when the concerned region is above the melting temperature, which is around 877 K¹⁸. For a particular incident pulse, the amorphous region heats up less than the crystalline region. Thus the change in amorphous thickness will decrease as the amorphous thickness increases. Thus, change in amorphization thickness is a function of the current state of the GST and the amplitude of the incident pulse.

Results

The ‘write’ operation of the spiking neuron is investigated using the modal profiles of the incident EM waves and the resulting temperature profiles in the GST-Si-SiO2 stack. The ‘read’ operation, on the other hand, is explored from the point of view of the entire GST-ring resonator system. The modal profile of input EM wave and subsequent heat dissipation framework was implemented in COMSOL¹⁹. The temperature profiles were used to simulate the phase change characteristics of GST in MATLAB. The optical response of a ring resonator was obtained using a commercial-grade simulator Lumerical FDTD Solutions based on the finite-difference time-domain (FDTD) method²⁰. Table 1 lists the parameters used for each simulation.

Table 1 Dimensions and Material parameters.

Full size table

Phase change dynamics of GST

The electromagnetic power absorption and subsequent temperature rise in GST is analyzed in detail using Finite Element Method (FEM) simulations in COMSOL Multiphysics. Firstly, to validate our simulation framework we simulated a GST embedded Si₃N₄-SiO₂ ridge-waveguide system and compared its transient response of temperature in GST with experimental data⁸ under same excitation conditions. Figure 3(a) shows good agreement between the results from our simulation and corresponding experimental data, thus validating our simulation setup. Next, we built a 3D model of a section of the ring resonator with GST as shown in Fig. 3(b) and studied the electromagnetic characteristics and subsequent temperature profiles using the validated simulation setup. The dimensions of the waveguide were fixed to ensure single fundamental mode propagation for a input optical wave of 1550 nm length. The electric field distribution at the surface of the waveguide embedded with c-GST and a-GST are shown in Fig. 3(c,d) respectively. We observe optical attenuation of −3.71 dB in the waveguide for c-GST of 0.3 μm length and 20 nm thickness while similar dimensions of a-GST give us negligible (−0.26 dB) attenuation. This implies strong optical absorption in c-GST and also validates the fact that it is an order of magnitude higher than that of amorphous state⁸. This property allows us to progressively amorphize our device while keeping the state of the already amorphized volume undisturbed for our chosen range of input optical power.

Next, we analyze the thermal response of the GST upon optical excitation using finite element simulation. We incorporate optical heating by modeling GST as local heat source. An optical pulse of amplitude 26 mW and duration 200 ps is injected from the front facet of the waveguide. The GST is initially considered to be in crystalline state and absorbed energy in GST is taken as the heat energy for that local heat source. However, as heat is not generated uniformly within the GST volume, we designed the heat source to decrease exponentially⁸ with a factor, A = exp(−|α_x|·x·ln(10)/10) along the length of the GST (0 ≤ x ≤ L_GST) where α_x is the optical attenuation per unit length of GST. Resulting temperature distribution at the end of the pulse is shown in Fig. 3(e). From inspection of this profile, an exponential temperature distribution along the GST length becomes evident. We also observe that there exists a significant portion of GST whose temperature is above the melting temperature (877 K) and hence will become amorphized (e.g. 57% amorphization for given conditions) after removal of optical pulse. This simulation was performed multiple times keeping the pulse width same but varying the pulse power (amplitude) and initial level of amorphization and results are plotted in Fig. 3(f). We find that below 12 mW (200 ps) input pulse, irrespective of initial amorphization state, no further amorphization happens. Thus, we choose a input power range (26 mW to 12 mW) for the operation of the proposed all-photonic spiking neuron.

Optical response of ring resonator

The ‘read’ operation of the spiking neuron concerns with the optical response of the ring resonator or more precisely, the transmission characteristics at the ‘THROUGH’ and ‘DROP’ ports of the device. FDTD simulations were performed in Lumerical. Inc on a ring resonator with Si waveguides and SiO2 substrate with a patch of GST on top of the ring waveguide as illustrated in Fig. 1(a). Figure 4(a,b) shows the normalized transmission at the ‘THROUGH’ and ‘DROP’ ports for different amorphization levels of GST. The insets of Fig. 4(a,b) show the variation of transmission at a resonant wavelength λ_read = 1529 nm with increasing degree of amorphization for the two ports respectively and results show consistency with our theoretical discussions above. The variation in transmission results from the decreasing absorption co-efficient (α) as the GST amorphizes. We observe a FWHM of 1.68 (2.23) nm for a-GST and 2.97 (2.97) nm for c-GST and an extinction ratio contrast of 7.5 (6.03) dB between the fully amorphous and fully crystalline states in the ‘THROUGH’ (‘DROP’) port. Figure 4(c,d) shows the visible contrast in electric field absorption by the GST element in the ring resonator for the amorphous and crystalline states of GST for an on-resonance incident wave. The slight shift in the resonance peaks can be attributed to the minor variations in the real part of the effective refractive indices of the GST at different states, which can be expressed as⁶:

$${\rm{\Delta }}{\lambda }_{read}\approx \frac{{\rm{\Delta }}{n}_{eff,GST}}{{n}_{eff,wg}}.\frac{{L}_{GST}}{2\pi R}$$

(7)

These characteristics show that the outputs at the ‘THROUGH’ and ‘DROP’ ports decrease and increase respectively with increasing degree of amorphization which is a desirable characteristic for integration in both the positive and negative direction. We leverage this characteristic by connecting the outputs from the ‘THROUGH’ and ‘DROP’ ports of two devices to an interferometer, as shown in Fig. 2(a) to obtain the resultant integration of the membrane potential as described earlier. Thus, the progressive optical responses of the ring resonator for various percentage amorphization are in agreement with the desired characteristics for the neuronal system to show integrating action. Finally, the contrast between transmission of a-GST and c-GST for a rectangular waveguide is shown in Fig. 4(e).

Spiking Neural network inferencing framework

A neural network is comprised of multiple layers of neurons connected through synapses. The operation of any layer in a neural network involves computing the dot-product of the inputs and weights of the synapses, which gets transferred through the neuron to the next layer. To that effect, the synaptic network can be represented as a dot-product engine that multiplies the inputs with the corresponding synaptic weights and computes a weighted sum which is received by the neuron. Such a dot-product framework can be potentially implemented by GST-based photonic synapses. Such a synapse can draw its inspiration from a GST-based on-chip photonic synapse¹⁰ recently proposed. The proposed integrate-and-fire spiking neuron can be integrated with these photonic synapses in an all-photonic implementation of a spiking neural network. To analyze the performance of such an all-photonic neural network, we built a device to algorithm framework by mapping the device characteristics to implement the proposed neuron in an algorithm level neural network inferencing setup. Such a system-level simulation is quintessential to validate the operation of the proposed integrate-and-fire neuron. For the current analysis, we assume ideal operation of the dot-product engine. We consider a fully connected network consisting of 3 layers, the input layer, the hidden layer and the output layer as shown in Fig. 5(a). In such a network, each neuron receives inputs from all the neurons of the previous layer. We study the performance of the aforementioned fully connected neural network in a standard handwritten digit recognition task based on the MNIST dataset²¹. The MNIST dataset consists of 60000 training images and 10000 testing images. The weights of the synapses are trained using the Backpropagation algorithm²² as in case of traditional Artificial Neural networks (ANN). During inferencing, we use a conversion mechanism²³ from ANN to SNN where the neurons with ‘ReLU’²⁴ activation functions in the ANN are replaced by the proposed integrate-and-fire neurons. The dependence of final state of the device on the input and initial state of the device as shown in Fig. 3(f) was used to determine the state of each neuron after each time-step. Then, the transmission characteristics of the ports of the ring resonators in the proposed neuron as shown in Fig. 4(a,b) was used to determine the final membrane potential of each neuron. Each pixel of a 28 × 28 input image is divided into a stream of spikes whose frequency is proportional to the pixel intensity. The proposed integrate-and-fire neurons receive the dot product of the input spikes in a certain time-step t and the corresponding weights of synapses connecting the neuron and the inputs as shown in Fig. 5(b). Upon receiving the dot product stimulus, the neurons integrate its membrane potential at that time-step. Mathematically, for j^th neuron, this can be represented similar to Eqn. 6:

$${V}_{j}[t]={V}_{j}[t-\mathrm{1]}+\sum _{i}\,{I}_{i}\,[t]{w}_{ij}$$

(8)

where V_j[t] is the internal state or the membrane potential of the j^th neuron at time t, I_i[t] is the i^th input at time t, w_ij is the weight of the synapse connecting the i^th input to the j^th neuron. The details of the synaptic network implementation in the photonic domian will be a future course of study, however, similar concepts have been well-explored in the electrical domain⁴. Any synaptic network is essentially a dot-product engine performing element-wise multiplication of the inputs and the synaptic weights. Such a dot-product engine receives an N-dimensional input vector and provides an M-dimensional output vector which can be mathematically represented as:

$$[\begin{array}{c}{O}_{1}\\ {O}_{2}\\ \vdots \\ {O}_{M}\end{array}]=[\begin{array}{cccc}{I}_{1} & {I}_{2} & \ldots & {I}_{N}\end{array}][\begin{array}{cccc}{w}_{11} & {w}_{12} & \ldots & {w}_{1M}\\ {w}_{21} & {w}_{22} & \ldots & {w}_{2M}\\ \vdots & \vdots & \ddots & \vdots \\ {w}_{N1} & {w}_{N2} & \ldots & {w}_{NM}\end{array}]$$

(9)

where [w_ij] is a N × M weight matrix.

To account for weights of either polarity, we represent the weights in two different dot-product engines as shown in Fig. 5(b). We can interpret the weight w_ij to possess a positive and negative component:

$${w}_{ij}={w}_{ij}^{+}-{w}_{ij}^{-}$$

(10)

$${w}_{ij}^{-}=|{w}_{ij}|,{w}_{ij}^{+}=\mathrm{0,}\,{\rm{when}}\,{w}_{ij} < 0$$

(11)

$${w}_{ij}^{+}={w}_{ij},{w}_{ij}^{-}=0,\,{\rm{when}}\,{w}_{ij} > 0$$

(12)

This gives us two matrices ${W}^{+}=[{w}_{ij}^{+}]$ and ${W}^{-}=[{w}_{ij}^{-}]$. These matrices are represented in the dot-product engines such that they return the corresponding dot products:

$${O}_{j}^{+}=\sum _{i}{I}_{i}{w}_{ij}^{+}$$

(13)

$${O}_{j}^{-}=\sum _{i}{I}_{i}{w}_{ij}^{-}$$

(14)

The positive and negative integrating ring resonators in the proposed neuron take these inputs separately and integrate in opposite direction such that the resulting integration mimics the desired integration that a biological neuron performs, given by Eqn. 7 because ${\sum }_{i}\,{I}_{i}{w}_{ij}=\sum _{i}{I}_{i}{w}_{ij}^{+}-\sum _{i}{I}_{i}{w}_{ij}^{-}$. The resulting membrane potential is fed to a Firing Unit as described in Fig. 2(a). A behavorial model of the SNN inferencing framework described above was simulated using the MATLAB Deep Learning Toolbox²⁵ using a well-explored network topology²³. Figure 5(c) shows the progression of the membrane potential of the proposed integrate-and-fire neuron in the hidden layer of the simulated SNN under the action of weighted incident spikes with time. The magnitude of the weighted incident spikes is essentially equal to ${\sum }_{i}\,{I}_{i}\,[t]\,{w}_{ij}$ for the j^th neuron at time-step t. It can be observed that once the membrane potential of the neuron reaches its threshold, it goes back to its rest potential. In the process, it generates a spike that gets fed to the next layer. The same integration process happens in case of the output layer neurons as well and the spike activities of all the neurons are monitored. The 10 output layer neurons correspond to the 10 classes of image being classified. The neuron with the highest spiking activity over a number of time-steps is compared with the test image label and if it matches with the neuron number, the image is classified correctly. This device to system level analysis helps us validate the operation of the proposed integrate-and-fire neuron. The accuracy of recognition was calculated to be 98.06% after 25 time-steps on the testing set. The accuracy suffers a 0.24% degradation with respect to the testing accuracy (98.3%) of a SNN based on an ideal integrate-and-fire neurons. This can be attributed to the non-linear transmission characteristics shown in Fig. 4 and the dependence of the final state on the initial state of the device. Such device inaccuracies can be accounted for by modifying the training algorithm²⁶.

The important metrics for performance evaluation on a neuromorphic hardware system are energy efficiency and speed. To that effect, the energy and delay performance of the proposed neuron merits discussion. Each ‘write’ cycle is considered to be 1.5 ns and each ‘read’ cycle for the proposed neuron was considered to be 500 ps. The durations of the ‘read’ and ‘write’ pulses were 200 ps. The additional times in the ‘write’ and ‘read’ cycles is to ensure that the GST temperature settles to its initial value after the excitation. The ‘write’ times are constrained by the transient response of GST to an amorphization pulse, which is shown to achieve times as low as 200 ps, experimentally⁶ when excited with 1 ps pulses. The average energy of a ‘write’ step considered for the simulation of the neural network was 4 pJ per neuron per time-step whereas the average ‘read’ energy was 1 pJ per neuron per time-step. The energy consumption in the ‘write’ cycles of the neuron can be further reduced by optimizing the feature size of the GST element. PCM devices of similar feature sizes^27,28 in the electrical domain can consume upto 14–19 pJ of ‘write’ energy while operating at speeds of 40–100 ns. Writing into the GST through evanscent coupling with photonic waveguides thus achieves a higher energy efficiency and speed, thus promising to rekindle the viability of PCMs for fast neuromorphic processing.

Discussion

Neuromorphic engineering has evolved heavily from its dawn as researchers have explored various kinds of technologies to mimic the functionality of the brain on an energy-efficient hardware platform. In the electrical domain, such technologies have been demonstrated to possess limitations such as speed, energy, process integration etc. Phase change materials, in particular, have hit the scaling bottleneck where further improvements in energy-efficiency would require reducing ‘write’ speeds significantly. To beat CMOS in terms of energy-efficiency a 10× reduction⁵ in current pulse amplitude or increase in pulse duration is necessary. As a solution, we propose an all-photonic integrate-and-fire neuron based on the phase change dynamics of GST which promises to achieve ‘write’ speeds of sub-ns orders. To the best of our knowledge, this is the first demonstration of a biologically plausible spiking neuron in the photonic domain involving phase change materials. We also showed that the proposed neuron can be potentially integrated with synapses in an all-photonic spiking neural network inferencing framework without any significant drop in classification performance. The proposed design opens up a host of possibilities for future implementations of all-photonic SNNs. By modulating the resonant wavelength by varying dimensions offers us the opportunity of wavelength multiplexing in an all-photonic spiking neural network. This offers substantial benefits such as elimination of cross-talk between neighboring neural elements thus allowing the provision of a denser network and in addition, could possibly allow us to implement larger networks on the same chip. With the recent advances in Photonic Neuromorphic, the proposed integrate-and-fire neuron fills the void of an all-photonic neuron that can be interfaced with photonic synapses¹⁰ to build a truly integrated all-photonic neuromorphic system that leverages the aforementioned advantages of photonic devices to perform ultrafast neuromorphic computation.

References

Hodgkin, A. L. & Huxley, A. F. A quantitative description of membrane current and its application to conduction and excitation in nerve. J. Physiol. 117, 500–544, https://doi.org/10.1113/jphysiol.1952.sp004764 (1952).
Article PubMed PubMed Central CAS Google Scholar
Stein, R. Some models of neuronal variability. Biophys. J. 7, 37–68, https://doi.org/10.1016/s0006-3495(67)86574-3 (1967).
Article ADS PubMed PubMed Central CAS Google Scholar
Tuma, T., Pantazi, A., Gallo, M. L., Sebastian, A. & Eleftheriou, E. Stochastic phase-change neurons. Nat. Nanotechnol. 11, 693–699, https://doi.org/10.1038/nnano.2016.70 (2016).
Article ADS PubMed CAS Google Scholar
Sengupta, A. & Roy, K. Encoding neural and synaptic functionalities in electron spin: A pathway to efficient neuromorphic computing. Appl. Phys. Rev. 4, 041105 (2017).
Article ADS CAS Google Scholar
Rajendran, B. et al. Specifications of nanoscale devices and circuits for neuromorphic computational systems. IEEE Transactions on Electron Devices 60, 246–253 (2013).
Article ADS Google Scholar
Stegmaier, M., Rios, C., Bhaskaran, H., Wright, C. D. & Pernice, W. H. P. Nonvolatile all-optical 1 × 2 switch for chipscale photonic networks. Adv. Opt. Mater. 5, 1600346, https://doi.org/10.1002/adom.201600346 (2016).
Article CAS Google Scholar
Pernice, W. H. & Bhaskaran, H. Photonic non-volatile memories using phase change materials. Appl. Phys. Lett. 101, 171101 (2012).
Article ADS CAS Google Scholar
Rios, C. et al. Integrated all-photonic non-volatile multi-level memory. Nat. Photonics 9, 725–732, https://doi.org/10.1038/nphoton.2015.182 (2015).
Article ADS CAS Google Scholar
Rodriguez-Hernandez, G., Hosseini, P., Ros, C., Wright, C. D. & Bhaskaran, H. Mixed-mode electro-optical operation of ge2sb2te5 nanoscale crossbar devices. Adv. Electron. Mater. 3, 1700079, https://doi.org/10.1002/aelm.201700079 (2017).
Article CAS Google Scholar
Cheng, Z., Ros, C., Pernice, W. H., Wright, C. D. & Bhaskaran, H. On-chip photonic synapse. Sci. adv. 3, e1700160 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Tait, A. N., Nahmias, M. A., Shastri, B. J. & Prucnal, P. R. Broadcast and weight: an integrated network for scalable photonic spike processing. J. Light. Technol. 32, 3427–3439 (2014).
Article ADS Google Scholar
Fok, M. P., Tian, Y., Rosenbluth, D. & Prucnal, P. R. Asynchronous spiking photonic neuron for lightwave neuromorphic signal processing. Opt. Lett. 37, 3309–3311 (2012).
Article ADS PubMed CAS Google Scholar
Shen, Y. et al. Deep learning with coherent nanophotonic circuits. Nat. Photonics 11, 441–446, https://doi.org/10.1038/nphoton.2017.93 (2017).
Article ADS CAS Google Scholar
Tait, A. N. et al. Neuromorphic photonic networks using silicon photonic weight banks. Sci. Rep. 7, https://doi.org/10.1038/s41598-017-07754-z (2017).
Chen, Y. et al. Engineering the phase front of light with phase-change material based planar lenses. Sci. Rep. 5, https://doi.org/10.1038/srep08660 (2015).
Voshchinnikov, N. V., Videen, G. & Henning, T. Effective medium theories for irregular fluffy structures: aggregation of small particles. Appl. Opt. 46, 4065, https://doi.org/10.1364/ao.46.004065 (2007).
Article ADS PubMed Google Scholar
Lyeo, H.-K. et al. Thermal conductivity of phase-change material Ge2Sb2Te5. Appl. Phys. Lett. 89, 151904 (2006).
Article ADS CAS Google Scholar
Sebastian, A., Gallo, M. L. & Krebs, D. Crystal growth within a phase change memory cell. Nat. Commun. 5, https://doi.org/10.1038/ncomms5314 (2014).
Comsol. Multiphysics Reference Guide for COMSOL 4.2 www.comsol.com (2011).
Lumerical. Lumerical Inc. http://www.lumerical.com/tcad-products/fdtd/ (2017).
MNIST handwritten digit database. http://yann.lecun.com/exdb/mnist/.
Hecht-Nielsen, R. Theory of the backpropagation neural network. In Neural networks for perception, 65–93 (Elsevier, 1992).
Diehl, P. U. et al. Fast-classifying, high-accuracy spiking deep networks through weight and threshold balancing. In Neural Networks (IJCNN), 2015 International Joint Conference on, 1–8 (IEEE, 2015).
Nair, V. & Hinton, G. E. Rectified linear units improve restricted boltzmann machines. In Proceedings of the 27th international conference on machine learning (ICML-10), 807–814 (2010).
Palm, R. B. Prediction as a candidate for learning deep hierarchical models of data. Tech. Univ. Denmark 5 (2012).
Chakraborty, I., Roy, D. & Roy, K. Technology aware training in memristive neuromorphic systems based on non-ideal synaptic crossbars. arXiv preprint arXiv:1711.08889 (2017).
Lee, B. C., Ipek, E., Mutlu, O. & Burger, D. Architecting phase change memory as a scalable dram alternative. In ACM SIGARCH Computer Architecture News, 37, 2–13 (ACM, 2009).
Wong, H.-S. P. et al. Phase change memory. Proc. IEEE 98, 2201–2227 (2010).
Article Google Scholar
Aspnes, D. E. & Studna, A. Dielectric functions and optical parameters of Si, Ge, GaP, GaAs, GaSb, InP, InAs and InSb from 1.5 to 6.0 ev. Physical review B 27, 985 (1983).
Article ADS CAS Google Scholar
Malitson, I. Interspecimen comparison of the refractive index of fused silica. Josa 55, 1205–1209 (1965).
Article ADS CAS Google Scholar
Kim, S.-Y., Kim, S. J., Seo, H. & Kim, M. R. Variation of the complex refractive indices with sb-addition in ge-sb-te alloy and their wavelength dependence. In Optical Data Storage'98, 3401, 112–116 (International Society for Optics and Photonics, 1998).
Gallo, M. L., Athmanathan, A., Krebs, D. & Sebastian, A. Evidence for thermally assisted threshold switching behavior in nanoscale phase-change memory cells. J. Appl. Phys. 119, 025704, https://doi.org/10.1063/1.4938532 (2016).
Article ADS CAS Google Scholar
Njoroge, W. K., Wöltgens, H.-W. & Wuttig, M. Density changes upon crystallization of Ge2 Sb2.04Te4.74 films. J. Vac Sci. & Technol. A: Vacuum, Surfaces and Films 20, 230–233, https://doi.org/10.1116/1.1430249 (2002).
Article ADS CAS Google Scholar

Download references

Acknowledgements

The work was supported in part by, ONR-MURI program, the National Science Foundation, Intel Corporation and by the DoD Vannevar Bush Fellowship.

Author information

Authors and Affiliations

Purdue University, School of Electrical and Computer Engineering, West Lafayette, IN, 47907, USA
Indranil Chakraborty, Gobinda Saha, Abhronil Sengupta & Kaushik Roy

Authors

Indranil Chakraborty
View author publications
You can also search for this author in PubMed Google Scholar
Gobinda Saha
View author publications
You can also search for this author in PubMed Google Scholar
Abhronil Sengupta
View author publications
You can also search for this author in PubMed Google Scholar
Kaushik Roy
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

I.C. and K.R. conceived the study. I.C. conceived the necessary simulations, I.C. and G.S. conducted the simulations, I.C., G.S. and A.S. analyzed the results. All authors reviewed the manuscript.

Corresponding author

Correspondence to Indranil Chakraborty.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chakraborty, I., Saha, G., Sengupta, A. et al. Toward Fast Neural Computing using All-Photonic Phase Change Spiking Neurons. Sci Rep 8, 12980 (2018). https://doi.org/10.1038/s41598-018-31365-x

Download citation

Received: 05 April 2018
Accepted: 17 August 2018
Published: 28 August 2018
DOI: https://doi.org/10.1038/s41598-018-31365-x

This article is cited by

Reconfigurable optical logic in silicon platform
- M. A. Ruhul Fatin
- Dusan Gostimirovic
- Winnie N. Ye
Scientific Reports (2024)
Conversion of a single-layer ANN to photonic SNN for pattern recognition
- Yanan Han
- Shuiying Xiang
- Yuechun Shi
Science China Information Sciences (2024)
Photonic matrix multiplication lights up photonic accelerator and beyond
- Hailong Zhou
- Jianji Dong
- Xinliang Zhang
Light: Science & Applications (2022)
Ultracompact photonic integrated content addressable memory using phase change materials
- Md. Ajwaad Zaman Quashef
- Md. Kawsar Alam
Optical and Quantum Electronics (2022)
Research progress in optical neural networks: theory, applications and developments
- Jia Liu
- Qiuhao Wu
- Shengcai Li
PhotoniX (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.