Article

# High-speed and energy-efficient non-volatile silicon photonic memory based on heterogeneously integrated memresonator

Received: 27 May 2023

Accepted: 3 January 2024

Published online: 16 January 2024

Check for updates

Bassem Tossoun  $\mathbb{O}^1 \boxtimes$ , Di Liang  $\mathbb{O}^{1,2}$ , Stanley Cheung  $\mathbb{O}^1$ , Zhuoran Fang<sup>1</sup>, Xia Sheng<sup>1</sup>, John Paul Strachan  $\mathbb{O}^{1,3}$  & Raymond G. Beausoleil<sup>1</sup>

Recently, interest in programmable photonics integrated circuits has grown as a potential hardware framework for deep neural networks, quantum computing, and field programmable arrays (FPGAs). However, these circuits are constrained by the limited tuning speed and large power consumption of the phase shifters used. In this paper, we introduce the memresonator, a metaloxide memristor heterogeneously integrated with a microring resonator, as a non-volatile silicon photonic phase shifter. These devices are capable of retention times of 12 hours, switching voltages lower than 5 V, and an endurance of 1000 switching cycles. Also, these memresonators have been switched using 300 ps long voltage pulses with a record low switching energy of 0.15 pJ. Furthermore, these memresonators are fabricated on a heterogeneous III-Von-Si platform capable of integrating a rich family of active and passive optoelectronic devices directly on-chip to enable in-memory photonic computing and further advance the scalability of integrated photonic processors.

Over recent years, the demand for high-performance computers (HPCs) capable of efficiently running artificial intelligence applications has grown dramatically. The number of programs which use deep learning training has doubled every 3.5 months, which is much faster than the rate of performance doubling predicted by Moore's law<sup>1</sup>. In addition, learning algorithms are required to be executed in real-time on a massive amount of data produced by the plethora of interconnected smart devices within the Internet of Things (IoT) and edge computing.

Today, AI algorithms utilized by applications such as autonomous driving vehicles and Amazon's Alexa, are implemented using neural networks (NNs), a model inspired by the neuro-synaptic network within the human brain, which is the most energy-efficient computer-to-human knowledge (able to process 10 petaflops of data with only 20 W of power)<sup>2</sup>. The most commonly used hardware for running NNs includes application-specific integrated circuits (ASICs), graphics processing units (GPUs), and field-programmable gate arrays (FPGAs). Current state-of-the-art electronic accelerators consume about 0.5 pJ in processing a single multiply-accumulate (MAC) operation, the most fundamental neural network calculation<sup>3</sup>.

While conventional microelectronic processor performance progressed in line with Moore's Law as transistor density increased and multi-core processors developed, they are still fundamentally limited in both speed and power. Joule heating and the charging of metal wires involved in the movement of data constrain the operating speed and dominate the power consumption within electronic neural network hardware<sup>4</sup>. To exacerbate this issue even further, the von Neumann bottleneck and the "memory wall" restrict the bandwidth of data communications between the processor and the memory. Furthermore, digital processing units are also bottlenecked by the clock rate of the processor when computing multiply-accumulate (MAC) operations, the most fundamental neural network calculation<sup>5</sup>.

Fortunately, several technological breakthroughs over the last few decades have opened novel opportunities to battle these challenges. Silicon photonics offers a promising solution to dramatically improve the bandwidth and energy efficiency of interconnects for data communications applications including data centers and HPCs<sup>6</sup>. Most recently, silicon photonics has not only been used for data communications, but for non-von Neumann accelerators used for applications

<sup>1</sup>Hewlett Packard Labs, Hewlett Packard Enterprise, Santa Barbara, CA, USA. <sup>2</sup>Present address: University of Michigan, Department of Electrical and Computer Engineering, Ann Arbor, MI, USA. <sup>3</sup>Present address: PGI-14, Forschungszentrum Jülich GmbH, Aachen, Germany. 🖾 e-mail: <u>bassem.tossoun@hpe.com</u>

such as deep learning<sup>7-12</sup>. Some of the inherent properties of photonics make it a suitable platform for neuromorphic computing such as its high bandwidth of data transmission and parallel operation enabled by unique multiplexing schemes like wavelength division multiplexing (WDM). Furthermore, the processing time scale of a photonic neuron is within picoseconds, which is orders of magnitude higher than that of its electronic counterparts<sup>13</sup>.

Because running a task on a deep neural network often can take a significant amount of time, there is a significant benefit to having nonvolatile memory on-chip as it eliminates the static power consumption in holding weight values throughout an inference task. Onchip memory also prevents the need to retrieve results stored on a separate memory chip in between epochs or training steps. In addition, nonvolatile photonic memory is not only useful for data storage but also as part of the computational algorithms running on photonic neuromorphic computers<sup>4</sup>. More specifically, high-speed and lowpower nonvolatile photonic phase shifters are essential in enabling a larger variety of machine-learning methods to be executed on integrated optical neural networks. For example, deep neural networks utilizing online training with algorithms such as backpropagation require synaptic weights to be updated frequently. These on-the-fly learning algorithms are scalable, memory-efficient, and can even be used to circumvent the losses compounded by the device imperfections within photonic neural networks as they scale in size and complexity<sup>14,15</sup>.

One viable solution to supplying a fast, low-power, nonvolatile memory is the memristor (also commonly referred to as resistive random-access memory or RRAM) which was theoretically proposed by Leon Chua and experimentally demonstrated by HP Labs<sup>16,17</sup>. Memristors (also commonly referred to as resistive random-access memory or RRAM) have proven to be excellent nonvolatile electronic memory devices with high switching speed (~100 ps), low energy switching (~100 fJ), endurance ( $10^6-10^8$  cycles), and high density<sup>5,18-22</sup>.

In this work, we integrated metal-oxide-based memristive devices within III-V/Si microring resonators to produce memresonators, an energy-efficient analogue nonvolatile memory on a highly scalable and versatile heterogeneous silicon photonic platform well-suited for integrated photonic information processing circuits. By changing the resistance state of the memristor, we can subsequently tune the optical phase within the waveguide and alter the resonant wavelength of the device. Analogue device operation was shown through the measurement of multiple optical states. Performance records, including retention times of 12 h, an endurance of 1000 switching cycles, switching times as low as 300 ps for SET and 900 ps for RESET, and switching energies of 0.15 pJ for SET and 0.36 pJ for RESET, are demonstrated.

By integrating these memristors on the same chip as photonic neural networks, for example, significant amounts of energy and latency can be saved by avoiding energy lost in the transfer of data from the processor to an external memory chip. Moreover, using these memresonators, weights within photonic neural networks can be stored and updated at high speeds and low energy, enabling the use of the back-propagation algorithm and the ability to train the network onchip. Finally, this III–V-on-silicon photonic memristive device is based on the same technology developed for a fully active (including optical gain) and passive integrated photonic platform on silicon for largebandwidth, energy-efficient optical interconnect applications<sup>23</sup>. In fact, the first generation of a heterogeneous III–V-on-silicon technology has been successfully commercialized by Intel in their 300 mm CMOS production line to enable on-chip lasers for over 2 million optical transceiver units each year<sup>24,25</sup>.

#### Results

#### Device design and fabrication

As schematically shown in Fig. 1a, heterogeneous III–V/Si microring resonators (MRRs) of varying radii between  $10 \,\mu\text{m}$  and  $25 \,\mu\text{m}$  were fabricated on a silicon-on-insulator (SOI) substrate with a  $2 \,\mu\text{m}$ -thick buried oxide layer and a 300 nm-thick top silicon layer. GaAs epitaxial device layers are transferred to a 100 mm Si-on-insulator (SOI) substrate by an O<sub>2</sub> plasma-assisted direct wafer bonding process<sup>26</sup>. About



Fig. 1 | Device schematics and images. a 3D-view and cross-schematic view of a memristive III–V-on-Silicon microring resonator. b Simulated fundamental TE mode field intensity within the microring waveguide at 1310 nm. c Scanning

electron microscopy (SEM) cross-sectional image of memresonator. **d** Transmission electron microscopy (TEM) cross-sectional image of memresonator. **e** TEM image of a bonded GaAs-Al<sub>2</sub>O<sub>3</sub>-Si memristor. 10 nm of Al<sub>2</sub>O<sub>3</sub> was grown on both the GaAs and Si substrates using atomic layer deposition (ALD) before they were bonded together to form the resistive-switching oxide. A n-GaAs/Al<sub>2</sub>O<sub>3</sub>/p-Si semiconductor-insulator-semiconductor (SIS) stack is embedded within the microring resonator for high-speed optical signal modulation through carrier accumulation and the plasma dispersion effect<sup>27</sup>. This device can then be resistively switched like a memristor, thereby producing a memresonator, or a memristor integrated with a microring resonator, as will be discussed in further detail in the next section.

An air trench is then formed on the Si device layer with a ~170 nm waveguide rib etch depth, prior to wafer bonding in order to confine the memristor device area only to the fundamental TE mode and to minimize the area for high-speed and energy-efficient charging and discharging. The bus and ring waveguides within the microring resonators are 500 nm wide each and are separated by 200 nm at the coupling section. Figure 1b is an optical simulation showing the fundamental TE mode within the memresonator waveguide. Transmission electronic microscopy (TEM) images of the fully fabricated memresonator cross section and memristor material stack are shown in Fig. 1d, e, respectively. As seen in Fig. 1a, electrodes are placed on the 150-nm-thick n-type GaAs contact layer and the 300 nm p-type Si contact layer to apply an electrical field across the oxide material. Since semiconductor materials are sandwiching the resistive-switching oxide, these memristors can be integrated within optical waveguides while adding only about 0.05 dB of insertion loss (see Supplementary Note 2), achieving much lower optical loss than with purely metal electrodes typically used in electronic memristors.

#### Working mechanism

As mentioned in the previous section, a memristor is formed using n-type GaAs and p-type Si sandwiching a thin resistive-switching Al<sub>2</sub>O<sub>3</sub> layer. In order to resistively switch the memristor, a process creating an interchange of oxygen and semiconductor atoms, called "electroforming," must be induced by applying a high enough positive bias voltage across the memristor. The high electric field breaks some of the Al-O bonds causing oxygen atoms to migrate towards the semiconductor regions and leave behind negatively ionized vacancies within the Al<sub>2</sub>O<sub>3</sub> layer. The oxygen vacancies form localized aluminumrich channels, namely conductive filaments (CFs), that allow current to flow and effectively increase the conductivity of the oxide material, setting the device to a low resistance state (LRS)<sup>28</sup>. When a large enough electric field is applied in the opposite direction, it causes a reduction of oxygen vacancies as well as sufficient current flow to catalyze localized Joule heating, rupturing the CFs previously formed and resetting the memristor back to a high resistance state (HRS)<sup>29,30</sup>. Prior studies suggest that a combination of electric field and Joule heating induces the resistive-switching mechanism of Al<sub>2</sub>O<sub>3</sub>-based memristors<sup>31</sup>. When a positive bias (typically lower than the voltage needed for electroforming) is applied again, the CF reforms, and the device switches back to a lower resistance.

As can be seen in Fig. 2a, a schematic of the resistive-switching mechanism within the memristor is shown. Oxygen vacancies are formed after electroforming, and they can be ruptured and reconnected through subsequent set and reset cycles. Figure 2b, c visually portrays the carrier dynamics within the III–V/Si memristor-integrated waveguide when the memristor is in the LRS and the HRS. Figure 2d shows the current–voltage characteristics of the device which shows a hysteresis-type curve confirming its operation as a memristor. The voltage was swept from 0 to 10 V and back down to 0 V, and then from 0 V to -5 V and back to 0 V to observe the hysteresis effect in the I–V characteristics. The compliance current,  $I_{CC}$ , was initially set to  $50 \,\mu$ A in the forward direction and 1 mA in the reverse direction in order to prevent the device from permanent breakdown and physical damage. Typically, the device had less than 10 nA of DC leakage current in the HRS state due to the high quality of the Al<sub>2</sub>O<sub>3</sub> (Supplementary Note 3).

The leakage current is mostly due to trap-assisted tunneling through deep-level traps within the  $Al_2O_3$  layer. The electroforming step in the memristor occurs at 9 V, the set voltage occurs at around 5 V, and the reset voltage occurs around -4 V.

As can be seen in the current-voltage characteristics in Fig. 2d, the device can also be switched to an intermediate resistance state (IRS) with a resistance between the LRS and HRS by adjusting the current compliance of the measurement equipment to a value between the compliance used for the HRS and LRS. The device can also be set to multiple intermediate in this way, displaying the possibility of using these devices for analogue computing. For example, while the device is in the HRS, it can be switched to the IRS by applying a current compliance and can be switched to a LRS by applying a higher current compliance. Since a lower current compliance is applied, it physically limits the growth of the conductive filament, thereby also limiting the device resistance<sup>32</sup>. Moreover, when the memresonator is set to a low or intermediate resistance state, we found that the conduction in the memristor is observed to be diode-like, which resembles the ideal diode equation,  $I \propto [e^{qV} - 1]$ . Since the resistive-switching oxide acts as a non-degenerate semiconductor material, and each semiconductor contact layer is p- and n-doped, the device essentially acts like a p-i-n diode in which excess electrons flow from the n-type GaAs to the p-type Si and excess holes flow in the opposite direction<sup>33</sup>. The device begins behaving similarly to a carrier injection type modulator in which majority carriers are injected into the CF and drift from one contact region to the other through the CF. In Fig. 2c, a schematic diagram of this process is shown, displaying electrons being injected from the n-GaAs to the p-Si and holes being injected from the p-Si to the n-GaAs through the CF while the memristor is in the LRS.

#### **Device characteristics**

As shown in Fig. 2e, switching the memresonator between the LRS, IRS, and the HRS subsequently switches its resonance wavelength. The insertion loss was measured to be about 0.047 dB in the HRS and 0.048 dB in the LRS (Supplementary Note 2). The 20-um diameter memresonator achieves about a 0.08 nm or about a  $0.18\pi$  phase shift (see Supplementary Note 5) in the LRS, leading to an estimated  $V_{\pi}L$  of  $2 V \times 0.35 \text{ mm} = 0.7 \text{ mm}$ . The effective refractive index and phase shift as a function of voltage is plotted in Supplementary Fig. 4. After setting the memresonator to the IRS or LRS, the device resonates at the same wavelength until it is reset back to the HRS. In Fig. 3b, e, the resistance in the HRS, IRS, and LRS and the optical power being transmitted through the memresonator at  $\lambda_{HRS}$ ,  $\lambda_{IRS}$ , and  $\lambda_{LRS}$  was measured for 12 h. This measurement demonstrates the non-volatility of this optoelectronic memory device. Drifting in the temperature stability of the setup was observed, which can be mitigated using a temperaturecontrolled stage. The device also demonstrated repeatability and excellent endurance as it was cycled 1000 times between states using voltage pulses (Fig. 3c). Figure 3f shows the resistance of the HRS, IRS, and LRS, and demonstrates a stable HRS/LRS resistance ratio of about 10<sup>3</sup> through 1000 switching cycles.

To test the switching speed and energy of these devices, an arbitrary waveform generator was used to generate voltage pulses used for reading and writing the memresonator (see "Methods" for experiment details). The output optical power at the resonant wavelength of the memresonator was monitored as the device was being switched (Fig. 4b, c). These measurements demonstrate the ability to quickly write and read data from the device with ultralow energy. The switching energy is as low as 0.15 pJ, which is more than 30× smaller than the record switching energy for photonic nonvolatile memory devices<sup>34</sup>. After the device was SET using a write voltage pulse, a read voltage pulse was applied to read the optical power transmitted through the memresonator at the resonant wavelength as well as the read current of the memristor, which was  $2.5 \,\mu$ A. The normalized transmitted power after the device was SET was about 0.27. The energy



Fig. 2 | Illustration of device working mechanism and fundamental device characteristics. a Schematic diagram of the process of forming and rupturing conductive filaments (CFs) within the memristor.  $V_{SET}$  is the voltage applied to set the memristor to the IRS or LRS.  $V_{RESET}$  is the voltage applied to reset the memristor to the HRS. **b** Schematic diagram of the carrier distribution within the waveguide while a read voltage,  $V_{READ}$ , is applied to the memristor in the HRS. **c** Schematic

consumed to read the memresonator after the write cycle was about 5 fJ.

Afterward, the device was RESET using an erase pulse, with a switching energy of 0.36 pJ. Then a read voltage pulse was applied to read the transmission power of the memresonator and the read current of the memristor, which was around 1 nA. The normalized optical power transmitted through the memresonator at the resonant wavelength after the device was SET was about 0.1. There is a small blueshift in the resonant wavelength in the HRS when the read voltage is applied, explaining why there is a small amount of power being transmitted read power is nearly 3× times smaller than when the device has been SET. Also, the energy consumption of reading the memresonator after the erase cycle was about 2 aJ. Most importantly, zero static power is consumed in between read and write cycles as energy is only spent during the read and write operations.

The measured switching speed of these devices is over two orders of magnitude faster than the fastest nonvolatile photonic phase shifters and is comparable to all-electronic metal-insulator-metal (MIM) memristor devices<sup>35,36</sup>. Furthermore, electronic memristors made from a similar material stack (Si/SiO<sub>2</sub>/Si) have previously shown a SET speed of 7.6  $\mu$ s and a RESET speed of 490  $\mu$ s<sup>33</sup>. Ultimately, the switching speed

diagram of the carrier distribution within the waveguide while a read voltage is applied to the memristor in the IRS or LRS. **d** Current–voltage characteristic of the device displaying the hysteresis signature of a memristor. Current compliances of 1  $\mu$ A, 50  $\mu$ A, and 100  $\mu$ A were used in the forward bias voltage direction in order to set the device into different resistance states. **e** Optical spectrum of the memre-sonator while a 2 V read voltage is applied in different states.

is limited by a few factors: the atomistic processes within the memristor stack, the time constant associated with Joule heating causing the conductive filament to rupture, and parasitic capacitances<sup>37</sup>.

#### Discussion

Resistive-switching elements such as memristors have been used for analogue computing for several years. While these electronic memristors can be integrated at high densities within crossbar arrays and switched at high speeds, there exists a trade-off between bandwidth and the total size of the crossbar array. For example, the bandwidth scales inverse proportionally to the size of the crossbar array, meaning that as the size reaches greater than 1 mm<sup>2</sup>, the bandwidth becomes constrained, and the energy cost for off-chip communications can also become problematic. Whereas on a photonic platform, signals can be supported with much greater bandwidth and consume less energy for longer distances than the electrical counterparts. For instance, optical waveguides can be designed with low signal attenuation (< 0.1 dB/cm) and are able to propagate high-power signals without the issues of thermal runaway such as that seen in the Joule heating of electrical wires<sup>38</sup>. Hence, microring-based weight banks and crossbar arrays which perform matrix-vector multiplication operations used for photonic neural networks, as well as for optical content-addressable



**Fig. 3** | **Optical and electrical characteristics of memresonator. a** Normalized transmitted power through the memresonator in the HRS and read current as a function of the read voltage. **b** Wavelength shift of the memresonator in multiple states measured every 5 min over the span of 12 h with a read voltage of 2 V. **c** Wavelength shift of the memresonator in multiple after 1000 set/reset cycles with

a read voltage of 3 V. **d** Normalized transmitted power through the memresonator in the HRS and read current as a function of read voltage. **e** Resistance measurements of the memristor in multiple states monitored for 12 h. **f** Resistance measurements of the memristor in multiple states.



**Fig. 4** | **Switching scheme and measured switching high-speed temporal response of memresonator. a** The typical voltage pulse sequence used to write and erase data from a memristor. **b** Plotted is the normalized transmitted power (left *y* axis) at the resonant wavelength of the memresonator as a function of time during a voltage pulse sequence. The voltage of the input pulse sequence is

memory, can be fashioned out of this platform using these nonvolatile photonic phase shifters<sup>9,39,40</sup>. Furthermore, these types of circuits can potentially achieve larger scales with much higher efficiency than their electronic counterparts.

Table 1 compares the characteristics for different implementations of phase shifters on silicon photonic platforms used for neural



networks and optical FPGAs. Typically, thermo-optic phase shifters are used as weights within photonic neural networks, however, since they lack high-speed programming capabilities and nonvolatile memory capabilities, they waste tens of mWs per unit of static power over the span of an inference task and with each weight update cycle. They can also easily cause thermal crosstalk which limits integration density and

|                  | Thermo- optic <sup>41,42,54-56</sup> | Charge- trapping <sup>57,58</sup> | MEMS <sup>43,59</sup> | PCM <sup>11,34,47,48,60-63</sup> | BaTiO <sub>3</sub> (BTO) <sup>35,50</sup> | Memresonator (this work) |
|------------------|--------------------------------------|-----------------------------------|-----------------------|----------------------------------|-------------------------------------------|--------------------------|
| Switching speed  | 2.4 µs                               | > 350 ms                          | ~1 µs                 | <100 ns                          | <1 ms                                     | <1 ns                    |
| Switching energy | 30.5 nJ                              | 11.4–17.2 pJ                      | 0.2 nJ                | 180 pJ–17 nJ                     | 4.6–26.7 pJ                               | 0.15–0.36 pJ             |
| Retention time   | N/A                                  | 10 years                          | N/A                   | 10 years                         | 10 h                                      | 12 h                     |
| L <sub>n</sub> * | ~10 µm                               | 865 µm                            | ~1 cm                 | 11 µm                            | 1mm                                       | ~350 µm                  |
| Insertion loss   | 0.23 dB                              | ~1 dB                             | 3.5 dB                | ~0.3 dB                          | ~0.1dB                                    | 0.27 dB**                |
| Non-volatility   | No                                   | Yes                               | No                    | Yes                              | Yes                                       | Yes                      |

| Table 1   Implementations of programmable phase shifters on a silicon photonic plat |
|-------------------------------------------------------------------------------------|
|-------------------------------------------------------------------------------------|

 ${}^{*}L_{\pi}$  refers to the length of the phase shifter required to achieve a  $\pi$  phase shift.

"The total insertion loss is measured to be 4 dB at the operating point of interest when including coupling losses.

scale, and control complexity<sup>8,41,42</sup>. More recent demonstrations showed integrated nano-opto-electro-mechanical phase shifters with improved energy efficiency, but were still limited in write speeds (~1  $\mu$ s), require large switching voltages (> 20 V), and have high mechanical failure rates<sup>43-45</sup>.

On the other hand, phase-change materials (PCM) such as Ge<sub>2</sub>Sb<sub>2</sub>Te<sub>5</sub> (GST) and, more recently, Sb<sub>2</sub>Se<sub>3</sub> have been explored extensively as a candidate for nonvolatile memory within silicon PICs with encouraging results<sup>34,46-49</sup>. Recently, they have shown the ability to achieve a  $\pi$  phase shift with less than 12  $\mu$ m of length and reasonably low insertion losses (< 0.3 dB). However, these materials are also limited in writing speed and typically require high input powers (~mW) to heat them long enough to change the phase from amorphous to crystalline. Most recently, BaTiO<sub>3</sub> (BTO) nonvolatile phase shifters have also been demonstrated with multi-level states with a switching energy as low as 4.6 pJ and excellent controllability<sup>35</sup>. While a promising advancement, they require a reset sequence consisting of 10,000 pulses with a duration totaling hundreds of microseconds before switching states. Phase shifters based on BTO also require about a ~1 mm phase shifter length to achieve a  $\pi$  phase shift, which is challenging to scale and achieve high-speed operation.

In this work, we have demonstrated a nonvolatile III–V-on-silicon memresonator used for programmable photonic memory operating at record low switching energy (0.15–0.36 pJ), sub-nanosecond switching times enabling high-speed, energy-efficient in-memory computing within silicon photonic neural networks. These nonvolatile optoelectronic memory devices save a great deal of energy by reducing the power consumption involved in programming phase shifters within photonic integrated circuits. By using short voltage pulses to permanently switch the state of this device, the energy typically lost in idle power consumption is saved throughout the duration of an inference task. For example, after a write pulse is applied to the memresonator, the device will retain its state until another voltage pulse is used to write a different weight value. In this way, it is worth reiterating that no idle power consumption is wasted in between reading or writing the weight value stored within the memresonator.

In addition, these nonvolatile photonic phase shifters can act as weights within silicon photonic neural networks which can be updated in real-time, enabling algorithms such as error backpropagation to be executed directly on-chip, greatly optimizing the acceleration of silicon photonic neural networks. Another significant distinction is that the memory is directly on the same chip as the phase shifter, enabling the capability to perform in-memory photonic computing. This avoids the optical-to-electrical conversion losses involved in going to off-chip memory in between each data set used to train a neural network. For instance, in the gradient descent algorithm, predicted values are subtracted from the actual values of the neural network in between training iterations to calculate the cost function This device can save a substantial amount of energy involved in fetching data typically stored in an external memory chip like static random-access memory (SRAM) or dynamic random-access memory (DRAM) to calculate the cost function in between training iterations. Another area in machinelearning this device may apply to is transfer learning, described as the practice of re-using a pre-trained neural network instead of training one from scratch to reduce latency and save computational resources<sup>50,51</sup>. Given that the weights in the backbone layer are fixed, they would benefit from being stored in on-chip memory such as with these memresonators. Also, these memresonators can simultaneously be used for the trainable portion of the neural network since they are also capable of being switched at high speeds and with low energy.

Lastly, these devices were developed on a heterogeneous III–V-onsilicon platform, which allows for the co-integration of non-linear active optoelectronic devices, such as lasers and modulators, directly on the same chip as a silicon photonic neural network or an optical FPGA<sup>52</sup>. Since these types of photonic integrated circuits do not inherently need to transmit optical signals off-chip, we gain a significant advantage by integrating the light source directly on-chip. This technology can immensely improve the energy efficiency, stability, and scalability of integrated photonic processors, advancing their potential for use in next-generation HPCs and edge computing.

Future designs will feature device and structural design improvements in order to reduce switching voltages and improve the extinction ratio. The total voltage applied across the device distributes over the Al<sub>2</sub>O<sub>3</sub> laver, semiconductor lavers (n-GaAs, p-Si), and metal/ semiconductor contact layers. By reducing the active area of the device and thickness of the Al<sub>2</sub>O<sub>3</sub> layer, optimizing the semiconductor lavers' doping concentrations and thicknesses, and improving the quality of the metal/semiconductor contact interface and the Al<sub>2</sub>O<sub>3</sub> layer, we can reduce the switching voltage<sup>53</sup>. Also, by increasing the critical coupling within the microring resonator, we can achieve a better extinction ratio and a smaller FWHM. We are also working on improving the waveguide losses on our platform, which will also improve the Q and extinction ratio. Also, TEM images will be taken to investigate the conductive filament formation within these devices and study the physical processes behind the switching mechanisms in these devices. These studies will aid in the design of future devices such as the selection of the resistive-switching oxide material.

Another design change will be to integrate a field-effect transistor in series with the memristor to be able to apply voltage pulses on the device with control of the device current. Integrating these devices with a MOS field-effect transistor (MOSFET) in a one-transistor oneresistor (1T1R) configuration to reliably control the current flow in the device without external circuitry. Within a 1T1R configuration, a MOSFET is connected in series with a memristor and is used to limit the current in the memristor by applying a gate voltage on the MOSFET to modulate the channel length and allow only a certain amount of current to flow through the MOSFET channel. This will play a significant role in improving the control of switching by controlling the amount of current able to conduct in the device. It will also enable multiple intermediate resistance states by using different voltage pulse parameters to select different states while the MOSFET protects the device from permanent breakdown with different voltage pulses. Lastly, memristors can also be integrated within Mach-Zehnder interferometers as an alternative form of a nonvolatile phase shifter also

commonly used within silicon photonic neural networks, quantum computing circuits, and FPGAs<sup>53</sup>.

### Methods

Microring resonators with a diameter of  $20 \,\mu$ m were measured on a copper stage with III–V side up. The experimental set-ups for the measurements taken are shown in Supplementary Fig. 1. Electrical measurements were taken with an Agilent B1500A semiconductor device analyzer, including a B1525A HV-SPGU high-voltage pulse generator. GSG RF probes (Cascade Microtech ACP–40) were used to probe the devices and measure the high-speed response. Optical power measurements were taken using a Newport 2936-R optical power meter. The device was designed with input and output grating couplers, which had about 6 dB of loss each at peak transmission. A Santec TSL-510 tunable laser is used to illuminate the input grating coupler with a cleaved fiber. The laser wavelength is swept and the output of the device is measured through the output grating coupler which is coupled to an optical power meter.

To measure the switching speed of the memresonator, we couple light coming from a tunable laser at the resonant wavelength of the memresonator into the input grating coupler. We then apply voltage pulses from a Keysight M8195A Arbitrary Waveform Generator to read and write the memristor, and couple light coming from the output grating coupler into a high-speed photodiode which is then connected to a Tektronix 8 GHz real-time oscilloscope. A 100 ns wide, 2 V amplitude pulse was used to read the memresonator in the retention time and endurance measurements.

# Data availability

All data are available in the main text or the supplementary materials.

#### References

- 1. Amodei, D. & Hernandez, D. Al and Compute, OpenAl, May 16, 2018. https://openai.com/research/ai-and-compute (2023).
- 2. Hsu, J. IBM's new brain [news]. *IEEE Spectr.* **51**, 17–19 (2014).
- 3. Mythic. https://mythic.ai/technology/ (2023).
- Shastri, B. J. et al. Photonics for artificial intelligence and neuromorphic computing. *Nat. Photonics* 15, 102–114 (2021).
- 5. Lanza, M. et al. Standards for the characterization of endurance in resistive switching devices. ACS Nano **15**, 17214–17231 (2021).
- Taubenblatt, M. A. Optical interconnects for high performance computing. In *IEEE Photonic Society 24th Annual Meeting* (IEEE, 2011).
- Ashtiani, F., Geers, A. J. & Aflatouni, F. An on-chip photonic deep neural network for image classification. *Nature* 606, 501–506 (2022).
- Shen, Y. et al. Deep learning with coherent nanophotonic circuits. Nat. Photonics 11, 441–446 (2017).
- 9. Tait, A. N. et al. Neuromorphic photonic networks using silicon photonic weight banks. *Sci. Rep.* **7**, 7430 (2017).
- Shi, B., Calabretta, N. & Stabile, R. InP photonic integrated multilayer neural networks: architecture and performance analysis. *APL Photonics* 7, 010801 (2022).
- Feldmann, J., Youngblood, N., Wright, C. D., Bhaskaran, H. & Pernice, W. H. P. All-optical spiking neurosynaptic networks with selflearning capabilities. *Nature* 569, 208–214 (2019).
- Vlieg, E. A., Talandier, L., Dangel, R., Horst, F. & Offrein, B. J. An integrated photorefractive analog matrix-vector multiplier for machine learning. *Appl. Sci.* 12, 4226 (2022).
- Nahmias, M. A., Tait, A. N., Shastri, B. J., de Lima, T. F. & Prucnal, P. R. Excitable laser processing network node in hybrid silicon: analysis and simulation. *Opt. Express* 23, 26800 (2015).
- Pai, S., Bartlett, B., Solgaard, O. & Miller, D. A. B. Matrix optimization on universal unitary photonic devices. *Phys. Rev. Appl.* 11, 064044 (2019).

- Hughes, T. W., Minkov, M., Shi, Y. & Fan, S. Training of photonic neural networks through in situ backpropagation and gradient measurement. *Optica* 5, 864–871 (2018).
- 16. Chua, L. Memristor—the missing circuit element. *IEEE Trans. Circuit Theory* **18**, 507–519 (1971).
- 17. Strukov, D. B., Snider, G. S., Stewart, D. R. & Williams, R. S. The missing memristor found. *Nature* **453**, 80–83 (2008).
- Torrezan, A. C., Strachan, J. P., Medeiros-Ribeiro, G. & Williams, R. S. Sub-nanosecond switching of a tantalum oxide memristor. *Nanotechnology* 22, 485203 (2011).
- 19. Yang, J. J. et al. High switching endurance in TaOx memristive devices. *Appl. Phys. Lett.* **97**, 232102 (2010).
- Lee, M.-J. et al. A fast, high-endurance and scalable non-volatile memory device made from asymmetric Ta<sub>2</sub>O<sub>5-x</sub>/TaO<sub>2-x</sub> bilayer structures. *Nat. Mater.* **10**, 625–630 (2011).
- Govoreanu, B. et al. 10x10nm2 Hf/HfOx crossbar resistive RAM with excellent performance, reliability and low-energy operation. In 2011 International Electron Devices Meeting, 31.6.1–31.6.4 (IEEE, Washington, DC, USA, 2011).
- 22. Strukov, D. B. & Williams, R. S. Four-dimensional address topology for circuits with stacked multilayer crossbar arrays. *Proc. Natl. Acad. Sci. USA* **106**, 20155–20158 (2009).
- Liang, D. et al. An energy-efficient and bandwidth-scalable DWDM heterogeneous silicon photonics integration platform. *IEEE J. Sel. Top. Quantum Electron.* 28, 1–19 (2022).
- Jones, R. et al. Heterogeneously integrated InP/silicon photonics: fabricating fully functional transceivers. *IEEE Nanotechnol. Mag.* 13, 17–26 (2019).
- 25. Margalit, N. et al. Perspective on the future of silicon photonics and electronics. *Appl. Phys. Lett.* **118**, 220501 (2021).
- Liang, D. & Bowers, J. E. Highly efficient vertical outgassing channels for low-temperature InP-to-silicon direct wafer bonding on the silicon-on-insulator substrate. J. Vac. Sci. Technol. B 26, 9 (2008).
- Srinivasan, S., Liang, D. & Beausoleil, R. G. Heterogeneous SISCAP microring modulator for high-speed optical communication. In 2020 European Conference on Optical Communications (ECOC), 1–3 (IEEE, 2020).
- Waser, R., Dittmann, R., Staikov, G. & Szot, K. Redox-based resistive switching memories—nanoionic mechanisms, prospects, and challenges. *Adv. Mater.* 21, 2632–2663 (2009).
- 29. Sun, W. Understanding memristive switching via in situ characterization and device modeling. *Nat. Commun.* **10**, 13 (2019).
- Zhang, Y. Evolution of the conductive filament system in HfO<sub>2</sub>based memristors observed by direct atomic-scale imaging. *Nat. Commun.* 12, 10 (2021).
- Zhang, X. et al. Effect of Joule heating on resistive switching characteristic in AlOx cells made by thermal oxidation formation. *Nanoscale Res. Lett.* 15, 11 (2020).
- 32. Yang, Y. et al. Observation of conducting filament growth in nanoscale resistive memories. *Nat. Commun.* **3**, 732 (2012).
- Li, C. et al. Three-dimensional crossbar arrays of self-rectifying Si/ SiO<sub>2</sub>/Si memristors. *Nat. Commun.* 8, 15666 (2017).
- 34. Ríos, C. et al. Integrated all-photonic non-volatile multi-level memory. *Nat. Photonics* **9**, 725–732 (2015).
- 35. Geler-Kremer, J. et al. A ferroelectric multilevel non-volatile photonic phase shifter. *Nat. Photonics* **16**, 491–497 (2022).
- Lee, H. Y. et al. Low power and high speed bipolar switching with a thin reactive Ti buffer layer in robust HfO<sub>2</sub> based RRAM. In 2008 IEEE International Electron Devices Meeting (IEEE, 2008).
- Menzel, S., Witzleben, M. V., Havel, V. & Böttger, U. The ultimate switching speed limit of redox-based resistive switching devices. *Faraday Discuss.* 213, 197–213 (2019).
- Nahmias, M. A. et al. Photonic multiply-accumulate operations for neural networks. *IEEE J. Sel. Top. Quantum Electron.* 26, 1 (2020).

- Xiao, X. et al. Wavelength-parallel photonic tensor core based on multi-FSR microring resonator crossbar array. In Optical Fiber Communication Conference (OFC) 2023, San Diego (Optica Publishing Group, 2023).
- London, Y. et al. WDM ternary content-addressable memory for optical links. In 2023 IEEE 18th International Conference on Group IV Photonics (GFP), Arlington, Virginia (IEEE, 2023).
- Heck, M. J. R. Highly integrated optical phased arrays: photonic integrated circuits for optical beam shaping and beam steering. *Nanophotonics* 6, 93–107 (2017).
- Zhong, C. et al. Fast thermo-optical modulators with doped-silicon heaters operating at 2 μm. Opt. Express 29, 23508–23516 (2021).
- Dong, M. et al. High-speed programmable photonic circuits in a cryogenically compatible, visible-near-infrared 200 mm CMOS architecture. *Nat. Photonics* 16, 59–65 (2022).
- Baghdadi, R. et al. Dual slot-mode NOEM phase shifter. Opt. Express 29, 19113–19119 (2021).
- Lee, C. Reliability and failure analysis of MEMS/NEMS switches. In 2016 IEEE 23rd International Symposium on the Physical and Failure Analysis of Integrated Circuits (IPFA) (IEEE, 2016).
- Fang, Z., Chen, R., Zheng, J. & Majumdar, A. Non-volatile reconfigurable silicon photonics based on phase-change materials. *IEEE J. Sel. Top. Quantum Electron.* 28, 1–17 (2021).
- Wuttig, M., Bhaskaran, H. & Taubner, T. Phase-change materials for non-volatile photonic applications. *Nat. Photonics* **11**, 465–476 (2017).
- Wang, J., Wang, L. & Liu, J. Overview of phase-change materials based photonic devices. *IEEE Access* 8, 121211–121245 (2020).
- Fang, Z. et al. Ultra-low-energy programmable non-volatile silicon photonics based on phase-change materials with graphene heaters. Nat. Nanotechnol. 17, 842–848 (2022).
- 50. Bozinovski, S. Reminder of the first paper on transfer learning in neural networks, 1976. *Informatica* **44**, 3 (2020).
- 51. Xiao, X. et al. Large-scale and energy-efficient tensorized optical neural networks on III–V-on-silicon MOSCAP platform. *APL Photonics* **6**, 12 (2021).
- Tossoun, B., Sheng, X., Strachan, J. P., Liang, D. & Beausoleil, R. G. Hybrid silicon MOS optoelectronic memristor with non-volatile memory. In 2020 IEEE Photonics Conference (IPC) (IEEE, 2020).
- 53. Cheung, S. et al. Heterogeneous III-V/Si non-volatile optical memory: a Mach-Zehnder memristor. In *Conference on Lasers and Electro-Optics,* San Jose (Optica Publishing Group, 2022).
- 54. Liu, S. et al. Thermo-optic phase shifters based on silicon-oninsulator platform: state-of-the-art and a review. *Front. Optoelectron.* **15**, 9 (2022).
- 55. Watts, M. R. et al. Adiabatic thermo-optic Mach–Zehnder switch. *Opt. Lett.* **38**, 733–735 (2013).
- 56. Harris, N. C. et al. Efficient, compact and low loss thermo-optic phase shifter in silicon. *Opt. Express* **22**, 10487–10493 (2014).
- Olivares, I., Parra, J. & Sanchis, P. Non-volatile photonic memory based on a SAHAS configuration. *IEEE Photonics J.* 13, 1–8 (2021).
- 58. Song, J.-F. et al. Integrated photonics with programmable nonvolatile memory. Sci. Rep. **6**, 22616 (2016).
- Quack, N. et al. MEMS-enabled silicon photonic integrated devices and circuits. *IEEE J. Quantum Electron.* 56, 1–10 (2020).
- Cheng, Z., Ríos, C., Pernice, W. H. P., Wright, C. D. & Bhaskaran, H. On-chip photonic synapse. Sci. Adv. 3, e1700160 (2017).
- Stegmaier, M., Ríos, C., Bhaskaran, H., Wright, C. D. & Pernice, W. H. P. Nonvolatile all-optical 1 × 2 switch for chipscale photonic networks. *Adv. Opt. Mater.* 5, 1600346 (2017).

- 62. Ríos, C. et al. Ultra-compact nonvolatile phase shifter based on electrically reprogrammable transparent phase change materials. *PhotoniX* **3**, 26 (2022).
- 63. Capmany, J. & Pérez-López, D. A new change of phase. *Nat. Photonics* **16**, 479–480 (2022).

# Acknowledgements

The authors want to thank Thomas Van Vaerenbergh, Marco Fiorentino, Sagi Mathai, and Sri Priya Sundararajan for the insightful discussions and input towards this paper. The authors acknowledge that they received no funding in support for this research.

# **Author contributions**

B.T. conceived the project and D.L. and R.G.B. supervised the project. B.T., D.L. and Z.F. designed the experiments. B.T. performed the experiments and wrote the manuscript. D.L., S.C., Z.F., X.S., J.P.S. and R.G.B. provided helpful suggestions. S.C. and Z.F. provided theoretical analysis. B.T. and Z.F. provided graphics and plots. All authors contributed to the preparation of the manuscript.

### **Competing interests**

The authors declare no competing interests.

# **Additional information**

**Supplementary information** The online version contains supplementary material available at https://doi.org/10.1038/s41467-024-44773-7.

**Correspondence** and requests for materials should be addressed to Bassem Tossoun.

**Peer review information** *Nature Communications* thanks Ryan Hamerly and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

# **Reprints and permissions information** is available at http://www.nature.com/reprints

**Publisher's note** Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

**Open Access** This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/ licenses/by/4.0/.

© The Author(s) 2024