Electronic-photonic arithmetic logic unit for high-speed computing

Ying, Zhoufeng; Feng, Chenghao; Zhao, Zheng; Dhar, Shounak; Dalir, Hamed; Gu, Jiaqi; Cheng, Yue; Soref, Richard; Pan, David Z.; Chen, Ray T.

doi:10.1038/s41467-020-16057-3

Download PDF

Article
Open access
Published: 01 May 2020

Electronic-photonic arithmetic logic unit for high-speed computing

Nature Communications volume 11, Article number: 2154 (2020) Cite this article

11k Accesses
94 Citations
4 Altmetric
Metrics details

Subjects

Abstract

The past two decades have witnessed the stagnation of the clock speed of microprocessors followed by the recent faltering of Moore’s law as nanofabrication technology approaches its unavoidable physical limit. Vigorous efforts from various research areas have been made to develop power-efficient and ultrafast computing machines in this post-Moore’s law era. With its unique capacity to integrate complex electro-optic circuits on a single chip, integrated photonics has revolutionized the interconnects and has shown its striking potential in optical computing. Here, we propose an electronic-photonic computing architecture for a wavelength division multiplexing-based electronic-photonic arithmetic logic unit, which disentangles the exponential relationship between power and clock rate, leading to an enhancement in computation speed and power efficiency as compared to the state-of-the-art transistors-based circuits. We experimentally demonstrate its practicality by implementing a 4-bit arithmetic logic unit consisting of 8 high-speed microdisk modulators and operating at 20 GHz. This approach paves the way to future power-saving and high-speed electronic-photonic computing circuits.

Realization of optical logic gates using on-chip diffractive optical neural networks

Article Open access 21 September 2022

On-chip optoelectronic logic gates operating in the telecom band

Article 30 October 2023

Programmable photonic circuits

Article 07 October 2020

Introduction

The breakdown of Dennard scaling in the early 2000s, when the features began to shrink below around 90 nm, led to the stagnation of the maximum clock frequency up to a few GHz due to the overwhelming heat^1,2. Since then, the industry has started to focus on multicore processors as an alternative way to improve performance which continues the Moore’s law, a rule of thumb that dominates computing^3,4. Unfortunately, nowadays Moore’s law has been facing fatal challenges once again as the nanofabrication technique goes to a several-nanometer limit where quantum uncertainties will govern the electron behavior and make transistors unreliable^2,5. This saturation has forced massive researches in industry and academia on various aspects ranging from nanofabrication technology⁶, material science⁷, computing architecture⁸, as well as new computing mechanisms such as quantum computing⁹.

Integrated photonics is poised to revolutionize traditional electrical interconnects with the trend from long-haul communication links down to inter- and intra-chip connections^{10,11,12,13,14}. It is even possible nowadays to achieve a high-performance optical memory-processor link on the same chip with the newly demonstrated “Zero-change” fabrication technique that tailors photonics devices to be integrated directly with transistors using the mature complementary metal–oxide–semiconductor (CMOS) fabrication line^15,16. As abundant passive and active components mature in integrated photonics^17,18,19,20, electronic-photonic computing (EPC) that uses photons to transport and process information instead of electrons has attracted increasing attention. It includes two directions, analogy computing^21,22,23 and digital computing²⁴ for various applications. Decades of successful practical experience of very-large-scale-integration (VLSI) have proven the indubitable significance of digital circuits for computing due to the advantages of much better noise tolerance as well as power saving.

It is worth mentioning several facts and limits of transistors in VLSI circuits. Firstly, although the power consumption and latency within the logic gates have been reduced over time, those for transporting signals between gates do not scale down in the same way, and that dominates the performance of the entire system^25,26. Secondly, the power consumption of a certain transistor-based circuit has a positive correlation with f³ (f is the clock frequency)²⁷. In other words, the power grows exponentially as the system clock frequency increases. Fortunately, using light to process information enables us to avoid these obstacles potentially. First, light travels much faster on a chip and normally it only takes a subpicosecond to go through a micron-scale gate which is 1–2 orders of magnitude faster than it takes electrons to go through a transistor with several fanouts²⁸. Second, not like the transistors where the voltage required for efficient switching is related to the frequency, optical circuits only consume the power proportional to f. Third, Bosons, photons in this case, have the unique property of not abiding by Pauli exclusion principle, creating multiplexing techniques unique to light such as wavelength division multiplexing (WDM) and polarization division multiplexing, which offer the ability to further scale the computing capacity substantially²⁹.

Here, we propose a WDM-based electronic-photonic arithmetic logic unit (EPALU) for computing at higher speed and with lower power consumption. We begin with a theoretical proposal of a general EPALU architecture, which makes full use of the advantages of electronics and photonics. We experimentally demonstrate the essential part of our proposed architecture by implementing a 4-bit optical carry propagation network (OCPN) operating at 20 GHz. A thorough analysis of its performances, including power efficiency, computation speed, and insertion loss, shows that such an EPC circuit is capable of running at tens of GHz while only consuming 1–2 orders of magnitude less power than state-of-the-art electronic circuits at a high clock rate due to the non-exponential relationship between power and frequency. Lastly, various scaling methods are discussed to enable optical Moore’s law for EPC.

Results

Architecture

An ALU that performs arithmetic and bitwise operations such as add, subtract, increment, compare, and logical operations on integer binary numbers is a fundamental building block of various significant computing modules including the central processing units (CPUs), floating-point units (FPUs), and graphics processing units (GPUs). As a key part in an ALU, a full adder adds two addends and one carry signal travels from the first bit to the last along the critical path that finally determines the total latency and thus the operating speed. The expression for a full adder can be summarized by

$${\it{C}}_{\it{n}} = \left( {{\it{a}}_{\it{n}} \oplus {\it{b}}_{\it{n}}} \right) \cdot {\it{C}}_{{\it{n}} - 1} + {\it{a}}_{\it{n}} \cdot {\it{b}}_ {\it{n}} = {\it{p}}_{\it{n}} \cdot {\it{C}}_{{\it{n}} - 1} + {\it{g}}_{\it{n}}$$

(1)

$${\it{S}}_{\it{n}} = {\it{C}}_{{\it{n}} - {\mathrm{1}}} \oplus \left( {{\it{a}}_{\it{n}} \oplus {\it{b}}_{\it{n}}} \right) = {\it{C}}_{{\it{n}} - {\mathrm{1}}} \oplus {\it{p}}_{\it{n}}$$

(2)

where ${\it{p}}_{\it{n}} = {\it{a}}_{\it{n}} \oplus {\it{b}}_{\it{n}}$ (propagate) and ${\it{g}}_{\it{n}} = {\it{a}}_{\it{n}} \cdot {\it{b}}_{\it{n}}$ (generate). Figure 1a shows the schematic of an n-bit ripple carry adder, where the carry signal propagates through the critical path. Significant effort has been made towards optimizing the performance of a full adder by changing the architectures for the past half century and examples include carry-lookahead adders, carry-save adders, and conditional sum adders (CSAs)³⁰. The basic idea of CSAs, one type of carry-select adders, is to split the N bit full adder (Fig. 1c, upper inset) into n sets of m-bit full adders (Fig. 1c, lower inset) (N = m × n) and thus all these sets are able to perform the calculation simultaneously to reduce the latency. However, the problem is that the carry signals between adjacent sets are unknown before the computing. Therefore, two sets of hardware are adopted here to generate two sets of outputs with the assumption that the incoming carry signal for each set is zero and one, respectively, since these are all possibilities in a binary system, as shown in Fig. 1c (also see Supplementary Note 1). Once the incoming carry is known, we only need to select the correct set of outputs using multiplexers (MUXs) based on the output carry signal from the previous set without waiting for the carry to further propagate through the entire full adder. The disadvantage is that it needs an extra set of full adder circuits, which consumes more power and more precious space on a chip.

**Fig. 1: General architecture of the EPALU.**

Figure 1b shows a general architecture of the electronic-photonic full adder in which the critical path is replaced by an optical route. Light has 1–2 orders of magnitude less latency per gate than the transistors so that it makes ultrahigh-speed computing possible. Furthermore, light beams with different wavelengths or other properties such as polarization can go through the same structure simultaneously and independently. Therefore, the extra identical set of full adder circuits can be eliminated by using two wavelengths in an optical CSA circuit. Incoming carry signals of one and zero can be encoded in two wavelengths and two sets of results are calculated independently.

As shown in Fig. 1e, an N-bit (N normally is 8, 16, 32, 64…) EPALU can be decomposed to n sets of m-bit optical carry propagation network (OCPN) (N = m × n) along with an array of integrated photodetectors and electrical circuits. Its relationship with the general architecture in Fig. 1b regarding each functional block is depicted in Fig. 1d. It is similar to the architecture of electrical CSA shown in Fig. 1c, but here the two sets of circuits with different inputs (0/1) now can be realized by one set of optical routes with two wavelengths and different inputs (0/1) are encoded into these two wavelengths. This decomposition offers two advantages. First, it reduces the total latency, which is based on the performance of optical and electrical gates. Second, it offers a solution to surviving from the loss since there are few efficient and compact integrated-optical amplifiers up till now on some monolithically integration platforms such as the silicon platform. The schematic of the m-bit OCPN is depicted in Fig. 1f, which is generated by an automated logic design algorithm (see Supplementary Note 10) and consists of a network of optical modulators and couplers. Two electro-optic modulators along with a 2-by-2 coupler compose one bit. Beams of continuous wave (CW) with different wavelengths are injected into the circuit. Note that they have two different combinations (the input of the first port, Cin port, is different), which represent the carry signals of one and zero. Within each clock cycle, all the electrical signals will be injected into the modulators simultaneously. The coupling efficiency for the couplers is set at 50% for this purpose and could be adjusted to optimize the performance, which will be discussed hereinafter. The generated carry signals emerge at the output ports after filters and will be received by an array of fast photodetectors followed by a network of electronic multiplexer units (MUXU) and a sum generation unit (SGU) (see Supplementary Notes 1 and 2). The overflow of this architecture is discussed in Supplementary Note 3. The entire EPALU including the electronic and photonic parts can be fabricated monolithically on a single chip using modern nanofabrication technology¹⁵. This EPALU is capable of performing addition, subtract, increment, decrement, compare, bit operation and so forth.

Experiment

We demonstrated the practicality of the EPALU in integrated silicon photonics by experimentally implementing a 4-bit OCPN. The chip was fabricated by AIM Photonics with over 20 fabrication masks in nanolithography³¹. It is composed of eight high-speed microdisk modulators, with several thermo-optic phase shifters and attenuators located along the paths to maintain the power and phase balance, which are not required in the future fine-tuned system. Note that p and g signals will not be logic one simultaneously by nature so that no interference between two strong light beams needs to be taken into consideration in an ideal system. While in a practical system where modulators have limited extinction ratio, the phases at each bit can be adjusted to optimize the output. The details and algorithms are discussed in ref. ³² (see Supplementary Note 9). The microdisk modulator is chosen as the primary active component due to its compact size and low power consumption^20,33. Figure 2a shows the micrograph of the fabricated chip with the dimension of 2 mm × 4 mm. The micrograph of the wire-bonded chip and close-ups of the components are also listed in Fig. 2b.

CW light is coupled into the chip through grating couplers and then be split into several portions to be the light inputs for different bits as shown in Fig. 1f. Pseudorandom non-return-to-zero (NRZ) signals are injected through GSG probes after all the static conditions of all the components have been fine-tuned including the accurate wavelength tuning of the microdisk modulators, which needs to align to the operating wavelength. One part of the result signals will be received by integrated photodetectors and the other part will be coupled out of the chip for testing. It should be noted that with the zero-change fabrication technique, it is possible to integrate all the electrical and photonic components onto a single chip^15,16.

Figure 3a shows several functions the EPALU can achieve, including addition, subtract, increment, decrement, compare, and bit operation such as AND with the help of electrical circuits. Different input CW combinations are able to generate different functions, which provides another degree of freedom to manipulate the circuit versatility of function realizations. Several examples of the realized functions in our experiments are shown in Fig. 3b, including addition, subtract, compare, and increment. Take the addition (time slot X) as an example. The operands A = 0011 and B = 1101 will first go through the PGU, marked as cell 1 in Fig. 3a, to generate the P and G signals. Then they will be fed into OCPN to generate carry signals (1111) as shown in Fig. 3f with the assumption that Cin is 0. Finally, the SGU will convert the carry signals into the sum signals (0000) with the Cout of 1. Assisted by the WDM, two sets of outputs at two different carry input states can be obtained at the same time. According to Fig. 1e, larger bit size EPALU can be realized with the help of extra electrical circuits and an 8-bit case is shown in Fig. 3c, d. The waveforms of the carry output C1–C4 at two different operating wavelengths (~1540 and 1565 nm) are depicted in Fig. 3e, f with the operating speed of 20 Gb/s, which are consistent with the truth tables (see Supplementary Note 11).

Note that this architecture of EPALU is suitable for any kind of modulators such as electro-absorption modulator³⁴, MZI modulators¹⁷, microring modulators³⁵, plasmonics modulators^36,37, graphene modulators³⁸, and so on. Therefore, the following discussion will be carried out more generally and broadly.

Computation speed

The maximum clock rate of the commercial CPUs has stagnated at a few GHz for decades. The proposed EPALU that disentangles the exponential relationship between the frequency and power is promising to escape from the heat death² and breakthrough the computation speed limit. On the other hand, the highly scaled micro-size optical components also enable the light to process information in sub-picoseconds which is 1–2 orders of magnitude faster than electrical gates, leading to a much higher operating speed of the entire circuits. Specifically, as shown in Fig. 1e, the total latency of the EPALU consists of the electro-optic transition time of the modulators ${\it{\uptau }}_{{\it{eo}}}$, the optical propagation latency per gate ${\it{\uptau }}_{\it{o}}$, the optoelectronic transition time of the PDs ${\it{\uptau }}_{{\it{oe}}}$, the electrical latency in the MUXU per stage ${\it{\uptau }}_{\it{e}}$, and the delay for the other electrical parts ${\it{\uptau }}_{\it{g}}$. For an N = m × n bit circuit, the total latency can be expressed as

$${\it{\uptau }} = {\it{\uptau }}_{\it{c}} + {\it{m}} \times {\it{\uptau }}_{\it{o}} + {\mathrm{log}}_2{\it{n}} \times \tau _{\it{e}}$$

(3)

where ${\it{\uptau }}_{\it{c}} = {\it{\uptau }}_{\it{g}} + {\it{\uptau }}_{{\it{eo}}} + {\it{\uptau }}_{{\it{oe}}}$ is the constant part. Then one can easily optimize the latency based on the values of each parameter in a real platform. Figure 4a shows the entire delay of the EPALU with respect to the bit size with assumptions and details discussed (see Supplementary Note 4). The results indicate that even the 32-bit and 64-bit circuits are capable of operating over 20 GHz under the assumption discussed. It can certainly go faster with the improvement of performances of the active and passive components.

Energy efficiency

Energy has constrained the design of computing devices in terms of power delivery, battery life, power dissipation, and heat removal¹³. To process data with low power has become a significant challenge. Thanks to the achievement of integrated photonics, various energy-efficient components have been developed and even commercialized in many foundries. For instance, the electro-optic modulator, which is one of the key active components in many applications as well as in our EPALU, now is able to support data rates up to 100 Gbit/s while at the same time consume only femtojoule or even sub-femtojoule power^17,20,36. Figure 4c shows the curves of power consumption of various circuits as a function of frequency. The power consumption of the EPALU includes that of the optical parts as well as the electrical part (see Supplementary Notes 6 and 7). As a comparison, we also calculated the power consumption of conventional electrical full adder using two methods. First, we use commercial Synopsys Design Compiler to perform the simulation based on the Synopsys Generic 32 nm design library (SAED 32 EDK) which is the most advanced library in the academic area. Second, we use published experimental data based on the 90 nm CMOS technology node from Intel²⁷ and estimate the power consumption in 32 and 7 nm technology node using scaling equations³⁹. As expected, the power of electrical circuits increases exponentially with respect to frequency. The merits of EPALU start to emerge at a higher frequency and its dynamic power consumption could be 1–2 orders of magnitude smaller than the electrical counterpart when the clock rate exceeds 20 GHz. It should be noted that the total power consumption also includes the static power consumption such as the power of the laser and the thermal tuning. As a result, the total power consumption will be greater than the state-of-the-art 7 nm transistor-based circuit for all frequencies. Fortunately, this part of power can be further reduced or eliminated in the future with the development of photonic components. See detailed discussion of the system power consumption in Supplementary Note 7.

Another important power-related parameter is the power density, the ratio of the power to the area (see Supplementary Note 9), which prevents transistors from going faster owing to overwhelming heat. The comparison of power density is shown in Fig. 4d (also see Supplementary Note 8). It is not surprising to see that the EPALU outperforms the electrical transistors-based circuits here even with one of the most advanced technology nodes since optical components by nature are still larger than transistors.

Loss

Unlike the transistor-based circuits where signals are automatically normalized to the source voltage or ground, optical signals will also experience optical loss along the paths owing to splitting, insertion loss, and waveguide loss. For example, the carry input from Cin port will encounter 3 dB loss per bit when it propagates to the last Cout port, which is unacceptable for a large-scale circuit. Though an integrated compact amplifier is available in some platforms (e.g., InP platform), it consumes extra power to boost the signals and is not preferred by an energy-efficient computing system. Therefore, the optimization of loss of the EPALU is conducted here by changing the coupling efficiency of the directional coupler. Note that the structure also needs to be revised slightly (see Supplementary Note 5). Figure 4b represents the relationship between the entire loss and the coupling efficiency for OCPNs with 8, 16, 32, and 64-bit size. It indicates that it becomes possible for this large-scale computing circuit to operate freely without the assist of amplifiers. In addition, the choice of m and n is another dimension we can manipulate to fulfill the loss requirements.

Scaling and outlook

It is foreseeable to further scale the capacity of the proposed EPALU as well as other photonics-assisted computing circuits at the rate comparable to Moore’s law through at least four directions. First, the performance of the circuits is determined by the components especially the modulators. The progress of these passive and active modules in terms of the size, power consumption, insertion loss, and so forth could directly contribute to the entire system. For example, plasmon-assisted electro-optic modulator³⁶ with a dimension of ~4 µm² enables about 90% reduction of the optical propagation delay and 99% reduction of the area compared to the calculation above. Second, the massive multiplexing technologies of light are the most natural and suitable methods for parallel computing. Two wavelengths are adopted in our proposal to dramatically improve the entire performance. More multiplexing technologies can be explored in other applications. Third, special logic gates such as multi-operand logic gates have the potential to further shrink the circuit size and save much power^28,40. Fourth, though a single optical gate may not beat the transistor in terms of the size and the power consumption, dedicated architectures provide photonics-assisted computing circuits the ability to have a simpler design and win by the number of components. It is because the optimization of electrical circuits for higher performance will bring numerous redundant transistors.

A kind of photonics-assisted computing architecture is proposed with an experimental demonstration at 20 GHz. Computing at a higher speed while with lower power consumption has proved to be possible with the help of light. Advanced fabrication techniques could assist to further integrate electronics and photonics onto a single chip. In addition, equivalent Moore’s law in integrated optical computing is expected to scale the circuit with several directions provided. Emerging optical–electrical–optical devices have the potential to be integrated with the proposed architecture to realize more complex computing functions⁴¹. Further integration of optical computing units, optical inter/intra-chip optical interconnect⁴², and optical clock distribution⁴³ can be explored to realize an entire all-optical computing system.

Methods

Electrical simulation

We wrote behavioral Verilog code for adders parameterized by number of bits. Each adder had a single flip-flop stage at the input and a single flip-flop stage at the output. We used Synopsys Design Compiler to perform synthesis and technology mapping on the adders using the Synopsys Generic 32 nm design library (SAED 32 EDK). The tool performed gate sizing and netlist optimizations. We used high and regular threshold voltage cells to optimize for power. The optimization passes of Design Compiler included all three metrics of power, timing, and area. Subsequent optimization steps involved delay reduction through gate sizing and selection of cells with lower threshold voltage.

Chip implementation and testing

The photonic chip layout was developed and drawn in Cadence Virtuoso and verified using Mentor Graphics Calibre. The chip was a fabrication by AIM Photonics³¹. The 52 pads sitting at two sides are designed for power supply, bias signals, and thermal tuning, which were wire-bonded to a printed circuit board. Amplified spontaneous emission lasers from Thorlabs and optical spectrum analyzers from AssetRelay were used for optical characterization and wavelength alignment. Standard single mode fibers (SMF28) were used to couple light into and out of the photonic chip through grating couplers. The minimal coupling loss (~5 dB per facet) was achieved at 8° off-normal from the surface of the chip. The Q factors of the microdisk modulators were estimated to be ~7000 by applying a Lorentzian fitting to the transmission spectrum. Peaks of the microdisk modulator are lying at ~1540 and ~1565 nm so that the operating wavelengths are also set around them, respectively. Thermal tuning is performed to all the microdisk modulators to make sure all the wavelengths are well aligned to the laser wavelength. A tunable laser from ID Photonics was used after the static characterization and alignment. The output light was coupled out to an Agilent DCA 86100C with a 30 GHz optical module (Agilent 86109A). Erbium-doped fiber amplifiers (JDSU Oprel EDFAs) were also used to boost the signals followed by an optical tunable filter (Santec, OFT-920). The Agilent E8257D PSG signal generator was used to generate the clock and feed it to the Agilent E8404A VXI mainframe. Non-return-to-zero (NRZ) signals were generated by the two independent N4872A slots with internal delay controls. We used high-bandwidth radio-frequency probes (GGB, 100 µm pitch) to inject the high-speed signals to the chip through GSG pads with the size of 60 × 60 µm² and the pitch of 100 µm.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

Dennard, R. H. et al. Design of ion-implanted MOSFET’s with very small physical dimensions. IEEE Solid-State Circuits Newsl. 87, 668–678 (1999).
Google Scholar
Waldrop, M. More than Moore. Nature 530, 145 (2016).
Article ADS Google Scholar
Esmaeilzadeh, H., Blem, E., St. Amant, R., Karthikeyan, A. & Doug, S. Dark silicon and the end of multicore scaling. In Proc. International Symposium on Computer Architecture, ISCA (2011).
Thompson, S. E. & Parthasarathy, S. Moore’s law: the future of Si microelectronics. Mater. Today 9, 20–25 (2006).
Article CAS Google Scholar
Theis, T. N. & Wong, H. P. The end of Moore’s law: a new beginning for information technology. Comput. Sci. Eng. 19, 41–50 (2017).
Article Google Scholar
Cavin, R. K., Lugli, P. & Zhirnov, V. V. Science and engineering beyond moore’s law. Proc. IEEE 100, 1720–1749 (2012).
Article Google Scholar
Kuhn, K. J. Moore’s law past 32 nm: future challenges in device scaling. In Proc. 13th International Workshop on Computational Electronics (IWCE) 1–6 (2009).
Lecun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article ADS CAS Google Scholar
Barends, R. et al. Digitized adiabatic quantum computing with a superconducting circuit. Nature 534, 222–226 (2016).
Article ADS CAS Google Scholar
Soref, R. Silicon photonics: a review of recent literature. Silicon 2, 1–6 (2010).
Article CAS Google Scholar
Soref, R. Tutorial: integrated-photonic switching structures. APL Photonics 3, 021101 (2018).
Article ADS Google Scholar
Miller, D. A. B. Device requirement for optical interconnects to silicon chips. Proc. IEEE 97, 1166–1185 (2009).
Article CAS Google Scholar
Miller, D. A. B. Attojoule optoelectronics for low-energy information processing and communications. J. Lightwave Technol. 35, 346–396 (2017).
Article ADS CAS Google Scholar
Miller, D. Optical interconnects to electronic chips. Appl. Opt. 49, 70 (2010).
Article Google Scholar
Atabaki, A. H. et al. Integrating photonics with silicon nanoelectronics for the next generation of systems on a chip. Nature 556, 349–354 (2018).
Article ADS CAS Google Scholar
Sun, C. et al. Single-chip microprocessor that communicates directly using light. Nature 528, 534–538 (2015).
Article ADS CAS Google Scholar
Wang, C. et al. Integrated lithium niobate electro-optic modulators operating at CMOS-compatible voltages. Nature https://doi.org/10.1038/s41586-018-0551-y (2018).
Article PubMed PubMed Central Google Scholar
Xu, Q., Schmidt, B., Pradhan, S. & Lipson, M. Micrometre-scale silicon electro-optic modulator. Nature 435, 325–327 (2005).
Article ADS CAS Google Scholar
Michel, J., Liu, J. & Kimerling, L. C. High-performance Ge-on-Si photodetectors. Nat. Photonics 4, 527–534 (2010).
Article ADS CAS Google Scholar
Timurdogan, E. et al. An ultralow power athermal silicon modulator. Nat. Commun. 5, 4008 (2014).
Article ADS CAS Google Scholar
Vandoorne, K. et al. Experimental demonstration of reservoir computing on a silicon photonics chip. Nat. Commun. 5, 1–6 (2014).
Article Google Scholar
Shen, Y., Harris, N. C., Skirlo, S., Englund, D. & Soljačić, M. Deep learning with coherent nanophotonic circuits. Nat. Photonics 11, 189–190 (2017).
Article Google Scholar
Liu, W. et al. A fully reconfigurable photonic integrated signal processor. Nat. Photonics 10, 190–195 (2016).
Article ADS CAS Google Scholar
Hardy, J. & Shamir, J. Optics inspired logic architecture. Opt. Express 15, 150–165 (2007).
Article ADS Google Scholar
Meindl, J. D. Beyond Moore’s law: the interconnect era. Comput. Sci. Eng. 5, 20–24 (2003).
Article Google Scholar
Miller, D. A. B. Device requirements for optical interconnects to silicon chips. Proc. IEEE 97, 1166–1185 (2009).
Article CAS Google Scholar
Mathew, S. K. et al. A 4-GHz 300-mW 64-bit integer execution ALU with dual supply voltages in 90-nm CMOS. IEEE J. Solid-State Circuits 40, 44–50 (2005).
Article ADS Google Scholar
Gostimirovic, D. & Ye, W. N. Ultracompact CMOS-compatible optical logic using carrier depletion in microdisk resonators. Sci. Rep. 7, 12603 (2017).
Article ADS Google Scholar
Winzer, P. J. Modulation and multiplexing in optical communications. CTuL3. https://doi.org/10.1364/cleo.2009.ctul3 (2013).
Taub, A. H. & Sklanskyt, J. Conditional-Sum Addition Logic 226–231 (1957).
Timurdogan, E. et al. APSUNY Process Design Kit (PDKv3.0): O, C and L band silicon photonics component libraries on 300 mm wafers. In Proc. Optical Fiber Communication Conference Exhibition Tu2A.1 (2019).
Feng, C., Pan, D. Z. & Chen, R. T. Power and accuracy co-optimization of an optical full adder via optimization algorithms. In Proc. IEEE Photonics Conference 1–2 (2019).
Ying, Z. et al. Comparison of microrings and microdisks for high-speed optical modulation in silicon photonics. Appl. Phys. Lett. 112, 111108 (2018).
Article ADS Google Scholar
Srinivasan, S. A. et al. 56 Gb/s germanium waveguide electro-absorption modulator. J. Lightwave Technol. 34, 419–424 (2016).
Article ADS CAS Google Scholar
Pantouvaki, M. et al. Active components for 50 Gb/s NRZ-OOK optical interconnects in a silicon photonics platform. J. Lightwave Technol. 35, 631–638 (2017).
Article ADS CAS Google Scholar
Haffner, C. et al. Low-loss plasmon-assisted electro-optic modulator. Nature 556, 483–486 (2018).
Article ADS CAS Google Scholar
Heni, W. et al. Plasmonic IQ modulators with attojoule per bit electrical energy consumption. Nat. Commun. 10, 1694 (2019).
Article ADS Google Scholar
Phare, C. T., Daniel Lee, Y.-H., Cardenas, J. & Lipson, M. Graphene electro-optic modulator with 30 GHz bandwidth. Nat. Photonics 9, 511–514 (2015).
Article ADS CAS Google Scholar
Stillmaker, A. & Baas, B. Scaling equations for the accurate prediction of CMOS device performance from 180 nm to 7 nm. Integr. VLSI J. 58, 74–81 (2017).
Article Google Scholar
Ying, Z. et al. Integrated multi-operand electro-optic logic gates for optical computing. Appl. Phys. Lett. 115, 171104 (2019).
Article ADS Google Scholar
Nozaki, K. et al. Femtofarad optoelectronic integration demonstrating energy-saving signal conversion and nonlinear functions. Nat. Photonics 13, 454–459 (2019).
Article ADS CAS Google Scholar
Haurylau, M. et al. On-chip optical interconnect roadmap: challenges and critical directions. IEEE J. Sel. Top. Quantum Electron. 12, 1699–1705 (2006).
Article ADS CAS Google Scholar
Bihari, B. et al. Optical clock distribution in supercomputers using polyimide-based waveguides. In Proc. SPIE 3632, Optoelectronic Interconnects VI 123–133 (2003).

Download references

Acknowledgements

The authors would like to thank Xiaochuan Xu, Zeyu Pan, and Chi-Jui Chung for assisting in the RF testing. The authors acknowledge support from the Multidisciplinary University Research Initiative (MURI) program (FA 9550-17-1-0071) through the Air Force Office of Scientific Research (AFOSR), monitored by Dr. Gernot S. Pomrenke.

Author information

Authors and Affiliations

Microelectronics Research Center, The University of Texas at Austin, Austin, TX, 78758, USA
Zhoufeng Ying, Chenghao Feng, Yue Cheng & Ray T. Chen
Computer Engineering Research Center, The University of Texas at Austin, Austin, TX, 78705, USA
Zheng Zhao, Shounak Dhar, Jiaqi Gu & David Z. Pan
Omega Optics, Inc., 8500 Shoal Creek Boulevard, Building 4, Suite 200, Austin, TX, 78757, USA
Hamed Dalir & Ray T. Chen
Department of Engineering, University of Massachusetts Boston, Boston, MA, 02125, USA
Richard Soref

Authors

Zhoufeng Ying
View author publications
You can also search for this author in PubMed Google Scholar
Chenghao Feng
View author publications
You can also search for this author in PubMed Google Scholar
Zheng Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Shounak Dhar
View author publications
You can also search for this author in PubMed Google Scholar
Hamed Dalir
View author publications
You can also search for this author in PubMed Google Scholar
Jiaqi Gu
View author publications
You can also search for this author in PubMed Google Scholar
Yue Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Richard Soref
View author publications
You can also search for this author in PubMed Google Scholar
David Z. Pan
View author publications
You can also search for this author in PubMed Google Scholar
Ray T. Chen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Z.Y. and Z.Z. designed the floorplan and drew the layout. Z.Y. and C.F. performed the measurements and analyzed the data. S.D. performed the VLSI simulation. H.D. assisted in high-speed testing. Z.Y. conceived the architecture and wrote the manuscript with contributions from R.T.C., D.Z.P., R.S., J.G., and Y.C. The project was led by R.T.C. and D.Z.P.

Corresponding author

Correspondence to Ray T. Chen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Zhixin Liu and the other anonymous reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ying, Z., Feng, C., Zhao, Z. et al. Electronic-photonic arithmetic logic unit for high-speed computing. Nat Commun 11, 2154 (2020). https://doi.org/10.1038/s41467-020-16057-3

Download citation

Received: 01 August 2019
Accepted: 06 April 2020
Published: 01 May 2020
DOI: https://doi.org/10.1038/s41467-020-16057-3

This article is cited by

Reconfigurable optical logic in silicon platform
- M. A. Ruhul Fatin
- Dusan Gostimirovic
- Winnie N. Ye
Scientific Reports (2024)
Correlated optical convolutional neural network with “quantum speedup”
- Yifan Sun
- Qian Li
- Xiangdong Zhang
Light: Science & Applications (2024)
Reconfigurable nonlinear photonic activation function for photonic neural network based on non-volatile opto-resistive RAM switch
- Zefeng Xu
- Baoshan Tang
- Aaron Voon-Yew Thean
Light: Science & Applications (2022)
Integrated photonic metasystem for image classifications at telecommunication wavelength
- Zi Wang
- Lorry Chang
- Tingyi Gu
Nature Communications (2022)
On-chip bacterial foraging training in silicon photonic circuits for projection-enabled nonlinear classification
- Guangwei Cong
- Noritsugu Yamamoto
- Koji Yamada
Nature Communications (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.