Realization of optical logic gates using on-chip diffractive optical neural networks

Zarei, Sanaz; Khavasi, Amin

doi:10.1038/s41598-022-19973-0

Download PDF

Article
Open access
Published: 21 September 2022

Realization of optical logic gates using on-chip diffractive optical neural networks

Sanaz Zarei¹ &
Amin Khavasi¹

Scientific Reports volume 12, Article number: 15747 (2022) Cite this article

3568 Accesses
10 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Optical computing is highly desired as a potential strategy for circumventing the performance limitations of semiconductor-based electronic devices and circuits. Optical logic gates are considered as fundamental building blocks for optical computation and they enable logic functions to be performed extremely quickly without the generation of heat and crosstalk. Here, we discuss the design of a multi-functional optical logic gate based on an on-chip diffractive optical neural network that can perform AND, NOT and OR logic operations at the wavelength of 1.55 µm. The wavelength-independent operation of the multi-functional logic gate at seven wavelengths (over a bandwidth of 60 nm) is also studied which paves the way for wavelength division multiplexed parallel computation. This simple, highly-integrable, low-loss, energy-efficient and broadband optical logic gate provides a path for the development of high-speed on-chip nanophotonic processors for future optical computing applications.

Giant energy storage and power density negative capacitance superlattices

Article 09 April 2024

High-speed and large-scale intrinsically stretchable integrated circuits

Article 13 March 2024

Logical quantum processor based on reconfigurable atom arrays

Article Open access 06 December 2023

Introduction

Modern semiconductor-based electronics is rapidly approaching fundamental limits caused by interconnect delays and large heat generation¹. Because photons have intrinsically higher information-carrying capacity and produce low heat loads, photonic devices and circuits can potentially surmount these obstacles, but the goal of all-optical computing has not been achieved yet¹. Optical logic gates are crucial building blocks for all-optical computing and they enable many applications like ultrahigh-speed information processing and all-optical networks. There are two major approaches toward all optical logic gates; one is based on the nonlinear optical effects^{2,3,4,5,6,7,8,9,10}, especially the third-order nonlinear susceptibility, while another approach is based on the linear optical effects^{11,12,13,14,15,16,17,18,19} such as multi-beam interference^{12,13,14,15,16,17,18}. However, the inherent instability of the interference-type optical logic circuits (including linear and nonlinear interference) hindered their application. Because they are heavily dependent on the precise control of the basic properties of the two input light signals, the control light and/or the pump light and precise controlling of the basic properties including phase difference, polarization, intensity and size of the input light beams (in the case that the two nanowires are close to each other, such as for the plasmonic logic gate) is difficult to implement²⁰. Also, many of the reported works that are based on nonlinear optics, are severely limited by small nonlinear susceptibility of conventional materials which bounds their operation speed, consumption energy and device size. On the other hand, they have difficulty to be integrated with silicon-based optical devices. Furthermore, most of the reported works suffer from some certain fundamental limitations including big size, very few logic functions and even single function, having a big loss which makes them hard for integration, and more importantly narrow operating bandwidth. Therefore, it is desirable to achieve full logic functionality in a simple and compact photonic system which is stable, robust and suitable for monolithic integration, while at the same time is capable for wavelength multiplexed parallel computations (which helps to realize the full potential of optical computing).

Optical Neural Networks (ONNs) uses photons instead of electrons for computation, which enables surmounting the inherent limitations of electronics and improves the energy efficiency, processing speed and computational throughput. In ONNs, the neuron functionality and interconnectivity can be implemented with optical and photonic devices and the nature of light propagation. Here an on-chip diffractive optical neural network (DONN) is utilized to perform optical logic operations. In this configuration, the encoded light at the input layer is decoded through the hidden layers (1D-metasurfaces). The 1D-metasurfaces, named as metalines, are trained to scatter the encoded light into one of two small specified areas at the output layer, one of which represents logic state "1" and the other stands for "0". It is possible to train a single diffractive optical neural network to realize all seven basic logic operations. As a proof of principle, three logic operations (NOT, AND, and OR) are demonstrated in a single DONN at the wavelength of 1.55 µm. Furthermore, wavelength-independent operation by seven wavelengths (over 60 nm bandwidth in the silicon-on-insulator platform) is also demonstrated, which leads to wavelength division multiplexed parallel computation.

Design and modeling

This section starts with the design principle of the multifunctional optical logic gate (“Design principle” section). Thereafter in “Architecture design” section, the optical architecture utilized for implementing the on-chip diffractive optical neural network is introduced. Subsequently, the theoretical model used to train the diffractive optical neural network on the computer is described in “Modeling” section. “Design considerations” section discusses the reasons of existing errors in the theoretical modeling of the system and possible solutions to mitigate the errors.

Design principle

Figure 1 summarizes the principle of our optical logic operations. An optical deep learning framework in which the neural network is physically composed of multiple layers of diffractive 1D-metasurfaces (metalines) is utilized to optically perform basic logic operations through the interference of transmitted light. Analogous to artificial neural network, a diffractive neural network composed of one input layer, at least one hidden layer and one output layer. The input signals will be loaded onto the corresponding input waveguides and propagated through the tapers into the slab waveguide. The input layer is regarded as the plane at which the tapers meet the slab waveguide. The hidden layers (metalines) are designed to decode the encoded input light and form the image of the calculated output at the output layer²⁰. Metalines are composed of arrays of subwavelength meta-atoms, which can be independently parameterized by an arbitrary number of variables ($w^{m}$). Each meta-atom behaves like an independent neuron in the neural network and interconnects to other meta-atoms of the adjacent layers through the diffraction of light²⁰. After light exits the final metaline (the last hidden layer), it propagates a definite distance until it reaches the output layer of the network. The distance between two successive metalines and the distance between the last metaline and the output layer are also the design parameters. The output layer of the network consists of two detector regions (corresponding to the computing results "0" and "1") arranged in a linear configuration. For binary optical logic operations, the output gain only two values, "0" and "1", which can be considered as a classification task in machine learning²⁰.

For our architecture, the physical structural parameters that accomplish the diffraction and prediction are designed in advanced. Initially, the parameters of the neural network are trained on the computer and then these parameters can be transferred to the physical structure.

As a proof of principle, we train the network such that it can perform NOT, AND and OR operations, but the network can be trained such that it is able to perform all seven logic operations. For such a network, seven input waveguides are required. Two waveguides for input A (one waveguide corresponding to "0" and one waveguide corresponding to "1"), two waveguides for input B, and three waveguides for control signals indicating NOT, AND, and OR (see Table 1). If it was desired that the network to be trained and perform as all seven logic operators, then seven waveguides should have been devised for control signals (one for each logic operation). For simplicity, it is assumed that when an input is applied to one of the waveguides, its value at the exit of the taper corresponding to that waveguide (or equivalently at the input layer) is "1".

Table 1 The optical I/O table for optical logic gate operations performed by diffractive optical neural network (DONN).

Full size table

Architecture design

The optical architecture used in this work is based on an SOI (silicon-on-insulator) platform which consists of several metalines; each metaline consists of a series of meta-atoms²¹. Each meta-atom in the metalines (hidden layers) is represented by a subwavelength slot (Fig. 2a) and the geometrical parameters of the slot are design parameters (some of them can be set as learnable parameters in training process). Therefore, each slot can be regarded as a weight element that connects to its adjacent layers through diffraction and interference of the in-plane waves. By adjusting the width and length (and height) of each slot, the fully control over transmitted amplitude and phase delay of a meta-atom can be achieved.

A silicon-on-insulator platform with a 250 nm silicon top layer and a 2 µm buried oxide layer is chosen. The lattice constant of the metalines is fixed to 500 nm (approximately one third of the wavelength) and the width and height of slots are constantly chosen as 140 nm and 250 nm, respectively. By changing the slot length between 100 nm and 2.3 µm, the transmission phase can be continuously tuned from 0-to-2π, while the transmission amplitude is higher than 0.96 (Fig. 2b). The design wavelength is 1.55 µm. The results presented in Fig. 2b are achieved using the commercial software package Lumerical FDTD, while FDTD ports with a fixed distance of 10 µm along x-direction are utilized to calculate the S-parameters of a meta-atom. The injection axis of the ports is x-axis and the fundamental TE mode is selected for excitation. The x–y view of the electric field intensity profile across the middle plane of the silicon slab (z = 0) and x–z view of the electric field intensity profile across the middle plane of the slot (y = 0) for TE wave tunneling through a 2 µm-length slot in a meta-atom are illustrated in Fig. 2c,d, respectively.

During the training process, the lengths of slots are considered as learnable parameters and the training can proceed based on both transmission phase and amplitude of meta-atoms. However, for simplifying the design process in this article, the simultaneous adjustment of amplitude and phase of meta-atoms is avoided and only phase modulation is taken into account with transmission efficiency being set to 1, which doesn’t appreciably affect the accuracy of the resulting design due to near-1 transmission amplitude of meta-atoms.

Modeling

Forward propagation

The multiscale nature of metasurfaces, which the whole metasurface has macroscale dimensions and its subwavelength scattering elements have nanoscale length scales, make using full-wave simulation tools to be computationally expensive or even prohibitive due to memory requirements. Locally periodic approximation that assumes the metasurface is locally periodic (i.e., is periodic over any small region) can be a high-speed alternative that calculate the field across the plane right after the metasurface, by using small full-wave simulations to compute the field transmission phase and amplitude for each scattering element (meta-atom). Thereafter, by using near-to-far field transformation^22,23, or scalar-wave approximation^24,25, or spatial domain electromagnetic propagation²⁶, the fields can be propagated between metasurface layers. In this article, near-to-far field transformation^22,23 is utilized to propagate the fields.

As our optical neural network is physically composed of multiple layers of diffractive 1D-metasurfaces (metalines) in the SOI platform, it can be assumed as a two-dimensional problem.

The wave-number that light travels in the silicon slab waveguide is $k = n_{eff} .\frac{2\pi }{\lambda }$, where $n_{eff}$ is the effective refractive index of the silicon slab waveguide associated with the fundamental TE mode. It is assumed that each metaline is composed of N meta-atoms and the input electric field ($E^{{in,\lambda_{j} }}$) to the meta-system is pixelated to N pixels. Thus, $E^{{in,\lambda_{j} }}$ is an N × 1-dimensional vector. Each meta-atom on a single metaline layer introduces a phase and amplitude modulation on the input electric field at that meta-atom. By using full-wave simulations to obtain the phase and amplitude modulation of each meta-atom (as is done in “Architecture design” section), the electric field along the line right after each metaline can be achieved as:

$$E_{m}^{{out,\lambda_{j} }} = T_{m}^{{\lambda_{j} }} \exp (i\phi_{m}^{{\lambda_{j} }} )E_{m}^{{in,\lambda_{j} }}$$

(1)

where $E_{m}^{{in,\lambda_{j} }}$ is the input electric field to the m'th metaline, and $E_{m}^{{out,\lambda_{j} }}$ is the electric field along the line right after the m'th metaline, and $T_{m}^{{\lambda_{j} }}$ and $\phi_{m}^{{\lambda_{j} }}$ are diagonal matrices containing the transmission amplitudes and phases of meta-atoms on layer m at the wavelength $\lambda_{j}$. Now that the near-field output of each layer ($E_{m}^{{out,\lambda_{j} }}$) is known, the far-field output can be calculated through a near-to-far field transformation:

$$E_{m + 1}^{{in,\lambda_{j} }} (\vec{y}) = - \int_{line} {2G(\vec{y},\vec{y^{\prime}})E_{m}^{{out,\lambda_{j} }} } (\vec{y^{\prime}})d\vec{y^{\prime}}$$

(2)

where $E_{m + 1}^{{in,\lambda_{j} }}$ is the far-field output of m'th metaline that is the input electric field of (m + 1)'th metaline, $G(\vec{y},\vec{y^{\prime}}) = - \frac{ik}{4}H^{(1)} (kr)\hat{n} \cdot \frac{{\vec{r}}}{r}$ is the Maxwell Green's function (where $H^{(1)}$ is the Hankel function of the first kind, $k = n_{eff} .\frac{2\pi }{\lambda }$, $\vec{r} = \vec{y} - \vec{y^{\prime}}$, $r = \left| {\vec{r}} \right|$ and $\hat{n} = \hat{x}$ is the surface unit-normal vector). This integral can be changed to summation and then converted to matrix multiplication as:

$$E_{m + 1}^{{in,\lambda_{j} }} = GE_{m}^{{out,\lambda_{j} }}$$

(3)

where $G_{ij} = G(\vec{y}_{i} ,\vec{y^{\prime}}_{j} )$, $E_{m}^{{out,\lambda_{j} }} (j) = E_{m}^{{out,\lambda_{j} }} (\vec{y^{\prime}}_{j} )$ and $E_{m + 1}^{{in,\lambda_{j} }} (i) = E_{m + 1}^{{in,\lambda_{j} }} (\vec{y}_{i} )$. Multiple Fresnel-like reflections of the intermediate metaline layers are neglected due to the near-1 transmission amplitude of our meta-atoms. At the output layer of the network, the output electric field ($E_{{}}^{{out,\lambda_{j} }}$) can be calculated following a series of matrix–vector multiplications as described above. $E_{{}}^{{out,\lambda_{j} }}$ is the function of the learnable parameters $\left\{ {w^{1} ,w^{2} ,...,w^{M} } \right\}$ of the M metalines. Once $E_{{}}^{{out,\lambda_{j} }}$ is computed, the output intensity can be calculated as $I^{{out,\lambda_{j} }} = E^{{out,\lambda_{j} *}} E^{{out,\lambda_{j} }}$.

Backpropagation

For backpropagation, a cost function is specified in terms of the squared errors between a desired set of output intensity distributions and realized set of intensity distributions at a given iteration:

$$C = \sum\limits_{j = 1}^{J} {\sum\limits_{k = 1}^{K} {\sum\limits_{s = 1}^{N} {\left( {I_{s}^{{\lambda_{j} ,f_{k} }} - I_{s}^{{des,\lambda_{j} ,f_{k} }} } \right)^{2} } } }$$

(4)

where the squared error is summed for J input wavelengths, K input field distributions at a given wavelength, and N sample points along the output line. The cost function is iteratively minimized by adjusting the learnable parameters $\left\{ {w^{1} ,w^{2} ,...,w^{M} } \right\}$. For back-propagating the errors in the network to update these learnable parameters, the adjoint method, in which the gradient of the cost function with respect to all learnable parameters can be computed using only two full-field simulations²⁴, is utilized, the details of which is given in supplementary note 1.

Design considerations

One important issue in an optical neural network design is its final experimental inference capability. Most of the diffractive optical neural networks proposed up to now, show high percentage of consistency between numerical predictions and experimental verifications^{26,27,28,29,30}. These DONNs are mainly designed for classification^26,27,28,30. In the tasks of classification and regression, some amount of error is tolerable. However, logic gates should be highly immune from errors. So, for our design, the matching between numerical testing results and experimental testing results should be 100% for accurate and precise performance of multi-functional logic gate.

In²⁶, two main problems which result in the difference between numerical predictions and full-wave electromagnetic simulations, are recognized. One problem is that the effective refractive index of an identical slot at different positions along the metaline is different, because the light inputs the slot at different locations with different angles. The other problem is the existence of mutual interference between adjacent slots of different lengths when the light inputs them at the same angle. The solution to these problems are also suggested in²⁶. For the first problem, if the incident light enters the slots at a smaller angle, then a more stable effective refractive index can be achieved for all the slots along the metaline. For the second problem, in order to reduce the influence of mutual interference between adjacent slots, a slot group composed of multiple identical slots is used to approximate a neuron value.

For our architecture with lattice constant of 500 nm, width and height of the slots as 140 nm and 250 nm and an arbitrary-chosen value of 2 µm for slot length (within the range of 100 nm–2.3 µm), we calculate $n_{eff - slot}$ (effective refractive index of the slot) for a slot, when the incident light enters the slot from diverse angles (see Fig. 3). The effective refractive index of the slot can be calculated from²⁶:

$$n_{eff - slot} = \frac{{\Delta \phi_{i} }}{{L_{slot} \cdot k_{0} }} + n_{eff - slab}$$

(5)

where $L_{slot}$ is the length of the slot, $n_{eff - slab}$ is the effective refractive index of the slab waveguide, $k_{0}$ is the wavenumber of light in the slots and $\Delta \phi_{i}$ is the phase delay generated by the slot. As is seen in Fig. 3, for all the incident angles of light to the slot which are less than 20˚, the effective refractive index of the slot remains nearly constant. Therefore, the distance between the metalines should be high enough, such that the incident light angle to the meta-atoms is less than 20˚.

Also for our architecture, the $n_{eff - slot}$ calculated by the phase delay generated by a slot group with a different number of slots is shown in Fig. 4. It can be seen that when the number of slots increases in a slot-group, the effective refractive index of the slot calculated by the phase delay generated by the slot groups tends to be more stable. Also, it can be inferred from Fig. 4 that when the slot group includes two identical slots, the calculated effective refractive index is very close to the final stable value. It should be mentioned that for generating Fig. 4, a slot array with 20 randomly-generated slot lengths is chosen. The effective refractive index is calculated for a slot with 1.964 µm slot length (marked in red in Fig. 4).

Results

In this part, on-chip optical Boolean logic operations (AND, OR, and NOT) are first numerically demonstrated at the wavelength of 1.55 µm and thereafter the achieved results are verified using 2D FDTD and 2.5D FDTD variational solver of Lumerical Mode Solution. The meta-system consists of five metalines (hidden layers), with each metaline containing 100 meta-atoms (neurons). While the length of each metaline is 100 µm, the distance between two successive layers is 300 µm, and after light exits the fifth metaline, it propagates 300 µm until it reaches the output line of the network with two linearly-arranged detection regions which are representative of logic states "0" and "1". The length of each detection region is 4 µm and the center-to-center distance between the two regions is 12 µm. Also, it is assumed that the distance between the input layer and the first metaline is zero, while the thickness of each metaline is 10 µm.

As was stated in “Design considerations” section, for a design to perform accurately as a multi-functional logic gate, it is necessary that the full-wave simulation results provide 100% fit to the numerical testing results. Therefore, according to Fig. 3, the distance between the metalines should be such that the incident light angle to the meta-atoms along the metaline to be less than 20°, which means that

$$Distance\;\;between\;\;layers\;\; \ge \left( {\frac{100\;\upmu \text{m}}{{\tan (20^\circ )}} = 274\;\upmu \text{m}} \right)$$

(6)

Therefore, for this design, it is assumed that the distance between metalines as well as the distance between the fifth metaline and the output line is 300 µm. Also according to Fig. 4, each neuron value in this design is approximated as a slot group composed of two identical slots, in order to reduce the influence of mutual interference between adjacent slots.

Training is performed using 10 input combinations (4 input field distributions for AND, 4 input field distributions for OR, and 2 input field distributions for NOT) and for each input combination, the desired intensity distribution $I^{{des,\lambda_{j} ,f_{k} }}$ is defined as a rectangle centered over appropriate output detection region with $\sum\limits_{s\;over\quad \det ection\quad region}^{{}} {I_{s}^{{des,\lambda j,f_{k} }} } = 1$ at the output layer of DONN. The optimization problem involves 500 design variables (100 variables per metaline). The lengths of slots (variables) are initially set to 1.1 µm in the optimization and due to fabrication constraints, the minimum slot length is specified as 100 nm and the maximum slot length is set as 2.3 µm in order that the transmission phase changes within 0-to-2π range (with minimum feature size of 100 nm, the fabrication of metalines is feasible utilizing current deep UV photolithography technique used in silicon photonics foundry). For a specific input, when the output signal is accurately distributed such that the total intensity upon the expected detector corresponding to that input has significant value comparing to the other detector, the classification can be considered successful. Using the mathematical framework described in “Modeling” section, the lengths of slots (and correspondingly the transmission phase of meta-atoms) are adjusted in search for minimum of the cost function which is defined as the squared errors between the desired set of output intensity distributions and realized set of intensity distributions at a training iteration. Figure 5a describes the numerical performance of the designed all optical logic gate. These results are verified with Lumerical 2D FDTD (Fig. 5b) and 2.5D FDTD variational solver of Lumerical Mode Solution (Fig. 5c). It should be mentioned that both numerical simulations and 2D FDTD simulations are two-dimensional and the effective refractive index of the silicon slab waveguide is used as the refractive index of background material in the simulations. However, 2.5D FDTD simulations consider the 3D nature of the diffractive optical neural network (DONN) structure (Lumerical 2.5D varFDTD method offers comparable accuracy and versatility to that of 3D FDTD, while only requiring the simulation time and memory of a 2D FDTD simulation³¹). As is seen in Fig. 5, 2D FDTD results and 2.5D FDTD results are very near to each other, although not completely identical. Therefore, it can be stated that 2D modeling of the structure considering effective refractive index of the silicon slab waveguide makes few discrepancies in the full-wave electromagnetic results. However, there is a significant difference between the predicted results by numerical simulations and full-wave electromagnetic simulations. The discrepancy between the numerical results (Fig. 5a) and Full-wave electromagnetic simulations (Fig. 5b,c) is dominantly due to the local periodic approximation that is used in the numerical modeling. Locally periodic approximation is alternatively utilized in many numerical modeling of metasurface-optics^{22,23,24,25,26,32,33}, because of large memory requirements of full-wave simulators. Also, the assumption of unit-transmission-amplitude for all meta-atom geometries along metalines can cause some small discrepancies between the two results (the reason that we mention “small discrepancies” is that the transmission amplitude of all meta-atoms is very near to 1). Furthermore, multiple Fresnel-like reflections of the intermediate layers of the optical system is neglected in our numerical simulations, however, due to near-to-1 transmission amplitude of meta-atoms, the reflection from intermediate metalines is not significant.

Because of computational restrictions of full-wave simulators such as Lumerical, here we choose a very compact structure with as few neurons as possible to make the structure simulable with such a commercial package for the purpose of verification. In this case, the contrast ratios between the measured intensities of two designated regions drops. Higher contrast ratios can be achieved utilizing higher number of neurons in each metaline.

It is noteworthy to mention that it is possible to train the diffractive optical neural network to perform any other logic operations like NAND, NOR, XOR and XNOR or even all the seven basic logic operations at the same time. Depending on the number and complexity of the tasks that the neural network is trained for, the number of neurons, the number of diffractive layers, and the distance between layers may subject to change. Furthermore, it is possible to design the multifunctional logic gate such that it can perform other functions as well which ease the integration of the gates with other devices.

Also, we have demonstrated wavelength division multiplexed parallel computation at seven different wavelengths (1520, 1530, 1540, 1550, 1560, 1570, and 1580 nm). Figure 6 shows the λ-dependent transmission phase response of meta-atoms versus slot length when the slot width is fixed at 140 nm. Lumerical FDTD is exploited to calculate the λ-dependent transmission phase of meta-atoms. The distance between FDTD ports is again set to be 10 µm along the x-direction. Again, training is performed using 10 input combinations at seven different wavelengths in the 1520–1580 nm range. In this case, the cost function should be computed for 7 input wavelengths and 10 input field distributions at a given wavelength and 100 sample points along the output line according to Eq. (4).

Principally in training, wavelength-dependent phase response of the meta-atoms (Fig. 6) and therefore wavelength-dependent phase response of the metalines and the wavelength-dependent refractive index of silicon and thus the wavelength-dependent effective refractive index of the guided light confined in the silicon slab should be taken into account. $I^{{des,\lambda_{j} ,f_{k} }}$ is defined as before for each wavelength. A separate forward and backward propagation is performed for each of the seven wavelengths. The gradients computed for each wavelength are summed to obtain the overall gradient of the cost function with respect to all of the design variables. All other design parameters are like the ones in the training of the single-wavelength DONN of Fig. 5. The numerical performance of the wavelength-independent DONN at 1520–1580 nm range as an all-optical multi-functional logic gate is depicted in Fig. 7. This provides the capability to perform simultaneously different operations for each wavelength channel by varying the input conditions which means that totally independent parallel logic operations is possible up to the number of input channels with a single gate¹². Due to huge computational time required to verify all these results, they haven't been verified with 2.5D FDTD variational solver of Lumerical Mode Solution.

Discussion

Design verification

Because of computational restrictions encountered in a single simulation of our metasystem using full-wave simulation tools as was described in “Modeling” section, it was tried to choose an optical gate design that is as compact as possible. However, the challenge was designing an optical logic gate with 100% consistency between its numerical performance and the performance it shows when simulated with a commercial full-wave electromagnetic software. On the path we took to achieve full compliance between the two simulation results, we numerically trained a number of diffractive networks and verified their performance with Lumerical 2.5D FDTD. Table 2 reports the results of these investigations.

Table 2 The characteristics of a number of diffractive neural networks that operate as a multi-functional logic gate and the matching between their numerical testing results and full-wave simulation results.

Full size table

It can be inferred from the results reported in Table 2 that approximating a neuron value by a slot group composed of two identical slots doesn’t necessarily lead to better matching between the numerical results and full-wave simulation results. The main reason is that the number of meta-atoms in the metasystem composed of slot groups is twice less than the metasystem composed of individual slots. Also, according to Fig. 4, in our chosen architecture, using a slot group of two identical slots doesn’t make a significant difference in the effective refractive index of the slot comparing to when using an individual slot. So, this was not unexpected.

This issue is evident when comparing designs 1 and 2, designs 3 and 4, designs 6 and 7, as well as designs 13 and 14, which only differ in the number of slots in a meta-atom (and therefore the number of meta-atoms in each metaline). In addition, despite the fact that both designs 8 and 9 show 100% consistency between numerical testing results and 2.5D FDTD results, design 8 shows better consistency especially in terms of contrast ratios between logical values “0” and “1” comparing to design 9. Also, by comparing designs 2 and 4, designs 6 and 8, and designs 7 and 9 in Table 2, it can be deduced that increasing the distance between the layers from 200 to 300 µm results in better fitting of the numerical and full-wave simulation results. For a more detailed discussion on the design verification, please refer to supplementary note 2 and supplementary Table S1.

Design performance

Falling under the category of diffractive neural networks, the presented design has most of the advantages that multi-functional optical logic gate design proposed in²⁰ based on diffractive neural network actually has, that are mainly the non-requirement to precise control of the input light beams which reduces the instability of optical logic circuits and the capability of complete logic functionality in a single optical network (all seven logic operations) which makes the logic gates easy for device fabrication and system integration (see Table 3 for more details).

Table 3 Comparison between our design and the multifunctional optical logic gate based on DONN presented in²⁰.

Full size table

Other than the advantages offered by the multi-layered metasurface-based architecture, the presented architecture benefits from its CMOS foundary-compatible silicon photonics platform, fabrication compatible with very matured silicon technology, simple configuration and compactness, and operation wavelength of 1550 nm. These make our design suitable for monolithic integration with other silicon-based photonic and electronic integrated circuits.

Also, the low-loss nature of this configuration results in low power consumption (for more details on the loss of system please refer to supplementary note 5) and its low-latency leads to high computational speed (see Table 3). Furthermore, the broadband Boolean logic operability of our design (over 60 nm in free-space wavelength) have been demonstrated and its capability for wavelength multiplexed parallel computation, which realize the full potential of optical computation, is studied. Although in our work, the diffractive network was trained for 60 nm bandwidth operation, higher-bandwidth logic gates are easy to be trained.

The main weakness of the presented design is its poor binary contrast. But it should be considered that for the purpose of full-wave electromagnetic verifications, it was tried to design a structure as compact as possible. Our design is composed of 500 neurons which is one fifth of the number of neurons in the proposal of²⁰ that has 2520 neurons for multifunctional logic gate that functions as AND, NOT and OR. Therefore, it is expected that the binary contrast highly improves by using larger number of neurons.

Finally, the comparison of our logic gate with some recently reported researches in terms of architecture, dimensions, operation wavelength, operation bandwidth, contrast ratio, etc. is summarized is Table 4. The main advantage of our design over other works^{1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,17,18,19,35,36,37} is that a single structure with the same structural and geometrical parameters can be used for all logic operations. Furthermore, comparing to the logic gate proposals named in Table 4, the other benefits offered by our logic gate are its on-chip design, operation wavelength of 1.55 µm, capability for broadband operation which is suitable for wavelength division multiplexing, and high computational speed (up to 50 Gbps).

Table 4 Comparison between our design and some of the state-of-the-art optical logic gates.

Full size table

Conclusion

In conclusion, a simple, integrable, low-loss, high-speed and broadband on-chip multi-functional optical logic gate capable of performing AND, NOT, and OR functions have been demonstrated using a diffractive optical neural network which is composed of cascaded on-chip 1D-metasurfaces. Due to broadband operation of the proposed logic gate, it is capable for wavelength multiplexed parallel computation, which helps to realize the full potential of optical computing. While the proposed logic gate shows poor binary contrast, a larger network composed of higher number of neurons can be trained to surmount this problem.

Data availability

All data generated or analyzed during this study are included in this published article (and its Supplementary Information files).

References

Wei, H., Wang, Z., Tian, X., Käll, M. & Xu, H. Cascaded logic gates in nanophotonic plasmon networks. Nat. Commun. 2, 387 (2011).
Article ADS Google Scholar
Jandieri, V. et al. Functional all-optical logic gates for true time-domain signal processing in nonlinear photonic crystal waveguides. Opt. Express 28, 18317–18331 (2020).
Article ADS Google Scholar
Moroney, N. et al. Logic gates based on interaction of counterpropagating light in microresonators. J. Lightwave Technol. 38(6), 1414–1419 (2020).
Article ADS CAS Google Scholar
Liu, Y. et al. All-optical logic gates based on two-dimensional low-refractive-index nonlinear photonic crystal slabs. Opt. Express 19, 1945–1953 (2011).
Article ADS CAS Google Scholar
Wu, Y. D. All-optical logic gates by using multibranch waveguide structure with localized optical nonlinearity. IEEE J. Sel. Top. Quantum Electron. 11(2), 307–312 (2005).
Article ADS CAS Google Scholar
McCutcheon, M. et al. All-optical conditional logic with a nonlinear photonic crystal nanocavity. Appl. Phys. Lett. 95(22), 221102 (2009).
Article ADS Google Scholar
Wu, Y. D., Shih, T. T. & Chen, M. H. New all-optical logic gates based on the local nonlinear Mach-Zehnder interferometer. Opt. Express 16, 248–257 (2008).
Article ADS Google Scholar
Fouskidis, D. E., Zoiros, K. E. & Hatziefremidis, A. Reconfigurable all-optical logic gates (AND, NOR, NOT, OR) with quantum-dot semiconductor optical amplifier and optical filter. IEEE J. Sel. Top. Quantum Electron. 27(2), 1–15 (2021).
Article Google Scholar
Ma, S., Chen, Z., Sun, H. & Dutta, N. K. High speed all optical logic gates based on quantum dot semiconductor optical amplifiers. Opt. Express 18, 6417–6422 (2010).
Article ADS CAS Google Scholar
Xu, Q. & Lipson, M. All-optical logic based on silicon micro-ring resonators. Opt. Express 15, 924–929 (2007).
Article ADS Google Scholar
Yang, H. et al. Nanowire network-based multifunctional all-optical logic gates. Sci. Adv. 4(7), eaar7954 (2018).
Article ADS CAS Google Scholar
Kita, S., Nozaki, K., Takata, K., Shinya, A. & Notomi, M. Ultrashort low-loss Ψ gates for linear optical logic on Si photonics platform. Commun. Phys. 3, 33 (2020).
Article CAS Google Scholar
Ying, Z. & Soref, R. Electro-optical logic using dual-nanobeam Mach-Zehnder interferometer switches. Opt. Express 29, 12801–12812 (2021).
Article ADS CAS Google Scholar
He, L., Zhang, W. X. & Zhang, X. D. Topological all-optical logic gates based on two-dimensional photonic crystals. Opt. Express 27, 25841–25859 (2019).
Article ADS CAS Google Scholar
Li, Z., Chen, Z. & Li, B. Optical pulse controlled all-optical logic gates in SiGe/Si multimode interference. Opt. Express 13, 1033–1038 (2005).
Article ADS CAS Google Scholar
Sang, Y. et al. Broadband multifunctional plasmonic logic gates. Adv. Opt. Mater. 6, 1701368 (2018).
Article Google Scholar
Wei, H. et al. Quantum dot-based local field imaging reveals plasmon-based interferometric logic in silver nanowire networks. Nano Lett. 11, 471–475 (2011).
Article ADS CAS Google Scholar
Fu, Y. et al. All-optical logic gates based on nanoscale plasmonic slot waveguides. Nano Lett. 12, 5784–5790 (2012).
Article ADS CAS Google Scholar
Yang, Z., Fu, Y., Yang, J., Hu, C. & Zhang, J. Spin-encoded subwavelength all-optical logic gates based on single-element optical slot nanoantennas. Nanoscale 10(9), 4523–4527 (2018).
Article CAS Google Scholar
Qian, C. et al. Performing optical logic operations by a diffractive neural network. Light Sci. Appl. 9, 59 (2020).
Article ADS CAS Google Scholar
Wang, Z. et al. On-chip wavefront shaping with dielectric metasurface. Nat. Commun. 10(1), 3547 (2019).
Article ADS Google Scholar
Pestourie, R. et al. Inverse design of large-area metasurfaces. Opt. Express 26(26), 33732 (2018).
Article ADS CAS Google Scholar
Wu, Z., Zhou, M., Khoram, E., Liu, B. & Yu, Z. Neuromorphic metasurface. Photon. Res. 8(1), 46 (2019).
Article ADS Google Scholar
Backer, A. Computational inverse design for cascaded systems of metasurface optics. Opt. Express 27(21), 30308 (2019).
Article ADS Google Scholar
Zarei, S., Khavasi, A. Computational inverse design for cascaded systems of metasurface optics: comment. to be published in Opt. Express. https://doi.org/10.1364/OE.448757.
Fu, T. et al. On-chip photonic diffractive optical neural network based on a spatial domain electromagnetic propagation model. Opt. Express 29, 31924–31940 (2021).
Article ADS CAS Google Scholar
Lin, X. et al. All-optical machine learning using diffractive deep neural networks. Science 361, 1004–1008 (2018).
Article ADS MathSciNet CAS Google Scholar
Weng, J. et al. Meta-neural-network for real-time and passive deep-learning-based object recognition. Nat. Commun. 11(1), 1–8 (2020).
Article ADS Google Scholar
Luo, Y. et al. Design of task-specific optical systems using broadband diffractive neural networks. Light Sci. Appl. 8(1), 1–14 (2019).
Article CAS Google Scholar
Li, J. et al. Spectrally encoded single-pixel machine vision using diffractive networks. Sci. Adv. 7(13), eabd7690 (2021).
Article ADS Google Scholar
https://www.lumerical.com/learn/whitepapers/lumericals-2-5d-fdtd-propagation-method/.
Wang, Z., Chang, L., Wang, F., Li, T. & Gu, T. Integrated photonic metasystem for image classifications at telecommunication wavelength. Nat. Commun. 13, 2131 (2022).
Article ADS CAS Google Scholar
Fu, T., Zang, Y., Huang, Y., Du, Z., Huang, H., Hu, C., et al. Photonic machine learning with on-chip diffractive optics. 13 May 2022, PREPRINT (Version 1) available at Research Square. https://doi.org/10.21203/rs.3.rs-1550655/v1.
Zarei, S., Marzban, M. & Khavasi, A. Integrated photonic neural network based on silicon metalines. Opt. Express 28(24), 36668 (2020).
Article ADS CAS Google Scholar
Dhasarathan, V., Sahu, S. K., Nguyen, T. K. & Palai, G. Realization of all logic gates using metamaterials based three-dimensional photonics structures: A future application of 3D photonics to optical computing. Optik 202, 163723 (2020).
Article ADS CAS Google Scholar
Nayak, S., Panda, M. & Palai, G. Realization of optical ADDER circuit using photonic structure and KNN algorithm. Optik 212, 164675 (2020).
Article ADS Google Scholar
Palai, G., Tripathy, M., Das, R. & Subudhi, P. K. Realization of all logic gates using hybrid grating structure: An application of silicon photonics. Optik 147, 256 (2017).
Article ADS CAS Google Scholar

Download references

Funding

Funding was provided by Iran National Science Foundation, 4002443.

Author information

Authors and Affiliations

Department of Electrical Engineering, Sharif University of Technology, Tehran, Iran
Sanaz Zarei & Amin Khavasi

Authors

Sanaz Zarei
View author publications
You can also search for this author in PubMed Google Scholar
Amin Khavasi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.K. supervised the research. S.Z. developed the design principle and performed FDTD simulations for meta-atoms. Also, the analytical model for the analysis of DONN architecture was formulated and verified by 2.5D variational solver of Lumerical Mode Solution by S.Z. S.Z. wrote the manuscript and both authors contributed to the discussion and analysis of the results and review of the manuscript.

Corresponding author

Correspondence to Sanaz Zarei.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zarei, S., Khavasi, A. Realization of optical logic gates using on-chip diffractive optical neural networks. Sci Rep 12, 15747 (2022). https://doi.org/10.1038/s41598-022-19973-0

Download citation

Received: 19 June 2022
Accepted: 07 September 2022
Published: 21 September 2022
DOI: https://doi.org/10.1038/s41598-022-19973-0

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.