Deep photonic network platform enabling arbitrary and broadband optical functionality

Najjar Amiri, Ali; Vit, Aycan Deniz; Gorgulu, Kazim; Magden, Emir Salih

doi:10.1038/s41467-024-45846-3

Download PDF

Article
Open access
Published: 16 February 2024

Deep photonic network platform enabling arbitrary and broadband optical functionality

Nature Communications volume 15, Article number: 1432 (2024) Cite this article

2537 Accesses
Metrics details

Subjects

Abstract

Expanding applications in optical communications, computing, and sensing continue to drive the need for high-performance integrated photonic components. Designing these on-chip systems with arbitrary functionality requires beyond what is possible with physical intuition, for which machine learning-based methods have recently become popular. However, computational demands for physically accurate device simulations present critical challenges, significantly limiting scalability and design flexibility of these methods. Here, we present a highly-scalable, physics-informed design platform for on-chip optical systems with arbitrary functionality, based on deep photonic networks of custom-designed Mach-Zehnder interferometers. Leveraging this platform, we demonstrate ultra-broadband power splitters and a spectral duplexer, each designed within two minutes. The devices exhibit state-of-the-art experimental performance with insertion losses below 0.66 dB, and 1-dB bandwidths exceeding 120 nm. This platform provides a tractable path towards systematic, large-scale photonic system design, enabling custom power, phase, and dispersion profiles for high-throughput communications, quantum information processing, and medical/biological sensing applications.

Integrated reconstructive spectrometer with programmable photonic circuits

Article Open access 11 October 2023

Broadband picometer-scale resolution on-chip spectrometer with reconfigurable photonics

Article Open access 25 June 2023

Programmable photonic circuits

Article 07 October 2020

Introduction

Photonic integrated circuits (PICs)^1,2 have significantly evolved over the last decade and are now essential technological components with critical importance in optical communications³, sensing^4,5, and computing^6,7,8. With the growing diversity and complexity of photonic applications, designing custom PICs with state-of-the-art performance metrics has become one of the most critical drivers of advancement in photonic systems. Traditional approaches relying on prior knowledge of relevant architectures, fundamental principles, and physical intuition yield a limited library of known devices and severely restrict the potential capabilities of the resulting photonic systems. More general approaches have recently emerged under the broad category of inverse/machine-optimized design^{9,10,11,12,13,14}, allowing for greater design flexibility than manual tuning of waveguide parameters. Through comprehensive searches over the complete domain of fabrication-compatible devices, various types of couplers¹⁵, polarization splitters^10,16, and spectral filters^9,14 have been proposed and demonstrated through these inverse-design methods. However, in these “free-form" design approaches, the degrees of design freedom are effectively controlled by the specified device footprint, which has key implications on the final device performance and the associated computational cost. While larger device footprints inherently provide the necessary design flexibility for complex and arbitrary optical functionality, they also rapidly scale the computational complexity of the necessary optimization process due to the physically-accurate electromagnetic simulations required^9,10,14,17. These requirements preclude the design of arbitrarily complex, ultra-broadband, or wavelength-specific photonic devices for the increasing number and variety of use cases and application requirements.

The ideal approach to photonic design must allow for arbitrarily-specified photonic functionality while maintaining low computational cost. In recent years, programmable PICs made from Mach-Zehnder interferometers (MZIs) have been proposed as a potential solution to this problem^{3,18,19,20,21}. These systems enable tuning of optical responses through active phase shifters to achieve wavelength-specific linear mappings for applications including high-speed and power-efficient optical signal routing^3,20,22, image/signal classification^{23,24,25,26,27}, and quantum computing^8,28. Yet, the potential utility of photonic interferometer networks extends well beyond these demonstrated capabilities, with critical implications towards the design of photonic systems with arbitrarily complex transfer functions.

In this paper, we introduce and experimentally demonstrate a highly-scalable framework for the design of photonic systems with arbitrarily-specified functionality, based on a deep photonic network architecture of custom-designed MZIs. Our architecture consists of a mesh of individually designed interferometers and is modeled by an equivalent computational network equipped with ultra-fast and physically-accurate simulation capabilities. In this network, each MZI is constructed from unique waveguide tapers, allowing for specific wavelength-dependent phase profiles to be achieved according to the target photonic functionality specified. The exact geometry of the individual interferometers is optimized by leveraging physics-informed machine learning capabilities in our design framework through a combination of rapid lookup of waveguide parameters and successive evaluation of photonic transfer matrices. Using this framework, we design ultra-broadband 50/50 and 75/25 power splitters and a spectral combiner/splitter, each in less than two minutes, with inherent fabrication compatibility on the 220-nm-thick silicon-on-insulator platform, and experimentally demonstrate state-of-the-art performance for all three devices. Our presented framework provides a path towards the systematic design of large-scale photonic systems with arbitrarily-specified, wavelength-dependent, or ultra-broadband responses.

Results

Deep photonic network architecture

The architecture of our deep photonic network consists of an input layer, a series of MZI layers, and an output layer, as shown in the schematic in Fig. 1a. This architecture based on a mesh of MZIs has the theoretical capability to implement any linear N × N input-output mapping in order to achieve arbitrary optical functionality^29,30,31. Input optical signal to the network is provided either externally by a series of couplers as shown, or by waveguides from upstream devices on-chip. The input optical signal is processed unidirectionally through layers of custom MZI interferometers, each with its own specific 2 × 2 mapping function denoted by T_i,j. This modular network is modeled using the transfer matrix description of each one of its constituent building blocks, in a modular configuration. Specifically, each MZI consists of two pairs of waveguide tapers with custom geometries and two directional couplers, as illustrated in Fig. 1(b). The overall transfer matrix for each MZI is described by the transfer matrices of these constituent blocks as:

$$\begin{array}{l}T(\lambda )={e}^{-j\varphi (\lambda )}\left[\begin{array}{ll}t(\lambda )&-jq(\lambda )\\ -jq(\lambda )&t(\lambda )\end{array}\right]\left[\begin{array}{ll}{e}^{-j{\theta }_{21}(\lambda )}&0\\ 0&{e}^{-j{\theta }_{22}(\lambda )}\end{array}\right]\\ \times {e}^{-j\varphi (\lambda )}\left[\begin{array}{ll}t(\lambda )&-jq(\lambda )\\ -jq(\lambda )&t(\lambda )\end{array}\right]\left[\begin{array}{ll}{e}^{-j{\theta }_{11}(\lambda )}&0\\ 0&{e}^{-j{\theta }_{12}(\lambda )}\end{array}\right]\end{array}$$

(1)

where t(λ), q(λ), and φ(λ) are the through- and cross-port amplitude coefficients and the phase response of the directional couplers, and θ₁₁(λ) through θ₂₂(λ) are the phases accumulated in corresponding waveguide tapers. The wavelength dependence of each one of these parameters plays a critical role in achieving arbitrary optical functionality in our networks. The directional couplers used throughout the network are identical and are designed to be approximately 50% couplers at 1550 nm (see Supplementary Section 1 for details). A schematic of this directional coupler and its simulated through-port transmission are shown in Fig. 1c. In contrast, all waveguide tapers are unique and custom-designed using a set of width and length parameters, as illustrated in Fig. 1d, which are determined through an iterative optimization algorithm. The phase accumulated through each custom waveguide taper is calculated as a differentiable function of these custom widths (w_i), taper length (L_θ), and input wavelength (λ), using the waveguide effective index n_eff(w, λ). This unique implementation allows the network to achieve wavelength-dependent phase profiles different from that of a straight waveguide, as demonstrated in the inset of Fig. 1d, enabling much higher degrees of freedom while maintaining the same device footprint. Our design framework then constructs the overall photonic integrated circuit through an arbitrary number of interferometric layers as shown in Fig. 1e.

**Fig. 1: Deep photonic network architecture and components.**

Simulation and optimization of the network’s optical response

Propagation of the complex optical amplitude through the network is carried out by a computational graph mimicking the physical network architecture. At each wavelength, the optical transformation carried out by the mesh of interferometers between N input channels and N output channels is represented by a computational graph. This architecture calculates the wavelength-dependent linear scattering matrix S(λ) of the entire deep photonic network according to

$$\begin{array}{rc}\mathop{\prod }\limits_{q=1}^{\lceil \frac{M}{2}\rceil }&\left[\begin{array}{lllll}{T}_{1,R+1-2q}&&&&\\ &&&&\\ &&\ddots &&\\ &&&&\\ &&&&{T}_{n,R+1-2q}\\ \end{array}\right]\\ &\times \left[\begin{array}{lllll}F&&&&\\ &{T}_{1,R-2q}&&&\\ &&\ddots &&\\ &&&{T}_{n-1,R-2q}&\\ &&&&F\end{array}\right]\end{array}$$

(2)

for networks with an even number of inputs, and by

$$\begin{array}{ll}\mathop{\prod }\limits_{q=1}^{\lceil \frac{M}{2}\rceil }&\left[\begin{array}{llll}F&&&\\ &{T}_{1,R+1-2q}&&\\ &&\ddots &\\ &&&{T}_{n,R+1-2q}\\ \end{array}\right]\\ &\times \left[\begin{array}{llll}{T}_{1,R-2q}&&&\,\\ &\ddots &&\\ &&{T}_{n,R-2q}&\\ &&&F\\ \end{array}\right]\end{array}$$

(3)

for networks with an odd number of inputs. Here, M represents the number of interferometric layers, $n=\lfloor \frac{N}{2}\rfloor$, $R=2\lceil \frac{M}{2}\rceil+1$, and F is a scalar indicating the phase accumulated through the topmost and bottommost arms of the network where no interferometer is present. Note that the very first matrix is omitted from the products when using an odd number of layers.

This computation involves integrating the waveguide effective index using the custom widths and lengths for each waveguide taper, and extracting the directional coupler through-port, cross-port coefficients, and phase response from the 3D-FDTD results. In order for our custom photonic networks to be optimized for user-defined optical functionality, these operations are implemented through a differentiable programming construct, enabling both fast parameter lookups and automatic calculation of relevant derivatives³². For calculating θ(λ), we numerically integrate the effective index throughout the length of the custom tapers using data obtained from Silicon Photonics Toolkit³³, an open-source software package providing access to several important propagation-related parameters in silicon waveguides as functions of wavelength and waveguide width. The directional coupler coefficients are similarly extracted from a differentiable interpolation of its 3D-FDTD simulation results. The result of this computation yields the complete network transfer function with a high degree of physical accuracy including the wavelength-dependent mappings for each input-output pair.

The ability to rapidly calculate a given network’s optical response as a differentiable function of its design parameters is critical from an optimization perspective. Using this capability, we construct an optimization procedure by iteratively modifying the waveguide tapers in order to obtain application-specific photonic networks with arbitrarily defined transfer functions. This procedure is illustrated for an example 1-input 4-output network in Fig. 2a. First, we initialize a network with the desired number of interferometric layers and input-output ports. We define the target optical transfer function of these input-output pairs (T_target(λ)), and assign semi-random width and length parameters to the constituent custom waveguide tapers. The network’s optical response is evaluated as a function of wavelength using the procedure described above and compared with the target transfer function. The difference between the calculated and target transfer functions is formulated as a mean squared error $J(x)=\frac{1}{Q}{\sum }_{\lambda }| {T}_{{{{{{{{\rm{calculated}}}}}}}}}(\lambda,\, x)-{T}_{{{{{{{{\rm{target}}}}}}}}}(\lambda ){| }^{2}$, where Q is the number of wavelengths and x are design parameters including widths and lengths of the custom tapers. Gradient of J(x) with respect to these design parameters ∇_x J is calculated through a back-propagation procedure. We then minimize this error by iteratively modifying the widths and lengths of waveguide tapers, as illustrated in Fig. 2b, using a gradient-based optimization algorithm³⁴. In addition to this error itself, we implemented numerous regularization schemes to achieve inherent fabrication compatibility by restricting waveguide widths from undergoing extreme changes in the custom tapers throughout the optimization procedure. Details regarding network initialization, convergence of this optimization process, and final resulting waveguide parameters can be found in Supplementary Section 2.

**Fig. 2: Optimization of an example 1-input 4-output photonic network.**

Arbitrary optical functionality with deep photonic networks

One of the key advantages of our proposed deep photonic network functionality is its ability to enable designs of photonic devices with arbitrary spectral specifications. We demonstrate how this capability allows for a universal design procedure for designing devices with ultra-broadband responses, and also devices with specific spectral features. As a proof of principle, this functionality is illustrated in Fig. 3 with three separate devices: two broadband power splitters with 50/50 and 75/25 splitting ratios operating within 1400-1600 nm, and a 1 × 2 spectral duplexer between 1450 nm and 1630 nm.

**Fig. 3: Optimization and final simulation results of power splitter and spectral duplexer deep photonic networks.**

Depending on the complexity of the desired functionality, our framework allows for the appropriate selection of hyperparameters of the deep photonic network including the number of interferometric layers and the number of custom widths in each waveguide taper. Details regarding the selection of hyperparameters can be found in Supplementary Section 3. Here, the power splitters are both designed with networks of three layers each, and the duplexer is designed with a network of six layers. For each custom waveguide taper in our devices, we used five trainable widths and a trainable length, resulting in a total of 24 parameters for each MZI in our photonic networks. The evolution of the resulting mean squared errors throughout the optimization processes are plotted in Fig. 3a–c, where convergence is achieved in several hundred iterations and, at most, a few minutes on a single Tesla V100 GPU. Details regarding the optimization time of the photonic networks and their scalability can be found in Supplementary Section 4.

The wavelength-dependent design capability of our network is illustrated in Fig. 3d–f, where we plot the transmission at one of the output ports for each one of the three devices as a function of wavelength. Throughout optimization, the output state evolves towards the target output functionality, as can be seen by the optical responses gradually approaching the desired 50%, 25% (for one output), and the spectrally duplexed outputs for the three devices, respectively. In Fig. 3g–i, the transmission spectra at both output ports are plotted for each device at their randomly-initialized states at the beginning of optimization, at an intermediate state where the devices have been partially trained, and at the final states of the optimized devices. The final device responses demonstrate a near-perfect match with the specified target functionality. These responses are verified by the propagation of the optical input in the final optimized devices, which are plotted using the electric field intensity from 3D-FDTD simulations in Fig. 3j–l. These simulation results confirm the expected outputs from our transfer matrix calculations that our networks are trained with. As expected, the power splitters achieve broadband operation; and the duplexer functions as a spectral splitter within its spectral design range, providing long-pass and short-pass outputs.

Experimental demonstration and analysis of network response

The experimental characterization results for the two power splitters are shown in Fig. 4a, b. For the 50/50 splitter, the maximum deviation from 50% transmission is as low as ± 6.42% for both output ports; and the insertion loss is measured to be less than 0.5 dB. As such, our network-based power splitter experimentally achieves a deviation of at most 0.6 dB within the 120 nm of measured bandwidth, and therefore a 1-dB bandwidth much wider than that. Similarly, for the 75/25 splitter, the deviations from the target transmission are within ±5.49% ( ±0.86 dB) and ±8.88% ( ±0.55 dB), for output ports number one and number two, respectively. The measured insertion loss is less than 0.61 dB for both output ports. These results indicate that the 75/25 splitter achieves a 1-dB bandwidth of at least 120 nm, our widest measurement range possible. The spectral duplexer’s experimental characterization results are shown in Fig. 4c. Within the pass-bands, a maximum loss of 11.45% (0.52 dB) and 15.30% (0.72 dB) are measured for the short-pass and long-pass outputs, respectively, and the insertion loss is measured about 0.66 dB (occurring at 1590 nm). The measured cutoff wavelength is around 1555.2 nm, compared to the specified target cutoff wavelength of 1550 nm. The extinction ratio between the two outputs is better than 15 dB for the majority of the wavelength range characterized, and only reaches 13.6 dB at the edge of the measured spectrum (1600 nm). All three devices experimentally exhibit state-of-the-art performance and a close match with the training objective transmission responses. Reflection in our networks was also characterized, and found to be around -30 dB for the majority of the measured spectrum with no practical influence on our device optimization processes (Supplementary Section 5). These results demonstrate and experimentally verify the universal capability of our design approach.

**Fig. 4: Experimental measurements and fabrication tolerance analysis of deep photonic networks.**

Next, we analyze the robustness of our deep photonic networks against fabrication variations. Specifically, we plot the resulting transmission responses from transfer matrix calculations under potential over-etch and under-etch scenarios in Fig. 4d–f up to a change of ±20 nm in the waveguide widths and gaps. The device responses are calculated by simulations of the network structures with updated waveguide tapers and directional couplers for the amounts of specified etch offsets. We observe minimal deviation of the transmission response from the ideal case with ± 10 nm over- and under-etch. At ±20 nm, we observe more significant changes in the simulated transmission responses, resulting from changes in the wavelength-dependent phase profiles in waveguide tapers and the shifted responses of the directional couplers, as expected. This is also demonstrated in Fig. 4g–i, where we plot the mean squared error of the resulting transmission with different over- and under-etch amounts. The calculated error increases with larger over-/under-etch amounts, indicating deteriorations in the resulting device performance. Functionally, we note that all three devices can still work as intended, with slightly inferior performance metrics up to the simulated ± 20 nm etch offsets.

Deep photonic network capability and fabrication robustness

The scalability of our deep photonic networks and the computational efficiency of our underlying simulation/optimization framework can provide highly capable networks with extremely large degrees of freedom to design arbitrarily complicated optical devices. For this architecture, the selection of the number of interferometric layers is a major design choice that determines the number of degrees of freedom for the network. While the trainability and capability of the resulting network increase with the number of layers at first, each additional layer also introduces additional propagation loss due to the waveguide bends added with each layer. This trade-off between device capability and insertion loss can be modeled by analyzing devices with different numbers of layers trained for the same objective functionality. In Fig. 5a, we plot the final mean squared error in the simulated transmission responses for different 50/50 power splitters designed with numbers of layers ranging from M = 2 to M = 60. As expected, the simulated error in the transmission response initially decreases and reaches a minimum with networks of 3 and 4 layers. However, with the increased number of layers, the accumulation of insertion loss through additional layers outweighs the benefits of increased network capability, and results in a larger calculated error and an inferior transmission response.

**Fig. 5: Influence of network size on final device performance.**

Similarly, while longer networks with more interferometric layers can provide larger degrees of freedom and more complex optical capabilities, they are also less robust to fabrication variations. Similar to the added optical loss, errors in the phase profiles add up through the additional layers and negatively affect the resulting device performance. We analyze the fabrication tolerance of 50/50 splitters constructed from different numbers of layers in Fig. 5b, where the mean squared error is plotted as a function of the etch offset. The results demonstrate that longer networks (with greater numbers of layers) are more sensitive to fabrication-induced changes due to the accumulation of phase and coupling errors within consecutive MZIs. For instance, while the minimum error calculated is similar for 3-layer and 4-layer splitters, the 4-layer network exhibits significantly worse performance with etch offsets reaching ± 20 nm. This analysis serves as an important guideline towards determining the appropriate number of layers for the design of specific structures using the demonstrated custom networks. Similar analyses for the 75/25 power splitter and the spectral duplexer can be found in Supplementary Section 3.

Multi-objective design capabilities

In addition to creating a single device with a single optical functionality, our design framework is also capable of utilizing multi-objective capabilities to create devices with more complex optical functionalities. We use this particular approach to demonstrate the design of deep photonic networks with more advanced capabilities including built-in tolerance against fabrication variations as well as scalability through different optical transfer functions across a larger number input-output pairs. For achieving fabrication tolerance, we simultaneously optimize the optical response of multiple different versions of a network, each one resulting from a different over-etch or under-etch scenario. Moreover, we also configure a photonic network as a combination of multiple different power splitters, in which the optical response depends on what port the optical input is received at. In this case, we define a more general figure of merit as a mean squared error including all possible combinations of fabrication variations and input ports as $J(x)=\frac{1}{Q}{\sum }_{{{\Omega }}}{\sum }_{{{\Delta }}w}{\sum }_{\lambda }| {T}_{{{{{{{{\rm{calculated}}}}}}}}}({{\Omega }},{{\Delta }}w,\lambda,x)-{T}_{{{{{{{{\rm{target}}}}}}}}}({{\Omega }},\lambda ){| }^{2}$ where the width offset parameter Δw represents the over-etch or under-etch perturbations in waveguide widths, and Ω indicates the input port selection, which now dictates the type of optical operation applied on the input signal. Consequently, the target transfer function T_target(Ω, λ) is now also a function of Ω. For this more general figure of merit, Q is the updated total number of combinations of all wavelengths, etch-offsets, and input port specifications.

This formulation allows us to design networks with more complex relationships between input-output pairs while simultaneously achieving tolerance against fabrication variations. We showcase this capability by designing a fabrication-tolerant photonic network with two inputs and three outputs, with a combined power splitter functionality, as illustrated by the device schematic in Fig. 6a. The target functionality for this device is configured such that light entering the center input is separated equally between the three outputs (1/3, 1/3, 1/3), whereas the light entering the top input is separated equally between only the top and bottom output ports (1/2, 0, 1/2) throughout the entire C-band. This network is constructed from four consecutive layers of interferometers as shown, resulting in a total footprint of 8 × 320 μm².

**Fig. 6: Multi-objective optimization of a deep photonic network with multiple different power splitter capabilities and tolerance against fabrication variations.**

For analysis of fabrication-tolerant design capability, we demonstrate the performance of networks designed both without and with tolerance to fabrication errors. The evolution of figures of merit throughout the optimization processes are plotted in Fig. 6b, c. In Fig. 6c, five different Δw offsets (-20 nm, -10 nm, 0 nm, 10 nm, 20 nm) were considered. In this fabrication-tolerant design, as the optimizer takes into account not a single network but five different networks simultaneously, the resulting figure of merit effectively includes optimizing the transfer function of a total of 5 × 4 = 20 MZIs. From this perspective, device optimization under fabrication errors inherently involves scaling to a larger number of interferometers, simply by the nature of this target functionality. While scaling in such artificial dimensions has obvious practical differences from spatial scaling in network depth or width, the resulting fabrication tolerance capability can be considerably more important for usability in application settings. For this specific example, the final figures of merit for the ideal and fabrication-tolerant networks were 1 × 10⁻⁵ and 5 × 10⁻⁵, respectively. As anticipated, the fabrication-tolerant device yields slightly worse performance as evidenced by the larger figure of merit. Moreover, increased complexity due to the consideration of multiple objectives for this device results in a greater number of iterations needed for convergence. However, despite doubling the number of iterations, we note that total optimization time recorded only increases by less than 5 seconds, underscoring the computational efficiency of the design framework (more details can be found in Supplementary Section 4). A comparison of performance for the two devices is shown in Fig. 6d, e. Despite the slightly larger figure of merit for the fabrication-tolerant network, both devices demonstrate near-perfect transmissions under ideal fabrication conditions. However, under non-zero Δw offsets, the fabrication-tolerant device maintains much flatter transmission spectra on all of its output ports throughout the entire C-band. This result demonstrates the ability of deep photonic networks to achieve more complex and multi-functional capabilities while simultaneously enabling much better robustness against fabrication errors across all output ports, for all objectives, through the entire design spectrum. We quantify these built-in fabrication tolerance capabilities further in Fig. 6f by plotting the mean squared error as a function of Δw for over-etch and under-etch scenarios ranging from -20 nm to 20 nm. While the ideal device clearly achieves a better absolute error under no fabrication errors (Δw = 0), the fabrication-tolerant network demonstrates larger tolerances by maintaining a significantly lower figure of merit in case of non-zero Δw. These results also demonstrate the practicality of our design framework for integration in a wide variety of applications and fabrication platforms, by giving system designers a choice in the final selection between different designs, which can be influenced by the specific fabrication procedures used.

Discussion

Our design framework provides a computationally efficient, physically accurate, and systematic methodology for creating deep photonic network architectures for on-chip arbitrary optical systems. The design framework is also capable of extended functionality for specific output configurations enabling band-pass filters with different bandwidths (Supplementary Section 6) as well as devices with constant dispersion profiles (Supplementary Section 7). For all of our demonstrations, while we only focused on silicon-based devices, the presented methodology is applicable in a wide variety of material platforms and spectral applications. Currently, each MZI in our deep photonic networks is 80 μm long and 4 μm wide, due to size of the directional couplers (Supplementary Section 1) and 10 μm-long custom tapers. Depending on the network width and depth, these dimensions result in footprints from 960 μm² to 1920 μm² for our experimentally demonstrated devices, which are either consistent with or smaller than those of integrated interferometer meshes in literature. These include programmable^3,7,35,36 photonic information processors whose responses also require additional electrical system stability, as well as meshes specifically targeting compact network structures with typical reported optical subsystem footprints ranging from 0.025 mm² to multiple mm² (not including electrical interfacing, metal routing, or contacts)^6,37,38,39. Moreover, our design framework also uniquely benefits from its ability to effectively combine multiple functional devices into a single photonic network, as demonstrated by the results in Fig. 6. Even though more complex optimization objectives may require longer devices with inherent size limitations (Supplementary Section 8), such multi-functional integration presents an additional and unique avenue towards achieving much higher on-chip integration density, while still maintaining broad optical operation bandwidths.

Our 50/50 and 75/25 power splitters demonstrate simulated 1dB bandwidths of over 200 nm, and experimentally measured 1dB bandwidths as wide as the entire measured spectrum of 120 nm. Both devices operate with insertion losses below 0.61 dB. In comparison to previous experimental demonstrations^{40,41,42,43,44,45,46,47,48,49,50}, these metrics represent the state-of-the-art performance in bandwidth, and illustrate comparable performance in insertion loss. Likewise, our duplexer demonstrates better experimental performance than devices with similar functionality^9,14,51,52, with less than 0.66 dB insertion loss, flat-top transmissions at both outputs, and a cutoff wavelength shift of only 5 nm. Despite the operation bandwidth reaching over 120 nm, this achieved spectral shift is also similar to reported metrics from literature where specific cutoff wavelengths for resonators, filters, or duplexers typically deviate from their targets by several nm^{9,14,51,52,53,54}. Depending on specific application requirements, this shift can be compensated through standard thermal tuning mechanisms^55,56. Similarly, based on application needs, the roll-off between the two bands may also be improved by optimizing with a tighter spectral placement of transmission targets shown in Fig. 3(i). These previously reported splitters and duplexers range approximately between 5 μm² and 6 mm² in footprint, depending on their operation principles and constituent waveguide structures. While some of these previous demonstrations using basic ring resonators^{53,54,57,58,59}, Y-junctions^44,50,60,61, and subwavelength grating waveguides^43,45,49,62 can achieve functionally similar operation within smaller footprints than our deep photonic networks, their capabilities remain limited to well-defined and fundamental operations with potentially narrower operation bandwidths. Even for devices obtained with free-form inverse-design techniques^{9,13,14,40,45,63}, the types, complexity, and bandwidth of possible optical operations are practically restricted by the inherent computational difficulty of addressing complicated objectives that require greater degrees of freedom and larger device sizes. In contrast, our networks naturally scale to a greater number of input-output pairs, with little change in their computational optimization performance (Supplementary Section 4). As a result, deep photonic networks allow for a wide and diverse array of demonstrated functional capabilities as complex as arbitrary, multi-functional, and inherently fabrication-tolerant power splitters, duplexers, band-pass filters, and dispersion compensators. As such, these networks not only advance the state-of-the-art in device performance, but also create new pathways for custom photonic system solutions.

In summary, our design framework enables highly scalable implementations of arbitrary transfer functions on-chip, by casting the problem of photonic design as a constrained optimization problem with inherent fabrication compatibility. By integrating accurate waveguide parameters and 3D-FDTD simulations into a physics-informed machine learning architecture, this methodology enables rapid yet accurate simulations of photonic devices and their scalable optimization. Our modular network design allows for a large number of degrees of freedom through custom layers of MZIs, allowing for complex photonic functionality, and therefore presents a tractable path forward for the design of large-scale integrated photonic systems. Moreover, as our computational design framework keeps track of complete phase information through the individual network components, it allows for the design of photonic networks with specific phase and dispersion profiles as a part of their target functionality. Due to the availability of rapid individual device simulations, our framework can also be configured to enable future designs with on-chip amplifiers and lasers^64,65, electrically-interfaced modulators and detectors⁶⁶, as well as structures with robustness against fabrication-induced variations⁶⁷. These capabilities present exciting novel directions in the design of photonic components with arbitrary transfer functions for use in next generation optical communication applications, neuromorphic photonic information processors, and medical/biological sensing.

Methods

Numerical simulations

The effective indices of silicon strip waveguides were extracted using Silicon Photonics Toolkit³³, an automatic differentiation-compatible open-source software package for the design of integrated photonic structures. This package enables fast lookup and evaluation of waveguide parameters on the 220 nm SOI platform, which is critically important for the rapid and scalable evaluation of our optical transfer functions. In our deep photonic networks, optical responses of the other components including directional couplers and waveguide bends were extracted from 3D-FDTD simulations performed with a maximum spatial discretization of 17 nm in all three dimensions. These responses including both amplitude and phase information were then linearly interpolated at 1000 wavelengths between 1.2 μm and 1.7 μm. The resulting interpolations were implemented as automatic differentiation-compatible lookup functions, and used during the performance evaluation of the constructed photonic networks.

Numerical optimization framework

Our deep photonic network optimization framework was built on an open-source, end-to-end deep learning library⁶⁸, enabling the use of state-of-the-art machine learning software constructs as well as access to modern hardware accelerators including GPUs and TPUs. In this framework, we model each interferometric structure as part of a physics-informed artificial neural network, and evaluate the amplitude and phase profiles of the transfer functions between each input/output pair using the automatic differentiation-compatible functions described above. This modular and highly parallelizable architecture allows for serial, parallel, or even residual types of connections between interferometric layers, which can also be used for constructing more complicated network topologies. The trainable parameters of our networks are iteratively optimized using adaptive moment estimation³⁴. During the optimizations, the learning rate was progressively reduced from 3 × 10⁻³ to 10⁻⁴ for ease and speed of convergence. For the design of the power splitters and the spectral duplexer, we used batch sizes of 32 and 21, respectively. A relative convergence was used for the stop condition of optimizations (see Supplementary Section 4 for details). All optimizations were performed using a single Tesla V100 GPU.

Device fabrication

After optimization, the final designed devices were converted to mask layouts using capabilities implemented in our design framework, through an open-source layout construction software library⁶⁹. Grating couplers were added at the inputs and outputs of the network in order for on- and off-chip light coupling. The devices were fabricated using standard 193 nm CMOS photolithography techniques on the SOI platform with a 220-nm-thick silicon device layer through IMEC’s multi-project-wafer foundry service.

Experimental measurements

For experimental characterization of deep photonic networks, our measurement procedures include standard steps to remove any losses due to on- and off-chip coupling of optical signals through grating couplers such as reflections⁷⁰ or potential mismatches between fiber, grating, or waveguide modes⁷¹. Our reported insertion losses refer to only the additional losses through the photonic networks themselves, after these grating coupler losses have been removed. The coupling losses have been measured at four separate fiber zenith angles between 8^∘ and 14^∘, using grating coupler test structures on the same chip as the measured deep photonic networks, and then combined together in order to accurately characterize as wide a measurement bandwidth as possible. All measurements have been performed using a continuous-wave tunable laser source (Santec TSL-710), an optical power meter (Santec MPM-210), and a polarization controller. The tunable source was operated using a wavelength sweep from 1480 nm to 1600 nm with a sampling rate of 40 ps to obtain the transmission characteristics of the measured structures. The spectral oscillations in our experimental measurements indicate presence of well-known Fabry-Perot interference due to reflections at fiber-to-chip interfaces⁷². These reflections are an inherent result of characterizing the devices on their own, with grating couplers directly connected to the inputs and outputs of our deep photonic networks. As parts of a larger photonic system, the networks can be directly connected by waveguides to other upstream and downstream on-chip devices, eliminating potential reflections at the grating interfaces and any associated spectral oscillations.

Data availability

The data that support the findings within this manuscript are available from the corresponding author upon request.

References

Chrostowski, L. & Hochberg, M. Silicon photonics design: from devices to systems. (Cambridge University Press, 2015).
Bogaerts, W. & Chrostowski, L. Silicon photonics circuit design: methods, tools and challenges. Laser Photonics Rev. 12, 1700237 (2018).
Article ADS Google Scholar
Zhuang, L., Roeloffzen, C. G., Hoekman, M., Boller, K.-J. & Lowery, A. J. Programmable photonic signal processor chip for radiofrequency applications. Optica 2, 854–859 (2015).
Article ADS Google Scholar
Hu, T. et al. Silicon photonic platforms for mid-infrared applications. Photonics Res. 5, 417–430 (2017).
Article CAS Google Scholar
Poulton, C. V. et al. Long-range lidar and free-space data communication with high-performance optical phased arrays. IEEE J. Sel. Top. Quantum Electron. 25, 1–8 (2019).
Article Google Scholar
Zhang, W. & Yao, J. Photonic integrated fieldprogrammable disk array signal processor. Nat. Commun. 11, 1–9 (2020).
ADS Google Scholar
Pérez, D. et al. Multipurpose silicon photonics signal processor core. Nat. Commun. 8, 1–9 (2017).
ADS Google Scholar
Carolan, J. et al. Universal linear optics. Science 349, 711–716 (2015).
Article MathSciNet CAS PubMed Google Scholar
Piggott, A. Y. et al. Inverse design and demonstration of a compact and broadband on-chip wavelength demultiplexer. Nat. Photonics 9, 374–377 (2015).
Article CAS ADS Google Scholar
Lu, J. & Vuckovic, J. Nanophotonic computational design. Opt. express 21, 13351–13367 (2013).
Article PubMed ADS Google Scholar
Qu, Y. et al. Inverse design of an integrated nanophotonics optical neural network. Sci. Bull. 65, 1177–1183 (2020).
Article Google Scholar
Tahersima, M. H. et al. Deep neural network inverse design of integrated photonic power splitters. Sci. Rep. 9, 1–9 (2019).
Article CAS Google Scholar
Molesky, S. et al. Inverse design in nanophotonics. Nat. Photonics 12, 659–670 (2018).
Article CAS ADS Google Scholar
Zhang, G., Xu, D.-X., Grinberg, Y. & LiboironLadouceur, O. Experimental demonstration of robust nanophotonic devices optimized by topological inverse design with energy constraint. Photonics Res. 10, 1787–1802 (2022).
Article Google Scholar
Piggott, A. Y. et al. Inverse-designed photonics for semiconductor foundries. ACS Photonics 7, 569–575 (2020).
Article CAS Google Scholar
Shen, B., Wang, P., Polson, R. & Menon, R. An integrated-nanophotonics polarization beamsplitter with 2.4 × 2.4 μm2 footprint. Nat. Photonics 9, 378–382 (2015).
Article CAS ADS Google Scholar
Jia, H., Zhou, T., Fu, X., Ding, J. & Yang, L. Inversedesign and demonstration of ultracompact silicon metastructure mode exchange device. Acs Photonics 5, 1833- 1838 (2018).
Article Google Scholar
Bogaerts, W. et al. Programmable photonic circuits. Nature 586, 207–216 (2020).
Article CAS PubMed ADS Google Scholar
Xu, X. et al. Self-calibrating programmable photonic integrated circuits. Nat. Photonics 16, 595–602 (2022).
Article CAS ADS Google Scholar
Pérez-López, D., López, A., DasMahapatra, P. & Capmany, J. Multipurpose self-configuration of programmable photonic circuits. Nat. Commun. 11, 1–11 (2020).
Article Google Scholar
Capmany, J. & Pérez, D., Programmable integrated photonics (Oxford University Press, 2020).
Marpaung, D., Yao, J. & Capmany, J. Integrated microwave photonics. Nat. photonics 13, 80–90 (2019).
Article CAS ADS Google Scholar
Shen, Y. et al. Deep learning with coherent nanophotonic circuits. Nat. photonics 11, 441–446 (2017).
Article CAS ADS Google Scholar
Ashtiani, F., Geers, A. J. & Aflatouni, F. An on-chip photonic deep neural network for image classification. Nature 606, 501–506 (2022).
Article CAS PubMed ADS Google Scholar
Shastri, B. J. et al. Photonics for artificial intelligence and neuromorphic computing. Nat. Photonics 15, 102–114 (2021).
Article CAS ADS Google Scholar
Feldmann, J. et al. Parallel convolutional processing using an integrated photonic tensor core. Nature 589, 52–58 (2021).
Article PubMed Google Scholar
Lin, X. et al. All-optical machine learning using diffractive deep neural networks. Science 361, 1004–1008 (2018).
Article MathSciNet CAS PubMed ADS Google Scholar
Harris, N. C. et al. Large-scale quantum photonic circuits in silicon. Nanophotonics 5, 456–468 (2016).
Article CAS Google Scholar
Reck, M., Zeilinger, A., Bernstein, H. J. & Bertani, P. Experimental realization of any discrete unitary operator. Phys. Rev. Lett. 73, 58 (1994).
Article CAS PubMed ADS Google Scholar
Miller, D. A. Self-configuring universal linear optical component. Photonics Res. 1, 1–15 (2013).
Article ADS Google Scholar
Miller, D. A. Perfect optics with imperfect components. Optica 2, 747–750 (2015).
Article ADS Google Scholar
Bradbury, J. et al. JAX: composable transformations of Python+NumPy programs, http://github.com/google/jax (2018).
Vit, A., Gorgulu, K., Amiri, A. & Magden, E. S., Silicon photonics toolkit. Preprint at Optica Open: https://doi.org/10.1364/opticaopen.23098334.v1 (2023).
Kingma, D. P. & Ba, J., Adam: A method for stochastic optimization. Preprint at arXiv preprint arXiv: 1412.6980 (2014).
Tang, R., Tanomura, R., Tanemura, T. & Nakano, Y. Ten-port unitary optical processor on a silicon photonic chip. ACS Photonics 8, 2074–2080 (2021).
Article CAS Google Scholar
Annoni, A. et al. Unscrambling light-automatically undoing strong mixing between modes. Light.: Sci. Appl. 6, e17110–e17110 (2017).
Article CAS PubMed Google Scholar
Torrijos-Morán, L., Pérez-Galacho, D. & Pérez-López, D. Silicon programmable photonic circuits based on periodic bimodal waveguides. Laser Photonics Rev. 18, 2300505 (2023).
Article ADS Google Scholar
Ribeiro, A., Ruocco, A., Vanacker, L. & Bogaerts, W. Demonstration of a 4 × 4-port universal linear circuit. Optica 3, 1348–1357 (2016).
Article CAS ADS Google Scholar
Harris, N. C. et al. Quantum transport simulations in a programmable nanophotonic processor. Nat. Photonics 11, 447–452 (2017).
Article CAS ADS Google Scholar
Kim, J. et al. Experimental demonstration of inversedesigned silicon integrated photonic power splitters. Nanophotonics 11, 4581–4590 (2022).
Article CAS Google Scholar
Papadovasilakis, M. et al. Fabrication tolerant and wavelength independent arbitrary power splitters on a monolithic silicon photonics platform. Opt. Express 30, 33780–33791 (2022).
Article CAS PubMed ADS Google Scholar
Yao, R. et al. Compact and low-insertion-loss 1 × n power splitter in silicon photonics. J. Lightwave Technol. 39, 6253–6259 (2021).
Article CAS ADS Google Scholar
Shiran, H. & Liboiron-Ladouceur, O. et al. Dual-mode broadband compact 2 × 2 optical power splitter using sub-wavelength metamaterial structures. Opt. Express 29, 23864–23876 (2021).
Article CAS PubMed ADS Google Scholar
Lin, Z. & Shi, W. Broadband, low-loss silicon photonic y-junction with an arbitrary power splitting ratio. Opt. express 27, 14338–14343 (2019).
Article CAS PubMed ADS Google Scholar
Chang, W. et al. Inverse design and demonstration of an ultracompact broadband dual-mode 3 db power splitter. Opt. Express 26, 24135–24144 (2018).
Article CAS PubMed ADS Google Scholar
Chen, G. F. et al. Broadband silicon-on-insulator directional couplers using a combination of straight and curved waveguide sections. Sci. Rep. 7, 1–8 (2017).
Google Scholar
Wang, Y., Gao, S., Wang, K. & Skafidas, E. Ultrabroadband and low-loss 3 db optical power splitter based on adiabatic tapered silicon waveguides. Opt. Lett. 41, 2053–2056 (2016).
Article CAS PubMed ADS Google Scholar
Lu, Z. et al. Broadband silicon photonic directional coupler using asymmetric-waveguide based phase control. Opt. express 23, 3795–3808 (2015).
Article CAS PubMed ADS Google Scholar
Yun, H. et al. Broadband 2 × 2 adiabatic 3 db coupler using silicon-on-insulator sub-wavelength grating waveguides. Opt. Lett. 41, 3041–3044 (2016).
Article CAS PubMed ADS Google Scholar
Sun, C., Zhao, J., Wang, Z., Du, L. & Huang, W. Broadband and high uniformity y junction optical beam splitter with multimode tapered branch. Optik 180, 866–872 (2019).
Article CAS ADS Google Scholar
Xu, X.-B. et al. Flat-top optical filter via the adiabatic evolution of light in an asymmetric coupler. Phys. Rev. A 100, 023809 (2019).
Article CAS ADS Google Scholar
Magden, E. S. et al. Transmissive silicon photonic dichroic filters with spectrally selective waveguides. Nat. Commun. 9, 3009 (2018).
Article PubMed PubMed Central ADS Google Scholar
Dai, T. et al. Bandwidth and wavelength tunable optical passband filter based on silicon multiple microring resonators. Opt. Lett. 41, 4807–4810 (2016).
Article CAS PubMed ADS Google Scholar
Orlandi, P. et al. Reconfigurable silicon filter with continuous bandwidth tunability. Opt. Lett. 37, 3669–3671 (2012).
Article PubMed ADS Google Scholar
Enright, R. et al. A vision for thermally integrated photonics systems. Bell Labs Tech. J. 19, 31–45 (2014).
Article Google Scholar
Masood, A. et al. Comparison of heater architectures for thermal control of silicon photonic circuits. In 10th International Conference on Group IV Photonics, 83-84 (IEEE, 2013).
Melloni, A. Synthesis of a parallel-coupled ring-resonator filter. Opt. Lett. 26, 917–919 (2001).
Article CAS PubMed ADS Google Scholar
Matsuo, M., Yabuki, H. & Makimoto, M. Dual-mode stepped-impedance ring resonator for bandpass filter applications. IEEE Trans. Microw. Theory Tech. 49, 1235–1240 (2001).
Article ADS Google Scholar
Luo, S., Zhu, L. & Sun, S. A dual-band ring-resonator bandpass filter based on two pairs of degenerate modes. IEEE Trans. Microw. Theory Tech. 58, 3427–3432 (2010).
ADS Google Scholar
Tao, S. et al. Cascade wide-angle y-junction 1 × 16 optical power splitter based on silicon wire waveguides on siliconon-insulator. Opt. express 16, 21456–21461 (2008).
Article CAS PubMed ADS Google Scholar
Ozcan, C., Mojahedi, M. & Aitchison, J. S. Short, broadband, and polarization-insensitive adiabatic y-junction power splitters. Opt. Lett. 48, 4901–4904 (2023).
Article CAS PubMed ADS Google Scholar
Yang, N. & Xiao, J. A compact silicon-based polarization-independent power splitter using a threeguide directional coupler with subwavelength gratings. Opt. Commun. 459, 125095 (2020).
Article CAS Google Scholar
Wiecha, P. R., Arbouet, A., Girard, C. & Muskens, O. L. Deep learning in nano-photonics: inverse design and beyond. Photonics Res. 9, B182–B200 (2021).
Article Google Scholar
Zhou, Z. et al. Prospects and applications of on-chip lasers. Elight 3, 1–25 (2023).
Article PubMed PubMed Central Google Scholar
Li, N. et al. Monolithically integrated erbium-doped tunable laser on a cmos-compatible silicon photonics platform. Opt. Express 26, 16200–16211 (2018).
Article CAS PubMed ADS Google Scholar
Liu, K., Ye, C. R., Khan, S. & Sorger, V. J. Review and perspective on ultrafast wavelength-size electrooptic modulators. Laser Photonics Rev. 9, 172–194 (2015).
Article ADS Google Scholar
Pérez, D. & Capmany, J. Scalable analysis for arbitrary photonic integrated waveguide meshes. Optica 6, 19–27 (2019).
Article ADS Google Scholar
Trax: an end-to-end library for deep learning that focuses on clear code and speed, https://github.com/google/trax (2020).
Gabrielli, L. H. Gdstk (GDSII Tool Kit) a C++ library for creation and manipulation of GDSII and OASIS files, https://github.com/heitzmann/gdstk (2020).
Li, Y. et al. Compact grating couplers on silicon-oninsulator with reduced backreflection. Opt. Lett. 37, 4356–4358 (2012).
Article PubMed ADS Google Scholar
Taillaert, D. et al. Grating couplers for coupling between optical fibers and nanophotonic waveguides. Japanese. J. Appl. Phys. 45, 6071 (2006).
Article CAS Google Scholar
Wang, Y. et al. Focusing sub-wavelength grating couplers with low back reflections for rapid prototyping of silicon photonic circuits. Opt. express 22, 20652–20662 (2014).
Article PubMed ADS Google Scholar

Download references

Acknowledgements

This work was supported by the Marie Sklodowska Curie Fellowship (number 101032147) through the Horizon 2020 program of the European Commission, and by The Scientific and Technological Research Council of Turkey (grant number 119E195), both awarded to E.S.M.

Author information

Authors and Affiliations

Department of Electrical and Electronics Engineering, Koç University, Sariyer, Istanbul, 34450, Turkey
Ali Najjar Amiri, Aycan Deniz Vit, Kazim Gorgulu & Emir Salih Magden

Authors

Ali Najjar Amiri
View author publications
You can also search for this author in PubMed Google Scholar
Aycan Deniz Vit
View author publications
You can also search for this author in PubMed Google Scholar
Kazim Gorgulu
View author publications
You can also search for this author in PubMed Google Scholar
Emir Salih Magden
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.S.M. conceived the idea of deep photonic networks. A.D.V. created the design framework with simulation, optimization, and layout capabilities. A.N.A. and K.G. developed and revised separate modules of the design framework. A.N.A. designed and simulated the individual devices. K.G. finalized the mask layout for fabrication. A.N.A. performed the experimental characterization of the devices; and K.G. assisted the setup and experiments. E.S.M. supervised and coordinated the research. A.N.A. and E.S.M. wrote the manuscript with contributions from all co-authors.

Corresponding author

Correspondence to Emir Salih Magden.

Ethics declarations

Competing interests

A.N.A., A.D.V., K.G., and E.S.M. have filed a patent application in Turkey (2023/012306), and are in the process of filing a worldwide patent application for the photonic design framework as described in this work.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewers and Alfredo de Rossi for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Najjar Amiri, A., Vit, A.D., Gorgulu, K. et al. Deep photonic network platform enabling arbitrary and broadband optical functionality. Nat Commun 15, 1432 (2024). https://doi.org/10.1038/s41467-024-45846-3

Download citation

Received: 26 May 2023
Accepted: 03 February 2024
Published: 16 February 2024
DOI: https://doi.org/10.1038/s41467-024-45846-3

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.