Low-Loss Photonic Reservoir Computing with Multimode Photonic Integrated Circuits

Katumba, Andrew; Heyvaert, Jelle; Schneider, Bendix; Uvin, Sarah; Dambre, Joni; Bienstman, Peter

doi:10.1038/s41598-018-21011-x

Download PDF

Article
Open access
Published: 08 February 2018

Low-Loss Photonic Reservoir Computing with Multimode Photonic Integrated Circuits

Andrew Katumba ORCID: orcid.org/0000-0001-7699-8426¹,
Jelle Heyvaert¹,
Bendix Schneider¹,
Sarah Uvin¹,
Joni Dambre² &
…
Peter Bienstman¹

Scientific Reports volume 8, Article number: 2653 (2018) Cite this article

4677 Accesses
39 Citations
14 Altmetric
Metrics details

Subjects

Abstract

We present a numerical study of a passive integrated photonics reservoir computing platform based on multimodal Y-junctions. We propose a novel design of this junction where the level of adiabaticity is carefully tailored to capture the radiation loss in higher-order modes, while at the same time providing additional mode mixing that increases the richness of the reservoir dynamics. With this design, we report an overall average combination efficiency of 61% compared to the standard 50% for the single-mode case. We demonstrate that with this design, much more power is able to reach the distant nodes of the reservoir, leading to increased scaling prospects. We use the example of a header recognition task to confirm that such a reservoir can be used for bit-level processing tasks. The design itself is CMOS-compatible and can be fabricated through the known standard fabrication procedures.

Reservoir computing based on a silicon microring and time multiplexing for binary and analog operations

Article Open access 02 August 2021

Massimo Borghi, Stefano Biasi & Lorenzo Pavesi

Physical reservoir computing with emerging electronics

Article 12 March 2024

Xiangpeng Liang, Jianshi Tang, … Huaqiang Wu

Photonic reservoir computing based on nonlinear wave dynamics at microscale

Article Open access 13 December 2019

Satoshi Sunada & Atsushi Uchida

Introduction

The surge in the volumes of data generated and consumed by devices has pushed the bounds on the scalability and speed of signal processing and computational systems. The consequence is a re-ignition of research into non-conventional information processing techniques that deviate from the well-established von-Neumann approach.

Analog computing has been proposed as a promising implementation in this regard in a number of alternative platforms. Analog computing, unlike conventional computing, relies on the information processing capabilities of certain physical systems. The processing usually hinges on the evolution of the internal state of the dynamical behavior of the physical system in response to an appropriately encoded input.

Analog information processing platforms are typically coupled with an adaptation and readout system to guide the evolution of the system state (or a subset of it) towards a solution whose accuracy metric is usually known beforehand. This is then followed by a readout that transforms the observed physical signal of interest into a form that is suitable for interpretation by digital computers. A prevalent way of implementing the adaptation of the computational system’s internal states is through machine learning.

Machine learning encompasses a set of techniques to teach computer systems how to perform complex tasks on previously unseen data, without explicitly programming them. Examples of tasks that are suitable for some form of machine learning are classification, regression or pattern recognition. The collection of machine learning techniques is indeed extensive and for every application the most appropriate technique has to be selected, depending on the application’s specific demands.

One important class of techniques are the so-called artificial neural networks (ANNs), that consist of a network of interconnected computational units, dubbed neurons. The layout and operation of the ANN is inspired by the structure and information processing mechanism of the human brain. Recurrent neural networks (RNNs), a subtype of neural networks, introduce memory into the network by creating directed interconnection cycles between neurons in order to tackle tasks with temporal extent.

Reservoir computing (RC)^1,2,3 was proposed as a methodology to ease the training of these recurrent networks, which was typically rather challenging. More recently, however, it has gained popularity as a computational paradigm to solve a variety of complex problems. It has been shown that RC performs very well on e.g. speech recognition and time series prediction. Reservoir computing initially emerged as a software-only technique and merely presented another algorithmic way of processing temporal data on digital computers. However, it has evolved into much more over the past decade. The RC system consists of three basic parts: the input layer which couples the input signal into a non-linear dynamical system, the “reservoir” (i.e. the recurrent neural network, which is kept untrained) and finally the output layer that typically linearly combines the states of the reservoir to provide the time-dependent output signal. An illustration of the reservoir computing system as is applied in this work is given in Fig. 1. To use the reservoir to solve a particular task, a machine learning algorithm is used to train a set of weights (the readout) using a set of known labeled example data, such that a linear combination of the optical signals recorded at each node approximates a desired output as closely as possible. These weights are then used to generate the output signal for any unseen subsequently injected input signal sequences. RC systems are fast to train and quickly converge to a global optimum. They have shown state-of-the-art performance on a range of complex tasks on time-dependent data (such as speech recognition, nonlinear channel equalisation, robot control, time series prediction, financial forecasting, handwriting recognition, etc.).

A key discovery was that the reservoir computing platform provides a natural framework for implementing hardware-based learning systems for which there may be only a limited ability to granularly influence the internal state of the dynamical system (reservoir). Examples of RC implementations in mechanical systems, memristive systems, atomic switch networks, boolean logic elements and photonic systems can be found in^4,5,6,7,8.

To date, experimental demonstrations of photonic reservoirs routinely achieve state-of-the-art performance on various information processing tasks. Implementations based on a single nonlinear node with a delayed feedback architecture have proven that photonic RC is competitive for analog information processing^{9,10,11,12,13,14,15,16,17}. Moreover, integrated photonic reservoirs can push computation speeds even higher for digital information processing. The performance of integrated photonic reservoirs has been studied numerically for networks of ring resonators^{18,19,20,21,22}, networks of SOAs⁷, and experimentally with networks of delay lines and splitters²³. Integrated photonic reservoirs are particularly compelling, especially when implemented in the CMOS platform as they can take advantage of its associated benefits for technology reuse and mass production.

However, while possessing numerous benefits, passive integrated photonic reservoirs are plagued by a number of issues, key among which is loss accumulation. Silicon photonics reservoirs are composed of nodes that are interconnected together in a planar topology such as that in Fig. 1. Since the interconnections between nodes are made up of spirals of a few centimeters, the material loss – ≈2 dB/cm for single-mode 220 nm Si waveguides – is important. An equally significant source of signal loss is the loss at combiner points. Indeed, based on supermode theory, combining single-mode waveguides in a Y-junction only has 100% transmission if the two inputs are exactly in phase. For anti-phase inputs, the transmission is 0%. Therefore, averaged over all possible phase differences, there is only a 50% transmission for each combiner traversed. An alternative way of expressing the same fact is by saying that if we only excite a single input of the combiner, we will have 50% transmission (Later on we will use this single-side excitation as a quicker way to model the average transmission for different phase differences). The 50% loss can quickly reach substantial values for large reservoirs with a lot of combiners. Using directional couplers instead of Y-junctions would in theory solve these issues, but they suffer from stringent fabrication tolerance requirements and narrow bandwidth. Both this combination loss and the propagation loss constrain the size of the reservoirs and hence limit the complexity of tasks they can tackle.

In this work we present an efficient passive photonic reservoir computing system that lowers the combination loss. In so doing, the system allows for upscaling the number of nodes in the design as loss build-up can be limited. Even a relatively modest improvement in trans-nodal transmission will yield a substantial overall gain, since splitting and combining of signals occurs a multitude of times before the signal is read out. This improvement hinges on using broader waveguides that hence support multiple modes.

One way to take advantage of multiple modes in reservoirs is by considering them as separate channels of computation and reading them out separately through a demultiplexing structure such as a cascade of asymmetrical directional couplers²⁴. The total number of modes supported by the network represents the factor of increase of the number of observables in the system and is an indicator for the performance improvement. A multimode reservoir also directly implies richer dynamics due to multiple mixing avenues that could happen between the modes. However, this approach is not the focus of the current work but will be investigated in a follow-up study.

A second way to take advantage of the multimodal character is the one that we pursue here, where we will focus on loss reduction. A critical component in this design is a novel multimode Y-junction structure that is used at the combiner/splitter points. The junction uses a taper section that is deliberately designed to be not perfectly adiabatic, ultimately resulting in energy efficiency benefits. Note that multimodal photonics is something which is typically not considered for other applications, because it leads to several complications (modal dispersion, more complex design, difficulty to selectively excite and maintain a select number of modes throughout the whole circuit, …). However, in the context of the RC computing paradigm, none of these are of any consequence.

The key advantage from using the multimode Y-junction comes from the fact that a portion of the light that was previously scattered into radiation modes of the single-mode structure can now be captured into the higher-order guided modes. Indeed, in a case where e.g. light is only sent into one of the two input arms of a single-mode Y-combiner, when entering the output waveguide 50% of the light radiates away, as mentioned previously. However, if the output waveguide of the combiner supports multiple modes, this transmission increases to 100% for a perfectly adiabatic taper. Still, since part of this combined light is now carried by a higher-order mode as opposed to by the fundamental mode, it is still a fact that this higher-order light will radiate away at the next combiner. This is where the next part of the design comes into play: by deliberately designing the Y-junction to be non-adiabatic to a certain extent, we can hope to get a degree of conversion from these higher-order modes back to the lower-order modes which can propagate unhindered. Still, this beneficial process competes with the harmful reverse process of conversion from lower-order to higher-order modes. Therefore, it is not a-priori obvious that there will be design that offers a bigger than 50% transmission averaged over different modal compositions of input excitation. The main contribution of this paper is to present exactly such a design.

Results

Design of a multimode Y-junction with low combiner loss

Waveguide cutoff conditions

To effectively design multimode waveguides and components, we must first determine the cutoff conditions for different modes. This has to be done numerically, as the well-established closed form 1D solution cannot easily be extended to practical 2D cases. Lumerical^© Mode Solutions, a commercial mode solver package, was used to determine the modes of the waveguides. The fully vectorial FFM solver in Fimmwave^©, which is based on the Film Mode Matching Method, was used to check the results for consistency.

For this work we assume oxide-clad 220 nm silicon at 1300 nm on the SOI material platform. From here on, we will only focus on the TE-like modes but a similar argument applies to TM-like modes (we also confirmed that there was negligible interaction between the two mode groups). From the mode simulations we obtained the cut-off points for each mode and can therefore choose what width to use in order for the waveguide to support a given number of modes (Fig. 2).

Relevant geometrical parameters of Y-junction

The Y-junction design for the simulations was composed of three sections as seen in Fig. 3. The input waveguide (section 1) of width w₁ is followed by a linear taper of length t (section 2) leading into a wider waveguide with width w₂. The third section starts off right after the taper and terminates into a split into the two output waveguides. The output waveguides (also referred to as arms) are all the same width w₁ as the input waveguide of section 1. ϕ is the angle between the two output waveguides and is determined by the bend radius of the output waveguides. The smaller the bend radius, the larger the value of ϕ and vice versa.

We shall refer to the splitter configuration as the case when the input is on the left side of Fig. 3 in section 1. For the combiner configuration, the junction is operated in the reverse sense.

In contrast to the single objective criterion of maximizing adiabaticity for the single-mode Y-junction, designing a low loss multi-mode Y-junction combiner is a more intricate procedure, as the interactions and evolution of the modes through the junction drive the choice of the best geometrical parameters. Since sweeping all the geometric parameters would be very time-consuming, the simulations proceeded as follows. For all designs considered, w₂ = 2w₁, and the bend radius for the two arms is set to 5 μm. Subsequently, the taper length t was fixed and the width of the input and output waveguides w₁ was scanned in the range 600 nm–1200 nm. Then, the taper length t was optimized to achieve maximum transmission for the Y-junction with the selected width.

Waveguide Width Optimisation

A first relevant parameter is the input and output waveguide width w₁ for the Y-junction, for which we will compare four values, inspired by different regimes in the dispersion diagram of Fig. 2. At 600 nm width, the first two modes (i.e. fundamental and first-order mode) are strongly guided. A third mode exists, but is guided less strongly. At a width of 800 nm, the first three modes are well-guided, but now a fourth mode (i.e. third-order mode with effective index of 1.46 with a cladding index of 1.4469) is weakly guided. The situation for a 1000 nm wide waveguide has evolved further towards four well-guided modes and a fifth (i.e. the fourth-order mode) that is barely guided with an effective index 1.4495. The last design, with a width of 1200 nm, guides this fourth-order mode better and does not support the fifth-order mode yet.

The transmission was then simulated for these Y-junction designs through propagation simulations. For this purpose Lumerical^© FDTD solutions, a commercial FDTD software package, was used.

If we consider the transmission results from the simulations for the combiner configuration with input to the upper arm only as presented in Table 1, the gains of moving to wider waveguides are evident. Compared to the single-mode fully adiabatic combiner, we already have a small improvement in the transmission from 50% to at least 53.18% occurring in the 1000 nm–1200 nm waveguide width region.

Table 1 Transmission for Y-junction combiners of different input and output waveguide widths w₁ when the input is a particular modal excitation.

Full size table

Each column in the table represents transmission values for excitation with a given input mode to a single arm only. The final column is an average transmission over all the previous columns for guided modes. Note that in a real reservoir, the modal composition at the input will take many different forms, and therefore this average transmission is used as a straightforward way to derive an approximate but relevant single figure-of-merit for the device, without having to deal with fully coherent multimodal simulations of an entire RC network. From the results, it is evident that for wider waveguides and junctions, the losses are substantially smaller than for smaller waveguides and junctions by virtue of the power ending up in higher order guided modes rather than being radiated away. As mentioned earlier, for the waveguide widths considered, the highest transmission is obtained in the 1000 nm–1200 nm range. One can argue that maximum transmission occurs in this region because it constitutes the widths at which power conversion from higher to lower order modes is most efficient for the given taper length t = 0.1 μm (t is chosen adhoc for this section). However, to reach the maximum of transmission for both the waveguide widths w₁ and taper lengths t, we need to probe the influence of the taper length t on the transmission. This will be the focus of the Taper Optimization section below.

Modal power distribution

Before optimizing the taper length t, we first make a detour to check the output modal power composition of the multimode Y-junction with respect to input excitation of a particular mode. We focus on the case of a 1 μm wide Y-junction with a 0.1 μm taper. Table 2 represents the transmission from the upper arm into the input in the combiner configuration, while Table 3 shows the transmission from the input into upper arm in the splitter configuration.

Table 2 Modal decomposition of Y-junction combiner with 1 μm wide waveguides and 0.1 μm long taper for single excitation from upper arm.

Full size table

Table 3 Modal decomposition of Y-junction splitter with 1 μm wide waveguides and a 0.1 μm long taper.

Full size table

First, we observe that indeed several modes participate in the guiding of power through the junction, and that the distribution of power over the different modes is very different for different input configurations. We can also see some modal conversion taking place from input in higher-order modes to output in lower-order, better guided modes.

Second, the results allow us to verify the reciprocity of the device (to within simulation tolerances). For example, the transmission from the fundamental mode to the first-order mode in the combiner configuration in Table 2 is 41.94%, whereas the transmission from the first-order mode to the fundamental mode in the splitter configuration is 41.90% in Table 3.

Taper Length Optimisation

In the Waveguide Width Selection section above, we demonstrated that by going multi-mode we can gain a small improvement in the transmission of the Y-junction combiner. However, to continue to boost the transmission of the multimode Y-junction, and especially when averaged over many possible input modal excitations, we need to further tailor the adiabaticity of the evolution of the input modeset by making appropriate changes to the geometry. We chose to do this by altering the taper making up section 2 of the design. The simulations for the choice of the waveguide width were done with a taper length t of 0.1 μm, we now select the w₁ = 1.0 μm design, which we found to be in the region of highest transmission in the first phase of our simulations, and track the transmission for taper lengths t = 0.1 μm, 1 μm, 2 μm and 2.5 μm.

The resulting transmission values for the multimode Y-junction combiner are plotted in Fig. 4 against taper length t for the case of excitation in the upper arm only. As input excitations, we use the fundamental, 1^st, 2^nd and 3^rd order mode. Each solid line corresponds to the denoted input mode and at the output a sum of the transmission across all output modes is plotted as the total transmission. The average transmission across all input modes per taper length t is also plotted and is the same figure-of-merit as used in the last column of Table 1. As anticipated, the optimization has resulted in higher transmission values as can be seen by comparing with the results of the t = 0.1 μm case (which was used in a previous section to determine the best waveguide width for the multimode Y-junction) to the longer taper lengths.

As we alluded to earlier, there are two conflicting trends at work which result in an optimum average transmission occurring at taper lengths around 2.0 μm. On one hand, increasing the taper length increases the adiabaticity which decreases the scattering losses for the guided supermodes and increases the radiation loss of the unguided supermodes. (Note that also for this multimode junction, in the limit of a perfectly adiabatic taper we expect 50% average transmission: 100% transmission for modes 0 and 1, and 0% transmission for modes 2 and 3, which will couple to unguided supermodes). On the other hand, having a certain degree of non-adiabaticity is beneficial to convert some of the higher-order modes into better guided lower-order modes, as shown earlier. This is the mechanism that allows us to get above a 50% average transmission. When looking at the transmission of e.g. modes 2 and 3 as a function of taper length, there is no clear trend, indicating a complicated modal mixing and conversion taking place.

At this point, although we could further optimize the design, simulation results already show an average transmission of 61% for the combiner, which is much better than the standard 50% average loss in single-mode junctions.

As this is the design we will be using in the reservoir, we carry out further simulations to check its performance when operated in the splitter configuration and obtain 42% average transmission per output arm. While the loss in the splitter is higher than the 50% of the fully adiabatic single mode case, we will confirm later that the improvement in the transmission of the combiner will yield net gains in the power efficiency of the reservoir.

Wavelength dependence

We additionally evaluated the impact the wavelength of operation could have on the performance of a passive reservoir system that uses this multimode Y-junction design. Figure 5 tracks the transmission to the fundamental mode of the output Y-junction combiner from input excitation with the fundamental mode for various wavelengths, in a 40 nm bandwidth around the target wavelength of 1300 nm. Results are given here for the Y-junction of width w₁ set to 1 μm and taper length t set to 2 μm. It can be seen that the curve is essentially flat and we can therefore conclude that variations in the wavelength will not impact the performance of the reservoir. Similarly flat curves were obtained for other input-output mode pairs.

Performance of the improved Y-junctions in photonic reservoirs

We focused our numerical simulations on a model of a 16-node passive integrated photonics reservoir with the nodes arranged in a swirl topology¹⁸ (see Fig. 1). This architecture adheres to the planarity constraints of the CMOS Silicon Photonics platform while simultaneously allowing for sufficient mixing of the input signals.

As mentioned, in our design of the multimode Y-junction, we used as a figure-of-merit the average transmission across all considered excitations with different modes. Therefore, we need to setup our simulations in a way that matches with this scenario. Specifically, at all points where a combiner is needed we use 61% transmission, and similarly for all splitting locations we use splitters with 42% efficiency. Additionally, as we are taking average powers across all modes, our reservoir simulations will not be coherent (as is the case in most of our previous works) but will rather calculate the time evolution of the intensity of the input signals.

The reservoir was tasked to solve the 3 bit header recognition task. The specific details of the task encoding have been discussed in our previous work¹⁸.

Error rates for different data rates

We evaluated and compared the error rates of the reservoir on the 3 bit header recognition task for multiple data rates for single-mode and multi-mode reservoirs. The input data stream is an NRZ OOK modulated signal with an oversampling factor of 24 and the maximum considered data rate is 32 Gbps.

An total input power of 100 mW was used and for each data rate errors were obtained for 10 different random initialisations of the input bit stream and input weights.

Figures 6 and 7 show the results from the error rate vs reservoir inter-delay (a normalization of the data rate to the interconnection delay between 2 adjacent nodes) experiments for the single-mode and the multimode reservoir cases respectively, for the case of input to node 0. For the most part there is no significant difference in performance when going from single to multi-mode reservoirs, which means there is no performance hit associated to moving to multimodal Y-junctions. In both cases we have regions of performance below the Soft Decision Forward Error Correction (SD-FEC) limit (corresponding to a BER of 2 × 10⁻²) which means we are able to reach error-free performance by applying FEC codes. To see the real benefit of this work, however, we need to look at energy efficiency.

Energy efficiency considerations

The key goal of this work is to demonstrate that we can achieve energy efficiency benefits by replacing a single-mode reservoir with a multi-mode version. First, if we track the error rate for the single-mode and multi-mode reservoirs on the 3 bit header recognition task against the input signal to noise ratio as plotted in Fig. 8, we observe that errors for the multi-mode reservoir are always lower than those of the single-mode case. The trend also tends to show higher divergence for higher signal to noise ratios. While at the level of the SD-FEC limit the difference seems small, it should be noted that this diverging trend will persist as we characterize for even lower BERs.

Next, we compare the overall loss of a single-mode reservoir with that of a multimode reservoir. Figure 9 indicates the per-node power ratios for the 16 node passive reservoir for the multi-mode versus single-mode case for the case of input to node 0. This power ratio is obtained by dividing the average power of the states at each node in the multimode reservoir by the power of the same node in the single-mode reservoir and is shown for the 3 bit header recognition as we considered for the performance evaluation. Because node 0 is used as the input in both cases, its power did not change and consequently its power ratio is equal to one. All other nodes have a power ratio above the horizontal red line that indicates where the ratio is equal to one. This means that more power is measured at all those nodes in the multimode reservoir. We observe that we can get up to 20% more power in nodes such as 4, 8 and 12 (they also happen to be the furthest from the input node in terms of optical path length).

We carried out one more experiment to further investigate how the improvement in the component energy efficiency affects the power distribution in the reservoir as we move to larger reservoirs. We re-simulated the 3 bit header recognition task but this time with a 6 × 6 (36 node) swirl reservoir. We observe, as shown in Fig. 10, that we now have nodes that have now more than 30% improvement in their power level (see e.g. nodes 6, 12, …). One can expect for the gains to increase even further as we move to larger networks. The additional power boost obtained for multimode reservoirs could be the difference between being below or above the noise floor and therefore could have significant impact on the performance and scalability of a real-world photonic reservoir computing setup.

Discussion

Numerical simulations have shown that we can improve the overall energy efficiency of silicon photonics integrated circuit reservoirs by replacing the typical single-mode components and waveguides with multimode versions. We used FDTD electromagnetic simulations to show that we can design a low-loss multimode Y-junction with widths ranging from 500 nm onwards but with a preference for 1000 nm to keep the circuit size down while maintaining the advantageous aspects of mode evolution at such a junction. We found that the best designs can be implemented with taper lengths around 2.0 μm at an optimum level of adiabaticity. We showed that the best design can yield a combiner with an average transmission of 61% as opposed to the 50% in the best case single-mode junction combiner. The same device operating as a splitter gives 42% transmission, less than the 50% of the single-mode case, however when the device is used in a reservoir, the improvements in the combiner transmission make up for this reduction in the splitter performance. Further parameter optimization can be used to get even better improvements on the loss.

We have demonstrated that constructing a reservoir based on the average transmission characteristics for our optimal Y-junction combiner/splitter design yields the same task performance but gains in energy efficiency, when computing with intensities of the input signal. For single-node input to the reservoir, energy at some nodes could be enhanced up to as much as 30% in the case of a 36 node reservoir. This highlights how the small improvements in a single component design yield benefits that scale with the size of the reservoir. Such an improvement paves the way for building reservoirs with a larger number of nodes than is currently possible and that can therefore solve more complex tasks.

In future work we will explore the operation of the coherent multimode reservoir and design a photonics chip that implements the multimode reservoir computing paradigm to take advantage of the improved energy efficiency.

Methods

Numerical simulations

For calculating the eigenmodes of the waveguides, Lumerical^© Mode Solutions, a commercial mode solver package, was used. The Film Mode Matching Method-based solver in Fimmwave^© was used to double-check the results. For the Lumerical^© varFDTD method selected for the propagation simulations and optimisation of the parameters of the Y-junctions, an auto non-uniform meshing strategy was chosen at level 5 accuracy and a conformal variant 1 mesh refinement was applied. The center wavelength was set to 1300 nm and a silicon-oxide cladding of refractive index 1.4469 surrounding the 220 nm high silicon waveguide structure with material parameters according to Palik’s model in the Lumerical material database was used. The geometries of the structures were designed with Luceda Ipkiss^© Photonic Design Environment, exported into the GDSII format and then imported into Lumerical^©. The varFDTD method was selected as it enables fast simulations (2D FDTD simulation speeds) with accuracy accuracy approaching that of 3D-FDTD simulations for devices with omni-directional in-plane propagation such as the MMI in this work.

Circuit simulations and task setup

We use custom time-domain circuit simulation scripts based on the Caphe software²⁵ and scikit-learn library²⁶ for the machine learning. Unless stated otherwise, a 4 × 4 (16 node) reservoir architecture was used to generate the states. This number of nodes was chosen as it is a design that is both cost-effective to produce with multi-project wafer runs, but also has a good performance on a number of tasks. In all cases, the length of the interconnections between the reservoir translates to a propagation time of 62.5 ps, matching the current generation of experimentally available chips¹⁸.

Once the states were obtained and transformed with an appropriate photodetector model taking into account various noise contributions and bandwidth limitations, the readout was trained using the scikit-learn library²⁶. For the training, 10,000 randomly chosen bits were fed into the reservoir and the resulting states were used for training with 5-fold cross-validation to optimise the model hyperparameters and yet another 10,000 for testing. We used a regularised ridge regression algorithm to train the linear readout. Testing is done on the best case resulting from the cross-validation. All reported error rates are related to the test data. With 10,000 bits for testing, error rates are reported at a confidence level of about 90%²⁷.

Data availability

The datasets generated and analysed for this work are available from the corresponding author on reasonable request.

References

Maass, W., Natschläger, T. & Markram, H. Real-time computing without stable states: A new framework for neural computation based on perturbations. Neural computation 2560, 2531–2560 (2002).
Article MATH Google Scholar
Jaeger, H. & Haas, H. Harnessing nonlinearity: predicting chaotic systems and saving energy in wireless communication. Sci. (New York, NY) 304, 78–80 (2004).
Article ADS CAS Google Scholar
Verstraeten, D., Schrauwen, B., D’Haene, M. & Stroobandt, D. An experimental unification of reservoir computing methods. Neural Networks 20, 391–403 (2007).
Article CAS PubMed MATH Google Scholar
Hauser, H., Ijspeert, A., Füchslin, R., Pfeifer, R. & Maass, W. Towards a theoretical foundation for morphological computation with compliant bodies. Biol. Cybern. 105, 355–370 (2011).
Article MathSciNet PubMed MATH Google Scholar
Sillin, H. O. et al. A theoretical and experimental study of neuromorphic atomic switch networks for reservoir computing. Nanotechnol. 24, 384004 (2013).
Article Google Scholar
Kulkarni, M. S. & Teuscher, C. Memristor-based reservoir computing. Proceedings of the 2012 IEEE/ACM International Symposium on Nanoscale Architectures – NANOARCH ’12 (pp. 226–232. ACM Press, New York, New York, USA, 2012).
Google Scholar
Vandoorne, K. Photonic reservoir computing with a network of coupled semiconductor optical amplifiers. Ph.D. thesis (2011).
Paquot, Y. et al. Optoelectronic Reservoir Computing. Sci. Reports 2, 287 (2012).
Article CAS Google Scholar
Vinckier, Q. et al. High-performance photonic reservoir computer based on a coherently driven passive cavity. Opt. 2, 438–446 (2015).
Google Scholar
Brunner, D., Soriano, M. C., Mirasso, C. R. & Fischer, I. Parallel photonic information processing at gigabyte per second data rates using transient states. Nat. communications 4, 1364 (2013).
Article ADS Google Scholar
Appeltant, L. et al. Information processing using a single dynamical node as complex system. Nat. communications 2, 468 (2011).
Article CAS Google Scholar
Larger, L. et al. Photonic information processing beyond Turing: an optoelectronic implementation of reservoir computing. Opt. Express 20, 3241 (2012).
Article ADS CAS PubMed Google Scholar
Duport, F., Schneider, B., Smerieri, A., Haelterman, M. & Massar, S. All-optical reservoir computing. Opt. Express 20, 22783 (2012).
Article ADS PubMed Google Scholar
Dejonckheere, A. et al. All-optical reservoir computer based on saturation of absorption. Opt. Express 22, 10868 (2014).
Article ADS PubMed Google Scholar
Soriano, M. C. et al. Optoelectronic reservoir computing: tackling noise-induced performance degradation. Opt. Express 21, 12 (2013).
Article ADS CAS PubMed Google Scholar
Nguimdo, R. M., Verschaffelt, G., Danckaert, J. & Van der Sande, G. Fast photonic information processing using semiconductor lasers with delayed optical feedback: Role of phase dynamics. Opt. Express 22, 8672 (2014).
Article ADS PubMed Google Scholar
Hicke, K. et al. Information Processing Using Transient Dynamics of Semiconductor Lasers Subject to Delayed Feedback. IEEE J. Sel. Top. Quantum Electron. 19, 1501610–1501610 (2013).
Article Google Scholar
Vandoorne, K., Dambre, J., Verstraeten, D., Schrauwen, B. & Bienstman, P. Parallel reservoir computing using optical amplifiers. IEEE transactions on neural networks 22, 1469–81 (2011).
Article PubMed Google Scholar
Mesaritakis, C., Papataxiarhis, V. & Syvridis, D. Micro ring resonators as building blocks for an all-optical high-speed reservoir-computing bit-pattern-recognition system. JOSA B (2013).
Fiers, M. A. A. et al. Nanophotonic reservoir computing with photonic crystal cavities to generate periodic patterns. IEEE Transactions on Neural Networks Learn. Syst. 25, 344–355 (2014).
Article Google Scholar
Zhang, H. et al. Integrated photonic reservoir computing based on hierarchical time-multiplexing structure. Opt. Express 22, 31356–31370 (2014).
Article ADS PubMed Google Scholar
Mesaritakis, C., Kapsalis, A. & Syvridis, D. All-optical reservoir computing system based on InGaAsP ring resonators for high-speed identification and optical routing in optical networks. vol. 9370, 937033 (2015).
Vandoorne, K. et al. Experimental demonstration of reservoir computing on a silicon photonics chip. Nat. communications 5, 3541 (2014).
Article Google Scholar
Dai, D., Wang, J. & Shi, Y. Silicon mode (de)multiplexer enabling high capacity photonic networks-on-chip with a single-wavelength-carrier light. Opt. Lett. 38, 1422 (2013).
Article ADS CAS PubMed Google Scholar
Fiers, M. et al. Time-domain and frequency-domain modeling of nonlinear optical components at the circuit-level using a node-based approach. J. Opt. Soc. Am. B 29, 896–900 (2012).
Article ADS CAS Google Scholar
Pedregosa, F. et al. Scikit-learn: Machine Learning in {P}ython. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar
Jeruchim, M. Techniques for Estimating the Bit Error Rate in the Simulation of Digital Communication Systems. IEEE J. on Sel. Areas Commun. 2, 153–170 (1984).
Article ADS Google Scholar

Download references

Acknowledgements

This research was funded by the EU Horizon 2020 PHRESCO Grant (Grant No. 688579), and the BELSPO IAP P7-35 program Photonics@BE.

Author information

Authors and Affiliations

Photonics Research Group, Department of Information Technology, Ghent University - imec, Ghent, Belgium
Andrew Katumba, Jelle Heyvaert, Bendix Schneider, Sarah Uvin & Peter Bienstman
IDLab, Department of Electronics and Information Systems, Ghent University - imec, Ghent, Belgium
Joni Dambre

Authors

Andrew Katumba
View author publications
You can also search for this author in PubMed Google Scholar
Jelle Heyvaert
View author publications
You can also search for this author in PubMed Google Scholar
Bendix Schneider
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Uvin
View author publications
You can also search for this author in PubMed Google Scholar
Joni Dambre
View author publications
You can also search for this author in PubMed Google Scholar
Peter Bienstman
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Andrew Katumba, Jelle Heyvaert and Sarah Uvin designed the multimode Y-junction. Andrew Katumba set up the circuit simulations and evaluated the performance of the photonic reservoir based on the multimode waveguides and Y-junction splitters/combiners. Andrew Katumba prepared the manuscript. Bendix Schneider, Peter Bienstman and Joni Dambre guided the interpretation of results and selection of experiments. All authors reviewed the manuscript.

Corresponding author

Correspondence to Andrew Katumba.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Katumba, A., Heyvaert, J., Schneider, B. et al. Low-Loss Photonic Reservoir Computing with Multimode Photonic Integrated Circuits. Sci Rep 8, 2653 (2018). https://doi.org/10.1038/s41598-018-21011-x

Download citation

Received: 04 October 2017
Accepted: 29 January 2018
Published: 08 February 2018
DOI: https://doi.org/10.1038/s41598-018-21011-x

This article is cited by

Connectome-based reservoir computing with the conn2res toolbox
- Laura E. Suárez
- Agoston Mihalik
- Bratislav Misic
Nature Communications (2024)
Reservoir computing based on a silicon microring and time multiplexing for binary and analog operations
- Massimo Borghi
- Stefano Biasi
- Lorenzo Pavesi
Scientific Reports (2021)
Learning function from structure in neuromorphic networks
- Laura E. Suárez
- Blake A. Richards
- Bratislav Misic
Nature Machine Intelligence (2021)
A brief review of integrated and passive photonic reservoir computing systems and an approach for achieving extra non-linearity in passive devices
- Dachuan Wu
- Yasha Yi
- Yuxiao Zhang
Science China Information Sciences (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Reservoir computing based on a silicon microring and time multiplexing for binary and analog operations

Physical reservoir computing with emerging electronics

Photonic reservoir computing based on nonlinear wave dynamics at microscale

Introduction

Results

Design of a multimode Y-junction with low combiner loss

Waveguide cutoff conditions

Relevant geometrical parameters of Y-junction

Waveguide Width Optimisation

Modal power distribution

Taper Length Optimisation

Wavelength dependence

Performance of the improved Y-junctions in photonic reservoirs

Error rates for different data rates

Energy efficiency considerations

Discussion

Methods

Numerical simulations

Circuit simulations and task setup

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Connectome-based reservoir computing with the conn2res toolbox

Reservoir computing based on a silicon microring and time multiplexing for binary and analog operations

Learning function from structure in neuromorphic networks

A brief review of integrated and passive photonic reservoir computing systems and an approach for achieving extra non-linearity in passive devices

Comments

Search

Quick links