Article | Open | Published:

# Benchmarking integrated linear-optical architectures for quantum information processing

## Abstract

Photonic platforms represent a promising technology for the realization of several quantum communication protocols and for experiments of quantum simulation. Moreover, large-scale integrated interferometers have recently gained a relevant role in quantum computing, specifically with Boson Sampling devices and the race for quantum supremacy. Indeed, various linear optical schemes have been proposed for the implementation of unitary transformations, each one suitable for a specific task. Notwithstanding, so far a comprehensive analysis of the state of the art under broader and realistic conditions is still lacking. In the present work we fill this gap, providing in a unified framework a quantitative comparison of the three main photonic architectures, namely the ones with triangular and square designs and the so-called fast transformations. All layouts have been analyzed in presence of losses and imperfect control over the internal reflectivities and phases, showing that the square design outperforms the triangular scheme in most operational conditions. Our results represent a further step ahead towards the implementation of quantum information protocols on large-scale integrated photonic devices.

## Introduction

Several milestone achievements in experimental quantum information are pushing the limits of integrated photonic technologies in numerous relevant applications. Single-photon sources1,2,3,4 and detectors5,6 are already providing remarkable results in first benchmark demonstrations, while a number of powerful techniques have been developed to fully characterize general quantum processes7,8,9,10,11,12. The miniaturization of complex interferometric schemes is thus expected to unlock stable and mass-produced large-scale quantum information protocols, among the others for teleportation13, logic gates14,15, quantum networks16,17 and light manipulation18,19. One further, fundamental feature of such platforms is the capability of adding dynamical reconfigurability to the circuits20,21,22,23, allowing for universal applications as for standard classical processors24,25,26. More in particular, a research area that well benefits from all the above-mentioned technologies is that of Boson Sampling27,28,29,30,31,32,33,34,35, where efficient sampling from linear photonic devices is expected to provide evidence of a quantum computational power beyond the reach of classical computers35,36,37.

In this context, it is essential to identify suitable architectures to implement large-size interferometric networks within an integrated platform. In ref.38, Reck et al. proposed a universal algorithm to implement an arbitrary unitary transformation by decomposing it in a suitable network of unit cells made up of only beam splitters and phase shifters. For each transformation to be implemented, it is sufficient to determine the correct set of parameters (beam splitter transmittivities and internal phases) without altering the overall interferometric layout. This architecture has been employed in first experimental instances of Boson Sampling30, where the capability to implement arbitrary Haar random unitaries is an essential ingredient for the demonstration of its computational complexity. However, this architecture lacks a perfect symmetry in its triangular layout, thus making it sensitive to internal losses that lower the adherence of the implemented transformation to the ideal one. Recently, two different architectures have been proposed for the implementation of linear optical networks. A first scheme has been reported in ref.39, which ultimately corresponds to a cunning rearrangement of the scheme of ref.38 in a symmetric layout. While keeping the same number of optical elements and the capability of implementing an arbitrary unitary transformation, this scheme presents reduced sensitivity to losses within the interferometer. A second scheme, inspired by the classical algorithm of Cooley and Tukey40 for the fast Fourier transform, has been proposed in refs41,42 and implemented experimentally in refs43,44,45,46 by exploiting the three-dimensional capabilities of femtosecond laser micromachining47,48. This layout, though not supporting arbitrary unitary evolutions, allows to implement a significant class of linear optical networks with a substantial reduction in the number of necessary optical elements. Such class of matrices includes the Hadamard ones, with a notable example provided by the Fourier transformation that is widely employed in a large set of quantum information protocols49,50. A crucial requirement towards the identification of optimal architectures for the implementation of large size interferometers is a detailed knowledge of the tolerance to fabricative errors, namely propagation losses and imperfections in the parameters of the optical elements. Indeed, in all quantum information applications including Boson Sampling51,52,53,54,55,56 the applicability of a given experimental platform is limited by the maximum amount of noise tolerable in the interfometric networks. Within this general framework, a thorough analysis of the tolerance of the proposed architectures in the presence of fabrication noise is still lacking.

In this article we bridge this gap, presenting a complete analysis of the performance of the main photonic architectures under imperfect operational conditions. The three interferometric layouts have been investigated in the general case, by admitting different levels of losses and noise in unitary transformations of increasing size. Specifically, following the approach commonly adopted for reconfigurable quantum circuits24,25, i.e. by modelling beam splitters as Mach-Zehnder interferometers with variable phases and two cascaded symmetric beam splitters, noise was added to their reflectivities and to phases in both Mach-Zehnders and outer phase shifters. For our numerical benchmark we employ as figures of merit the fidelity39,52,55,56 and the total variation distance (TVD)53,54,55,56, as good estimators of the distance between ideal and imperfect implementations in relevant applications. In particular, while depending also on external factors like purity and degree of distinguishability between the input photons, the TVD has been already identified as a key figure of merit for the goodness of multiphoton experiments53,54,55,56 (see Supplementary Note 3). The article is structured as follows. First, we briefly discuss the interferometric structure of the triangular, square and fast designs, whose main characteristics are outlined in Fig. 1. Thereafter, we compare the performances of the first two schemes, which were shown to be universal for unitary evolutions, for the implementation of Haar-random transformations of increasing size. Finally, we compare the operation of the three schemes for the implementation of Fourier and Sylvester interferometers, which represent the fundamental building blocks in a significant number of relevant quantum information protocols. Our analysis highlights the advantages and limitations of each scheme, providing an essential reference point for the design of future larger-scale photonic technologies, whose optimal configurations may well benefit from a joint integration.

## Results

### Designs for photonic architectures

Since the seminal work of Hurwitz57, it is known that every m × m unitary transformation can be decomposed in the action of $$\frac{m(m-1)}{2}$$ unitaries, each acting on a two-dimensional subspace of the Hilbert space. Reck et al.38 independently gave the first operational proof (R) that an actual (linear optical) implementation, consisting of only single-mode phase shifters and two-mode beam splitters, does exist for any discrete unitary operator. Recently, a new algorithm (C) for the decomposition of arbitrary unitary transformations has been introduced by Clements et al.39, which basically presents a higher resilience against propagation losses thanks to the compact and fully symmetric design. Both R and C decompositions are made up of a set of $$\frac{m(m-1)}{2}$$ unitaries $${T}_{k,k+1}^{(i)}$$, each coupling step by step modes k and k + 1 of the interferometer

$$\begin{array}{cc}{T}_{k,k+1}^{(i)}=(\begin{array}{cccc}1 & & & 0\\ & -\sin \,{\omega }_{i} & {e}^{-\iota {\varphi }_{i}}\,\cos \,{\omega }_{i} & \\ & \cos \,{\omega }_{i} & {e}^{-\iota {\varphi }_{i}}\,\sin \,{\omega }_{i} & \\ 0 & & & 1\end{array}), & \qquad \qquad \qquad {U}_{R,C}={\prod }_{i=1}^{\frac{m(m-1)}{2}}{T}_{k,k+1}^{(i)}\end{array}$$
(1)

where the order of the interactions is directly related to the triangular and square designs of, respectively, the R and C schemes. While being both universal for the decomposition of unitary transformations, the C-design presents some immediate advantages in terms of circuit depth, namely a more balanced mixing of the optical modes and less propagation losses thanks to a minimized operation area, which is also a crucial requirement for large-scale implementations.

The universality of the C- and R-designs for unitary decomposition comes however at the cost of ignoring possible symmetries of the transformations, which could reduce the complexity of specific implementations. A relevant example is represented for instance by the Hadamard transformations, whose symmetries are known to allow remarkable simplifications in their algorithmic construction40. Notable representatives of the Hadamard class are the Sylvester (U S) and Fourier (U F) transformations, described by 2n-dimensional unitary matrices given by

$$\begin{array}{cc}{U}^{S}{(2}^{n})=S{(2}^{n})=(\begin{array}{cc}S{(2}^{n-1}) & S{(2}^{n-1})\\ S{(2}^{n-1}) & -S{(2}^{n-1})\end{array}), & \qquad \qquad {U}_{a,b}^{F}{(2}^{n})=\frac{1}{\sqrt{{2}^{n}}}\,{e}^{2\pi i{\textstyle \tfrac{ab}{{2}^{n}}}}\end{array}$$
(2)

being S(20) = (1) and n any positive integer. More generally, starting from the linear-optical fast Fourier decomposition developed by Barak et al.41,42, it is possible to generalize their scheme to span a whole class of generalized Hadamard transformations58 by keeping fixed the interferometric structure and by tuning the parameters of the beam splitters and phase shifters. The essential structure of such fast architectures presents some interesting advantages with respect to the other universal schemes. First, the depth of the circuit scales only logarithmically with the size of the interferometer, i.e. with the number of optical modes, leading to an even more compact operation area and to reduced propagation losses. Moreover, the layout is fully symmetric and naturally fits a description in terms of qubit states, thanks to the binary interactions between the modes. A closed-form expression of the element U a,b of the most general 2n-dimensional fast unitary transformation has the form

$${U}_{\,a+\mathrm{1,}\,b+1}^{(n)}={e}^{i{b}_{r}{\varphi }_{r}^{{\xi }_{r}^{(a,b)}}+i\tfrac{\pi }{2}{a}_{r}^{(n)}\oplus {b}_{r}^{(n)}}\,\prod _{s=1}^{n}\cos \,{\chi }_{a,b}^{(n,s)}$$
(3)

where a, b [0, 2n − 1] label the input/output modes and some shorthand notations have been used, following the Einstein summation convention

$$\begin{array}{cc}{\chi }_{a,b}^{(n,s)}={\theta }_{s,{f}_{s}^{(a,b)}}\,-\,\frac{\pi }{2}\,|{a}_{s}^{(n)}-{b}_{s}^{(n)}|, & {\xi }_{r}^{(a,b)}=1+{2}^{n-r}+b\,{\rm{m}}{\rm{o}}{\rm{d}}\,{2}^{n-r}+\lfloor {\textstyle \tfrac{a}{{2}^{n-r+1}}}\rfloor \,{2}^{n-r+1}\end{array}$$
(4)

where $${f}_{s}^{(a,b)}=1+b+({a}_{r}^{(n)}\,-\,{b}_{r}^{(n)})\,{2}^{n-r}$$ and $${m}_{r}^{(n)}$$ equals the r-th digit of the n-bit binary representation of m, being $$\lfloor x\rfloor$$ the integer part of the real number x.

In general, any m-dimensional photonic architectures can be described in terms of consecutive layers s of optical elements L s , made up of a network of phase shifters and beam splitters mixing a subset of modes no more than once each. Specifically, each matrix L s consists in turn of a layer B (s) of beam splitters, coupling a set of pairs of modes (k 1, k 2), and $$\tfrac{m}{2}$$ phase shifters $${e}^{i{\varphi }_{s,{k}_{1}}}$$ placed for each pair on one of the two interacting modes (k 1, k 2). The particular sequence of mode interactions {(k 1, k 2)} depends on the triangular, square or fast designs. While clearer for the first two, the geometry of the third scheme for a 2n-dimensional unitary transformation is slightly more complex and arises from the binary representations of the optical modes. Using τ s,k for the beam splitters transmissivities on mode k, the beamsplitters layer is described by the matrix

$${B}_{{k}_{1},{k}_{2}}^{(s)}\equiv \{\begin{array}{ll}{\tau }_{s,{k}_{1}} & {k}_{1}={k}_{2}\\ i\,{(1-{\tau }_{s,{k}_{1}}^{2})}^{\mathrm{1/2}} & ({k}_{1},{k}_{2})\in {\{(\alpha ,\beta )\}}^{(n,s)}\\ 0 & otherwise\end{array}$$
(5)

where {(α, β)}(n,s) = {(a + 2sb, a + 2sb + 2s−1)} are the pairs of modes interacting in the layer s, with a {1, …, 2s−1}, b {0, …, 2ns − 1}. For example, the 8-dimensional quantum Fourier transform is obtained, modulo a relabeling of the output modes42, by choosing $${\tau }_{s,k}=\sqrt{{2}^{-1}}$$ and $${\varphi }_{\mathrm{2,7}}={\varphi }_{\mathrm{2,8}}={\varphi }_{\mathrm{3,4}}=\tfrac{\pi }{2}$$, $${\varphi }_{\mathrm{3,6}}=\tfrac{\pi }{4}$$ and $${\varphi }_{\mathrm{3,8}}=\tfrac{3\pi }{4}$$. Similarly, the Sylvester transformation corresponds to the choice $${\tau }_{s,k}=\sqrt{{2}^{-1}}$$ and ϕ s,k  = 0.

### Modelling non-ideal unitary implementations

We can now introduce the model adopted to probe the three architectures under non-ideal conditions. In the following we will refer to the C- (Clements et al.39), R– (Reck et al.38) and F- (Fast) designs looking at their fixed, solid photonic architectures. This aspect is especially relevant if we are to choose the layout of a fully reconfigurable quantum circuit, which is designed to be multi-purpose and optimal when averaging over all its applications of interest. In general, the two main factors affecting the implementation of photonic quantum circuits are propagation losses and imperfect settings of the parameters describing the optical elements.

#### Losses

Coupling losses at the input/output of any circuits remains a relevant aspect in practical situations; however, their effect can be regarded as independent of the internal photonic architecture adopted and, thus, they will not be included in our study. Propagation losses, occurring in both straight and bent waveguides, play instead the main role in spoiling multipath interference. Their impact was shown to be mitigated by a more compact and symmetric interferometric structure39, where optical modes interact with each other in a balanced way. Though photon losses unavoidably occurr all along the circuit, a simple and effective way to analyze their effect is that of inserting costant losses at the output of each two-mode unit cell.

#### Fabricative noise

Imperfect control over the fabricative parameters naturally leads to deviations from the ideal evolution on the circuit while keeping the unitarity of the process. The source of noise, arising from imperfect fabrication of the beam splitters and phase shifters associated to the $${T}_{k,k+1}^{(i)}$$ in Eq. (1), has a different nature depending on the practical realization. Recent technological achievements have enabled the realization of reconfigurable quantum circuits24,25, where generic beam splitters are implemented as Mach-Zehnder interferometers with two cascaded symmetric beam splitters and one tunable phase shift. In this case, which will be at the core of our analysis, noise arises from imperfect fabrication of the fixed symmetric beam splitters and from a non-perfect control over the thermo-electric or electro-optic phase manipulation, which becomes non-negligible for large-scale circuits or with high-speed tunings. More specifically, fabrication errors may have a systematic component and a completely random one. Systematic components can be compensated in principle by fabricating several devices, within the same substrate and different geometrical parameters. The use of thermo-optic phase shifters further allows to compensate for all fixed phase imperfections.

To focus our study around realistic values for the above mentioned sources of imperfections, we will consider femtosecond laser writing as the reference fabrication technology, as it is the only one capable to realize all the three considered architectures. In this case, typical bending losses with respect to straight waveguides are of the order of 0.5 dB/cm, which corresponds to 0.2 dB per directional coupler. Realistic values for fabrication errors at each directional coupler are of 0.21 rad for the phase shifters59 and about 1–2% for the beam splitter transmissivities. These values do not change significantly considering a 2D or 3D architecture and, concerning phase shifts, a quite higher accuracy down to about 0.01 rad can actually be reached with an active control through thermo-optic phase shifters23. Therefore, we can consider as realistic fabrication tolerances 0.01 for the beam splitter transmissivity and 0.01 rad for the active phase control.

Following the scheme of Fig. 2, our investigation on the performances of the three architectures was carried out by introducing various levels of noise on the optical elements and losses after each unit cell. Our Monte Carlo simulations proceed through the following steps:

1. 1.

Sample a unitary transformation U according to the Haar measure;

2. 2.

Apply C or R algorithms to retrieve the parameters (ω i , ϕ i ) according to Eq. (1);

3. 3.

Implement each $${T}_{k,k+1}^{(i)}$$ as a Mach-Zehnder with input phase shift ϕ i and τ i,1 = τ i,2 = 2−1/2:

$${T}^{(i)}\to -\iota \,(\begin{array}{cc}{\tau }_{i\mathrm{,2}} & \iota \sqrt{1\,-\,{\tau }_{i\mathrm{,2}}^{2}}\\ \iota \sqrt{1\,-\,{\tau }_{i\mathrm{,2}}^{2}} & {\tau }_{i\mathrm{,2}}\end{array})\,(\begin{array}{cc}{e}^{-\iota {\omega }_{i}} & 0\\ 0 & {e}^{\iota {\omega }_{i}}\end{array})\,(\begin{array}{cc}{\tau }_{i\mathrm{,1}} & \iota \sqrt{1\,-\,{\tau }_{i\mathrm{,1}}^{2}}\\ \iota \sqrt{1\,-\,{\tau }_{i\mathrm{,1}}^{2}} & {\tau }_{i\mathrm{,1}}\end{array})\,(\begin{array}{cc}1 & 0\\ 0 & {e}^{\iota {\varphi }_{i}}\end{array})$$
(6)
4. 4.

Introduce gaussian noise on the four parameters (τ i,1, τ i,2, ω i , ϕ i ), by sampling new values from a normal distribution centered on the ideal ones and with widths σ BS and σ PS respectively for (τ 1, τ 2) and for (ω, ϕ);

5. 5.

Generate new unit cells from the noisy values, adding possible losses diag(η i ) at the output, and rebuild the noisy U.

This simple procedure allows us to investigate the effect of noise and losses on the C- and R-designs for the implementation of Haar random unitaries. Incidentally, our numerical simulations confirm60 that predictions would be remarkably different if, instead of generating every time a new unitary according to the Haar measure, we directly generated sets of uniformly distributed random parameters (ω i , ϕ i ) to -mistakenly- speed up the calculation. Subsequently, the same analysis is repeated for all three architectures (C, R and F) focusing on the implementation of the Fourier and the Sylvester transformation.

### Benchmarking Haar-random interferometers

The results of our first analyses are shown in Fig. 3. The figure of merit adopted to compare the two architectures is the fidelity39 $${\mathcal F} ={|\tfrac{{\rm{Tr}}({U}_{imp}^{\dagger }U)}{\sqrt{m{\rm{Tr}}({U}_{imp}^{\dagger }{U}_{imp})}}|}^{2}$$, being U a m × m unitary transformation and U imp its imperfect implementation. Thanks to the rescaling factor in the denominator, accounting for the contribution of lossy implementations, this fidelity is particularly suitable to characterize non-unitary transformations. Figures 3a,b show the deterioration of $${\mathcal F}$$ for increasing values of losses η per unit cell and size m of the circuit for the C- and R- designs. In this simulation, optical elements are assumed to be immune to fabrication noises in order to isolate the η-contribution. As pointed out in ref.39, the C-design is more resilient to internal losses than the R- design, thanks to the almost total balance between the mode interactions. Moreover, as the heuristic non-linear fit suggests (see Supplementary Note 1), also the scaling of the fidelity is much more favorable for the former architecture, features that pushes C as a promising candidate for large-scale platforms or where, due to technical issues inherent to the specific implementation, it is not possible to guarantee a low level of losses in each optical element. We observe that, as already pointed out in refs26,39, inserting additional lossy elements in the R-scheme allows to compensate for the imbalance, at the price of significantly increasing the total amount of losses (see Supplementary Note 4). Figures 3c,d show instead the average effect of a noisy implementation of the optical components over the fidelity. Similarly to the previous analysis, our simulation is carried out for various levels of noise σ and different sizes m, aiming to retrieve a more complete feeling of its scaling. Noise is assumed to be of equal intensity at this stage on both beam splitters transmissivities (σ BS ) and phase shifts (σ PS ), i.e. σ = σ BS  = σ PS , in order to capture the scaling of the performance in a unique three-dimensional plot. Our simulations confirm previous qualitative estimates39 concerning the similarity of the scalings of the average fidelity in the two architectures, providing a clear and quantitative picture of its dependency on the fabrication noise over the optical elements. Such a similar trend is observed also when investigating their performance in the case of crosstalk between thermal shifters (see Supplementary Note 5).

After this preliminary stage, more general investigations have been carried out by considering different levels of noise over the fabrication transmissivities and phases, i.e. studying the practical situation where σ BS  ≠ σ PS . Results of this analysis are shown in the contour plots of Fig. 4 for different sizes of the interferometers, highlighting a number of interesting features. First, the qualitative dependency on the noise seems to remain fixed while increasing the dimension of the circuit, though the average fidelity rapidly drops to low values already at m = 64, for intensities of noise that are within the tolerances of current technology, in particular for reconfigurable circuits. Moreover, we observe that the two sources of noise affect the fidelity in a similar way, with an intensity approximately double for the one on the transmissivities in the Mach-Zehnder. Thus, while dynamic control over the phases can in principle mitigate the effects of noisy implementations, non-ideal values of the transmissivities of the symmetric beam splitters remain critical when looking at large-scale implementations. To focus our analysis on the circuit size allowed by current technology, we have considered as a reference current top-level implementations for quantum applications. Circuits with up to m = 16 have been reported for non-reconfigurable femtosecond laser written circuits59, m = 6 for universally reconfigurable lithographic circuits24 and m = 26 for reconfigurable lithographic circuits that do not implement a fully arbitrary unitary matrix25. Reasonable sizes achievable in the near future thus can be m = 32 or m = 64 depending on the level of reconfigurability required. However, looking at future large-scale implementations, we see that for m = 1024 even a very low level of noise makes the average fidelity drop to values as low as ~0.90. Note the shift in the axes scales between the four left contour plots (m ≤ 256) and the one on the right (m = 1024). Thus, for an efficient scaling a technology leap will be required not only in the circuit size but also on the level of fabrication tolerances.

So far, we adopted as figure of merit the fidelity of the quantum process. Though perfectly suitable to benchmark noisy transformations, this quantity fails in general to highlight more complex many-particle phenomena. This requirement becomes even more strict for instance in the context of Boson Sampling and, more specifically, in its validation, where multiphoton interference plays a key role to guarantee the computational complexity of the problem. Indeed, while Boson Sampling preserves its complexity even in lossy and imperfect conditions below a certain threshold51,54, multiphoton interference in Hadamard interferometers44 was shown to be a promising tool to correctly validate its operation. Partial deviation from their ideal symmetric structures can then spoil the interference effects in the output distributions. For this reason, we investigated the performance of noisy architectures also in the scope of multiphoton output probability distributions, employing as figure of merit the total variation distance (TVD) between the ideal (P) and actual ($$\tilde{P}$$) n-photon distributions: $$TVD(P,\,\tilde{P})=\tfrac{1}{2}\,{\sum }_{i}\,|{p}_{i}\,-\,{\tilde{p}}_{i}|$$. Indeed, the TVD plays a relevant role in testing statistical hypotheses since the quantity 1 − TVD is a lower bound on the sum of Type I and Type II error rates61. From a practical point of view, $$TVD(P,\tilde{P})$$ is also a simple and natural metric to quantify the discrepancy between the two probability distributions P and $$\tilde{P}$$. Results for this analysis are shown in Fig. 5: again, noise affects almost equally the C and R architectures when averaging over all the input/output states. The slight difference in favor of the R-design may not be practically appreciable in real experimental conditions and it rapidly becomes negligible for higher values of (n, m). Thus, the two architectures behave equally as far as lossless multiphoton investigations are concerned.

We conclude from our analysis that the C- and R-designs are ultimately equivalent in terms of resilience to noise when averaging over all input/output configurations. On one side, the single optical elements affect in a different way the elements of the unitary transformation in the two schemes, being the C and R parameters more localized respectively in the upper corner and central part of the unitary matrix. However, for practical noise levels and general multi-input applications, this difference does not give rise to any effective deviation between the two schemes. In contrast, the two architectures behave differently when it comes to lossy implementations, where the symmetric C-design outperforms the R scheme.

### Benchmarking Fourier and Sylvester interferometers

Though the C- and R-designs are universal for unitary evolutions, often it is desirable to have circuits optimized for specific relevant tasks. It is the case of the quantum Fourier or Sylvester transforms, which have importance on their own in different scopes of quantum information processing.

In this section we investigate the performance of the Fast (F) scheme42,43,44,45,46 for the implementation of generalized Hadamard transformations58 and compare it with the average performance of the two universal designs, following the same procedure outlined in Fig. 2. Figure 6 reproduces the analysis of Fig. 3 including the F architecture. Being the circuit completely symmetric, the structure is totally immune to constant propagation losses, beating even the highly resilient C-design. The F-design is also more resilient to noise, thanks to the reduced depth of the circuit which lowers the number of noisy optical elements from $$\frac{m}{2}(m-1)$$ to $$\tfrac{m}{2}\,\mathrm{log}\,m$$. A benchmark summary of this comparison is reported in Table 1 for both the quantum Fourier transform and the Sylvester interferometers.

## Conclusions

Photonic technologies promise to enable the application of several quantum information protocols, ranging from fundamental research to quantum computation and optical quantum networks. In this work we have provided a comprehensive analysis of the performance of the three main interferometric schemes, namely the triangular38 and square39 designs and the Fast architecture, under realistic conditions of losses and noise. Our investigations quantitatively address the issue of imperfect implementations for interferometers of increasing dimension, aiming to embrace both mid-term and long-term technological standards. Our results confirm the qualitative expectation that the square design performs way better, in terms of fidelity, between ideal and lossy evolutions with respect to the triangular one, even though the latter exhibits a slightly enhanced resilience to fabrication noise when considering also the total variation distance. Thus, we conclude that the square design is preferable in practical applications involving multi-input protocols and Haar-random generation, especially in high-dimension circuits where the issue of propagation losses becomes critical.

Fast architectures represent instead a specialized design to implement a significant class of unitary evolutions, highly optimized for the realization of Fourier and Sylvester quantum transformations. Our results quantitatively highlight the improved performance of this scheme with respect to the universal ones, thus making it the preferred choice when applicable. Furthermore, we have provided a closed-form expression for the elements of the 2n-dimensional unitary describing these circuits, which allows to suitably tailor single elements of the unitary transformations mapping them to the transmissivities and the phase shifts in the real optical implementation. Due to the deep relevance of this broad class of quantum routines, and thanks to the high enhancements offered by their optimized implementations, we expect Fast architectures to gain a key role among photonic platforms in synergy with the universal schemes, to fully benefit from the unique advantages of both designs.

During the completion of this manuscript, related work has been reported in refs60,62.

### Data availability

The datasets generated and analysed during the current study are available from the corresponding author on reasonable request.

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## References

1. 1.

Silverstone, J. W. et al. Qubit entanglement between ring-resonator photon-pair sources on a silicon chip. Nat. Commun. 6, 7948 (2015).

2. 2.

Somaschi, N. et al. Near-optimal single-photon sources in the solid state. Nat. Photon. 10, 340–345 (2016).

3. 3.

Spring, J. B. et al. Chip-based array of near-identical, pure, heralded single-photon sources. Optica 4, 90–96 (2017).

4. 4.

Montaut, N. et al. High-efficiency plug-and-play source of heralded single photons. Phys. Rev. Applied 8, 024021 (2017).

5. 5.

Lita, A. E., Miller, A. J. & Nam, S. W. Counting near-infrared single-photons with 95% efficiency. Opt. Express 16, 3032–3040 (2008).

6. 6.

Calkins, B. et al. High quantum-efficiency photon-number-resolving detector for photonic on-chip information processing. Opt. Express 21, 22657–22670 (2008).

7. 7.

O’Brien, J. L. et al. Quantum process tomography of a controlled-NOT gate. Phys. Rev. Lett. 93, 080502 (2004).

8. 8.

Lobino, M. et al. Complete characterization of quantum-optical processes. Science 322, 563–566 (2008).

9. 9.

Laing, A. & O’Brien, J. L. Super-stable tomography of any linear optical device. Preprint at https://arxiv.org/abs/1208.2868 (2012).

10. 10.

Rahimi-Keshari, S. et al. Direct characterization of linear-optical networks. Opt. Express 21, 13450–13458 (2013).

11. 11.

Tillmann, M., Schmidt, C. & Walther, P. On unitary reconstruction of linear optical networks. J. Opt. 18, 114002 (2016).

12. 12.

Spagnolo, N. et al. Learning an unknown transformation via a genetic approach. Preprint at https://arxiv.org/abs/1610.03291 (2016).

13. 13.

Metcalf, B. J. et al. Quantum teleportation on a photonic chip. Nat. Photon. 8, 770–774 (2014).

14. 14.

Politi, A., Matthews, J. C. F. & O’Brien, J. L. Shor’s quantum factoring algorithm on a photonic chip. Science 325, 1221 (2009).

15. 15.

Crespi, A. et al. Integrated photonic quantum gates for polarization qubits. Nat. Commun. 2, 566 (2011).

16. 16.

Peruzzo, A. et al. Quantum walks of correlated photons. Science 329, 1500–1503 (2010).

17. 17.

Sansoni, L. et al. Two-particle bosonic-fermionic quantum walk via integrated photonics. Phys. Rev. Lett. 108, 010502 (2012).

18. 18.

Tanzilli, S. et al. On the genesis and evolution of Integrated Quantum Optics. Laser & Photon. Rev. 6, 115–143 (2012).

19. 19.

Orieux, A. & Diamanti, E. Recent advances on integrated quantum communications. J. Opt. 18, 083002 (2016).

20. 20.

Smith, B. J., Kundys, D., Thomas-Peter, N., Smith, P. G. R. & Walmsley, I. A. Phase-controlled integrated photonic quantum circuits. Opt. Express 17, 13516–13525 (2009).

21. 21.

Matthews, J. C. F., Politi, A., Stefanov, A. & O’Brien, J. L. Manipulation of multiphoton entanglement in waveguide quantum circuits. Nat. Photon. 3, 346–350 (2009).

22. 22.

Chaboyer, Z., Meany, T., Helt, L. G., Withford, M. J. & Steel, M. J. Tunable quantum interference in a 3D integrated circuit. Sci. Rep. 5, 9601 (2015).

23. 23.

Flamini, F. et al. Thermally-reconfigurable quantum photonic circuits at telecom wavelength by femtosecond laser micromachining. Light Sci. Appl. 4, e354 (2015).

24. 24.

Carolan, J. et al. Universal linear optics. Science 349, 711 (2015).

25. 25.

Harris, N. C. et al. Quantum transport simulations in a programmable nanophotonic processor. Nat. Photon. 11, 447–452 (2017).

26. 26.

Miller, D. A. Perfect optics with imperfect components. Optica 2, 747–750 (2015).

27. 27.

Broome, M. A. et al. Photonic boson sampling in a tunable circuit. Science 339, 794–798 (2013).

28. 28.

Spring, J. B. et al. Boson Sampling on a photonic chip. Science 339, 798–801 (2013).

29. 29.

Tillmann, M. et al. Experimental boson sampling. Nat. Photon. 7, 540–544 (2013).

30. 30.

Crespi, A. et al. Integrated multimode interferometers with arbitrary designs for photonic boson sampling. Nat. Photon. 7, 545–549 (2013).

31. 31.

Spagnolo, N. et al. General rules for bosonic bunching in multimode interferometers. Phys. Rev. Lett. 111, 130503 (2013).

32. 32.

Spagnolo, N. et al. Experimental validation of photonic boson sampling. Nat. Photon. 8, 615–620 (2014).

33. 33.

Carolan, J. et al. On the experimental verification of quantum complexity in linear optics. Nat. Photon. 8, 621–626 (2014).

34. 34.

Bentivegna, M. et al. Experimental scattershot boson sampling. Sci. Adv. 1, e1400255 (2015).

35. 35.

Aaronson, S. & Arkhipov, A. The computational complexity of linear optics. In Proceedings of the forty-third annual ACM symposium on Theory of computing, 333–342 (2011).

36. 36.

Aaronson, S. & Brod, D. J. BosonSampling with lost photons. Phys. Rev. A 93 (2015).

37. 37.

Latmiral, L., Spagnolo, N. & Sciarrino, F. Towards quantum supremacy with lossy scattershot boson sampling. New J. Phys. 18, 113008 (2016).

38. 38.

Reck, M., Zeilinger, A., Bernstein, H. J. & Bertani, P. Experimental realization of any discrete unitary operator. Phys. Rev. Lett. 73, 58 (1994).

39. 39.

Clements, W. R., Humphreys, P. C., Metcalf, B. J., Kolthammer, W. S. & Walmsley, I. A. Optimal design for universal multiport interferometers. Optica 3, 1460–1465 (2016).

40. 40.

Cooley, J. W. & Tukey, W. An algorithm for the machine calculation of complex Fourier series. Math. Comput. 19, 297–301 (1965).

41. 41.

Törmä, P., Jex, I. & Stenholm, S. Beam splitter realizations of totally symmetric mode couplers. J. Mod. Opt. 43, 245–251 (1996).

42. 42.

Barak, R. & Ben-Aryeh, Y. Quantum fast Fourier transform and quantum computation by linear optics. J. Opt. Soc. Am. B 24, 231–240 (2007).

43. 43.

Takiguchi, K., Oguma, M., Shibata, T. & Takahashi, H. Demultiplexer for optical orthogonal frequency-division multiplexing using an optical fast-Fourier-transform circuit. Opt. Lett. 34, 1828–1830 (2009).

44. 44.

Crespi, A. et al. Suppression law of quantum states in a 3D photonic fast Fourier transform chip. Nat. Commun. 7, 10469 (2016).

45. 45.

Flamini, F. et al. Observation of Majorization Principle for quantum algorithms via 3-D integrated photonic circuits. Preprint at https://arxiv.org/abs/1608.01141 (2016).

46. 46.

Viggianiello, N. et al. Experimental generalized quantum suppression law in Sylvester interferometers. Preprint at https://arxiv.org/abs/1705.08650 (2017).

47. 47.

Gattass, R. R. & Mazur, E. Femtosecond laser micromachining in transparent materials. Nat. Photon. 2, 219 (2008).

48. 48.

Spagnolo, N. et al. Three-photon bosonic coalescence in an integrated tritter. Nat. Commun. 4, 1606 (2013).

49. 49.

Shor, P. Algorithms for quantum computation: discrete logarithms and factoring. In Proceedings of the 35th Annual Symposium on the Foundations of Computer Science, 124–134 (1994).

50. 50.

Nielsen, M. & Chuang, I. Quantum Information and Quantum Computation (Cambridge, 2000).

51. 51.

Arkhipov, A. BosonSampling is robust against small errors in the network matrix. Phys. Rev. A 92, 062326 (2015).

52. 52.

Rohde, P. P. Optical quantum computing with photons of arbitrarily low fidelity and purity. Phys. Rev. A 86, 052321 (2012).

53. 53.

Rohde, P. P. & Ralph, T. C. Error tolerance of the boson-sampling model for linear optics quantum computing. Phys. Rev. A 85, 022332 (2012).

54. 54.

Leverrier, A. & Garcia-Patron, R. Analysis of circuit imperfections in BosonSampling. Quantum Information and Computation 15, 0489–0512 (2015).

55. 55.

Shchesnovich, V. S. Tight bound on the trace distance between a realistic device with partially indistinguishable bosons and the ideal BosonSampling. Phys. Rev. A 91, 063842 (2015).

56. 56.

Shchesnovich, V. S. Sufficient condition for the mode mismatch of single photons for scalability of the boson-sampling computer. Phys. Rev. A 89, 022333 (2014).

57. 57.

Hurwitz, A. Über die erzeugung der invarianten durch integration. Nachr. Ges. Wiss. Göttingen, 71–90 (1897).

58. 58.

Flamini, F. et al. Generalized quantum fast transformations via femtosecond laser writing technique. I.I.S. 23, 115–118 (2017).

59. 59.

Crespi., A. et al. Anderson localization of entangled photons in an integrated quantum walk. Nat. Photon. 7, 322–328 (2013).

60. 60.

Russell, N. J., Chakhmakhchyan, L., O’Brien, J. L. & Laing, A. Direct dialling of Haar random unitary matrices. New J. Phys. 19, 033007 (2017).

61. 61.

Lehmann, E. L. & Romano, J. P. Testing Statistical Hypotheses (Springer Texts in Statistics, 2005).

62. 62.

Burgwal, R. et al. Implementing random unitaries in an imperfect photonic network. Preprint at https://arxiv.org/abs/1704.01945v1 (2017).

## Acknowledgements

This work was supported by the ERC-Starting Grant 3D-QUEST (3D-Quantum Integrated Optical Simulation; grant agreement no. 307783): http://www.3dquest.eu, and by the H2020-FETPROACT-2014 Grant QUCHIP (Quantum Simulation on a Photonic Chip; grant agreement no. 641039): http://www.quchip.eu.

## Author information

### Affiliations

1. #### Dipartimento di Fisica, Sapienza Università di Roma, Piazzale Aldo Moro 5, I-00185, Roma, Italy

• Fulvio Flamini
• , Nicolò Spagnolo
• , Niko Viggianiello
•  & Fabio Sciarrino
2. #### Istituto di Fotonica e Nanotecnologie, Consiglio Nazionale delle Ricerche (IFN-CNR), Piazza Leonardo da Vinci, 32, I-20133, Milano, Italy

• Andrea Crespi
•  & Roberto Osellame
3. #### Dipartimento di Fisica, Politecnico di Milano, Piazza Leonardo da Vinci, 32, I-20133, Milano, Italy

• Andrea Crespi
•  & Roberto Osellame

### Contributions

F.F., N.S., N.V., A.C., R.O. and F.S. contributed to the analysis and to writing the manuscript. F.F. performed the numerical simulations.

### Competing Interests

The authors declare that they have no competing interests.

### Corresponding author

Correspondence to Fulvio Flamini.