Charting the circuit QED design landscape using optimal control theory

With recent improvements in coherence times, superconducting transmon qubits have become a promising platform for quantum computing. They can be flexibly engineered over a wide range of parameters, but also require us to identify an efficient operating regime. Using state-of-the-art quantum optimal control techniques, we exhaustively explore the landscape for creation and removal of entanglement over a wide range of design parameters. We identify an optimal operating region outside of the usually considered strongly dispersive regime, where multiple sources of entanglement interfere simultaneously, which we name the quasi-dispersive straddling qutrits (QuaDiSQ) regime. At a chosen point in this region, a universal gate set is realized by applying microwave fields for gate durations of 50 ns, with errors approaching the limit of intrinsic transmon coherence. Our systematic quantum optimal control approach is easily adapted to explore the parameter landscape of other quantum technology platforms.

For quantum technology to unfold its full potential, ultimate performance bounds must be known. This concerns all relevant steps for operating a device, such as state preparation or quantum gate implementation. One such bound is the empirical "quantum speed limit" that determines the shortest possible duration to carry out the task at hand [1][2][3]. Quantum optimal control (QOC) [4,5] has grown into a versatile tool for identifying these performance bounds. Typical control tasks include the preparation of nonclassical states, as shown in an experiment with a Bose-Einstein condensate [6], or the creation of entanglement and quantum error correction, as demonstrated with diamond spin systems [7,8]. To date, these tasks have been optimized for known, fixed parameters of the respective system. Here, we show that a fully numerical QOC approach can go even further and, using the most advanced control techniques, can map out the entire parameter landscape for the physical system at hand. To this end, we consider the task of realizing the fastest possible universal set of gates for two superconducting transmon qubits within the constraints of current lifetimes.
Superconducting transmon qubits [9] are one of the most promising architectures for quantum computing today. The development of circuit QED [10], a broad platform for quantum technology, in particular enabled the entanglement of spatially separated superconducting qubits via a shared transmission line resonator. The shared resonator can be used to implement two-qubit gates but is generally required to be decoupled when single-qubit gates are carried out.
In all these approaches, a wide range of possible parameters can be engineered. However, the parameter regimes where single-and two-quit gates can be faithfully operated are typically very different and the optimal choice of parameters is then not obvious. Identification of the optimal operating parameters is a well-defined control problem that we address here with QOC. While QOC has been used to realize specific quantum gates on superconducting qubits [19,23,[32][33][34][35][36][37], no systematic exploration of the full parameter space has been undertaken to date. In particular, most prior work has focused exclusively on the dispersive regime regime and explicitly avoided the regime of strong coupling. We show here that a fully numerical approach combined with advanced QOC techniques allows us to map the entire parameter landscape without restrictions due to approximations or model reduction. We can thus identify the global quantum speed limit for a universal set of gates for transmon qubits and analyze how gate errors vary with qubit-cavity-couplings, resonances, and cavity-mediated decay. This guides the decision for specific working points that promise a successful implementation of universal quantum computing using superconducting qubits. In particular, our results show that fast operation of both the single-and twoqubit gates needed for universal computation can be im-arXiv:1606.08825v4 [quant-ph] 3 Aug 2017 plemented in parameter regimes outside that typically explored to date. The results thus provide critical information for design decisions in circuit QED, or similarly complex quantum architectures.

Model and Parameters
Two superconducting transmons with a common transmission line resonator can be modeled by two anharmonic ladders coupled to a driven harmonic oscillator [9]. In the frame rotating with frequency ω r and within the rotating wave approximation, the Hamiltonian readŝ where ω c is the resonator ("cavity") frequency, ω 1,2 is the frequency of the first (second) qubit, α 1,2 is the qubit anharmonicity, g is the coupling strength between each qubit and the resonator, and (t) is the microwave control field in the cavity. Taking (t) ∈ C is equivalent to controlling the pulse amplitude and phase independently. It corresponds to a control field (t) ∈ R in the non-rotating frame whose driving frequency may deviate from ω r . In the following, we set = 1. Typically, the Hamiltonian (1) is treated perturbatively, in order to derive an effective model in which the cavity can be integrated out. This approach is only valid in the "dispersive regime" where the qubitcavity separation is much larger than the qubit-cavity coupling, |ω c − ω 1,2 | g. This limits the effective interaction ∝ g/(ω c − ω 1,2 ) between both qubits, except when resonances with higher transmon levels can be exploited [14,15]. Here, we forgo such a treatment in lieu of solving Eq. (1) numerically, allowing us to explore parameter regimes beyond the dispersive limit.
In order to limit the number of parameters that have to be varied, we focus on the two parameters that capture the essential physics of quantum gate implementation. The departure from the dispersive regime is characterized by ∆ c /g, with ∆ c = ω c − ω 1 . Secondly, resonances of the form |ω 1 − ω 2 | ≈ nα 1,2 , n = 1, 2, between different levels of the two transmons are known to aid in the implementation of entangling gates [14,15]; therefore, it is natural to express the qubit-qubit-detuning ∆ 2 = ω 2 −ω 1 in units of α = |α 1 + α 2 | /2. We can thus explore the entire parameter landscape in terms of ∆ c /g and ∆ 2 /α by keeping ω 1 , g, α 1 , and α 2 fixed and varying ω 2 and ω c . At each parameter point (∆ c /g, ∆ 2 /α), there is a unique minimum duration (a quantum speed limit) for any gate operation and pre-specified error. ω1/2π = 6.0 GHz ω2/2π = 5.0 -7.0 GHz (vary); ∆2 ≡ ω2 − ω1 ωc/2π = 4.5 -9.0 GHz (vary); ∆c ≡ ωc − ω1 α1/2π α2/2π  Table I. We include in our model the most relevant source of dissipation, spontaneous decay of the qubits with a decay rate γ (and the associated qubit lifetime τ γ ), and spontaneous decay of the cavity with decay rate κ (lifetime τ κ ). We assume the same decay rate for both qubits for convenience only. Different rates would result in correspondingly different error limits for single-qubit operations on each qubit, but not affect results otherwise.
Qubit interaction, entanglement, and spontaneous decay losses for zero external drive Creation of entanglement is typically considered the most difficult task in implementation of a universal set of gates. The use of dressed logical states results in a static qubit-qubit interaction ζ (for (t) = 0) that may be exploited to this end. The interaction is the result of the eigenenergies E 00 , E 01 , E 10 , and E 11 being shifted relative to the bare frame, which leads to [2] ζ = E 00 − E 01 − E 10 + E 11 . ( The value of ζ is shown in Fig. 1 (a) as a function of the two parameters ∆ c /g and ∆ 2 /α. For a horizontal and vertical slice through the parameter landscape at the point marked as X, Fig. 1 (d-g) shows how the dressed energy levels vary with ∆ c and ∆ 2 , and how they combine to ζ. The energy shifts become especially large when any of the (bare) qubits are near-resonant with the cavity (vertical line in panels a-c at ∆ c = ω c − ω 1 = 0), or with the "anharmonic transition" (horizontal lines at ∆ 2 ≈ ±α). They jump in sign when crossing through the resonances, see Gate duration after which the field-free evolution produces a fully entangling gate. The points labeled 1-10 mark the parameters for some existing implementations of entangling gates for fixed-frequency transmons, cf. Refs. [11][12][13][14][15][16][17][18][19][20]. For points 4, 2, 6, ∆c/g takes a value outside of plotted region (58, 95, and 43, respectively). The gate durations are 150 (1) , 220 (2) , 110 (3) , 200 (4) , 500 (5) , 350 (6) , 350 (7) , 50 (8) , 120 (9) , and 200 (10) ns. The horizontal gray dotted lines indicate ∆2/α = −2, 0, 2, for visual reference. (c) Ratio of the dressed qubit decay rate to the bare qubit decay rate. The point labeled by the red X indicates possible parameters where to implement a full universal set of gates. The bare qubit and cavity frequencies ω1, ω2, ωc at this point are shown in the inset, together with their "dressed" value E10, E01, E001, i.e., the eigenenergies of the corresponding logical levels, respectively the eigenenergy E001 of the eigenstate closest to the bare state |001 . (d, e) For a horizontal slice through the parameter space as indicated in panel a, value of ∆E01 ≡ E01 − ω2, ∆E10 ≡ E10 − ω1, ∆E11 ≡ E11 − ω1 − ω2, and ∆Ecav ≡ E001 − ωc. (f, g) Energy shifts for vertical slice as indicated in panel a. In panels (e, g), the difference between the solid orange and dotted blue curve is ζ, see Eq. (2). The gray region highlights the quasi-dispersive straddling qutrits (QuaDiSQ) regime, cf. Fig. 2 (n). The vertical thick dashed line and thin solid lines mark the parameters of point X, and the cavity-qubit or qubit-cavity resonances, respectively.
frequencies are very close. Such a large static interaction would allow for very fast entangling gates. The field-free evolution for a duration T induces an entangling gate with a concurrence [2] C 0 (ζ, T ) = sin ζT 2 .
A perfectly entangling gate is first reached after T π = π/ζ. This time is shown in Fig. 1 (b) as a function of ∆ c /g and ∆ 2 /α. The spontaneous decay of the qubit with a decay rate of γ implies a lower bound on the error of any two-qubit gate. As shown in Methods, for a fixed gate duration T , we find this bound to be The decay rate of the logical eigenstates may increase relative to the bare states, due to overlap with excited bare transmon or resonator levels with shorter lifetimes. The ratio of the effective ("dressed") qubit decay rate to the bare decay rate is shown in Fig. 1 (c) as a function of ∆ c /g and ∆ 2 /α. At the resonance lines, the increase in dissipation is most severe. However, even in the worst case, it is within only a factor of ≈ 2.3 of the bare rate. This is in contrast to the static interaction ζ and the duration to imple-ment an entangling gate, T π , whose values span several orders of magnitude. Choosing a specific point for gate implementation within the parameter space may therefore dramatically increase implementation speed without incurring substantially larger losses. It remains to be seen, however, whether parameter space points with a large static interaction and a small T π also allow for implementation of local operations.

Entanglement creation and removal
We now consider the use of a control field (t) = 0 for the realization of quantum gates. Before targeting specific gates that build up a universal set, we study a prerequisite-the basic capability to entangle and disentangle. To this end, we minimize and maximize entanglement, measured by the concurrence, while minimizing loss due to decay or leakage from the logical subspace, at all points of the parameter landscape. At each point, we employ a three-stage optimization, as described in Methods.
Any entanglement that is created or removed by application of a microwave pulse is relative to the entanglement for the field-free evolution. Therefore, Fig. 2 (a-c) shows the field-free entanglement C 0 (ζ, T ) (Eq. (3)) as a function of ∆ c /g and ∆ 2 /α for gate durations T =200, 50, and 10 ns. The oscillatory behavior in Fig. 2 (a, b) results from the fact that T π/ζ for these longer gate times.
The basic capability to carry out local operations, quantified as C SQ , is analyzed in Fig. 2 (d-f). Entanglement can be reduced below the field-free values in a large part of the parameter space by applying suitably shaped pulses, as reflected by the dark areas in Fig. 2 (df). For long gate durations (T = 200 ns), the concurrence can be brought close to zero over nearly the entire parameter landscape, see Fig. 2 (d). As the pulse duration gets shorter, the parameter region with significant fieldfree entanglement becomes smaller ( Fig. 2 (b, c)). At the same time, bringing the concurrence to zero becomes more difficult. This is true in particular along the resonance lines ω c ≈ ω 1 , ω 2 , for −α < ∆ 2 < α, which is the region for which there is substantial field-free entanglement even for very short gate durations (T = 10 ns, Fig. 2 (c)). Application of a pulse cannot further reduce the concurrence, see Fig. 2 (f). Thus, a speed limit for pulse-induced removal of the non-local nature of the interaction, (C 0 > 0 → C SQ = 0) is found around 10 ns.
The basic capability to create entanglement, quantified as C PE , with microwave control is shown in Fig. 2 (gi). Comparison with the field-free entanglement C 0 in Fig. 2 (a-c) reveals that adding microwave controls enlarges the region of parameter space where entanglement can be created, as expected. For long gate durations (T = 200 ns), entanglement can be generated in large parts of the parameter space, in particular in the region −α < ∆ 2 < α around the resonance of both qubits (∆ 2 ≈ 0). As the gate duration becomes shorter (T = 50, 10 ns), the region where perfect entanglers can be implemented shrinks ( Fig. 2 (h, i)). Still, even for very short gate durations, it is possible to generate pulseinduced perfect entanglers along the line ∆ 2 = 0.
Optimal QuaDiSQ regime for implementing a universal set of gates The realization of a full universal set of gates [39] requires a region in the parameter space that allows for both entangling and local gates. This is true both for tunable and fixed-frequency qubits, since in the former case the tuning range should be kept small in order to avoid dephasing (flux) noise. To identify such regions, we inspect the product It is noteworthy that points with large field-free entanglement ζ, cf. Fig. 1 (a), are not ideal candidates for fixed-frequency qubits. This is because each qubit transition is strongly dressed by the other qubit in this case. While it allows for the easy realization of an entangling (CPHASE) gate in under 10 ns [40], cf. Fig. 1 (b), it prevents single-qubit gate operations independent of the state of the other qubit. Similarly, the horizontal line in Fig. 2 (m-n) suffers from a resonance between the qubits (∆ 2 = 0) which impairs their individual addressability despite the absence of field-free entanglement.
We conclude that an optimal regime for fixedfrequency transmons requires simultaneously (1) very small static entangling strength ζ and (2) a small dispersive parameter (ω 1,2 − ω c )/g, while (3) also avoiding resonance of the qubit frequencies. This is the case in the region encircled by the turquoise dashed ellipse in Fig. 2 (n), for which we coin the name Quasi-Dispersive Straddling Qutrits (QuaDiSQ).
The quasi-dispersive regime of cavity-QED is defined as 1 < (ω 1,2 − ω c )/g < 10, between the near-resonant ((ω 1,2 −ω c )/g < 1) and the dispersive ((ω 1,2 −ω c )/g > 10) regime [41], both of which are excluded by conditions (1)- (3). Meanwhile, the minimal static coupling ζ between the qubits (cf. black/white disc in Fig. 1 (a,b), and crossing solid orange and dotted blue lines in Fig. 1 (e,f)) is enabled by one transmon frequency being situated between the first and second (|1 → |2 ) transition of the other transmon (hence "qutrits"), −1 < ∆ 2 /α < 1. This region is reminiscent of the so-called straddling regime of single-transmon circuit QED [9], with the distinction that the qutrits are straddling one another here, rather than the cavity. This allows the level repulsion to act with opposite signs, see Fig. 1 (d,f), effectively resulting in destructive interference and avoiding unwanted entanglement. The qubits being sufficiently separated still allows individual addressing with microwave pulses. In (k-n) Combined measure of success CPE × (1 − CSQ) for the ability to both produce local gates (CSQ = 0) and perfectly entangling gates (CPE = 1). In all panels, loss from the logical subspace (through excitation or dissipation) is indicated as transparency against the (black) background. The turquoise ellipse encircles the "QuaDiSQ" region which we identify as optimal for the control of entanglement, see text for details. The point marked by the red X is a candidate for the implementation of a universal set of gates.
particular, frequency crowding is also avoided, with the nearest unwanted transition at least 100 MHz detuned for all the relevant transitions. We emphasize that this optimized mechanism is a general principle for quantum information processing, whereby destructive interference can be engineered on a multi-qubit state to mutually cancel out level repulsions coming from nearby levels and thereby enable both driving of local and nonlocal transitions from the level.
Lastly, the decoherence rate is only marginally increased in the quasi-dispersive regime relative to the bare rate, cf. Fig. 1 (c). At the point marked X in Figs. 1, 2 the corresponding error limit is shown in Fig. 4  provides an effective optimum for these combined error mechanisms. The exact corresponding parameters are listed in Fig. 1 (c).

Quantum speed limit for a universal set of gates
A universal set requires one entangling gate, with many choices available. We therefore analyze which perfect entanglers can be implemented before determining the quantum speed limit for a universal set of gates. To this end, we employ the representation of two-qubit gates in the Weyl chamber (see Methods). Figure 3 indicates the perfect entanglers that are successfully implemented   Fig. 2 (g-i). The total Weyl chamber and the polyhedron of perfect entanglers are indicated by thin gray and bold magenta lines, respectively.
in Fig. 2 (g-i). For long gate durations (T = 200 ns, Fig. 3 (a)), a large part of the polyhedron of perfect entanglers is covered. That is, nearly any perfect entangler can easily be implemented. This changes as the pulse duration gets shorter. For T = 50 ns (Fig. 3 (b)), the reached perfect entanglers are focused around the L point (diagonal gates), and the lines Q-A 2 and Q-P, M-N (local equivalence classes of √ iSWAP-√ SWAP). We can therefore empirically identify the dominant effective qubit-qubit interaction term as a linear combination of σ xσx ,σ yσy , andσ zσz . For very short gate durations (T = 10 ns, Fig. 3 (c)), the obtained gates cluster strongly around the points Q and M (local equivalence class of √ iSWAP), corresponding to a dominant interaction term σ xσx +σ yσy =σ +σ− +σ −σ+ . This is in agreement with the interaction obtained from effective two-qubit models in the dispersive regime [42]. Our results suggest that targeting √ iSWAP (or a gate that is locally equivalent to √ iSWAP) is natural also in the non-dispersive case.
Analysis of the gate fidelities of the perfect entanglers allow us to extract the quantum speed limit for the generation of entanglement. The smallest error of any perfect entangler in Fig. 2 (g-i) is shown as ε PE avg in Fig. 4 (a), together with results for additional gate durations. ε PE avg is to be compared to the error ε 0 avg due to qubit decay, Eq. (4). For T > 10 ns, we find the errors to be limited by the qubit lifetime. For T = 5 ns, the error increases significantly from the lifetime-limited bound, indicating that the main source of error is now the lack of capability to create entanglement. We thus find T ≈ 10 ns to be the minimum time necessary to build entanglement.
Next, we determine the quantum speed limit for local gates in the QuaDiSQ regime, using the parameters of point X. The smallest gate error of a Hadamard gate on qubit 1 (H ⊗1 1) is shown as ε H1,X avg in Fig. 4 (b). We find that the Hadamard gate can be implemented near a lifetime-limited error bound up to a gate duration of 50 ns. Targeting the remaining single-qubit gates in the universal set, 1 1 ⊗ H, S π/8 ⊗1 1, and 1 1 ⊗ S π/8 (with S π/8 = diag[1, exp(−iπ/4)]) yields comparable errors. We therefore identify 50 ns as the speed limit for singlequbit operations.  (Figs. 1, 2) in the optimal QuaDiSQ regime, compared to the lifetime-limited error bound, ε 0,X avg , at point X, which differs from the global limit ε 0 avg by a factor of 1.2 due to the increase in the effective decay rate, cf. Fig. 1 (c). The minimum in ε H1,X avg may be taken as an estimate of the global quantum speed limit for implementing a universal set of gates.
For implementing a general perfect entangler at point X with a gate error close to the theoretical limit, we find a minimum gate duration of T = 20 ns, located in the Weyl chamber on the line Q-P. This is slightly larger than the global limit of 10 ns. It is worth noting that this entanglement rate is nonetheless much faster than what is achievable through resonant qubit-coupling (typically limited to the inverse of the cavity-mediated qubit-qubit coupling, ≈ 20 MHz at point X). This suggests that even for a frequency-tunable architecture, it can be advantageous to supplement static qubit coupling with microwave drives for even faster gates.
When decomposing an arbitrary computation of a quantum computer into a universal set of single and twoqubit gates, it is important to use an entangling operation in this set that yields the smallest possible number of elementary gates. To this end, the perfect entangler must be taken from a small subset of "special" perfect entanglers. Specifically, gates along the line L-A 2 in the Weyl chamber are known to be efficient [43]. The midpoint of this line, known as the BGATE (see Methods), allows for the most efficient realization of a universal set. It requires at most two applications in the decomposition of an arbitrary gate, compared to three for iSWAP. We therefore target a BGATE at parameter point X. Compared to iSWAP/ √ iSWAP, this entails small tradeoffs in the gate duration and in the complexity of the control pulses. We find a minimum gate duration for a BGATE of 50 ns.

Realization of a universal set of gates
The optimized pulses and the corresponding dynamics (in the interaction picture) for the entire universal set of gates, consisting of H ⊗1 1, 1 1 ⊗ H, S π/8 ⊗1 1, 1 1 ⊗ S π/8 , and BGATE, are shown in Fig. 5. The average gate error over the entire universal set is 4.3 × 10 −3 , compared to the lifetime-limited error bound of ε 0 avg = 3.6 × 10 −3 . In all cases, the achieved gate error is within a factor of 1.3 of the lifetime-limited error bound. It is correlated with the mean of the population outside the logical subspace, P outside , which in turn is correlated with the mean pulse amplitude (black doted horizontal lines). Population that is excited to these higher levels is more strongly affected by dissipation, because the decay rate scales as the square root of the (bare) quantum numbers. In fact, evaluating the gate error without dissipation (ε no diss. avg in Fig. 5) shows that the error is dominated by the decay of the qubit, not by failure to implement the desired gate. As qubit lifetimes increase with further technological advances of the transmon platform, errors will approach the value ε no diss. avg , consistently below the quantum error correction limit.
For all gates, we obtained the lowest error when the optimization was performed in a rotating frame with frequency centered exactly between the two qubit frequencies. While for the implementation of the Hadamard gate, the spectra show active frequencies largely around the two qubit transitions (indicated by the dashed green and orange lines), for the phase gate as well as the BGATE, the pulses also have strong off-resonant components. These are predominantly to the left of the qubit transitions, and thus affect the anharmonic transitions more strongly, driving pathways outside of the logical subspace.
The derivative of the complex phase, dφ/dt in Fig. 5 provides a rough estimate of the time-frequency characteristics of the optimized pulse. For several of the pulses, we find distinct differences in the active frequencies over the duration of the pulse. For example, for 1 1 ⊗ H, the pulse alternates between the two qubit frequencies. For 1 1 ⊗ S π/8 , the strongly off-resonant driving is interrupted by two brief periods of near-resonant driving. For the BGATE, the first half of the pulse acts near-resonant on the two qubits, whereas in the second half, strong offresonant kicks are applied. It will be interesting to see in future work whether the observed feature of alternating periods of near-resonant and off-resonant driving may be exploited in an analytic design of control fields near the quantum speed limit.
In practice, any waveform generator will also have bandwidth and filtering restrictions that must be taken into account [37,44,45]. In order to identify the quantum speed limit, we have not considered any such restrictions. Consequently, the pulses have bandwidths in the 500-1000 MHz range. While this is well within reach of the most current waveform generators [46], it may be beyond the limitations of older devices typically used in experiments [44]. In such a case, moving away from the speed limit of T = 50 ns to e.g. T = 100 ns may provide several practical advantages, as the longer gate durations give room for applying spectral constraints. Furthermore, gate errors will approach the lifetime limit more closely, giving some room to compensate for imperfections in the control scheme. In the supplementary material, we show pulses implementing a universal set of gates at T = 100 ns with a spectral constraint of ±200 MHz around the center of the rotating frame.
Robustness with respect to fluctuations can also be added as an explicit optimization objective [47]. For example, for a Hadamard gate on transmon 1 at a longer gate duration of T = 100 ns, we find that by simultaneously optimizing over multiple variations of the system we can account for errors in the pulse amplitude of 1% with relative ease, incurring a worst-case drop in the gate error from 7.3 × 10 −3 to 7.6 × 10 −3 , or 8.9 × 10 −5 to 3.7 × 10 −4 without spontaneous decay of the qubit. In general, both technical constraints and noise sources should be addressed with respect to a specific experimental setup. Optimal control techniques towards this end are readily available [48,49].

DISCUSSION
Superconducting qubits with a shared transmission line come with great tunability in system parameters. We have addressed the question of how to choose these parameters in order to implement a universal set of gates with the best possible errors and shortest possible gate durations. We have found that the parameter landscape may be fully characterized by the qubit-qubit detuning in units of the anharmonicity and the qubit-cavity detuning in units of the qubit-cavity coupling. Analysis of the field-free qubit dynamics revealed novel strategies for implementing both entangling and local quantum gates. Resonances between qubit levels, or with the cavity, can generate very large interactions even with-

Re[Ψ(t)]
Hadamard (2) ε no diss. avg is evaluated without any decay in the system. The dissipative gate error ε diss. avg should be compared to the lifetime-limited error bound ε 0,X avg = 3.6 × 10 −3 for an assumed lifetime of 13.3 µs. The dynamics are shown for each of the four logical basis states, in terms of amplitude and phase of the projection onto the logical basis states (color-coded) within the unit circle. The dashed circle marks an amplitude of 1/ √ 2. The values at initial (final) time are indicated by the colored squares (bullets). The population P outside outside of the logical subspace is plotted over time. The properties of the optimized pulse (t) for each gate are shown, from top to bottom, in terms of the pulse amplitude | |, the (smoothed) derivative of the complex phase dφ/dt, and the spectrum |F ( (t))| for frequencies δ relative to the rotating frame at ωr/2π = 5.9325 GHz. The derivative dφ/dt gives an approximation for δ(t). The dressed qubit frequencies are indicated as green and orange dashed lines, the dressed "anharmonic transitions" |1 → |2 as green and orange dotted lines. The black dotted lines correspond to the mean value of P outside and | |, respectively. out any external drive. In these regions of the parameter space, we showed that a quantum speed limit of 10 ns can be obtained for the controlled generation or removal of entanglement. However, a strong, always-on interaction cannot yield a fast universal set of quantum gates with low error, due to the difficulty to generate specific single qubit gates. These regions of the parameter space are therefore only of interest to setups that employ tunable qubits. For fixed-frequency qubits, high-fidelity quantum gates are best implemented with system parameters that do not yield any static interaction but where strong in-teraction can be engineered in a time-dependent fashion by a suitably shaped microwave pulse.
A key result of this work is the demonstration that the conditions for the realization of a universal set of gate for short gate durations are best met outside of the dispersive regime. Parameter regimes where the dispersive approximation is not valid have remained under-explored to date, since the Hamiltonian in this regime cannot easily be approximated to an analytically treatable model. In this work, we have used a fully numerical approach to explore the complete parameter space without restric-tions and thereby identified a novel parameter regime as the optimal operating regime for universal quantum computing in which the global quantum speed limits for a given architecture are attained. In this new regime, which we term the Quasi-Dispersive Straddling Qutrits (QuaDiSQ) regime, we have shown that two critical but competing requirements for realizing a universal set of gates are achieved through destructive interference of multiple near-resonances. Specifically, one can maintain the near-resonances that allow for fast entangling gates, while also minimizing static qubit-qubit interaction allowing for the implementation of local gates.
Using a universal set of gates consisting of the non-local BGATE, as well as Hadamard and phase gates on each of the two qubits, we derived control protocols to realize gates with errors within a factor of 1.3 of the lifetimelimited error bound for a gate duration of 50 ns. Given this identification of the new QuaDiSQ regime and characterization of the fundamental quantum speed limits, further requirements and constraints of a specific experimental setup can be taken into account. For example, by extending the gate duration to 100 ns, we can enforce a spectral width of the pulses of ±200 MHz around the center frequency. Robustness to experimental parameters such as variations in the pulse amplitude can further be taken into account for specific setups.
Other choices for the entangling operation are conceivable as well. For long gate durations, almost all perfect entanglers can be implemented. When the gate duration is shortened, only the √ iSWAP gate plus all gates that differ from √ iSWAP by local operations survive. This indicates that √ iSWAP is the most natural entangling gate for the transmon architecture. This is in agreement with earlier findings obtained in the dispersive regime [42], suggesting that effective two-qubit models may be still be qualitatively correct even when they break down quantitatively.
The approach advocated here of identifying a few key parameters and exploring those with quantum optimal control is not limited to circuit QED platforms nor quantum computation. In fact, the paradigm of quantum technology is to engineer an often complex quantum system to resemble a comparatively simple model Hamiltonian (an anharmonic ladder system in the case of the transmon). Quantum engineering ensures, on one hand, that only a few parameters will be relevant in the analysis of the possible dynamics. On the other hand, isolation of the required quantum features typically implies limited control over the system. Ensuring the desired functionality (here a universal set of quantum gates) despite limited control (no or limited tunability of the qubits in the present case) is possible by harnessing some of the complexity of the quantum system (using multiple interfering sources of entanglement, as in the QuaDiSQ regime in our example). This paradigm finds many realizations at the interface of quantum optics and solid-state physics, for the design of devices to tackle tasks such as sensing of external fields with quantum-enhanced sensitivity or transmission of quantum information with cryptographic capability. Quantum optimal control is essential to achieving this goal.

Optimization procedure
The results shown in Fig. 2 are the result of optimal control theory applied to a large sample of parameter points (∆ 2 /α, ∆ c /g). At each point, a multi-stage optimization procedure [50] is employed to either minimize or maximize entanglement, proceeding in three stages.
In the first stage, we presume the control pulse to take the form where B(t) is a Blackman shape, similar to a Gaussian, but exactly zero at initial and final time. In the rotating frame, the driving frequency ω r vanishes and instead is reflected in the Hamiltonian (1). The width of the shape is constant, extending over the entire duration. Thus, the control problem has only two free parameters, the peak amplitude E 0 and the driving frequency ω r . Note that a complex (t) would be equivalent to a time-dependent phase of the pulse in the non-rotating frame. However, we find that the results in Fig. 2 do not change significantly when allowing for multiple frequency components or allowing a non-zero phase. The first optimization stage consists in choosing ω r randomly from within a range of 1 GHz around the qubit and cavity frequencies, and scanning the pulse amplitude E 0 systematically for values ∈ [10, 900] MHz. The best pulses are selected according to the functionals This takes into account both the concurrence C and the error due to population loss, whereÛ is the projection of the time evolution operator onto the logical subspace. The functional is written as a product, since the concurrence is only well-defined for a population loss error near zero. The selected pulses are the starting point for the second stage, a simplex (Nelder-Mead) optimization of the two free parameters E 0 and ω r in Eq. (5), using the same optimization functionals (6), (7). The third and last stage of optimization relaxes the constraint imposed by the simple analytical form (5), and uses Krotov's method [51] to continue optimization of (t) for an arbitrary perfect entangler, respectively an arbitrary local gate. As any gradient-based optimization approach, Krotov's method requires a differentiable functional. Since the concurrence is non-analytic in the control, we cannot employ Eqs. (6), (7) and need to resort to optimization in the Weyl chamber as described below. The pulse shape is now allowed to take complex values. The total optimization functional also includes a term tr[Û †Û ]/4 to penalize loss from the logical subspace [52].
The optimization for a complete universal set of gates uses a similar three-stage procedure as outlined above, but optimizes towards the BGATE (up to local operations, see below) and towards the four local gates using the gate overlap with the target gateÔ,

Optimization in the Weyl chamber
Any two-qubit gateÛ ∈ SU(4) can be written according to the Cartan decomposition [53,54], whereσ x ,σ y ,σ z are the Pauli matrices, andk 1,2 ∈ SU(2) ⊗ SU(2) are single-qubit, "local" operations. Taking into account symmetries, the coefficients c 1 , c 2 , c 3 take values c 1 ∈ [0, π] and c 2 , c 3 ∈ [0, π/2]. They may be interpreted as coordinates in a three-dimensional space; all the points that represent the possible two-qubit gates up to local operations then form a quarter-pyramid called the Weyl-chamber. It is depicted in Fig. 3. The named edges and vertex midpoints of the Weyl chamber correspond to some of the "standard" two-qubit gates (CNOT, CPHASE, SWAP, iSWAP, etc. ). The point (π/2, π/4, 0), i.e., the midpoint of the line L-A 2 , defines the BGATE that is the optimal perfect entangler for the universal set of gates, with the canonical representation Separating a quantum gate into local and non-local contributions through the Cartan decomposition allows to optimize for a quantum gate up to local operations by minimizing the "local invariants functional" J LI . It evaluates the geometric distance to a target point in the Weyl chamber [52]. This is especially appropriate in the context of universal quantum gates, as we assume that arbitrary single-qubit gates can be implemented.
The majority of two-qubit entanglers which form a polyhedron with the edge points L, M, A 2 , Q, P, and N, indicated in magenta in Fig. 3. In order to optimize for an arbitrary perfect entangler using a gradient-based method, one can define a "perfect entanglers functional" J PE that minimizes the geometric distance to the surface of the polyhedron [55]. Note that the opposite, optimizing for an arbitrary local gate (non-entangling), is achieved by applying the local invariants functional towards the points O or A 1 . Only when we need to implement a specific gate (e.g., the single-qubit gates in the universal set) do we employ the gate overlap J sm .
Any of the functionals defined in the Weyl chamber assume thatÛ ∈ SU(4). When the model includes levels outside the logical subspace, as is the case here, some of the population may remain in those spurious levels at final time T . The resulting projectionŨ to the logical subspace may no longer be unitary. In this case, we must include the loss in the functional [52], and evaluate J LI or J PE using the closest unitary gateÛ. IfŨ has the singular value decompositionŨ =VΣŴ † , then the closest unitary isÛ = arg min When evaluating the optimization success using J LI , we must find the local operationsk 1 ,k 2 in the Cartan decomposition such that ε avg is minimized. This is done by parametrizing an arbitrary single-qubit gate aŝ cos θ e iφ1 sin θ e iφ2 − sin θ e −iφ2 cos θ e −iφ1 .
Withk 1 andk 2 consisting of two single-qubit gates each, this gives a total of 16 free parameters that are easily determined through numerical minimization. For J PE , the procedure is the same, except that the appropriate value of the Weyl coordinates c 1 , c 2 , c 3 must first be determined by projectingÛ onto the surface of the polyhedron of perfect entanglers (assumingÛ itself is not already a perfect entangler).

Average Gate Error
While the functionals J LI , J PE , or J sm are suitable for numerical purposes to steer an optimization, they are not directly accessible to measurement and can thus not be used to objectively evaluate the success of gate implementation. The experimentally relevant measure of success for implementing the target gateÔ with a dynamical map E is the average gate fidelity respectively the average gate error ε avg = 1 − F avg . The error is easily evaluated numerically [56]. We report all gate errors in terms of ε avg , independent of the functional used in the optimization. In order to derive a lower limit for the achievable gate error for a given qubit decay rate, i.e., the lifetime-limited error bound, we consider the Hamiltonian of Eq. (1) with g = 0 and (t) ≡ 0. The cavity can then be integrated out, and the transmon Hilbert space can be truncated to two levels. In the interaction picture and without dissipation, there is no time evolution (Ô = 1 1). The Liouvillevon-Neumann equation can then be solved analytically for the qubit decay rate γ. Plugging the result into the formula for ε avg yields

Equation of motion
In order to evaluate the average gate error for any of the optimized pulses, we numerically solve the full Liouville-von-Neumann equation with the Hamiltonian (1), and the Lindblad operatorŝ A 1 = √ γb 1 ,Â 2 = √ γb 2 , andÂ 3 = √ κâ. The Hilbert space of the two transmons and the cavity are truncated at 5 and 6 levels, respectively. This has been checked to be sufficient for all pulses considered here. For the purpose of optimization, solving Eq. (8) is numerically too expensive. Instead, we solve the Schrödinger equation with an non-Hermitian effective Hamiltonian to mimic population loss, We also increase the decay rate of the highest qubit and cavity level to infinity as a method to ensure that the optimized pulses are well-described in the truncated Hilbert space. We stress that this non-Hermitian Hamiltonian is only used to effectively penalize population in strongly dissipative levels during the optimization. The final optimized pulses are then evaluated by solving Eq. (8); all reported dynamics and errors are obtained from this density matrix evolution. Both the Liouvillevon-Neumann equation and the Schrödinger equation with a non-Hermitian Hamiltonian can be solved efficiently and to high precision using an expansion into Newton polynomials in a Krylov subspace [57], as implemented in the Fortran QDYN package.