Introduction

While quantum processors continue to progress towards practical use, the errors present in current systems are still the most limiting factor. A dominant error in superconductor-based quantum computers is decoherence. There have been several proposals to mitigate it, either by designing new qubit types1,2,3 or by further optimizing the existing designs, typically focusing on increasing the coherence times of the circuit4,5,6. However, in the latter case, one often encounters trade-offs between different circuit properties. For example in transmon qubits, which have emerged as the most widely-used qubit type in large-scale experiments7,8,9,10,11, the suppression of charge noise comes at the cost of low anharmonicity which sets a lower bound on the duration of single-qubit operations12.

This illustrates the importance of understanding the different errors affecting the quantum hardware, as well as the fact that an informed design of the circuit parameters must consider a plethora of possible error sources which are not necessarily limited only to the coherence properties of the circuit, but also include leakage errors in single qubit gates due to low anharmonicity, state preparation errors due to finite-temperature heating effects13,14,15, as well as the parity-switching error presented in this manuscript. To elaborate further, only taking into account the coherence properties of the transmon and the low anharmonicity, one way to achieve better performance is to increase the transmon anharmonicity (in order to suppress potential leakage errors) while keeping the frequency of the transmon fixed. The latter condition, under the assumption of constant quality factors, ensures the coherence properties of the circuit remain unchanged. However this inadvertently leads into the regime where the transmon charge dispersion becomes more significant. One of the errors that is exponentially more pronounced in the low EJ/EC regime is related to the presence of electron-like excitations of the superconducting condensate, referred to as quasiparticles and the charge-parity of the transmon, thus prompting us to analyze these effects further so that a trade-off between the different error sources can be made.

Quasiparticles can be created through several mechanisms, and are known to cause different types of incoherent errors in superconducting qubit realizations16,17. Particularly, quasiparticle tunneling across the Josephson junction results in energy relaxation and dephasing in superconducting qubits18,19,20,21,22,23.

Such detrimental quasiparticle-induced effects have, in turn, motivated research for finding mitigation strategies such as normal-metal traps24,25,26, band-gap engineering27,28,29 and improved qubit design30. There have also been efforts towards designing superconducting qubits that are expected to be intrinsically robust against quasiparticle tunneling31.

Suppression of charge-noise susceptibility of the transmon is achieved by adding a large shunt capacitor in parallel with a Josephson junction of a Cooper-pair box12. However, the energy levels of the transmon exhibit a weak 2e-periodic charge dispersion and are, thus, not completely independent of the offset charge on the transmon island. Since the presence of a quasiparticle shifts the island charge by e, the energy spectrum of the transmon can, thus, be divided into two distinct manifolds based on the parity of the number of quasiparticles on the island as shown in Fig. 1a.

Fig. 1: A schematic representation of the effect of a parity switch on a single transmon and on a two-qubit gate.
figure 1

a The energy diagram of a single offset-charge-sensitive transmon with EJ/EC = 10 with two distinct parity manifolds, marked with + and −. While the ground state \(\left\vert 0\right\rangle\) also comprises of two distinct parity levels, the difference between them is not visible. b Illustration of two parity switching mechanisms. The vertical axis represents the energy, with the left and right regions corresponding to the two sides of the Josephson junction and the middle region corresponding to the insulator. The light gray area corresponds to the density of states of a BCS superconductor on both sides of the junction. Orange: A high energy photon breaks a Cooper pair (dark blue) thus generating two quasiparticles (light blue), with one quasiparticle tunneling across the junction. Red: A preexisting quasiparticle tunnels across the junction. c The energy level diagram of the states involved in the operation of a diabatic CPHASE gate. The Rabi oscillation between the levels \(\left\vert 11\right\rangle \leftrightarrow \left\vert 02\right\rangle\) is marked with green arrows, however the larger charge dispersion of the second excited state means that both parity states of the \(\left\vert 02\right\rangle\) level cannot be considered degenerate anymore. d The lumped element model of the tunable-coupler circuit used in the implementation of diabatic CPHASE gates, consisting of two computational transmons (dark and light blue) referred to in the following as Qubits 1 and 2 (with labels q1 and q2), a flux-tunable coupler (green), denoted with the label c, and capacitive couplings between the transmons (black). The readout resonators and drive lines for the implementation of single-qubit gates are not included in the schematics.

Switching of the charge-parity can occur either due to a preexisting quasiparticle tunneling across the Josephson junction on to the transmon island, or due to photon-assisted breaking of a Cooper pair, as depicted in Fig. 1b16. The timescale of these stochastic parity-switches is referred to as the parity-switching time and, as we argue in the Results section, it is typically much shorter compared to the time needed to obtain meaningful statistics from the quantum computer.

Since the difference between the two parity manifolds is strongly suppressed, quasiparticle effects in transmons have mainly been analyzed in the context of quasiparticle-induced decoherence. However, the parity-dependent energy splitting of the higher-excited states is much larger compared to that of the first-excited and ground states, as shown in Fig. 1a. Therefore, parity-switching in the second excited state can potentially become a notable source of error for example in diabatic CZ (controlled-Z) or more generally CPHASE (controlled-phase) gates32. There are different implementations of this gate scheme, most notably, statically coupled transmons33 and parametric drives34, tunable couplers and parametric drives35 and the frequency-tunable coupler architecture35,36,37,38,39,40,41,42,43,44. Nonetheless, in all of these gate implementations, the second excited state of one transmon is populated during the gate operation, suggesting the gate could be susceptible to the charge-parity of one transmon in the system.

Besides the ability to perform fast and high-fidelity gate operations, one of the main reasons for the introduction of the tunable coupler is the fact that the static residual-ZZ interaction between the qubits can be completely suppressed by tuning the coupler transmon to a specific frequency36,38, thus leading to reduced cross-talk and spectator qubit related decoherence45. When determining this frequency, it has been shown that the level repulsion between the higher levels of the tunable coupler system are relevant36,39, thus implying that quasiparticle dynamics might affect our ability to effectively decouple the qubits. On the contrary, these level repulsions can serve as a means to enhance the ZZ interaction strength and facilitate the implementation of CPHASE gates entirely adiabatically, without populating the higher excited states46. Building upon this insight, we anticipate that parity switching may likewise exert a significant influence on the performance of adiabatic gate implementations.

In this paper, we develop an analytical theory of parity switches in a tunable coupler based architecture which is currently one of the most promising platforms for large-scale quantum computing7,8,10,11. We demonstrate that the effects of a parity switch in a two-qubit gate can be a relevant source of error, even in the transmon regime. Moreover, we show that this previously unidentified error can, in certain parameter regimes, be the dominant quasiparticle-induced error mechanism during a diabatic two-qubit gate, indicated by a comparison to currently achievable parity switching rates observed in superconducting circuits. Furthermore, we demonstrate that the inherent stochastic nature of parity-switching events limits the ability to suppress any unwanted longitudinal interactions between the qubits coupled through tunable couplers47. We find that the magnitude of the unwanted interactions make this effect relevant as coherence times advance into the millisecond regime48,49.

Secondly, we introduce a systematic and versatile analysis aimed at identifying the optimal qubit design parameters. This comprehensive analysis involves accurately modeling dominant error sources and utilizing a performance metric tailored specifically for the quantum circuit under consideration. Importantly, we utilize analytical expressions for various infidelity terms within the performance metric, which significantly aids the scalability of our approach. By examining the performance metric’s reliance on the transmon qubit design parameters, we delineate an optimal parameter range that maximizes overall performance. We demonstrate how our analysis and findings can offer invaluable guidance in the advancement of transmon-based quantum processors. Furthermore, since the established methodology in this paper is general-purpose, we anticipate its broader applicability across various hardware platforms.

Results

Charge-parity modeling

In this section, we first present how a parity switch affects a single transmon qubit. The Hamiltonian of an individual transmon, not taking into account potential higher order contributions to the Josephson energy50, is given by12,51

$$\hat{H}=4{E}_{C}{\left(\hat{n}-{n}_{g}+\frac{P-1}{4}\right)}^{2}-{E}_{J}\cos \hat{\phi },$$
(1)

where the operator \(\hat{n}\) represents the dimensionless charge and \(\hat{\phi }\) is the superconducting phase operator across the Josephson junction. The variables EC, EJ and ng represent the charging energy of an electron (i.e. the energy required to add a single electron of the Cooper-pair to the transmon island), Josephson energy and dimensionless offset charge, respectively. The variables \(\hat{n}\) and \(\hat{\phi }\) are related via the canonical commutation relation \([\hat{\phi },\hat{n}]=i\). Additionally we have included a discrete parity variable P { − 1, + 1}, corresponding to the parity of the number of electrons that have tunneled across the junction. The parity term has the same effect as a shift of the offset charge by Δng = 1/2.

Denoting the eigenenergies of the original transmon Hamiltonian in Eq. (1) as Ei with i {0, 1, 2, 3, . . . }, the difference between the energy levels of the different parity states can be asymptotically approximated by12

$${E}_{m}^{+}({n}_{g})-{E}_{m}^{-}({n}_{g})\simeq {\epsilon }_{m}\cos (2\pi {n}_{g}),$$
(2)

where the superscript refers to the parity and the charge dispersion ϵm is given by

$${\epsilon }_{m}\simeq {(-1)}^{m}{E}_{C}\frac{{2}^{4m+5}}{m!}\sqrt{\frac{2}{\pi }}{\left(\frac{{E}_{J}}{2{E}_{C}}\right)}^{\frac{m}{2}\,+\,\frac{3}{4}}{{{{\rm{e}}}}}^{-\sqrt{8{E}_{J}/{E}_{C}}}.$$
(3)

While the exponential suppression of the charge dispersion with the ratio EJ/EC is well-known and the main reason for the introduction of the transmon, the formula in Eq. (3) also predicts a significant increase in the charge dispersion of higher excited states. This means that even though the effect of a parity switch may be small in the computational subspace, the effect can be significantly more pronounced if higher-excited states are involved in the operation of two-qubit gates. For example, we find ϵ2/ϵ1 ~ 40 for EJ/EC ~ 50. This difference is even more pronounced since certain effects scale with the square of the charge dispersion, as we show in the following.

Consequently, the Hamiltonian of the single transmon in Eq. (1) can be approximated in the low-energy manifold and in the asymptotic limit of EJ/EC 1 as

$$\hat{H}/\hslash \simeq \left[\omega +\delta \omega (P,{n}_{g})\right]{\hat{a}}^{{\dagger} }\hat{a}+\frac{\alpha +\delta \alpha (P,{n}_{g})}{2}{\hat{a}}^{{\dagger} }{\hat{a}}^{{\dagger} }\hat{a}\hat{a},$$
(4)

where \(\hat{a}\) are bosonic annihilation operators, and \(\hslash \omega \equiv [{E}_{1}^{+}({n}_{g})+{E}_{1}^{-}({n}_{g})]/2\simeq \sqrt{8{E}_{J}{E}_{C}}-{E}_{C}\) and \(\hslash \alpha \equiv [{E}_{2}^{+}({n}_{g})+{E}_{2}^{-}({n}_{g})]/2 - 2\hslash\omega\simeq -{E}_{C}\)22 are the parity-averaged expressions for the transmon (angular) frequency and the anharmonicity in the asymptotic limit, respectively. Here, we have taken into account the fact that the different parities have almost identical parameters, and the small differences between them are taken into account with the two parameters depending on the parity, δω(P, ng) and δα(P, ng). The parity P therefore divides the eigenstates of the Hamiltonian in Eq. (1) into two distinct manifolds, as illustrated in Fig. 1a. Due to the rapid scaling of the charge dispersion shown in Eq. (3), we neglect in the following the effect of the parity switching on the first excited state, i.e. we set δω = 0, and only focus on the second excited state. Consequently, \(\delta \alpha (P,{n}_{g})=P{\epsilon }_{2}\cos (2\pi {n}_{g})/(2\hslash )\).

We consider a non-adiabatic, i.e. diabatic, CPHASE gate based on the two-qubit gate scheme using tunable couplers that was analyzed in refs. 36,37,38,39,40 with similar schemes proposed in refs. 35,41,42,43. We show the circuit schematics of the tunable-coupler setup in Fig. 1d. Here, the two computational transmons, which we refer to as Qubits 1 and 2 (q1,2), are capacitively coupled with each other and to a third, frequency-tunable, transmon which is referred to as the tunable coupler or simply coupler (c).

The main operation principle of the diabatic CPHASE gate is shown in Fig. 1c. The CPHASE gate is implemented by tuning the frequency of the coupler closer to the frequency of the computational transmons by using a flux pulse. The conditional phase is collected during a Rabi oscillation between the \(\left\vert 11\right\rangle\) and \(\left\vert 02\right\rangle\) states of the computational qubits, as illustrated in Fig. 1c. We model the circuit pictured in Fig. 1d with the Hamiltonian38,39

$${\hat{H}}/{\hslash} =\sum\limits_{i\in \{{{\rm{q}}_1},{\rm{c}},{{\rm{q}}_2}\}}{\omega }_{i}{\hat{a}}_{i}^{{\dagger} }{\hat{a}}_{i}+\frac{{\alpha }_{i}}{2}{\hat{a}}_{i}^{{\dagger} }{\hat{a}}_{i}^{{\dagger} }{\hat{a}}_{i}{\hat{a}}_{i}-\sum\limits_{\mathop {i,j \in \{ {{\rm{q}}_1},{\rm{c}},{{\rm{q}}_2}\} }\limits_{i \ne j}}{g}_{ij}({\hat{a}}_{i}^{\dagger}-{\hat{a}}_{i})({\hat{a}}_{j}^{\dagger}-{\hat{a}}_{j}).$$
(5)

Since the second excited state of one of the computational transmons is significantly populated during the gate operation, the non-degeneracy of the two parity levels can have a direct effect, and therefore quasiparticle tunneling and photon-assisted pair breaking in the transmon can become a notable source of error in the gate operation. Further details about the numerical modeling and definition of the computational basis in relation to the states of the three constituent transmons are found in the Methods section.

The tunable coupler circuit Hamiltonian in Eq. (5) is difficult to analyze and we must often rely on numerical studies39. It is therefore beneficial to introduce an effective Hamiltonian that can approximate the physics of the system. Similar to refs. 38,52, we introduce the Schrieffer-Wolff transformation as a means to decouple the computational transmon states from the coupler states, and assume that the decoupled coupler remains in the ground state during the gate operations. Unlike the approach taken in refs. 38,52, where the Hilbert space of the local transmons is truncated to the computational subspace, we also include the \(\left\vert 02\right\rangle\) state to account for the Rabi oscillation that is used to accumulate the conditional phase. More details of the Schrieffer-Wolff transformation can be found in the Methods section, where we show that the diabatic gate can be modeled with the following effective unitary

$$\hat{U}(t)\hat{=}\begin{array}{l}\begin{array}{ll}\quad\,\;\left\vert 00\right\rangle \quad\left\vert 01\right\rangle \quad\left\vert 10\right\rangle \qquad \left\vert 11\right\rangle\end{array}\\ \quad\left(\begin{array}{ccccccc}1&&0&&0&&0\\ 0&&1&&0&&0\\ 0&&0&&1&&0\\ 0&&0&&0&&\sqrt{{P}_{11}}{{{{\rm{e}}}}}^{{{{\rm{i}}}}\phi (t)}\\ \end{array}\right)\begin{array}{c}\left\vert 00\right\rangle \\ \left\vert 01\right\rangle \\ \left\vert 10\right\rangle \\ \left\vert 11\right\rangle \end{array}.\end{array}$$
(6)

Here, we have denoted the conditional phase with ϕ(t) and the population of the \(\left\vert 11\right\rangle\) state with P11. Note that this operator is not necessarily trace-preserving, as part of the population of the \(\left\vert 11\right\rangle\) state might remain in the \(\left\vert 02\right\rangle\) state, due to potential calibration errors. More explicitly, P11 is given by

$${P}_{11}(t)=1-\frac{2{\tilde{g}}_{11,02}^{2}}{{\Omega }^{2}}\left[1-\cos (\Omega t)\right],$$
(7)

where we have defined the qubit-qubit detuning and the Rabi frequency of the \(\left\vert 11\right\rangle \leftrightarrow \left\vert 02\right\rangle\) transition as \(\tilde{\Delta }={\tilde{\omega }}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}-{\tilde{\omega }}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\) and \(\Omega =\sqrt{{(\tilde{\Delta }-{\tilde{\alpha }}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}})}^{2}+4{\tilde{g}}_{11,02}^{2}}\), respectively. The parameters with the tilde denote the perturbed parameters of the original full Hamiltonian from Eq. (5), which were derived by applying the Schrieffer-Wolff transformation (see Methods, more specifically Eqs. (32)–(35)). The conditional phase in Eq. (6) is given by

$$\phi (t)=\frac{1}{2}\left[({\tilde{\alpha }}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}-\tilde{\Delta })t+\pi \left(1-{{{\rm{sign}}}}\{\cos (\Omega t/2)\}\right)\right]+\arctan \left(\frac{\tilde{\Delta }-{\tilde{\alpha }}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}}{\Omega }\tan (\Omega t/2)\right).$$
(8)

Eqs. (7) and (8) can be used to assess the susceptibility of the gate parameters to a small perturbation, such as a parity switch. Due to the larger charge dispersion of the second excited state, and the fact that the second excited state of Qubit 2 (q2) is populated during the gate operation, we can assume that the main contribution of the parity switch is the perturbation of the anharmonicity \({\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\).

By treating the parity-dependent contribution to the anharmonicity δα from Eq. (4) as a small perturbation, we can obtain the parity-dependent expressions for the conditional phase and \(\left\vert 11\right\rangle\) population

$$\phi ({t}_{{{{\rm{g}}}}},{P}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}})\,\approx \,{\phi }_{0}+{\left.\frac{\partial \phi }{\partial {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}}\right| }_{t = {t}_{{{{\rm{g}}}}}}\delta {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}({P}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}},{n}_{g}),$$
(9)
$${P}_{11}({t}_{{{{\rm{g}}}}},{P}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}})\,\approx \,1+\frac{1}{2}{\left.\frac{{\partial }^{2}{P}_{11}}{\partial {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}^{2}}\right| }_{t = {t}_{{{{\rm{g}}}}}}{\left[\delta {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}({P}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}},{n}_{g})\right]}^{2},$$
(10)

where the Taylor expansion of the optimal gate parameters for a small perturbation of \({\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\) evaluated at the mean (parity averaged) anharmonicity \({\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\) from Eq. (4) was employed. At this point, we stress again that \(\delta \alpha (P,{n}_{g})=P{\epsilon }_{2}\cos (2\pi {n}_{g})/2\). While the above expressions are completely general also in the non-perturbative regime, the relations given in Eqs. (7) and (8) can be used to obtain analytical expressions for \(\partial \phi /\partial {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\) and \({\partial }^{2}{P}_{11}/\partial {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}^{2}\), which determine the susceptibility of the gate to charge-parity switches.

We realize from Eq. (7) that the implementation of a high-fidelity gate with an arbitrary conditional phase ϕ0 requires P11(tg) = 1, otherwise some population remains outside of the computational subspace. Therefore, the following condition for the gate time tg must hold: Ωtg = n 2π, in which \(n\in {\mathbb{N}}\) is an integer number. This condition enables us to further simplify the relations for the susceptibility of the conditional phase to a parity switch, and we obtain up to the leading order,

$${\left.\frac{\partial \phi }{\partial {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}}\right| }_{t = {t}_{{{{\rm{g}}}}}}\approx \frac{{t}_{{{{\rm{g}}}}}}{2},$$
(11)

and

$${\left.\frac{{\partial }^{2}{P}_{11}}{\partial {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}^{2}}\right| }_{t = {t}_{{{{\rm{g}}}}}} \sim {{{\mathcal{O}}}}\left(\frac{{\tilde{g}}_{11,02}^{2}}{{({\omega }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}+{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}-{\omega }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}})}^{4}}\right).$$
(12)

In the derivation of Eq. (11) we have neglected the terms proportional to \({g}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}\rm{c}}{g}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}\rm{c}}/{({\omega }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}-{\omega }_{{{{\rm{c}}}}}+{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}})}^{3}\) and \({g}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}\rm{c}}^{2}/{({\omega }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}-{\omega }_{{{{\rm{c}}}}}+{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}})}^{3}\) and higher orders. Additionally Eq. (12) only contains the lowest order scaling of the result. A list of all the assumptions used in the derivation of Eqs. (11) and (12) is included in the Methods section.

Gate fidelity limitations

So far we have quantified how a parity switching event can affect the parameters of the gate unitary. In order to describe the gate performance in a quantum circuit, we also need to consider how frequently parity switching events occur.

Experiments of parity switching lifetimes typically show the parity switching time to lie in the broad range of TP ~ 100 μs − 1 s51,53,54,55,56,57. Even though the parity switching lifetime of transmons might increase in the future, e.g., due to better design and improved shielding, it appears that the parity lifetime may be fundamentally upper bounded by the high-energy quasiparticle burst events, which are observed to happen once every 10–50 s58,59,60,61.

Comparing the realistic range of parity lifetimes of superconducting qubits to the duration of a single two-qubit gate tg, which is typically in the range of tens to hundreds of nanoseconds36,37,41,43,62, we observe that tgTP. However, any meaningful application of a quantum computer will include re-running an algorithm, comprised of a large number of non-parallel gates Ngates, in order to reduce the statistical uncertainty of the observable being evaluated. Denoting the number of runs by Nshots, the uncertainty of any observable generally scales as \(1/\sqrt{{N}_{{{{\rm{shots}}}}}}\)63 and therefore Nshots must be large. All together, the time needed to execute a full algorithm Talg roughly scales as TalgNshotsNgatestg, but it may be realistically even longer, due to the time needed to measure and reinitialize the quantum computer, and any possible pulse schedule compilation of the control electronics64.

As an example, in refs. 7,9,65 state of the art devices were run for a total of several minutes in order to obtain meaningful results, which implies the following clear separation of timescales

$${t}_{{{{\rm{g}}}}}\,\ll \,{T}_{{{{\rm{P}}}}}\,\ll \,{T}_{{{{\rm{alg}}}}}.$$
(13)

The left side of Eq. (13) indicates that the probability for a parity switch occurring during the operation of a single gate is very low, while the right-hand side suggests that a large number of parity switches can occur during an execution of an algorithm. This means that the effect of the charge-parity switch (CPS) on a diabatic CPHASE gate can be described by the following Kraus operators66 acting on the two-qubit density matrix \(\hat{\rho }\)

$${{{{\rm{CPHASE}}}}}_{{{{\rm{CPS}}}}}[\hat{\rho }]={\hat{U}}_{-}\hat{\rho }{\hat{U}}_{-}^{{\dagger} }+{\hat{U}}_{+}\hat{\rho }{\hat{U}}_{+}^{{\dagger} },$$
(14)

where \({\hat{U}}_{\pm }\) are Kraus operators corresponding to the different parity implementations of the two-qubit gate. Equation (14) can be interpreted as a stochastic application of two different gate operators; by assuming that the target conditional phase is ϕ0, they can be written as

$${\hat{U}}_{\pm }(t)=\frac{1}{\sqrt{2}}\left(\begin{array}{cccc}1&0&0&0\\ 0&1&0&0\\ 0&0&1&0\\ 0&0&0&\sqrt{1-\frac{\delta {P}_{11}}{4}}\,{{{{\rm{e}}}}}^{i{\phi }_{0}\pm i\,\frac{\delta \phi }{2}}\\ \end{array}\right),$$
(15)

with \(\delta \phi =\partial \phi /\partial {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\,{\epsilon }_{2}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\cos (2\pi {n}_{g}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}})/\hslash\) and \(\delta {P}_{11}={\partial }^{2}{P}_{11}/\partial {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}^{2}\,{\left[{\epsilon }_{2}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\cos (2\pi {n}_{g}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}})/\hslash \right]}^{2}/2\), which is a result of Eqs. (9) and (10), together with \(\delta \alpha (P,{n}_{g})=P{\epsilon }_{2}\cos (2\pi {n}_{g})/(2\hslash )\). We have additionally acknowledged that both parities are equally likely. Note that the channel defined in Eq. (14) is not trace preserving when δP11 > 0. More specifically \({{{\rm{tr}}}}\{{{{{\rm{CPHASE}}}}}_{{{{\rm{CPS}}}}}[\hat{\rho }]\}\le {{{\rm{tr}}}}\{\hat{\rho }\}\), where the equality holds when δP11 = 0. In this case the map is both completely positive and trace preserving and the completeness relation of the Kraus operators \(\sum\limits_{i = +,-}{\hat{U}}_{i}^{{\dagger} }{\hat{U}}_{i}={\mathbb{I}}\) holds.

We have defined the Kraus operators in Eq. (15) so that there are small errors associated with each parity state. We show in the following that this corresponds to higher average gate fidelities, compared to having one parity state with a perfect fidelity and the second parity state with a larger error. This assumption is, therefore, equivalent to optimally calibrating the gate with respect to the parity switching error.

Using the Kraus operator description in Eq. (15) allows us to make a formal analysis of the fidelity of an arbitrary conditional phase gate in the presence of parity switches and resulting phase and leakage errors.

The average gate fidelity \({{{\mathcal{F}}}}\) in the presence of leakage can be computed by67,68

$${{{\mathcal{F}}}}[{{{{\rm{CPHASE}}}}}_{{{{\rm{CPS}}}}}]=\frac{\frac{1}{d}{\sum }_{i = +,-}{\left| {{{\rm{tr}}}}\left\{{\hat{U}}_{{{{\rm{CPHASE}}}}}^{{\dagger} }{\hat{U}}_{i}\right\}\right| }^{2}+1-L}{d+1},$$
(16)

with the leakage parameter \(L=1-{{{\rm{tr}}}}\left\{{\hat{U}}_{{{{\rm{CPHASE}}}}}^{{\dagger} }\left[\sum\limits_{i = +,-}{\hat{U}}_{i}{\hat{U}}_{i}^{{\dagger} }\right]{\hat{U}}_{{{{\rm{CPHASE}}}}}\right\}/d\). Here \({\hat{U}}_{{{{\rm{CPHASE}}}}}\) is the unitary operator of an ideal CPHASE gate and d is the dimension of the computational Hilbert space, which in our case is d = 4.

The fidelity \({{{\mathcal{F}}}}\) of the operation given in Eq. (14) for small perturbations δϕ and δP11 is given by the following expression

$${{{\mathcal{F}}}}\approx 1-\frac{3}{80}{\left[{\frac{\partial \phi }{\partial {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}}\Bigg| }_{t = {t}_{{{{\rm{g}}}}}}{\epsilon }_{2}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\cos (2\pi {n}_{g}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}})/\hslash \right]}^{2}+\frac{1}{32}{\frac{{\partial }^{2}{P}_{11}}{\partial {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}^{2}}\Bigg| }_{t = {t}_{{{{\rm{g}}}}}}{\left[{\epsilon }_{2}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\cos (2\pi {n}_{g}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}})/\hslash \right]}^{2}.$$
(17)

Here, we observe that the effect on the fidelity is of second order in the charge dispersion, due to the coherent nature of a conditional phase error. However, we have shown in Eq. (38) of the Methods section that the infidelity of a series of N gates is given by \(1-{{{{\mathcal{F}}}}}_{N}\approx N(1-{{{\mathcal{F}}}})\). This indicates that the error scales linearly with the number of gates, as is typical of incoherent errors, but quadratically in the error parameter δϕ, as is expected from a coherent error69,70.

In deriving Eq. (38), one can observe that the numerical prefactors in front of both terms (here 3/80 and 1/32) increase, and therefore the average gate fidelity decreases, if the gate is not calibrated in such a way that the error is equally distributed between both parity states. For this reason, we have chosen the Kraus operators as given in Eq. (15).

We conclude from Eq. (17), together with Eqs. (11) and (12), that the dominant effect of the parity switch event is the shift in the conditional phase, rather than leakage, since \({(\partial \phi /\partial {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}})}^{2}\,\gg \,{\partial }^{2}{P}_{11}/\partial {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}^{2}\) in the perturbative regime considered in this manuscript. Moreover the magnitude of the shift in conditional phase is given by Eqs. (9) and (11). Further simplifying Eq. (38) therefore results in

$${{{\mathcal{F}}}}\approx 1-\frac{3}{320}{\left[{\epsilon }_{2}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}{t}_{{{{\rm{g}}}}}\cos (2\pi {n}_{g}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}})/\hslash \right]}^{2}.$$
(18)

Numerical simulations

Here, we compare the above results with the numerically exact treatment of full the Hamiltonian in Eq. (5). The parity switch effect is taken into account in the numerical experiments by considering the transmon Hamiltonian from Eq. (4). In other words, the system is simulated for all 23 = 8 possible parity states and the results are averaged accordingly.

According to Eq. (2), the magnitude of the effect also depends on the offset voltage of Qubit 2, which is typically not known. Thus, we assume for simplicity that \({n}_{g}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}=0\). Alternatively, as long as all the effects remain second order in the charge dispersion, one can also define an average charge dispersion as \({\bar{\epsilon }}_{2}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}={\epsilon }_{2}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\sqrt{\int\nolimits_{0}^{1}{{{\rm{d}}}}{n}_{g}{\cos }^{2}(2\pi {n}_{g})}={\epsilon }_{2}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}/\sqrt{2}\).

To achieve high-fidelity gate simulations, it is crucial to carefully select the Hamiltonian parameters in Eq. (5). We have described how these parameters were chosen so that high-fidelity gates are possible with arbitrary ratios of EJ/EC of Qubit 2 in Table 1 of the Methods section. We also note that in general, we use parameters that closely resemble those in the implementation presented in ref. 36. We examine the effect of a parity-switch on the fidelity of a CZ gate and compare the perturbative analytical results to a full simulation of the Hamiltonian in Eq. (5). The pulse shape used in the simulations (defined in Eq. (27)) is parameterized by its amplitude A, ramping up time τb and plateau duration τc. The full duration of the flux pulse is therefore given by T = τc + 2τb.

The conditional phase of the gate in the simulations is obtained from propagating the state \(\left\vert \psi (t=0)\right\rangle =\frac{1}{2}{(1,1,1,1)}^{{{{\rm{T}}}}}\) and extracting the conditional phase of the \(\left\vert 11\right\rangle\) state. Figure 2a compares the analytical results in Eqs. (9) and (11) to the numerically obtained values and confirms that the two approaches agree up to a good accuracy. Since the magnitude of the charge dispersion in the numerical analysis is the same as in the analytical treatment, the error stems completely from the approximations made in evaluating \(\partial \phi /\partial {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\). Furthermore, Fig. 2a also clearly shows that the parity-switching-induced shift in the conditional phase scales exponentially with the ratio of EJ/EC. This is due to the scaling of the charge dispersion in Eq. (3).

Fig. 2: Effects of a parity switch on the fidelity of a CZ gate.
figure 2

a Analytical predictions (red circles) of the conditional-phase difference of the CZ gate between the two parities. The theory predictions from Eqs. (11) and (9) are compared to the phase extracted from a full numerical simulation of the gate (blue crosses), with the Hamiltonian parameters from Table 1. The inset displays the relative absolute error between the analytical predictions and the numerical results. The x-axis represents the EJ/EC ratio (bottom) and second excited state charge dispersion (top) of the qubit with the higher frequency (Qubit 2). b Gate infidelity after a parity switch on Qubit 2, where the theory predictions (red circles) only take into account the effect on the conditional phase, according to Eq. (18). The green lines are estimates of the upper bound for the infidelity contribution of quasiparticle induced decoherence on the two-qubit gate system, with the parity lifetimes TP measured by different references: [1] Risté et al.54, [2] Diamond et al.53 and [3] Kurter et al.56. More specifically, references [1] and [2] have reported the values TP = 1.25 ms and TP = 2.5 ms respectively. In reference [3], parity-switching times between 1 ms and 1.5 s were demonstrated. For this reference, we have used the value of TP = 20 ms, which approximately corresponds to the median parity-switching time of all the samples. The gray region is the region in which errors due to unwanted transitions during the gate operation are more prominent and the effect of a parity switch is negligible. It therefore represents the lower bound of the infidelity, which is largely independent of the ratio EJ/EC.

In Fig. 2b, we show numerical data of the full gate fidelity after the parity switch and, as a comparison, the corresponding analytical result calculated using Eq. (18). The average gate fidelity in the numerical example is obtained from propagating a number of input states, reconstructing the effective superoperator of the gate from these simulations and subsequently using Eq. (16) to obtain the average gate fidelity of the gate. We observe that the infidelity of the numerical simulation flattens for EJ/EC 75, which is due to other errors in the gate implementation, such as leakage transitions during the pulse ramping up and down, which was confirmed by monitoring the population of the computational states during gate operation (see Supplementary Fig. 1 for more details). In this region (in gray) the effect of a parity switch is not seen since it is too small compared to other errors. On the other hand at lower values of EJ/EC, the numerical results overlap with the parity switching error predicted by the shift in the conditional phase. While clearly demonstrating the magnitude of the error, this result also shows that the leakage error contribution in Eq. (17) is negligible in the perturbative regime. The gate durations in Fig. 2 are typically 45 ns tg 60 ns, with τc ~ tg.

Additionally, Fig. 2b compares the magnitude of the error due to quasiparticle related decoherence to the parity switch induced error described in this work. Since the quasiparticle induced characteristic decay times T1 and Tϕ depend on a large number of parameters and there are two possible quasiparticle generating mechanisms (see Fig. 1b)16, we only provide an upper bound based on the parity switching time observed in the references cited in the caption of Fig. 2. This upper bound is determined by noting that in the computational subspace55

$${\Gamma }_{00}^{+-}+{\Gamma }_{11}^{+-}+{\Gamma }_{01}^{+-}+{\Gamma }_{10}^{+-}\approx 2/{T}_{{{{\rm{P}}}}},$$
(19)

where the rates \({\Gamma }_{ij}^{+-}\) represent the transition rates between states \(\left\vert {i}^{\pm }\right\rangle \to \left\vert {j}^{\mp }\right\rangle\) in different parity manifolds of a single transmon. Since amplitude damping noise has a larger effect on the fidelity compared to pure dephasing (see Table 2), we furthermore assume the worst-case scenario in which each quasiparticle-induced parity switching event results in a T1 decay, so that \(1/{T}_{1}^{{{{\rm{qp}}}}}={\Gamma }_{01}^{+-}+{\Gamma }_{10}^{+-}\approx 2/{T}_{{{{\rm{P}}}}}\). This expression for \({T}_{1}^{{{{\rm{qp}}}}}\), together with experimentally measured values of TP, provides an approximate upper bound for the magnitude of the effect of the decoherence. Even though each green line in Fig. 2b uses a constant measured TP, independent of EJ/EC, the plotted infidelity contribution is not constant due to the varying gate duration T = 2τb + τc.

We observe from Fig. 2b that depending on the ratio of EJ/EC and the parity lifetimes TP, the contribution of parity switching to the infidelity of the two-qubit gate system can dominate the contribution from quasiparticle-induced decoherence. Note that the two-qubit gates are also typically the noisiest building blocks of a quantum algorithm7,8,11,69. We have also provided additional numerical results in the Supplementary Results, more specifically Supplementary Fig. 2, showing that the effect of a parity switch on the leakage becomes the main contribution to the infidelity at shorter gate times, beyond the perturbative regime.

In order to compare the effect of the charge-parity switches to 1/f-charge noise, which was previously thought to dominate the low EJ/EC regime, we included a comparison of the infidelity of a two-qubit gate due to 1/f-type charge noise in the Supplementary Results and Methods, more specifically Supplementary Fig. 3, since the contribution of both of these error sources scales with the charge dispersion of the transmon. However, unlike the charge dispersion of the transmon which is maximized when \(\cos (2\pi {n}_{g})=1\), the low-frequency charge noise decoherence rate is maximal at \(\sin (2\pi {n}_{g})=1\). This means that these two errors are mutually exclusive, i.e. if we were hypothetically able to tune ng to the value where the charge dispersion of the transmon is equal to zero, that point corresponds to the maximal decay rate due to low-frequency charge noise12. Nonetheless, in the comparison we have assumed in both the charge-parity switching analysis and the low-frequency charge noise analysis that we are at the noise hotspot, i.e. the value of ng where the effects are maximal, thus slightly overestimating both errors. We observe that at lower EJ/EC the charge-parity switching error is dominant and vice versa. The crossover between the infidelity due to the charge-noise-induced dephasing in the computational subspace and the parity switching exhibits a crossover at EJ/EC ~ 80, which corresponds to a gate infidelity of \(1-{{{\mathcal{F}}}} \sim 1{0}^{-8}\), i.e. the charge-parity switching error is dominant for EJ/EC 80. However, due to the increased charge dispersion in the higher-excited levels of the transmon and the utilization of the second excited state, we have additionally analyzed the effect of the second excited state charge-noise-induced dephasing which was found to be much larger compared to the effects in the computational subspace. In this case, the crossover between the charge-parity error, which was again dominant at lower EJ/EC, was found to occur at EJ/EC ~ 65 and infidelities on the order of \(1-{{{\mathcal{F}}}} \sim 1{0}^{-6}\). We note here that our analysis overestimates the effect of the charge noise in a realistic scenario by neglecting the time correlations in the noise and should be treated as an upper bound of the magnitude of the effect. These results demonstrate that the parity-switching error is the dominant charge dispersion related error of the two-qubit gate in the regime of EJ/EC 65.

Parity-switching effects on the adiabatic interaction

The tunable coupler architecture analyzed thus far can also be used to implement an adiabatic CPHASE gate. In this case, the level repulsion between the states of the double-excitation manifold is used to obtain a phase shift of the \(\left\vert 11\right\rangle\) state, which accumulates the conditional phase with the rate

$${\zeta }_{{{{\rm{ZZ}}}}}={\omega }_{11}-{\omega }_{01}-{\omega }_{10}+{\omega }_{00},$$
(20)

where ωij are the eigenenergies of the eigenstates which form the computational basis, as described in the Methods section. In this gate implementation the population ideally never leaves the computational subspace, however we will show that, also in the adiabatic implementation, the gate will suffer from charge-parity switching errors of a similar magnitude as in the diabatic case.

Additionally, one of the main benefits of the tunable coupler architecture is the ability to also suppress any residual interactions between the qubits during idling38,39,47, with unwanted ZZ coupling strengths demonstrated to be below 1 kHz36. In this section, we also show that uncontrolled parity-switching sets a lower bound on the minimum achievable unwanted ZZ interaction in such systems.

In the presence of parity-switches in all three transmons of the tunable coupler system, the ZZ coupling rate of each of the distinct parity configurations will slightly differ. Since these differences are proportional to the charge dispersion, and therefore small, we can describe the parity-dependent ZZ coupling of this system with a first-order Taylor expansion, so that

$${\tilde{\zeta }}_{{{{\rm{ZZ}}}}}({P}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}},{P}_{{{{\rm{c}}}}},{P}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}})\simeq {\zeta }_{{{{\rm{ZZ}}}}}^{0}+\sum\limits_{i\in \{{{{{\rm{{q}}}_{1},c,q}}}_{{{{\rm{2}}}}}\}}\frac{{P}_{i}}{2}\left(\frac{\partial {\zeta }_{{{{\rm{ZZ}}}}}}{\partial {\alpha }_{i}}{\epsilon }_{2}^{i}+\frac{\partial {\zeta }_{{{{\rm{ZZ}}}}}}{\partial {\omega }_{i}}{\epsilon }_{1}^{i}\right),$$
(21)

with Pi {−1, +1} i. While we have still assumed ϵ2ϵ1, we have also acknowledged that the first-excited-state charge dispersion and the derivatives ∂ζZZ/∂ωi can, in certain cases, be significantly larger compared to ∂ζZZ/∂αi. ζZZ therefore corresponds to the ZZ coupling strength of the system with parity-averaged parameters.

However, in order to evaluate the derivatives in Eq. (21), or the general value of the coupling rate ζZZ with fixed Hamiltonian parameters (without including parity effects), it is necessary to go to fourth-order perturbation theory in the coupling strengths, as was done in refs. 36,39. While the complete fourth-order expression is impractical for obtaining any analytical insight, by further assuming the hierarchy of the system parameters \({\Sigma }_{i{{{\rm{c}}}}}\,\gg \,{\Delta }_{i{{{\rm{c}}}}}\,\gg \,{g}_{i{{{\rm{c}}}}}\,\gg\, {g}_{{{{{\rm{{q}}}_{1}\rm{q}}}}_{{{{\rm{2}}}}}}\), where Σij = ωi + ωj and Δij = ωi − ωj, for \(i,j\in \{{{{{\rm{{q}}}_{1},\; \rm{c},\; q}}}_{{{{\rm{2}}}}}\}\), the cumbersome expressions can be significantly simplified into39

$${\zeta }_{{{{\rm{ZZ}}}}}\approx \frac{2\left[\left({\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}+{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\right){\tilde{g}}_{01,10}^{2}-2\nu {\tilde{g}}_{01,10}\left(2{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}+\left({\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}-{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\right){\Delta }_{{{{{\rm{{q}}}_{1}\rm{q}}}}_{{{{\rm{2}}}}}}\right)\right]}{({\Delta }_{{{{{\rm{{q}}}_{1}\rm{q}}}}_{{{{\rm{2}}}}}}+{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}})({\Delta }_{{{{{\rm{{q}}}_{1}\rm{q}}}}_{{{{\rm{2}}}}}}-{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}})}\;+\;2{\nu }^{2}\left[4{\alpha }_{{{{\rm{c}}}}}+\frac{({\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}+{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}){\Delta }_{{{{{\rm{{q}}}_{1}\rm{q}}}}_{{{{\rm{2}}}}}}^{2}}{({\Delta }_{{{{{\rm{{q}}}_{1}\rm{q}}}}_{{{{\rm{2}}}}}}+{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}})({\Delta }_{{{{{\rm{{q}}}_{1}\rm{q}}}}_{{{{\rm{2}}}}}}-{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}})}\right],$$
(22)

where \(\nu ={g}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}\rm{c}}{g}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}\rm{c}}/(2{\Delta }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}\rm{c}}{\Delta }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}\rm{c}}) \sim 1{0}^{-3}\).

Focusing on the idling regime, where \({\zeta }_{{{{\rm{ZZ}}}}}^{0}=0\), the difference between the derivatives obtained from Eq. (22) and the numerically exact result is shown in Fig. 3a. We observe qualitatively good agreement in the vicinity of the two coupler idling frequencies of the system (vertical black dotted lines).

Fig. 3: Comparison of the analytical formulas with numerical results obtained via exact diagonalization for the adiabatic interaction.
figure 3

a Largest derivatives in Eq. (21) obtained numerically (solid lines) and from Eq. (22) (filled circles) using the parameters from Table 1 and \({\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}=-270\) 2π MHz. The x-axis represents the coupler frequency, with the two idling frequencies denoted with black dashed vertical lines. Additional plots of the first-order derivatives at different qubit detunings are plotted in Supplementary Fig. 4. b Comparison of numerical and analytical results for the root-mean-squared coupling strength at the idling point (defined in Eq. (23)), with the bar graph displaying the numerical values of the coupling for the 8 different parity states for the highlighted data point. The parameters of the simulation are chosen identically as in Fig. 2 and are listed in the Methods section, more specifically in Table 1. The parameters for the bar plot correspond to the parameters in panel a. c The susceptibility of an adiabatic CZ (ϕ0 = π) gate to a parity switch (defined in Eq. (24)) at different coupler frequencies and qubit detunings, measured in GHz (left axis) and in units of \({\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\) (right axis), using the same parameters as in panel a. The black contours represent different values of \({\zeta }_{{{{\rm{ZZ}}}}}^{0}\), as indicated on the plot. The white dashed line indicates the detuning used in panel a.

Since the system has eight uncontrolled, rapidly (compared to experimental timescales, as in Eq. (13)) switching parity states, we define a parity-averaged idling interaction strength:

$${\langle {\tilde{\zeta }}_{{{{\rm{ZZ}}}}}^{2}\rangle }_{{{{\rm{CPS}}}}}=\frac{1}{{2}^{3}}\sum\limits_{{P}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}},{P}_{{{{\rm{c}}}}},{P}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\in \{-1,+1\}}{\left[{\tilde{\zeta }}_{{{{\rm{ZZ}}}}}({P}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}},{P}_{{{{\rm{c}}}}},{P}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}})\right]}^{2}.$$
(23)

This definition is the parity-averaged mean-square of the idling ZZ strength defined in Eq. (21). Parity-averaged coupling strengths are shown in Fig. 3b, where we observe a good agreement between the analytical result and the numerical data. The possible values of the quantity \(\tilde{\zeta }_{{{{\rm{ZZ}}}}}({P}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}},{P}_{{{{\rm{c}}}}},{P}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}})\) for the third data point (also shown in panel a) are visualized in the bar graph. The bar graph demonstrates that Qubit 2 (q2) is the dominant contributor to the perturbation, with small corrections due to the parity of Qubit 1 (q1). This can be seen by noting that the residual coupling strength is dependent mostly on the parity of Qubit 2 (q2). We have extended this analysis to larger values of \({\Delta }_{{{{{\rm{{q}}}_{1}}}}\rm{q}_{{{{\rm{2}}}}}}\) in the Supplementary Results (see Supplementary Fig. 4), to show that the parity-switching induced ZZ coupling strength is not always dependent on the parity of a single transmon in the system. The parity of the coupler in this set of parameters is largely irrelevant due to the high ratio of \({E}_{{J}_{{{{\rm{c}}}}}}/{E}_{{C}_{{{{\rm{c}}}}}}\approx 250\), but this is not always the case. For example, considering the gate implementation from ref. 41, the ratio of \({E}_{{J}_{{{{\rm{c}}}}}}/{E}_{{C}_{{{{\rm{c}}}}}}\) for the coupler is much smaller and therefore, the coupler parity in this implementation has a much larger effect on the strength of the residual ZZ coupling.

The above results therefore present a fundamental limit on the magnitude of unwanted interactions that can be achieved in the tunable coupler architecture. However, the overall effect on an algorithm is more complex as it depends on the duration of the execution (since the interaction is always “on”), which in turn depends on the coherence times. We estimate that this error becomes relevant if the coherence times are of the order of \(1/\sqrt{{\langle {\tilde{\zeta }}_{{{{\rm{ZZ}}}}}^{2}\rangle }_{{{{\rm{CPS}}}}}}\), since a significant unwanted conditional phase is accumulated if the time needed to run a single shot of a circuit (without the time needed to reinitialize and measure the qubits) TcircNgatestg, which is limited by the coherence times, is long enough such that \({T}_{{{{\rm{circ}}}}}\cdot \sqrt{{\langle {\tilde{\zeta }}_{{{{\rm{ZZ}}}}}^{2}\rangle }_{{{{\rm{CPS}}}}}} \sim 1\). For EJ/EC ~ 50, the coherence time (or algorithm duration) needed to observe the parity-switching-induced residual coupling strength is on the order of 1 ms. Note that current fabrication processes are indeed already approaching this value48,49.

Moving on the adiabatic CPHASE gate, we have observed in Fig. 3a, b that the state contributing most to the unwanted ZZ interaction in the idling regime is the second excited state of the high-frequency qubit, provided that \({\omega }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}-{\omega }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\simeq {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\). This is due to the hybridization of the computational \(\left\vert 11\right\rangle\) state with the local transmon state \(\left\vert {0}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}{0}_{{{{\rm{c}}}}}{2}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\right\rangle\), which was also confirmed by the additional results in Supplementary Fig. 4. More importantly, this means that according to the same reasoning as in the diabatic gate case, the adiabatic gate can be described by the same pair of Kraus operators as defined in Eq. (15), and since we are assuming adiabaticity, we can automatically set δP11 = 0. Again, applying the same reasoning as in the diabatic case, the fidelity of the adiabatic gate is given by

$${\mathcal{F}} \approx 1 - \frac{3}{80} \left[\left.\frac{\partial \phi}{\partial \alpha_{{\mathrm{q}}_{\mathrm{2}}}}\right|_{t = t_{\mathrm{g}}} \epsilon_2^{{\mathrm{q}}_{\mathrm{2}}} \cos(2\pi n_g^{{\mathrm{q}}_{\mathrm{2}}})/\hbar \right]^2 = 1 - \frac{3}{80} \left[\mathop{\underbrace{\frac{\partial \zeta_{\mathrm{ZZ}}}{\partial \alpha_{{\mathrm{q}}_{\mathrm{2}}}} \frac{\phi_0}{|\zeta_{\mathrm{ZZ}}^0 |}}}\limits_{{\mathrm{Fig.}}\,3{\mathrm{c}}} \epsilon_2^{{\mathrm{q}}_{\mathrm{2}}} \cos(2\pi n_g^{{\mathrm{q}}_{\mathrm{2}}})/\hbar \right]^2.$$
(24)

In the second step we have again assumed a flat pulse with duration tg and neglected the ramping up and down phases of the flux pulse. With this simplification, the conditional phase is given by \({\phi }_{0}={\zeta }_{{{{\rm{ZZ}}}}}^{0}{t}_{{{{\rm{g}}}}}\). Contrasting the expressions in Eq. (24) and Eq. (18), we can see that if the quantity \(\partial {\zeta }_{{{{\rm{ZZ}}}}}/\partial {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\cdot {\phi }_{0}/| {\zeta }_{{{{\rm{ZZ}}}}}^{0}|\) is on the order of ~10 ns, the error of the adiabatic gate is similar to the diabatic case (displayed in Fig. 2). More specifically, the conditional phase shift due to a charge-parity switch is given by \(\delta {\phi }_{{{{\rm{d}}}}}\approx {t}_{{{{\rm{g}}}}}/2\cdot {\epsilon }_{2}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}/\hslash\) in the diabatic case (according to Eq. (11)) and by \(\delta {\phi }_{{{{\rm{a}}}}}\approx \partial {\zeta }_{{{{\rm{ZZ}}}}}/\partial {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\cdot {\phi }_{0}/| {\zeta }_{{{{\rm{ZZ}}}}}^{0}| {\epsilon }_{2}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}/\hslash\) in the adiabatic case (ignoring the factor \(\cos (2\pi {n}_{g}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}})\)).

The quantity \(\partial {\zeta }_{{{{\rm{ZZ}}}}}/\partial {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\cdot {\phi }_{0}/| {\zeta }_{{{{\rm{ZZ}}}}}^{0}|\) is numerically analyzed in Fig. 3c, where we can see that in the vicinity of the horizontal line where \({\omega }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}-{\omega }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}={\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\), the hybridization with the second excited state results in an increased sensitivity to the charge-parity switches. However, the exact value of \(\partial {\zeta }_{{{{\rm{ZZ}}}}}/\partial {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\cdot {\phi }_{0}/| {\zeta }_{{{{\rm{ZZ}}}}}^{0}|\) is very dependent on the value of the qubit-qubit detuning, but generally the regime where \({\omega }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}-{\omega }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}} < {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\) is more favorable. In the regime where \({\omega }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}-{\omega }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\approx {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\), the observed effect of a charge-parity switch on the fidelity (which is proportional to the square of the plotted values according to Eq. (24)) can be an order of magnitude larger than in the diabatic case. Conversely, in the regime where \({\omega }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}-{\omega }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}} < {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\), the infidelity is up to two orders of magnitude smaller. More specifically, at EJ/EC = 50 (corresponding to \({\epsilon }_{2}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\approx 250\,h\,{{{\rm{kHz}}}}\)), the maximal fidelity achievable in the range of the plot in Fig. 3c is \({{{\mathcal{F}}}} \sim 1-1{0}^{-7}\), and the minimal fidelity is \({{{\mathcal{F}}}} \sim 1-1{0}^{-2}\).

Optimal qubit parameters

Having established the magnitude of the parity-induced error on a two-qubit gate, we have shown that this error can be mitigated by increasing the EJ/EC ratio of the transmon. However, there are other error sources present in such architectures40, and while increasing the EJ/EC ratio will suppress the parity-switching errors, it may also increase the contribution of other possible error sources. Therefore, in order to find better parameters for future transmon-based quantum computers, we must evaluate the contributions of all errors affecting the system. In particular, we estimate optimal regions for the qubit parameters EJ and EC, where the errors contributing to the gate and state preparation infidelities are minimized.

We consider a number of different error sources relevant to superconducting qubits:

  1. 1.

    T1 decay due to the coupling to a bath of two-level systems71,72,73,74,75,76,77.

  2. 2.

    Tϕ pure dephasing due to the coupling to magnetic flux noise71,78,79,80.

  3. 3.

    Leakage affecting single-qubit gates due to low anharmonicity81.

  4. 4.

    State preparation errors due to finite-temperature heating effects13,14,15, without the presence of active reset.

  5. 5.

    Errors in the two-qubit gate operation due to parity switch effects that are analyzed in this manuscript.

In Methods, we show how the above error sources scale with the transmon Hamiltonian parameters EJ and EC. We have not included any errors related to the control and calibration of the individual gates, as such errors do not explicitly depend on the qubit parameters and their inclusion, therefore, would not significantly alter the presented results. Similarly, measurement errors are present, but have no explicit dependence on EJ and EC.

For simplicity, we consider the same tunable coupler system as in the previous section, but arranged in a square grid, as pictured in Fig. 4a. In order to avoid frequency crowding issues with such a connectivity82, the qubits in the array are divided into low and high-frequency transmons. Furthermore, to prevent next-nearest-neighbor frequency collisions, even qubits assigned to the high- or low-frequency groups must possess non-identical frequencies, as demonstrated in ref. 83. Nonetheless, the frequency spread within each group is considerably smaller compared to the disparity between the means of these two groups. While our analysis focuses on a single qubit pair, extending the results to encompass a parameter distribution is straightforward. Additionally, we will define a broader region of optimal parameters spanning several hundred MHz in frequency, ensuring that all transmons within the square grid can fit within this designated range.

Fig. 4: Determining optimal transmon parameters.
figure 4

a Schematic representation of the square grid architecture, with high- (dark blue) and low-frequency (light blue) transmons, connected via tunable couplers (green). b Values of EJ and EC of Qubit 2 (q2) from different experimental implementations of the tunable coupler transmon architecture. Both parameters are extracted from the reported \({\omega }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\) and \({\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\), and therefore the points are only an approximation50. The annotations refer to the following references [1] Collodo et al.46, [2] Xu et al.44 [3] Sung et al.36, [4] Wu et al.11, [5] Xu et al.37 and [6] Google Quantum AI8. The black lines represent constant ratios of EJ/EC and the red lines correspond to contours of constant qubit frequency \({\omega }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\). c Schematic representation of the circuit used to infer the weights in Eqs. (25) and (26), with the state preparation pictured on the left (in purple), single-qubit π and π/2-rotations in green and blue respectively, and the two-qubit CZ gate in orange. N is the number of times the pictured circuit (without the state preparation) is repeated before measurement and therefore an integer determined by the reference coherence time. In our case, we consider \(N=\left\lfloor \frac{{T}_{1,0}}{10({t}_{{{{\rm{TQG}}}}}+2{t}_{{{{\rm{SQG}}}}})}\right\rfloor\). d The function \(1-{{{\mathcal{P}}}}\) defined in Eq. (26) plotted for different values of the second (higher frequency) transmon EJ and EC. We consider a single-qubit gate implemented with a Gaussian DRAG pulse with a duration of tSQG = 16 ns, a two-qubit gate duration of tTQG = 50 ns and three different reference coherence times indicated on top of each panel. For all three cases, the reference Tϕ,0 = T1,0 at EJ = 12 h GHz and EC = 0.2 h GHz for Qubit 2 (q2) with the parameters of Qubit 1 (q1) given in Table 1. All the parameters (transmon parameters and decay times) are scaled accordingly to different values of EJ and EC, and for each qubit individually, as described in Tables 1 and 2. The striped green area marks the region in the plot with the lowest values of \(1-{{{\mathcal{P}}}}\), defined by the 10th percentile of the plotted values. The darker green contour is obtained with a density matrix simulation of the circuit from panel c with the same errors, but instead of evaluating the function \(1-{{{\mathcal{P}}}}\), it is obtained by minimizing the infidelity of the state before measurement. The lighter green dotted contour is the optimal region obtained using the advanced noise model from Table 2.

In the following, we consider two noise models. It has been demonstrated that the gate infidelity due to decoherence is, up to first order, independent of the unitary dynamics84. Therefore, in the basic noise model, presented in Table 2, both T1 and T2 decay are modeled by applying the corresponding Kraus operators after the gate operation. We further assume that the detuning between the computational transmons is consistently \({\omega }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}-{\omega }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\approx {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\), and the anharmonicity of the qubits is similar. Given these conditions, we parameterize the entire system in terms of EJ and EC of the higher-frequency transmon. Examples of parameter values, obtained from experimental demonstrations of CZ gates, are plotted in Fig. 4b.

In the advanced noise model (see Table 2), we account for the population leaving the computational subspace during the operation of the diabatic CZ gate85. Additionally, in the advanced noise model we also consider the qubits to be detuned by \({\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}/2\approx {\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}/2\) during the execution of single-qubit gates (in order to prevent driving unwanted leakage transitions)86. The low-frequency qubit is then tuned so that \({\omega }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}-{\omega }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}={\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\) only during the two-qubit gate operation, thus increasing the flux-noise susceptibility during a CZ gate, according to Eq. (47).

In order to quantify the performance of an algorithm execution with a specified pair of parameters EJ and EC in mind, we define a performance metric \({{{\mathcal{P}}}}\) which we will then maximize. We further define \({{{\mathcal{P}}}}\) as one minus a weighted sum of the infidelity contributions of all the relevant errors listed in Table 2. This sum can be written as

$$\begin{array}{l}1-{{{\mathcal{P}}}}=\sum\limits_{i={T}_{1},{T}_{\phi },{{{\rm{parity}}}}}{w}_{{{{\rm{TQG}}}},i}(1-{{{{\mathcal{F}}}}}_{{{{\rm{TQG}}}},i})+\sum\limits_{i={{{{\rm{{q}}}_{1},\rm{q}}}}_{{{{\rm{2}}}}}}\sum\limits_{j={T}_{1},{T}_{\phi },{{{\rm{leak.}}}}}{w}_{{{{\rm{SQG}}}},i,j}(1-{{{{\mathcal{F}}}}}_{{{{\rm{SQG}}}},i,j})\\\qquad\qquad+\sum\limits_{i={{{{\rm{{q}}}_{1},\rm{q}}}}_{{{{\rm{2}}}}}}{w}_{{{{\rm{SP}}}},i}(1-{{{{\mathcal{F}}}}}_{{{{\rm{SP}}}},i}),\end{array}$$
(25)

where the summation runs across all infidelity contributions, or more explicitly for the simple algorithm pictured in Fig. 4c and the basic noise model from Table 2

$$\begin{array}{lll}1-{{{\mathcal{P}}}}\,=\,\frac{2}{5}({\Gamma }_{1}^{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}+{\Gamma }_{1}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}){t}_{{{{\rm{TQG}}}}}+\frac{1}{5}({\Gamma }_{\phi }^{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}+{\Gamma }_{\phi }^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}){t}_{{{{\rm{TQG}}}}}+\frac{3}{80}{\left(\frac{{t}_{{{{\rm{TQG}}}}}}{2\hslash }{\epsilon }_{2}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\right)}^{2}\\ \qquad\qquad+\,2\left[\frac{1}{3}{\Gamma }_{1}^{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}{t}_{{{{\rm{SQG}}}}}+\frac{1}{6}{\Gamma }_{\phi }^{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}{t}_{{{{\rm{SQG}}}}}\right]+\frac{1}{3}{P}_{{{{\rm{leak.}}}}}^{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}+2\left[\frac{1}{3}{\Gamma }_{1}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}{t}_{{{{\rm{SQG}}}}}+\frac{1}{6}{\Gamma }_{\phi }^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}{t}_{{{{\rm{SQG}}}}}\right]+\frac{1}{3}{P}_{{{{\rm{leak.}}}}}^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\\ \qquad\qquad+\,\left[\frac{10({t}_{{{{\rm{TQG}}}}}+2{t}_{{{{\rm{SQG}}}}})}{{T}_{1,0}}\right]\left({P}_{\left\vert 1\right\rangle }^{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}+{P}_{\left\vert 1\right\rangle }^{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\right).\end{array}$$
(26)

The first term in Eq. (25) and first line in Eq. (26) correspond to the errors of the two-qubit gate, the second term and line to single-qubit gate errors and the last to the state preparation error. The performance metric for the advanced noise model can be derived from Table 2.

A similar fidelity approximation was defined in refs. 7,87. We have introduced additional weights in the sum, in order to account for the relative number of single and two-qubit gates, and also to correctly take into account the fact that the error in state preparation occurs only once, while the gate error is significantly amplified after a number of applications of the operation. As the determination of the position of maximal \({{{\mathcal{P}}}}\) within the (EC, EJ) landscape, using which we find the optimal range for parameters, depends exclusively on the relative values of the weights wi,j, we proceed by assuming that all error terms associated with the two-qubit gate are assigned weights of wTQG = 1. As pictured in Fig. 4c, we analyze a circuit where we perform four single-qubit gates per each two-qubit gate, with half those single-qubit gates being π rotations which are more susceptible to leakage. Note that this ratio of single to two-qubit gates arises naturally with the introduction of randomized compiling into the algorithm69. The weights \({w}_{{{{\rm{SQG}}}},{T}_{1,\phi }}\) for the decoherence induced infidelity during a single-qubit gate therefore have a value of 2 (since a π and π/2 rotation are applied) per qubit, while the leakage error wSQG,leak. = 1 per each qubit. Lastly, the thermal state preparation error is weighted so that wSP = 10(tTQG + 2tSQG)/T1,0, since this quantity is the inverse of the approximate number of single and two-qubit gates that we can perform within a specified coherence time T1,0 at the reference point. Therefore, the performance metric \({{{\mathcal{P}}}}\) approximates the fidelity of the simple circuit displayed in Fig. 4c.

The values of \(1-{{{\mathcal{P}}}}\) from Eq. (26) for certain parameters are plotted in Fig. 4d. Due to the fact that the maximum of \({{{\mathcal{P}}}}\) is an optimal solution only for the circuit pictured in Fig. 4c, we have plotted an optimal region of parameters defined by the 10th percentile of the points with the lowest value of \(1-{{{\mathcal{P}}}}\). This region is defined so that the parameters of the system can be further fine-tuned within the optimal area. This optimal region is also contrasted with the optimal region obtained with a density matrix simulation of the same circuit, and the good agreement between the curves shows that \({{{\mathcal{P}}}}\) is a valid performance metric. We also observe good alignment between the optimal regions of the two distinct considered noise models, with the second model anticipated to offer greater accuracy. The discrepancy is larger at lower coherence times, since both models differ mainly in the decoherence modeling of the diabatic CZ gate. Further numerical tests have shown that the more accurate infidelity formulas are the main reason for the difference85, rather than the increased flux-sensitivity of Qubit 1 during two-qubit gates, or the differing transmon idling frequency configurations.

The color scale corresponding to the \(1-{{{\mathcal{P}}}}\) has four distinct less-favorable regions, corresponding to the error sources in Table 2. For small EC and, consequently, low anharmonicity \({\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\), the overall fidelities are low due to relatively large errors caused by leakage during single-qubit gates. For small EJ and large EC, EJ/EC has a low value and therefore the system experiences relatively large errors arising from parity switching events. If both EJ and EC are large, the coherence times are short and, therefore, the dominating source of infidelity. Moving perpendicularly to the constant frequency contours towards lower values of EC and EJ, results in lower frequencies and, thus increased errors caused by thermal excitations.

On the other hand, when considering near-future transmons with coherence times close to 0.5 ms48,49 in Fig. 4d, we observe that the optimal region is shifted towards larger values of EC, and is even limited by the parity-switching effects in the bottom right corner, thus demonstrating the importance of this effect in future quantum processing unit (QPU) design, when two-qubit gate infidelities surpass the limit of 10−3 infidelity.

By comparing the data in Fig. 4b, d, we observe that the implementation by Xu et al.37 is close to the parity-switching induced error region. Since the pure dephasing times of the system were not reported we are not able to assess the relative contribution of the parity switching error to this gate implementation. However, using the results shown in Eqs. (11) and (17), we observe that the parity-switching-error induced infidelity for the parameters is estimated to be \(1-{{{\mathcal{F}}}}\approx 2\cdot 1{0}^{-3}\). As a comparison, the reported Γ1 decay rates (measured in the idling configuration) in the reference were \(1/{\Gamma }_{1}^{{{\rm{q}}}_{{\rm{1}}}}=20.8\,{{\upmu}} {{{\rm{s}}}}\) and \(1/{\Gamma }_{1}^{{{\rm{q}}}_{{\rm{2}}}}=28.8\,\upmu {{\rm{s}}}\)37. Together with the reported gate time, these decay times correspond to an infidelity contribution of approximately 6  10−4, as given by the expression in Table 2. These results show that for this specific implementation of the diabatic CZ gate, the described effects of the parity switches are comparable, and possibly even greater than the T1 decay induced infidelity.

The shifts of the optimal region with increasing coherence times seen in Fig. 4d mean that the qubit parameters must be adapted to the currently achievable coherence times. Figure 5a shows how the performance metric \({{{\mathcal{P}}}}\) increases with the achievable coherence properties of the system, provided that the parameters EJ and EC are adapted to the coherence times. On the other hand, Fig. 5b displays the value of the function \({{{\mathcal{P}}}}\) without changing EJ and EC. Although the performance metric \({{{\mathcal{P}}}}\) on panels a and b is initially largely coherence limited, the assumption of fixed parameters EJ and EC in the simulations of Fig. 5b shows that simply increasing the coherence times does not necessarily result in better fidelities if the effects of other sources of error are not taken into account. In this case, the parity switching error becomes the dominant error source, as seen from the upper panel in Fig. 5b.

Fig. 5: The performance metric at different coherence times.
figure 5

The relative contributions to the value of \(1-{{{\mathcal{P}}}}\) (in Eq. (25)) from the error sources listed in Table 2 (top) and the value of the performance metric \({{{\mathcal{P}}}}\) as a function of coherence times with assuming Tϕ,0 = T1,0 at the reference point (bottom). Here we considered the same reference point as used in Fig. 4d. a A well-designed system, where the parameters EJ and EC are optimally adapted so that \({{{\mathcal{P}}}}\) is maximized for each value of the coherence time on the x-axis. b \({{{\mathcal{P}}}}\) at various coherence times while keeping EJ and EC fixed, corresponding to the values marked as point [5] in Fig. 4b.

It is also important to mention what other aspects of successfully operating a transmon tunable coupler based quantum computer were not included in the presented analysis.

  • We are neglecting any possible cross-talk effects between next-nearest neighbors.

  • Idling errors were discussed in this work, but not included in this analysis, as their contribution depends significantly on the specific algorithm being implemented. Taking also this effect into account, the area with low EJ/EC is even less favorable.

  • Other sources of decoherence are expected to have smaller contributions and therefore do not significantly affect the findings presented here.

  • The TLS environment is random, meaning that the scaling shown in Table 2 only holds for the average of a large number of qubits. Additionally, the T1 decay rate is also heavily influenced by the design, i.e., the exact geometry of the capacitor pads of the transmon, meaning that the presented results are only valid for a comparison of qubits with similar designs.

  • Recently, two-qubit gates with tunable couplers idled below the qubit frequencies have been demonstrated35,41. In such implementations, the presence of thermal coupler excitations limit the gate performance at lower qubit frequencies in a more complex manner. Our analysis in the low frequency regime therefore only holds for the original tunable coupler proposals, where the coupler is idled above the computational transmons.

While the results in Figs. 4d and 5a, b demonstrate how the knowledge about the error contributions can be used to guide the design of superconducting circuits, the scheme can easily be generalized to any type of quantum processor and any set of error sources. The only requirement is that the infidelity contribution of each error source depending on the tunable parameters of the processor can be evaluated either analytically or numerically. Especially the latter is typically feasible for errors which are local, i.e. do not affect a large number of components.

The schematic in Fig. 6 shows how the optimization procedure can be generalized to different quantum processor parameters and errors. The preparatory phase includes first identifying the parameters of the quantum processor we wish to optimize as well as the error sources whose contributions depend on these parameters (step 1). In the transmon example this corresponds to EJ and EC of the high-frequency qubit and the list of error sources in Table 2. In the second step an analysis of the infidelity contributions of the selected error sources with respect to the tunable parameters is performed, as described in Table 2. At this point, analytical formulas, such as the one in Eq. (18), are invaluable. However, in certain cases numerical simulations are needed to obtain these relations.

Fig. 6: General quantum processor parameter optimization procedure.
figure 6

The procedure is composed of two parts: the preparatory phase and the actual optimization. The optimization loop is composed of first obtaining the necessary data to evaluate the performance metric from Eq. (25) (step 3), after which we can determine whether the value of \({{{\mathcal{P}}}}\) is good enough or whether the optimization should continue. In the latter case, the performance metric is extrapolated to different values of the quantum processor parameters (from step 1.b) we wish to tune as plotted in Fig. 4d and a better parameter set is identified in step 5, based on the results from step 4. Afterwards, a new device is fabricated with the improved parameter set and the cycle repeats.

Once the preparatory phase is complete, the optimization step of the processor can commence by first obtaining the scaling coefficients needed to evaluate the infidelity contributions of each error (step 3 in Fig. 6), and hence evaluate the performance metric \({{{\mathcal{P}}}}\). This step can include measurements of the average coherence times of the qubits, their effective temperatures, etc. In some cases, such as for the parity switching error described in this manuscript, no additional measurements are required. Once the performance metric is evaluated one can decide whether to proceed with the optimization based on a predefined condition which can be chosen arbitrarily. As an example, in Fig. 4d, the convergence condition is satisfied if the current QPU parameters are within the optimal region defined by the top 10th percentile. If the value of the performance metric was found to be insufficient, a new set of parameters is chosen based on the the landscape of the performance metric (step 5), as shown in Fig. 4d.

Discussion

We have provided a framework for the optimization of circuit parameters that can be used to guide the future design of transmon-based quantum computers. Our findings reveal the presence of a distinct global performance peak within the EJ and EC parameter space, which has not been identified before. Moreover, our optimization procedure can be straightforwardly extended to more error sources, provided that the scaling of the error as a function of the system parameters is known analytically, or the infidelity contribution can be evaluated numerically. The latter is typically realistic as long as the error is sufficiently local, i.e., it depends only on the parameters of a handful of transmons at most. While we have based our analysis on transmon qubits connected via tunable couplers, the same principles, albeit with different error sources, can be applied to different types of qubits2 or co-design chips88. Additionally, more parameters than just EJ and EC can be optimized, e.g., also the gate durations can be considered as free parameters since they are realistically easy to adjust in experiments. The limiting factor here are the errors for which the analytical behavior is unknown and numerical interpolation in a large parameter space is too demanding.

We have chosen to present a range of optimal parameters, rather than prescribing a single optimal value for EJ and EC since additional improvements of the noise models are expected to improve the accuracy and reliability of the procedure. An extension of the framework in the tunable coupler example would include also optimizing the EJ and EC of the coupler transmon, as well as the couplings between the three transmons of the system. We are not aware of any analytical results linking the gate infidelity with these parameters, however it is known that the coupler coherence affects the effective qubit decoherence due to the hybridization of the states39,40. Correctly taking into account the effects of different coupling strengths necessitates a good understanding of the effect of the state hybridization on single-qubit rotations52,86 and the two-qubit gate duration (Eq. (11)). We believe that the inclusion of pulse-level numerical modeling, as presented in ref. 40, can also take into account such effects. Another straightforward improvement of our framework would be to consider a set of (possibly random) parameters for EJ and EC in the evaluation of the performance metric in order to better mimic the spread in transmon frequencies required to avoid unwanted resonances in a more realistic lattice82. This would also allow us to better estimate the effects of flux noise.

It is important to acknowledge the potential influence of other noise mechanisms, not explicitly addressed in this study, on the optimal design parameters. Specifically for superconducting circuits, the list of error sources in Table 2 can be expanded to include more contributions to the circuit coherence times, such as losses due to charge-coupling to an impedance, current fluctuations in the flux-bias lines, etc. All of these contributions to the coherence times can be evaluated using open-source software such as scqubits89. We believe that even taking into account cross-talk effects is possible, using similar approaches as in ref. 87. Nonetheless, the results in Fig. 4d show that the optimal parameter regimes presented are robust to small perturbations in the idling configuration, and higher-order corrections to the noise models. Our findings, as depicted in Fig. 4d, reveal that a two-fold variation in the reference coherence time only marginally adjusts the optimal parameter domain. This observation underscores the robustness of our results, suggesting that the presence of additional, potentially sub-leading noise mechanisms omitted from our simulations are unlikely to precipitate a drastic alteration in the presented outcomes.

Additionally, we have established how parity switching affects the commonly implemented tunable-coupler mediated diabatic and adiabatic CZ gate in a transmon based quantum computer, both analytically and numerically. We have shown that the parity switching error can be the main quasiparticle-related error source of the two-qubit gate. Moreover, we have demonstrated that the experimental implementation of the gate presented in ref. 37 may have a comparable, if not larger, contribution of parity-switching errors compared to all T1 decay mechanisms.

While the tunable-coupler-based diabatic and adiabatic CPHASE gates are more relevant due to their implementations in leading large-scale experiments7,8,10,11, we believe that the effects described in this manuscript should be considered in any current or future transmon-based quantum gate which utilizes higher-excited states, such as the gate schemes proposed in refs. 34,35,90. Accordingly, we do not believe that the parity-switching effects play a major role in implementing iSWAP-like interactions in the tunable coupler architecture, since the states involved in the interaction are not significantly hybridized with the second excited state of any transmon. Furthermore, the effects on parity-switches on more general fSIM gates91 are expected to be of a similar magnitude as in the adiabatic CZ case presented in Fig. 3c.

One of the primary anticipated advantages of incorporating tunable couplers into the system is the potential for on-demand complete suppression of ZZ-type interactions among the qubits. However, our research in this context has revealed that the stochastic nature of parity switches imposes constraints on this proposition, practically establishing a lower bound on the achievable minimum ZZ coupling strength. The magnitude of this “always on" interaction should be an important consideration when running longer algorithms. More specifically, this effect becomes relevant if the algorithm is long enough to accumulate a considerable conditional phase due to the unwanted coupling strengths shown in Fig. 3b. Since current coherence times are approaching the 1 ms limit48,49, the residual idling strength can become relevant if the described effects are not taken into account in the design of the transmon parameters.

One way of mitigating the parity-switching effects would be to attempt to tune the offset charge ng to the point where both parity manifolds are degenerate. However, such a solution is not practical, since the environmental charge noise would result in a drift of the offset charge ng as was demonstrated in refs. 92,93. We further note that \(\cos (2\pi {n}_{g})=0\) is the low-frequency charge noise hotspot12 in which the qubit frequency is maximally sensitive to the fluctuations of the offset charge. Therefore, the qubit is expected to have lower coherence times at this particular value for ng.

Methods

Modeling the tunable coupler circuit and diabatic CPHASE gate

The dependence of the transmon parameters on the parity and offset charge is not explicitly shown in Eq. (5) in order to simplify the notation. However, we note that the whole system has 23 = 8 distinct parity states. We emphasize also that the couplings between the transmons in Eq. (5), \({g}_{ij}={\beta }_{ij}\sqrt{{\omega }_{i}{\omega }_{j}}\) also depend on the frequencies, meaning that while \({g}_{{{{{\rm{{q}}}_{1}\rm{q}}}}_{{{{\rm{2}}}}}}\) is constant, \({g}_{{{{{\rm{q}}}}}_{1,2}{{{\rm{c}}}}}\) is time dependent. The dimensionless prefactors βij depend on the coupling capacitances, as well as self-capacitances of the transmons in the lumped-element circuit model38.

The computational basis in this scheme is formed by the eigenstates of the Hamiltonian from Eq. (5) in the idling configuration (defined below), rather than the local (uncoupled) transmon states39. Since the couplings act only as a perturbation to the uncoupled states, we identify the full Hamiltonian eigenstates corresponding to the uncoupled states. More specifically, the computational state \(\left\vert ij\right\rangle\), i, j {0, 1} is the eigenstate \(\left\vert \psi \right\rangle\) of the Hamiltonian in Eq. (5) with the maximal overlap \(| \langle \psi | {i}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}{0}_{{{{\rm{c}}}}}{j}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\rangle |\). This notation is employed throughout this manuscript and we typically omit the subscripts q1,2 and c. The kets with three indices (e.g. \(\left\vert ijk\right\rangle\)) always denote the local (uncoupled) Fock states of the three-transmon system. The kets with only two indices (e.g. \(\left\vert ij\right\rangle\)) are used to denote the eigenstates of the whole system that are closest to the local (uncoupled) state \(\left\vert i0j\right\rangle\). We denote with ωij the angular frequency of the computational state \(\left\vert ij\right\rangle\). The eigenenergies of these states are also used to computed the ZZ interaction strength ζZZ, as defined in Eq. (20).

However, since the coupler frequency is tunable, it is possible to find one or two frequencies ωc for which ζZZ = 036,38,41, provided that the qubit-qubit detuning \({\omega }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}-{\omega }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\in [{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}},-{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}]\). These special frequencies are referred to as the coupler idling frequencies and denoted with \({\omega }_{{{{\rm{c}}}}}^{{{{\rm{idle}}}}}\).

A variety of pulse shapes can be used to implement the gate effectively and without inducing too many unwanted transitions36,39,62. Even though our analytical analysis makes minimal assumptions about the pulse shape, we need to choose a specific form for the numerical simulations. In our simulations we use the flattop Gaussian pulse described by the formula36,46

$$f(t)=\frac{1}{2}\left[{{{\rm{erf}}}}\left(\frac{t-{\tau }_{{{{\rm{b}}}}}}{\sqrt{2}\sigma }\right)-{{{\rm{erf}}}}\left(\frac{t-{\tau }_{{{{\rm{b}}}}}-{\tau }_{{{{\rm{c}}}}}}{\sqrt{2}\sigma }\right)\right]-C.$$
(27)

The flattop Gaussian is obtained by a convolution of a step function with duration τc and a Gaussian with parameter σ. The reasoning behind this choice is that the convolution of the flattop pulse with a Gaussian strongly suppresses the spectral component of the flattop pulse at higher frequencies, thus reducing the probability for unwanted transitions. An additional rise time of τb is also introduced, which we fix to \({\tau }_{{{{\rm{b}}}}}=2\sqrt{2}\sigma\). Since any gate must have a finite duration, we introduce a cut-off at time T = 2τb + τc. The constant C is then introduced to correct for the discontinuity at the beginning and the end of a pulse with finite duration. The coupler frequency is varied accordingly \({\omega }_{{{{\rm{c}}}}}(t)={\omega }_{{{{\rm{c}}}}}^{{{{\rm{idle}}}}}-Af(t)\). More details about the numerical simulations are given in ref. 40. The parity of the whole system is then switched from one state to the other according to the charge dispersion given by Eqs. (2) and (3).

Since in our analytical approach we approximate the realistic pulse shape shown in Eq. (27) with a square pulse (i.e., we neglect any dynamics during the ramping up and down of the flux pulse), we have to be careful when comparing the results to those obtained with the numerical data. Particularly, the pulse duration T in the numerics is in general different than the gate duration tg we have defined in our analytical derivation, since the full duration T also includes the time needed to ramp the pulse up and down. When we compare our analytical results with numerical simulations, we first extract the number n of Rabi oscillations from the simulated data by monitoring the population of the \(\left\vert 002\right\rangle\) state during the gate operation. After n is determined, the analytical gate time is adjusted such that tg = nπ/Ω. The effective gate time obtained in this manner does not differ significantly from the duration of the flat part of the pulse, typically less than 5 ns.

The Schrieffer–Wolff transformation

The Schrieffer-Wolff (SW) transformation used in this manuscript was first introduced in ref. 94 and a similar transformation has been applied to the computational subspace of the two-qubit system in refs. 38,52. The aim of the SW transformation in our case is not to diagonalize the system, but rather to decouple the coupler states from the computational transmons, thus enabling us to only study the reduced system, i.e. we want to simplify the full Hamiltonian into a more tractable reduced model, containing only the relevant states (i.e. the states which have a significant population).

The reason for this is the fact that the computational basis of the system is defined by the eigenstates of the system which have the maximum overlap with states of the form \(\left\vert {i}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}{0}_{{{{\rm{c}}}}}{j}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\right\rangle\), i.e. with the coupler always in the ground state. Any excitations of the coupler therefore lead to errors, so in order to analyze the ideal gate dynamics, we constrain ourselves only to the energy levels of the computational subspace and the second excited state used in the Rabi oscillation during the CPHASE gate operation.

In general, a SW transformation is obtained by noting that any unitary operator can be written as \(\hat{U}={{{{\rm{e}}}}}^{\hat{S}}={\mathbb{1}}+\hat{S}+\frac{1}{2}{\hat{S}}^{2}+\ldots \,\), where \(\hat{S}\) is anti-hermitian, \(\hat{S}=-{\hat{S}}^{{\dagger} }\). Consequently a unitary transformation of an arbitrary Hamiltonian \(\hat{H}\) can be expanded in terms of \(\hat{S}\) as

$$\hat{U}\hat{H}{\hat{U}}^{{\dagger} }=\hat{H}+\left[\hat{S},\hat{H}\right]+\frac{1}{2}\left[\hat{S},\left[\hat{S},\hat{H}\right]\right]+{{{\mathcal{O}}}}({\hat{S}}^{3}).$$
(28)

As is typical in perturbation theory, we introduce the parameter α for bookkeeping purposes, and split the full Hamiltonian into a diagonal part and two off-diagonal perturbations, so that \(\hat{H}={\hat{H}}_{0}+\alpha {\hat{V}}_{1}+{\alpha }^{2}{\hat{V}}_{2}\). Additionally we rewrite the operator \(\hat{S}\) as a first order operator \(\hat{S}\to \alpha \hat{S}\), since if the perturbation is small, \(\hat{U}\) should be close to identity. In our case, \(\hat{H}\) is given in Eq. (5) and the first-order perturbation \({\hat{V}}_{1}=-\sum\limits_{i = {{{{\rm{{q}}}_{1},\rm{q}}}}_{{{{\rm{2}}}}}}\hslash {g}_{i{{{\rm{c}}}}}({\hat{a}}_{i}^{{\dagger} }-{\hat{a}}_{i})({\hat{a}}_{{{{\rm{c}}}}}^{{\dagger} }-{\hat{a}}_{{{{\rm{c}}}}})\) corresponds to the capacitive couplings of the two transmons (Qubits 1 and 2, q1,2) to the coupler (c), the direct coupling between the qubits \({\hat{V}}_{2}=-\hslash {g}_{{{{{\rm{{q}}}_{1}\rm{q}}}}_{{{{\rm{2}}}}}}({\hat{a}}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}^{{\dagger} }-{\hat{a}}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}})({\hat{a}}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}^{{\dagger} }-{\hat{a}}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}})\) is a second order perturbation, while \({\hat{H}}_{0}\) is a sum of the three independent anharmonic oscillator Hamiltonians. This hierarchy is chosen due to the fact that in all practical scenarios \({g}_{{{{{\rm{{q}}}_{1}\rm{q}}}}_{{{{\rm{2}}}}}}\ll\;{g}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}\rm{c}},{g}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}\rm{c}}\)38.

By plugging the ansaetze into Eq. (28), and grouping the terms with the same order of α, we obtain

$$\hat{U}\hat{H}{\hat{U}}^{{\dagger} }={\hat{H}}_{0}+\alpha \left({\hat{V}}_{1}+\left[\hat{S},{\hat{H}}_{0}\right]\right)+{\alpha }^{2}\left({\hat{V}}_{2}+\left[\hat{S},{\hat{V}}_{1}\right]+\frac{1}{2}\left[\hat{S},\left[\hat{S},{\hat{H}}_{0}\right]\right]\right)+{{{\mathcal{O}}}}({\alpha }^{3}).$$
(29)

Looking at the first-order term, it is natural to choose \(\hat{S}\) such that \(\left[\hat{S},{\hat{H}}_{0}\right]=-{\hat{V}}_{1}\), i.e. so that we cancel any couplings to the coupler states up to lowest order. However in order to do so and account for the couplings to the higher state correctly, we generalize the transformation from refs. 38,52

$$\begin{array}{ll}{\hat{S}}_{i}\,=\,\sum\limits_{{n}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}},{n}_{{{{\rm{c}}}}}\in \{0,1\}}\sqrt{({n}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}+1)({n}_{{{{\rm{c}}}}}+1)}\left[\frac{{g}_{\rm{q}_{i}\rm{c}}}{{\Delta }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}\rm{c}}\,+\,{n}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}-{n}_{{{{\rm{c}}}}}{\alpha }_{{{{\rm{c}}}}}}\left({\hat{\pi }}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}^{{n}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}+1,{n}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}}{\hat{\pi }}_{{{{\rm{c}}}}}^{{n}_{{{{\rm{c}}}}},{n}_{{{{\rm{c}}}}}+1}-{\hat{\pi }}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}^{{n}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}},{n}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}+1}{\hat{\pi }}_{{{{\rm{c}}}}}^{{n}_{{{{\rm{c}}}}}+1,{n}_{{{{\rm{c}}}}}}\right)\right.\\ \qquad\quad-\,\left.\frac{{g}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}\rm{c}}}{{\Sigma }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}\rm{c}}\,+\,{n}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}\,+\,{n}_{{{{\rm{c}}}}}{\alpha }_{{{{\rm{c}}}}}}\left({\hat{\pi }}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}^{{n}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}+1,{n}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}}{\hat{\pi }}_{{{{\rm{c}}}}}^{{n}_{{{{\rm{c}}}}}+1,{n}_{{{{\rm{c}}}}}}-{\hat{\pi }}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}^{{n}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}},{n}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}+1}{\hat{\pi }}_{{{{\rm{c}}}}}^{{n}_{{{{\rm{c}}}}},{n}_{{{{\rm{c}}}}}+1}\right)\right],\\ \end{array}$$
(30)
$$\hat{S}={\hat{S}}_{1}+{\hat{S}}_{2}.$$
(31)

We have additionally defined \({\Delta }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}\rm{c}}={\omega }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}-{\omega }_{{{{\rm{c}}}}}\), \({\Sigma }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}\rm{c}}={\omega }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}+{\omega }_{{{{\rm{c}}}}}\) and the operators \({\hat{\pi }}_{k}^{n,m}=\left\vert n\right\rangle \left\langle m\right\vert\), acting in the Hilbert space of \(k\in \{{{{{{\rm{q}}}}}_{1},{{{\rm{c}}}},{{{\rm{q}}}}}_{{{{\rm{2}}}}}\}\).

Since we have assumed the coupler remains in the ground state at all times, the effective Hamiltonian is defined on the set of states \(\{\left\vert {0}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}{0}_{{{{\rm{c}}}}}{0}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\right\rangle ,\left\vert {0}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}{0}_{{{{\rm{c}}}}}{1}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\right\rangle ,\left\vert {1}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}{0}_{{{{\rm{c}}}}}{0}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\right\rangle ,\left\vert {1}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}{0}_{{{{\rm{c}}}}}{1}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\right\rangle ,\left\vert {0}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}{0}_{{{{\rm{c}}}}}{2}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\right\rangle \}\). Additionally, we neglect any couplings outside of the subspace of interest, however the resulting effective Hamiltonian still contains terms coupling the levels \(\left\vert 000\right\rangle \leftrightarrow \left\vert 101\right\rangle\) and \(\left\vert 000\right\rangle \leftrightarrow \left\vert 002\right\rangle\). These couplings are neglected in the rotating-wave approximation, as these transitions do not conserve the total occupation number.

By additionally setting the energy of the ground state to zero we arrive at the effective subspace Hamiltonian from Eq. (36). The perturbative parameter values are given by

$${\tilde{\omega }}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}={\omega }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}+\frac{{g}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}{\rm{c}}}^{2}}{{\Delta }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}{\rm{c}}}}+\frac{2{g}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}{\rm{c}}}^{2}}{{\Sigma }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}{\rm{c}}}+{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}}+\frac{{g}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}{\rm{c}}}^{2}}{{\Sigma }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}{\rm{c}}}},$$
(32)
$${\tilde{\alpha }}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}={\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}-\frac{2{g}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}{\rm{c}}}^{2}}{{\Delta }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}{\rm{c}}}}+\frac{2{g}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}{\rm{c}}}^{2}}{{\Delta }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}{\rm{c}}}+{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}}+\frac{4{g}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}{\rm{c}}}^{2}}{{\Sigma }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}{\rm{c}}}+{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}}+\frac{{g}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}{\rm{c}}}^{2}}{{\Sigma }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}{\rm{c}}}}-\frac{3{g}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}{\rm{c}}}^{2}}{{\Sigma }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}{\rm{c}}}+2{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}},$$
(33)
$${\tilde{g}}_{01,10}={g}_{{{{{\rm{{q}}}_{1}\rm{q}}}}_{{{{\rm{2}}}}}}+\frac{{g}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}{\rm{c}}}{g}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}{\rm{c}}}}{2}\left(\frac{1}{{\Delta }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}{\rm{c}}}}+\frac{1}{{\Delta }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}{\rm{c}}}}-\frac{1}{{\Sigma }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}{\rm{c}}}}-\frac{1}{{\Sigma }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}{\rm{c}}}}\right),$$
(34)
$${\tilde{g}}_{11,02}=\sqrt{2}\left[{g}_{{{{{\rm{{q}}}_{1}\rm{q}}}}_{{{{\rm{2}}}}}}+\frac{{g}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}{\rm{c}}}{g}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}{\rm{c}}}}{2}\left(\frac{1}{{\Delta }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}{\rm{c}}}}+\frac{1}{{\Delta }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}+{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}}-\frac{1}{{\Sigma }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}{\rm{c}}}}-\frac{1}{{\Sigma }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}{\rm{c}}}+{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}}\right)\right].$$
(35)

Deriving the effective diabatic gate unitary

We consider here the truncated Hilbert space spanned by the states \(\{\left\vert 00\right\rangle ,\left\vert 01\right\rangle ,\left\vert 10\right\rangle ,\left\vert 11\right\rangle ,\left\vert 02\right\rangle \}\). In our analytical considerations, we thus neglect the possibility of leakage, which is a good assumption for high-fidelity gates. We emphasize that the leakage effects are included in our numerical simulations which have been made with the full Hamiltonian defined in Eq. (5).

We additionally neglect the counter-rotating terms within the truncated Hilbert space and, after applying the Schrieffer-Wolff trnasofrmation to the Hamiltonian in Eq. (5), we obtain

$${\hat{H}}_{\rm{eff}}/ \hbar\hat{=}\begin{array}{*{20}{l}}\begin{array}{*{20}{ccccc}}\,\,\,\,{\left|{00}\right\rangle} & \,\,\,\quad{\left|{01}\right\rangle}& \quad{\left|{10}\right\rangle} &\quad\,\,\,{\left|{11}\right\rangle}&\qquad\,\,\,\,{\left|{02}\right\rangle}\end{array}\\\left( \begin{array}{*{20}{ccccc}}0 & \qquad0 & 0 & 0 & 0& \\0 & \qquad\tilde{\omega}_{{\mathrm{q}}_2} & \tilde{g}_{01,10} &0 & 0\\0 & \qquad\tilde{g}_{01,10} & \tilde{\omega}_{{\mathrm{q}}_1} & 0 & 0\\0 & \qquad0 & 0 & \tilde{\omega}_{{\mathrm{q}}_1} + \tilde{\omega}_{{\mathrm{q}}_2} & \tilde{g}_{11,02} \\0 & \qquad0 & 0 & \tilde{g}_{11,02} & 2\tilde{\omega}_{{\mathrm{q}}_2} + \tilde{\alpha}_{{\mathrm{q}}_2}\end{array}\!\!\!\!\!\!\right)\begin{array}{l}{\left|{00}\right\rangle}\\{\left|{01}\right\rangle} \\{\left|{10}\right\rangle} \\{\left|{11}\right\rangle} \\{\left|{02}\right\rangle}\end{array},\end{array}$$
(36)

where the parameters with the tilde denote the perturbed parameters of the original full Hamiltonian from Eq. (5) (see Eqs. (32)–(35)) up to second order in \({g}_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}{\rm{c}}}/({\omega }_{{{{{\rm{q}}}}}_{{{{\rm{i}}}}}}-{\omega }_{{{{\rm{c}}}}})\). Without any loss of generality, we have also assumed that the second excited state of the second qubit (q2) is used to perform the Rabi oscillation.

The pulse shape (see Eq. (27)) used to implement a diabatic CPHASE gate can typically be divided into three distinct stages: (i) fast sweep of the coupler frequency to an operation value close to the resonance with the qubits; (ii) long constant-frequency plateau at the operation frequency; (iii) fast sweep back to idling frequency of the coupler. Since the plateau is typically longer compared to the fast sweeps36, and the effective coupling strengths \({\tilde{g}}_{01,10}\) and \({\tilde{g}}_{11,02}\) are significantly smaller if the coupler is further detuned, we assume that the plateau stage is the only contributor to the dynamics and the effective Hamiltonian from Eq. (36) is constant in time. We show later that this approximation is valid for realistic gate durations close to 40 ns and longer. If this approximation cannot be made, the form of the effective Hamiltonian remains unchanged. However, the analytical results derived in the remainder of this section become more complex due to the time dependence of the effective Hamiltonian.

The effective Hamiltonian in Eq. (36) is block-diagonal. Using matrix exponentiation, one can readily obtain an effective unitary time-evolution operator. Assuming that the computational transmons are detuned by approximately one anharmonicity, i.e. \({\tilde{\omega }}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}-{\tilde{\omega }}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\approx {\tilde{\alpha }}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\), the amplitude of the Rabi oscillation between the single-excitation states \(\left\vert 01\right\rangle\) and \(\left\vert 10\right\rangle\) is lower than that of the transition between \(\left\vert 11\right\rangle\) and \(\left\vert 02\right\rangle\) because \(| {\tilde{\omega }}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}-{\tilde{\omega }}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}| \gg {\tilde{g}}_{01,10}\). Thus, we can neglect the interaction between the single-excitation states.

Since we are interested in the operator acting on the computational subspace of the system, we further truncate the subspace by excluding the non-computational \(\left\vert 02\right\rangle\) state. After also accounting for the single-qubit phases, which is typically done via virtual Z-rotations95, we obtain the effective time-evolution operator \(\hat{U}(t)\) from Eq. (6). Equation (7) for P11 is derived by recognizing the block-diagonal structure of the effective Hamiltonian in Eq. (36). This simplifies the analysis, reducing it to a standard two-level Rabi oscillation63. Equation (8) is obtained by subtracting the phases of the single-excitation states from the phase of the \(\left\vert 11\right\rangle\) state.

List of assumptions

Here we summarize all of the approximations involved in deriving Eqs. (11) and (12) in the main text. These expressions are valid under the following assumptions:

  1. 1.

    The initial assumptions used to derive the effective Hamiltonian in Eq. (36) are valid, meaning that:

    1. (a)

      The second order perturbation theory used for the Schrieffer-Wolff transformation is valid, i.e. \({g}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}{\rm{c}}}^{2}/{({\omega }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}-{\omega }_{{{{\rm{c}}}}})}^{2}\ll 1\).

    2. (b)

      The gate has low leakage outside of the considered subspace, spanned by the computational states and second excited state of the relevant qubit.

    3. (c)

      The rotating wave approximation for the counter-rotating coupling terms is justified.

  2. 2.

    The Rabi oscillation between the states \(\left\vert 01\right\rangle \leftrightarrow \left\vert 10\right\rangle\) is negligible compared to the Rabi oscillation between \(\left\vert 11\right\rangle \leftrightarrow \left\vert 02\right\rangle\). In the perturbative regime this is fulfilled if \(\sqrt{{\tilde{\Delta }}^{2}+4{\tilde{g}}_{01,10}^{2}}\ll \Omega\) and \({\tilde{g}}_{01,10}^{2}/({\tilde{\Delta }}^{2}+4{\tilde{g}}_{01,10}^{2})\ll {\tilde{g}}_{11,02}^{2}/{\Omega }^{2}\).

  3. 3.

    The coupler frequency in the interaction regime is relatively constant. If this is not the case, the time dependency of the perturbative parameters must be taken into account.

  4. 4.

    The gate has low leakage outside of the computational subspace.

  5. 5.

    The simplified formula in Eq. (11) additionally neglects the terms proportional to \({g}_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}{\rm{c}}}{g}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}{\rm{c}}}/{({\Delta }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}{\rm{c}}}+{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}})}^{3}\) and \({g}_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}{\rm{c}}}^{2}/{({\Delta }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}{\rm{c}}}+{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}})}^{3}\), and smaller.

  6. 6.

    We have assumed that the main contributor to the perturbation is the Qubit 2 (q2) whose second excited state is populated during the gate operation. However, if the system is designed in such a way that the charge dispersion of any of the other two transmons is significantly larger, their effects might not be negligible anymore.

Note that both assumptions about the leakage are automatically fulfilled if the gate has a high-fidelity.

Fidelity scaling

Scaling of the fidelity with the number of gates, i.e. computing the average gate fidelity of sequential application of N gates, can be found by using Eq. (14) to first define the map corresponding to a series of gates

$${\left({{{{\rm{CPHASE}}}}}_{{{{\rm{CPS}}}}}\right)}^{N}[\hat{\rho }]=\sum\limits_{k=0}^{N}\left(\begin{array}{c}N\\ k\end{array}\right){\hat{U}}_{-}^{k}{\hat{U}}_{+}^{(N-k)}\hat{\rho }\,{\hat{U}}_{-}^{(N-k)}{\hat{U}}_{+}^{k},$$
(37)

where we have used the fact that the operators in Eq. (15) are diagonal and therefore commute with each other and also \({\hat{U}}_{+}^{{\dagger} }={\hat{U}}_{-}\). Using the effective Kraus operators from Eq. (37), and only considering the conditional phase error, we combine this with the fidelity definition from Eq. (16), and obtain

$${{{\mathcal{F}}}}[{\left({{{{\rm{CPHASE}}}}}_{{{{\rm{CPS}}}}}\right)}^{N}]=\frac{d+\sum\nolimits_{k = 0}^{N}\left(\begin{array}{c}N\\ k\end{array}\right){\left| {{{\rm{tr}}}}\left\{{\left({\hat{U}}_{{{{\rm{CPHASE}}}}}^{{\dagger} }\right)}^{N}{\hat{U}}_{-}^{k}{\hat{U}}_{+}^{(N-k)}\right\}\right| }^{2}}{{d}^{2}+d}$$
(38)
$$=\frac{4+\frac{1}{{2}^{N}}\mathop\sum\nolimits_{k = 0}^{N}\left(\begin{array}{c}N\\ k\end{array}\right)\left[10+6\cos \left(\frac{\delta \phi }{2}(N-2k)\right)\right]}{20}$$
(39)
$$\approx 1-\frac{3}{80}{(\delta \phi )}^{2}\frac{1}{{2}^{N}}\mathop\sum\limits_{k=0}^{N}\left(\begin{array}{c}N\\ k\end{array}\right){(N-2k)}^{2}$$
(40)
$$=1-\frac{3}{80}N{(\delta \phi )}^{2}.$$
(41)

This result indicates that calibrating the gate such that the error is evenly split between the two parities not only increases the single-gate fidelity but also leads to a more advantageous scaling of the infidelity N, compared to the purely coherent error case for which the error scales as N269.

Gate parameters for high-fidelity simulations

Finding good gate parameters, both for the Hamiltonian as well as for the pulse for high-fidelity simulations, is not a trivial task. Here, we discuss how to find optimal Hamiltonian parameters at different qubit frequencies and anharamonicities. In general, the only prerequisite is that the qubits are detuned by approximately one anharmonicity, which can be seen from Eq. (36).

Table 1 Table of Hamiltonian parameters (Eq. (5)) with which high-fidelity CZ gates are possible

As seen from Table 1, we keep some of the parameters in the simulation fixed, while others depend on \({\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\), which is varied. These parameters are based on the experimental values from ref. 36 and the coupling coefficients from the Hamiltonian in Eq. (5) are related to βij via the following relation \({g}_{ij}={\beta }_{ij}\sqrt{{\omega }_{i}{\omega }_{j}}\), as described in the main text. The small perturbation to \({\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}\) is there in order to ensure that the energy levels are significantly non-degenerate for perturbation theory to apply. This is also completely realistic as a typical fabrication procedure results in seemingly random deviations from the designed values. More specifically, the gates simulated in Fig. 2a, b are obtained by varying \({\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\) with values \({\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\in [-195, -230, -250, -270, -300]\,\) 2π MHz. Additionally, in order to certify that the analyzed effect is not limited to the choice of parameters presented above, the second data point of Fig. 2a, b at EJ/EC ≈ 50, was generated in the same way as in Table 1, but with the change to \({\omega }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}=5.1\cdot 2\pi {{{\rm{GHz}}}}\). The coupler frequency in the idling configuration \({\omega }_{{{{\rm{c}}}}}^{{{{\rm{idle}}}}}\), i.e. before and after performing a gate, is determined by diagonalizing the Hamiltonian to fulfill the condition ζZZ = 0 from Eq. (20). The pulse parameters from Eq. (27) are obtained by numerically optimizing the fidelity of the gate, with fixed σ = 5 ns and \({\tau }_{{{{\rm{b}}}}}=2\sqrt{2}\sigma\). Typical values of the amplitude are A ~ 1 − 1.2  2π GHz and τc ~ 60 ns.

Scaling of transmon error sources

In this section we derive the scaling of the noise parameters shown in Table 2 with the transmon energies EJ and EC. The noise is modeled by appending the appropriate noise channel after the gate unitary, and the calculated infidelity is thus independent of the unitary dynamics, as was shown in ref. 84.

Table 2 Scaling analysis for the most relevant error sources in an architecture of flux-tunable transmons, with two noise models

The T1 decay time of transmon devices is believed to be currently limited by the presence of a discrete number of environmental two-level systems (TLSs) which couple to the qubit via their electrical dipole71,72,73,74,75,76,77. Therefore assuming that the charge noise in the system is weak enough, the interaction Hamiltonian between a transmon and an environmental TLS can be derived from Eq. (1), by replacing \({n}_{g}\to {n}_{g}+\delta {\hat{n}}_{g}\), as in refs. 72,77, so that

$${\hat{H}}_{{{{\rm{q-TLS}}}}}=8{E}_{C}\,\hat{n}\otimes \delta {\hat{n}}_{g}=-4\sqrt{2}{E}_{C}{\left(\frac{{E}_{J}}{8{E}_{C}}\right)}^{\frac{1}{4}}{{{\rm{i}}}}(\hat{a}-{\hat{a}}^{{\dagger} })\otimes \delta {\hat{n}}_{g}.$$
(42)

In the above equation, we have used the asymptotic expression for the number operator \(\hat{n}\), derived already in ref. 12. The operator \(\delta {\hat{n}}_{g}\) is defined in the Hilbert space of the TLS, and is related to the parameters of the TLS - more specifically, its electrical dipole75. More importantly, \(\delta {\hat{n}}_{g}\) does not explicitly depend on EJ and EC.

Since the majority of the T1 experiments on transmons display exponential decays96, the dynamics can be captured by the Lindblad equation. As a second order approximation is assumed in the derivation of the Lindblad equation, the resulting decay rates are found to be proportional to the square of the coupling coefficient of the transmon to the TLS environment66. In our case this translates to

$${\Gamma }_{1}\propto {E}_{C}^{3/2}{E}_{J}^{1/2}.$$
(43)

Here we have omitted the noise spectrum of the environment, since current models assume a flat noise spectrum without any dependence on the qubit frequency75.

While it is not strictly necessary for the computational transmons to be flux-tunable, it is often desired as flux-tunability additionally enables the implementation of an iSWAP gate with the same architecture36,38. Having slow flux tunability is also desirable in order to avoid resonances with TLSs in the environment which can severely limit the T1 decay time76,77, as well as helping with the issue of frequency crowding82.

However, unlike the random noise spectrum of a TLS environment producing T1 dynamics, the noise spectrum of magnetic-flux noise is typically observed to have a 1/fα frequency dependence71,78,79,80, with α ~ 1. The large noise-spectrum amplitude at lower frequencies means that the long-time correlation results in non-Markovian dynamics97. Assuming that the noise is slow enough, in order for the adiabatic approximation to hold, the interaction Hamiltonian due to a slowly fluctuating magnetic environment \(\delta \hat{\Phi }\) is given by12

$${\hat{H}}_{{{{\rm{q-flux}}}}}=\frac{\partial \hat{H}}{\partial \Phi }\otimes \delta \hat{\Phi }=\hslash \frac{\partial \omega }{\partial \Phi }{\hat{a}}^{{\dagger} }\hat{a}\otimes \delta \hat{\Phi }.$$
(44)

Similarly to the T1-decay scenario, we make the assumption that the environment operator \(\delta \hat{\Phi }\) remains independent of the transmon parameters. In reality, the magnitude of this operator is contingent upon the inner product between the magnetic dipole operator of the spins and the surface vector of the SQUID loop.

The form of the flux dispersion ∂ω/∂Φ of a split-junction transmon, with Josephson energies \({E}_{{J}_{1}}\) and \({E}_{{J}_{2}}\), is determined by the relations12

$${E}_{J}(\Phi )={E}_{J\Sigma }\cos (\pi \Phi )\sqrt{1+{d}^{2}{\tan }^{2}(\pi \Phi )},$$
(45)
$$\hslash \omega (\Phi )=\sqrt{8{E}_{C}{E}_{J}(\Phi )}-{E}_{C},$$
(46)

with \({E}_{J\Sigma }={E}_{{J}_{1}}+{E}_{{J}_{2}}\) and \(d=| {E}_{{J}_{1}}-{E}_{{J}_{2}}| /({E}_{{J}_{1}}+{E}_{{J}_{2}})\). We assume that d 1, since values of d ≈ 1 result in overall longer coherence times, and the computational transmons need to be tuned over a smaller frequency range, typically on the order of ~100 MHz36. In this regime, the flux dispersion can be approximated as

$$\frac{\partial \omega }{\partial \Phi }=(d-1)\sqrt{8{E}_{C}{E}_{J\Sigma }}\frac{\pi \sin (2\pi \Phi )}{2}+{{{\mathcal{O}}}}({[d-1]}^{2}),$$
(47)

and the decay rate scales with the coupling coefficient98,99,

$${\Gamma }_{\phi }(\Phi )\propto \left| \frac{\partial \omega }{\partial \Phi }\right|.$$
(48)

Ideally, the system is designed so that both the idling frequency of the computational qubits (i.e. the frequency at which single-qubit rotations are performed), as well as the computational transmon frequency during the two-qubit gate operation are positioned at the first-order flux-insensitive sweet-spot. In that case, it can be seen from Eq. (47), that the flux noise decay rate is identical in both the idling configuration as during the gate operation. This is true for both computational qubits.

While we have explicitly addressed only the first-order flux dispersion ∂ω/∂Φ, it is evident that the aforementioned conclusion remains valid even if ∂ω/∂Φ = 0. In such cases, we must consider the second order dispersion ∂2ω/∂Φ2. A simplified scaling relationship of Γϕ with EC and EJ can be derived under the condition that the applied flux Φ is independent of EJ and EC, or equivalently, when we are interested in the flux-averaged decay rate \({\bar{\Gamma }}_{\phi }=\int\nolimits_{0}^{1}{{{\rm{d}}}}\Phi \,{\Gamma }_{\phi }(\Phi )\). This yields the relation:

$${\Gamma }_{\phi }\propto \sqrt{{E}_{C}{E}_{J\Sigma }},$$
(49)

which holds for all values of d.

The averaging over the flux is relevant, in a more realistic scenario, where the frequencies of the system may be calibrated in order to avoid resonances with individual TLSs and other elements on the chip87. Due to fabrication inaccuracies83 and the stochastic nature of the TLSs, it is impossible to deterministically predict the optimal frequencies in an array of transmons, which prompts us to consider the flux-averaged decay rate scaling in Eq. (49).

Realistic gate implementations frequently involve fast flux tuning of one of the computational transmons to enhance gate fidelity36,91. This flux tuning influences the flux dispersion of the qubit and consequently impacts the decay rate during the two-qubit gate implementation. While predicting the exact detunings of all qubits proves challenging, in the advanced noise model, we incorporate the flux dependence of the dephasing rate from Eq. (49) to better model the dephasing of one of the transmons during the operation of the two-qubit gate. Typically, tuning the low-frequency transmon (q1) proves more advantageous, as the second excited state of Qubit 2 is inherently more susceptible to decoherence.

In the advanced noise model, we therefore consider that the frequency of Qubit 1 satisfies \({\omega }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}-{\omega }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}={\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}/2\,\approx \,{\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}/2\) for all values of EJ and EC during single-qubit gate operation. This will reduce any potential leakage due to driving the \(\left\vert 10\right\rangle \leftrightarrow \left\vert 02\right\rangle\) transition instead of \(\left\vert 10\right\rangle \leftrightarrow \left\vert 11\right\rangle\)86. More specifically we consider Qubit 1 to be weakly tunable (d = 0.9) and we assume its frequency is shifted so that the two-qubit gate resonance condition is satisfied, i.e. \({\omega }_{{{{{\rm{q}}}}}_{{{{\rm{1}}}}}}-{\omega }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}={\alpha }_{{{{{\rm{q}}}}}_{{{{\rm{2}}}}}}\) during the two-qubit gate operation. Otherwise the transmons are assumed to be idling 10 MHz away from the sweet-spot. This information is sufficient to compute the decay rate of Qubit 1 at a different flux bias, by using Eq. (47). We believe that pinpointing more specific detuning values would require pulse-level simulation.

The low anharmonicity of the transmon is a limiting factor in the operation of single-qubit gates40,81 as fast operations drive a part of the population from the computational subspace into the second excited state. A straightforward and effective scheme for mitigating this effect while implementing single-qubit gates known as Derivative Removal by Adiabatic Gate (DRAG) was presented in ref. 81.

Since typical pulse amplitudes used to perform single-qubit operations are typically much lower compared to the qubit frequency, and the drive is resonant with the qubit, we assume that the rotating-wave approximation is accurate. In the frame rotating with the qubit frequency, the effective Hamiltonian depends on the pulse parameters, anharmonicity, and the detuning between the drive and qubit frequencies. This indicates that the amount of leakage does not explicitly depend on the qubit frequency.

Thus, the average gate fidelity, similar to the one defined in Eq. (16), but in this case for a single-qubit rotation around the x or y-axes, depends only on the qubit anharmonicity (or charging energy), since α − EC in the transmon limit. This means that even though analytical results are not available, the relationship can be determined numerically and interpolated. This relationship, albeit with different parameters, has already been plotted in ref. 81, and generally follows a dependence of \({P}_{{{{\rm{leak.}}}}}\propto {E}_{C}^{-\gamma }\), 5 γ 6, with higher exponents observed at lower EC. The independence of the single-qubit gate infidelity of the qubit frequency (within the transmon regime) was also verified numerically.

The single-qubit gate parameters assumed in Figs. 4d and Fig 5 were a DRAG Gaussian pulse, with a σ = 4 ns and a total duration tSQG = 4σ. The amplitudes of both DRAG components are numerically optimized before interpolating the dependence of Pleak. on α, which was used to generate Fig. 4d. The pulse drive frequency is assumed to be resonant with the qubit. More details are available in ref. 40. We note here, that most of the SQG infidelity is due to leakage, rather than phase errors.

While other error sources affect the gate performance, the thermal-excitation error considered here only affects the state preparation. The average gate fidelity, which is defined as the fidelity averaged over Haar random distributed input states, is therefore not applicable, since this error only affects one input state. We therefore replace the average gate infidelity67 with a state infidelity. This can also be qualitatively thought of as replacing the Haar random distribution with a delta-like distribution with a peak at the \(\left\vert 0\right\rangle\) state. By modeling the thermal excitation as a bit-flip channel \({{{\mathcal{E}}}}[\hat{\rho }]=(1-{P}_{\left\vert 1\right\rangle })\hat{\rho }+{P}_{\left\vert 1\right\rangle }{\hat{\sigma }}_{x}\hat{\rho }{\hat{\sigma }}_{x}\) it is straightforward to see that the state infidelity63

$${{{\mathcal{F}}}}=\left\langle 0\right\vert {{{\mathcal{E}}}}[\left\vert 0\right\rangle \left\langle 0\right\vert ]\left\vert 0\right\rangle =1-{P}_{\left\vert 1\right\rangle }=\frac{1}{1+{{{{\rm{e}}}}}^{-\beta \omega }},$$
(50)

where we have additionally assumed that the temperature is low enough such that the population of the higher-excited states is negligible.