Optimizing quantum gates towards the scale of logical qubits

Klimov, Paul V.; Bengtsson, Andreas; Quintana, Chris; Bourassa, Alexandre; Hong, Sabrina; Dunsworth, Andrew; Satzinger, Kevin J.; Livingston, William P.; Sivak, Volodymyr; Niu, Murphy Yuezhen; Andersen, Trond I.; Zhang, Yaxing; Chik, Desmond; Chen, Zijun; Neill, Charles; Erickson, Catherine; Grajales Dau, Alejandro; Megrant, Anthony; Roushan, Pedram; Korotkov, Alexander N.; Kelly, Julian; Smelyanskiy, Vadim; Chen, Yu; Neven, Hartmut

doi:10.1038/s41467-024-46623-y

Download PDF

Article
Open access
Published: 18 March 2024

Optimizing quantum gates towards the scale of logical qubits

Nature Communications volume 15, Article number: 2442 (2024) Cite this article

3147 Accesses
35 Altmetric
Metrics details

Subjects

Abstract

A foundational assumption of quantum error correction theory is that quantum gates can be scaled to large processors without exceeding the error-threshold for fault tolerance. Two major challenges that could become fundamental roadblocks are manufacturing high-performance quantum hardware and engineering a control system that can reach its performance limits. The control challenge of scaling quantum gates from small to large processors without degrading performance often maps to non-convex, high-constraint, and time-dynamic control optimization over an exponentially expanding configuration space. Here we report on a control optimization strategy that can scalably overcome the complexity of such problems. We demonstrate it by choreographing the frequency trajectories of 68 frequency-tunable superconducting qubits to execute single- and two-qubit gates while mitigating computational errors. When combined with a comprehensive model of physical errors across our processor, the strategy suppresses physical error rates by ~3.7× compared with the case of no optimization. Furthermore, it is projected to achieve a similar performance advantage on a distance-23 surface code logical qubit with 1057 physical qubits. Our control optimization strategy solves a generic scaling challenge in a way that can be adapted to a variety of quantum operations, algorithms, and computing architectures.

Suppressing quantum errors by scaling a surface code logical qubit

Article Open access 22 February 2023

Leakage reduction in fast superconducting qubit gates via optimal control

Article Open access 29 January 2021

Fast universal quantum gate above the fault-tolerance threshold in silicon

Article 19 January 2022

Introduction

Superconducting quantum processors have demonstrated elements of surface code quantum error correction^1,2,3,4,5,6 establishing themselves as promising candidates for fault-tolerant quantum computing. Nonetheless, imperfections in hardware and control introduce physical errors that corrupt quantum information⁷ and could limit scalability. Even if a large enough quantum processor with a high enough performance limit to implement error correction can be manufactured, there is no guarantee that a control strategy will be able to reach that limit.

Frequency-tunable architectures^{4,8,9,10,11,12,13,14,15,16,17,18,19} are uniquely positioned to mitigate computational errors since most physical error mechanisms are frequency dependent^{20,21,22,23,24,25,26,27,28,29,30,31} (Fig. 1a–d). However, to leverage this architectural feature, qubit frequency trajectories must be choreographed over quantum algorithms to simultaneously execute quantum operations while mitigating errors.

Choreographing frequency trajectories is a complex optimization problem due to engineered and parasitic interactions among computational elements²⁷ and their environment^22,26,29, hardware³² and control²⁸ inhomogeneities, performance fluctuations^26,33, and competition between error mechanisms. Mathematically, the problem is non-convex, highly constrained, time-dynamic, and expands exponentially with processor size.

Past research into overcoming these complexities employed frequency partitioning strategies that either faced difficulties scaling with realistic hardware imperfections^34,35 or whose scalability is not well understood^4,13,17,36. To overcome the limitations of these strategies, we proposed the Snake optimizer^37,38 and employed an early version in past reports^{39,40,41,42,43,44,45,46,47,48}. However, an optimization strategy has not been developed around it, it has not been rigorously benchmarked, and large enough processors to investigate its scalability have only recently become available. Whether high-performance configurations exist at scale and whether they can be quickly discovered and stabilized are open questions.

Here we address these questions by developing a control optimization strategy around Snake that can scalably overcome the complexity of problems like frequency optimization within the high performance, high stability, and low runtime requirements of an industrial system. The strategy introduces generic frameworks for building processor-scale optimization models, training them for various quantum algorithms, and adapting to their unique optimization landscapes via Snake. This flexible approach can be applied to a variety of quantum operations, algorithms, and architectures. We believe it will be an important element in scaling quantum control and realizing commercially valuable quantum computations.

We investigate the prospects of this strategy for optimizing quantum gates for error correction in superconducting qubits. We demonstrate that it strongly suppresses physical error rates, approaching the surface code threshold for fault tolerance on our processor with tens of qubits. To pave the way towards much larger processors, we demonstrate Snake “healing” and “stitching”, which were designed to stabilize performance over long timescales and geometrically parallelize optimization. Finally, we introduce a simulation environment that emulates our quantum computing stack and combine it with optimization, healing, and stitching to project the scalability of our strategy towards thousands of qubits.

Results

Quantum hardware

Our hardware platform is a Sycamore processor⁶ with N = 68 frequency-tunable transmon qubits on a two-dimensional lattice. Engineered tunable coupling exists between 109 nearest-neighbors^49,50. We configure our control system and processor to execute the surface code gate set, which includes single-qubit XY rotations (SQ) and two-qubit controlled-Z (CZ) gates¹⁰ (Supplementary Note 1). SQ gates are implemented via microwave pulses resonant with qubits’ respective \(\left\vert 0\right\rangle \leftrightarrow \left\vert 1\right\rangle\) idle frequencies (f_i for qubit q_i executing SQ_i). CZ gates are implemented by sweeping neighboring qubits into \(\left\vert 11\right\rangle \leftrightarrow \left\vert 02\right\rangle\) resonance near respective interaction frequencies (f_ij for qubits q_i and q_j executing CZ_ij) and actuating their couplers. The N = 68 idle and ~ 2N = 109 interaction frequencies—which constitute one frequency configuration F with dimension ~ 3N = ∣F∣ = 177—parameterize qubit frequency trajectories, which we seek to optimize.

Performance benchmark

We evaluate the performance of frequency configurations via the parallel two-qubit cross-entropy benchmarking algorithm (CZXEB, see Supplementary Note 6)^39,51. CZXEB executes cycles of parallel SQ gates followed by parallel CZ gates, benchmarking them in a context representative of many quantum algorithms. Most relevant to this study is that CZXEB reflects the structure of the surface code’s parity checks and has empirically served as a valuable performance proxy of logical error⁶. The processed output of CZXEB is the benchmark distribution e_c, in which each value is one qubit pair’s average error per cycle e_c,ij, which includes error contributions from respective SQ_i, SQ_j, and CZ_ij gates. Benchmarks are generally not normally distributed across a processor and are thus reported via percentiles as \(50.0{\%}_{(2.5-50)\%}^{(97.5-50)\%}\) and plotted as quantile boxplots. The wide range from 2.5% to 97.5% is the distribution spread, which spans ± 2σ standard deviations for normally distributed data.

Optimization model

We approach frequency optimization as a model-based problem. In turn, we must define an algorithm error estimator E that is representative of the performance of the target quantum algorithm A at the optimizable frequency configuration F (Fig. 1d). This problem is hard because the estimator must be fast for scalability, predictive for scaling projections, and physical for metrology investigations. We introduce a flexible framework for overcoming these competing requirements that can be adapted to define the optimization landscapes of a variety of quantum operations, algorithms, and architectures.

Our framework corresponds to the decomposition E(F∣A, D) = ∑_g∈A∑_m∈Mw_g,m(A)ϵ_g,m(F_g,m∣D) where the sums are over all gates g ∈ A and known physical error mechanisms m ∈ M. ϵ_g,m are algorithm-independent error components that depend on some subset of frequencies F_g,m ⊆ F and can be computed from relevant characterization data D (Supplementary Note 2 and 3). w_g,m are algorithm-dependent weights that capture algorithmic context via training on benchmarks that are sufficiently representative of A. Defining the estimator thus maps to defining the target quantum algorithm, the algorithm-independent error components, and then training the algorithm-dependent weights.

We set our target quantum algorithm to CZXEB to gear the estimator towards the surface code’s parity checks. Furthermore, since CZXEB is also our benchmarking algorithm, we can associate the performance of optimized frequency configurations with our optimization strategy. We then define error components corresponding to dephasing^23,24,25, relaxation^23,26,30,31, stray coupling²⁷, and frequency-pulse distortion²⁸ over qubit frequency trajectories. The relevant characterization data include qubit flux-sensitivity spectra, energy-relaxation rate spectra, parasitic stray coupling parameters, and pulse distortion parameters, which are measured prior to optimization. Finally, we train the weights via a protocol⁵² that we developed specifically to reduce the risk of overfitting (Supplementary Note 4). It constrains weights via homogeneity and symmetry assumptions and then leverages the frequency tunability of our architecture to train them on single- and two-qubit gate benchmarks taken in configurations of variable complexity.

The resulting algorithm error estimator represents a comprehensive understanding of physical errors across our processor. It spans ~4 × 10⁴ error components, only has 16 trainable weights for the full processor, and is trained and tested on ~6500 benchmarks. Despite its scale, it can still be evaluated ~ 100 times/s on a desktop. Furthermore, it can predict CZXEB cycle errors in the wide range ~3–40 × 10⁻³ within two factors of experimental uncertainty (Supplementary Note 4). In total, the estimator fulfils our speed, predictivity, and physicality requirements.

Optimization strategy

Finding an optimized frequency configuration from the algorithm error estimator maps to solving F ^* = argmin_FE. This problem is hard for several reasons. First, all ∣F∣ ~ 3N idle and interaction frequencies are interdependent due to engineered and parasitic interactions between nearest and next-nearest neighbor qubits. Second, the estimator has numerous local minima since most error mechanisms and hardware constraints compete, and since it is built from noisy characterization data. Finally, there are ~k^∣F∣ ~ k^3N possible configurations, where k is the number of options per frequency, as constrained by hardware and control specifications and inhomogeneities. In total, the problem is highly-constrained, non-convex, and expands exponentially with processor size. We developed the Snake optimizer³⁸ to scalably overcome the complexity of control optimization problems like frequency optimization.

Snake implements a graph-based algorithm that maps the variable frequency configuration F onto a graph and then launches an optimization thread from some seed frequency (Fig. 1e, f). It then finds all unoptimized frequencies F_S within a neighborhood whose size is bounded by the scope parameter S, and constructs the Snake estimator E_S. E_S contains all terms in E that depend only on F_S, which serve as optimization variables, and previously optimized frequencies F ^* that are algorithmically relevant F ^* ∩ A, which serve as fixed constraints. Snake then solves \({F}_{S}^{*}={{{\mbox{argmin}}}}_{{F}_{S}}{E}_{S}\), updates F ^*, traverses, and repeats until all frequencies have been optimized. Frequency configurations are typically optimized from multiple seeds in parallel and the one that minimizes the algorithm error estimator is benchmarked.

Snake’s favorable scaling properties are derived from the scope S, which tunes the greediness of its optimization between the local and global limits³⁸. By tuning the scope within \(1\le S\le {S}_{\max }\), we can bound the number of frequencies optimized at each traversal step to 1 ≤ ∣F_S∣ ≤ ∣F∣, where ∣F_S∣ ~ S² and \({S}_{\max } \sim \sqrt{3N}\) (Fig. 1e). In turn, we can split one complex ~3N-dimensional problem over ~k^3N configurations into ~3N/S² simpler ~S²-dimensional problems over \(\sim {k}^{{S}^{2}}\) configurations each. Such splitting terminates at S = 1, where Snake optimizes ~3N 1-dimensional problems over ~ k¹ configurations each. Importantly, the intermediate dimensional problems with \(S \, < \, {S}_{\max }\) are exponentially smaller than the global problem and independent of processor size.

Snake is not expected to discover globally optimal configurations. However, if it can find sufficiently performant configurations for the target quantum algorithm — for example with errors below the fault-tolerance threshold³—it will solve the scaling complexity problem. Namely, we will not be faced with an exponentially expanding problem as our processors scale, but linearly more problems with bounded configuration spaces. Furthermore, since Snake’s seed strategy, traversal strategy, inner-loop optimizer, and scope are highly configurable (Supplementary Note 5), it should be adaptable to overcome similar scaling complexities in other control problems and hardware.

Validating performance

To experimentally investigate whether Snake can actually find performant frequency configurations at some intermediate dimension, we optimize our processor at scopes ranging from S = 1 (177 1D local problems) to \(S={S}_{\max }\) (one 177D global problem) and benchmark CZXEB. We evaluate configurations by comparing their benchmarks against three performance standards (Fig. 2a). First, the baseline standard references benchmarks taken in a random frequency configuration, which establishes the average performance of the hardware and control system without frequency optimization (\({e}_{c}=16.{7}_{-10.6}^{+267.1}\times 1{0}^{-3}\) at N = 68). Second, the outlier standard references a constant cycle error, above which gates are considered performance outliers (e_c = 15.0 × 10⁻³ for all N). Third, the crossover standard references published benchmarks from the same processor that reached the surface code’s crossover regime, which approaches the error correction threshold (\({e}_{c}=6.{2}_{-2.5}^{+7.6}\times 1{0}^{-3}\) at N = 49)^3,6. This standard establishes what we consider high performance, while recognizing that much higher performance will be necessary to implement error correction in practice.

**Fig. 2: Optimization and healing performance.**

The wide performance gap between the baseline and crossover standards is closed via frequency optimization (Fig. 2b). Namely, intermediate dimensional optimization (2 ≤ S ≤ 4) approaches the crossover standard (\({e}_{c}=7.{2}_{-2.5}^{+19.9}\times 1{0}^{-3}\) in ~130 s at S = 2) while suppressing performance outliers, with < 10% of gates above the outlier standard and < 0.5% failing calibrations, which prevent benchmarking. However, local (\({e}_{c}=9.{8}_{-6.0}^{+231.8}\times 1{0}^{-3}\) in ~6 s at S = 1) and global (\({e}_{c}=10.{8}_{-5.7}^{+145.0}\times 1{0}^{-3}\) in ~6500 s at \(S={S}_{\max }\)) optimization only marginally outperform the baseline standard. The optimal scope is S = 4 (≤21D optimization), but we default to S = 2 (≤5D optimization), which offers a better balance between performance and runtime (Supplementary Note 5). Next, we interpret.

First, the fact that we see performance variations between configurations illustrates that poor frequency choices cannot be compensated for by other components of our control system and that optimization is critical. Second, the fact that local optimization underperforms illustrates that frequency optimization is a non-local problem and that tradeoffs between gates must be considered. Third, the fact that global optimization underperforms even after an hour of searching illustrates the difficulty of navigating the configuration space even on our relatively small processor. Finally, the fact that relatively low intermediate dimensions found the most performant configurations is consistent with relatively local engineered and parasitic interactions and suggests that Snake can navigate our architecture’s configuration space in a way that should scale to larger processors.

Stabilizing performance

Stabilizing performant configurations is as difficult and important as finding them. Namely, a processor’s optimization landscape constantly evolves and performance outliers emerge on timescales ranging from seconds to months, with the most catastrophic due to TLS defects fluctuating into the path of qubit frequency trajectories^26,33. Unfortunately, even a low percentage of outliers can significantly degrade the performance of a quantum algorithm⁶. However, re-optimizing all gates of a processor when a low percentage of outliers are detected is unscalable from a runtime perspective and introduces the risk of degrading performant gates.

By design, Snake healing can surgically re-optimize outliers, nominally much faster than full re-optimization, and without degrading performant gates³⁸. To investigate the viability of healing, we heal all configurations generated by the variable-scope experiment described above (Fig. 2c–e), targetting poorly performing gates (Supplementary Note 8). From the perspective of stability, the progressively worse configurations emulate the performance of our processor over progressively longer timescales following optimization. Healing suppresses outliers by ~48% averaged over configurations, typically runs >10 × faster than full reoptimization, and rarely degrades performant gates. Furthermore, heals can be applied repetitively and parallelized for sufficiently sparse outliers. These results demonstrate the viability of healing for scalably suppressing outliers to stabilize performance.

Impact of metrology

We now consider the impact of the algorithm error estimator’s composition on Snake’s performance. In particular, the dephasing, relaxation, stray coupling, and pulse distortion error components may be interpreted as distinct error mitigation strategies that can be activated independently. To isolate their impact and to understand their interplay, we progressively activate them in all combinations, optimize, and benchmark CZXEB (Fig. 3a).

**Fig. 3: Optimization performance versus error mitigation strategy.**

To build intuition for the impact of each error mitigation strategy, we inspect frequency configurations optimized with only one mitigation strategy activated (Fig. 3b). Most are visually structured, with inhomogeneities arising from fabrication imperfections in the processor’s parameters. Dephasing mitigation biases qubits towards their maximum frequencies, where flux sensitivity vanishes⁹. Relaxation mitigation biases qubits away from relaxation hotspots driven by coupling to the control⁹ and readout circuitry^22,53, packaging environment²⁹, and random TLS defects²⁶. Stray-coupling mitigation disperses qubits to avoid frequency collisions between parasitically coupled gates²⁷. Finally, pulse-distortion mitigation biases idles towards a multi-layered checkerboard, with neighbors at one of two symmetric \(\left\vert 11\right\rangle \leftrightarrow \left\vert 02\right\rangle\) CZ resonances¹⁰, and interactions towards resonance between the idles, to minimize frequency excursions. The inversion of frequencies at the eastern edges of the processor was triggered by fabrication imperfections that broke the symmetry between CZ resonances. This observation highlights non-trivial interplay between error mitigation and hardware inhomogeneities.

Interestingly, while some of these mitigation strategies alone may find performant configurations at the scale of several qubits, none of them substantially outperform the random baseline configuration at the scale of our processor. As we progressively activate mitigation strategies, competition between error mechanisms causes frequency configurations lose visual structure, while performance approaches the crossover standard. Analyzing error contributions in optimized configurations, we confirm that activating mitigation strategies selectively and effectively suppresses their corresponding error components, while only weakly impacting others (Supplementary Note 8). These results support our interpretation of error components as error mitigation strategies and that our optimizer can effectively reconcile their competition and suppress them. More generally, they highlight the importance of error metrology on the performance of our optimization strategy.

Performance scalability

We are finally ready to investigate Snake’s scalability. To do so, we conduct a scaling experiment that may be valuable for evaluating the prospects of any quantum hardware and control system. Namely, we optimize, heal, and benchmark hundreds of configurations of our processor ranging in size from N = 2 to 68 (Fig. 4a). As before, we reference the crossover standard. However, we now reference multiple baseline standards that correspond to unoptimized random configurations of variable size. Despite the irregular shapes of some configurations, we find surprisingly clear scaling trends.

CZXEB benchmarks grow and then saturate in both optimized and unoptimized configurations. Furthermore, mean cycle errors are well-represented by the model \(\langle {e}_{c}(N)\rangle={e}_{{{{{{{{\rm{sat}}}}}}}}}-{e}_{{{{{{{{\rm{scale}}}}}}}}}\exp (-N/{N}_{{{{{{{{\rm{sat}}}}}}}}})\), where N_sat is the qubit saturation constant, e_scale is the error penalty in scaling gates from small to large systems, and e_sat is the saturated error. Fitting this model to the empirical benchmarks, we find that optimized configurations saturate near the crossover standard, with best-fit parameters N_sat = 22 ± 10( ± 1σ), e_scale = 3.1 ± 0.4 × 10⁻³, and e_sat = 7.5 ± 0.4 × 10⁻³.

To estimate Snake’s performance advantage, we make several comparisons. From the empirical benchmarks, we compare the mean cycle errors \(\langle {e}_{c}^{{{{{{{{\rm{base}}}}}}}}}\rangle /\langle {e}_{c}^{{{{{{{{\rm{snake}}}}}}}}}\rangle\) in isolation (N = 2) and in parallel at scale (N = 68), which are 3.1 ± 0.5 and 6.4 ± 1.0, respectively. Remarkably, the optimized N = 68 configuration outperforms unoptimized N = 2 configurations by 2.3 ± 0.4×. Furthermore, the optimized N = 68 configuration has a ~ 40 × narrower benchmark distribution spread than the unoptimized N = 68 configuration. From the saturation model, we compare the scaling penalty \({e}_{{{{{{{{\rm{scale}}}}}}}}}^{{{{{{{{\rm{base}}}}}}}}}/{e}_{{{{{{{{\rm{scale}}}}}}}}}^{{{{{{{{\rm{snake}}}}}}}}}\) and the saturated cycle errors \({e}_{{{{{{{{\rm{sat}}}}}}}}}^{{{{{{{{\rm{base}}}}}}}}}/{e}_{{{{{{{{\rm{sat}}}}}}}}}^{{{{{{{{\rm{snake}}}}}}}}}\), which are 5.6 ± 1.8 and 3.7 ± 0.7, respectively. These comparisons illustrate that Snake achieves a significant performance advantage to N = 68.

To investigate Snake’s future scalability, we simulate much larger processors than those manufactured to date. To do so, we developed a generative model that can generate simulated processors of arbitrary size and connectivity with simulated characterization data that are nearly indistinguishable from our processor⁵⁴. We generate simulated processors ranging in size from N = 17 to 1057, with connectivity corresponding to distance-3 to 23 surface code logical qubits (d = 3 to 23 with N = 2d² − 1)⁵⁵. We optimize simulated processors exactly like our processor and predict CZXEB benchmarks via our estimator (Supplementary Note 7). Simulated benchmarks reproduce the saturation trends seen in experiment, building trust in our simulation environment and results (Fig. 4b). Furthermore, they project that Snake’s performance advantage should scale to a d = 23 logical qubit with N = 1057.

Runtime scalability

Despite the promising performance outlook, practically scaling to thousands of qubits will require Snake to be geometrically parallelized. Namely, even though optimization runtimes scale nearly linearly with processor size (~3.6 ± 0.1 s added per qubit at S = 2), N = 1057 threads take ~1.4 h. This exceeds our runtime budget of 0.5 h (Supplementary Note 5), which was chosen for compatibility with operating large surface codes.

By design, Snake stitching can split a processor into R disjoint regions, optimize them in parallel, and stitch configurations³⁸. Stitching leads optimization runtimes to scale sub-linearly with processor size, which should enable scalability towards N ~ 10⁴ with R = 128 within our runtime budget in principle (Supplementary Note 5). In practice, however, stitching risks amplifying outliers at seams, where Snake must reconcile constraints between independently optimized configurations.

To investigate the viability of stitching, we stitch and heal our N = 68 processor with R = 2 (Fig. 4c) as well as an N = 1057 (d = 23) simulated processor with R = 4 (Fig. 4d). We chose convenient stitch geometries, but believe they will ultimately need to be optimized (Supplementary Note 8). Experimental data are limited, but outliers are not amplified at seams and stitched configurations perform as well as their unstitched counterparts (\({e}_{c}=6.{4}_{-1.8}^{+4.4}\times 1{0}^{-3}\) for N = 68 and \({e}_{c}=6.{3}_{-2.9}^{+4.3}\times 1{0}^{-3}\) for N = 1057). Finally, we note that stitching the d = 23 logical qubit with R = 4 is equivalent to stitching four d = 11 logical qubits into a 4-logical-qubit processor⁵⁵, which illustrates how larger surface codes may be optimized.

Discussion

We introduced a control optimization strategy that combines generic frameworks for building, training, and navigating the optimization landscapes presented by a variety of quantum operations, algorithms, and architectures. It offers a significant performance advantage for quantum gates on our superconducting quantum processor with tens of qubits, approaching the surface code threshold for fault tolerance, and shows promise for scalability towards logical qubits with thousands of qubits. A recent demonstration of error suppression in a scaled-up surface code logical qubit⁶ enabled by this strategy underscores its potential.

Elements of our strategy have also been employed to optimize quantum operations including measurement⁵⁶ and SWAP gates⁴⁰, and quantum algorithms for optimization⁴⁰, metrology^41,42, simulation^{43,44,45,46,47}, and beyond classical computation^39,48. The strategy should also find value in quantum hardware beyond superconducting circuits²⁰, which face control challenges with similar scaling complexities. Choreographing the trajectories of electrons in quantum dots^57,58,59, shuttled ions in ion traps^60,61,62, or neutral atoms in reconfigurable atom arrays^63,64 are promising applications (Supplementary Note 3) that are of contemporary interest.

Looking towards commercially valuable quantum computations, significant challenges remain. The larger and more performant processors that are necessary to implement them will be susceptible to error mechanisms that are currently irrelevant or yet to be discovered. Furthermore, we expect that stabilizing performance over long computations that may span days⁶⁵ will present significant hurdles. Towards that end, Snake’s model-based approach can leverage historical characterization data to forecast and optimize around failures before they happen^66,67. Finally, even though we expect that model-based optimization will remain critical for injecting metrological discoveries into control optimization for the foreseeable future, Snake can also deploy model-free reinforcement learning agents^68,69,70, which may reduce the burden of developing performance estimators (Supplementary Note 5). The techniques presented here should complement the numerous other control, hardware, and algorithm advancements necessary to realize commercial quantum applications.

Data availability

The minimum dataset necessary to interpret, verify, and extend the research in this article is available in Supplementary Tables 1–5. Additional data are available from the corresponding author upon request.

Code availability

The mathematical algorithm underlying the Snake optimizer and the pseudo code necessary to implement it are available in ref. ³⁸.

References

Shor, P. W. Scheme for reducing decoherence in quantum computer memory. Phys. Rev. A 52, R2493–R2496 (1995).
Article ADS CAS PubMed Google Scholar
Knill, E. & Laflamme, R. Theory of quantum error-correcting codes. Phys. Rev. A 55, 900–911 (1997).
Article ADS MathSciNet CAS Google Scholar
Stephens, A. M. Fault-tolerant thresholds for quantum error correction with the surface code. Phys. Rev. A 89, 022321 (2014).
Article ADS Google Scholar
Krinner, S. et al. Realizing repeated quantum error correction in a distance-three surface code. Nature 605, 669–674 (2022).
Article ADS CAS PubMed Google Scholar
Zhao, Y. et al. Realization of an error-correcting surface code with superconducting qubits. Phys. Rev. Lett. 129, 030501 (2022).
Article ADS CAS PubMed Google Scholar
Acharya, R. et al. Suppressing quantum errors by scaling a surface code logical qubit. Nature 614, 676–681 (2023).
Article ADS Google Scholar
Nielsen, M. A. & Chuang, I. L. Quantum Computation and Quantum Information (Cambridge University Press, 2000).
Martinis, J. M., Devoret, M. H. & Clarke, J. Energy-level quantization in the zero-voltage state of a current-biased Josephson junction. Phys. Rev. Lett. 55, 1543–1546 (1985).
Article ADS CAS PubMed Google Scholar
Koch, J. et al. Charge-insensitive qubit design derived from the Cooper pair box. Phys. Rev. A 76, 042319 (2007).
Article ADS Google Scholar
DiCarlo, L. et al. Demonstration of two-qubit algorithms with a superconducting quantum processor. Nature 460, 240–244 (2009).
Article ADS CAS PubMed Google Scholar
Manucharyan, V. E., Koch, J., Glazman, L. I. & Devoret, M. H. Fluxonium: single cooper-pair circuit free of charge offsets. Science 326, 113–116 (2009).
Article ADS CAS PubMed Google Scholar
Barends, R. et al. Coherent Josephson qubit suitable for scalable quantum integrated circuits. Phys. Rev. Lett. 111, 080502 (2013).
Article ADS CAS PubMed Google Scholar
Versluis, R. et al. Scalable quantum circuit and control for a superconducting surface code. Phys. Rev. Appl. 8, 034021 (2017).
Lin, Y.-H. et al. Demonstration of protection of a superconducting qubit from energy decay. Phys. Rev. Lett. 120, 150503 (2018).
Wu, Y. et al. Strong quantum computational advantage using a superconducting quantum processor. Phys. Rev. Lett. 127, 180501 (2021).
Article ADS CAS PubMed Google Scholar
Zhu, Q. et al. Quantum computational advantage via 60-qubit 24-cycle random circuit sampling. Sci. Bull. 67, 240–245 (2022).
Article Google Scholar
Valery, J. A., Chowdhury, S., Jones, G. & Didier, N. Dynamical sweet spot engineering via two-tone flux modulation of superconducting qubits. PRX Quantum 3, 020337 (2022).
Article ADS Google Scholar
Nguyen, L. B. et al. Blueprint for a high-performance fluxonium quantum processor. PRX Quantum 3, 037001 (2022).
Article ADS Google Scholar
Bao, F. et al. Fluxonium: an alternative qubit platform for high-fidelity operations. Phys. Rev. Lett. 129, 010502 (2022).
Article ADS CAS PubMed Google Scholar
Krantz, P. et al. A quantum engineer’s guide to superconducting qubits. Appl. Phys. Rev. 6, 021318 (2019).
Article ADS Google Scholar
Dirac, P. A. M. The quantum theory of the emission and absorption of radiation. Proc. R. Soc. Math. Phys. Eng. Sci. 114, 243–265 (1927).
ADS CAS Google Scholar
Purcell, E. M. Spontaneous emission probabilities at radio frequencies. Phys. Rev. 69, 681 (1946).
Google Scholar
Ithier, G. et al. Decoherence in a superconducting quantum bit circuit. Phys. Rev. B 72, 134519 (2005).
Article ADS Google Scholar
Bylander, J. et al. Noise spectroscopy through dynamical decoupling with a superconducting flux qubit. Nat. Phys. 7, 565–570 (2011).
Article CAS Google Scholar
O’Malley, P. J. J. et al. Qubit metrology of ultralow phase noise using randomized benchmarking. Phys. Rev. Appl. 3, 044009 (2015).
Article ADS Google Scholar
Müller, C., Cole, J. H. & Lisenfeld, J. Towards understanding two-level-systems in amorphous solids: insights from quantum circuits. Rep. Prog. Phys. 82, 124501 (2019).
Article ADS PubMed Google Scholar
Mundada, P., Zhang, G., Hazard, T. & Houck, A. Suppression of qubit crosstalk in a tunable coupling superconducting circuit. Phys. Rev. Appl. 12, 054023 (2019).
Article ADS CAS Google Scholar
Rol, M. A. et al. Time-domain characterization and correction of on-chip distortion of control pulses in a quantum processor. Appl. Phys. Lett. 116, 054001 (2020).
Article ADS CAS Google Scholar
Huang, S. et al. Microwave package design for superconducting quantum processors. PRX Quantum 2, 020306 (2021).
Article ADS Google Scholar
Abad, T., Fernández-Pendás, J., Frisk Kockum, A. & Johansson, G. Universal fidelity reduction of quantum operations from weak dissipation. Phys. Rev. Lett. 129, 150504 (2022).
Article ADS MathSciNet CAS PubMed Google Scholar
Marxer, F. et al. Long-distance transmon coupler with cz-gate fidelity above 99.8%. PRX Quantum 4, 010314 (2023).
Article ADS Google Scholar
Hertzberg, J. B. et al. Laser-annealing Josephson junctions for yielding scaled-up superconducting quantum processors. npj Quantum Inf. 7, 129 (2021).
Article ADS Google Scholar
Klimov, P. V. et al. Fluctuations of energy-relaxation times in superconducting qubits. Phys. Rev. Lett. 121, 090502 (2018).
Article ADS CAS PubMed Google Scholar
Barends, R. et al. Superconducting quantum circuits at the surface code threshold for fault tolerance. Nature 508, 500–503 (2014).
Article ADS CAS PubMed Google Scholar
Klimov, P. V. & Kelly, J. S. Optimizing qubit operating frequencies. US Patent US20220300847A1 (2018).
Ding, Y. et al. Systematic crosstalk mitigation for superconducting qubits via frequency-aware compilation. In: Proceedings of 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)201-214 (2020).
Klimov, P. V. Calibration of quantum processor operator parameters. US Patent US20200387822A1 (2019).
Klimov, P. V., Kelly, J., Martinis, J. M. & Neven, H. The snake optimizer for learning quantum processor control parameters (2020). Preprint at http://arXiv.org/abs/2006.04594.
Arute, F. et al. Quantum supremacy using a programmable superconducting processor. Nature 574, 505–510 (2019).
Article ADS CAS PubMed Google Scholar
Harrigan, M. P. et al. Quantum approximate optimization of non-planar graph problems on a planar superconducting processor. Nat. Phys. 17, 332–336 (2021).
Article CAS Google Scholar
McEwen, M. et al. Resolving catastrophic error bursts from cosmic rays in large arrays of superconducting qubits. Nat. Phys. 18, 107–111 (2022).
Article CAS Google Scholar
Miao, K. C. et al. Overcoming leakage in quantum error correction. Nat. Phys. 19, 1780–1786 (2023).
Article CAS Google Scholar
Mi, X. et al. Information scrambling in quantum circuits. Science 374, 1479–1483 (2021).
Article ADS CAS PubMed Google Scholar
Satzinger, K. J. et al. Realizing topologically ordered states on a quantum processor. Science 374, 1237–1241 (2021).
Article ADS CAS PubMed Google Scholar
Mi, X. et al. Time-crystalline eigenstate order on a quantum processor. Nature 601, 531–536 (2022).
Article ADS CAS PubMed Google Scholar
Mi, X. et al. Noise-resilient edge modes on a chain of superconducting qubits. Science 378, 785–790 (2022).
Article ADS CAS PubMed Google Scholar
Andersen, T. I. et al. Non-abelian braiding of graph vertices in a superconducting processor. Nature 618, 264–269 (2023).
Morvan, A. et al. Phase transition in random circuit sampling (2023). Preprint at http://arXiv.org/abs/2304.11119.
Neill, C. A Path Towards Quantum Supremacy with Superconducting Qubits. Ph.D. thesis, (UC Santa Barbara, 2017).
Yan, F. et al. Tunable coupling scheme for implementing high-fidelity two-qubit gates. Phys. Rev. Appl. 10, 054062 (2018).
Article ADS CAS Google Scholar
Boixo, S. et al. Characterizing quantum supremacy in near-term devices. Nat. Phys. 14, 595–600 (2018).
Article CAS Google Scholar
Klimov, P. V. Iterative supervised learning of quantum processor error models. US Patent US20230359922A1 (2022).
Jeffrey, E. et al. Fast accurate state measurement with superconducting qubits. Phys. Rev. Lett. 112, 190504 (2014).
Article ADS PubMed Google Scholar
Klimov, P. V., Megrant, A. E., Dunsworth, A. L. & Kelly, J. S. Generative modeling of quantum hardware. US Patent US20230259802A1 (2020).
Fowler, A. G. & Gidney, C. Low overhead quantum computation using lattice surgery. Preprint at http://arXiv.org/abs/1808.06709 (2019).
Bengtsson, A. et al. Model-based optimization of superconducting qubit readout. Preprint at http://arxiv.org/abs/2308.02079 (2023).
Zwanenburg, F. A. et al. Silicon quantum electronics. Rev. Mod. Phys. 85, 961–1019 (2013).
Article ADS CAS Google Scholar
Mills, A. R. et al. Shuttling a single charge across a one-dimensional array of silicon quantum dots. Nat. Commun. 10, 1063 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Zwolak, J. P. & Taylor, J. M. Colloquium: advances in automation of quantum dot devices control. Rev. Mod. Phys. 95, 011006 (2023).
Article ADS Google Scholar
Durandau, J. et al. Automated generation of shuttling sequences for a linear segmented ion trap quantum computer. Quantum 7, 1175 (2023).
Article Google Scholar
Kreppel, F. et al. Quantum circuit compiler for a shuttling-based trapped-ion quantum computer. Quantum 7, 1176 (2023).
Article Google Scholar
Sterk, J. D. et al. Closed-loop optimization of fast trapped-ion shuttling with sub-quanta excitation. npj Quantum Inf. 8, 68 (2022).
Article ADS Google Scholar
Bluvstein, D. et al. A quantum processor based on coherent transport of entangled atom arrays. Nature 604, 451–456 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Bluvstein, D. et al. Logical quantum processor based on reconfigurable atom arrays. Nature 626, 58–65 (2024).
Gidney, C. & Ekerå, M. How to factor 2048 bit RSA integers in 8 hours using 20 million noisy qubits. Quantum 5, 433 (2021).
Article Google Scholar
Klimov, P. V. Operating quantum devices using a temporal metric. US Patent US20230081120A1 (2022).
Jansen, S. Machine Learning for Algorithmic Trading: predictive models to extract signals from the market and alternative data for systematic trading strategies with Python (Packt Publishing, 2020).
Sutton, R. S. & Barto, A. G. Reinforcement Learning: An Introduction, 2nd edn (The MIT Press, 2018).
Baum, Y. et al. Experimental deep reinforcement learning for error-robust gate-set design on a superconducting quantum computer. PRX Quantum 2, 040324 (2021).
Sivak, V. V. et al. Model-free quantum control with reinforcement learning. Phys. Rev. X 12, 011059 (2022).
CAS Google Scholar

Download references

Acknowledgements

We thank the broader Google Quantum AI team for fabricating the processor, building and maintaining the cryogenic system, and general hardware and software infrastructure that enabled this experiment. We also thank Austin Fowler, Alexis Morvan, and Xiao Mi for their feedback on the manuscript.

Author information

Authors and Affiliations

Google AI, Mountain View, CA, USA
Paul V. Klimov, Andreas Bengtsson, Chris Quintana, Alexandre Bourassa, Sabrina Hong, Andrew Dunsworth, Kevin J. Satzinger, William P. Livingston, Volodymyr Sivak, Murphy Yuezhen Niu, Trond I. Andersen, Yaxing Zhang, Desmond Chik, Zijun Chen, Charles Neill, Catherine Erickson, Alejandro Grajales Dau, Anthony Megrant, Pedram Roushan, Alexander N. Korotkov, Julian Kelly, Vadim Smelyanskiy, Yu Chen & Hartmut Neven
Department of Electrical and Computer Engineering, University of California, Riverside, CA, USA
Alexander N. Korotkov

Authors

Paul V. Klimov
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Bengtsson
View author publications
You can also search for this author in PubMed Google Scholar
Chris Quintana
View author publications
You can also search for this author in PubMed Google Scholar
Alexandre Bourassa
View author publications
You can also search for this author in PubMed Google Scholar
Sabrina Hong
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Dunsworth
View author publications
You can also search for this author in PubMed Google Scholar
Kevin J. Satzinger
View author publications
You can also search for this author in PubMed Google Scholar
William P. Livingston
View author publications
You can also search for this author in PubMed Google Scholar
Volodymyr Sivak
View author publications
You can also search for this author in PubMed Google Scholar
Murphy Yuezhen Niu
View author publications
You can also search for this author in PubMed Google Scholar
Trond I. Andersen
View author publications
You can also search for this author in PubMed Google Scholar
Yaxing Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Desmond Chik
View author publications
You can also search for this author in PubMed Google Scholar
Zijun Chen
View author publications
You can also search for this author in PubMed Google Scholar
Charles Neill
View author publications
You can also search for this author in PubMed Google Scholar
Catherine Erickson
View author publications
You can also search for this author in PubMed Google Scholar
Alejandro Grajales Dau
View author publications
You can also search for this author in PubMed Google Scholar
Anthony Megrant
View author publications
You can also search for this author in PubMed Google Scholar
Pedram Roushan
View author publications
You can also search for this author in PubMed Google Scholar
Alexander N. Korotkov
View author publications
You can also search for this author in PubMed Google Scholar
Julian Kelly
View author publications
You can also search for this author in PubMed Google Scholar
Vadim Smelyanskiy
View author publications
You can also search for this author in PubMed Google Scholar
Yu Chen
View author publications
You can also search for this author in PubMed Google Scholar
Hartmut Neven
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.V.K. conceived, prototyped, and led the development of the Snake optimizer, algorithm error estimator, and simulated processor generative modeling frameworks. A.Be. and C.Q. contributed to engineering the optimizer. A.Bo. and A.D. contributed to engineering the generative model. C.Q, A.Bo., A.D., K.J.S., M.Y.N., W.P.L., V.S., T.I.A., and Y.Z. contributed to error metrology research. S.H. led the development of parallel calibration infrastructure with engineering contributions from A.Be. and Z.C. D.C., C.N., C.E., and A.G.D. contributed to infrastructure. A.M., P.R., A.N.K., J.K, V.S., Y.C., and H.N. supported research and development.

Corresponding author

Correspondence to Paul V. Klimov.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Klimov, P.V., Bengtsson, A., Quintana, C. et al. Optimizing quantum gates towards the scale of logical qubits. Nat Commun 15, 2442 (2024). https://doi.org/10.1038/s41467-024-46623-y

Download citation

Received: 16 August 2023
Accepted: 04 March 2024
Published: 18 March 2024
DOI: https://doi.org/10.1038/s41467-024-46623-y

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.