Scalable randomised benchmarking of non-Clifford gates

Cross, Andrew W; Magesan, Easwar; Bishop, Lev S; Smolin, John A; Gambetta, Jay M

doi:10.1038/npjqi.2016.12

Download PDF

Article
Open access
Published: 26 April 2016

Scalable randomised benchmarking of non-Clifford gates

Andrew W Cross¹,
Easwar Magesan¹,
Lev S Bishop¹,
John A Smolin¹ &
…
Jay M Gambetta¹

npj Quantum Information volume 2, Article number: 16012 (2016) Cite this article

5130 Accesses
69 Citations
5 Altmetric
Metrics details

Subjects

Abstract

Randomised benchmarking is a widely used experimental technique to characterise the average error of quantum operations. Benchmarking procedures that scale to enable the characterisation of n-qubit circuits rely on efficient procedures for manipulating those circuits and, as such, have been limited to subgroups of the Clifford group. However, universal quantum computers require additional, non-Clifford gates to approximate arbitrary unitary transformations. We define a scalable randomised benchmarking procedure over n-qubit unitary matrices that correspond to protected non-Clifford gates for a class of stabiliser codes. We present efficient methods for representing and composing group elements, sampling them uniformly and synthesising corresponding poly(n)-sized circuits. The procedure provides experimental access to two independent parameters that together characterise the average gate fidelity of a group element.

Partial randomized benchmarking

Article Open access 16 June 2022

Benchmarking universal quantum gates via channel spectrum

Article Open access 21 September 2023

6-qubit optimal Clifford circuits

Article Open access 05 July 2022

Introduction

A key step to realising a large-scale universal quantum computer is demonstrating that decoherence and other realistic imperfections are small enough to be overcome by fault-tolerant quantum computing protocols.^1,2 Randomised benchmarking (RB)^3–6 has become a standard experimental technique for characterising the average error of quantum gates partly because of its insensitivity to state preparation and measurement errors. Benchmarking provides robust estimates of average gate fidelity^6,7 and it can characterise specific interleaved gate errors,^8,9 addressability errors¹⁰ and leakage errors.^11–13

RB techniques that efficiently scale to many qubits have been limited to subgroups of gates in the Clifford group, as computations with this group are tractable.⁶ However, the Clifford group is not enough for general quantum computations.¹⁴ Previous work generalises RB to groups that include non-Clifford gates,^15,16 but only on single qubits, a significant limitation. Methods for bounding the average fidelity of specific types of non-Clifford gates have also been considered.¹⁷

We present a scalable RB procedure that includes important non-Clifford circuits, such as circuits composed from $T = \sqrt[4]{Z}$ and controlled-NOT (CNOT) gates that naturally occur in fault-tolerant quantum computations. The n-qubit matrix groups we study are a generalisation of the standard dihedral group and coincide in some cases with protected gates in stabiliser codes, such as k-dimensional colour codes.¹⁸ Circuits built from these gates cannot be universal but do constitute significant portions of magic state distillation protocols,^19,20 repeat-until-success circuits²¹ and the vital quantum Fourier transform.²² We show that there are efficient methods for representing and composing group elements, sampling them uniformly and synthesising corresponding circuits whose size grows polynomially with the number of qubits n. The benchmarking procedure provides experimental access to two independent noise parameters through exponential decays of average sequence fidelities.

Results

The quantum circuits we consider are products of CNOT gates $Λ_{12} (X) | u, v 〉 : = | u, u \oplus v 〉$ , bit-flip gates $X | u 〉 : = | u \oplus 1 〉$ and single-qubit m-phase gates $Z_{m} | u 〉 : = ω_{m}^{u} | u 〉$ , where $ω_{m} = e^{i 2 π / m}$ . More concisely, the circuits of interest are given by the group

\begin{matrix} (1) & G_{m} : = 〈 Λ_{i j} (X), X (j), Z_{m} (j) 〉 / 〈 ω_{m} 〉 . \end{matrix}

We call this group a CNOT-dihedral group, as it is generated by CNOTs and a single-qubit dihedral group. Although we prove certain results for general m, we focus mainly on the case of m=2^k. This case affords efficient benchmarking and contains non-Clifford gates of interest, such as $T = \sqrt[4]{Z}$ , controlled- $\sqrt{Z}$ (defined as $Λ_{12} (\sqrt{Z}) | u, v 〉 : = i^{u v} | u, v 〉$ ) and controlled–controlled–Z (defined as $Λ_{123} (Z) | u, v, w 〉 : = {(- 1)}^{u v w} | u, v, w 〉$ ), which is locally equivalent to a Toffoli gate.

Our interest in the dihedral group was motivated by symmetries of stabiliser codes. However, another group that may have similar properties is $G_{p, m} : = 〈 Λ^{(p)} (X), X (j), Z_{m} (j) 〉 / 〈 ω_{m} 〉$ where Λ^(p)(X) is a p-controlled-NOT gate. Not all entangling gates are suitable for randomised benchmarking though. Our arguments imply that the group $〈 Λ_{i j} (Z), X (j), Z_{m} (j) 〉$ does not yield an efficient benchmarking procedure, as twirling over this group produces a map with exponentially many parameters.

The benchmarking procedure we present here both generalises¹⁶ and extends naturally to interleaving gates to estimate individual gate fidelities.^8,9 The procedure closely follows¹⁰ but we describe it in some detail for completeness. Choose a sequence of ℓ+1 unitary gates in which the first ℓ gates are uniformly random elements $g_{j_{1}}$ , $g_{j_{2}}$ , …, $g_{j_{ℓ}}$ of $G_{2^{k}}$ and the (ℓ+1)^st gate is $g_{j_{ℓ}}^{- 1} : = g_{j_{1}}^{†} \dots g_{j_{ℓ}}^{†}$ where j_ℓ denotes the ℓ-tuple (j₁, … j_ℓ) labelling the sequence. We show later that elements of $G_{2^{k}}$ can be efficiently sampled and $g_{j_{ℓ}}^{- 1}$ can be efficiently computed. For each sequence, we prepare an input state ρ, apply $S_{j_{ℓ}} : = g_{j_{ℓ}}^{- 1} g_{j_{ℓ}} \dots g_{j_{1}}$ and measure an operator E.

Assuming each gate g_i has an associated error $E_{i} (ρ)$ , the sequence $S_{j_{ℓ}}$ is implemented as

\begin{matrix} (2) & {\tilde{S}}_{j_{ℓ}} : = E_{j_{ℓ}^{- 1}} \circ g_{j_{ℓ}}^{- 1} \circ (○_{i = 1}^{ℓ} [E_{j_{i}} \circ g_{j_{i}}]) \end{matrix}

\begin{matrix} (3) & = E_{j_{ℓ}^{- 1}} \circ (○_{i = 1}^{ℓ} [{\tilde{g}}_{j_{i}}^{†} \circ E_{j_{i}} \circ {\tilde{g}}_{j_{i}}]) \end{matrix}

where each ${\tilde{g}}_{j_{i}} \in G_{2^{k}}$ . The overlap with E is $Tr [E {\tilde{S}}_{j_{ℓ}} (ρ)]$ . Averaging this overlap over K independent sequences of length ℓ gives an estimate of the average sequence fidelity $F_{s e q} (ℓ, E, ρ) : = Tr [E {\tilde{S}}_{ℓ} (ρ)]$ where ${\tilde{S}}_{ℓ} (ρ) : = \frac{1}{K} \sum_{j_{ℓ}} {\tilde{S}}_{j_{ℓ}} (ρ)$ is the average quantum channel.

Defining $E$ to be the average of errors $E_{i}$ and assuming for all i that $δ E_{i} : = E_{i} - E$ is small, the average quantum channel is

\begin{matrix} (4) & {\tilde{S}}_{ℓ} (ρ) = E \circ {[E_{G_{2^{k}}}]}^{\circ ℓ} + O (δ E) \end{matrix}

where ${\bar{E}}_{G_{2^{k}}}$ is the $G_{2^{k}}$ -twirl of (see Materials and Methods). The error operator $E$ is attributed to measurement error and perturbs E to a new operator E′. We decompose the input state and this final measurement operator in the Pauli basis to give $ρ = \sum_{P} x_{P} P / 2^{n}$ and $E' = \sum_{P} e_{P} P$ . Neglecting the $O (δ E)$ term, the average sequence fidelity is

\begin{matrix} (5) & F_{s e q} (ℓ, E, ρ) = Tr [E' {({\bar{E}}_{G_{2^{k}}})}^{\circ ℓ} (ρ)] = A_{Z} α_{Z}^{ℓ} + A_{R} α_{R}^{ℓ} + e_{I} \end{matrix}

where $A_{Z} = \sum_{P \in Z / {I}} e_{P} x_{P}$ and $A_{R} = \sum_{P \in P / Z} e_{P} x_{P}$ .

To see this, it is convenient to express ${\bar{E}}_{G_{2^{k}}}$ in a corresponding Liouville representation $R^{\bar{E}}$ (see Methods). In this representation, $R^{\bar{E}}$ is diagonal with three distinct diagonal elements corresponding to sets of Pauli operators: the identity I has value 1, the Z-type Pauli operators $Z / {I}$ have value α_Z and the remaining Pauli operators $P / Z$ have value α_R. The Pauli operator P then contributes e_Px_Pα^ℓ to F_seq(ℓ, E, ρ), where α is one of 1, α_Z or α_R depending on P.

In a spirit similar to simultaneous RB,¹⁰ each of the two exponential decays $α_{Z}^{ℓ}$ and $α_{R}^{ℓ}$ can be observed by choosing appropriate input states. For example, if we choose the input state $| 0 \dots 0 〉$ , then $F_{seq} = e_{I} + A_{0} α_{Z}^{ℓ}$ where $A_{0} = \sum_{P \in Z / {I}} e_{P}$ . On the other hand, if we choose $| + \dots + 〉 : = \sum_{b \in {0, 1}^{n}} | b 〉$ , then $F_{s e q} = e_{I} + A_{+} α_{R}^{ℓ}$ where $A_{+} = \sum_{P \in X / {I}} e_{P}$ . State preparation errors may lead to deviation from a single exponential decay, but this is detectable. The channel parameters α_Z and α_R can be extracted by fitting the average sequence fidelity. The corresponding depolarising channel parameter is a weighted average $α = (α_{Z} + 2^{n} α_{R}) / (2^{n} + 1)$ , and the average gate error is given by $r = (2^{n} - 1) (1 - α) / 2^{n}$ (see ref. 6).

The Materials and Methods section is devoted to proving the technical results that enable the benchmarking procedure such as a canonical decomposition of G_m, efficient computation within $G_{2^{k}}$ and twirling over $G_{2^{k}}$ to obtain the averaged quantum channel.

Discussion

Our results enable scalable benchmarking of a natural family of non-Clifford circuits related to quantum error-correcting codes. In principle, our procedure allows efficient benchmarking of isolated non-Clifford gates, as well as large sub-circuits for state distillation^19,20 or repeat-until-success protocols.²¹ These sub-circuits can be characterised with our procedure using physical gates or logical gates on protected qubits. Altogether with standard Clifford benchmarking, our procedures enable characterisation of the full range of gates used in the leading fault-tolerant quantum computing protocols. As multi-qubit benchmarking is well within experimental reach,^9,23,24 we expect an optimised implementation of our procedure to be quite practical.

Several natural questions arise from this work. First, one might address the asymptotically optimal cost of circuit synthesis for elements of the CNOT-dihedral groups, as well as the practical question of finding optimal circuit decompositions for elements of the smallest groups. We expect optimal circuits are computationally hard to find as n grows, but experimentally it is important to minimise the number of gates. Second, unlike the Clifford group, the CNOT-dihedral group is not a 2-design.⁵ It would be interesting to find a group (or set) containing a non-Clifford gate and that is a 2-design, and in which benchmarking can be done efficiently. Third, our results show that we can efficiently perform RB. However, we have not addressed the precise sense in which quantum computations over the CNOT-dihedral group can be efficiently simulated. This may be a subtle problem.^25,26 Last, there are generalised stabiliser formalisms, such as,²⁷ and it is natural to ask whether one of these describes how this group acts on some set of states.

Materials and methods

This section is devoted to proving the various results used in the benchmarking procedure: canonical decomposition of G_m, efficient computation in G_m and twirling over G_m, each of which is interesting in its own right. Let m be general and let us briefly set some notation. The matrix representation of G_m is set by identifying g∈G_m to the matrix that maps $| 0^{n} 〉 : = | 00 \dots 0 〉$ to $| b 〉 : = | b_{1} b_{2} \dots b_{n} 〉$ with unit phase. We define the phase-flip gates $Z | u 〉 : = {(- 1)}^{u} | u 〉$ and controlled-Z gates $Λ_{12} (Z) | u, v 〉 : = {(- 1)}^{u v} | u, v 〉$ . The support of a bit string v∈{0, 1}ⁿ is $supp (v) = {j | v_{j} = 1} \subseteq [n] : = {1, 2, \dots, n}$ . We refer to v and its support interchangeably, treating v as a set and vice versa. Let U be a single-qubit gate and U(v) denote the gate acting as U only on qubits in the support of v. Given J⊆[n] or elements i, j, … ∈[n], we also use the shorthand U(J) and U(i, j, …). $P : = 〈 X (j), Z (j) 〉 / 〈 i 〉$ denotes the n-qubit Pauli group and we define $X : = 〈 X (j) | j \in [n] 〉$ , $Z : = 〈 Z (j) | j \in [n] 〉$ , $c X : = 〈 Λ_{i j} (X) | i, j \in [n], i \neq j 〉$ and $c Z : = 〈 Λ_{i j} (Z) | i, j \in [n], i < j 〉$ .

Canonical form of G_m

Our first goal will be to put G_m in a canonical form (the main result is contained in Theorem 1). The rewriting identities shown in Figure 1 allow us to commute diagonal elements of G_m through Λ_ij(X) and X(j) gates. The rules for bit-flip gates are a special case of the CNOT rules. The following Lemma follows directly from definitions and formalises the role of the rewriting identities in understanding the group’s structure.

Lemma 1: Let W_m denote the subgroup of diagonal matrices of G_m and let $Π = 〈 Λ_{i j} (X), X (j) 〉$ denote the subgroup of permutation matrices. Then, G_m is isomorphic to a semi-direct product of groups G_m≃W_m⋊Π.

The proof of Lemma 1 is given in the Supplementary Material. Note that by definition $Π = X ⋊ c X$ . As $X ≃_{2}^{n}$ and $c X ≃ {GL}_{n} (_{2})$ , each element π∈Π can be associated with an n-bit string $c \in_{2}^{n}$ and an n by n invertible 0–1 linear transformation $B \in {GL}_{n} (_{2})$ such that $π | b 〉 = | B b \oplus c 〉$ . Here $F_{2}$ denotes the field with two elements. Furthermore, $| Π | = 2^{n} \prod_{ℓ = 0}^{n - 1} (2^{n} - 2^{ℓ})$ .

It remains to better understand W_m (see Lemma 3 for the main result). Let D_m denote the group of 2ⁿ by 2ⁿ diagonal unitary matrices D with elements $〈 b | D | b 〉 = ω_{m}^{f (b)}$ . Here $f :_{2}^{n} \to Z_{m}$ is a function that assigns mth roots of unity to the diagonal and $_{m}$ is the ring of integers modulo m. Since G_m is generated by permutation matrices and products of m-phase gates, W_m⊆D_m.

Let $ℛ \subset Z_{m} [x_{1}, \dots, x_{n}]$ denote the polynomial ring whose elements are $p (x) : = p (x_{1}, \dots, x_{n}) = \sum_{α \in {0, 1}^{n}} p_{α} x^{α}$ where α=α₁ … α_n is a multi-index, $p_{α} \in_{m}$ and $x^{α} = x_{1}^{α_{1}} \dots x_{n}^{α_{n}}$ is a monomial. The multi-index takes values in {0, 1}ⁿ as a convenient notation, as we will evaluate p(x) on binary strings, so $x_{j}^{2} = x_{j}$ . The degree of a monomial is denoted $| α |$ . We mainly consider ℛ as an additive group. The next Lemma follows from the definition of group isomorphism and the fact that each function f(b) can be expressed as a polynomial in ℛ.

Lemma 2: Let p(b) denote evaluation of p on the n-bit binary string b=b₁ … b_n with operations in $_{m}$ . The function $Φ : ℛ \to D_{m}$ given by $〈 b | Φ (p) | b 〉 = ω_{m}^{p (b)}$ is a group isomorphism.

The proof of Lemma 2 is given in the Supplementary Material. The rewriting identities give the action of Π on W_m by conjugation. Let ${\bar{W}}_{m} : = 〈 Z_{m} (j) 〉$ . On the basis of a similar application of the rewriting identities as in Lemma 1, $W_{m} = 〈 π {\bar{W}}_{m} π^{†} | π \in Π 〉$ . As $W_{m} \subseteq D_{m} ≃ ℛ$ , Φ⁻¹ associates a polynomial in ℛ to each element of W_m. By our chosen convention, matrices representing elements w∈W_m are given modulo a global phase factor $〈 ω_{m} 〉$ such that $w | 0^{n} 〉 = | 0^{n} 〉$ . Therefore, the preimages Φ⁻¹(w) have zero constant term—i.e., p_α=0 when $| α | = 0$ . Through Φ, the rewriting identities define an action of Π on ℛ that, respectively, takes x₁x₂ … x_px_j to

\begin{matrix} (6) & - 2 x_{1} x_{2} \dots x_{p} x_{i} x_{j} + x_{1} x_{2} \dots x_{p} x_{i} + x_{1} x_{2} \dots x_{p} x_{j} \end{matrix}

and x₁x₂…x_px_ix_j to

\begin{matrix} (7) & - x_{1} x_{2} \dots x_{p} x_{i} x_{j} + x_{1} x_{2} \dots x_{p} x_{i} . \end{matrix}

Equation (6) increments the degree of a monomial and multiplies its coefficient by −2, whereas Equation (7) does not change the degree. Another way to understand iterated applications of Equation (6) is to observe that

\begin{matrix} (8) & x_{1} \oplus x_{2} \oplus \dots \oplus x_{N} = \sum_{α \in Z_{2}^{N}, | α | \neq 0} {(- 2)}^{| α | - 1} x^{α} . \end{matrix}

This fact relates how single qubit Z_m gates acting on mod 2 linear combinations of input bits are equivalent to products of certain controlled-phase gates.

There is an element of W_m corresponding to each monomial term of non-zero degree, and the coefficient of this term has the form $p_{α} \in {(- 2)}^{| α | - 1} Z_{m}$ , as we will now see (see Supplementary Materials for further details.). We choose a subset of qubits J, fix any j∈J and define a permutation gate and corresponding polynomial

\begin{matrix} (9) & π_{J, j} : = \prod_{\begin{matrix} k \in J \\ k \neq j \end{matrix}} Λ_{k j} (X); p_{J} (x) : = \sum_{\begin{matrix} α \subseteq J \\ | α | \neq 0 \end{matrix}} {(- 2)}^{| α | - 1} x^{α} . \end{matrix}

By Equation (8), $Φ (p_{J}) = π_{J, j} Z_{m} (j) π_{J, j}^{†} \in W_{m}$ ; i.e., this circuit has a polynomial with one term of degree $| J |$ . As Φ(Z_m(j))=x_j, scaled monomials of successive degrees |α| and with coefficients in ${(- 2)}^{| α | - 1}_{m}$ can be generated inductively by composing these circuits. Take all linear combinations of these over $_{m}$ to find

Lemma 3: W_m is isomorphic to the subgroup $W < ℛ$ given by

\begin{matrix} (10) & {p \in ℛ | p_{\emptyset} = 0 and \forall α \neq \emptyset, p_{α} \in {(- 2)}^{| α | - 1} Z_{m}} . \end{matrix}

We can now directly compute $| G_{m} |$ .

Corollary 1

\begin{matrix} (11) & | G_{m} | = 2^{n} \prod_{ℓ = 0}^{n - 1} (2^{n} - 2^{ℓ}) \prod_{t = 1}^{n} {(\frac{L C M (2^{t - 1}, m)}{2^{t - 1}})}^{(\begin{matrix} n \\ t \end{matrix})} . \end{matrix}

Proof: Let o_m(a)=LCM(a, m)/a denote the order of a in $_{m}$ . Observe that ${(- 2)}^{| α | - 1}_{m} ≃_{o_{m} (2^{| α | - 1})}$ as additive groups. Therefore, W_m is isomorphic to a direct product of additive cyclic groups $A_{m} : = \prod_{t = 1}^{n}_{o_{m} (2^{t - 1})}^{(\begin{matrix} n \\ t \end{matrix})}$ . This shows that $| G_{m} | = | A_{m} | | Π |$ .

Putting everything together, we have

Theorem 1: Any element of G_m can be written in canonical form as the composition of a sequence of phase gates (comprising an element of W_m whose form is given in Lemma 3), a sequence of CNOT gates and a sequence of bit-flip gates.

Efficient computation in $G_{2^{k}}$

Our next goal is to present efficient methods for computing with G_m. Suppose we fix value of m so that it is not a function of n. Any labelling of group elements will have length proportional to $s {= \log}_{2} | G_{m} |$ . If m is odd, then $\log_{2} | G_{m} | = (2^{n} - 1) \log_{2} m {+ \log}_{2} | Π |$ , whereas if m=2^k then $\log_{2} | G_{2^{k}} | = \sum_{t = 1}^{k} (k - t + 1) (\begin{matrix} n \\ t \end{matrix}) {+ \log}_{2} | Π |$ . Therefore, s=Ω(2ⁿ) whenever m is odd (see Supplementary Materials for further details.), and in general we cannot efficiently represent elements of G_m as the number of qubits grows. However, s=O(n^k) for the special case m=2^k, and the story is different. We focus on this special case for the remainder of this article.

An element g∈G_m can be written as a product g=uvw where w∈W_m is a diagonal matrix, $v \in c X$ is a CNOT circuit and $u \in X$ is a tensor product of bit-flips. This transforms n-qubit quantum states as $g | b 〉 = ω_{m}^{p (b)} | B b \oplus c 〉$ where $p \in W$ , $B \in {GL}_{n} (_{2})$ and $c \in_{2}^{n}$ . Group elements are in bijective correspondence with the triples (p, B, c). The polynomial p has maximum degree k and at most $\sum_{t = 0}^{k} (\begin{matrix} n \\ t \end{matrix}) = O (n^{k})$ nonzero coefficients, each contained in $_{2^{k}}$ .

The product of group elements g₁, g₂∈G_m,

\begin{matrix} (12) & g_{2} g_{1} | b 〉 = ω_{m}^{p_{1} (b) + p_{2} (B_{1} b \oplus c_{1})} | B_{2} B_{1} b \oplus B_{2} c_{1} \oplus c_{2} 〉, \end{matrix}

is given by the triple

\begin{matrix} (13) & (p_{1} (x) + p_{2} (B_{1} x \oplus c_{1}), B_{2} B_{1}, B_{2} c_{1} \oplus c_{2}) . \end{matrix}

The products B₂B₁ and B₂c₁⊕c₂ can be computed in O(n³) time, and polynomials in $W$ can be added in O(n^k) ring operations. We need to show that p₂(B₁x⊕c₁) can also be computed efficiently.

Consider a triple (p, B, c) and let B_j denote the jth row of B and J_j=supp(B_j). Define x′=Bx⊕c. Then, for any j∈[n], using Equations (8) and (9),

\begin{matrix} (14) & x_{j}^{'} (x) = (\underset{ℓ \in J_{j}}{\oplus} x_{ℓ}) \oplus c_{j} = {\begin{array}{l} p_{J_{j}} (x) & if c_{j} = 0 \\ 1 - p_{J_{j}} (x) & if c_{j} = 1 \end{array} \end{matrix}

has maximum degree k. When we substitute $x' = x_{1}^{'} \dots x_{n}^{'}$ into the degree k polynomial p(x), computations occur with coefficients in $Z_{2^{k}}$ . We compute each monomial (x′)^α with O(k) multivariate polynomial multiplications, each of which can be done term-by-term in O(n^2k+1) ring operations. We compute the term ${(- 2)}^{| α | - 1}$ p_α(x′)^α with an additional O(n^k) ring operations to multiply each term of (x′)^α by a ${(- 2)}^{| α | - 1} p_{α}$ and accumulate the result. There are O(n^k) terms in p(x), so the total number of ring operations to compute p(x′) is O(n^3k+1). If c ≠ 0ⁿ, then it is possible that p(x′) has a non-zero constant term. With additional O(n^k) ring operations, p(x′) can be mapped to an equivalent polynomial in $W$ .

Uniformly sampling from G_2k is equivalent to uniformly and independently sampling from $W$ , $G L_{n} (_{2})$ and $_{2}^{n}$ . This can be done efficiently, as elements of $W$ have maximum degree k; see also ref. 28 (see Supplementary Materials for further details.).

Given a triple (p, B, c), we synthesise a corresponding circuit from products of CNOT gates, bit-flip gates and single-qubit m-phase gates. Our goal is to efficiently synthesise a circuit whose size (number of gates) is polynomial in n but not to optimise this circuit. We independently synthesise circuits coinciding with p, B and c. As c corresponds to X(c), and a CNOT circuit for B can be found by Gaussian elimination,¹⁴ the new part of the algorithm synthesises a circuit for p.

We describe the circuit synthesis for p informally. The algorithm proceeds in k rounds. Begin by initialising a working polynomial q(x)←p(x), set a round counter t←k and set a quantum circuit U←I. Here ‘←’ denotes assignment. In round t, we synthesise a circuit corresponding to a polynomial p^(t)(x) that coincides with q(x) on its degree t terms. For each of the O(n^t) degree-t terms ${(- 2)}^{| α | - 1} p_{α} x^{α}$ of q(x), we apply the constant-sized circuit $g_{α} : = π_{J, j} {(Z_{2^{k}} (j))}^{p_{α}} π_{J, j}^{†}$ setting U←g_αU, where J=supp(α) as in the proof of Lemma 3. The product of the g_α corresponds to $p^{(t)} (x) : = \prod_{α \subseteq [n], | α | = t} p_{α} p_{J} (x)$ . Therefore, we update q(x)←q(x)−p^(t)(x), which now has maximum degree t−1, decrement the round counter and proceed to the next round. The algorithm terminates when q(x)=0 and t=0. The total algorithm run-time and circuit size of the output U is O(n^k).

Twirling over $G_{2^{k}}$

A quantum channel is a completely positive trace-preserving map whose operator sum decomposition is $E (ρ) = \sum_{k} A_{k} ρ A_{k}^{†}$ where $\sum_{k} A_{k}^{†} A_{k} = I$ . The twirl of $E$ over a finite group G (G-twirl) is given by

\begin{matrix} (15) & {\bar{E}}_{G} (ρ) : = \frac{1}{| G |} \sum_{U \in G} U^{†} E (U ρ U^{†}) U . \end{matrix}

In what follows, we use several facts about group twirls. If G=AB is a direct product of groups, then ${\bar{E}}_{G} (ρ) = {\bar{({\bar{E}}_{A})}}_{B} (ρ)$ , and if A is a normal subgroup of G (denoted $A ⊲ G$ ), then ${\bar{E}}_{G} (ρ) = {\bar{({\bar{E}}_{A})}}_{G / A} (ρ)$ , where the twirl over the factor group G/A is over a set of coset representatives. Twirling any map over the Pauli group produces a Pauli channel.⁵ Consider a Pauli channel $E (ρ) = \sum_{Q \in P} η_{Q} Q ρ Q$ . Twirl this channel over any finite group G that has a permutation action on the set $P$ . The orbit of P∈ $P$ is $O_{P} : = {V^{†} P V | V \in G}$ and the stabiliser is $S_{P} : = {V \in G | V^{†} P V = P}$ . The orbits define an equivalence relation P~Q if and only if O_P=O_Q. This relation partitions $P$ into a disjoint union of orbits. By the orbit-stabiliser theorem and Lagrange’s theorem,²⁹ $| O_{P} | = | G / S_{P} | = | G | / | S_{P} |$ . Therefore, the twirl, Equation (15), can be written

\begin{matrix} (16) & {\bar{E}}_{G} (ρ) = \sum_{C \in C\} \sum_{P \in O_{C}} (\frac{\sum_{Q \in O_{C}} η_{Q}}{| O_{C} |}) P ρ P, \end{matrix}

where $C$ is a set of representative elements, one from each orbit.

These facts allow us to compute the twirl over $G_{2^{k}}$ when k>1 by expressing it as a sequence of twirls. We begin by decomposing the group. Let ${\tilde{W}}_{2^{k}} : = W_{2^{k}} / ({\bar{W}}_{2^{k}} \ {I})$ and recall that ${\bar{W}}_{2^{k}} : = 〈 Z_{2^{k}} (j) 〉$ , then $W_{2^{k}} = {\tilde{W}}_{2^{k}} {\bar{W}}_{2^{k}}$ . As $c Z ⊲ {\tilde{W}}_{2^{k}}$ and $Z ⊲ {\bar{W}}_{2^{k}}$ , we form the corresponding factor groups. Therefore, an element $w \in W_{2^{k}}$ can be written as $w = \tilde{w} \bar{w} = {\tilde{w}}_{1} {\tilde{w}}_{2} {\bar{w}}_{1} {\bar{w}}_{2}$ where ${\tilde{w}}_{1}$ labels cosets ${\tilde{w}}_{1} c Z$ , ${\tilde{w}}_{2} \in c Z$ , ${\bar{w}}_{1}$ labels cosets ${\bar{w}}_{1} Z$ and ${\bar{w}}_{2} \in Z$ . Finally, by Lemma 1, any element $g \in G_{2^{k}}$ factors as g=uvw where $u \in X$ , $v \in c X$ and $w \in W_{2^{k}}$ . Therefore, we have $g = u {\bar{w}}_{2}^{'} v {\tilde{w}}_{2}^{'} {\bar{w}}_{1} {\tilde{w}}_{1}$ where ${\bar{w}}_{2}^{'} = v {\bar{w}}_{2} v^{†} \in Z$ .

Our strategy is to use the decomposition to express the $G_{2^{k}}$ -twirl as a sequential $P$ -twirl, c $X$ -twirl, c $Z$ -twirl, ${\bar{W}}_{2^{k}} / Z$ -twirl and ${\tilde{W}}_{2^{k}} / c Z$ -twirl. Each twirl can be computed in a straightforward manner using the facts we have described, and it reduces the number of independent parameters describing the channel until we have twirled over the whole of $G_{2^{k}}$ (see Supplementary Materials for further details.). The final twirled map is

\begin{matrix} (17) & \bar{E} (ρ) = β_{I} ρ + β_{Z} \sum_{P \in Z / {I}} P ρ P + β_{R} \sum_{P \in P / Z} P ρ P . \end{matrix}

In the Liouville representation in the Pauli basis, which has matrix elements $R_{P Q}^{(\bar{E})} = Tr (P \bar{E} (Q)) / 4^{n}$ where P and Q are n-qubit Pauli operators, this map has three diagonal blocks corresponding to I, $Z / {I}$ and $P / Z$ with elements 1, α_Z:=1−4ⁿβ_R and α_R:=1−2ⁿβ_Z−(4ⁿ−2ⁿ)β_R, respectively.

References

Gottesman., D. An introduction to quantum error correction and fault-tolerant quantum computation. Preprint at http://arxiv.org/abs/0904.2557 (2009).
Raussendorf, R. & Harrington, J. Fault-tolerant quantum computation with high threshold in two dimensions. Phys. Rev. Lett. 98, 190504 (2007).
Article ADS Google Scholar
Emerson, J., Alicki, R. & Zyczkowski, K. Scalable noise estimation with random unitary operators. J. Opt. B Quantum Semiclassical Opt 7, S347 (2005).
Article ADS MathSciNet Google Scholar
Knill, E. et al. Randomized benchmarking of quantum gates. Phys. Rev. A 77, 012307 (2008).
Article ADS Google Scholar
Dankert, C., Cleve, R., Emerson, J. & Livine., E. Exact and approximate unitary 2-designs and their application to fidelity estimation. Phys. Rev. A 80, 012304 (2009).
Article ADS Google Scholar
Magesan, E., Gambetta, J. & Emerson, J. Robust randomized benchmarking of quantum processes. Phys. Rev. Lett. 106, 180504 (2011).
Article ADS Google Scholar
Wallman, J. & Flammia, S. Randomized benchmarking with confidence. New J. Phys. 16, 103032 (2014).
Article ADS Google Scholar
Magesan, E. et al. Efficient measurement of quantum gate error by interleaved randomized benchmarking. Phys. Rev. Lett. 109, 080505 (2012).
Article ADS Google Scholar
Gaebler, J. et al. Randomized benchmarking of multiqubit gates. Phys. Rev. Lett. 108, 260503 (2012).
Article ADS Google Scholar
Gambetta, J. et al. Characterization of addressability by simultaneous randomized benchmarking. Phys. Rev. Lett. 109, 240504 (2012).
Article ADS Google Scholar
Epstein, J., Cross, A., Magesen, E. & Gambetta, J. Investigating the limits of randomized benchmarking protocols. Phys. Rev. A 89, 062321 (2014).
Article ADS Google Scholar
Wallman, J., Barnhill, M. & Emerson, J. Characterization of leakage errors via randomized benchmarking. Phys. Rev. Lett. 115, 060501 (2015).
Article ADS Google Scholar
Chasseur, T. & Wilhelm, F. Complete randomized benchmarking protocol accounting for leakage errors. Phys. Rev. A 92, 042333 (2015).
Article ADS Google Scholar
Gottesman., D . Stabilizer Codes and Quantum Error Correction. PhD dissertation, (Caltech, 1997).
Google Scholar
Barends, R. et al. Rolling quantum dice with a superconducting qubit. Phys. Rev. A 90, 030303, (R)) (2014).
Article ADS Google Scholar
Dugas, A., Wallman, J. & Emerson, J. Characterizing universal gate sets via dihedral benchmarking. Phys. Rev. A 92, 060302, (R) (2015).
Article Google Scholar
Kimmel, S., da Silva, M. P., Ryan, C., Johnson, B. & Ohki, T. Robust extraction of tomographic information via randomized benchmarking. Phys. Rev. X 4, 011050 (2014).
Google Scholar
Bombin, H. & Martin-Delgado, M. Topological computation without braiding. Phys.Rev.Lett. 98, 160502 (2007).
Article ADS MathSciNet Google Scholar
Bravyi, S. & Kitaev, A. Universal quantum computation with ideal Clifford gates and noisy ancillas. Phys. Rev. A 71, 022316 (2005).
Article ADS MathSciNet Google Scholar
Duclos-Cianci, G. & Poulin, D. Reducing the quantum computing overhead with complex gate distillation. Phys. Rev. A 91, 042315 (2015).
Article ADS Google Scholar
Paetznick, A. & Svore., K. Repeat-until-success: Non-deterministic decomposition of single-qubit unitaries. Quant. Inf. Comp. 14, 15/16 (2014).
MathSciNet Google Scholar
Nielsen, M. & Chuang., I. Quantum Computation and Quantum Information. (Cambridge Univ. Press, 2000).
MATH Google Scholar
Corcoles, A. et al. Process verification of two-qubit quantum gates by randomized benchmarking. Phys. Rev. A 87, 030301, (R)) (2013).
Article ADS Google Scholar
Kelly, J. et al. Optimal quantum control using randomized benchmarking. Phys. Rev. Lett. 112, 240504 (2014).
Article ADS Google Scholar
Ni, X. & Van den Nest, M. Commuting quantum circuits: efficient classical simulations versus hardness results. Quant. Inf. Comp. 13, 54–72 (2013).
MathSciNet Google Scholar
Jozsa, R. & Van den Nest, M. Classical simulation complexity of extended Clifford circuits. Quant. Inf. Comp. 14, 633–648 (2014).
MathSciNet Google Scholar
Ni, X., Buerschaper, O. & Van den Nest., M. A non-commuting stabilizer formalism. J. Math. Phys. 56, 052201 (2015).
Article ADS MathSciNet Google Scholar
Randall, D. Efficient generation of random nonsingular matrices. Random Struct. Algorithms 4, 111–118 (1993).
Article MathSciNet Google Scholar
Artin., M. Algebra. (Prentice Hall, 1991).
MATH Google Scholar

Download references

Acknowledgements

All authors acknowledge support from ARO under contract W911NF-14-1-0124.

Author information

Authors and Affiliations

IBM T.J. Watson Research Center, Yorktown Heights, NY, USA
Andrew W Cross, Easwar Magesan, Lev S Bishop, John A Smolin & Jay M Gambetta

Authors

Andrew W Cross
View author publications
You can also search for this author in PubMed Google Scholar
Easwar Magesan
View author publications
You can also search for this author in PubMed Google Scholar
Lev S Bishop
View author publications
You can also search for this author in PubMed Google Scholar
John A Smolin
View author publications
You can also search for this author in PubMed Google Scholar
Jay M Gambetta
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.W.C. proved the main results with substantial contributions from E.M. and J.M.G. L.S.B. and A.W.C. implemented twirling operations and computed group orders. J.A.S. contributed significantly to early discussions. All authors contributed to writing the manuscript.

Corresponding author

Correspondence to Andrew W Cross.

Ethics declarations

Competing interests

The authors declare no conflict of interest.

Additional information

Supplementary Information accompanies the paper on the npj Quantum Information website (http://www.nature.com/npjqi)

Supplementary information

Supplemental Materials: Scalable randomized benchmarking of non-Clifford gates (TXT 18 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/4.0/

Reprints and permissions

About this article

Cite this article

Cross, A., Magesan, E., Bishop, L. et al. Scalable randomised benchmarking of non-Clifford gates. npj Quantum Inf 2, 16012 (2016). https://doi.org/10.1038/npjqi.2016.12

Download citation

Received: 20 October 2015
Revised: 17 March 2016
Accepted: 28 March 2016
Published: 26 April 2016
DOI: https://doi.org/10.1038/npjqi.2016.12

This article is cited by

Majorization-based benchmark of the complexity of quantum processors
- Alexandre B. Tacla
- Nina M. O’Neill
- Raúl O. Vallejos
Quantum Information Processing (2024)
Benchmarking universal quantum gates via channel spectrum
- Yanwu Gu
- Wei-Feng Zhuang
- Dong E. Liu
Nature Communications (2023)
Near-term quantum computing techniques: Variational quantum algorithms, error mitigation, circuit compilation, benchmarking and classical simulation
- He-Liang Huang
- Xiao-Yue Xu
- Gui-Lu Long
Science China Physics, Mechanics & Astronomy (2023)
Optimal two-qubit circuits for universal fault-tolerant quantum computation
- Andrew N. Glaudell
- Neil J. Ross
- Jacob M. Taylor
npj Quantum Information (2021)
Multi-exponential error extrapolation and combining error mitigation techniques for NISQ applications
- Zhenyu Cai
npj Quantum Information (2021)