Exploiting degeneracy in belief propagation decoding of quantum codes

Kuo, Kao-Yueh; Lai, Ching-Yi

doi:10.1038/s41534-022-00623-2

Download PDF

Article
Open access
Published: 14 September 2022

Exploiting degeneracy in belief propagation decoding of quantum codes

npj Quantum Information volume 8, Article number: 111 (2022) Cite this article

2303 Accesses
10 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Quantum information needs to be protected by quantum error-correcting codes due to imperfect physical devices and operations. One would like to have an efficient and high-performance decoding procedure for the class of quantum stabilizer codes. A potential candidate is Gallager’s sum-product algorithm, also known as Pearl’s belief propagation (BP), but its performance suffers from the many short cycles inherent in a quantum stabilizer code, especially highly-degenerate codes. A general impression exists that BP is not effective for topological codes. In this paper, we propose a decoding algorithm for quantum codes based on quaternary BP with additional memory effects (called MBP). This MBP is like a recursive neural network with inhibitions between neurons (edges with negative weights), which enhance the perception capability of a network. Moreover, MBP exploits the degeneracy of a quantum code so that the most probable error or its degenerate errors can be found with high probability. The decoding performance is significantly improved over the conventional BP for various quantum codes, including quantum bicycle, hypergraph-product, surface and toric codes. For MBP on the surface and toric codes over depolarizing errors, we observe error thresholds of 16% and 17.5%, respectively.

Multidimensional Bose quantum error correction based on neural network decoder

Article Open access 18 November 2022

Single-shot quantum error correction with the three-dimensional subsystem toric code

Article Open access 21 October 2022

Decoding quantum errors with subspace expansions

Article Open access 31 January 2020

Introduction

To demonstrate an interesting quantum algorithm, such as Shor’s factoring algorithm¹, a quantum computer needs to implement more than 10¹⁰ logical operations, which means that the error rate of each logical operation must be much less than 10⁻¹⁰ (see ref. ²). With limited quantum devices and imperfect operations^3,4, quantum information needs to be protected by quantum error-correcting codes to achieve fault-tolerant quantum computation⁵. If a quantum state is encoded in a stabilizer code^6,7, the error syndrome of an occurred error can be measured without disturbing the quantum information of the state. A quantum stabilizer code constructed from a sparse graph is favorable since it affords a two-dimensional layout or simple quantum error-correction procedures. This includes the families of surface and toric codes⁸, color codes⁹, random bicycle codes¹⁰, and generalized hypergraph-product (GHP) codes^11,12.

For a general stabilizer code, the decoding problem of finding the most probable coset of degenerate errors with a given error syndrome is hard^13,14, and an efficient decoding procedure with good performance is desired. The complexity of a decoding algorithm is usually a function of code length N. Edmonds’ minimum-weight perfect matching (MWPM)¹⁵ can be used to decode a surface or toric code^16,17,18,19, and the complexity of MWPM is O(N³), which can be reduced to O(N²) if local matching is used with minor performance loss^19,20,21. Duclos-Cianci and Poulin proposed a renormalization group (RG) decoder, which uses a strategy analogous to the decoding of a concatenated code, to decode a toric (or surface) code with complexity proportional to $N\log (\sqrt{N})$²². Both MWPM and RG can be generalized for color codes ^{23,24,25,26,27}.

On the other hand, most sparse quantum codes can be decoded by belief propagation (BP)^10,28,29,30 or its variants with additional processes^{31,32,33,34,35}. BP is an iterative algorithm and the decoding complexity per iteration is O(Nj)^30,36, where j is the mean column-weight of the check matrix of a quantum code. In general, an average number of iterations proportional to $\log \log N$ is sufficient for BP decoding^37,38. In practice, a maximum number of iterations ${T}_{\max }$ proportional to $\log \log N$ up to a large enough constant will be chosen. So the overall decoding complexity of BP is $O(Nj{T}_{\max })$ or $O(Nj\log \log N)$.

Although BP seems to have the lowest complexity, the long-standing problem is that BP does not perform well on quantum codes with high degeneracy unless additional complex processes are included^31,33. (We say that a code has high degeneracy or is highly degenerate if it has many stabilizers of weight lower than its minimum distance.) The Tanner graph of a stabilizer code inevitably contains many short cycles, which deteriorate the message-passing process in BP^10,31, especially for codes with high degeneracy^33,34,39. Any message-passing or neural network decoder may suffer from this issue. One may consider variants of BP with additional efforts in pre-training by neural networks^40,41,42,43 or post-processing^33,34 such as ordered statistics decoding (OSD)⁴⁴, but these methods may not be practical for large codes. In this paper, we will address this long-standing BP problem by devising an efficient quaternary BP decoding algorithm with memory effects (c.f. Eq. (10)), abbreviated MBP, so that the degeneracy of quantum codes can be exploited. Moreover, many known decoders in the literature treat Pauli X and Z errors separately as binary errors, which may incur additional computation overhead or performance loss. MBP directly handles the quaternary errors.

The problem of hard-decision decoding of a classical code is like an energy-minimization problem in a neural network⁴⁵, where an energy function measures the parity-check satisfaction (denoted by J_S).

It is known that BP has been used for energy minimization in statistical physics^31,38,46. Moreover, an iterative decoder based on the gradient descent optimization of the energy function has been proposed⁴⁷. These motivate us to consider a soft-decision generalization of the energy function with variables that are log-likelihood ratios (LLRs) of Pauli errors and make connections between BP and the gradient descent algorithm. We define an energy function with an additional term J_D that measures the distance between a recovery operator and the initial channel statistics. Then we show that BP in the log domain is like a gradient descent optimization for this generalized energy function but with more elegant step updates⁴⁸. This explains why the conventional BP may work well on a nondegenerate quantum code¹⁰, since this is similar to the classical case ⁴⁷.

For a highly-degenerate quantum code, it has many low-weight stabilizers corresponding to local minimums in the energy topology so that the conventional BP easily gets trapped in these local minimums near the origin. This suggests that we should use a larger step (which can be controlled by message normalization)³⁰. However, this is simply not enough since the energy-minimization process may not converge if large steps are made. An observation from neural networks is that inhibitions (edges with negative weights) between neurons can enhance the perception capability of a network and improve the pattern-recognition accuracy^{49,50,51,52,53}. MBP is mathematically formulated to have this inhibition functionality, which helps to resist wrong beliefs passing on the Tanner graph (due to short cycles³⁷) or to effectively accelerate the search in a gradient descent optimization. An important feature of MBP is that no additional computation is required, and thus, the complexity of MBP remains the same as the conventional BP with message normalization.

The performance of MBP can be further improved by choosing an appropriate step-size for each error syndrome. However, it is difficult to precisely determine the step-size. If the step-size is too large, MBP may return incorrect solutions or diverge. We propose to choose the step-size using an ε-net so the step-size can be determined adaptively. This adaptive scheme will be called AMBP. The overall complexity of AMBP is still $O(Nj\log \log N)$ since the chosen ε is independent of N.

Another technique adopted in MBP is to use fixed initialization^54,55. The energy function and energy topology are defined according to the channel statistics. If MBP performs well on a certain channel statistics (say, at a certain depolarizing rate ϵ₀), it means that MBP can correctly determine most syndrome-and-error pairs on that topology. Thus it is better to decode using this energy topology, regardless of the true channel statistics. This technique works for any quantum code.

Computer simulations of MBP on various quantum codes are performed. Note that MBP naturally extends to a more complicated error model of simultaneous data and measurement errors⁵⁶; however, perfect syndrome measurements are assumed in this paper since we focus on the algorithm and performance of BP on degenerate quantum codes. In RESULTS, we demonstrate the decoding of quantum bicycle codes¹⁰, a highly-degenerate GHP code³³, and the (rotated) surface or toric codes^57,58. Our simulation results show that MBP performs significantly better than the conventional BP. In particular, any degenerate error of the target error up to a stabilizer can also be the target and MBP is able to locate such errors. (See more discussions and examples about energy minimization and the memory effects of MBP in ref. ⁴⁸).

Results

Computer simulations

Our main results are based on an efficient decoder for quantum codes, the quaternary MBP (MBP₄) (see Algorithm 1). MBP₄ has a configurable step-size, which is scaled by a positive constant α⁻¹. For comparison, the conventional quaternary BP (BP₄) is similar to MBP₄ with α = 1 but without additional memory effects. MBP₄ can be extended as AMBP₄ (Algorithm 2). We simulate the decoding performances of various quantum codes by MBP₄ and AMBP₄ in the following. The message-update schedule will be denoted by a prefix parallel/serial²⁸.

For an [[N, K, D]] quantum code that encodes K logical qubits into N physical qubits with minimum distance D, if any errors of weight smaller or equal to t are correctable, its logical error rate is

$${P}_{{{{\rm{BDD}}}}}(t)\,\triangleq\, 1-\mathop{\sum }\limits_{i = 0}^{t}{{N}\choose{i}}{\epsilon }^{i}{(1-\epsilon )}^{N-i}$$

(1)

at depolarizing rate ϵ, using bounded distance decoding (BDD). Let r × BDD denote the case that any N-fold Pauli error of weight $\le \,t=\lfloor \frac{rD-1}{2}\rfloor$ is correctable so that the logical error rate is ${P}_{{{{\rm{BDD}}}}}(\lfloor \frac{rD-1}{2}\rfloor )$. If D is unknown, we may directly specify BDD with some t instead of r × BDD for comparison. Usually, a good classical decoding procedure has a correction radius between 1 × BDD and 2 × BDD³⁷. However, the degeneracy of a quantum code is not considered in BDD; we may have decoding performance much better than 2 × BDD in the quantum case. In addition, the optimal achievable decoding performance is unknown^13,14, so r × BDD serves as a good benchmark.

The mean weight of the rows in the check matrix of a quantum code is called the row-weight and denoted by k. If the row-weight is small, then the quantum code has many low-weight stabilizers. We say that a quantum code is more degenerate if the row-weight of its check matrix is smaller compared to the minimum distance of the code. We will see that MBP₄ improves the conventional BP₄ more when the tested code is more degenerate. In our simulations, the normalization factor α for the step-size in MBP₄ is chosen to be roughly proportional to k and inversely proportional to the depolarizing rate ϵ. (See the analysis in ref. ⁴⁸)

A relatively larger step-size may be needed for a highly-degenerate quantum code to decode those errors with a weight larger than the row-weight of the check matrix. We use an ε-net of α to adaptively determine the best value α^* for each error syndrome (Algorithm 2, denoted as AMBP₄). Since MBP₄ is queried as a subroutine in AMBP₄ at most ε⁻¹ times, the computation complexity of AMBP₄ is higher. If ε is independent of N, then the asymptotic complexity remains the same. To efficiently determine α^* is worth further studying.

We briefly explain how to interpret the simulation results. Let ${{{{\mathcal{G}}}}}_{N}$ be the N-fold Pauli group, ${{{\mathcal{S}}}}\subset {{{{\mathcal{G}}}}}_{N}$ be the stabilizer group that defines a quantum code, and ${{{{\mathcal{S}}}}}^{\perp }\subset {\{I,X,Y,Z\}}^{\otimes N}$ denote the set of operators with phase +1 that commute with ${{{\mathcal{S}}}}$ in ${{{{\mathcal{G}}}}}_{N}$. Let n_tot be the number of tested error samples for a data point in the simulation of the performance curve of a code. Suppose that E⁽ⁱ⁾ and ${\hat{{{{\bf{E}}}}}}^{(i)}\in {\{I,X,Y,Z\}}^{\otimes N}$ are the tested and estimated errors, respectively, for i = 1, 2, …, n_tot. Denote

$${n}_{{{{\rm{0}}}}}=\#\,{{{\rm{of}}}}\,{{{\rm{pairs}}}}\,({{{{\bf{E}}}}}^{(i)},{\hat{{{{\bf{E}}}}}}^{(i)}):{\hat{{{{\bf{E}}}}}}^{(i)}\,\ne\, {{{{\bf{E}}}}}^{(i)},$$

(2)

$${n}_{{{{\rm{e}}}}}=\#\,{{{\rm{of}}}}\,{{{\rm{pairs}}}}\,({{{{\bf{E}}}}}^{(i)},{\hat{{{{\bf{E}}}}}}^{(i)}):{\hat{{{{\bf{E}}}}}}^{(i)}\notin \pm\! {{{{\bf{E}}}}}^{(i)}{{{\mathcal{S}}}},$$

(3)

$${n}_{{{{\rm{u}}}}}=\#\,{{{\rm{of}}}}\,{{{\rm{pairs}}}}\,({{{{\bf{E}}}}}^{(i)},{\hat{{{{\bf{E}}}}}}^{(i)}):{\hat{{{{\bf{E}}}}}}^{(i)}{{{{\bf{E}}}}}^{(i)}\in {{{{\mathcal{S}}}}}^{\perp }\setminus \pm {{{\mathcal{S}}}}.$$

(4)

Empirically, we have the classical block error rate $P(\hat{{{{\bf{E}}}}}\,\ne\, {{{\bf{E}}}})={n}_{{{{\rm{0}}}}}/{n}_{{{{\rm{tot}}}}}$, the quantum logical error rate $P(\hat{{{{\bf{E}}}}}\,\notin\, \pm {{{\bf{E}}}}{{{\mathcal{S}}}})={n}_{{{{\rm{e}}}}}/{n}_{{{{\rm{tot}}}}}$, and the undetected error rate $P(\hat{{{{\bf{E}}}}}{{{\bf{E}}}}\in {{{{\mathcal{S}}}}}^{\perp }\setminus \pm {{{\mathcal{S}}}})={n}_{{{{\rm{u}}}}}/{n}_{{{{\rm{tot}}}}}$.

Since $(\hat{{{{\bf{E}}}}}\,\notin\, \pm {{{\bf{E}}}}{{{\mathcal{S}}}})\subseteq (\hat{{{{\bf{E}}}}}\,\ne\, {{{\bf{E}}}})$, by Bayes’ rule, we have

$$\begin{array}{l}P(\hat{{{{\bf{E}}}}}\,\notin \pm {{{\bf{E}}}}{{{\mathcal{S}}}})=P(\hat{{{{\bf{E}}}}}\,\notin \pm {{{\bf{E}}}}{{{\mathcal{S}}}},\hat{{{{\bf{E}}}}}\,\ne\, {{{\bf{E}}}})\\ \qquad\qquad\qquad\,=P(\hat{{{{\bf{E}}}}}\,\ne\, {{{\bf{E}}}})\times P(\hat{{{{\bf{E}}}}}\,\notin \pm {{{\bf{E}}}}{{{\mathcal{S}}}}| \hat{{{{\bf{E}}}}}\,\ne\, {{{\bf{E}}}})\\\qquad\qquad\qquad\,=\frac{{n}_{{{{\rm{0}}}}}}{{n}_{{{{\rm{tot}}}}}}\times \frac{{n}_{{{{\rm{e}}}}}}{{n}_{{{{\rm{0}}}}}}.\end{array}$$

(5)

Usually, a classical decoding strategy is to lower n₀/n_tot, which means that the target error needs to be accurately located from a given syndrome. Such a strategy has a limit in performance due to short cycles or strong degeneracy of the code. If a decoder converges to anyone of the degenerate errors, the decoding succeeds. A better strategy has to also lower the ratio n_e/n₀, which will be called the error suppression ratio, by exploiting the degeneracy.

In the simulations, E⁽ⁱ⁾ is drawn from a memoryless depolarizing error model and then decoded as ${\hat{{{{\bf{E}}}}}}^{(i)}$. The pairs $({{{{\bf{E}}}}}^{(i)},{\hat{{{{\bf{E}}}}}}^{(i)})$ are collected until we have 100 logical error events for a data point. Otherwise, an error bar between two crosses is used to show the 95% confidence interval (1.96 times the standard error of the mean). If a maximum number of iteration ${T}_{\max }$ is reached, but the BP does not converge, the decoding stops, and this error sample is counted as a logical error. ${T}_{\max }$ is chosen to match the literature for comparison if it was specified. (Empirically, ${T}_{\max }$ is chosen to be much larger than the average number of iterations τ.) We will see that MBP₄ significantly improves BP₄ with better convergence speed (smaller τ) and lower logical error rate (with n_e/n₀ < 1).

Bicycle codes

MacKay et al. constructed families of random bicycle codes, which are sparse-graph codes with performance possibly close to the quantum Gilbert–Varshamov rate¹⁰. To have an [[N, K]] random bicycle code, the number of row-weight k is chosen and two random circulant matrices are generated accordingly to define the check matrix of a quantum code of rate $\frac{K}{N}$ (after proper row-deletion). Since the minimum distance of the code is no larger than k due to the code construction, it may have a high decoding error-floor when k is small. For [[3786, 946]] bicycle codes, MacKay et al. showed that a code of row-weight k ≥ 24 can have good BP decoding performance. However, the decoding complexity is lower for a check matrix with a smaller k and the syndrome measurements are simpler. Thus we would like to have a good decoder for random bicycle codes of small row-weight.

We first construct bicycle codes with the same parameters as in ref. ¹⁰. Figure 1a shows the conventional BP₄ performance on [[3786, 946]] bicycle codes for row-weights k = 24, 20, 16, 12. It shows that the code of row-weight 24 is able to achieve the logical error rate of 10⁻⁴ before hitting the error-floor. Also shown in Fig. 1a are the performance curves from MacKay et al. using binary BP (BP₂), which treats Pauli X and Z errors separately. It can be seen that BP₄ performs better than BP₂, because the correlations between X errors and Z errors are considered²⁸.

**Fig. 1: Performance of parallel BP₄ and MBP₄ on the [[3786, 946]] bicycle codes of different row-weights (k), based on ${T}_{\max }=90$.**

Now we show that the performance is significantly improved with MBP₄, as shown in Fig. 1b. The error-floor performance is improved, and the code of row-weight 16 is able to well achieve the logical error rate of 10⁻⁶. The minimum distance of a bicycle code is usually unknown, so it is hard to compare its performance with r × BDD for some r. However, we know that the minimum distance is no larger than k for a bicycle code. Thus the performance of a code with k = 16 is at most ${P}_{{{{\rm{BDD}}}}}(\lfloor \frac{16-1}{2}\rfloor )$ for 1 × BDD. On the other hand, the performance of MBP₄ on the code of row-weight 16 is close to P_BDD(t) with t between 140 and 200 depending on the logical error rate as shown in Fig. 1b, which is better than 17 × BDD.

The average numbers of iterations are shown in Fig. 1c. The convergence behavior is good since the number of average iterations decreases when the physical error rate decreases. It can be seen that the number of average iterations using MBP₄ for k = 12 decreases more than in the other three cases of k = 16, 20, and 24 since there are more lower-weight stabilizers and hence more low-weight degenerate errors.

Next, we study whether MBP₄ improves the error suppression ratio n_e/n₀ defined in Eq. (5). Detailed error counts n₀, n_e, n_u, and n_tot of (M)BP₄ on the two codes of k = 16 and k = 12 at depolarizing rate ϵ = 0.027, 0.037, and 0.049 are provided in Table 1. MBP₄ has n_e/n₀ < 1 for these two codes if ϵ ≤ 0.049. Note that n_e/n₀ is small if the decoder finds degenerate errors most of the time. We observe that the decoder exploits the degeneracy more for a code with stabilizers of lower-weight. If the depolarizing rate is smaller, the ratio n_e/n₀ is smaller for MBP₄ on both codes. For k ≤ 12, the minimum distance of a bicycle code would be too small to have a low error-floor.

Table 1 Numbers of various events in the simulations of BP₄ and MBP₄ on the bicycle codes of row-weights 16 and 12.

Full size table

We remark that the conventional BP₄ has n_e/n₀ ≈ 1 for most cases when k ≥ 16. Also listed in Table 1 are the numbers of undetected errors, which are nonzero for k = 12. However, the ratio n_u/n_tot tends to be small.

To further improve the performance of these bicycle codes, we use AMBP₄ with α^* ∈ {2.4, 2.3, …, 0.5}. Herein we consider the serial schedule because it accelerates the message update and enlarges the error-correction radius in finite iterations. The performance curves in Fig. 1b are significantly improved, as shown in Fig. 2.

**Fig. 2: Performance of serial AMBP4 on [[3786, 946]] bicycle codes.**

In the case of quantum communication, we may focus on a target logical error rate of 10⁻⁴ (see ref. ¹⁰), where quantum retransmission is possible if necessary⁵⁹. Consider $\epsilon =\frac{t}{N}$ for large N. The quantum Gilbert–Varshamov rate^6,60 states that there exists a code of rate $\frac{1}{4}$ with the target logical error rate at ϵ = 0.063. One can see from Fig. 2 that the [[3786, 946]] bicycle code with k = 16 has this target logical error rate at ϵ = 0.057, which is close to the quantum Gilbert–Varshamov rate.

Generalized hypergraph-product code

Herein we consider the decoding of an [[882, 48, 16]] GHP code constructed in ref. ³³. This code has row-weight 8, which is less than its minimum distance and is thus highly degenerate. The performance of this code under each decoding strategy is shown in Fig. 3. The conventional BP₄, no matter parallel or serial, does not perform good enough. On the other hand, we find that most errors can be decoded by MBP₄ with α ∈ [1.2, 1.5]. The results can be further improved by AMBP₄ with α^* ∈ {1.5, 1.49, …, 0.5}, for both the parallel and serial schedules.

**Fig. 3: The performance of the [[822, 48, 16]] GHP code.**

For reference, we also plot the performance curves in the literature³³ in Fig. 3. The curve “[PK19] BP” is quaternary BP with a layered schedule and the curve “[PK19] BP-OSD-ω” is BP with OSD and additional post-processing. In addition to BP and OSD, BP-OSD-ω has to sort out 2^ω errors in ω unreliable coordinates, so its complexity is high. For this [[882, 48, 16]] GHP code, as shown in Fig. 3, the performance of AMBP₄ is better than BP-OSD-ω with ω = 15. The complexity of AMBP₄ is low enough, so we simulate to lower logical error rate.

We also plot several r × BDD performance curves for comparison. Observe that the curve of serial AMBP₄ has a slope roughly aligned with 1 × BDD, but its performance is close to 8 × BDD at a logical error rate of 10⁻⁶, since more low-weight errors are corrected. We also draw the curve of the classical block error rate $P(\hat{{{{\bf{E}}}}}\,\ne\, {{{\bf{E}}}})={n}_{{{{\rm{0}}}}}/{n}_{{{{\rm{tot}}}}}$. It becomes the logical error rate after times the ratio n_e/n₀. Figure 3 shows that the improvement by the ratio n_e/n₀ is quite significant, which means that AMBP₄ is able to exploit the code degeneracy to have better performance.

Surface and toric codes

In this subsection, we simulate the surface codes with a 45^∘ rotation for lower overhead^57,58. Our analysis can be applied to rotated toric codes as well.

An [[L², 1, L]] surface code for an odd integer L can be defined on an L × L square lattice. Figure 4a provides an example of L = 5. A stabilizer generator of a surface code is of weight 2 or 4, independent of the minimum distance. Consequently, a large surface code is highly-degenerate. As mentioned in the Introduction, the conventional BP cannot handle highly-degenerate quantum codes since there could be many errors of similar likelihood, so BP will hesitate among these errors. The decoding performance curves of the conventional (parallel) BP₄ and (serial) MBP₄ on several surface codes are shown in Fig. 5. It can be seen that the conventional BP₄ does not work well on these surface codes. Moreover, the logical error rate is worse for a surface code with a larger minimum distance.

**Fig. 4: The lattice representations of (rotated) surface and toric codes.**

**Fig. 5: Performance curves of several surface codes using the conventional BP₄ and serial (A)MBP₄.**

On the other hand, serial MBP₄ is able to decode the surface codes, as shown in Fig. 5. For L = 17, the decoding performance of serial MBP₄ with α = 0.65 is around 1 × BDD to 2 × BDD, which agrees with Gallager’s expectation on BP decoding of classical codes³⁷.

How MBP₄ decodes the surface codes is examined as follows. As previously discussed, n_e/n₀ would be small if a decoder can find degenerate errors of the target, which is indeed the case for MBP₄, as shown in Fig. 6a. We also observe undetected error events in the serial MBP₄ decoding. For the conventional BP₄, we have n_e/n₀ ≈ 1 and the undetected error rate ≈ 0 for L > 7. Thus the improvement of serial MBP₄ over BP₄ comes at the cost of some undetected error events, as shown in Fig. 6b. (A similar phenomenon was also observed in the neural BP decoder)⁴². This unwanted phenomenon is not surprising, since a large step-size is used so that BP may jump too far, causing logical errors. (We remark that this is not a random search, or otherwise the ratio n_e/n₀ would be as large as 3/4 since there are four logical operators, I, X, Y, and Z, for a logical qubit). However, the undetected error rate is smaller for larger L so this is fine for the purpose of fault-tolerant quantum computation. Figure 6c compares the average numbers of iterations for serial MBP₄ and conventional BP₄. It can be seen that serial MBP₄ uses fewer iterations than the conventional BP₄, and yet the performance of serial MBP₄ is better. It means that the convergence behavior of serial MBP₄ is more accurate and the computation is more economic and effective.

**Fig. 6: Some statistical results of serial MBP₄ (α = 0.65) on surface codes (solid lines).**

Next, we verify that the runtime of MBP₄ is O(Nj) per iteration. First, consider the toric codes with mean column-weight j = 4. We test serial MBP₄ with α = 0.75 at depolarizing rate of 0.32 on one core (4.9 GHz) of an Intel i9-9900K machine. The average runtime per iteration is shown in Fig. 7, which is obviously linear in N. Then we consider the surface codes, which have a mean column-weight slightly smaller than 4. As expected, the average runtime per iteration is again linear in N and the slope is smaller than that for the toric codes, as shown in Fig. 7.

**Fig. 7: Almost linear runtime of MBP₄.**

Although MBP₄ succeeds to decode topological codes from our simulations, it is also observed that the performance of MBP₄ on surface codes saturates for large L, i.e., the slope of the performance curve does not increase as L increases. For better decoding performance, we use AMBP₄ with α^* ∈ {1.0, 0.99, …, 0.5}, and the performance for L = 17 is greatly improved, as shown in Fig. 5. In Fig. 8, we plot the performance of AMBP₄ for each surface code of lattice size L ∈ {3, 5, 7, …, 17} and an error threshold of about 16% is observed. Similarly, a slightly higher error threshold of roughly 17.5% can be observed on the toric codes, using AMBP₄ decoding⁴⁸.

**Fig. 8: The threshold performance of serial AMBP₄ on the surface codes.**

Finally, we compare various polynomial-time decoders in terms of error thresholds and computation complexity in Table 2. Let ϵ_surf and $\epsilon_{\text{toric}}$ denote the error thresholds for the surface and toric codes, respectively. Certain decoders can approach the quantum hashing bound (which is roughly 18.9%^6,60,61) for the surface or toric codes, but they will not be considered due to high complexities^62,63,64. MWPM achieves ϵ_surf ≈ ϵ_toric = 15.5%^17,19. RG combined with BP (RG-BP) achieves ϵ_toric = 16.4%²². The matrix product states (MPS) decoder achieves 17% ≤ ϵ_surf ≤ 18.5% with complexity O(N²) specified⁶⁵. Union-find (UF) has complexity almost linear in N, but its decoding performance is slightly worse than MWPM⁶⁶. BP-assisted MWPM (BP-MWPM) has high thresholds for both the surface and toric codes, but its complexity is O(N^2.5)³². AMBP₄ achieves roughly ϵ_surf = 16% and ϵ_toric = 17.5%, and its complexity is only $O(N\log \log N)$. Thus AMBP₄ is very competing in both decoding performance and computation complexity.

Table 2 The error thresholds and computation complexities of various decoders on the surface codes (ϵ_surf) and toric codes (ϵ_toric) over depolarizing errors. An entry is denoted − if the value is not provided in the literature.

Full size table

Discussion

We analyzed the energy topology of BP and proposed an efficient BP decoding algorithm for quantum codes called MBP. MBP explores the degeneracy of a quantum code by finding degenerate errors of the target. MBP is competing in both decoding performance and computation complexity. The reader can find a detailed comparison of the thresholds and complexities of MWPM- or BP-based decoders on various topological codes (including color codes and XZZX codes) over depolarizing errors in Table II of ref. ⁶⁷.

It is known that BP can be treated as a recurrent neural network (RNN)⁶⁸. Similarly, our MBP induces an RNN with inhibition without the pre-training process. This may provide an explanation why RNN decoders can work on degenerate codes⁴². Thus, one may consider an MBP-based neural network decoder, which naturally generalizes the BP-based neural networks^42,68. One would have an adjustable parameter α_mn,i for each edge (m, n) at iteration i.

In AMBP₄, one has to find a proper value for α^*. An efficient strategy to select α^* is desired. A clue is that α^* should be related to the properties of the error syndrome. For example, a syndrome vector of high weight usually corresponds to an error of high weight and a smaller value of α should be chosen.

Our decoder can be extended for fault-tolerant quantum computation with imperfect quantum gates, following the initial study of BP decoding for both data and syndrome errors⁵⁶. This is our ongoing work.

Methods

BP with additional memory effects (MBP)

Decoding an [[N, K]] quantum code subject to an (unknown) error ${{{\bf{E}}}}\in {{{{\mathcal{G}}}}}_{N}$ is to estimate an $\hat{{{{\bf{E}}}}}\in \pm {{{\bf{E}}}}{{{\mathcal{S}}}}$, given a check matrix S ∈ {I, X, Y, Z}^M×N (where M ≥ N − K), a syndrome z ∈ {0, 1}^M, a real α > 0, and initial LLRs ${\{{{{{\mathbf{\Lambda }}}}}_{n} = ({{{{\mathbf{\Lambda }}}}}_{n}^{X},{{{{\mathbf{\Lambda }}}}}_{n}^{Y},{{{{\mathbf{\Lambda }}}}}_{n}^{Z})\in {{\mathbb{R}}}^{3}\}}_{n = 1}^{N}$ of the error rate at each qubit (see ref. ³⁰). The error syndrome z ∈ {0, 1}^M is defined by

$${{{{\bf{z}}}}}_{m}\,\triangleq\, \left\{\begin{array}{ll}0,&{{{\rm{if}}}}\,{{{\bf{E}}}}\,{{{\rm{and}}}}\,{{{{\bf{S}}}}}_{m}\,{{{\rm{commute}}}};\\ 1,&{{{\rm{if}}}}\,{{{\bf{E}}}}\,{{{\rm{and}}}}\,{{{{\bf{S}}}}}_{m}\,{{{\rm{anticommute}}}};\end{array}\right.$$

where S_m is the m-th row of S. For simplicity, an E ∈ {I, X, Y, Z}^⊗N is represented by E = (E₁, E₂, …, E_N) ∈ {I, X, Y, Z}^N.

The LLR value ${\mathbf{\Lambda}}_{n}^{W}\triangleq {{\bf{p}}}_{n}^{I}/{{\bf{p}}}_{n}^{W}$ for W ∈ {X, Y, Z} is initialized by a distribution vector ${{{{\bf{p}}}}}_{n}=({{{{\bf{p}}}}}_{n}^{I},{{{{\bf{p}}}}}_{n}^{X},{{{{\bf{p}}}}}_{n}^{Y},{{{{\bf{p}}}}}_{n}^{Z})=(1-{\epsilon }_{0},\frac{{\epsilon }_{0}}{3},\frac{{\epsilon }_{0}}{3},\frac{{\epsilon }_{0}}{3})$ for independent depolarizing errors. The value of ϵ₀ can be the channel error rate ϵ or an independent fixed point ∈ [0, 1]. When the LLR is initialized by a fixed point, this is referred to as fixed initialization.

We denote ${{{\mathcal{N}}}}(m)=\{n:{{{{\bf{S}}}}}_{mn}\,\ne\, I\}$ and ${{{\mathcal{M}}}}(n)=\{m:{{{{\bf{S}}}}}_{mn}\,\ne\, I\}$. Define functions ${\lambda }_{W}:{{\mathbb{R}}}^{3}\to {\mathbb{R}}$

$${\lambda }_{W}({\gamma }^{X},{\gamma }^{Y},{\gamma }^{Z})\,\triangleq\, \ln \frac{1+{{{{\rm{e}}}}}^{-{\gamma }^{W}}}{{{{{\rm{e}}}}}^{-{\gamma }^{X}}+{{{{\rm{e}}}}}^{-{\gamma }^{Y}}+{{{{\rm{e}}}}}^{-{\gamma }^{Z}}-{{{{\rm{e}}}}}^{-{\gamma }^{W}}}$$

(6)

for W ∈ {X, Y, Z}. Also define an operation ⊞ : for a set of k real scalars ${a}_{1},{a}_{2},\ldots ,{a}_{k}\in {\mathbb{R}}$,

$$\mathop\boxplus \limits_{n\; =\; 1}^{k}\,{a}_{n}\,\triangleq\, 2{\tanh }^{-1}\left(\mathop{\prod }\nolimits_{n = 1}^{k}\tanh \frac{{a}_{n}}{2}\right).$$

(7)

We may simplify a notation ${{{\mathcal{M}}}}(n)\setminus \{m\}$ as ${{{\mathcal{M}}}}(n)\setminus m$.

Algorithm 1:

Quaternary MBP (MBP₄)

Input: S ∈ {I, X, Y, Z}^M×N, z ∈ {0, 1}^M, ${T}_{\max }\in {{\mathbb{Z}}}_{+}$, a real α > 0, and initial LLRs ${\{({{{{\mathbf{\Lambda }}}}}_{n}^{X},{{{{\mathbf{\Lambda }}}}}_{n}^{Y},{{{{\mathbf{\Lambda }}}}}_{n}^{Z})\in {{\mathbb{R}}}^{3}\}}_{n = 1}^{N}$.

Initialization. For n = 1 to N and $m\in {{{\mathcal{M}}}}(n)$, let

$${{{{\mathbf{\Gamma }}}}}_{n\to m}^{W}={{{{\mathbf{\Lambda }}}}}_{n}^{W},\,\,W\in \{X,Y,Z\}.$$

Horizontal Step. For m = 1 to M and $n\in {{{\mathcal{N}}}}(m)$, compute

$${{{{\mathbf{\Delta }}}}}_{m\to n}={(-1)}^{{{{{\bf{z}}}}}_{m}} \mathop\boxplus\limits_ {n^{\prime} \in {{{\mathcal{N}}}}(m)\setminus n}{\lambda }_{{{{{\bf{S}}}}}_{mn^{\prime} }}({{{{\mathbf{\Gamma }}}}}_{n^{\prime} \to m}).$$

(8)

Vertical Step. For n = 1 to N and W ∈ {X, Y, Z}, compute

$${{{{\mathbf{\Gamma }}}}}_{n}^{W}={{{{\mathbf{\Lambda }}}}}_{n}^{W}+\frac{1}{\alpha }\mathop{\sum}\limits_{{m\in {{{\mathcal{M}}}}(n)}\atop{\langle W,{{{{\bf{S}}}}}_{mn}\rangle =1}}{{{{\mathbf{\Delta }}}}}_{m\to n}.$$

(9)

(Hard Decision.) Let $\hat{{{{\bf{E}}}}}=({\hat{{{{\bf{E}}}}}}_{1},{\hat{{{{\bf{E}}}}}}_{2},\ldots ,{\hat{{{{\bf{E}}}}}}_{N})$, where ${\hat{{{{\bf{E}}}}}}_{n}=I$ if ${{{{\mathbf{\Gamma }}}}}_{n}^{W} \,>\, 0$ for all W ∈ {X, Y, Z}, and ${\hat{{{{\bf{E}}}}}}_{n}=\arg \mathop{\min }\limits_{W\in \{X,Y,Z\}}{{{{\mathbf{\Gamma }}}}}_{n}^{W}$, otherwise.
If $\langle \hat{{{{\bf{E}}}}},{{{{\bf{S}}}}}_{m}\rangle ={{{{\bf{z}}}}}_{m}\,\forall \,m$, halt and return “CONVERGE”;
Otherwise, if the maximum number of iterations ${T}_{\max }$ is reached, halt and return “FAIL”;
(Fixed Inhibition.) Otherwise, for n = 1 to N, $m\in {{{\mathcal{M}}}}(n)$, and W ∈ {X, Y, Z}, compute
$${{{{\mathbf{\Gamma }}}}}_{n\to m}^{W}={{{{\mathbf{\Gamma }}}}}_{n}^{W}-\,\langle W,{{{{\bf{S}}}}}_{mn}\rangle {{{{\mathbf{\Delta }}}}}_{m\to n}.$$
(10)
Repeat from the horizontal step.

Motivated by the energy topology of a degenerate quantum code and gradient decent energy optimization, we propose MBP₄ in Algorithm 1. MBP₄ has variable-to-check messages ${\lambda }_{{{{{\bf{S}}}}}_{mn}}({{{{\mathbf{\Gamma }}}}}_{n\to m})$ and check-to-variable messages Δ_m→n, similarly to the log-BP in ref. ³⁰; however, the message Δ_m→n is used differently when generating ${{{{\mathbf{\Gamma }}}}}_{n\to m}^{W}$ in Eq. (10). Especially we can rewrite Eq. (10) as

$$\begin{array}{ll}{{{{\mathbf{\Gamma}}}}}_{n\to m}^{W}={{{{\mathbf{\Lambda }}}}}_{n}^{W}+\left(\frac{1}{\alpha }\mathop{\sum}\nolimits_{{m^{\prime} \in {{{\mathcal{M}}}}(n)}\atop{\langle W,{{{{\bf{S}}}}}_{m^{\prime} n}\rangle =1}}{{{{\mathbf{\Delta }}}}}_{m^{\prime} \to n}\right)\\ \qquad\quad-\langle W,{{{{\bf{S}}}}}_{mn}\rangle {{{{\mathbf{\Delta }}}}}_{m\to n}.\end{array}$$

(11)

The term − 〈W, S_mn〉Δ_m→n is called inhibition, which provides adequate strength to resist the wrong belief looped in the short cycles. Unlike refs. ^28,30, where the corresponding inhibition is also scaled by 1/α, we suggest to keep this inhibition strength (Eq. (11)) since this part is the belief inherited in check node m, and it must remain unchanged when we update the belief in variable n to make the decoding less affected by the short cycles. Consequently, these introduce additional memory effects in MBP. How to choose the factor α is intriguing. Please see ref. ⁴⁸ for more discussions; for reference, MBP₄ can also be defined in the linear domain.

Remark 1

In Algorithm 1, one can verify that

$${\lambda }_{{{{{\bf{S}}}}}_{mn}}({{{{\mathbf{\Gamma }}}}}_{n\to m})={\lambda }_{{{{{\bf{S}}}}}_{mn}}({{{{\mathbf{\Gamma }}}}}_{n})-{{{{\mathbf{\Delta }}}}}_{m\to n}.$$

(12)

It is more efficient to update ${\lambda }_{{{{{\bf{S}}}}}_{mn}}({{{{\mathbf{\Gamma }}}}}_{n\to m})$ in this way so that for each n, computing ${\lambda }_{{{{{\bf{S}}}}}_{mn}}({{{{\mathbf{\Gamma }}}}}_{n})$ needs at most three computations of ${\lambda }_{{{{{\bf{S}}}}}_{mn}}(\cdot )$ for S_mn ∈ {X, Y, Z}; otherwise, directly computing ${\lambda }_{{{{{\bf{S}}}}}_{mn}}({{{{\mathbf{\Gamma }}}}}_{n\to m})$ needs $| {{{\mathcal{M}}}}(n)|$ (usually ≥3) computations of ${\lambda }_{{{{{\bf{S}}}}}_{mn}}(\cdot )$. On the other hand, the computation in the horizontal step can be simplified as in Remarks 1 and 4 of ref. ³⁰. Then the MBP₄ complexity is proportional to the number of edges Nj per iteration, and thus the overall complexity is $O(Nj{T}_{\max })$ or $O(Nj\log \log N)$.

Adaptive MBP

Herein we propose a variation of MBP₄ with α chosen adaptively, as shown in Algorithm 2. The value of α controls the search radius of MBP₄. Typically, a fixed α is chosen so that BP focuses on an error-correction region between 1 × BDD and 2 × BDD. For highly-degenerate codes, we intend to correct errors in a much wider region, and thus we need to consider variations in α. More specifically, α should be chosen according to the given syndrome vector. Precisely determining a required value of α helps to achieve the desired performance, but in general, it is difficult to do so. Generating a solution by referring to multiple instances of the decoder is an important technique in Monte Carlo sampling methods (cf. parallel tempering in ref. ⁶²), as well as in neural networks (cf. Fig. 4 of ref. ⁶⁸). This is like using an ε-net of α. Thus we conduct multiple instances of MBP₄ with different values of α, and choose a valid (syndrome-matched) solution with the largest (the most conservative) value of α. This value of α is adaptively chosen and denoted by α^*, so Algorithm 2 is referred to as AMBP₄. For this algorithm, the prefix parallel/serial is used to indicate the schedule type of the oracle function MBP₄.

Note that in Algorithm 2 each value of α_i is tested in a sequential manner; these α_i’s can be tested in parallel if the physical resources for implementation are available.

Algorithm 2:

Adaptive MBP₄ (AMBP₄)

Input: S ∈ {I, X, Y, Z}^M×N, z ∈ {0, 1}^M, ${T}_{\max }\in {{\mathbb{Z}}}_{+}$, ${\{{{{{\mathbf{\Lambda }}}}}_{n} = ({{{{\mathbf{\Lambda }}}}}_{n}^{X},{{{{\mathbf{\Lambda }}}}}_{n}^{Y},{{{{\mathbf{\Lambda }}}}}_{n}^{Z})\in {{\mathbb{R}}}^{3}\}}_{n = 1}^{N}$, a sequence of real values α₁ > α₂ > ⋯ > α_l > 0, and an oracle function MBP₄.

Initialization: Let i = 1.

MBP Step: Run MBP${}_{4}({{{\bf{S}}}},\,{{{\bf{z}}}},\,{T}_{\max },\,{\alpha }_{i},\,\{{{{{\mathbf{\Lambda }}}}}_{n}\})$, which will return “CONVERGE” or “FAIL” with estimated $\hat{{{{\bf{E}}}}}\in {\{I,X,Y,Z\}}^{N}$.

Adaptive Check:

If the return indicator is “CONVERGE”, then return "SUCCESS” (with valid $\hat{{{{\bf{E}}}}}$ and α^* = α_i);
Let i ← i + 1. If i > l, return “FAIL” (with invalid $\hat{{{{\bf{E}}}}}$);
Otherwise, repeat from the MBP Step.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

Shor, P. W. Algorithms for quantum computation: Discrete logarithms and factoring. In Proc. 35th Annual Symposium on Foundations of Computer Science (FOCS). 124–134 (IEEE, 1994).
Suchara, M. et al. QuRE: The quantum resource estimator toolbox. In Proc. IEEE 31st International Conference on Computer Design (ICCD). 419–426 (IEEE, 2013).
Wang, Y. et al. Single-qubit quantum memory exceeding ten-minute coherence time. Nat. Photonics 11, 646–650 (2017).
Article ADS Google Scholar
Arute, F. et al. Quantum supremacy using a programmable superconducting processor. Nature 574, 505–510 (2019).
Article ADS Google Scholar
Shor, P. W. Fault-tolerant quantum computation. In Proc. 37th Annual Symposium on Foundations of Computer Science (FOCS). 56–65 (IEEE Computer Society, 1996).
Gottesman, D. Stabilizer Codes and Quantum Error Correction. Ph.D. thesis, California Institute of Technology (1997).
Calderbank, A. R., Rains, E. M., Shor, P. W. & Sloane, N. J. A. Quantum error correction via codes over GF(4). IEEE Trans. Inf. Theory 44, 1369–1387 (1998).
Article MathSciNet MATH Google Scholar
Kitaev, A. Y. Fault-tolerant quantum computation by anyons. Ann. Phys. 303, 2–30 (2003).
Article ADS MathSciNet MATH Google Scholar
Bombin, H. & Martin-Delgado, M. A. Topological quantum distillation. Phys. Rev. Lett. 97, 180501 (2006).
Article ADS Google Scholar
MacKay, D. J. C., Mitchison, G. & McFadden, P. L. Sparse-graph codes for quantum error correction. IEEE Trans. Inf. Theory 50, 2315–2330 (2004).
Article MathSciNet MATH Google Scholar
Tillich, J.-P. & Zémor, G. Quantum LDPC codes with positive rate and minimum distance proportional to the square root of the blocklength. IEEE Trans. Inf. Theory 60, 1193–1202 (2014).
Article MathSciNet MATH Google Scholar
Kovalev, A. A. & Pryadko, L. P. Quantum Kronecker sum-product low-density parity-check codes with finite rate. Phys. Rev. A 88, 012311 (2013).
Article ADS Google Scholar
Kuo, K.-Y. & Lu, C.-C. On the hardnesses of several quantum decoding problems. Quantum Inf. Process. 19, 1–17 (2020).
Article ADS MathSciNet Google Scholar
Iyer, P. & Poulin, D. Hardness of decoding quantum stabilizer codes. IEEE Trans. Inf. Theory 61, 5209–5223 (2015).
Article MathSciNet MATH Google Scholar
Edmonds, J. Paths, trees, and flowers. Can. J. Math. 17, 449–467 (1965).
Article MathSciNet MATH Google Scholar
Dennis, E., Kitaev, A., Landahl, A. & Preskill, J. Topological quantum memory. J. Math. Phys. 43, 4452–4505 (2002).
Article ADS MathSciNet MATH Google Scholar
Wang, C., Harrington, J. & Preskill, J. Confinement-Higgs transition in a disordered gauge theory and the accuracy threshold for quantum memory. Ann. Phys. 303, 31–58 (2003).
Article ADS MATH Google Scholar
Raussendorf, R., Harrington, J. & Goyal, K. A fault-tolerant one-way quantum computer. Ann. Phys. 321, 2242–2270 (2006).
Article ADS MathSciNet MATH Google Scholar
Wang, D. S., Fowler, A. G., Stephens, A. M. & Hollenberg, L. C. L. Threshold error rates for the toric and planar codes. Quantum Inf. Comput. 10, 456–469 (2010).
MathSciNet MATH Google Scholar
Fowler, A. G., Whiteside, A. C. & Hollenberg, L. C. Towards practical classical processing for the surface code. Phy. Rev. Lett. 108, 180501 (2012).
Article ADS Google Scholar
Fowler, A. G. Minimum weight perfect matching of fault-tolerant topological quantum error correction in average O(1) parallel time. Quantum Inf. Comput. 15, 145–158 (2015).
MathSciNet Google Scholar
Duclos-Cianci, G. & Poulin, D. Fast decoders for topological quantum codes. Phys. Rev. Lett. 104, 050504 (2010).
Article ADS Google Scholar
Wang, D. S., Fowler, A. G., Hill, C. D. & Hollenberg, L. C. L. Graphical algorithms and threshold error rates for the 2d color code. Quantum Inf. Comput. 10, 780–802 (2010).
MATH Google Scholar
Bombin, H., Duclos-Cianci, G. & Poulin, D. Universal topological phase of two-dimensional stabilizer codes. N. J. Phys. 14, 073048 (2012).
Article MATH Google Scholar
Delfosse, N. Decoding color codes by projection onto surface codes. Phys. Rev. A 89, 012317 (2014).
Article ADS Google Scholar
Sarvepalli, P. & Raussendorf, R. Efficient decoding of topological color codes. Phys. Rev. A 85, 022317 (2012).
Article ADS Google Scholar
Stephens, A. M. Efficient fault-tolerant decoding of topological color codes. Preprint at https://arxiv.org/abs/1402.3037 (2014).
Kuo, K.-Y. & Lai, C.-Y. Refined belief propagation decoding of sparse-graph quantum codes. IEEE J. Sel. Areas Inf. Theory 1, 487–498 (2020).
Article Google Scholar
Kuo, K.-Y. & Lai, C.-Y. Refined belief-propagation decoding of quantum codes with scalar messages. In Proc. IEEE Globecom Workshops 1–6 (IEEE, 2020).
Lai, C.-Y. & Kuo, K.-Y. Log-domain decoding of quantum LDPC codes over binary finite fields. In IEEE Transactions on Quantum Engineering 1–15 (IEEE, 2021).
Poulin, D. & Chung, Y. On the iterative decoding of sparse quantum codes. Quantum Inf. Comput. 8, 987–1000 (2008).
MathSciNet MATH Google Scholar
Criger, B. & Ashraf, I. Multi-path summation for decoding 2D topological codes. Quantum 2, 102 (2018).
Article Google Scholar
Panteleev, P. & Kalachev, G. Degenerate quantum LDPC codes with good finite length performance. Quantum 5, 585 (2021).
Article Google Scholar
Roffe, J., White, D. R., Burton, S. & Campbell, E. T. Decoding across the quantum LDPC code landscape. Phys. Rev. Res. 2, 043423 (2020).
Article Google Scholar
Grospellier, A., Grouès, L., Krishna, A. & Leverrier, A. Combining hard and soft decoders for hypergraph product codes. Quantum 5, 432 (2021).
Article Google Scholar
Davey, M. & MacKay, D. Low-density parity check codes over GF(q). IEEE Commun. Lett. 2, 165–167 (1998).
Article Google Scholar
Gallager, R. G. Research Monograph Series (MIT Press, 1963).
MacKay, D. J. C. Good error-correcting codes based on very sparse matrices. IEEE Trans. Inf. Theory 45, 399–431 (1999).
Article MathSciNet MATH Google Scholar
Raveendran, N. & Vasić, B. Trapping sets of quantum LDPC codes. Quantum 5, 562 (2021).
Article Google Scholar
Torlai, G. & Melko, R. G. Neural decoder for topological codes. Phys. Rev. Lett. 119, 030501 (2017).
Article ADS MathSciNet Google Scholar
Krastanov, S. & Jiang, L. Deep neural network probabilistic decoder for stabilizer codes. Sci. Rep. 7, 11003 (2017).
Article ADS Google Scholar
Liu, Y.-H. & Poulin, D. Neural belief-propagation decoders for quantum error-correcting codes. Phys. Rev. Lett. 122, 200501 (2019).
Article ADS Google Scholar
Maskara, N., Kubica, A. & Jochym-O’Connor, T. Advantages of versatile neural-network decoding for topological codes. Phys. Rev. A 99, 052351 (2019).
Article ADS Google Scholar
Fossorier, M. & Lin, S. Soft-decision decoding of linear block codes based on ordered statistics. IEEE Trans. Inf. Theory 41, 1379–1396 (1995).
Article MATH Google Scholar
Bruck, J. & Blaum, M. Neural networks, error-correcting codes, and polynomials over the binary n-cube. IEEE Trans. Inf. Theory 35, 976–987 (1989).
Article MathSciNet MATH Google Scholar
Yedidia, J. S., Freeman, W. T. & Weiss, Y. Constructing free-energy approximations and generalized belief propagation algorithms. IEEE Trans. Inf. Theory 51, 2282–2312 (2005).
Article MathSciNet MATH Google Scholar
Lucas, R., Bossert, M. & Breitbach, M. On iterative soft-decision decoding of linear binary block codes and product codes. IEEE J. Sel. Areas Commun. 16, 276–296 (1998).
Article Google Scholar
Kuo, K.-Y. & Lai, C.-Y. Exploiting degeneracy in belief propagation decoding of quantum codes. Preprint at https://arxiv.org/abs/2104.13659 (2021).
Hopfield, J. J. Neurons with graded response have collective computational properties like those of two-state neurons. Proc. Nat. Acad. Sci. USA 81, 3088–3092 (1984).
Article ADS MATH Google Scholar
Hopfield, J. J. & Tank, D. W. "neural” computation of decisions in optimization problems. Biol. Cybern. 52, 141–152 (1985).
Article MATH Google Scholar
Hopfield, J. J. & Tank, D. W. Computing with neural circuits: a model. Science 233, 625–633 (1986).
Article ADS Google Scholar
Van den Bout, D. E. & Miller, T. K. Improving the performance of the Hopfield-Tank neural network through normalization and annealing. Biol. Cybern. 62, 129–139 (1989).
Article Google Scholar
Marcus, C. M., Waugh, F. R. & Westervelt, R. M. Nonlinear dynamics and stability of analog neural networks. Phys. D. 51, 234–247 (1991).
Article MathSciNet MATH Google Scholar
Hagiwara, M., Fossorier, M. P. C. & Imai, H. Fixed initialization decoding of LDPC codes over a binary symmetric channel. IEEE Trans. Inf. Theory 58, 2321–2329 (2012).
Article MathSciNet MATH Google Scholar
Sutskever, I., Martens, J., Dahl, G. & Hinton, G. On the importance of initialization and momentum in deep learning. In Proc. 30th International Conference on Machine Learning (ICML) 1139–1147 (PMLR, 2013).
Kuo, K.-Y., Chern, I.-C. & Lai, C.-Y. Decoding of quantum data-syndrome codes via belief propagation. In Proc. IEEE International Symposium on Information Theory (ISIT) 1552–1557 (IEEE, 2021).
Bombin, H. & Martin-Delgado, M. A. Optimal resources for topological two-dimensional stabilizer codes: Comparative study. Phys. Rev. A 76, 012305 (2007).
Article ADS Google Scholar
Horsman, C., Fowler, A. G., Devitt, S. & Van Meter, R. Surface code quantum computing by lattice surgery. N. J. Phys. 14, 123011 (2012).
Article MathSciNet MATH Google Scholar
Yu, N., Lai, C.-Y. & Zhou, L. Protocols for packet quantum network intercommunication. In IEEE Transactions on Quantum Engineering (2021).
Ekert, A. & Macchiavello, C. Quantum error correction for communication. Phys. Rev. Lett. 77, 2585 (1996).
Article ADS Google Scholar
Bennett, C. H., DiVincenzo, D. P., Smolin, J. A. & Wootters, W. K. Mixed-state entanglement and quantum error correction. Phys. Rev. A 54, 3824 (1996).
Article ADS MathSciNet MATH Google Scholar
Wootton, J. R. & Loss, D. High threshold error correction for the surface code. Phys. Rev. Lett. 109, 160503 (2012).
Article ADS Google Scholar
Bombin, H., Andrist, R. S., Ohzeki, M., Katzgraber, H. G. & Martín-Delgado, M. A. Strong resilience of topological codes to depolarization. Phys. Rev. X 2, 021004 (2012).
Google Scholar
Ohzeki, M. Error threshold estimates for surface code with loss of qubits. Phys. Rev. A 85, 060301 (2012).
Article ADS Google Scholar
Bravyi, S., Suchara, M. & Vargo, A. Efficient algorithms for maximum likelihood decoding in the surface code. Phys. Rev. A 90, 032326 (2014).
Article ADS Google Scholar
Delfosse, N. & Nickerson, N. H. Almost-linear time decoding algorithm for topological codes. Quantum 5, 595 (2021).
Article Google Scholar
Kuo, K.-Y. & Lai, C.-Y. Comparison of 2D topological codes and their decoding performances. In Proc. IEEE International Symposium on Information Theory (ISIT) 186–191 (IEEE, 2022).
Nachmani, E. et al. Deep learning methods for improved decoding of linear codes. IEEE J. Sel. Top. Signal Process. 12, 119–131 (2018).
Article ADS Google Scholar

Download references

Acknowledgements

CYL was supported by the National Science and Technology Council in Taiwan under Grant MOST110-2628-E-A49-007, MOST111-2628-E-A49-024-MY2, MOST111-2119-M-A49-004, and MOST111-2119-M-001-002.

Author information

Authors and Affiliations

Institute of Communications Engineering, National Yang Ming Chiao Tung University, Hsinchu, 300093, Taiwan
Kao-Yueh Kuo & Ching-Yi Lai

Authors

Kao-Yueh Kuo
View author publications
You can also search for this author in PubMed Google Scholar
Ching-Yi Lai
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.-Y.L. and K.-Y.K. formulated the initial idea and developed the theory. K.-Y.K. performed the simulations. All co-authors contributed to the preparation of the manuscript.

Corresponding author

Correspondence to Ching-Yi Lai.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kuo, KY., Lai, CY. Exploiting degeneracy in belief propagation decoding of quantum codes. npj Quantum Inf 8, 111 (2022). https://doi.org/10.1038/s41534-022-00623-2

Download citation

Received: 23 May 2021
Accepted: 23 August 2022
Published: 14 September 2022
DOI: https://doi.org/10.1038/s41534-022-00623-2

This article is cited by

Exploiting degeneracy in belief propagation decoding of quantum codes
- Kao-Yueh Kuo
- Ching-Yi Lai
npj Quantum Information (2022)