Security of quantum key distribution from generalised entropy accumulation

Metger, Tony; Renner, Renato

doi:10.1038/s41467-023-40920-8

Download PDF

Article
Open access
Published: 29 August 2023

Security of quantum key distribution from generalised entropy accumulation

Nature Communications volume 14, Article number: 5272 (2023) Cite this article

2058 Accesses
3 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The goal of quantum key distribution (QKD) is to establish a secure key between two parties connected by an insecure quantum channel. To use a QKD protocol in practice, one has to prove that a finite size key is secure against general attacks: no matter the adversary’s attack, they cannot gain useful information about the key. A much simpler task is to prove security against collective attacks, where the adversary is assumed to behave identically and independently in each round. In this work, we provide a formal framework for general QKD protocols and show that for any protocol that can be expressed in this framework, security against general attacks reduces to security against collective attacks, which in turn reduces to a numerical computation. Our proof relies on a recently developed information-theoretic tool called generalised entropy accumulation and can handle generic prepare-and-measure protocols directly without switching to an entanglement-based version.

Computing conditional entropies for quantum correlations

Article Open access 25 January 2021

Device-independent quantum key distribution with random key basis

Article Open access 17 May 2021

Computing secure key rates for quantum cryptography with untrusted devices

Article Open access 29 October 2021

Introduction

Quantum key distribution (QKD) considers the following scenario: two parties, Alice and Bob, can communicate via an insecure quantum channel and an authenticated classical channel. An insecure quantum channel allows the adversary to intercept and tamper with any quantum state sent across the channel; an authenticated classical channel is one where an adversary can read every message sent across the channel, but cannot impersonate either party; for example, the adversary cannot convince Bob that a certain message was sent by Alice when in fact, it was not. Using these resources, Alice and Bob would like to establish a secure shared key, i.e. a piece of information that is known to both of them, but entirely unknown to an adversary Eve^1,2.

The key difficulty in establishing the security of a QKD protocol is that one has to take into consideration any possible attack that the adversary Eve may perform. For example, in one round of the protocol, Eve may gather a piece of quantum side information about the quantum state sent via the insecure channel. This piece of side information could be combined with side information from previous rounds to plan Eve’s attack for the next round, resulting in a very complicated multi-round attack. Additionally, Alice and Bob can only execute a certain finite number of rounds, introducing statistical finite-size effects. A security proof that takes both of these challenges into account is called a finite-size security proof against general attacks (also referred to as coherent attacks)^3,4,5. Such a proof is required to safely deploy a QKD protocol in practice.

Due to the difficulty of proving finite-size security against general attacks, many protocols are first analysed for collective attacks, for which very general numerical techniques have been developed (see e.g.^{6,7,8,9,10,11,12,13,14}). For a security proof against collective attacks one makes the assumption that Alice and Bob execute infinitely many rounds of the protocol and Eve behaves independently and identically in each round. This is also called the i.i.d. asymptotic setting. These assumptions are of course unrealistic, but a collective attack proof is a useful theoretical tool as it can often be converted into a finite-size proof against general attacks; this is called a reduction to i.i.d.

There are a number of existing techniques for performing such a reduction to i.i.d. These techniques are very powerful, but typically require additional assumptions on the protocol and can significantly lower the amount of key that can be extracted compared to the collective attack scenario. The most widely used ones are either based on the quantum de Finetti theorem¹⁵ (and the related post-selection technique¹⁶) or the entropy accumulation theorem (EAT)¹⁷.

The quantum de Finetti theorem and related methods such as the post-selection technique rely on the permutation-symmetry between different rounds of the protocol to reduce general to collective attacks. While not every protocol possesses this permutation symmetry naturally, it can usually be enforced by including an additional “symmetrisation step” in the protocol. The main downside of these techniques is that the bounds they achieve scale unfavourably with the dimension of the underlying Hilbert space, i.e. the Hilbert space that contains the states sent from Alice to Bob. This means that these techniques only yield useful bounds for protocols with a small Hilbert space dimension, e.g. the BB84 or B92 protocols^1,18. However, practical implementations of QKD protocols do not always satisfy this requirement; for example, many protocols use laser pulses as the means by which Alice sends a quantum state to Bob^19,20, and such laser pulses are described in a Fock space whose dimension is in principle unbounded. While methods for truncating the Fock space have been developed²¹, this introduces additional complications and may lead to weak bounds if the dimension of the truncated Fock space remains large.

In contrast, the EAT provides bounds that do not depend on the dimension of the underlying Hilbert space. This dimension-independence of the second-order terms means that the EAT can also be used to prove security for device-independent or semi-device-independent protocols²².

The main downside of the EAT for security proofs is that it requires that new side information must be output in a round-by-round manner subject to a Markov condition between rounds, and once side information has been output it cannot be updated anymore. In general, it is not possible to model the way that Eve actively intercepts quantum states and updates her side information in a prepare-and-measure protocol by a process that outputs side information in a round-by-round manner subject to the Markov condition. As a consequence, the EAT cannot “naturally” deal with general prepare-and-measure protocols. Instead, one first has to convert a prepare-and-measure protocol into an entanglement-based protocol. This can be done as follows: if Alice prepares one of a set of pure states ${\big\{{\left|{\psi }^{j}\right\rangle }_{Q}\big\}}_{j}$ with probability p(j) and stores the index j specifying the state in her register A, we can replace this by Alice preparing a state ${\left|\tilde{\psi }\right\rangle }_{AQ}={\sum }_{j}\sqrt{p(j)}{\left|j\right\rangle }_{A}{\left|{\psi }^{j}\right\rangle }_{Q}$ and later measuring her system A. Then, we can model Eve’s attack by replacing this state ${\left|\tilde{\psi }\right\rangle }_{AQ}$ by an arbitrary state ${\left|\hat{\psi }\right\rangle }_{AQE}$ prepared by Eve, subject to the constraint that Alice’s marginal, which Eve cannot access in the prepare-and-measure protocol, is “correct”, i.e. ${\tilde{\psi }}_{A}={\hat{\psi }}_{A}$. This additional constraint is an artificial one in the sense that it is not something that Alice and Bob check in the actual protocol, and it is unclear how it can be incorporated into a security proof using the EAT in a natural way. As a result, it appears difficult or impossible to use the EAT to obtain reasonable finite-size key rates for prepare-and-measure protocols except in very simple cases.

In addition to these general techniques for reducing security against general attacks to security against collective attacks, there are also more specialised techniques that directly prove security against general attacks without an explicit reduction to collective attacks. Perhaps the most common of these are phase-error correction and entropic uncertainty techniques, both of which use the complementarity of different measurements in the protocol as the starting point for a security proof (see e.g., refs. ^{23,24,25,26,27,28}). These security proofs usually give very tight bounds for “symmetric” protocols (i.e. protocols relying on mutually unbiased measurement bases, even though these bases need not be chosen with equal probability) where they can be applied naturally, and can also be extended to symmetric protocols with experimental imperfections that slightly break the symmetry, e.g. using the reference state technique^29,30. In addition, various other proof techniques that use the symmetry of specific protocols have been developed (see e.g. refs. ^31,32,33).

In this work, we show that security against collective attacks implies finite-size security against general attacks for a broad class of protocols. The main feature of our security proof is its generality: while many existing security proofs work well for particular protocols, our approach works for any generic protocol satisfying a few structural assumptions. Furthermore, it provides a natural way of proving security against general attacks, with the proof being in close correspondence to the structure of the original protocol, whereas previous techniques often required the protocol to be transformed into a theoretically equivalent one to fit into the framework of a particular proof technique. In particular, our technique can be applied directly to prepare-and-measure protocols without transforming them into an entanglement-based version. As a sample application, we show that a direct application of our general framework yields the first asymptotically tight finite-size security proof against general attacks for the B92 protocol. Importantly, our technique provides bounds that are independent of the dimension of the underlying Hilbert space; instead, the bound depends only on the number of possible classical outputs that Alice and Bob may receive. This is particularly relevant for photonic QKD protocols, where the underlying Hilbert space is a Fock space with unbounded dimension^34,35, and is also useful for (semi-)device-independent protocols. For our security proof, we employ the generalised entropy accumulation theorem (GEAT), a recent information-theoretic result³⁶ that resembles the EAT discussed above, but allows a more flexible model of side information; this enables us to circumvent many of the difficulties in applying the EAT and deal with prepare-and-measure protocols directly, while retaining the advantages of the EAT, most importantly its dimension-independence.

Results

Framework for prepare-and-measure protocols

Our main result, Theorem 4, shows that for a broad class of prepare-and-measure protocols, security against collective attacks implies security against general attacks. To make this result easy to use, we phrase it as a security statement for a general “template protocol”; many existing prepare-and-measure protocols can be viewed as an instance of this template protocol, and their security then follows from the security of the general template protocol. For protocols that do not fit exactly into this template, the security proof can usually easily be adapted from our proof of Theorem 4.

Our template protocol is described formally in Box 1; here, we make a few additional remarks regarding this general protocol, using the notation introduced in Box 1. Firstly, without loss of generality, we can assume that the cq-state ψ_UQ is of the form ${\psi }_{UQ}={\sum }_{u}p(u)\left|u\right\rangle \left\langle u\right|\otimes \left|\psi \right\rangle {\left\langle \psi \right|}_{Q| u}$ for a probability distribution p(u) and pure states $\left|\psi \right\rangle {\left\langle \psi \right|}_{Q| u}$. This means that Alice chooses a value u according to p(u) and then sends the pure state $\left|\psi \right\rangle {\left\langle \psi \right|}_{Q| u}$ to Bob. The reason that we can assume that $\left|\psi \right\rangle {\left\langle \psi \right|}_{Q| u}$ is pure is that if Alice wanted to send a mixed state, she could express that mixed state as a mixture of pure states, send one of those pure states, and later “forget” which of the pure states she sent as part of the map RK.

Secondly, in the protocol in Box 1, Bob measures a POVM {N^(v)} with outcomes $v\in {{{{{{{\mathcal{V}}}}}}}}$. More commonly, we think of Bob as choosing an input y according to some distribution q(y) and receiving an output $b\in {{{{{{{\mathcal{B}}}}}}}}$. This can be described by a collection of POVMs ${\{{\tilde{N}}_{y}^{(b)}\}}_{b\in {{{{{{{\mathcal{B}}}}}}}}}$, one for each possible input y. For example, Bob might choose uniformly at random whether to measure a qubit in the computational or Hadamard basis. In that case, y would be the basis choice, and for each $y,{\{{\tilde{N}}_{y}^{(b)}\}}_{b\in {{{{{{{\mathcal{B}}}}}}}}}$ is the measurement in the chosen basis. However, since Bob’s measurements are trusted, the distinction between inputs and outputs is unnecessary: we can convert a set of POVMs ${\{{\tilde{N}}_{y}^{(b)}\}}_{b\in {{{{{{{\mathcal{B}}}}}}}}}$ with an input distribution q(y) into an equivalent single POVM ${\{{N}^{(v)}\}}_{v\in {{{{{{{\mathcal{V}}}}}}}}}$ by choosing ${{{{{{{\mathcal{V}}}}}}}}={{{{{{{\mathcal{Y}}}}}}}}\times {{{{{{{\mathcal{B}}}}}}}}$ and ${N}^{(y,b)}=q(y){\tilde{N}}_{y}^{(b)}$. This satisfies the required property of a POVM:

$$\begin{array}{r}\mathop{\sum}\limits_{y,b}{N}^{(y,b)}=\mathop{\sum}\limits_{y}q(y)\mathop{\sum}\limits_{b}{\tilde{N}}_{y}^{(b)}=\mathop{\sum}\limits_{y}q(y){\mathbb{1}}={\mathbb{1}},\end{array}$$

(1)

where we used the fact that ${\{{\tilde{N}}_{y}^{(b)}\}}_{b\in {{{{{{{\mathcal{B}}}}}}}}}$ is a POVM for the first equality and the fact that q(y) is a probability distribution for the second. One can think of N^(y, b) as first choosing $y\in {{{{{{{\mathcal{Y}}}}}}}}$ according to q(y) and then measuring $\{{\tilde{N}}_{y}^{(b)}\}$ on the state, providing (y, b) as output.

Thirdly, the function PD describes the total information exchanged during the public discussion (Step (2)) for one round i of the protocol. The details of how the public discussion takes place are of no concern to the protocol: in general, Alice and Bob may exchange multiple rounds of back-and-forth communication during this step, and PD describes the transcript of the entire exchange. For example, in a protocol that includes a sifting step, the public discussion would include the information necessary to decide which rounds to sift out; the actual sifting would occur in the raw key generation step, where Alice’s function RK can use the information from the public discussion to put a special symbol (e.g. ⊥) as the raw key for rounds that are sifted out.

Additionally, the protocol distinguishes between information I_i published during Step (2) and error correction information EC published during Step (4). The difference between these two steps is that I_i may only depend on the inputs U_i and V_i generated during the i-th round of measurements. This means that I_i is generated in a round-by-round manner and will enter in the single-round security statement (or collective attack bound, see “Definition 2”). In contrast, EC is global information of a fixed length λ_EC, i.e. it can depend arbitrarily on information generated during all rounds of the protocol, but to obtain a good key rate, λ_EC should be as short as possible. We note that the bound on the length of EC is needed in Supplementary Eq. (5), where we use it to remove the error correction information from the conditioning system; one can replace Supplementary Eq. (5) by a slightly more sophisticated chain rule that subtracts a (one-shot) mutual information between EC and Sⁿ. In that case, the protocol needs to specify an upper bound on this mutual information instead of the length λ_EC.

Finally, we note that in the protocol in Box 1, Alice and Bob first perform error correction, and afterwards Bob uses his error-corrected guess for Alice’s raw key for the purposes of the statistical check. An alternative that is commonly used in existing QKD protocols is that Alice and Bob publish part of their data in a separate parameter estimation step before the error correction step and use this public information to run a statistical check. Our protocol in Box 1 can easily be modified to include protocols of this form. For the modified protocol, the security proof stays exactly the same, except that the reduction from Theorem 4 to Claim 10 now follows almost trivially and does not need the argument from Supplementary Note A.

Example: BB84 protocol as an instance of Box 1. To gain further intuition for the protocol in Box 1, we describe how to reproduce the well-known BB84 protocol as an instance of our general protocol in Box 1. In the BB84 protocol, Alice sends a random state from the set $\{\left|0\right\rangle,\left|1\right\rangle,\left|+\right\rangle,\left|-\right\rangle \}$, where $\left|\pm \right\rangle=\frac{\left|0\right\rangle \pm \left|1\right\rangle }{\sqrt{2}}$ are the Hadamard basis states. As her information U_i, Alice records which state she sent, i.e. she records the basis x ∈ {0, 1} and the value a ∈ {0, 1}. Hence, for the BB84 protocol,

$$\begin{array}{r}{\psi }_{UQ}=\frac{1}{4}\mathop{\sum}\limits_{x,a\in \{0,1\}}\left|x,a\right\rangle {\left\langle x,a\right|}_{U}\otimes {H}^{x}\left|a\right\rangle {\left\langle a\right|}_{Q}{H}^{x},\end{array}$$

(2)

where H is the Hadamard gate and H⁰ = id, H¹ = H. Bob’s measurements output a basis choice y ∈ {0, 1} and the outcome b of a single-qubit measurement in that basis (with y = 0 corresponding to the computational and y = 1 to the Hadamard basis). Therefore, his measurements are described by a POVM on system Q consisting of elements

$$\begin{array}{r}{N}^{(y,b)}=\frac{1}{2}{H}^{y}\left|b\right\rangle \left\langle b\right|{H}^{y}.\end{array}$$

(3)

During the public discussion phase, Alice and Bob publish their basis choices x_i and y_i for each of the rounds. Therefore, for U_i = (x_i, a_i) and V_i = (y_i, b_i),

$$\begin{array}{r}{I}_{i}={{{{{{{\rm{PD}}}}}}}}({U}_{i},{V}_{i})=({x}_{i},{y}_{i}).\end{array}$$

(4)

To generate her raw key, for each round Alice checks whether the basis choices x_i and y_i are the same: if so, she uses her measurement outcome a_i for the raw key, and otherwise she discards that round. Formally,

$$\begin{array}{r}{S}_{i}={{{{{{{\rm{RK}}}}}}}}({U}_{i},{I}_{i})={{{{{{{\rm{RK}}}}}}}}(({x}_{i},{a}_{i}),({x}_{i},{y}_{i}))=\left\{\begin{array}{ll}{a}_{i}\quad &{{{{{{{\rm{if}}}}}}}}\,{x}_{i}={y}_{i},\\ \perp \quad &{{{{{{{\rm{otherwise}}}}}}}}.\end{array}\right.\end{array}$$

(5)

Finally, for the statistical check in Step (6), Bob checks whether his guess ${\hat{S}}^{n}$ for Alice’s string matches his own raw data. In fact, Bob can only do this check on a small subset of indices i. The reason is that for our definition of collective attack bounds (“Definition 2”) and the security proof (Theorem 4), we are bounding the entropy conditioned on the systems Cⁿ, i.e. we are essentially assuming that all of the statistical information gets leaked to Eve. Hence, Bob chooses a value T_i at random with $\Pr \left[{T}_{i}=1\right]=\gamma$ (where γ is the testing probability, and the choice of T_i can formally be included into V_i), and then sets

$${\hat{C}}_{i}={{{{{{{\rm{EV}}}}}}}}({V}_{i},{I}_{i},{\hat{S}}_{i})={{{{{{{\rm{EV}}}}}}}}(({y}_{i},{b}_{i}),({x}_{i},{y}_{i}),{\hat{S}}_{i})$$

(6)

$$=\left\{\begin{array}{ll}\perp \quad &{{{{{{{\rm{if}}}}}}}}\,{x}_{i}\, \ne \, {y}_{i}\,{{{{{{{\rm{or}}}}}}}}\,{T}_{i}=0,\hfill \\ 1\quad &{{{{{{{\rm{if}}}}}}}}\,{x}_{i}={y}_{i},{T}_{i}=1,\, {{{{{{{\rm{and}}}}}}}}\,{b}_{i}={\hat{S}}_{i},\\ 0\quad &{{{{{{{\rm{otherwise}}}}}}}}.\hfill\end{array}\right.$$

(7)

Intuitively, ⊥ denotes that no useful check can be performed in this round, “1” means the check has passed, and “0” means the check has failed.

Box 1 General prepare-and-measure QKD protocol

Protocol arguments:

$n\in {\mathbb{N}}$ : number of rounds.

ψ_UQ : quantum state prepared by Alice, where U is classical with alphabet ${{{{{{{\mathcal{U}}}}}}}}$ and Q is quantum.

${\{{N}^{(v)}\}}_{v\in {{{{{{{\mathcal{V}}}}}}}}}$ : POVM acting on Hilbert space ${{{{{{{{\mathcal{H}}}}}}}}}_{Q}$ describing Bob’s trusted measurements (where ${{{{{{{\mathcal{V}}}}}}}}$ is some finite set of possible outcomes).

${{{{{{{\rm{PD}}}}}}}}:{{{{{{{\mathcal{U}}}}}}}}\times {{{{{{{\mathcal{V}}}}}}}}\to {{{{{{{\mathcal{I}}}}}}}}$ : function describing transcript of public discussion (where ${{{{{{{\mathcal{I}}}}}}}}$ is some finite alphabet).

${{{{{{{\rm{RK}}}}}}}}:{{{{{{{\mathcal{U}}}}}}}}\times {{{{{{{\mathcal{I}}}}}}}}\to {{{{{{{\mathcal{S}}}}}}}}$ : function describing Alice’s raw key generation (where ${{{{{{{\mathcal{S}}}}}}}}$ is the alphabet of the raw key).

${{{{{{{\rm{EV}}}}}}}}:{{{{{{{\mathcal{V}}}}}}}}\times {{{{{{{\mathcal{I}}}}}}}}\times {{{{{{{\mathcal{S}}}}}}}}\to {{{{{{{\mathcal{C}}}}}}}}$ : function “evaluating” each round by assigning a label from the alphabet ${{{{{{{\mathcal{C}}}}}}}}$

${\lambda }_{{{{{{{{\rm{EC}}}}}}}}}\in {{\mathbb{N}}}_{0}$ : length of bit string exchanged during error correction step.

k_CA > 0 : required amount of single-round entropy generation.

ε_KV, ε_PA > 0 : tolerated errors during key validation and privacy amplification steps.

${{{{{{{\rm{CA}}}}}}}}:{\mathbb{P}}({{{{{{{\mathcal{C}}}}}}}})\to {\mathbb{R}}$ : affine function corresponding to collective attack bound.

$l\in {\mathbb{N}}$ : length of final key.

Protocol steps:

(1) Data generation: Alice prepares ${\psi }_{{U}^{n}{Q}^{n}}={\psi }_{UQ}^{\otimes n}$ and sequentially sends the systems Q₁, …, Q_n to Bob via a public quantum channel. For each i ∈ {1, …, n}, Bob measures ${\{{N}^{(v)}\}}_{v\in {{{{{{{\mathcal{V}}}}}}}}}$ on register Q_i and records the outcome in register V_i.

(2) Public discussion: for each i ∈ {1, …, n}, Alice and Bob publicly exchange information I_i = PD(U_i, V_i).

(3) Raw key generation: for each i ∈ {1, …, n}, Alice computes S_i = RK(U_i, I_i).

(4) Error correction: Alice and Bob publicly exchange information ${{{{{{{\rm{EC}}}}}}}}\in {\{0,1\}}^{{\lambda }_{{{{{{{{\rm{EC}}}}}}}}}}$, which can depend on Uⁿ, Vⁿ, and Iⁿ. Bob computes ${\hat{S}}^{n}({{{{{{{\rm{EC}}}}}}}},{V}^{n},{I}^{n})\in {{{{{{{{\mathcal{S}}}}}}}}}^{n}$.

(5) Raw key validation: Alice chooses a function ${{{{{{{\rm{HASH}}}}}}}}:{{{{{{{{\mathcal{S}}}}}}}}}^{n}\to {\{0,1\}}^{\lceil \log (1/{\varepsilon }_{{{{{{{{\rm{KV}}}}}}}}})\rceil }$ from a universal hash family ${{{{{{{\mathcal{F}}}}}}}}$ (Definition 5) according to the associated probability distribution ${P}_{{{{{{{{\mathcal{F}}}}}}}}}$ and publishes a description of HASH and the value HASH(Sⁿ). Bob computes ${{{{{{{\rm{HASH}}}}}}}}({\hat{S}}^{n})$ and aborts the protocol if ${{{{{{{\rm{HASH}}}}}}}}({S}^{n})\, \ne \, {{{{{{{\rm{HASH}}}}}}}}({\hat{S}}^{n})$.

(6) Statistical check: for each i ∈ {1, …, n}, Bob sets ${\hat{C}}_{i}={{{{{{{\rm{EV}}}}}}}}({V}_{i},{I}_{i},{\hat{S}}_{i})$. Bob then computes $k={{{{{{{\rm{CA}}}}}}}}({\mathsf{freq}}(\hat{{C}^{n}}))$. If k < k_CA, he aborts the protocol.

(7) Privacy amplification: Alice and Bob convert their registers Sⁿ and ${\hat{S}}^{n}$ to a binary representation, obtaining strings of length m. Alice chooses a seed μ ∈ {0, 1}^m uniformly at random and publishes her choice. Alice and Bob compute l-bit strings K = EXT(Sⁿ, μ) and $\hat{K}={{{{{{{\rm{EXT}}}}}}}}({\hat{S}}^{n},\mu )$, respectively, where EXT: {0, 1}^m × {0, 1}^m → {0, 1}^l is a quantum-proof strong $(l+\lceil 2\log (1/{\varepsilon }_{{{{{{{{\rm{PA}}}}}}}}})\rceil,{\varepsilon }_{{{{{{{{\rm{PA}}}}}}}}})$-extractor (Definition 6).

Modelling Eve’s attack

In the protocol in Box 1, Eve can obtain information about the final key K in two ways: firstly, Eve can observe the classical information published by Alice and Bob during the protocol, e.g. the error correction information EC. In a security proof, this is easy to handle, as Alice and Bob have full control over what information they publish. Secondly, Eve can intercept the quantum systems Q_i sent from Alice to Bob in Step (1). This is much harder to analyse in a security proof as Eve can perform arbitrary operations on the systems Q_i and we need to bound the amount of information Eve can gain about Alice’s and Bob’s raw key from tampering with the systems Q_i without being detected. The set of actions Eve performs on the systems Q_i is called Eve’s attack.

In principle, Eve could collect all of the n systems Q₁, …, Q_n, perform an arbitrary quantum channel ${{{{{{{\mathcal{A}}}}}}}}:{Q}^{n}\to E{Q}^{n}$, and send the output on systems Qⁿ to Bob. The system E would be kept by Eve and would contain her (potentially quantum) side information about the final key.

To analyse the security of a prepare-and-measure protocol with the GEAT, we need to introduce an extra condition.

Condition 1

Eve can only be in possession of one of the systems Q_i at the same time.

Since Alice sends the systems Q₁, …, Q_n sequentially in Step (1), this means that with this additional condition, Eve’s most general attack also takes a sequential form. More formally, with this condition, the most general attack Eve can perform is described by a sequence of maps ${{{{{{{{\mathcal{A}}}}}}}}}_{i}:{E}_{i-1}^{{\prime} }{Q}_{i}\to {E}_{i}^{{\prime} }{Q}_{i}$, where ${E}_{i}^{{\prime} }$ are arbitrary quantum systems that contain Eve’s side information after having intercepted the i-th system Q_i. (The system E₀ can be chosen to be trivial without loss of generality, but we will not need this for our security proof).

In fact, it is easy for Alice and Bob to enforce Condition 1 by checking that system Q_i has arrived on Bob’s side before Q_i+1 is sent. The downside of this simple strategy is that if Alice and Bob are far apart, it limits the number of signals that can be sent per unit time.

To circumvent this, Alice and Bob can agree on a “schedule” on which signals are transmitted, i.e. they decide when Alice will send out each signal, so Bob, being aware of its travel time without Eve’s interference, knows when to expect to receive it. Then, assuming that Eve cannot significantly speed up the transmission of signals, this would ensure that Condition 1 is satisfied without Alice having to wait for Bob’s confirmation to send the next signal (see Supplementary Fig. 1 for an illustration of this). Whether or not the assumption that Eve cannot significantly speed up the transmission of signals is realistic depends on the specific QKD setup: for example, if signals are transmitted from Alice to Bob through vacuum (e.g. in satellite-to-satellite QKD), they travel at the speed of light and cannot be sped up further by Eve, so Condition 1 can be enforced by sending signals on a pre-agreed schedule without issues.

On the other hand, if Alice and Bob exchange signals via a (very long) optical fibre, Eve could in principle extract the signal at the start of the fibre, transmit it through free space, and then re-insert it into the fibre on Bob’s side. Since the speed of light in a fibre is slower than in free space, this would enable Eve to have simultaneous access to a (relatively small) set of s sped-up signals, perform some attack involving this set of signals, and then feed the “first” of these signals to Bob in such a way that it arrives at the time expected by Bob; then, Eve could add the next sped-up signal to her set, apply another attack to that set of s signals, and so on. Such an attack would violate Condition 1, but it would go unnoticed by Alice and Bob since the signals do arrive at the expected times on Bob’s end.

Setting aside the question of how realistic it is for Eve to perform such an attack, this issue can be addressed by relaxing Condition 1 so that instead of requiring Eve to be in possession of only one signal at a time, we allow her to be in possession of s signals at a time. To prove security under this weakened condition, we can divide the signals into interleaved groups such that any two signals within a group are s rounds apart, use a standard chain rule for min-entropies (or Renyi entropies) to divide the total entropy into a sum of group-wise entropies, and simply apply our analysis at the level of these groups. Our proof then goes through essentially unchanged, although the resulting second-order terms in the key rate will depend on the allowed number s of signals available to Eve at a time. We explain this modification in more detail in Supplementary Note C and focus on the case where Condition 1 holds exactly in the main text for simplicity.

We have now seen how to model Eve’s general attack under Condition 1. In contrast to such general sequential attacks, collective attacks only allow Eve to perform the same independent attack in each round of the protocol. Hence, a collective attack can be modelled by a map ${{{{{{{\mathcal{A}}}}}}}}:Q\to EQ$, which Eve applies in each round of the protocol, so Eve’s full attack over n rounds is given by the tensor product map ${{{{{{{{\mathcal{A}}}}}}}}}^{\otimes n}:{Q}^{n}\to {E}^{n}{Q}^{n}$. Proving security against this restricted class of attacks is typically much easier than proving security against general attacks. However, we stress that, unlike Condition 1, the assumption that Eve performs only a collective attack cannot be enforced by Alice and Bob. Therefore, a security proof that only considers collective attacks is insufficient for practical applications.

Collective attack bounds

If one restricts Eve to performing collective attacks, it is known that in the limit n → ∞ of many rounds the key rate is given by a simple entropic expression that only involves quantities corresponding to a single round of the protocol³⁷. Note that the entropic expression for the key rate in³⁷ already includes information leaked to Eve during the error correction step assuming an optimal error correcting protocol. Our Definition 2 does not include a term corresponding to this – instead in Box 1 we assume that the error correction information has length at most λ_EC, which we can later subtract from the length of the final key that can be generated.

More formally, we can view a collective attack bound as a map that takes as input the statistics corresponding to a single round of the protocol and outputs a lower bound on a certain conditional entropy, which specifies how much key can safely be extracted from a state with those statistics.

Definition 2

(Collective attack bound for Box 1) Fix arguments ${\psi }_{UQ},{\{{N}^{(v)}\}}_{v\in {{{{{{{\mathcal{V}}}}}}}}},{{{{{{{\rm{PD}}}}}}}},\, {{{{{{{\rm{RK}}}}}}}}$, and EV for the protocol in Box 1. Suppose that Alice and Bob run a single round (i.e. n = 1) of the protocol in Box 1 with these arguments up to (and including) Step (3). For a collective attack ${{{{{{{\mathcal{A}}}}}}}}:Q\to QE$, denote the state at the end of Step (3) as ν_UVSIE. Let ν_UVSIEC be an extension of this state, where C = EV(V, I, S). A collective attack bound (for the choice of parameters fixed above) is a map ${{{{{{{\rm{CA}}}}}}}}:{\mathbb{P}}({{{{{{{\mathcal{C}}}}}}}})\to {\mathbb{R}}$ such that for any collective attack ${{{{{{{\mathcal{A}}}}}}}}$, the state ν_UVSIEC (which depends on ${{{{{{{\mathcal{A}}}}}}}}$) satisfies

$$\begin{array}{r}H{(S| IEC)}_{\nu }\ge {{{{{{{\rm{CA}}}}}}}}({\nu }_{C}).\end{array}$$

(8)

Security against general attacks

Having introduced our framework for general prepare-and-measure protocols and collective attack bounds, we can now state the main technical result of this paper, namely that a collective attack bound implies a security statement against general attacks. For this, we first recall the security definition for QKD, namely the notions of correctness, secrecy, and completeness¹⁵. This security definition is composable, meaning that the key generated by a protocol satisfying this definition can safely be used for other protocols³⁸.

Definition 3

(Correctness, secrecy, and completeness) Consider a QKD protocol in which Alice and Bob can decide whether or not to abort the protocol. Let ${\rho }_{K\hat{K}E}$ be the final state at the end of the protocol (for a given initial state), where K and $\hat{K}$ are Alice’s and Bob’s version of the final key, respectively, and E contains all side information available to the adversary Eve at the end of the protocol. The protocol is called ε^cor-correct, ${\varepsilon }^{\sec }$-secret, and ε^comp-complete if the following holds:

(i)
Correctness. For any actions of the adversary Eve:
$$\begin{array}{r}\Pr \left[K\, \ne \, \hat{K}\wedge \,{{\mbox{not abort}}}\,\right]\le {\varepsilon }^{{{{{{{{\rm{cor}}}}}}}}}.\end{array}$$
(9)
(ii)
Secrecy. For any actions of the adversary Eve:
$$\begin{array}{r}{\left\|{\rho }_{KE\wedge {{\Omega }}}-{\tau }_{K}\otimes {\rho }_{E\wedge {{\Omega }}}\right\|}_{1}\le {\varepsilon }^{\sec },\end{array}$$
(10)
where τ_K is the maximally mixed state on system K, Ω is the event that the protocol does not abort, and ${\rho }_{\wedge {{\Omega }}}=\Pr \left[{{\Omega }}\right]{\rho }_{| {{\Omega }}}$ is the subnormalised state conditioned on Ω (see Methods Subsection “Notation” for details). Note that here and throughout the paper, we use the difference in trace norm, not the trace distance. The latter has an additional normalisation factor of $\frac{1}{2}$.
(iii)
Completeness. For a given noise model for the protocol there exists an honest behaviour for the adversary Eve such that
$$\begin{array}{r}\Pr \left[{{{{{{{\rm{abort}}}}}}}}\right]\le {\varepsilon }^{{{{{{{{\rm{comp}}}}}}}}}.\end{array}$$
(11)

Note that correctness and secrecy must hold for any behaviour of Eve (and also any noise model), while completeness is concerned with the honest implementation of the protocol. Correctness and secrecy bound the probability of Alice and Bob receiving different or insecure keys without detecting this fact and aborting the protocol. Completeness says that the protocol is robust against a given noise model in the sense that for this noise model, the probability of aborting the protocol is small if Eve behaves honestly. It is common to combine the correctness and secrecy parameters and call a protocol $({\varepsilon }^{{{{{{{{\rm{cor}}}}}}}}}+{\varepsilon }^{\sec }/2)$-secure, where the factor of 1/2 arises because our definition of secrecy uses the difference in trace norm, not the trace distance, which has an additional factor of 1/2.

Our main result is that the protocol in Box 1 satisfies the correctness and secrecy conditions. Formally, we show the following.

Theorem 4

Fix any choice of arguments $n,{\psi }_{UQ},{\{{N}^{(v)}\}}_{v\in {{{{{{{\mathcal{V}}}}}}}}},\, {{{{{{{\rm{PD}}}}}}}},\, {{{{{{{\rm{RK}}}}}}}},\, {{{{{{{\rm{EV}}}}}}}},\, {k}_{{{{{{{{\rm{CA}}}}}}}}},\, {\lambda }_{{{{{{{{\rm{EC}}}}}}}}},\, {\varepsilon }_{{{{{{{{\rm{KV}}}}}}}}}$, and ε_PA for Box 1. Let ${{{{{{{\rm{CA}}}}}}}}:{\mathbb{P}}({{{{{{{\mathcal{C}}}}}}}})\to {\mathbb{R}}$ be an affine collective attack bound for this choice of arguments. For any ε_s, ε_a > 0 and α ∈ (1, 3/2), choose a final key length l that satisfies

$$l\le n\,{k}_{{{{{{{{\rm{CA}}}}}}}}} -n\,\frac{\alpha -1}{2-\alpha }\,\frac{\ln (2)}{2}{V}^{2}-\frac{g({\varepsilon }_{s})+\alpha \log (1/{\varepsilon }_{a})}{\alpha -1} \\ - n\,{\left(\frac{\alpha -1}{2-\alpha }\right)}^{2}{K}^{{\prime} }(\alpha )-\lceil 2\log (1/{\varepsilon }_{{{{{{{{\rm{PA}}}}}}}}})\rceil -\lceil \log (1/{\varepsilon }_{{{{{{{{\rm{KV}}}}}}}}})\rceil -{\lambda }_{{{{{{{{\rm{EC}}}}}}}}},$$

(12)

where g(ε_s), V, and ${K}^{{\prime} }(\alpha )$ are defined in Theorem 9. With this choice of parameters and assuming that Condition 1 holds, the protocol in Box 1 is ε^cor-correct and ${\varepsilon }^{\sec }$-secret for

$$\begin{array}{r}{\varepsilon }^{{{{{{{{\rm{cor}}}}}}}}}={\varepsilon }_{{{{{{{{\rm{KV}}}}}}}}},\qquad {\varepsilon }^{\sec }=\max \{{\varepsilon }_{{{{{{{{\rm{PA}}}}}}}}}+4\,{\varepsilon }_{s},2\,{\varepsilon }_{a}\}+2\,{\varepsilon }_{{{{{{{{\rm{KV}}}}}}}}}.\end{array}$$

(13)

We prove this theorem in “Methods” subsection “Proof of main theorem”. In addition, we also show completeness; since this is much more straightforward and only uses standard techniques, we defer this to Supplementary Note B.

Sample application: B92 protocol

We now demonstrate how to apply our framework, using the B92 protocol as an example. The B92 protocol has no natural entanglement-based analogue (i.e. an equivalent entanglement-based protocol that does not require “artificial” constraints on the reduced state on Alice’s side and still achieves the same key rate as the prepare-and-measure version of B92) and therefore cannot be analysed with the original EAT. Nonetheless, the B92 protocol is very simple, and therefore provides arguably the easiest example to demonstrate the application of our framework to a protocol that cannot be analysed with the EAT. Furthermore, while there exist analytic security proofs of B92 using entropic uncertainty relations^25,39, these techniques yield key rates that are far from optimal even in the asymptotic regime. This is in contrast to highly symmetric protocols such as BB84, where entropic uncertainty relations yield essentially tight proofs²⁸.

We emphasise that the purpose of this section is to illustrate our general results with a simple example, not to derive the tightest possible key rates for a particular protocols. We leave the analysis of more complicated protocols, where deriving the collective attack bound may be more involved, for future work. In Supplementary Note G, we also sketch how to express the decoy state BB84 protocol as an instance of our framework and how to derive a collective bound for it, demonstrating that the widely-used decoy state technique also naturally fits within our framework.

We also note that very recent work⁴⁰ has analysed the performance of the EAT on entanglement-based QKD protocols (and prepare-and-measure protocols that have a natural entanglement-based analogue) and found that it provides better key rates than previous methods. Since our GEAT-based security proof produces essentially the same key rates as the EAT in cases where both methods can be applied, this suggests that our framework will provide very good key rates also in cases where the EAT cannot be applied.

We start by giving an informal description of the B92 protocol and the intuition behind it. Then, we show how to view the B92 protocol as an instance of our general protocol in Box 1. Using the technique from “Results” subsection “Collective attack bounds” to derive a collective attack bound, we can then apply Theorem 4 to obtain a security statement for general attacks. To illustrate the result, we numerically compute the key rate for different choices of the number of rounds and tolerated noise level in Fig. 1.

Fig. 1: Key rates for the B92 protocol as a function of the depolarising probability p for ${\varepsilon }^{{{{{{{{\rm{cor}}}}}}}}}=5\cdot 1{0}^{-11},{\varepsilon }^{\sec }=1{0}^{-9}$, and ε^comp = 10⁻².

Each round of the B92 protocol works as follows: Alice chooses a bit u ∈ {0, 1} uniformly at random. If u = 0, she prepares the state ${\left|\psi \right\rangle }_{Q}=\left|0\right\rangle$, whereas if u = 1, she prepares ${\left|\psi \right\rangle }_{Q}=\left|+\right\rangle$. She sends ${\left|\psi \right\rangle }_{Q}$ to Bob, who chooses y ∈ {0, 1} uniformly at random and measures the system Q in the computational basis if y = 0 and the Hadamard basis if y = 1. If he obtains outcome “1” (when measuring in the computational basis) or “-” (when measuring in the Hadamard basis), he sets v = y ⊕ 1. Otherwise, he sets v = ⊥. In the sifting step, Bob announces in which rounds he recorded v = ⊥, and Alice sets u = ⊥ for those rounds, too. The bits u and v from all of the rounds form the raw key. To detect possible tampering by Eve, Alice and Bob compare their values of u and v on a subset of rounds.

The intuition behind this protocol is the following: the secret information that will make up the key is encoded in Alice’s basis choice u (where u = 0 corresponds to the computational and u = 1 to the Hadamard basis). When Bob receives the system Q he tries to find out which basis the state was prepared in. For this, he guesses a basis y and measures Q in this basis. Suppose he chose y = 0, i.e. the computational basis, and assume that Eve did not tamper with the system Q. Then, if he obtains outcome “1” he concludes that Alice cannot have prepared the state $\left|0\right\rangle$ and therefore must have chosen u = 1. Accordingly, he sets v = 1 = y ⊕ 1. If Bob obtains outcome “0” he cannot deduce Alice’s basis choice as both the states $\left|0\right\rangle$ and $\left|+\right\rangle$ may produce outcome “0” when measured in the computational basis, so he sets v = ⊥. Likewise, if he chose y = 1 and obtains outcome “-”, this provides conclusive evidence that Alice cannot have prepared the state $\left|+\right\rangle$, so he sets v = 0 = y ⊕ 1, whereas the outcome “+” is inconclusive. If Eve tries to tamper with the system Q, she is likely to disturb the state as she does not know which basis it was prepared in. Therefore, Alice and Bob will detect this tampering when comparing their values of u and v.

We now give a more formal description of the B92 protocol as an instance of the protocol in Box 1. As for the BB84 protocol described in Results Subsection “Framework for prepare-and-measure protocols”, this means specifying the arguments ${\psi }_{UQ},{\{{N}^{(v)}\}}_{v\in {{{{{{{\mathcal{V}}}}}}}}},{{{{{{{\rm{PD}}}}}}}},{{{{{{{\rm{RK}}}}}}}}$, and EV. For each round Alice chooses a bit U_i uniformly at random and prepares $\left|0\right\rangle$ or $\left|+\right\rangle$ based on her choice, so

$$\begin{array}{r}{\psi }_{UQ}=\frac{1}{2}(\left|0\right\rangle {\left\langle 0\right|}_{U}\otimes \left|0\right\rangle {\left\langle 0\right|}_{Q}+\left|1\right\rangle {\left\langle 1\right|}_{U}\otimes \left|+\right\rangle {\left\langle+\right|}_{Q}).\end{array}$$

(14)

Bob measures in either the computational or Hadamard basis and uses the outcome to determine V_i ∈ {0, 1, ⊥} as described before. This measurement is described by the following POVM:

$$\begin{array}{r}{N}^{(0)}=\frac{1}{2}\left|-\right\rangle \left\langle -\right|,\ {N}^{(1)}=\frac{1}{2}\left|1\right\rangle \left\langle 1\right|,\ {N}^{(\perp )}=\frac{1}{2}(\left|0\right\rangle \left\langle 0\right|+\left|+\right\rangle \left\langle+\right|).\end{array}$$

(15)

During the public discussion phase, Bob informs Alice which rounds were inconclusive, i.e. yielded outcome ⊥. Therefore,

$$\begin{array}{r}{I}_{i}={{{{{{{\rm{PD}}}}}}}}({U}_{i},{V}_{i})=\left\{\begin{array}{ll}\perp \quad &{{{{{{{\rm{if}}}}}}}}\,{V}_{i}=\perp,\\ \top \quad &{{{{{{{\rm{otherwise}}}}}}}}.\end{array}\right.\end{array}$$

(16)

To generate her raw key Sⁿ, Alice uses her bits U_i and discards the rounds for which Bob’s measurement outcome was inconclusive, which she knows from the value of I_i:

$$\begin{array}{r}{S}_{i}={{{{{{{\rm{RK}}}}}}}}({U}_{i},{I}_{i})=\left\{\begin{array}{ll}\perp \quad &{{{{{{{\rm{if}}}}}}}}\,{I}_{i}=\perp,\\ {U}_{i}\quad &{{{{{{{\rm{otherwise}}}}}}}}.\end{array}\right.\end{array}$$

(17)

To generate the statistics ${\hat{C}}_{i}$, Bob will check whether his guess ${\hat{S}}^{n}$ for Alice’s raw key agrees with his own raw data Vⁿ. As for the BB84 protocol described in Results Subsection “Framework for prepare-and-measure protocols”, Bob can only do so on a small fraction γ of rounds because Definition 2 includes the classical statistics as a conditioning system. Therefore, Bob chooses a value T_i at random with $\Pr \left[{T}_{i}=1\right]=\gamma$ (the choice of T_i can formally be included into V_i or one can view EV as a randomised rather than deterministic function). If T_i = 0, he sets ${\hat{C}}_{i}=\perp$, i.e. ${{{{{{{{\rm{EV}}}}}}}}}_{{T}_{i}=0}({V}_{i},{I}_{i},{\hat{S}}_{i})=\perp$. Otherwise, he sets ${\hat{C}}_{i}={{{{{{{{\rm{EV}}}}}}}}}_{{T}_{i}=1}({V}_{i},{I}_{i},{\hat{S}}_{i})$ to

$$\begin{array}{r}\left\{\begin{array}{ll}{\mathtt{fail}}\quad &{{{{{{{\rm{if}}}}}}}}\,{\hat{S}}_{i}=0\wedge {V}_{i}=1\,{{{{{{{\rm{or}}}}}}}}\,{\hat{S}}_{i}=1\wedge {V}_{i}=0,\\ {\mathtt{inc}}\quad &{{{{{{{\rm{if}}}}}}}}\,{V}_{i}=\perp,\hfill\\ \varnothing \quad &{{{{{{{\rm{else}}}}}}}}.\hfill\end{array}\right.\end{array}$$

(18)

Of course, the functions ${{{{{{{{\rm{EV}}}}}}}}}_{{T}_{i}=0}$ and ${{{{{{{{\rm{EV}}}}}}}}}_{{T}_{i}=1}$ can be combined into a single function EV to formally fit into the framework of Box 1.

We need to derive an affine collective bound ${{{{{{{\rm{CA}}}}}}}}({\nu }_{C})=\overrightarrow{\lambda }\cdot {\overrightarrow{\nu }}_{C}+{c}_{\overrightarrow{\lambda }}$ for the B92 protocol, where ${\overrightarrow{\nu }}_{C}$ denotes the probability vector of distribution ν_C as in Results Subsection “Collective attack bounds”. For this, we use the steps and notation from Methods Subsection “Deriving collective attack bounds”; we recommend skipping this subsection on a first reading and returning to it after understanding that subsection.

In the notation of Methods Subsection “Deriving collective attack bounds”, the state ${\tilde{\psi }}_{PQ}$ is given by

$$\begin{array}{r}{\tilde{\psi }}_{PQ}=\frac{1}{\sqrt{2}}({\left|0\right\rangle }_{P}\otimes {\left|0\right\rangle }_{Q}+{\left|1\right\rangle }_{P}\otimes {\left|+\right\rangle }_{Q}).\end{array}$$

(19)

For any state ${\hat{\psi }}_{PQ}$ chosen by Eve, the statistics observed by Alice and Bob are described by

$$\begin{array}{r}{\overrightarrow{\nu }}_{C}=\,{{\mbox{Tr}}}\,\left[\overrightarrow{{{\Gamma }}}{\hat{\psi }}_{PQ}\right],\end{array}$$

(20)

where $\overrightarrow{{{\Gamma }}}=({{{\Gamma }}}_{{\mathtt{fail}}},{{{\Gamma }}}_{{\mathtt{inc}}},{{{\Gamma }}}_{\varnothing },{{{\Gamma }}}_{\perp })$ with

$${{{\Gamma }}}_{{\mathtt{fail}}}=\gamma (\left|0\right\rangle {\left\langle 0\right|}_{P}\otimes {N}_{Q}^{(1)}+\left|1\right\rangle {\left\langle 1\right|}_{P}\otimes {N}_{Q}^{(0)}),$$

(21)

$${{{\Gamma }}}_{{\mathtt{inc}}}=\gamma {{\mathbb{1}}}_{P}\otimes {N}_{Q}^{(\perp )},$$

(22)

$${{{\Gamma }}}_{\varnothing }=\gamma ({\mathbb{1}}-{{{\Gamma }}}_{{\mathtt{fail}}}-{{{\Gamma }}}_{{\mathtt{inc}}}),$$

(23)

$${{{\Gamma }}}_{\perp }=(1-\gamma ){{\mathbb{1}}}_{P}\otimes {{\mathbb{1}}}_{Q},$$

(24)

and $\,{{\mbox{Tr}}}\,\left[\overrightarrow{{{\Gamma }}}{\hat{\psi }}_{PQ}\right]$ is shorthand for the vector of the traces with the individual elements of $\overrightarrow{{{\Gamma }}}$. We can now directly apply the method from Methods Subsection “Deriving collective attack bounds” to find a collective attack bound ${{{{{{{\rm{CA}}}}}}}}({\nu }_{C})=\overrightarrow{\lambda }\cdot {\overrightarrow{\nu }}_{C}+{c}_{\overrightarrow{\lambda }}$: we can heuristically choose a $\overrightarrow{\lambda }$ and then determine ${c}_{\overrightarrow{\lambda }}$ by solving the convex optimisation problem from Equation (70) using the package Matlab CVXQUAD⁴¹. Note that one can pick $\overrightarrow{\lambda }$ by any numerical optimisation technique such as Matlab’s fminsearch: since $\overrightarrow{\lambda }$ can be chosen heuristically, it is not an issue if such an optimisation method does not have a convergence guarantee. In contrast, to determine ${c}_{\overrightarrow{\lambda }}$ one must use an optimisation method that guarantees a lower bound in order to ensure that the collective attack bound is valid. This is why it is important that ${c}_{\overrightarrow{\lambda }}$ be determined via a convex optimisation problem for which one can certify the solution by duality. For our numerical implementation, we employ additional simplifications to the optimisation problem from Equation (70) using the steps described in Supplementary Note E. This helps with numerical performance, but is not strictly necessary.

As our noise model for an honest implementation, we consider the depolarising channel with depolarising probability p, i.e. the channel that maps ρ ↦ (1 − p)ρ + pτ, where τ is the maximally mixed state. We determine the key rate as a function of p, i.e. we determine the amount of key that can safely be generated from any potentially dishonest implementation that produces the same statistics as the honest implementation with noise level p. To this end, for every value of p we first determine the statistics produced by an honest implementation with that noise level. We then choose a collective attack bound and parameters for Theorem 4 that ensure that the protocol is ε^cor-correct, ${\varepsilon }^{\sec }$-secret, and ε^comp-complete for that noise level and ${\varepsilon }^{{{{{{{{\rm{cor}}}}}}}}}=5\cdot 1{0}^{-11},\, {\varepsilon }^{\sec }=1{0}^{-9}$, and ε^comp = 10⁻². Finally, we choose the key length to be the largest integer l that satisfies the condition in Equation (12). We provide the choice of parameters in detail in Supplementary Note F and plot the resulting key rate in Fig. 1 for different numbers of rounds n. We again note that the choice of parameters here is largely arbitrary and not optimised as the purpose of this example is only to illustrate the use of our general framework.

Discussion

We have introduced a proof technique for analysing the security of QKD protocols in the finite-size regime against general attacks. This technique is best understood as a general procedure for converting a security proof in the i.i.d. asymptotic setting into a finite-size security proof against general attacks. To apply our technique, one can express a protocol of interest as an instance of our template protocol in Box 1, derive a collective attack bound (either using the general numerical technique described in “Results” subsection “Collective attack bounds” or by reusing an existing analysis in the i.i.d. asymptotic setting), and apply our Theorem 4 to obtain finite-size key rates against general attacks. Unlike previous techniques, our method can be applied directly to prepare-and-measure protocols and does not depend on the dimension of the underlying Hilbert space, allowing for a simple analysis of photonic prepare-and-measure protocols.

While we have provided a simple illustrative example of applying our framework to the well-known B92 protocol (Results Subsection “Sample application: B92 protocol”), which is not amenable to treatment with the EAT, and sketched the analysis of the BB84 decoy-state protocol (Supplementary Note G), we leave it for future work to analyse more practical protocols and optimise the bounds one can obtain for those protocols. This is especially relevant given that commercial QKD systems may become increasingly prevalent in the near future. In particular, it would be interesting to see whether our framework can be used to prove the security of the differential phase-shift⁴² and coherent one-way⁴³ QKD protocols. These protocols (and related ones using similar ideas) are relatively practical to implement, but notoriously hard to analyse.

Methods

Notation

The set of states for a quantum system A (with associated Hilbert space ${{{{{{{{\mathcal{H}}}}}}}}}_{A}$) is given by ${{{{{{{\rm{S}}}}}}}}(A)=\{\rho \in {{{{{{{\rm{Pos}}}}}}}}(A)\,| \,\,{{\mbox{Tr}}}\,\left[\rho \right]=1\}$, where Pos(A) is the set of positive operators on ${{{{{{{{\mathcal{H}}}}}}}}}_{A}$. If A is a quantum system and X is a classical system with alphabet ${{{{{{{\mathcal{X}}}}}}}}$, we call ρ ∈ S(XA) a cq-state and can expand it as ${\rho }_{XA}={\sum }_{x\in {{{{{{{\mathcal{X}}}}}}}}}\left|x\right\rangle \left\langle x\right|\otimes {\rho }_{A,x}$ for subnormalised ρ_A,x ∈ Pos(A). For ${{\Omega }}\subset {{{{{{{\mathcal{X}}}}}}}}$, we define the partial and conditional states

$$\begin{array}{r}{\rho }_{XA\wedge {{\Omega }}}=\mathop{\sum}\limits_{x\in {{\Omega }}}\left|x\right\rangle \left\langle x\right|\otimes {\rho }_{A,x}\,{{{{{{{\rm{and}}}}}}}}\,{\rho }_{XA| {{\Omega }}}=\frac{1}{{\Pr }_{\rho }\left[{{\Omega }}\right]}{\rho }_{XA\wedge {{\Omega }}},\end{array}$$

(25)

where ${\Pr }_{\rho }\left[{{\Omega }}\right] := \,{{\mbox{Tr}}}\,\left[{\rho }_{XA\wedge {{\Omega }}}\right]$. If Ω = {x}, we also write ρ_XA∣x for ρ_XA∣Ω. The set of quantum channels from system A to ${A}^{{\prime} }$ is denoted as ${{{{{{{\rm{CPTP}}}}}}}}(A,{A}^{{\prime} })$. The trace norm (sum of the singular values) of an operator L on ${{{{{{{{\mathcal{H}}}}}}}}}_{A}$ is denoted as ${\left|L\right|}_{1}$.

We will deal with two different entropies, the von Neumann entropy and the min-entropy, which are defined as follows. Let ρ_AB ∈ S(AB) be a quantum state. Then the conditional von Neumann entropy of A conditioned on B is given by

$$\begin{array}{r}H{(A| B)}_{\rho }=-{{\mbox{Tr}}}\,\left[{\rho }_{AB}\log {\rho }_{AB}\right]+\,{{\mbox{Tr}}}\,\left[{\rho }_{B}\log {\rho }_{B}\right].\end{array}$$

(26)

For ε ∈ [0, 1], the ε-smoothed min-entropy of A conditioned on B is

$$\begin{array}{r}{H}_{\min }^{\varepsilon }{(A| B)}_{\rho }=-\log \mathop{\inf }\limits_{{\tilde{\rho }}_{AB}}\mathop{\inf }\limits_{{\sigma }_{B}\in {{{{{{{\rm{S}}}}}}}}(B)}{\left\|{\sigma }_{B}^{-\frac{1}{2}}{\tilde{\rho }}_{AB}{\sigma }_{B}^{-\frac{1}{2}}\right\|}_{\infty }\,,\end{array}$$

(27)

where ${\left\|\cdot \right\|}_{\infty }$ denotes the spectral norm and the first infimum is taken over all states ${\tilde{\rho }}_{AB}\in {{{{{{{{\mathcal{B}}}}}}}}}_{\varepsilon }({\rho }_{AB})$ in the ε-ball around ρ_AB (in terms of the purified distance⁴⁴).

Universal hashing and randomness extraction

To check that Alice’s and Bob’s keys are the same, our general QKD protocol will make use of a universal hash family, and to extract a secure key from Alice’s and Bob’s raw data we will use a randomness extractor. Here, we briefly define what these primitives achieve. We refer to ref. ¹⁵ for a more detailed exposition and explanation of their construction.

Definition 5

(Universal hash family) Let M be a set. A family ${{{{{{{\mathcal{F}}}}}}}}$ of functions from M to {0, 1}^l with a probability distribution ${P}_{{{{{{{{\mathcal{F}}}}}}}}}$ over ${{{{{{{\mathcal{F}}}}}}}}$ is called a universal hash family if for any $x\, \ne \, {x}^{{\prime} }\in M,{\Pr }_{f}[f(x)=f({x}^{{\prime} })]\le {2}^{-l}$.

Definition 6

(Quantum-proof strong extractor^15,45,46) A function EXT: {0, 1}^m × {0, 1}^d → {0, 1}^l is a quantum-proof strong (k, ε_EXT)-extractor if for any ρ_SE ∈ Pos(SE) with $\,{{\mbox{Tr}}}\,\left[\rho \right]\le 1$ (and S classical with dimension 2^m) for which ${H}_{\min }{(S| E)}_{\rho }\ge k$, we have

$$\begin{array}{r}{\left\|{{{{{{{\rm{EXT}}}}}}}}({\rho }_{SE}\otimes {\tau }_{D})-{\tau }_{K}\otimes {\rho }_{E}\otimes {\tau }_{D}\right\|}_{1}\le {\varepsilon }_{{{{{{{{\rm{EXT}}}}}}}}},\end{array}$$

(28)

where τ_D and τ_K are maximally mixed states of dimension 2^d and 2^l, respectively, and the map EXT acts on the classical systems S and D. The input on system D is called the seed of the extractor.

This definition of extractors makes use of the non-smoothed min-entropy ${H}_{\min }{(S| E)}_{\rho }$. It is straightforward to modify this condition so that it only requires a lower bound on the smooth min-entropy: if EXT is a quantum-proof strong (k, ε_EXT)-extractor as in Definition 7 and ρ_SE satisfies ${H}_{\min }^{\varepsilon }{(S| E)}_{\rho }\ge k$, then

$$\begin{array}{r}{\left\|{{{{{{{\rm{EXT}}}}}}}}({\rho }_{SE}\otimes {\tau }_{D})-{\tau }_{L}\otimes {\rho }_{E}\otimes {\tau }_{D}\right\|}_{1}\le {\varepsilon }_{{{{{{{{\rm{EXT}}}}}}}}}+4\varepsilon .\end{array}$$

(29)

To see that this is the case, note that ${H}_{\min }^{\varepsilon }{(S| E)}_{\rho }\ge k$ means that there exists a ${\rho }^{{\prime} }$ within ε purified distance of ρ for which ${H}_{\min }{(S| E)}_{{\rho }^{{\prime} }}\ge k$. By the relation between purified distance and trace distance⁴⁴, we have ${\left\|\rho -{\rho }^{{\prime} }\right\|}_{1}\le 2\varepsilon$. Then, Equation (29) follows from the triangle inequality and because applying the map EXT cannot increase the trace distance.

For the purposes of QKD, a simple construction based on two-universal hashing¹⁵ provides sufficiently good parameters. We also note that more involved constructions exist that require shorter seeds, but this if typically not a concern for QKD applications (see e.g.⁴⁶ for a very efficient example using Trevisan’s extractor).

Lemma 7

(ref. ¹⁵) There exist quantum-proof strong (k, ε_EXT)-extractors EXT: {0, 1}^m × {0, 1}^d → {0, 1}^l for d = m and $l\le k-2\log (1/{\varepsilon }_{{{{{{{{\rm{EXT}}}}}}}}})$.

Generalised entropy accumulation

In this section, we introduce the GEAT from ref. ³⁶. Most of this section is taken directly from³⁶ and we refer to the introduction of that paper for a more detailed description of the setting and how it compares to the EAT¹⁷. Consider a sequence of channels ${{{{{{{{\mathcal{M}}}}}}}}}_{i}\in {{{{{{{\rm{CPTP}}}}}}}}({R}_{i-1}{E}_{i-1},{C}_{i}{A}_{i}{R}_{i}{E}_{i})$ for i ∈ {1, …, n}, where C_i are classical systems with common alphabet ${{{{{{{\mathcal{C}}}}}}}}$. In the context of cryptographic protocols, one should think of E_i as Eve’s side information after the i-th round, R_i as some internal system of a device, A_i as the protocol’s output in the i-th round, and C_i as classical statistics that determine whether the protocol aborts (e.g. by checking the number of rounds on which A_i does not satisfy a certain property). For all results in this paper, R_i can be chosen to be trivial. However, for (semi-)device-independent applications, the systems R_i are important because they can be used to describe the internal memory of the untrusted devices. As this is an interesting direction for future work, we state the theorem in full generality here.

We require that these channels ${{{{{{{{\mathcal{M}}}}}}}}}_{i}$ satisfy the following condition: defining ${{{{{{{{\mathcal{M}}}}}}}}}_{i}^{{\prime} }={{{{{{{{\rm{Tr}}}}}}}}}_{{C}_{i}}\circ {{{{{{{{\mathcal{M}}}}}}}}}_{i}$ (where ${{{{{{{{\rm{Tr}}}}}}}}}_{{C}_{i}}$ is the partial trace over system C_i and ∘ is the composition of channels), there exists a channel ${{{{{{{\mathcal{T}}}}}}}}\in {{{{{{{\rm{CPTP}}}}}}}}({A}^{n}{E}_{n},{C}^{n}{A}^{n}{E}_{n})$ such that ${{{{{{{{\mathcal{M}}}}}}}}}_{n}\circ \cdots \circ {{{{{{{{\mathcal{M}}}}}}}}}_{1}={{{{{{{\mathcal{T}}}}}}}}\circ {{{{{{{{\mathcal{M}}}}}}}}}_{n}^{{\prime} }\circ \cdots \circ {{{{{{{{\mathcal{M}}}}}}}}}_{1}^{{\prime} }$ and ${{{{{{{\mathcal{T}}}}}}}}$ has the form

$${{{{{{{\mathcal{T}}}}}}}}({\omega }_{{A}^{n}{E}_{n}})=\mathop{\sum}\limits_{y\in {{{{{{{\mathcal{Y}}}}}}}},z\in {{{{{{{\mathcal{Z}}}}}}}}}\left({{{\Pi }}}_{{A}^{n}}^{(y)}\otimes {{{\Pi }}}_{{E}_{n}}^{(z)}\right){\omega }_{{A}^{n}{E}_{n}}\left({{{\Pi }}}_{{A}^{n}}^{(y)}\otimes {{{\Pi }}}_{{E}_{n}}^{(z)}\right) \otimes \left|r(y,z)\right\rangle {\left\langle r(y,z)\right|}_{{C}^{n}},$$

(30)

where $\{{{{\Pi }}}_{{A}^{n}}^{(y)}\}$ and $\{{{{\Pi }}}_{{E}_{n}}^{(z)}\}$ are families of mutually orthogonal projectors on A_i and E_i, and $r:{{{{{{{\mathcal{Y}}}}}}}}\times {{{{{{{\mathcal{Z}}}}}}}}\to {{{{{{{\mathcal{C}}}}}}}}$ is a deterministic function. Intuitively, this condition says that the classical statistics can be reconstructed “in a projective way” from systems Aⁿ and E_n at the end of the protocol. In particular, this requirement is always satisfied if the statistics are computed from classical information contained in Aⁿ and E_n, which is the case for the applications in this paper. We note that the statistics are still generated in a round-by-round manner; Eq. (30) merely asserts that they could be reconstructed from the final state.

Let ${\mathbb{P}}$ be the set of probability distributions on the alphabet ${{{{{{{\mathcal{C}}}}}}}}$ of C_i, and let ${\tilde{E}}_{i-1}$ be a system isomorphic to R_i−1E_i−1. For any $q\in {\mathbb{P}}$ we define the set of states

$${{{\Sigma }}}_{i}(q)=\big\{{\nu }_{{C}_{i}{A}_{i}{R}_{i}{E}_{i}{\tilde{E}}_{i-1}}={{{{{{{{\mathcal{M}}}}}}}}}_{i}({\omega }_{{R}_{i-1}{E}_{i-1}{\tilde{E}}_{i-1}})\left.\right| \omega \in {{{{{{{\rm{S}}}}}}}}({R}_{i-1}{E}_{i-1}{\tilde{E}}_{i-1})\,\,{{\mbox{and}}}\,\,{\nu }_{{C}_{i}}=q\big\},$$

(31)

where ${\nu }_{{C}_{i}}$ denotes the probability distribution over ${{{{{{{\mathcal{C}}}}}}}}$ with the probabilities given by $\Pr \left[c\right]=\left\langle c\right|{\nu }_{{C}_{i}}\left|c\right\rangle$. In other words, Σ_i(q) is the set of states that can be produced at the output of the channel ${{{{{{{{\mathcal{M}}}}}}}}}_{i}$ and whose reduced state on C_i is equal to the probability distribution q.

Definition 8

A function $f:{\mathbb{P}}\to {\mathbb{R}}$ is called a min-tradeoff function for $\{{{{{{{{{\mathcal{M}}}}}}}}}_{i}\}$ if it satisfies

$$\begin{array}{r}f(q)\le \mathop{\min }\limits_{\nu \in {{{\Sigma }}}_{i}(q)}H{({A}_{i}| {E}_{i}{\tilde{E}}_{i-1})}_{\nu }\quad \forall i=1,\ldots,n\,.\end{array}$$

(32)

Note that if ${{{\Sigma }}}_{i}(q)={{\emptyset}}$, then f(q) can be chosen arbitrarily.

Our result will depend on some simple properties of the tradeoff function, namely the maximum and minimum of f, the minimum of f over valid distributions, and the maximum variance of f:

$${\mathsf{Max}}(f):=\mathop{\max }\limits_{q\in {\mathbb{P}}}f(q),$$

(33)

$${\mathsf{Min}}(f):=\mathop{\min }\limits_{q\in {\mathbb{P}}}f(q),$$

(34)

$${{\mathsf{Min}}}_{{{\Sigma }}}(f):=\mathop{\min }\limits_{q:{{\Sigma }}(q)\ne {{\emptyset}}}f(q),$$

(35)

$${\mathsf{Var}}(f):=\mathop{\max }\limits_{q:{{\Sigma }}(q)\ne {{\emptyset}}}\mathop{\sum}\limits_{x\in {{{{{{{\mathcal{C}}}}}}}}}q(x)f{({\delta }_{x})}^{2}-{\left(\mathop{\sum}\limits_{x\in {{{{{{{\mathcal{C}}}}}}}}}q(x)f({\delta }_{x})\right)}^{2},$$

(36)

where Σ(q) = ⋃_iΣ_i(q) and δ_x is the distribution with all the weight on element x. We write freq(Cⁿ) for the distribution on ${{{{{{{\mathcal{C}}}}}}}}$ defined by ${\mathsf{freq}}({C}^{n})(c)=\frac{| \{i\in \{1,\ldots,n\}:{C}_{i}=c\}| }{n}$. We also recall that in this context, an event Ω is defined by a subset of ${{{{{{{{\mathcal{C}}}}}}}}}^{n}$, and for a state ${\rho }_{{C}^{n}{A}^{n}{E}_{n}{R}_{n}}$ we write ${\Pr }_{\rho }\left[{{\Omega }}\right]={\sum }_{{c}^{n}\in {{\Omega }}}\,{{\mbox{Tr}}}\,\big[{\rho }_{{A}_{1}^{n}{E}_{n}{R}_{n},{c}^{n}}\big]$ for the probability of the event Ω and

$$\begin{array}{r}{\rho }_{{C}^{n}{A}^{n}{E}_{n}{R}_{n}| {{\Omega }}}=\frac{1}{{\Pr }_{\rho }\left[{{\Omega }}\right]}\mathop{\sum}\limits_{{c}^{n}\in {{\Omega }}}\left|{c}^{n}\right\rangle {\left\langle {c}^{n}\right|}_{{C}^{n}}\otimes {\rho }_{{A}^{n}{E}_{n}{R}_{n},{c}^{n}}\end{array}$$

(37)

for the state conditioned on Ω. With this, we can finally state the GEAT of³⁶.

Theorem 9

(GEAT³⁶) Consider a sequence of channels ${{{{{{{{\mathcal{M}}}}}}}}}_{i}\in {{{{{{{\rm{CPTP}}}}}}}}({R}_{i-1}{E}_{i-1},{C}_{i}{A}_{i}{R}_{i}{E}_{i})$ for i ∈ {1, …, n}, where C_i are classical systems with common alphabet ${{{{{{{\mathcal{C}}}}}}}}$ and the sequence $\{{{{{{{{{\mathcal{M}}}}}}}}}_{i}\}$ satisfies Equation (30) and the following no-signalling condition: for each ${{{{{{{{\mathcal{M}}}}}}}}}_{i}$, there exists a channel ${{{{{{{{\mathcal{R}}}}}}}}}_{i}\in {{{{{{{\rm{CPTP}}}}}}}}({E}_{i-1},{E}_{i})$ such that ${{{{{{{{\rm{Tr}}}}}}}}}_{{A}_{i}{R}_{i}{C}_{i}}\circ {{{{{{{{\mathcal{M}}}}}}}}}_{i}={{{{{{{{\mathcal{R}}}}}}}}}_{i}\circ {{{{{{{{\rm{Tr}}}}}}}}}_{{R}_{i-1}}$. Let $\varepsilon \in (0,1),\alpha \in (1,3/2),{{\Omega }}\subset {{{{{{{{\mathcal{C}}}}}}}}}^{n},{\rho }_{{R}_{0}{E}_{0}}\in {{{{{{{\rm{S}}}}}}}}({R}_{0}{E}_{0})$, and f be an affine min-tradeoff function with $h=\mathop{\min }\limits_{{c}^{n}\in {{\Omega }}}f({\mathsf{freq}}({c}^{n}))$. Then,

$${H}_{\min }^{\varepsilon }{({A}^{n}| {E}_{n})}_{{{{{{{{{\mathcal{M}}}}}}}}}_{n}\circ \cdots \circ {{{{{{{{\mathcal{M}}}}}}}}}_{1}{({\rho }_{{R}_{0}{E}_{0}})}_{| {{\Omega }}}}\ge n\,h-n\,\frac{\alpha -1}{2-\alpha }\,\frac{\ln (2)}{2}{V}^{2} \\ -\frac{g(\varepsilon )+\alpha \log (1/{\Pr }_{{\rho }^{n}}\left[{{\Omega }}\right])}{\alpha -1}-n\,{\left(\frac{\alpha -1}{2-\alpha }\right)}^{2}{K}^{{\prime} }(\alpha )\,,\hfill$$

(38)

where $\Pr \left[{{\Omega }}\right]$ is the probability of observing event Ω, and

$$g(\varepsilon )=-\log (1-\sqrt{1-{\varepsilon }^{2}})\le \log (2/{\varepsilon }^{2}),$$

(39)

$$V=\log (2{d}_{A}^{2}+1)+\sqrt{2+{\mathsf{Var}}(f)},$$

(40)

$${K}^{{\prime} }(\alpha )= \frac{{(2-\alpha )}^{3}}{6{(3-2\alpha )}^{3}\ln 2}\,{2}^{\frac{\alpha -1}{2-\alpha }(2\log {d}_{A}+{\mathsf{Max}}(f)-{{\mathsf{Min}}}_{{{\Sigma }}}(f))}\\ {\ln }^{3}\left({2}^{2\log {d}_{A}+{\mathsf{Max}}(f)-{{\mathsf{Min}}}_{{{\Sigma }}}(f)}+{e}^{2}\right),$$

(41)

with ${d}_{A}={\max }_{i}\dim ({A}_{i})$.

We briefly comment on the main differences between the GEAT as stated above and the EAT from¹⁷. The GEAT deals with a sequence of channels ${{{{{{{{\mathcal{M}}}}}}}}}_{i}\in {{{{{{{\rm{CPTP}}}}}}}}({R}_{i-1}{E}_{i-1},{C}_{i}{A}_{i}{R}_{i}{E}_{i})$ that can update both the internal memory register R_i and the side information register E_i (subject to the no-signalling condition), i.e. change these states to e.g. incorporate additional side information obtained in the protocol or account for measurements performed in response to the user’s input. In contrast, the EAT does not allow the side information register to be updated. More formally, the EAT deals with channels ${{{{{{{{\mathcal{M}}}}}}}}}_{i}^{{\prime} }\in {{{{{{{\rm{CPTP}}}}}}}}({R}_{i-1},{C}_{i}{A}_{i}{R}_{i}{I}_{i})$, where I_i is side information produced in each round that cannot be updated in the future. The final side information at the end of such a process is EIⁿ, where E can be any additional side information from the initial state of the process that was never updated during the process. If the side information registers I_i satisfy the Markov condition Aⁱ⁻¹ ↔ Iⁱ⁻¹E ↔ I_i (see ref. ¹⁷ for a more detailed explanation), then the EAT gives a lower bound on ${H}_{\min }^{\varepsilon }{({A}^{n}| {I}^{n}E)}_{{{{{{{{{\mathcal{M}}}}}}}}}_{n}^{{\prime} }\circ \cdots \circ {{{{{{{{\mathcal{M}}}}}}}}}_{1}^{{\prime} }{({\rho }_{{R}_{0}})}_{| {{\Omega }}}}$ similar to the one in Theorem 9.

We can now see at a high level why the EAT cannot be used to deal with prepare-and-measure protocols directly: in a prepare-and-measure protocol, the adversary Eve intercepts the quantum state sent from Alice to Bob in each round and updates her side information based on that. Therefore, any technique used to deal with such protocols must allow for the side information to be updated like in the GEAT; the more restrictive scenario considered in the EAT does not capture this kind of protocol.

We also note that the GEAT is strictly more general than the EAT (see [ref. ³⁶, Section 1] for a proof). Hence, any application that can be treated with the EAT can also be treated with the GEAT (up to some very minor loss in second-order parameters), and the resulting proofs are often much more straightforward; see [ref. ³⁶, Section 5.2] for an example.

Proof of main theorem

In this section, we prove our main result, Theorem 4, i.e. we show that the protocol in Box 1 is correct and secret.

Proof of Theorem 4

For the correctness statement, we need to show that $\Pr [K\, \ne \, \hat{K}\wedge {{{{{{{\mbox{not \,abort}}}}}}}}]\le {\varepsilon }_{{{{{{{{\rm{KV}}}}}}}}}$. To see that this is the case, we note that due to the check in Step (5), the protocol not aborting implies that ${{{{{{{\rm{HASH}}}}}}}}({S}^{n})={{{{{{{\rm{HASH}}}}}}}}({\hat{S}}^{n})$. Furthermore, from Step (7) we see that $K\, \ne \, \hat{K}$ implies that ${S}^{n} \ne \, {\hat{S}}^{n}$. Therefore, it suffices to show that

$$\begin{array}{r}\Pr \left[{S}^{n}\, \ne \, {\hat{S}}^{n}\wedge {{{{{{{\rm{HASH}}}}}}}}({S}^{n})={{{{{{{\rm{HASH}}}}}}}}({\hat{S}}^{n})\right]\le {\varepsilon }_{{{{{{{{\rm{KV}}}}}}}}}.\end{array}$$

(42)

Since Alice chooses the function HASH at random from a universal hash family, this follows directly from Definition 5 and completes the correctness proof.

The remainder of the proof will be concerned with the secrecy condition. As explained in “Results” subsection “Modelling Eve’s attack”, assuming Condition 1 we can model a general attack by a sequence of channels

$$\begin{array}{r}{{{{{{{{\mathcal{A}}}}}}}}}_{i}:{E}_{i-1}^{{\prime} }{Q}_{i}\to {E}_{i}^{{\prime} }{Q}_{i}.\end{array}$$

(43)

Alice, Bob, and Eve’s joint final state at the end of the protocol therefore contains systems

$$\begin{array}{r}{U}^{n}{V}^{n}{I}^{n}{S}^{n}{\hat{S}}^{n}{\hat{C}}^{n}K\hat{K}{E}_{n}^{{\prime} }{E}^{{\prime} }.\end{array}$$

(44)

Here, ${E}_{n}^{{\prime} }$ is Eve’s system after using the maps ${{{{{{{{\mathcal{A}}}}}}}}}_{1},\ldots,{{{{{{{{\mathcal{A}}}}}}}}}_{n},{E}^{{\prime} }$ stores the additional classical information published after Step (4), i.e., the error correction information EC, a description of the hash function HASH, the hash value HASH(Sⁿ), and the seed μ, and the other systems are labelled as in Box 1. This means that Eve’s full side information is given by ${I}^{n}{E}_{n}^{{\prime} }{E}^{{\prime} }$. Throughout the proof, we will denote the final state at the end of the protocol by ${\rho }_{{U}^{n}{V}^{n}{I}^{n}{S}^{n}{\hat{S}}^{n}{C}^{n}K\hat{K}{E}_{n}^{{\prime} }{E}^{{\prime} }}$.

By Definition 3, we need to show that

$$\begin{array}{rcl}{\left\|{\rho }_{K{I}^{n}{E}_{n}^{{\prime} }{E}^{{\prime} }\wedge {{\Omega }}}-{\tau }_{K}\otimes {\rho }_{{I}^{n}{E}_{n}^{{\prime} }{E}^{{\prime} }\wedge {{\Omega }}}\right\|}_{1}\le \max \{{\varepsilon }_{{{{{{{{\rm{PA}}}}}}}}}+4\,{\varepsilon }_{s},2\,{\varepsilon }_{a}\}+2\,{\varepsilon }_{{{{{{{{\rm{KV}}}}}}}}},\end{array}$$

(45)

where Ω is the event that the protocol does not abort and τ_K is the maximally mixed state on system K of dimension ∣K∣ = 2^l. Since the protocol’s final state arises by application of a strong extractor in Step (7), we can reduce Eq. (45) to an entropic statement. This step requires careful technical treatment because the statistical check in Step (6) uses the systems ${\hat{C}}^{n}$, which are computed from ${\hat{S}}^{n}$. However, ${\hat{S}}^{n}$ is Bob’s guess for Alice’s string Sⁿ and depends on the global error correction information EC, i.e., it cannot be generated in a round-by-round manner as required for the GEAT. The intuition for circumventing this issue is as follows: if ${\hat{S}}^{n}\, \ne \, {S}^{n}$, then the protocol is likely to abort anyway because of Step (5); on the other hand, if ${\hat{S}}^{n}={S}^{n}$, then we can replace ${\hat{S}}^{n}$ by Sⁿ, and the latter is generated in a round-by-round manner. Following this intuition, we can show that the entropy bound in Claim 10 implies Theorem 4. We give a formal proof of this step in Supplementary Note A and continue here with proving the required entropy bound. We also note that for protocols that include a separate parameter estimation step rather than using Bob’s guess for Alice’s raw key, Claim 10 implies Theorem 4 almost immediately. □

Claim 10

Let Ω_C be the event that CA(freq(Cⁿ)) ≥ k_CA (i.e. the statistical check (Step (6)) passes using the values Cⁿ). Continuing with the notation from before, for any α ∈ (1, 3/2):

$${H}_{\min }^{{\varepsilon }_{s}}{({S}^{n}| {I}^{n}{C}^{n}{E}_{n}^{{\prime} })}_{{\rho }_{| {{{\Omega }}}_{C}}}\ge n\,{k}_{{{{{{{{\rm{CA}}}}}}}}}-n\,\frac{\alpha -1}{2-\alpha }\,\frac{\ln (2)}{2}{V}^{2}-\frac{g({\varepsilon }_{s})+\alpha \log (1/\Pr \left[{{{\Omega }}}_{C}\right])}{\alpha -1}\\ -n\,{\left(\frac{\alpha -1}{2-\alpha }\right)}^{2}{K}^{{\prime} }(\alpha ),$$

(46)

with g(ε_s), V, and ${K}^{{\prime} }(\alpha )$ as in Theorem 9.

Proof.

To make use of the GEAT, we need to write ${\rho }_{{S}^{n}{I}^{n}{C}^{n}{E}_{n}^{{\prime} }| {{{\Omega }}}_{C}}$ as the result of a sequential application of a quantum channel. For this we fix an attack ${{{{{{{{\mathcal{A}}}}}}}}}_{1},\ldots,{{{{{{{{\mathcal{A}}}}}}}}}_{n}$ and define

$$\begin{array}{r}{{{{{{{{\mathcal{M}}}}}}}}}_{i}:{E}_{i-1}^{{\prime} }\to {S}_{i}{I}_{i}{C}_{i}{E}_{i}^{{\prime} }\end{array}$$

(47)

as the following channel: given a quantum system ${\omega }_{{E}_{i-1}^{{\prime} }}$,

(i)
create the state ${\psi }_{{U}_{i}{Q}_{i}}$ (defined in Step (1) of Box 1),
(ii)
apply the attack map ${{{{{{{{\mathcal{A}}}}}}}}}_{i}:{Q}_{i}{E}_{i-1}^{{\prime} }\to {Q}_{i}{E}_{i}^{{\prime} }$ to ${\psi }_{{U}_{i}{Q}_{i}}\otimes {\omega }_{{E}_{i-1}^{{\prime} }}$,
(iii)
measure ${\{{N}^{(v)}\}}_{v\in {{{{{{{\mathcal{V}}}}}}}}}$ on system Q_i and store the result in register V_i,
(iv)
set I_i = PD(U_i, V_i),
(v)
set S_i = RK(U_i, I_i),
(vi)
set C_i = EV(V_i, I_i, S_i),
(vii)
trace out registers U_i and V_i.

Comparing the steps of the protocol and Supplementary Eq. (1) with this definition of ${{{{{{{{\mathcal{M}}}}}}}}}_{i}$, we see that the marginal of ρ on systems ${S}^{n}{I}^{n}{C}^{n}{E}_{n}^{{\prime} }$ is the same as the output of the maps ${{{{{{{{\mathcal{M}}}}}}}}}_{i}$:

$$\begin{array}{r}{\rho }_{{S}^{n}{I}^{n}{C}^{n}{E}_{n}^{{\prime} }}={{{{{{{{\mathcal{M}}}}}}}}}_{n}\circ \cdots \circ {{{{{{{{\mathcal{M}}}}}}}}}_{1}({\omega }_{{E}_{0}^{{\prime} }}),\end{array}$$

(48)

where ${\omega }_{{E}_{0}^{{\prime} }}$ is the initial state of Eve’s side information (which can be chosen to be trivial without loss of generality as explained in Results Subsection “Modelling Eve’s attack”). If we define the systems ${E}_{i}={I}^{i}{C}^{i}{E}_{i}^{{\prime} }$, then by suitable tensoring with the identity map and copying the register C_i we can view ${{{{{{{{\mathcal{M}}}}}}}}}_{i}$ as a map

$$\begin{array}{r}{\tilde{{{{{{{{\mathcal{M}}}}}}}}}}_{i}:{E}_{i-1}\to {S}_{i}{E}_{i}{C}_{i}.\end{array}$$

(49)

With this we can also express the final state (which technically now includes two copies of Cⁿ, one explicit and one part of E_n) as

$$\begin{array}{r}{\rho }_{{S}^{n}{E}_{n}{C}^{n}}={\tilde{{{{{{{{\mathcal{M}}}}}}}}}}_{n}\circ \cdots \circ {\tilde{{{{{{{{\mathcal{M}}}}}}}}}}_{1}({\omega }_{{E}_{0}}).\end{array}$$

(50)

With this notation, the entropy on the l.h.s. of Equation (46) can be written as

$${H}_{\min }^{{\varepsilon }_{s}}{({S}^{n}| {I}^{n}{C}^{n}{E}_{n}^{{\prime} })}_{{\rho }_{| {{{\Omega }}}_{C}}}={H}_{\min }^{{\varepsilon }_{s}}{({S}^{n}| {E}_{n})}_{{\tilde{{{{{{{{\mathcal{M}}}}}}}}}}_{n}\circ \cdots \circ {\tilde{{{{{{{{\mathcal{M}}}}}}}}}}_{1}{({\omega }_{{E}_{0}})}_{| {{{\Omega }}}_{C}}}.$$

(51)

We want to apply Theorem 9 to derive the desired lower bound in Eq. (46). For this, we first need to check that the required conditions on the maps ${\tilde{{{{{{{{\mathcal{M}}}}}}}}}}_{i}$ are satisfied. The condition in Eq. (30) is clearly satisfied as the systems C_i are themselves included in the conditioning system ${E}_{n}^{{\prime} }$. The non-signalling condition in Theorem 9 is also trivially satisfied in this case since there is no system R_i.

We now need to argue that the collective attack bound ${{{{{{{\rm{CA}}}}}}}}:{\mathbb{P}}({{{{{{{\mathcal{C}}}}}}}})\to {\mathbb{R}}$ used as an argument in Box 1 is a min-tradeoff function for the maps $\{{\tilde{{{{{{{{\mathcal{M}}}}}}}}}}_{i}\}$. By Definition 8, we need to show that for any i, attack ${{{{{{{{\mathcal{A}}}}}}}}}_{i}:{Q}_{i}{E}_{i-1}^{{\prime} }\to {Q}_{i}{E}_{i}^{{\prime} }$ (in the definition of ${\tilde{{{{{{{{\mathcal{M}}}}}}}}}}_{i}$, see Step (ii)), and state ${\omega }_{{E}_{i-1}{\tilde{E}}_{i-1}}^{i-1}$ (where ${\tilde{E}}_{i-1}\equiv {E}_{i-1}$), the following holds:

$$\begin{array}{r}{{{{{{{\rm{CA}}}}}}}}({\tilde{{{{{{{{\mathcal{M}}}}}}}}}}_{i}{({\omega }^{i-1})}_{{C}_{i}})\, \le \, H{({S}_{i}| {E}_{i}{\tilde{E}}_{i-1})}_{{\tilde{{{{{{{{\mathcal{M}}}}}}}}}}_{i}({\omega }^{i-1})}.\end{array}$$

(52)

For the rest of the proof, we fix an arbitrary choice of i, ωⁱ⁻¹, and ${{{{{{{{\mathcal{A}}}}}}}}}_{i}$. To relate Eq. (52) to the definition of collective attack bounds (“Definition 2”), we construct a collective attack ${{{{{{{{\mathcal{A}}}}}}}}}^{{\prime} }:{Q}_{i}\to {Q}_{i}{E}_{i}{\tilde{E}}_{i-1}$ such that

$$\begin{array}{r}{\tilde{{{{{{{{\mathcal{M}}}}}}}}}}_{i}{({\omega }^{i-1})}_{{S}_{i}{C}_{i}{E}_{i}{\tilde{E}}_{i-1}}={\nu }_{{S}_{i}{C}_{i}{E}_{i}{\tilde{E}}_{i}},\end{array}$$

(53)

where ν is defined as in Definition 2, i.e. ν is the state produced by running a single round of the protocol in Box 1 with the attack ${{{{{{{{\mathcal{A}}}}}}}}}^{{\prime} }$. Of course, ${{{{{{{{\mathcal{A}}}}}}}}}^{{\prime} }$ will depend on i, ωⁱ⁻¹, and ${{{{{{{{\mathcal{A}}}}}}}}}_{i}$. This is not a problem since Definition 2 holds for any collective attack, i.e., to show that Eq. (52) holds for any i, ωⁱ⁻¹, and ${{{{{{{{\mathcal{A}}}}}}}}}_{i}$, we can first fix an arbitrary choice, construct a “custom” collective attack that shows Eq. (52) for that choice, and then apply the condition in Definition 2 to that choice.

It is easy to check that Eq. (53) is satisfied for the following choice of ${{{{{{{{\mathcal{A}}}}}}}}}^{{\prime} }$: given a state ${\sigma }_{Q},{{{{{{{{\mathcal{A}}}}}}}}}^{{\prime} }$ first creates the (fixed) state ${\omega }_{{E}_{i-1}{\tilde{E}}_{i-1}}^{i-1}$ and then applies the (fixed) attack ${{{{{{{{\mathcal{A}}}}}}}}}_{i}$ to ${\sigma }_{Q}\otimes {\omega }_{{E}_{i-1}{\tilde{E}}_{i-1}}^{i-1}$ (with Q_i = Q).

Then, since CA is a collective attack bound, Eq. (52) follows from Definition 2:

$${{{{{{{\rm{CA}}}}}}}}({\tilde{{{{{{{{\mathcal{M}}}}}}}}}}_{i}{({\omega }^{i-1})}_{{C}_{i}})={{{{{{{\rm{CA}}}}}}}}({\nu }_{{C}_{i}})\le H{({S}_{i}| {E}_{i}{\tilde{E}}_{i-1}{C}_{i})}_{\nu }=H{({S}_{i}| {E}_{i}{\tilde{E}}_{i-1})}_{{\tilde{{{{{{{{\mathcal{M}}}}}}}}}}_{i}({\omega }^{i-1})}.$$

(54)

Compared to Definition 2, we have dropped the explicit conditioning on I ≔ I_i since I_i is already part of E_i, and in the last equality we can drop C_i since it is also part of E_i.

This means that the function CA is a min-tradeoff function for the protocol in Box 1. By definition, for any cⁿ ∈ Ω_C, CA(freq(cⁿ)) ≥ k_CA Hence, Claim 10 follows by applying Theorem 9. □

Having proved correctness and secrecy, we turn our attention to the completeness of the protocol in Box 1, i.e. we need to bound the probability that the protocol aborts when Eve does not interfere in the protocol, but the channel between Alice and Bob may be noisy. In the protocol, Alice sends a quantum system Q to Bob. If the channel connecting Alice and Bob is noisy, instead of Alice’s and Bob’s joint state in each round being ψ_UQ, the joint state is ${{{{{{{\mathcal{N}}}}}}}}({\psi }_{UQ})$ for some channel ${{{{{{{\mathcal{N}}}}}}}}:Q\to Q$. This channel ${{{{{{{\mathcal{N}}}}}}}}$ describes the noise model for Box 1. Note that the channel ${{{{{{{\mathcal{N}}}}}}}}$ is not something that needs to be added explicitly to the description of Box 1: formally, ${{{{{{{\mathcal{N}}}}}}}}$ can be viewed as Eve’s attack, i.e. we can model the implementation of the protocol in Box 1 with a noisy channel and honest Eve by saying that Eve’s attack is described by ${{{{{{{\mathcal{N}}}}}}}}$. This also means that when we proved correctness and secrecy, we only needed to prove this for any behaviour of Eve, not any noise model, since the noise model can be included in Eve’s actions.

For a given noise model ${{{{{{{\mathcal{N}}}}}}}}$, we need to choose the length of the error correction string λ_EC to be sufficiently long such that Bob’s guess ${\hat{S}}^{n}$ for Alice’s raw key Sⁿ is correct with high probability, and as a consequence the check in Step (5) passes. Furthermore, we need to choose the threshold k_CA to be sufficiently low that an honest noisy state passes Step (6) with high probability. The precise choice of parameters can be worked out using the properties of the error correcting code in Step (4) and statistical tail bounds for Step (6). We provide the details in Supplementary Note B.

Deriving collective attack bounds

Our main result, Theorem 4, turns an affine collective attack bound (defined in “Definition 2”) into a security statement against general attacks. Therefore, the main step one has to perform to use our framework is finding such an affine collective attack bound for a protocol of interest. In this section, we give a numerical method for finding collective attack bounds for the protocol in Box 1 based on ideas from refs. ^7,47. Combined with Theorem 4, this means that the problem of finding key rate bounds against general attacks for any instance of the protocol in Box 1 is reduced to a numerical computation.

We begin by noting that we can rewrite the condition Eq. (8) from “Definition 2” as follows: for any probability distribution ${\nu }_{C}^{*}\in {\mathbb{P}}({{{{{{{\mathcal{C}}}}}}}})$ we require that

$$\begin{array}{r}\mathop{\inf }\limits_{\nu {{{{{{{\rm{s.t.}}}}}}}}\,{\nu }_{C}={\nu }_{C}^{*}}H{(S| IEC)}_{\nu }\ge {{{{{{{\rm{CA}}}}}}}}({\nu }_{C}^{*}),\end{array}$$

(55)

where the infimum is over all states ν that can result from a collective attack and have statistics ${\nu }_{C}^{*}$ (and the infinimum is infinite if there is no such state). In the language of the GEAT, a collective attack bound essentially is a min-tradeoff function for a certain sequence of maps associated with Box 1. More details on how a collective attack bound serves as a min-tradeoff function can be found in the proof of Claim 10

Since we are interested in an affine lower bound, we write the probability distribution ν_C as a probability vector ${\overrightarrow{\nu }}_{C}$ and, following^12,48, make the ansatz

$$\begin{array}{r}{{{{{{{\rm{CA}}}}}}}}({\overrightarrow{\nu }}_{C})=\overrightarrow{\lambda }\cdot {\overrightarrow{\nu }}_{C}+{c}_{\overrightarrow{\lambda }}\end{array}$$

(56)

for some vector $\overrightarrow{\lambda }$ of the same dimension as ${\overrightarrow{\nu }}_{C}$ and a constant ${c}_{\overrightarrow{\lambda }}$. We treat $\overrightarrow{\lambda }$ as a parameter that will be chosen heuristically. For example, one can choose $\overrightarrow{\lambda }$ by numerically estimating the gradient of the function ${\nu }_{C}^{{\prime} }\mapsto \mathop{\inf }\limits_{\nu {{{{{{{\rm{s.t.}}}}}}}}\,{\nu }_{C}={\nu }_{C}^{{\prime} }}H{(S| IEC)}_{\nu }$ around a particular choice of classical statistics ${\nu }_{C}^{*}$ that has been observed in an experimental realisation of the protocol, although this choice is not necessarily optimal and $\overrightarrow{\lambda }$ should be numerically optimised if one wants to obtain the best possible key rates.

Having chosen $\overrightarrow{\lambda }$ heuristically, we need to compute a value of ${c}_{\overrightarrow{\lambda }}$ that ensures that $\overrightarrow{\lambda }\cdot {\overrightarrow{\nu }}_{C}+{c}_{\overrightarrow{\lambda }}$ is a valid min-tradeoff function. Inserting our ansatz into Equation (8), we see that for any fixed $\overrightarrow{\lambda }$, a valid choice of ${c}_{\overrightarrow{\lambda }}$ is one that satisfies

$$\begin{array}{r}{c}_{\overrightarrow{\lambda }}\le \mathop{\inf }\limits_{\nu }H{(S| IEC)}_{\nu }-\overrightarrow{\lambda }\cdot {\overrightarrow{\nu }}_{C}.\end{array}$$

(57)

The infimum here is taken over the states ν described in “Definition 2”. To avoid confusion, we emphasise that the infimum here is taken over all such states ν, not just ones with a specific classical distribution ${\nu }_{C}^{*}$ as considered in Eq. (55). As explained in ref. ¹², one can view the optimisation in Eq. (57) as arising from the Lagrange dual of Eq. (55), but we will not make use of this relation here explicitly.

To tackle this optimisation problem, we consider an entanglement-based version of the protocol in Box 1 using the source-replacement scheme explained in ref. ⁶. As explained in the introduction, switching to an entanglement-based version of a prepare-and-measure protocol generally requires introducing “artificial” constraints on Eve’s actions. These artificial constraints are troublesome when applying the EAT to the entanglement-based version, but here we take a different approach: we only use the entanglement-based version to derive a collective attack bound (for which the artificial constraints do not present a problem). This collective attack bound also applies to the original prepare-and-measure protocol and in Theorem 4 we apply the EAT with this collective attack bound to the prepare-and-measure protocol directly. We emphasise that the method for deriving a collective attack bound and our Theorem 4 are entirely independent: Theorem 4 does not depend on how the collective attack bound was derived and does not make use of an entanglement-based protocol itself.

In Box 1 Alice prepares the state

$$\begin{array}{r}{\psi }_{UQ}=\mathop{\sum}\limits_{u}p(u)\left|u\right\rangle \left\langle u\right|\otimes \left|\psi \right\rangle {\left\langle \psi \right|}_{Q| u}\end{array}$$

(58)

and sends system Q to Bob. It is clear that Alice could equivalently prepare the state

$$\begin{array}{r}{\left|\tilde{\psi }\right\rangle }_{UQ}=\mathop{\sum}\limits_{u}\sqrt{p(u)}{\left|u\right\rangle }_{P}\otimes {\left|\psi \right\rangle }_{Q| u},\end{array}$$

(59)

send system Q to Bob, and only afterwards measure her own system P in the computational basis, storing the outcome in system U. Eve would now apply her collective attack ${{{{{{{\mathcal{A}}}}}}}}:Q\to QE$ to system Q of $\tilde{\psi }$, so the state after Eve’s attack would be ${\tilde{\psi }}_{PQE}$. We can replace this attack by giving Eve the ability to prepare a state ${\hat{\psi }}_{PQE}$ directly and distribute P and Q to Alice and Bob, respectively. This kind of attack clearly gives Eve more power. In fact, it gives Eve too much power: in order to still obtain a good key rate, we need to enforce the additional constraint that Alice’s marginal of the state $\hat{\psi }$ is the same as her marginal of the state $\tilde{\psi }$ she would have prepared herself, i.e. ${\hat{\psi }}_{P}={\tilde{\psi }}_{P}$. It is easy to see that even with this additional constraint, this latter kind of attack is still at least as general as any collective attack on the prepare-and-measure protocol described before. Note that the condition ${\hat{\psi }}_{A}={\tilde{\psi }}_{A}$ is not a physical constraint that Alice checks in an actual protocol, but rather the aforementioned additional artificial constraint. Nonetheless, we can impose this artificial constraint on the optimisation problem used to calculate the collective attack bound.

For a fixed instance of Box 1, we can now view the state ν in “Definition 2” as a function of ${\hat{\psi }}_{PQE}$:

$${\nu }_{ESIC}(\hat{\psi })=\mathop{\sum}\limits_{u,v}{{{\mbox{Tr}}}}_{PQ}\left[\left|u\right\rangle {\left\langle u\right|}_{P}\otimes {N}_{Q}^{(v)}{\hat{\psi }}_{PQE}\right]\otimes \left|{{{{{{{\rm{RK}}}}}}}}(u,i)\right\rangle {\left\langle \right|}_{S}\otimes \left|{{{{{{{\rm{PD}}}}}}}}(u,v)\right\rangle {\left\langle \right|}_{I}\otimes \left|{{{{{{{\rm{EV}}}}}}}}(v,i)\right\rangle {\left\langle \right|}_{C}.$$

(60)

Here, $\left|{{{{{{{\rm{RK}}}}}}}}(u,{{{{{{{\rm{PD}}}}}}}}(u,v))\right\rangle \left\langle \,\right|$ is shorthand for the projector $\left|{{{{{{{\rm{RK}}}}}}}}(u,{{{{{{{\rm{PD}}}}}}}}(u,v))\right\rangle \left\langle {{{{{{{\rm{RK}}}}}}}}(u,{{{{{{{\rm{PD}}}}}}}}(u,v))\right|$ and i is shorthand for PD(u, v). We can therefore write the optimisation problem from Equation (57) as

$$\mathop{\inf }\limits_{\hat{{\psi }_{PQE}}}H{(S| IEC)}_{\nu }-\overrightarrow{\lambda }\cdot {\overrightarrow{\nu }}_{C}$$

(61)

$${{{{{{{\rm{s.t.}}}}}}}}\,{\hat{\psi }}_{PQE}\ge 0,\quad \,{{\mbox{Tr}}}\,\left[\hat{\psi }_{PQE}\right]=1,\quad {\hat{\psi }}_{P}={\tilde{\psi }}_{P},$$

(62)

where $\nu=\nu (\hat{\psi })$, and without loss of generality we can restrict the optimisation to pure states on PQE with E ≡ PQ.

A lot of work in QKD has been focused on numerical methods for this kind of optimisation problem (see e.g. refs. ^6,7,13,49,50). The key difficulty is that we need a lower bound on the infimum of a concave function $H{(S| IEC)}_{\nu (\hat{\psi })}$. Here we use a method from refs. ^7,47 to turn this optimisation problem into a convex one. As a first step, we observe that in the definition of ν we can incorporate the classical functions RK, PD, and EV into Alice’s and Bob’s measurements by defining

$$\begin{array}{r}{M}_{PQ}^{(s,i,c)}=\mathop{\sum}\limits_{u,v:\,\left\{\begin{array}{c}{{{{{{{\rm{RK}}}}}}}}(u,i)=s,\\ {{{{{{{\rm{PD}}}}}}}}(u,v)=i,\\ {{{{{{{\rm{EV}}}}}}}}(v,i,s)=c\end{array}\right.}\left|u\right\rangle {\left\langle u\right|}_{P}\otimes {N}_{Q}^{(v)}.\end{array}$$

(63)

Then, we can write ν_ESIC as

$$\begin{array}{r}\nu=\mathop{\sum}\limits_{s,i,c}{{{\mbox{Tr}}}}_{PQ}\left[{M}_{PQ}^{(s,i,c)}{\hat{\psi }}_{PQE}\right]\otimes \left|s\right\rangle {\left\langle s\right|}_{S}\otimes \left|i\right\rangle {\left\langle i\right|}_{I}\otimes \left|c\right\rangle {\left\langle c\right|}_{C}.\end{array}$$

(64)

Remembering that we can assume that ψ_PQE is pure, we now define the pure state

$$\left|{\nu }^{1}\right\rangle=\mathop{\sum}\limits_{s,i,c}\sqrt{{M}_{PQ}^{(s,i,c)}}{\left|\hat{\psi }\right\rangle }_{PQE}{\left|s\right\rangle }_{S}{\left|i\right\rangle }_{I}{\left|i\right\rangle }_{{I}^{{\prime} }}{\left|c\right\rangle }_{C}{\left|c\right\rangle }_{{C}^{{\prime} }}.$$

(65)

We observe that

$$\begin{array}{r}{\nu }_{EIC}={\nu }_{EIC}^{1}.\end{array}$$

(66)

Following the proof of [ref. ⁴⁷, Theorem 1], a direct calculation shows that

$$\begin{array}{r}H{(S| IEC)}_{\nu }=D\left({\nu }_{PQSIC}^{1}\,\left|\left| \,{{{{{{{{\mathcal{P}}}}}}}}}_{S}({\nu }_{PQSIC}^{1})\right.\right.\right)\end{array}$$

(67)

where ${{{{{{{{\mathcal{P}}}}}}}}}_{S}$ is the pinching map ${{{{{{{{\mathcal{P}}}}}}}}}_{S}({\nu }^{1})={\sum }_{s\in {{{{{{{\mathcal{S}}}}}}}}}\left|s\right\rangle {\left\langle s\right|}_{S}{\nu }^{1}\left|s\right\rangle {\left\langle s\right|}_{S}$. We can view ${\nu }_{PQSIC}^{1}$ as a linear function of ${\hat{\psi }}_{PQ}$:

$${\nu }_{PQSIC}^{1}({\hat{\psi }}_{PQ})=\mathop{\sum}\limits_{s,{s}^{{\prime} },i,c}\sqrt{{M}_{PQ}^{(s,i,c)}}{\hat{\psi }}_{PQ}\sqrt{{M}_{PQ}^{({s}^{{\prime} },i,c)}} \otimes \left|s\right\rangle {\left\langle {s}^{{\prime} }\right|}_{S}\otimes \left|i\right\rangle {\left\langle i\right|}_{I}\otimes \left|c\right\rangle {\left\langle c\right|}_{C},$$

(68)

Furthermore, the relative entropy is jointly convex. Therefore, for a given $\overrightarrow{\lambda }$, a valid choice for ${c}_{\overrightarrow{\lambda }}$ can be found by solving the following convex optimisation problem:

$${c}_{\overrightarrow{\lambda }}=\mathop{\inf }\limits_{\hat{{\psi }_{PQ}}}D\left({\nu }_{PQSIC}^{1}\,\big\| \,{{{{{{{{\mathcal{P}}}}}}}}}_{S}({\nu }_{PQSIC}^{1})\right)-\overrightarrow{\lambda }\cdot {\overrightarrow{\nu }}_{C}$$

(69)

$${{{{{{{\rm{s.t.}}}}}}}}\,{\hat{\psi }}_{PQ}\ge 0,\quad \,{{\mbox{Tr}}}\,\left[\hat{{\psi }_{PQ}}\right]=1,\quad {\hat{\psi }}_{P}={\tilde{\psi }}_{P},$$

(70)

where ${\nu }_{PQSIC}^{1}$ and ν_C are linear functions of ${\hat{\psi }}_{PQ}$. To solve this optimisation problem, we can use standard techniques from convex optimisation. In particular, in refs. ^41,51,52 techniques have been developed to bound the relative entropy from below by a sequence of semidefinite programs (SDPs). These SDPs can then be solved using standard SDP solvers, and the solution to the dual SDP provides a certified lower bound. Alternatively, one can also turn any feasible choice of ${\hat{\psi }}_{PQ}$ (ideally close to the optimal attack) into a certified lower bound using the techniques from refs. ^6,7.

We note that many protocols have additional structure that allow the optimisation problem in Eq. (70) to be simplified before tackling it numerically. Additionally, if the map EV from Box 1 has a particular structure that distinguishes between “test rounds”, in which Alice and Bob use their measurement outcomes to check whether Eve tampered with the protocol, and “data rounds”, in which Alice and Bob generate the raw data for their key, the derivation of a collective attack bound can be further simplified. We refer to [ref. ⁵³, Section V.A] for a detailed explanation of this method and to Results Subsection “Sample application: B92 protocol” for an example of its use in our context.

Data availability

No experimental data was collected as part of this work.

Code availability

Code for reproducing Fig. 1 is available from the authors upon request.

References

Bennett, C. H. & Brassard, G. Quantum cryptography: public key distribution and coin tossing. in Proceedings of IEEE International Conference on Computers, Systems and Signal Processing. pp. 8, vol. 175 (1984).
Ekert, A. K. Quantum cryptography based on Bell’s theorem. Phys. Rev. Lett. 67, 661 (1991).
ADS MathSciNet CAS PubMed MATH Google Scholar
Meyer, T., Kampermann, H., Kleinmann, M. & Bruß, D. Finite key analysis for symmetric attacks in quantum key distribution. Phys. Rev. A 74, 042340 (2006).
ADS Google Scholar
Scarani, V. & Renner, R. Quantum cryptography with finite resources: unconditional security bound for discrete-variable protocols with one-way postprocessing. Phys. Rev. Lett. 100, 200501 (2008).
ADS PubMed Google Scholar
Cai, R. Y. Q. & Scarani, V. Finite-key analysis for practical implementations of quantum key distribution. New J. Phys. 11, 045024 (2009).
ADS Google Scholar
Coles, P. J., Metodiev, E. M. & Lütkenhaus, N. Numerical approach for unstructured quantum key distribution. Nat. Commun. 7, 1 (2016).
Google Scholar
Winick, A., Lütkenhaus, N. & Coles, P. J. Reliable numerical key rates for quantum key distribution. Quantum 2, 77 (2018).
Google Scholar
Wang, Y., Primaatmaja, I. W., Lavie, E., Varvitsiotis, A. & Lim, C. C. W. Characterising the correlations of prepare-and-measure quantum networks. npj Quant. Inf. 5, 17 (2019).
ADS Google Scholar
Primaatmaja, I. W., Lavie, E., Goh, K. T., Wang, C. & Lim, C. C. W. Versatile security analysis of measurement-device-independent quantum key distribution. Phys. Rev. A 99, 062332 (2019).
ADS CAS Google Scholar
Brown, P., Fawzi, H. & Fawzi, O. Device-independent lower bounds on the conditional von neumann entropy. arXiv preprint arXiv:2106.13692 (2021a).
Brown, P., Fawzi, H. & Fawzi, O. Computing conditional entropies for quantum correlations. Nat. Commun. 12, 1 (2021).
MATH Google Scholar
Tan, E. Y.-Z., Schwonnek, R., Goh, K. T., Primaatmaja, I. W. & Lim, C. C.-W. Computing secure key rates for quantum cryptography with untrusted devices. npj Quant. Inf. 7, 1 (2021).
Google Scholar
Hu, H., Im, J., Lin, J., Lütkenhaus, N. & Wolkowicz, H. Robust interior point method for quantum key distribution rate computation. Quantum 6, 792 (2022).
Google Scholar
Araújo, M., Huber, M., Navascués, M., Pivoluska, M. & Tavakoli, A. Quantum key distribution rates from semidefinite programming. Quantum 7, 1019 (2023).
Renner, R. Security of quantum key distribution. Int. J. Quant. Inf. 6, 1 (2008).
MATH Google Scholar
Christandl, M., König, R. & Renner, R. Postselection technique for quantum channels with applications to quantum cryptography. Phys. Rev. Lett. 102, 020504 (2009).
ADS PubMed Google Scholar
Dupuis, F., Fawzi, O. & Renner, R. Entropy accumulation. Commun. Math. Phys. 379, 867 (2020).
ADS MathSciNet MATH Google Scholar
Bennett, C. H. Quantum cryptography using any two nonorthogonal states. Phys. Rev. Lett. 68, 3121 (1992).
ADS MathSciNet CAS PubMed MATH Google Scholar
Scarani, V. et al. The security of practical quantum key distribution. Rev. Mod. Phys. 81, 1301 (2009).
ADS Google Scholar
Xu, F., Ma, X., Zhang, Q., Lo, H.-K. & Pan, J.-W. Secure quantum key distribution with realistic devices. Rev. Mod. Phys. 92, 025002 (2020).
ADS MathSciNet CAS Google Scholar
Beaudry, N. J., Moroder, T. & Lütkenhaus, N. Squashing models for optical measurements in quantum communication. Phys. Rev. Lett. 101, 093601 (2008).
ADS PubMed MATH Google Scholar
Arnon-Friedman, R., Renner, R. & Vidick, T. Simple and tight device-independent security proofs. SIAM J. Comput. 48, 181 (2019).
MathSciNet MATH Google Scholar
Lo, H.-K. & Chau, H. F. Unconditional security of quantum key distribution over arbitrarily long distances. Science 283, 2050 (1999).
ADS CAS PubMed Google Scholar
Shor, P. W. & Preskill, J. Simple proof of security of the bb84 quantum key distribution protocol. Phys. Rev. Lett. 85, 441 (2000).
ADS CAS PubMed Google Scholar
Tamaki, K., Koashi, M. & Imoto, N. Unconditionally secure key distribution based on two nonorthogonal states. Phys. Rev. Lett. 90, 167904 (2003).
ADS PubMed Google Scholar
Boileau, J.-C., Tamaki, K., Batuwantudawe, J., Laflamme, R. & Renes, J. M. Unconditional security of a three state quantum key distribution protocol. Phys. Rev. Lett. 94, 040503 (2005).
ADS MathSciNet PubMed Google Scholar
Koashi, M. Simple security proof of quantum key distribution based on complementarity. New J. Phys. 11, 045018 (2009).
ADS MathSciNet Google Scholar
Tomamichel, M., Lim, C. C. W., Gisin, N. & Renner, R. Tight finite-key analysis for quantum cryptography. Nat. Commun. 3, 1 (2012).
Google Scholar
Pereira, M., Kato, G., Mizutani, A., Curty, M. & Tamaki, K. Quantum key distribution with correlated sources. Sci. Adv. 6, eaaz4487 (2020).
ADS PubMed Google Scholar
Pereira, M. et al. Modified BB84 quantum key distribution protocol robust to source imperfections. Phys. Rev. Res. 5, 023065 (2023).
Christandl, M., Renner, R., & Ekert, A. A generic security proof for quantum key distribution. arXiv preprint quant-ph/0402131 (2004).
Renner, R., Gisin, N. & Kraus, B. Information-theoretic security proof for quantum-key-distribution protocols. Phys. Rev. A 72, 012332 (2005).
ADS Google Scholar
Abruzzo, S., Kampermann, H., Mertz, M. & Bruß, D. Quantum key distribution with finite resources: Secret key rates via rényi entropies. Phys. Rev. A 84, 032321 (2011).
ADS Google Scholar
Lütkenhaus, N. Security against individual attacks for realistic quantum key distribution. Phys. Rev. A 61, 052304 (2000).
ADS Google Scholar
Inamori, H., Lütkenhaus, N. & Mayers, D. Unconditional security of practical quantum key distribution. Eur. Phys. J. D 41, 599 (2007).
ADS CAS Google Scholar
Metger, T., Fawzi, O., Sutter, D. & Renner, R. Generalised entropy accumulation. arXiv preprint arXiv:2203.04989 (2022).
Devetak, I. & Winter, A. Distillation of secret key and entanglement from quantum states. Proc. R. Soc. A: Math. Phys. Eng. Sci. 461, 207 (2005).
ADS MathSciNet MATH Google Scholar
Portmann, C. & Renner, R. Security in quantum cryptography. Rev. Mod. Phys. 94, 025008 (2022).
Tamaki, K. & Lütkenhaus, N. Unconditional security of the bennett 1992 quantum key-distribution protocol over a lossy and noisy channel. Phys. Rev. A 69, 032316 (2004).
ADS Google Scholar
George, I., Lin, J., van Himbeeck, T., Fang, K. & Lütkenhaus, N. Finite-key analysis of quantum key distribution with characterized devices using entropy accumulation. arXiv preprint arXiv:2203.06554 (2022).
Fawzi, H., Saunderson, J. & Parrilo, P. A. Semidefinite approximations of the matrix logarithm. Found. Comput. Math. 19, 259 (2019).
MathSciNet MATH Google Scholar
Inoue, K., Waks, E. & Yamamoto, Y. Differential phase shift quantum key distribution. Phys. Rev. Lett. 89, 037902 (2002).
ADS PubMed Google Scholar
Stucki, D., Brunner, N., Gisin, N., Scarani, V. & Zbinden, H. Fast and simple one-way quantum key distribution. Appl. Phys. Lett. 87, 194108 (2005).
ADS Google Scholar
Tomamichel, M. Quantum Information Processing With Finite Resources: Mathematical Foundations, Vol. 5 (Springer, 2015).
Konig, R. & Renner, R. Sampling of min-entropy relative to quantum knowledge. IEEE Trans. Inf. Theory 57, 4760 (2011).
MathSciNet MATH Google Scholar
De, A., Portmann, C., Vidick, T. & Renner, R. Trevisan’s extractor in the presence of quantum side information. SIAM J. Comput. 41, 915 (2012).
MathSciNet MATH Google Scholar
Coles, P. J. Unification of different views of decoherence and discord. Phys. Rev. A 85, 042103 (2012).
ADS Google Scholar
Tan, E. Y. Z. et al. Improved DIQKD protocols with finite-size analysis. Quantum 6, 880 (2022).
Bunandar, D., Govia, L. C., Krovi, H. & Englund, D. Numerical finite-key analysis of quantum key distribution. npj Quant. Inf. 6, 1 (2020).
Google Scholar
George, I., Lin, J. & Lütkenhaus, N. Numerical calculations of the finite key rate for general quantum key distribution protocols. Phys. Rev. Res. 3, 013274 (2021).
CAS Google Scholar
Fawzi, H. & Fawzi, O. Efficient optimization of the quantum relative entropy. J. Phys. A: Math. Theor. 51, 154003 (2018).
ADS MathSciNet MATH Google Scholar
Fawzi, H. Rational Upper/lower Bounds On Log. https://github.com/hfawzi/cvxquad/blob/master/doc/log_approx_bounds.pdf (2021).
Dupuis, F. & Fawzi, O. Entropy accumulation with improved second-order term. IEEE Trans. Inf. Theory 65, 7596 (2019).
MathSciNet MATH Google Scholar

Download references

Acknowledgements

We thank Rotem Arnon-Friedman, Omar Fawzi, Marcus Haberland, Christoph Pacher, Joseph M. Renes, Martin Sandfuchs, and David Sutter for helpful discussions. We are especially grateful to Ernest Tan for helpful explanations regarding numerical methods for computing collective attack bounds. Both authors were supported by the National Centres of Competence in Research (NCCRs) QSIT and SwissMAP (both funded by the Swiss National Science Foundation), the Air Force Office of Scientific Research (AFOSR) via project No. FA9550-19-1-0202, the SNSF project No. 200021_188541 and the QuantERA project eDICT.

Author information

Authors and Affiliations

Institute for Theoretical Physics, ETH Zürich, 8093, Zürich, Switzerland
Tony Metger & Renato Renner

Authors

Tony Metger
View author publications
You can also search for this author in PubMed Google Scholar
Renato Renner
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.M. developed the proofs and wrote the manuscript with input from R.R.; R.R. guided and supervised the project.

Corresponding author

Correspondence to Tony Metger.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Metger, T., Renner, R. Security of quantum key distribution from generalised entropy accumulation. Nat Commun 14, 5272 (2023). https://doi.org/10.1038/s41467-023-40920-8

Download citation

Received: 17 January 2023
Accepted: 16 August 2023
Published: 29 August 2023
DOI: https://doi.org/10.1038/s41467-023-40920-8

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.