Quantum causal unravelling

Bai, Ge; Wu, Ya-Dong; Zhu, Yan; Hayashi, Masahito; Chiribella, Giulio

doi:10.1038/s41534-022-00578-4

Download PDF

Article
Open access
Published: 21 June 2022

Quantum causal unravelling

npj Quantum Information volume 8, Article number: 69 (2022) Cite this article

1905 Accesses
6 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Complex processes often arise from sequences of simpler interactions involving a few particles at a time. These interactions, however, may not be directly accessible to experiments. Here we develop the first efficient method for unravelling the causal structure of the interactions in a multipartite quantum process, under the assumption that the process has bounded information loss and induces causal dependencies whose strength is above a fixed (but otherwise arbitrary) threshold. Our method is based on a quantum algorithm whose complexity scales polynomially in the total number of input/output systems, in the dimension of the systems involved in each interaction, and in the inverse of the chosen threshold for the strength of the causal dependencies. Under additional assumptions, we also provide a second algorithm that has lower complexity and requires only local state preparation and local measurements. Our algorithms can be used to identify processes that can be characterized efficiently with the technique of quantum process tomography. Similarly, they can be used to identify useful communication channels in quantum networks, and to test the internal structure of uncharacterized quantum circuits.

Concurrence percolation threshold of large-scale quantum networks

Article Open access 29 July 2022

Symmetries in quantum networks lead to no-go theorems for entanglement distribution and to verification techniques

Article Open access 25 January 2022

Multipartite entanglement analysis from random correlations

Article Open access 09 June 2020

Introduction

Many processes in nature arise from sequences of basic interactions, each involving a small number of physical systems. Determining the causal structure of these interactions is important both for basic science and for engineering. Often, however, the sequence of interactions giving rise to a process of interest may not be directly accessible to experiments. For example, scattering experiments in high energy physics can probe the relation between a set of incoming particles and a set of outgoing particles, but typically cannot access the individual events taking place within the scattering region. In this and similar scenarios, a fundamental problem is to characterize the causal structure of the interactions by accessing only the inputs and outputs of the process of interest, while treating the intermediate steps as a black box. We call this problem, illustrated in Fig. 1, the causal unravelling of an unknown physical process. Explicitly, the problem of causal unravelling is to determine whether an unknown process can be broken down into a sequence of simpler interactions, to determine the order of such interactions, and to determine which systems take part in each interaction. Causal unravelling can be viewed as a special case of the broader problem of causal discovery^1,2, namely the task to identify the causal relations between a given set of variables. In the broad class of causal discovery problems, the distinctive features of causal unravelling are that (i) the goal is to identify a linear causal structure, corresponding to the sequence of interactions underlying the given process, and (ii) certain variables are a priori known to be ‘inputs’ (and therefore potential ‘causes’), while other variables are a priori known to be ‘outputs’ (and therefore potential ‘effects’). This scenario often arises in experimental physics, where the input/output structure is typically clear from the design of the experiment, as in the aforementioned example of scattering experiments. In principle, candidate answers can be extracted from a full tomographic characterization of the process under consideration. However, the complexity of process tomography grows exponentially in the number of inputs and outputs, making this approach unfeasible when the process involves a large number of systems.

In the classical domain, the problem of causal unravelling can be efficiently addressed with a variety of algorithms developed for the general problem of causal discovery^1,2,3. Classical causal discovery algorithms often formulate causal relationships with graphical models and solve such models by structure learning algorithms, such as PC algorithm (named after Peter Spirtes and Clark Glymour)¹, Greedy Equivalence Search⁴, and Max-Min Hill Climbing⁵. These algorithms cover a wide variety of problems, by making different sets of assumptions on the process under consideration. Typical assumptions include causal sufficiency—meaning that no variables are hidden—and causal faithfulness—meaning that the conditional independences among the variables are precisely those associated to an underlying graph used to model the causal structure. In general, however, causal discovery is intrinsically a hard problem: when no assumption is made, the complexity of all the known algorithms becomes exponential in the worst case over all possible instances^6,7.

In the quantum domain, the problem of causal unravelling is made even more challenging by the presence of correlations that elude a classical explanation^8,9. In recent years, the quantum extension of the notion of causal model has been addressed in a series of works^{10,11,12,13,14,15}, providing a solid conceptual foundation to the field of quantum causal discovery. On the algorithmic side, however, the study of quantum causal models remained relatively underdeveloped. Specific instances of quantum causal discovery were studied in refs. ^16,17,18, showing that quantum resources offer appealing advantages. These examples, however, were limited to simple instances, typically involving a small number of variables and/or a small number of hypotheses on the causal structure. In more general scenarios, one approach could be to perform quantum process tomography and then to infer the causal structure from the full description of the process under consideration¹⁹. As in the classical case, however, the number of queries needed by a full process tomography grows exponentially with the number of systems involved in the process, making this approach impractical as the size of the problem increases.

In this paper, we provide an efficient algorithm for unravelling the causal structure of multipartite quantum processes without resorting to full process tomography. Our algorithm is similar to the PC algorithm¹ for classical causal discovery, in that it is based on a set of tests that establish the independence relations between subsets of input and output systems. We show that the algorithm has the following features:

1.
The number of independence tests needed to infer the causal structure scales polynomially with the number of inputs/outputs of the process. This feature is possible thanks to the special structure of the causal unravelling problem, where the goal is to establish a linear ordering of the interactions giving rise to the process under consideration.
2.
The independence tests produce, as a byproduct, an estimate of the strength of correlation between the various inputs and outputs of the process. In ‘Methods’, we show that this estimate can be obtained by performing a number of measurements that grows polynomially with the dimension of the systems under consideration, and that the number of measurements needed to conclude independence scales polynomially with the number of systems.
3.
The algorithm is exact whenever the process has bounded information loss, and satisfies a form of causal faithfulness property, namely that the strength of the causal relations, when present, is above a given threshold. When these assumptions are not satisfied, the algorithm produces an approximate result. The details of the approximate case are in Supplementary Note 6.

Moreover, the efficiency of our algorithm can be further boosted in special cases, including (i) the case where each input of a given interaction has a non-trivial causal influence on all the outputs of subsequent interactions, and (ii) the case where the process belongs to a special case of Markovian processes^12,19,20,21, where each output depends only on one previous input and each input affects only one later output. We study these cases in the ‘Results’, where we devise an alternative algorithm that only requires local state preparations and local measurements. The number of queries to the process is only logarithmic in the number of input and output wires, thanks to a method that efficiently determines the correlations between input–output pairs as described in the ‘Methods’.

The remaining parts of this paper are structured as follows. In ‘Results’, we first formulate the quantum causal unravelling problem. In the second subsection, we give the main body of the efficient causal unravelling algorithm, discuss the assumptions and analyze its efficiency. The third subsection of ‘Results’ talks about the alternative algorithm designed for special cases. At the end of ‘Results’, we briefly talk about a generalization of our algorithm. Future works, interpretations and the applications of the causal unravelling algorithms are addressed in the ‘Discussion’. The ‘Methods’ section contains the detailed implementation of the independence tests used by the algorithms in ‘Results’.

Results

Problem formulation

Let us start by giving a precise formulation of the problem of quantum causal unravelling. In this problem, an experimenter is given access to a multipartite quantum process, with inputs labelled as ${A}_{1},\ldots ,{A}_{{n}_{{{{\rm{in}}}}}}$, and outputs labelled as ${B}_{1},\ldots ,{B}_{{n}_{{{{\rm{out}}}}}}$. Note that, in general, different labels may refer to the same physical system: for example, system A₁ could be a single photon with a given frequency, entering in the interaction region, and system B₁ could be a single photon with the same frequency, exiting the interaction region. Mathematically, the process is described by a quantum channel, that is, a completely positive trace-preserving (CPTP) linear map ${{{\mathcal{C}}}}$ transforming operators on the tensor product space ${{{{\mathcal{H}}}}}_{{A}_{1}}\otimes \cdots \otimes {{{{\mathcal{H}}}}}_{{A}_{{n}_{{{{\rm{in}}}}}}}$ to operators on the tensor product space ${{{{\mathcal{H}}}}}_{{B}_{1}}\otimes \cdots \otimes {{{{\mathcal{H}}}}}_{{B}_{{n}_{{{{\rm{out}}}}}}}$. Note that, without loss of generality, one can always assume n_in = n_out = n, as this condition can be satisfied by adding a number of dummy systems with one-dimensional Hilbert space. In the following, we will denote by $L({{{\mathcal{H}}}})$ the set of linear operators on a generic Hilbert space ${{{\mathcal{H}}}}$, and by $S({{{\mathcal{H}}}})$ the subset of density operators on ${{{\mathcal{H}}}}$, that is, the subset of operators $\rho \in L({{{\mathcal{H}}}})$ that are positive semidefinite and have unit trace.

The problem of causal unravelling is to determine whether a multipartite process can be broken down into a sequence of interactions, as in Fig. 2, and, in the affirmative case, to determine which systems are involved in each interaction. Mathematically, the problem is to find a partition ${\{{P}_{i}\}}_{i = 1}^{m}$ of the set {A₁, …, A_n} and a partition ${\{{Q}_{i}\}}_{i = 1}^{m}$ of the set {B₁, …, B_n}, such that the multipartite process can be decomposed into a sequence of interactions, with the i-th interaction involving input systems in P_i and output systems in Q_i. Such a sequential structure matches the framework of quantum combs^22,23. A quantum comb is a quantum process that can be broken down into a sequence of interactions ${{{{\mathcal{C}}}}}_{1},\ldots ,{{{{\mathcal{C}}}}}_{k}$ as in Fig. 2, while each interaction ${{{{\mathcal{C}}}}}_{i}$ is a CPTP map and is called a tooth of the comb. References ^22,23 give a set of necessary and sufficient conditions for determining whether a given process conforms to a quantum comb, which is equivalent to whether the process admits a causal unravelling with partitions {P_i} and {Q_i}. We say a process ${{{\mathcal{C}}}}$ has a causal unravelling (P₁, Q₁), …, (P_m, Q_m) if it can be decomposed into the form of a quantum comb with m teeth as in Fig. 2.

**Fig. 2: Decomposition of a physical process into a sequence of m interactions.**

The causal unravelling of a given multipartite process determines the possible signalling relations between inputs and outputs. With respect to this decomposition, input systems at a given time can only signal to output systems at later times. The resulting pattern can be graphically illustrated by a causal graph^1,2,14, as shown in Fig. 3. Note that the causal graph includes all the signalling relations that are in principle compatible with the structure of the interactions. However, a specific quantum process may not exhibit any signalling from a specific input to a specific output, even though the causal structure of the interactions would in principle permit it. When this occurs, the causal unravelling of a process may not be unique. For example, consider a bipartite process with inputs {A₁, A₂} and outputs {B₁, B₂}, with the property that A₁ signals to B₁ but not to B₂ and A₂ signals to B₂ but not to B₁. It is impossible to decide whether the signalling A₁ → B₁ happens before or after A₂ → B₂ since they are causally uncorrelated. Therefore, the process admits two causal unravellings: (A₁, B₁), (A₂, B₂) and (A₂, B₂), (A₁, B₁). In this case, any of the two options is a valid solution of the causal unravelling problem.

**Fig. 3: Causal graph of a quantum process with causal unravelling (A₁, B₁), (A₂, B₂), (A₃, B₃), (A₄, B₄).**

Generally, causal discovery is a hard problem, even in the classical setting^6,7. However, we will now show that, under a few assumptions, the more specific problem of quantum causal unravelling defined above can be solved efficiently. The first assumption is that the basic interactions appearing in the causal unravelling involve a small number of systems, independent of the number of inputs and outputs. At the fundamental level, this assumption is motivated by the fact that interactions are local, and typically involve a small number of systems. Mathematically, the assumption is that the cardinality of all the sets in the partitions ${\{{P}_{i}\}}_{i = 1}^{m}$ and ${\{{Q}_{i}\}}_{i = 1}^{m}$ is no larger than a constant c independent of n. For simplicity, we will first restrict our attention to the special case c = 1, meaning that the process can be broken down into interactions involving only one input and one output at a time. We will discuss larger c at the end of the ‘Results’ and in Supplementary Note 7. Hereafter, we will denote by ${\mathrm{Comb}}$[(A₁, B₁), …, (A_n, B_n)] the set of quantum combs with n teeth where the i-th tooth has input A_i and output B_i.

In the basic c = 1 scenario illustrated above, the problem of causal unravelling is to find out which pair of systems is involved in the first interaction, which pair is involved in the second, and so on. Formally, our goal is to identify an ordering of the inputs and outputs, (A_σ(1), B_π(1)), …, (A_σ(n), B_π(n)) with σ and π being permutations of {1, …, n}, such that ${{{\mathcal{C}}}}\in {\mathrm{Comb}}[({A}_{\sigma (1)},{B}_{\pi (1)}),\ldots ,({A}_{\sigma (n)},{B}_{\pi (n)})]$.

To quantify the efficiency of our algorithms, we will focus on the sample complexity, namely the number of black-box queries to the channel ${{{\mathcal{C}}}}$, and on the computational complexity, including additional quantum and classical computation time measured by the number of elementary quantum gates and classical operations.

Efficient quantum causal unravelling

We now provide an efficient quantum algorithm for quantum causal unravelling. The main idea of the algorithm is to recursively find the last interaction in the decomposition of a given process. To illustrate this idea, consider the example of Fig. 3. When we remove B₄, the node A₄ becomes disconnected from all the other nodes, meaning that the state of system A₄ does not affect the state of the other systems. By testing this independence condition, we can in principle check whether the graph of Fig. 3 is an appropriate model for the process under consideration. In general, suppose (A_x, B_y) are the input and output of the last interaction. Since A_x signals to only B_y, if we ignore B_y, A_x is independent of the joint system formed by all systems excluding A_x and B_y. This gives the following criterion that the last interaction must satisfy:

Proposition 1.²² The last interaction of a process ${{{\mathcal{C}}}}$ involves the input/output pair (A_x, B_y) if and only if A_x is independent of A_≠xB_≠y, the joint system containing all systems other than A_x and B_y.

More details can be found in Supplementary Note 1. By scanning the possible pairs (A_x, B_y), we can find if one of them satisfies the above criterion, and in the affirmative case, we can assign that pair to the last interaction. Note that, in general, there may be more than one pair that satisfy the required condition. After one pair is found, one can reduce the problem to a smaller graph containing n − 1 inputs and n − 1 outputs. If at every step a suitable pair is found, then the final result is a valid causal unravelling of the original process. If at one step no pair can be found, the algorithm will then conclude that no causal unravelling with c = 1 exists for the remaining subgraph. At this point, the algorithm can continue by considering causal unravellings with higher values of c for the remaining subgraph, which is discussed at the end of the ‘Results’ and detailed in Supplementary Note 7.

A description of the algorithm is provided in Algorithm 1. The algorithm runs in a recursive manner: for an n-tooth comb, it finds the last tooth of the comb, remove the tooth (feeds the input with an arbitrary state and discards its output), and reduces the problem to finding the causal unravelling of an (n − 1)-tooth comb. We repeat the above procedure until we reach the bottom case n = 1, and thus obtain the order of all inputs and outputs. If the last tooth cannot be found in some iteration, it means that the current channel cannot be further decomposed, and the algorithm will output the trivial causal unravelling (P, Q) where P (Q) is the set of all input (output) wires of the current channel.

Algorithm 1

Efficient quantum causal unravelling algorithm

We now discuss the efficiency of this algorithm. First, we show that the number of independence tests is polynomial in n. Let T_test(n) be the number of independence tests required for an n-to-n channel. In this algorithm, an independence test is performed for at most each of the n² input–output pairs (A_x, B_y), resulting in O(n²) independence tests. After finding a last tooth, the problem size is reduced to n − 1. Therefore, T_test(n) can be given by the recursive relation T_test(n) = O(n²) + T_test(n − 1) with T_test(1) = O(1), solving which gives T_test(n) = O(n³).

Second, we show that the independence tests can be efficiently realized. This is a non-trivial problem, because testing whether two generic systems are in a product state is computationally hard in the worst-case scenario²⁴. Nevertheless, in the ‘Methods’ we design a quantum circuit that performs the independence tests efficiently under assumptions of bounded information loss and causal faithfulness. Our circuit converts the independence test to the estimation of the distance between quantum states, which is done by the SWAP test²⁵.

Now, we give the efficiency guarantee for our algorithm. We first discuss the exact case, when our algorithm produces the exact causal unravelling of ${{{\mathcal{C}}}}$ based on two assumptions. We will use a few parameters related to the Choi state²⁶ of the process ${{{\mathcal{C}}}}$, defined as the state $C:= ({{{\mathcal{C}}}}\otimes {{{\mathcal{I}}}})(\left|I\rangle \,\right\rangle \left\langle \,\langle I\right|)/{d}_{{{{\rm{in}}}}}$, where d_in is the total dimension of all the input systems, and $\left|I\rangle \,\right\rangle =\mathop{\sum }\nolimits_{j = 1}^{{d}_{{{{\rm{in}}}}}}\left|j\right\rangle \otimes \left|j\right\rangle$ is the canonical (unnormalized) maximally entangled state. The rank of the Choi state C is called the Kraus rank of the process ${{{\mathcal{C}}}}$. Since a unitary evolution, namely a process without information loss, has Kraus rank equal to one, the Kraus rank could be interpreted as the degree of information loss introduced by the process.

An important parameter entering into the analysis is the degree of independence between systems, defined in the following. For a state ρ, we say two disjoint subsystems S and T are independent if ρ_ST = ρ_S ⊗ ρ_T, where ρ_S, ρ_T and ρ_ST are the marginal states of ρ on the subsystems S, T and the joint system ST, respectively. The degree of independence between S and T is then defined as ${\chi }_{1}{\left(S;T\right)}_{\rho }:= \parallel {\rho }_{ST}-{\rho }_{S}\otimes {\rho }_{T}{\parallel }_{1}$, where $\parallel X{\parallel }_{1}:= {{\mathrm{Tr}}}\,\left[\sqrt{{X}^{{\dagger} }X}\right]$ denotes the trace norm. Clearly, ${\chi }_{1}{\left(S;T\right)}_{\rho }=0$ if and only if ρ_ST = ρ_S ⊗ ρ_T. More generally, the trace distance is related to the probability to distinguish the states, and thus ${\chi }_{1}{\left(S;T\right)}_{\rho }$ measures the probability that an observer correctly decides whether the subsystems are independent or correlated. In the following, we will apply this definition to the Choi state C of the process ${{{\mathcal{C}}}}$. With this choice, ${\chi }_{1}{\left(S;T\right)}_{C}$ satisfies the conditions for a quantum causality measure, as defined in ref. ²⁷.

To facilitate the efficiency analysis of our algorithm, we first put the process ${{{\mathcal{C}}}}$ in a standard form where all input and output wires have the same dimension d_A. This standard form does not limit the generality of the quantum process we investigate. For a process ${{{\mathcal{C}}}}$ with input dimensions ${d}_{{A}_{1}},\ldots ,{d}_{{A}_{n}}$ and output dimensions ${d}_{{B}_{1}},\ldots ,{d}_{{B}_{n}}$, we can pick ${d}_{A}=\max \{{d}_{{A}_{1}},\ldots ,{d}_{{A}_{n}},{d}_{{B}_{1}},\ldots ,{d}_{{B}_{n}}\}$ and regard each input or output wire of ${{{\mathcal{C}}}}$ as a subspace of a d_A-dimensional system, thus transforming ${{{\mathcal{C}}}}$ into a process whose wires all have dimension d_A. In our analysis, we will use the Kraus rank of the process in the standard form to characterize the information loss. We assume that the information loss is bounded, which is given by the following assumption:

Assumption 1. The Kraus rank of ${{{\mathcal{C}}}}$, after transforming it into the standard form, is bounded by a polynomial of n.

To ensure that the algorithm outputs the correct causal unravelling, we further require that the process satisfies a form of causal faithfulness, meaning that the strength of causal relations is either zero or above a threshold. Using χ₁ as a quantitative measure of correlation, we adopt the following assumption:

Assumption 2. There exists a number ${\chi }_{\min } > 0$ such that, for any two disjoint sets of wires S and T being tested for independence, either

1.
${\chi }_{1}{\left(S;T\right)}_{C}=0$, or
2.
${\chi }_{1}{\left(S;T\right)}_{C}\ge {\chi }_{\min }$,

where C is the Choi state of the quantum process ${{{\mathcal{C}}}}$.

The threshold ${\chi }_{\min }$ determines the resolution of the independence tests. To guarantee the correctness of Algorithm 1, the independence tests must be precise enough to detect correlations above this threshold with high probability. The efficiency and correctness of Algorithm 1 are given in the following theorem, whose proof is in Supplementary Note 2:

Theorem 1. Under Assumptions 1 and 2, for any confidence parameter κ₀ > 0, Algorithm 1 satisfies the following conditions:

1.
With probability 1 − κ₀, the output of Algorithm 1 is a correct causal unravelling for ${{{\mathcal{C}}}}$.
2.
The number of queries to ${{{\mathcal{C}}}}$ is in the order of
$${T}_{{{{\rm{sample}}}}}=O\left({n}^{3}{d}_{A}^{2}{r}_{{{{\mathcal{C}}}}}^{2}{\chi }_{\min }^{-4}\log (n{\kappa }_{0}^{-1})\right)$$
(1)

where ${r}_{{{{\mathcal{C}}}}}$ is the Kraus rank of ${{{\mathcal{C}}}}$ in the standard form with all wires having dimension d_A.
3.
The computational complexity is in the order of $O({T}_{{{{\rm{sample}}}}}n\log {d}_{A})$.

Theorem 1 guarantees that, under appropriate assumptions, the sample complexity of our algorithm is polynomial in the number of input and output systems of the process under consideration. This feature is in stark contrast with the exponential complexity of full process tomography. As a consequence, our algorithm offers a speedup over for other algorithms, such as the one proposed in ref. ¹⁹, which require process tomography as an intermediate step.

Assumptions 1 and 2 guarantee that Algorithm 1 produces an exact causal unravelling. However, both assumptions can be lifted if we only require an approximate causal unravelling, meaning that the process ${{{\mathcal{C}}}}$ is within a certain error of another process compatible with the causal unravelling output by the algorithm. In Supplementary Note 6, we formulate this approximate case and prove that the error is small under the condition that every marginal Choi state of ${{{\mathcal{C}}}}$ obtained by taking only the first k inputs and k − 1 outputs in the causal unravelling of ${{{\mathcal{C}}}}$ has polynomial rank up to a small error. This condition can be verified efficiently during the execution of Algorithm 1.

Causal unravelling with local observations

We now show that, under some assumptions on the input–output relations, one can design algorithms that have much lower sample complexity in terms of n compared to Algorithm 1, and are more experimentally friendly, in that they require only local state preparation and local measurements.

In the ‘Methods’, we show an efficient algorithm to detect the pairwise correlations between input and output wires of ${{{\mathcal{C}}}}$ with local state preparation and local measurements. The algorithm computes a Boolean matrix ind_ij such that, with high probability, for every i and j, A_i and B_j are approximately independent whenever ind_ij = true, and are correlated whenever ind_ij = false. With some assumptions, this Boolean matrix ind_ij is sufficient to give the exact causal unravelling. The first case is given by the following assumption on the process ${{{\mathcal{C}}}}$:

Assumption 3. ${{{\mathcal{C}}}}$ is a quantum comb in ${\mathrm{Comb}}$[(A_σ(1), B_π(1)), …, (A_σ(n), B_π(n))], and there exists a constant ${\chi }_{\min } \,>\, 0$ such that, for any pair of input and output wires A_σ(i) and B_π(j), if j ≥ i, then ${\chi }_{1}{\left({A}_{\sigma (i)};{B}_{\pi (j)}\right)}_{C}\ge {\chi }_{\min }$.

Assumption 3 indicates a non-trivial correlation between any pair consisting of an input system and an output system, with the property that the input system appears before the output system in the overall causal order. In other words, for any j ≥ i, ${C}_{{A}_{\sigma (i)},{B}_{\pi (j)}}$ is away from ${C}_{{A}_{\sigma (i)}}\otimes {C}_{{B}_{\pi (j)}}$ by distance ${\chi }_{\min }$. Meanwhile, Assumption 3 defines a total order of the input (output) wires, and ensures a unique causal unravelling that ${{{\mathcal{C}}}}$ is compatible with. Under this assumption, if A_i is the k-th input, namely i = σ(k), A_i is correlated with n − k + 1 output wires including every output B_π(j) with j ≥ k, and is independent of the other outputs. In other words, if we find A_i is correlated with exactly c_A(i) output wires, it must be the (n − c_A(i) + 1)-th input. With this, the order of input wires can be exactly determined, and a similar statement can be applied to order the output wires.

In Supplementary Note 4, we give the details of this algorithm, and analyze its efficiency given by the following theorem:

Theorem 2. For a quantum comb ${{{\mathcal{C}}}}\in {\mathsf{Comb}}[({A}_{\sigma (1)},{B}_{\pi (1)}),\ldots ,({A}_{\sigma (n)},{B}_{\pi (n)})]$ satisfying Assumption 3, there is an algorithm that satisfies the following conditions:

1.
With probability 1 − κ, the algorithm outputs the correct causal unravelling (A_σ(1), B_π(1)), …, (A_σ(n), B_π(n)).
2.
The algorithm uses only local state preparations and local measurements and the number of queries to ${{{\mathcal{C}}}}$ is in the order of
$$N=O\left({d}_{A}^{6}{d}_{B}^{6}{\chi }_{\min }^{-2}\log (n{d}_{A}{d}_{B}{\kappa }^{-1})\right),$$
(2)

${where}\,{d}_{A}:= \mathop{\max }\limits_{i}{d}_{{A}_{i}},{d}_{B}:= \mathop{\max }\limits_{j}{d}_{{B}_{j}}$.
3.
The computational complexity is in the order of $O(Nn(n+{d}_{A}+{d}_{B}^{4}))$.

Note the sample complexity of this algorithm grows only logarithmically with n.

A similar idea could be adopted to the case where the process belongs to a special case of Markovian processes^12,19,20,21, where each output depends only on one previous input and each input affects only one later output. This indicates that the process is decomposable to a tensor product of n channels each with one input and one output. In our problem, the order of inputs and outputs is unknown, and we have the following assumption:

Assumption 4. The process C is a tensor product of n channels, ${{{\mathcal{C}}}}{ = \bigotimes }_{i = 1}^{n}{{{{\mathcal{C}}}}}_{i}$ with ${{{{\mathcal{C}}}}}_{i}:{{{{\mathcal{H}}}}}_{{A}_{i}}\to {{{{\mathcal{H}}}}}_{{B}_{\pi ^{\prime} (i)}}$ for some permutation $\pi ^{\prime}$.

Since each output is related to at most one input and each input affects at most one output, after obtaining ind_ij, we can obtain the causal unravelling by matching each input–output pair (A_i, B_j) with ind_ij = false.

If there exists a threshold ${\chi }_{\min } \,>\, 0$ such that either ${\chi }_{1}{\left({A}_{i};{B}_{j}\right)}_{C}=0$ or ${\chi }_{1}{\left({A}_{i};{B}_{j}\right)}_{C}\ge {\chi }_{\min }$ holds for every A_i and B_j, then the algorithm has the same complexity as in Theorem 2 that is logarithmic in n. However, in case a threshold ${\chi }_{\min }$ is not known, we can still show that the algorithm is efficient yet produces an approximate answer with an error bound defined by the diamond norm²⁸. The diamond norm, also known as the completely bounded trace norm, is a distance measure between channels defined for ${{{\mathcal{C}}}},{{{\mathcal{D}}}}:L({{{{\mathcal{H}}}}}_{A})\to L({{{{\mathcal{H}}}}}_{B})$ as $\parallel {{{\mathcal{C}}}}-{{{\mathcal{D}}}}{\parallel }_{\lozenge }:= \mathop{\max }\limits_{\rho \in S({{{{\mathcal{H}}}}}_{A}\otimes {{{{\mathcal{H}}}}}_{A})}\parallel ({{{\mathcal{C}}}}\otimes {{{{\mathcal{I}}}}}_{A})(\rho )-({{{\mathcal{D}}}}\otimes {{{{\mathcal{I}}}}}_{A})(\rho ){\parallel }_{1}$, where ${{{{\mathcal{I}}}}}_{A}:L({{{{\mathcal{H}}}}}_{A})\to L({{{{\mathcal{H}}}}}_{A})$ is the identity map. The diamond norm measures the maximum probability to distinguish two channels, and is tighter than the trace distance since $\parallel {{{\mathcal{C}}}}-{{{\mathcal{D}}}}{\parallel }_{\lozenge }\le \parallel C-D{\parallel }_{1}$ for all channels ${{{\mathcal{C}}}}$ and ${{{\mathcal{D}}}}$ with Choi states C and D. The error bound is stated in the following theorem, whose proof is in Supplementary Note 5.

Theorem 3. For a quantum process ${{{\mathcal{C}}}}$ satisfying Assumption 4, there is an algorithm that outputs a causal unravelling (A₁, B_π(1)), …, (A_n, B_π(n)) satisfying the following conditions:

1.
With probability 1 − κ, the causal unravelling is approximately correct in the following sense:
$$\exists {{{\mathcal{D}}}}\in {\mathsf{Comb}}[({A}_{1},{B}_{\pi (1)}),\ldots ,({A}_{n},{B}_{\pi (n)})],\,\parallel {{{\mathcal{C}}}}-{{{\mathcal{D}}}}{\parallel }_{\lozenge }\le \varepsilon \,.$$
(3)
2.
The algorithm uses only local state preparations and local measurements and the number of queries to ${{{\mathcal{C}}}}$ is in the order of
$$N=O\left({n}^{2}{d}_{A}^{8}{d}_{B}^{6}{\varepsilon }^{-2}\log (n{d}_{A}{d}_{B}{\kappa }^{-1})\right),$$
(4)

$where\,{d}_{A}:= \mathop{\max }\limits_{i}{d}_{{A}_{i}}$ and ${d}_{B}:= \mathop{\max }\limits_{j}{d}_{{B}_{j}}$.
3.
The computational complexity is in the order of $O(Nn(n+{d}_{A}+{d}_{B}^{4}))$.

Causal unravelling with interactions between more inputs and outputs

In the algorithms shown so far, we assumed that the process under consideration admits a causal unravelling where each interaction involves exactly one input and one output. More generally, Algorithm 1 can be easily extended to the scenario where each interaction involves at most c inputs and c outputs of the original process. Instead of considering each wire separately, the idea is to consider a subset of at most c input (output) wires and perform independence tests on the subsets.

In Algorithm 1, one enumerates an input–output pair (A_x, B_y) and checks whether it is the last tooth by performing an independence test between A_x and A_≠xB_≠y. To deal with larger c, we replace this procedure by enumerating a subset of input wires $P\subset \{{A}_{1},\ldots ,{A}_{{n}_{{{{\rm{in}}}}}}\}$ and a subset of output wires $Q\subset \{{B}_{1},\ldots ,{B}_{{n}_{{{{\rm{out}}}}}}\}$, satisfying ∣P∣ ≤ c and ∣Q∣ ≤ c. Then we check whether (P, Q) is the last tooth of ${{{\mathcal{C}}}}$, which, according to Proposition 1, is equivalent to checking the independence between P and A_∉PB_∉Q ≔ {A_i∣A_i ∉ P} ∪ {B_j∣B_j ∉ Q}. The independence tests can still be implemented with SWAP tests. Like Algorithm 1, after we decide (P, Q) to be the last tooth, the wires in P and Q are removed from consideration, and the problem is reduced to the causal unravelling of a smaller channel. This process is done recursively until one reaches the bottom case. We give the detailed algorithm and analysis in Supplementary Note 7. For constant c, under some assumptions, the complexity of the algorithm is still polynomial in n and d_A, while the exponent depends on c.

Discussion

In this paper we developed an efficient algorithm for discovering linear causal structures between the inputs and outputs of a multipartite quantum process. Our algorithm provides a partial solution to the more general quantum causal discovery problem, whose goal is to produce a full causal graph describing arbitrary causal correlations in an arbitrary set of quantum variables. Our algorithm can be used as the first step for quantum causal discovery, and to obtain the full causal structure, additional tests may be adopted to detect signalling between more subsets of input and output wires. Since the most general quantum causal discovery problem is intrinsically hard, an interesting direction for future work is to examine to what extent the problem of quantum causal discovery can be solved by an efficient algorithm in scenarios beyond the linear structure analysed in this work.

The efficiency of our algorithms relies on some assumptions. For Algorithm 1, the low-rank assumption is the key in both the exact case (Assumption 1) and the approximate case discussed in Supplementary Note 6. This assumption avoids the computational difficulty of deciding whether a completely general state is a product state²⁴. Physically, a quantum process has a low rank if the number of uncontrolled particles entering and/or exiting the interaction region is small. In this picture, the uncontrolled particles in the input can be regarded as sources of environmental noise, and the uncontrolled particles in the output are responsible for information loss in the process. Intuitively, without the low-rank assumption, the causal correlations will be obscured by the noise, and it will be hard to discover them without additional prior knowledge. On the other hand, if one has prior knowledge, as in the case of processes satisfying Assumptions 3 and 4, the low-rank assumption may be lifted.

The ability to infer the underlying causal structure of a process is useful for a variety of applications. Classically, discovering causal relationships is the goal of many research areas with numerous applications in social and biomedical sciences¹ such as the construction of the gene expression network²⁹. Causal discovery allows us to understand complex systems whose internal structures are not directly accessible, and to discover possible models for the internal mechanisms. Quantum causal discovery, likewise, enables the modelling of quantum physical processes with inaccessible internal structure, for example, discovering the individual interactions in scattering experiments. Below we list some specific examples where our causal unravelling algorithms can be applied to the detection of correlations and the modelling of internal structures of complex processes.

First, causal relations among quantum variables are relevant to the study of quantum networks^30,31,32, where the presence of a causal relation between two systems can be used to test whether it is possible to send signals from one node to another. In a realistic setting, the signalling patterns within a quantum network may change dynamically, depending on the number of users of the network at a given moment of time, on the way the messages are routed from the senders to the receivers, and also on changes in the environment, which may affect the availability of transmission paths between nodes. Such a dynamical structure occurs frequently in classical wireless networks^33,34, and is likely to arise in a future quantum internet. In this context, our algorithms provide an efficient way to detect dynamical changes in the availability of data transmission paths.

The detection of causal relations is also relevant to the verification of quantum devices, as it can be used as an initial test to determine whether a given quantum device generates input–output correlations with a desired causal structure. Such a test could serve as an initial screening to rule out devices that are not suitable for a given task, and could be followed by more refined quantum benchmarks³⁵ which quantify how well the device performs a desired task. In this context, the benefit of the causal unravelling test is that it could save the effort of performing more refined tests in case the process under consideration does not comply with the desired causal structure.

Finally, our causal unravelling algorithm can be used as a preliminary step to full process tomography. By detecting the causal structure of multipartite quantum processes, one can sometimes design a tailor-made tomography scheme that ignores unnecessary correlations, and achieves full process tomography without requiring an exponentially large number of measurement setups. For example, a process that admits a causal unravelling with systems of bounded dimension at every step can be efficiently represented by a tensor network state^36,37, for which tomography can be performed efficiently³⁸. In a quantum communication network, efficient tomography of the transmission paths is crucial for the design of encoding, decoding and calibration schemes for more efficient data transmission. In physics experiments, the causal structure and tomography data are useful for modelling the underlying physical process, for example, by finding the smallest quantum model that reproduces the observed data^39,40,41. More generally, characterizing the causal structure of a multipartite process as a tensor network enables the use of efficient protocols that exploits the tensor network structure, including simulation protocols^37,42,43 and compression protocols⁴⁴.

Methods

Efficient tests for the last tooth via the SWAP test

In the ‘Results’, we have given the framework of Algorithm 1. In this section, we discuss how the tests for the last tooth, namely line 5 of Algorithm 1, can be carried out efficiently.

Consider a process ${{{\mathcal{C}}}}$ of three input wires A₁, A₂, A₃ and three output wires B₁, B₂, B₃, and suppose that we want to test whether (A₁, B₁) is the last tooth, which is equivalent to the independence test between A₁ and A₂A₃B₂B₃ according to Proposition 1. Testing the independence between A₁ and A₂A₃B₂B₃ can be converted to the estimation of ${\chi }_{1}{\left({A}_{1};{A}_{2}{A}_{3}{B}_{2}{B}_{3}\right)}_{C}=\parallel {C}_{{A}_{1}{A}_{2}{A}_{3}{B}_{2}{B}_{3}}-{C}_{{A}_{1}}\otimes {C}_{{A}_{2}{A}_{3}{B}_{2}{B}_{3}}{\parallel }_{1}$, which is the distance between marginal Choi states. Each copy of ${C}_{{A}_{1}{A}_{2}{A}_{3}{B}_{2}{B}_{3}}$ or ${C}_{{A}_{2}{A}_{3}{B}_{2}{B}_{3}}$ can be prepared with one use of the process ${{{\mathcal{C}}}}$. Note that ${C}_{{A}_{1}}$ equals to ${I}_{{A}_{1}}/{d}_{{A}_{1}}$ by definition of a CPTP map. Given the ability to prepare the marginal Choi states, we now consider the estimation of their distance, which gives the value of ${\chi }_{1}{\left({A}_{1};{A}_{2}{A}_{3}{B}_{2}{B}_{3}\right)}_{C}$. In the following, we first talk about the estimation of another distance measure, the Hilbert–Schmidt distance, and use it to bound the trace distance as used by χ₁.

Generally, the Hilbert–Schmidt distance between two states ρ and σ is defined as ∥ρ − σ∥₂, where $\parallel X{\parallel }_{2}:= \sqrt{{{\mathrm{Tr}}}\,[{X}^{{\dagger} }X]}$ denotes the Schatten 2-norm, also known as the Frobenius norm. It is related to the trace distance by the following inequality⁴⁵:

$$2{\left\Vert \rho -\sigma \right\Vert }_{2}^{2}\le {\left\Vert \rho -\sigma \right\Vert }_{1}^{2}\le \frac{4{\mathrm{rank}}(\rho ){\mathrm{rank}}(\sigma )}{{\mathrm{rank}}(\rho )+{\mathrm{rank}}(\sigma )}{\left\Vert \rho -\sigma \right\Vert }_{2}^{2}.$$

(5)

The Hilbert–Schmidt distance between two quantum states can be estimated via SWAP tests²⁵. The SWAP test uses the quantum circuit in Fig. 4 to estimate ${{\mathrm{Tr}}}\,[\rho \sigma ]$ for two given quantum states ρ and σ.

If we run the circuit in Fig. 4 for N times and let c₊ be the number of times observing outcome $\left|+\right\rangle$, then 2c₊/N − 1 is an estimate of ${{\mathrm{Tr}}}\,[\rho \sigma ]$. The algorithm that yields an estimate of ${{\mathrm{Tr}}}\,[\rho \sigma ]$ is as follows, which produces an estimate with error no more than ε with probability 1 − κ as shown in Lemma 1.

Function 1: SWAPTEST(ρ, σ, ε, κ)

	Input: Quantum states ρ and σ (accessed by oracles that generate the states), error threshold ε, confidence κ
	Output: Approximate value of ${{\mathrm{Tr}}}\,[\rho \sigma ]$
1	$N\leftarrow \lceil 2{\varepsilon }^{-2}\log (2/\kappa )\rceil;$
2	Run the circuit in Fig. 4 for N times. Let c₊ be the number of outcome $\left\|+\right\rangle;$
3	Return 2c₊/N − 1.

Lemma 1. With probability 1 − κ, the SWAP test estimates ${{\mathrm{Tr}}}\,[\rho \sigma ]$ within error ε.

Proof. For each run, the probability that the outcome is $\left|+\right\rangle$ is $(1+{{\mathrm{Tr}}}\,[\rho \sigma ])/2$. By Hoeffding’s inequality,

$$\begin{array}{l}\Pr \left[\left|\frac{{c}_{+}}{N}-\frac{1+{{\mathrm{Tr}}}\,[\rho \sigma ]}{2}\right|\le \varepsilon /2\right]\ge 1-2{e}^{-{\varepsilon }^{2}N/2}\\ \Pr \left[\left|2{c}_{+}/N-1-{{\mathrm{Tr}}}\,[\rho \sigma ]\right|\le \varepsilon \right]\ge 1-2{e}^{-{\varepsilon }^{2}(2{\varepsilon }^{-2}\log (2/\kappa ))/2}=1-\kappa \end{array}$$

(6)

□

The SWAP test is efficient in the sense that its circuit complexity is linear in the number of qubits representing the systems. For d-dimensional states ρ and σ, the controlled-SWAP gate acting on ρ and σ can be realized with $\lceil \log d\rceil$ controlled-SWAP gates acting on each pair of corresponding qubits of ρ and σ, and thus can be implemented with $O(\log d)$ gates.

Running SWAP tests on ρ ⊗ σ, ρ ⊗ ρ and σ ⊗ σ, we are able to obtain estimates for ${{\mathrm{Tr}}}\,[\rho \sigma ]$, ${{\mathrm{Tr}}}\,[{\rho }^{2}]$ and ${{\mathrm{Tr}}}\,[{\sigma }^{2}]$, respectively. From these estimates we can compute the Hilbert–Schmidt distance between ρ and σ according to the following equation:

$${\left\Vert \rho -\sigma \right\Vert }_{2}^{2}={{\mathrm{Tr}}}\,[{\rho }^{2}]+{{\mathrm{Tr}}}\,[{\sigma }^{2}]-2{{\mathrm{Tr}}}\,[\rho \sigma ].$$

(7)

Coming back to the independence tests, by applying SWAP tests on marginal Choi states ${C}_{{A}_{1}{A}_{2}{A}_{3}{B}_{2}{B}_{3}}$ and ${C}_{{A}_{1}}\otimes {C}_{{A}_{2}{A}_{3}{B}_{2}{B}_{3}}$, we can estimate their Hilbert-Schimidt distance. Using Eq. (5), we can bound their trace distance, obtain a bound on ${\chi }_{1}{\left({A}_{1};{A}_{2}{A}_{3}{B}_{2}{B}_{3}\right)}_{C}$, determine the independence between A₁ and A₂A₃B₂B₃, and decide whether (A₁, B₁) is the last tooth. This procedure is summarized in Function 2.

Function 2: checklast(${{{\mathcal{C}}}}$, _Ax, B_y, ε, δ, κ)

This completes Algorithm 1. The choices of ε and δ are done in the detailed error analysis in Supplementary Note 2. To ensure the correctness and efficiency, we need to further ensure that Eq. (5) provides a useful bound by giving upper bounds on the ranks of the marginal Choi states. Such upper bounds can be obtained from Assumption 1, and the details are in Supplementary Note 2.

We have shown the possibility of using the SWAP test as an efficient test of independence. The SWAP test could in principle be replaced by any algorithm for estimating the Hilbert–Schmidt distance, with possibly lower sample complexity⁴⁶. In a quantum network scenario, causal unravelling may be performed by multiple parties, each of which has access to one input system or one output system. A bonus of the SWAP test is that, since a controlled-SWAP gate for two multipartite states can be decomposed into controlled-SWAP gates on the local systems of each party, only the control qubit needs to be transferred from one party to another in each round of the test. In addition, the control qubit is measured after each round, and therefore parties do not need to carry quantum memories over subsequent rounds.

Testing the independence between all input–output pairs

In this section, we show the algorithm to test the independence between every pair of input and output wires, resulting in a Boolean matrix ind_ij. The first step is to collect statistics of the channel ${{{\mathcal{C}}}}$. When collecting data, we need to ensure that the information encoded in the quantum data are preserved, for which purpose we use informationally complete POVMs⁴⁷. The elements of such a POVM on Hilbert space ${{{\mathcal{H}}}}$ form a spanning set of $L({{{\mathcal{H}}}})$, and the number of elements can be chosen as ${(\dim {{{\mathcal{H}}}})}^{2}$.

We pick an informationally complete POVM ${\{{P}_{{A}_{i},\alpha }\}}_{\alpha = 1}^{{d}_{{A}_{i}}^{2}}$ for each input wire A_i and ${\{{Q}_{{B}_{j},\beta }\}}_{\beta = 1}^{{d}_{{B}_{j}}^{2}}$ for each output wire B_j. We pick N to be the number of queries to the channel ${{{\mathcal{C}}}}$. In each query, we measure the Choi state of ${{{\mathcal{C}}}}$ with the POVM

$${\left\{{P}_{{A}_{1},{\alpha }_{1}}\otimes \cdots \otimes {P}_{{A}_{n},{\alpha }_{n}}\otimes {Q}_{{B}_{1},{\beta }_{1}}\otimes \cdots \otimes {Q}_{{B}_{n},{\beta }_{n}}\right\}}_{{\alpha }_{1},\ldots ,{\alpha }_{n},{\beta }_{1},\ldots ,{\beta }_{n}}$$

(8)

This POVM can be realized with local measurements on each wire. Let $({\alpha }_{1}^{(k)},\ldots ,{\alpha }_{n}^{(k)},{\beta }_{1}^{(k)},\ldots ,{\beta }_{n}^{(k)})$ be the outcome of the k-th query. We can write the outcomes in a matrix as shown in Fig. 5.

**Fig. 5: Matrix of measurement outcomes.**

This matrix will be used to determine the independence between every pair of input and output wires. When we investigate A_i and B_j, we only look at the corresponding columns in the matrix and check the independence from data in these columns. Since we are using an informationally complete POVM, the independence between the two columns of the matrix is equivalent to the independence between the input A_i and output B_j. With this idea, we will be able to compute a Boolean matrix ind_ij such that ind_ij = true if and only if A_i and B_j are independent, up to an error related to N, the number of queries.

One may notice that the POVM in Eq. (8) is enough for a full process tomography of ${{{\mathcal{C}}}}$. This is true if we pick an exponentially large N as large as the square of the product of all input and output dimensions. However, to compute ind_ij with a small error, N does not need to be large. We show that N could be chosen to grow only logarithmically with n, far less than a full process tomography.

Lemma 2. Given a channel ${{{\mathcal{C}}}}$ with input wires A₁, …, A_n and output wires B₁, …, B_n, there exists an algorithm that satisfies the following conditions:

1.
With probability 1 − n²κ₀, the algorithm produces an output satisfying:
1. (a)
  if ind_i,j = false, then ${\chi }_{1}({C}_{{A}_{i},{B}_{j}}) \,>\, {\chi }_{-}-{\varepsilon }_{0}$
2. (b)
  if ind_i,j = true, then ${\chi }_{1}({C}_{{A}_{i},{B}_{j}})\le {\chi }_{-}+{\varepsilon }_{0}$
  
  where χ₋ > 0 is a threshold that can be freely chosen.
2.
The number of queries to C is in the order of
$$N=O\left({d}_{A}^{6}{d}_{B}^{6}{\varepsilon }_{0}^{-2}\log ({d}_{A}{d}_{B}{\kappa }_{0}^{-1})\right).$$
(9)

The detailed algorithm and the proof are in Supplementary Note 3.

The Boolean matrix ind_ij gives a partial order of the wires: if ind_ij = false, then A_i must be before B_j. This partial order may not produce the full ordering of the wires, but will be a convenient initial guess for the causal unravelling, since its complexity is much lower than the general algorithm Algorithm 1 for large n. In the ‘Results’, we show that under Assumptions 3 or 4, this partial order is enough to infer the full order.

Furthermore, the matrix of outcomes (Fig. 5) is a conversion from the quantum process to classical data, making it possible to adopt classical causal discovery algorithms^1,3,4,5. For example, one could use algorithms based on the graph model, and try to find a graph in the form of Fig. 3 that best fits the observation data in Fig. 5.

Data availability

The authors declare that the data supporting the findings of this study are available within the paper and in the Supplementary Information files.

References

Spirtes, P., Glymour, C. N., Scheines, R. & Heckerman, D. Causation, Prediction, and Search (MIT Press, 2000).
Pearl, J. Causality (Cambridge University Press, 2009).
Heinze-Deml, C., Maathuis, M. H. & Meinshausen, N. Causal structure learning. Annu. Rev. Stat. Appl. 5, 371–391 (2018).
Article MathSciNet Google Scholar
Chickering, D. M. Optimal structure identification with greedy search. J. Mach. Learn. Res. 3, 507–554 (2002).
MathSciNet MATH Google Scholar
Tsamardinos, I., Brown, L. E. & Aliferis, C. F. The max-min hill-climbing Bayesian network structure learning algorithm. Mach. Learn. 65, 31–78 (2006).
Article Google Scholar
Chickering, D. M. In Learning from Data 121–130 (Springer, 1996).
Chickering, M., Heckerman, D. & Meek, C. Large-sample learning of Bayesian networks is np-hard. J. Mach. Learn. Res. 5, 1287–1330 (2004).
MathSciNet MATH Google Scholar
Wood, C. J. & Spekkens, R. W. The lesson of causal discovery algorithms for quantum correlations: causal explanations of Bell-inequality violations require fine-tuning. N. J. Phys. 17, 033002 (2015).
Article Google Scholar
Van Himbeeck, T. et al. Quantum violations in the instrumental scenario and their relations to the Bell scenario. Quantum 3, 186 (2019).
Article Google Scholar
Henson, J., Lal, R. & Pusey, M. F. Theory-independent limits on correlations from generalized Bayesian networks. N. J. Phys. 16, 113043 (2014).
Article Google Scholar
Pienaar, J. & Brukner, Č. A graph-separation theorem for quantum causal models. N. J. Phys. 17, 073020 (2015).
Article Google Scholar
Costa, F. & Shrapnel, S. Quantum causal modelling. N. J. Phys. 18, 063032 (2016).
Article Google Scholar
Allen, J.-M. A., Barrett, J., Horsman, D. C., Lee, C. M. & Spekkens, R. W. Quantum common causes and quantum causal models. Phys. Rev. X 7, 031021 (2017).
Google Scholar
Barrett, J., Lorenz, R. & Oreshkov, O. Quantum causal models. Preprint at https://arxiv.org/abs/1906.10726 (2019).
Barrett, J., Lorenz, R. & Oreshkov, O. Cyclic quantum causal models. Nat. Commun. 12, 885 (2021).
Article ADS Google Scholar
Ried, K. et al. A quantum advantage for inferring causal structure. Nat. Phys. 11, 414–420 (2015).
Article Google Scholar
Fitzsimons, J. F., Jones, J. A. & Vedral, V. Quantum correlations which imply causation. Sci. Rep. 5, 1–7 (2015).
Google Scholar
Chiribella, G. & Ebler, D. Quantum speedup in the identification of cause–effect relations. Nat. Commun. 10, 1472 (2019).
Article ADS Google Scholar
Giarmatzi, C. & Costa, F. A quantum causal discovery algorithm. npj Quantum Inf. 4, 1–9 (2018).
Article Google Scholar
Pollock, F. A., Rodríguez-Rosario, C., Frauenheim, T., Paternostro, M. & Modi, K. Operational Markov condition for quantum processes. Phys. Rev. Lett. 120, 040405 (2018).
Article ADS Google Scholar
Berk, G. D., Garner, A. J., Yadin, B., Modi, K. & Pollock, F. A. Resource theories of multi-time processes: a window into quantum non-Markovianity. Quantum 5, 435 (2021).
Article Google Scholar
Chiribella, G., D’Ariano, G. M. & Perinotti, P. Quantum circuit architecture. Phys. Rev. Lett. 101, 060401 (2008).
Article ADS Google Scholar
Chiribella, G., D’Ariano, G. M. & Perinotti, P. Theoretical framework for quantum networks. Phys. Rev. A 80, 022339 (2009).
Article ADS MathSciNet Google Scholar
Gutoski, G., Hayden, P., Milner, K. & Wilde, M. M. Quantum interactive proofs and the complexity of separability testing. Theory Comput. 11, 59 (2015).
Article MathSciNet Google Scholar
Buhrman, H., Cleve, R., Watrous, J. & De Wolf, R. Quantum fingerprinting. Phys. Rev. Lett. 87, 167902 (2001).
Article ADS Google Scholar
Choi, M.-D. Completely positive linear maps on complex matrices. Linear Algebra Appl. 10, 285–290 (1975).
Article MathSciNet Google Scholar
Jia, D. Quantifying causality in quantum and general models. Preprint at https://arxiv.org/abs/1801.06293 (2018).
Kitaev, A. Y., Shen, A., Vyalyi, M. N. & Vyalyi, M. N. Classical and Quantum Computation. Number 47 (American Mathematical Soc., 2002).
Spirtes, P. et al. Constructing Bayesian Network Models of Gene Expression Networks from Microarray Data (2000).
Kimble, H. J. The quantum internet. Nature 453, 1023–1030 (2008).
Elliott, C. Building the quantum network. N. J. Phys. 4, 46 (2002).
Article Google Scholar
Wehner, S., Elkouss, D. & Hanson, R. Quantum internet: a vision for the road ahead. Science 362, eaam9288 (2018).
Johnson, D. B. & Maltz, D. A. In Mobile Computing 153–181 (Springer, 1996).
Royer, E. M. & Toh, C.-K. A review of current routing protocols for ad hoc mobile wireless networks. IEEE Pers. Commun. 6, 46–55 (1999).
Article Google Scholar
Bai, G. & Chiribella, G. Test one to test many: a unified approach to quantum benchmarks. Phys. Rev. Lett. 120, 150502 (2018).
Article ADS Google Scholar
Fannes, M., Nachtergaele, B. & Werner, R. F. Finitely correlated states on quantum spin chains. Commun. Math. Phys. 144, 443–490 (1992).
Article ADS MathSciNet Google Scholar
Verstraete, F., Murg, V. & Cirac, J. I. Matrix product states, projected entangled pair states, and variational renormalization group methods for quantum spin systems. Adv. Phys. 57, 143–224 (2008).
Article ADS Google Scholar
Cramer, M. et al. Efficient quantum state tomography. Nat. Commun. 1, 149 (2010).
Article ADS Google Scholar
Gu, M., Wiesner, K., Rieper, E. & Vedral, V. Quantum mechanics can reduce the complexity of classical models. Nat. Commun. 3, 762 (2012).
Article ADS Google Scholar
Monras, A. & Winter, A. Quantum learning of classical stochastic processes: the completely positive realization problem. J. Math. Phys. 57, 015219 (2016).
Article ADS MathSciNet Google Scholar
Thompson, J., Garner, A. J., Vedral, V. & Gu, M. Using quantum theory to simplify input–output processes. npj Quantum Inf. 3, 1–8 (2017).
Article ADS Google Scholar
Shi, Y.-Y., Duan, L.-M. & Vidal, G. Classical simulation of quantum many-body systems with a tree tensor network. Phys. Rev. A 74, 022320 (2006).
Article ADS Google Scholar
Vidal, G. Class of quantum many-body states that can be efficiently simulated. Phys. Rev. Lett. 101, 110501 (2008).
Article ADS Google Scholar
Bai, G., Yang, Y. & Chiribella, G. Quantum compression of tensor network states. N. J. Phys. 22, 043015 (2020).
Article MathSciNet Google Scholar
Coles, P. J., Cerezo, M. & Cincio, L. Strong bound between trace distance and Hilbert-Schmidt distance for low-rank states. Phys. Rev. A 100, 022103 (2019).
Article ADS Google Scholar
Bădescu, C., O’Donnell, R. & Wright, J. Quantum state certification. In Proceedings of the 51st Annual ACM SIGACT Symposium on Theory of Computing 503–514 (2019).
Prugovečki, E. Information-theoretical aspects of quantum measurement. Int. J. Theor. Phys. 16, 321–331 (1977).
Article Google Scholar

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China through grant 11675136, the Hong Kong Research Grant Council through grants 17300918 and 17307520, and though the Senior Research Fellowship Scheme SRFS2021-7S02, the Croucher Foundation, and the John Templeton Foundation through grant 61466, The Quantum Information Structure of Spacetime (qiss.fr). Research at the Perimeter Institute is supported by the Government of Canada through the Department of Innovation, Science and Economic Development Canada and by the Province of Ontario through the Ministry of Research, Innovation and Science. The opinions expressed in this publication are those of the authors and do not necessarily reflect the views of the John Templeton Foundation. The work of M. Hayashi was supported in part by Guangdong Provincial Key Laboratory (Grant No. 2019B121203002).

Author information

Authors and Affiliations

QICI Quantum Information and Computation Initiative, Department of Computer Science, The University of Hong Kong, Pokfulam Road, Hong Kong, China
Ge Bai, Ya-Dong Wu, Yan Zhu & Giulio Chiribella
HKU-Oxford Joint Laboratory for Quantum Information and Computation, Department of Computer Science, The University of Hong Kong, Pokfulam Road, Hong Kong, China
Ge Bai, Ya-Dong Wu, Yan Zhu & Giulio Chiribella
Shenzhen Institute for Quantum Science and Engineering, Southern University of Science and Technology, Shenzhen, 518055, China
Masahito Hayashi
Guangdong Provincial Key Laboratory of Quantum Science and Engineering, Southern University of Science and Technology, Shenzhen, 518055, China
Masahito Hayashi
Graduate School of Mathematics, Nagoya University, Nagoya, 464-8602, Japan
Masahito Hayashi
Department of Computer Science, University of Oxford, Parks Road, Oxford, OX1 3QD, UK
Giulio Chiribella
Perimeter Institute for Theoretical Physics, 31 Caroline Street North, Waterloo, N2L 2Y5, Ontario, Canada
Giulio Chiribella

Authors

Ge Bai
View author publications
You can also search for this author in PubMed Google Scholar
Ya-Dong Wu
View author publications
You can also search for this author in PubMed Google Scholar
Yan Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Masahito Hayashi
View author publications
You can also search for this author in PubMed Google Scholar
Giulio Chiribella
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

G.B., Y.-D.W., Y.Z. and G.C. contributed to the development of the first algorithm. M.H. and G.B. proposed and refined the algorithms with local observations. G.C. proposed the problem and supervised this project. All authors discussed extensively the research presented in this paper and contributed to the writing of this manuscript.

Corresponding author

Correspondence to Giulio Chiribella.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Quantum Causal Unravelling - Supplementary Material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bai, G., Wu, YD., Zhu, Y. et al. Quantum causal unravelling. npj Quantum Inf 8, 69 (2022). https://doi.org/10.1038/s41534-022-00578-4

Download citation

Received: 01 October 2021
Accepted: 19 May 2022
Published: 21 June 2022
DOI: https://doi.org/10.1038/s41534-022-00578-4

	Input: Quantum states ρ and σ (accessed by oracles that generate the states), error threshold ε, confidence κ
	Output: Approximate value of \({{\mathrm{Tr}}}\,[\rho \sigma ]\)
1	\(N\leftarrow \lceil 2{\varepsilon }^{-2}\log (2/\kappa )\rceil;\)
2	Run the circuit in Fig. 4 for N times. Let c₊ be the number of outcome \(\left\|+\right\rangle;\)
3	Return 2c₊/N − 1.