# Variational consistent histories as a hybrid algorithm for quantum foundations

## Abstract

Although quantum computers are predicted to have many commercial applications, less attention has been given to their potential for resolving foundational issues in quantum mechanics. Here we focus on quantum computers’ utility for the Consistent Histories formalism, which has previously been employed to study quantum cosmology, quantum paradoxes, and the quantum-to-classical transition. We present a variational hybrid quantum-classical algorithm for finding consistent histories, which should revitalize interest in this formalism by allowing classically impossible calculations to be performed. In our algorithm, the quantum computer evaluates the decoherence functional (with exponential speedup in both the number of qubits and the number of times in the history) and a classical optimizer adjusts the history parameters to improve consistency. We implement our algorithm on a cloud quantum computer to find consistent histories for a spin in a magnetic field and on a simulator to observe the emergence of classicality for a chiral molecule.

## Introduction

The foundations of quantum mechanics (QM) have been debated for the past century1,2, including topics such as the Einstein–Podolsky–Rosen (EPR) paradox, hidden-variable theories, Bell’s Theorem, Born’s rule, and the role of measurements in QM. This also includes the quantum-to-classical transition, i.e., the emergence of classical behavior (objectivity, irreversibility, lack of interference, etc.) from quantum laws3,4,5.

The Consistent Histories (CH) formalism was introduced by Griffiths6, Omnès7, Gell-Mann, and Hartle to address some (though not all) of the aforementioned issues8. One inventor considered CH to be “the Copenhagen interpretation done right”6, as it resolves some of the paradoxes of QM by enforcing strict rules for logical reasoning with quantum systems. In this formalism, the Copenhagen interpretation’s focus on measurements as the origin of probabilities is replaced by probabilities for sequences of events (histories) to occur, and hence by avoiding measurements it avoids the measurement problem. The sets of histories whose probabilities are additive (as the histories do not interfere with each other) are considered to be consistent and are thus the only ones able to be reasoned about in terms of classical probability and logic7.

Regardless of one’s opinion of the philosophical interpretation (on which this paper is agnostic), this computational framework has proven useful in applications such as attempting to solve the cosmological measure problem9,10, understanding quantum jumps11, and evaluating the arrival time for particles at a detector12,13,14. One of the main reasons that this framework has not received more attention and use is that carrying out the calculations for non-trivial cases (e.g., discrete systems of appreciable size or continuous systems that do not admit approximate descriptions by exactly solvable path integrals) can be difficult11,15. Although numerical approaches have been attempted16,17, they require exponentially scaling resources as either the number of times considered or the system size grows. This makes classical numerical approaches unusable for any but the simplest cases.

With the impending arrival of the first noisy intermediate-scale quantum computers18, the field of variational hybrid quantum-classical algorithms (VHQCAs), which make the most of short quantum circuits combined with classical optimizers, has been taking off. VHQCAs have now been demonstrated for a myriad of tasks ranging from factoring to finding ground states, among others19,20,21,22,23,24,25,26. The VHQCA framework potentially brings the practical applications of quantum computers years closer to fruition.

Here we present a scalable VHQCA for the CH formalism. Our algorithm achieves an exponential speedup over classical methods both in terms of the system size and the number of times considered. It will allow exploration beyond toy models, such as the quantum-to-classical transition in mesoscopic quantum systems. We implement this algorithm on IBM’s superconducting qubit quantum processor and obtain results in good agreement with theoretical expectations, suggesting that useful implementations of our algorithm may be feasible on near-term quantum devices.

## Results

### Consistent histories background

In the CH framework27,28,29, a history $${\cal{Y}}^{\boldsymbol{\alpha }}$$ is a sequence of properties (i.e., projectors onto the appropriate subspaces) at a succession of times t1 < t2 < … < tk,

$${\cal{Y}}^{\boldsymbol{\alpha }} = (P_1^{\alpha _1},P_2^{\alpha _2}, \ldots ,P_k^{\alpha _k}),$$
(1)

where $$P_j^{\alpha _j}$$ is chosen from a set Pj of projectors that sum to the identity at time tj. For example, for a photon passing through a sequence of diffraction gratings and then striking a screen, a history could be the photon passed through one slit in the first grating, another slit in the second, and so on. Clearly, we find interference between such histories unless there is some sense in which the photon’s path has been recorded. As there is interference, we cannot add the probabilities of the different histories classically and expect to correctly predict where the photon strikes the screen.

The CH framework provides tools for determining when a family (i.e., a set that sums to the multi-time identity operator) of histories $${\cal{F}} = \{ {\cal{Y}}^{\boldsymbol{\alpha }}\}$$ exhibits interference, which is not always obvious. In this framework, one defines the so-called class operator

$${\cal{C}}^{\boldsymbol{\alpha }} = P_k^{\alpha _k}(t_k)P_{k - 1}^{\alpha _{k - 1}}(t_{k - 1}) \ldots P_1^{\alpha _1}(t_1),$$
(2)

which is the time-ordered product of the projection operators (now in the Heisenberg picture and hence explicitly time dependent) in history $${\cal{Y}}^{\boldsymbol{\alpha }}$$. If the system is initially described by a density matrix ρ, the degree of interference or overlap between histories $${\cal{Y}}^{\boldsymbol{\alpha }}$$ and $${\cal{Y}}^{{\boldsymbol{\alpha }}\prime }$$ is

$${\cal{D}}({\boldsymbol{\alpha }},{\boldsymbol{\alpha }}\prime ) = {\mathrm{Tr}}\left( {{\cal{C}}^{\boldsymbol{\alpha }}\rho \,{\cal{C}}^{{\boldsymbol{\alpha }}\prime \dagger }} \right).$$
(3)

This quantity is called the decoherence functional. The consistency condition for a family of histories $${\cal{F}}$$ is then

$${\mathrm{Re}}({\cal{D}}({\boldsymbol{\alpha }},{\boldsymbol{\alpha }}\prime )) = 0,\quad \forall {\boldsymbol{\alpha }} \ne {\boldsymbol{\alpha }}\prime .$$
(4)

If and only if this condition holds do we say that $${\cal{D}}({\boldsymbol{\alpha }},{\boldsymbol{\alpha }})$$ is the probability for history $${\cal{Y}}^{\boldsymbol{\alpha }}$$. For computational convenience, we will instead work with a stronger condition28:

$${\cal{D}}({\boldsymbol{\alpha }},{\boldsymbol{\alpha }}\prime ) = 0{\mathrm{,}}\quad \forall {\boldsymbol{\alpha }} \ne {\boldsymbol{\alpha }}\prime ,$$
(5)

As we are presenting a numerical algorithm, it will also be useful to consider approximate consistency, where we merely insist that the interference is small in the following sense:

$$|{\cal{D}}({\boldsymbol{\alpha }},{\boldsymbol{\alpha }}\prime )|^2 \le \varepsilon ^2{\cal{D}}({\boldsymbol{\alpha }},{\boldsymbol{\alpha }}){\cal{D}}({\boldsymbol{\alpha }}\prime ,{\boldsymbol{\alpha }}\prime ){\mathrm{,}}\quad \forall {\boldsymbol{\alpha }} \ne {\boldsymbol{\alpha }}\prime ,$$
(6)

which guarantees that probability sum rules for $${\cal{F}}$$ are satisfied within an error of ε30.

To study consistency arising purely from decoherence (i.e., records in the environment), researchers have proposed a functional that instead takes a partial trace over E, which is (a subsystem of) the environment31,32:

$${\cal{D}}_{{\mathrm{pt}}}({\boldsymbol{\alpha }},{\boldsymbol{\alpha }}\prime ) = {\mathrm{Tr}}_{\mathrm{E}}\left( {{\cal{C}}^{\boldsymbol{\alpha }}\rho \,{\cal{C}}^{{\boldsymbol{\alpha }}\prime \dagger }} \right).$$
(7)

With this modification, the consistency condition is

$${\cal{D}}_{{\mathrm{pt}}} ({\boldsymbol{\alpha }},{\boldsymbol{\alpha }}\prime ) = {\mathbf{0}},\quad \forall {\boldsymbol{\alpha }} \ne {\boldsymbol{\alpha }}\prime ,$$
(8)

where 0 is the zero matrix. Instead of only signifying the lack of interference, partial-trace consistency singles out whether or not the records of the histories in the environment interfere. It is noteworthy that the full-trace condition of Eq. (5) is satisfied when this partial-trace consistency is satisfied, but the converse does not hold31.

With this formalism in hand, we can now see why classical numerical schemes for CH have faced difficulty. For example, consider histories of a collection of n spin-1/2 particles for k time steps, depicted in Fig. 1. The number of histories is 2nk and hence there are ~22nk decoherence functional elements. Furthermore, evaluating each decoherence functional element $${\cal{D}}({\boldsymbol{\alpha }},{\boldsymbol{\alpha }}\prime )$$ requires the equivalent of a Hamiltonian simulation of the system, i.e., the multiplication of 2n × 2n matrices. This means modern clusters would take centuries to evaluate the consistency of a family of histories with k = 2 time steps and n = 10 spins. Given this limitation, we can see why, for the most part, only toy models have been analyzed in this framework thus far.

### Hybrid algorithm for finding consistent histories

We refer to our VHQCA as Variational Consistent Histories (VCH), see Fig. 2. VCH takes as its inputs a physical model (i.e., an initial state ρ and a Hamiltonian H) and some ansatz for the types of projectors to consider. It outputs the following: (1) a family $${\cal{F}}$$ of histories that is (approximately) full and/or partial-trace consistent in the form of projection operators prepared on a quantum computer, (2) the probabilities of the most likely histories $${\cal{Y}}^{\boldsymbol{\alpha }}$$ in $${\cal{F}}$$, and (3) a bound on the consistency parameter ε.

VCH involves a parameter optimization loop, where a quantum computer evaluates a cost function that quantifies the family’s inconsistency, while a classical optimizer adjusts the family (i.e., varies the projector parameters) to reduce the cost. Classical optimizers for VHQCAs are actively being investigated26,33 and one is free to choose the classical optimizer on an empirical basis.

To compute the cost, it is noteworthy that the elements of the decoherence functional form a positive semi-definite matrix with trace one. In VCH, we exploit this property to encode $${\cal{D}}$$ in a quantum state σA, whose matrix elements are $$\langle{\boldsymbol{\alpha }}|\sigma ^{\mathrm{A}}|{\boldsymbol{\alpha }}\prime\rangle = {\cal{D}}({\boldsymbol{\alpha }},{\boldsymbol{\alpha }}\prime )$$. Step b of Fig. 2 shows a quantum circuit that prepares σA (see Supplementary Note 2 for more details). This circuit transforms an initial state ρ |0〉〈0| on systems SA, where S simulates the physical system of interest and A is an ancilla system, into a state σSA whose marginal is σA. For the full-trace consistency, we introduce a global measure of the (in)consistency that quantifies how far σA is from being diagonal, which serves as our cost function:

$$C: = \mathop {\sum}\limits_{{\boldsymbol{\alpha }} \ne {\boldsymbol{\alpha }}\prime } {\left| {{\cal{D}}({\boldsymbol{\alpha }},{\boldsymbol{\alpha }}\prime )} \right|^2} = D_{{\mathrm{HS}}}(\sigma ^{\mathrm{A}},{\cal{Z}}^{\mathrm{A}}(\sigma ^{\mathrm{A}})),$$
(9)

where DHS is the Hilbert–Schmidt distance and $${\cal{Z}}^{\mathrm{A}}(\sigma ^{\mathrm{A}})$$ is the dephased (all off-diagonal elements set to zero) version of σA. This quantity goes to zero if and only if $${\cal{F}}$$ is consistent. For the partial-trace case, we arrive at a similar cost function but with σA replaced by σSA:

$$C_{{\mathrm{pt}}}: = \mathop {\sum}\limits_{{\boldsymbol{\alpha }} \ne {\boldsymbol{\alpha }}\prime } {\left\| {{\cal{D}}_{{\mathrm{pt}}}({\boldsymbol{\alpha }},{\boldsymbol{\alpha }}\prime )} \right\|_{{\mathrm{HS}}}^2} = D_{{\mathrm{HS}}}\left( {\sigma ^{{\mathrm{SA}}},{\cal{Z}}^{\mathrm{A}}\left( {\sigma ^{{\mathrm{SA}}}} \right)} \right).$$
(10)

Here, the notation $${\cal{Z}}^{\mathrm{A}}(\sigma ^{{\mathrm{SA}}})$$ indicates that the dephasing operation only acts on system A and the absolute squares of Eq. (9) have been generalized to Hilbert–Schmidt norms, $$\left\| M \right\|_{{\mathrm{HS}}}^2: = {\mathrm{Tr}}(M^\dagger M)$$. In the Methods section, we present quantum circuits that compute these cost functions from two copies of σA or σSA. Derivations of the second equalities in Eqs (9) and (10) can be found in Supplementary Note 3. We remark that alternative cost functions may be useful, e.g., to penalize families $${\cal{F}}$$ with high entropy (see Methods) or to obtain a larger cost gradient by employing local instead of global observables (see ref. 26).

The parameter optimization loop results in an approximately consistent family, $${\cal{F}}$$, of histories, where the consistency parameter ε is upper bounded in terms of the final cost (see Methods). In Step c in Fig. 2, we then generate the probabilities for the most likely histories by repeatedly preparing σA and measuring in the standard basis, where the measurement frequencies give the probabilities (an alternative circuit that reads out any one of the exponentially many elements $${\cal{D}}({\boldsymbol{\alpha }},{\boldsymbol{\alpha }}\prime )$$ is introduced in Supplementary Note 4). Step e shows how one prepares the set of projection operators for any given history in $${\cal{F}}$$. These projectors can then be characterized with an efficient number of observables (i.e., avoiding full state tomography) to learn important information about the histories.

Let us discuss the scaling of VCH. With the potential exceptions of the Hamiltonian evolution and the projection operators, the complexity of our quantum circuits (i.e., the gate count, circuit depth, and total number of required qubits) scales linearly with both the system size n and the number of times k considered. The complexity of Hamiltonian evolution to some accuracy is problem dependent, but we typically expect polynomial scaling in n for physical systems with properties like translational symmetry34. On the other hand, we consider the circuit depth for preparing the history projectors to be a refinement parameter. One can begin with a short-depth ansatz for the projectors and incrementally increase the depth to refine the ansatz, potentially improving the approximate consistency. We therefore expect the overall scaling of our quantum circuits to be polynomial in n and k for the anticipated use cases of VCH.

The complexity of minimizing our non-convex cost function is unknown, which is typical for VHQCAs. As classical methods for finding consistent families also involve optimizing over some parameterization for the projectors, classical methods also need to deal with this optimization complexity issue.

Although the number of required repetitions of the probability readout step can scale inefficiently in n and k for certain families of histories, we assume that minimizing the cost outputs a family $${\cal{F}}$$ for which the probability readout step is efficient (see Methods for elaboration on this point).

This scaling behavior means that for systems that can be tractably simulated on a quantum computer and whose properties of interest are simple to implement, we achieve an exponential speedup and reduction in the needed resources as compared with classical approaches to this problem.

### Experimental implementations

Spin in a magnetic field. We now present an experimental demonstration of VCH on a cloud quantum computer. See the Supplementary Note 5 for further details on this implementation. We examine the two time histories of a spin-1/2 particle in a magnetic field $$B\hat z$$, whose Hamiltonian is H = − γBσz. The histories we consider have a time step Δt between the initial state (chosen to be ρ = | + 〉〈 + |, with $$| + \rangle = 1/\sqrt 2 (|0\rangle + |1\rangle )$$) and first projector, as well as between the first and second projector, chosen so that γBΔt = 2rad. In addition, we only consider projectors onto the xy plane of the Bloch sphere, parameterized by their azimuth. For this model, Fig. 3 shows the landscape of the cost in (9) for the ibmqx5 quantum processor35 as well as a simulator whose precision was limited by imposing the same finite statistics as were collected with the quantum processor. Several minima found by running VCH on ibmqx5 are superimposed on the landscape (all points found below a noise threshold were considered to be equally valid minima). As these minima correspond reasonably well to theoretically consistent families, this represents a successful proof-of-principle implementation of VCH.

Chiral molecule. To highlight applications that will be possible on future hardware, we now turn to a simulated use of VCH to observe the quantum-to-classical transition for a chiral molecule36,37. The chiral molecule has been modeled as a two level system where the right |R〉 and left |L〉 chirality states are described as $$|R\rangle /|L\rangle = | + \rangle /| - \rangle = \frac{1}{{\sqrt 2 }}\left( {|0\rangle \pm |1\rangle } \right)$$37. A chiral molecule in isolation would tunnel between |R〉 and |L〉, but we consider the molecule to be in a gas, where collisions with other molecules convey information about the molecule’s chirality to its environment. This information transfer is modeled by a rotation by angle θx about the x axis of an environment qubit, controlled on the system’s chirality, and for simplicity we suppose such collisions are evenly spaced at five points in time (see the Supplementary Note 5 for further details). We then consider simple families of stationary histories37, where the projector set corresponds to the same basis at all five times (just after a collision occurs). Letting θz be the precession angle due to tunneling in the time between collisions, we can then explore the competition between decoherence and tunneling. Figure 4 shows our results for this model. Notably, we observe the transition from a quantum regime, where the chirality is not consistent, to a classical regime, where the chirality is both consistent and stable over time.

## Discussion

We expect VCH to revitalize interest in the CH approach to QM by increasing its practical utility. Making it possible to apply the tools and concepts of quantum foundations to a wide array of physical situations, as VCH will, is an important step for our understanding of the physical world. Specifically by providing an exponential speedup and reduction in resources over classical methods, VCH will provide a way to study phenomena including the quantum-to-classical transition31,32,38, dynamics of quantum phase transitions39, quantum biological processes40, conformational changes41, and many other complex phenomena that so far have been computationally intractable. In addition, VCH could be applied to study quantum algorithms themselves42. In order to highlight such potential applications and examine their resource requirements, we now focus on two of them: the emergence of classical diffusive dynamics in quantum spin systems and the appearance of defined pathways in protein folding.

In the context of nuclear magnetic resonance (NMR) experiments, it has long been known that systems with many spins obey a classical diffusion equation while smaller spin systems undergo Rabi oscillations. Despite the long history of spin diffusion studies43,44,45, there is still no derivation of the transition from quantum oscillations to classical diffusion that can predict the size of the system where we should find that transition, or the nature of the transition. Applying VCH to the study of histories of spin systems would clarify this point by showing the scale and abruptness with which the diffusive behavior emerges. As spin diffusion has been observed for systems as small as ~30,000 spins46, we estimate that between ~102 and ~103 qubits would allow us to study this transition. For more details about this application, see the Supplementary Note 6.

In the protein-folding community there are currently two main schools of thought on how proteins fold. The first is that proteins fold along well-determined pathways with discrete folding units (foldons)47, whereas the second is that there should be a funnel in the energy landscape of folding configurations, causing the system to explore a wide range of configurations before settling into the final state48. The deterministic pathways of the foldon model are favored by NMR experiments, raising the question of whether these views can be reconciled47. By providing the means to study the dynamic emergence of classical paths, i.e., the quantum-to-classical transition for proteins, VCH could resolve this discrepancy. For this purpose, we estimate that between ~103 and ~104 qubits will be needed. See the Supplementary Note 6 for more details on this application and resource estimate.

Finally, our work highlights the synergy of two distinct fields, quantum foundations and quantum computational algorithms, and hopefully will inspire further research into their intersection.

## Methods

### Evaluation of the cost

Figure 5 shows the circuits for computing the full-trace cost (partial-trace cost) from two copies of σA (σSA). It is noteworthy that both costs can be written as a difference of purities:

$$C = {\mathrm{Tr}}((\sigma ^{\mathrm{A}})^2) - {\mathrm{Tr}}({\cal{Z}}^{\mathrm{A}}(\sigma ^{\mathrm{A}})^2)$$
(11)
$$C_{{\mathrm{pt}}} = {\mathrm{Tr}}((\sigma ^{{\mathrm{SA}}})^2) - {\mathrm{Tr}}({\cal{Z}}^{\mathrm{A}}(\sigma ^{{\mathrm{SA}}})^2).$$
(12)

The Tr((σA)2) and Tr((σSA)2) terms are computed via the Swap Test, with a depth-two circuit and classical post-processing that scales linearly in the number of qubits49,50. A similar but even simpler circuit, called the Diagonalized Inner Product (DIP) Test26, calculates the $${\mathrm{Tr}}({\cal{Z}}^{\mathrm{A}}(\sigma ^{\mathrm{A}})^2)$$ term with a depth-one circuit and no post-processing. Finally, the $${\mathrm{Tr}}({\cal{Z}}^{\mathrm{A}}(\sigma ^{{\mathrm{SA}}})^2)$$ term is evaluated with the Partial-DIP Test26, a depth-two circuit that is a hybridization of the Swap Test and the DIP Test.

One does not know a priori how many histories will be characterized in the probability readout step (Fig. 2c). Due to statistical noise, the probability of histories with greater probability will be determined with greater relative precision than those with lesser probability. Hence, it is reasonable to set a precision (or statistical noise) threshold, ε. Let Nreadout be the number of repetitions of the probability readout circuit. Then, histories $${\cal{Y}}^{\boldsymbol{\alpha }}$$ whose bitstring α occurs with frequency $$f_{\boldsymbol{\alpha }} < \sqrt {N_{{\mathrm{readout}}}} /\varepsilon _{{\mathrm{max}}}$$ should be ignored, as their probabilities p(α) = fα/Nreadout were not characterized with the desired precision. We separate $${\cal{F}}$$ into the set $${\cal{F}}_c$$ of histories whose probabilities are above the precision threshold (which we previously referred to loosely as the most likely histories) and the set of all other histories in $${\cal{F}}$$:

$${\cal{F}} = {\cal{F}}_c \cup \overline {{\cal{F}}_c} .$$
(13)

Computational complexity can be hidden in the value of Nreadout needed to obtain a desired precision for the probabilities of histories of interest. This issue is closely connected to the entropy of the set $$\{ {\cal{D}}({\boldsymbol{\alpha }},{\boldsymbol{\alpha }})\}$$, or equivalently, the entropy of the quantum state $${\cal{Z}}^{\mathrm{A}}(\sigma ^{\mathrm{A}})$$. When $${\cal{Z}}^{\mathrm{A}}(\sigma ^{\mathrm{A}})$$ is high entropy, an exponentially large number of histories may have non-zero probability and hence Nreadout would need to grow exponentially. VCH is therefore better suited to applications where there is a small subset of the histories that are far more probable than the rest. In the parameter optimization loop, one can select for families with this property by penalizing families for which $${\cal{Z}}^{\mathrm{A}}(\sigma ^{\mathrm{A}})$$ has high entropy. Specifically, by noting that $$P: = {\mathrm{Tr}}({\cal{Z}}^{\mathrm{A}}(\sigma ^{\mathrm{A}})^2)$$ can be efficiently computed via the circuit in Fig. 5a, one can modify the costs functions in Eqs (9) and (10) to be $$\tilde C = C/P$$ and $$\tilde C_{{\mathrm{pt}}} = C_{{\mathrm{pt}}}/P$$.

We remark that classicality is intimately connected to predictability, with the emergence of classicality linked to the so-called predictability sieve51,52. As the CH formalism is typically used to find classical families, this implies predictable families (i.e., families with low entropy or high purity P) are arguably of the most interest. Hence, our modified cost function $$\tilde C$$ also serves to select those consistent families with histories that are the most predictable and therefore the most classical.

### Approximate consistency

Here we discuss how VCH outputs an upper bound on the consistency parameter ε. Let us first relate the cost C to ε. For any pair of histories $${\cal{Y}}^{\boldsymbol{\alpha }}$$ and $${\cal{Y}}^{{\boldsymbol{\alpha }}\prime }$$ in $${\cal{F}}$$,

$$\left| {{\cal{D}}({\boldsymbol{\alpha }},{\boldsymbol{\alpha }}\prime )} \right|^2 \le C/2,$$
(14)

which follows from Eq. (9) and the fact that $$|{\cal{D}}({\boldsymbol{\alpha }},{\boldsymbol{\alpha }}\prime )| = |{\cal{D}}({\boldsymbol{\alpha }}\prime ,{\boldsymbol{\alpha }})|$$. Let us define

$$\varepsilon _{{\boldsymbol{\alpha }},{\boldsymbol{\alpha }}\prime }: = \sqrt {\frac{C}{{2{\cal{D}}({\boldsymbol{\alpha }},{\boldsymbol{\alpha }}){\cal{D}}({\boldsymbol{\alpha }}\prime ,{\boldsymbol{\alpha }}\prime )}}} .$$
(15)

Then it follows from Eq. (14) that

$$|{\cal{D}}({\boldsymbol{\alpha }},{\boldsymbol{\alpha }}\prime )|^2 \le \varepsilon _{{\boldsymbol{\alpha }},{\boldsymbol{\alpha }}\prime }^2{\cal{D}}({\boldsymbol{\alpha }},{\boldsymbol{\alpha }}){\cal{D}}({\boldsymbol{\alpha }}\prime ,{\boldsymbol{\alpha }}\prime ),$$
(16)

which corresponds to the approximate consistency condition from Eq. (6). Hence, probablity sum rules for these two histories are satisfied within error εα,α', which can be calculated from Eq. (15) for histories in $${\cal{F}}_c$$, as the probabilites are known for these histories.

Next, consider histories in $$\overline {{\cal{F}}_c}$$. As we do not have enough information to differentiate these histories, we advocate combining the elements of $$\overline {{\cal{F}}_c}$$ into a single coarse-grained history $${\cal{Y}}^{\boldsymbol{\gamma }}$$.

Let $${\cal{Y}}^{\boldsymbol{\beta }}$$ be the least likely history in $${\cal{F}}_c$$. Then defining $$\delta ^2 = {\cal{D}}({\boldsymbol{\gamma }},{\boldsymbol{\gamma }})/{\cal{D}}({\boldsymbol{\beta }},{\boldsymbol{\beta }})$$, we can make use of the positive semi-definite property of σA to write:

$$|{\cal{D}}({\boldsymbol{\gamma }},{\boldsymbol{\beta }})|^2 \le {\cal{D}}({\boldsymbol{\gamma }},{\boldsymbol{\gamma }}){\cal{D}}({\boldsymbol{\beta }},{\boldsymbol{\beta }}) = \delta ^2{\cal{D}}({\boldsymbol{\beta }},{\boldsymbol{\beta }})^2.$$
(17)

As $${\cal{Y}}^{\boldsymbol{\beta }}$$ is the least likely history in $${\cal{F}}_c$$, this expression then lets us bound the error on the probability sum rule (giving a weaker approximate consistency condition30) between $${\cal{Y}}^{\boldsymbol{\gamma }}$$ and any $${\cal{Y}}^{\boldsymbol{\alpha }} \in {\cal{F}}_c$$ as:

$$\begin{array}{*{20}{l}} {|{\cal{D}}({\boldsymbol{\gamma }},{\boldsymbol{\alpha }})|} \hfill & \le \hfill & {\delta {\cal{D}}({\boldsymbol{\alpha }},{\boldsymbol{\alpha }})} \hfill \\ {} \hfill & \le \hfill & { \delta ({\cal{D}}({\boldsymbol{\gamma }},{\boldsymbol{\gamma }}) + {\cal{D}}({\boldsymbol{\alpha }},{\boldsymbol{\alpha }})).} \hfill \end{array}$$
(18)

It is then possible to characterize the approximate consistency of the histories of $${\cal{F}}$$ pairwise with εα,α' and δ. Alternatively, to give an upper bound on the overall consistency ε, we take the greatest of these pairwise bounds:

$$\varepsilon \le {\mathrm{max}}(\{ \varepsilon _{{\boldsymbol{\alpha }},{\boldsymbol{\alpha }}\prime }\} \cup \{ \delta \} ).$$
(19)

For those applications where we are working with the partial-trace consistency, the notion of approximate consistency is somewhat more obscured. In order to generate probabilities and bound ε, we therefore recommend evaluating the full-trace cost function at the minimum found with the partial-trace cost. This approach is helpful, as any partial-trace consistent family will also be full-trace consistent and the partial-trace consistency does not directly allow one to discuss probabilities in the same way. Taking this approach allows us to then directly utilize the approximate consistency framework above.

## Data Availability

The data used to create the figures in this article are available upon request. Requests should be sent to the corresponding author.

## References

1. 1.

Wheeler, J. A. and Zurek, W. H. (eds). Quantum Theory and Measurement (Princeton Series in Physics) (Princeton Univ. Press, 2016).

2. 2.

Auletta, G. Foundations and Interpretation of Quantum Mechanics (World Scientific, 2000) https://doi.org/10.1142/4194.

3. 3.

Joos, E. & Zeh, H. D. The emergence of classical properties through interaction with the environment. Z. Phys. B Condens. Matter 59, 223–243 (1985).

4. 4.

Zurek, W. H. Decoherence, einselection, and the quantum origins of the classical. Rev. Mod. Phys. 75, 715–775 (2003).

5. 5.

Schlosshauer, M. A. Decoherence: and the Quantum-to-Classical Transition (Springer Science & Business Media, 2007).

6. 6.

Griffiths, R. B. Consistent histories and the interpretation of quantum mechanics. J. Stat. Phys. 36, 219–272 (1984).

7. 7.

Omnès, R. Logical reformulation of quantum mechanics. I. foundations. J. Stat. Phys. 53, 893–932 (1988).

8. 8.

Gell-Mann, M. & Hartle, J. B. Quantum mechanics in the light of quantum cosmology. In Proc. 3rd International Symposium Foundations of Quantum Mechanics in the Light of New Technology (1989).

9. 9.

Hartle, J. & Hertog, T. One bubble to rule them all. Phys. Rev. D. 95, 123502 (2017).

10. 10.

Lloyd, S. Decoherent histories approach to the cosmological measure problem. Preprint at https://arxiv.org/abs/1608.05672 (2016).

11. 11.

Brun, T. A. Quantum jumps as decoherent histories. Phys. Rev. Lett. 78, 1833–1837 (1997).

12. 12.

Halliwell, J. J. & Yearsley, J. M. Quantum arrival time formula from decoherent histories. Phys. Lett. A 374, 154–157 (2009).

13. 13.

Halliwell, J. J. & Yearsley, J. M. Arrival times, complex potentials, and decoherent histories. Phys. Rev. A. 79, 062101 (2009).

14. 14.

Charis Anastopoulos & Savvidou, Ntina Time of arrival and localization of relativistic particles. J. Math. Phys. 60, 032301 (2019).

15. 15.

Brun, T. A. Quasiclassical equations of motion for nonlinear Brownian systems. Phys. Rev. D. 47, 3383–3393 (1993).

16. 16.

Pohle, H.-J. How to calculate decoherence matrices numerically. Phys. A Stat. Mech. its Appl. 213, 435–449 (1995).

17. 17.

Schmidtke, D. & Gemmer, J. Numerical evidence for approximate consistency and markovianity of some quantum histories in a class of finite closed spin systems. Phys. Rev. E 93, 012125 (2016).

18. 18.

Preskill, J. Quantum computing in the NISQ era and beyond. Quantum 2, 79 (2018).

19. 19.

Peruzzo, A. et al. A variational eigenvalue solver on a photonic quantum processor, Nature. Communications 5, 4213 (2014).

20. 20.

Anschuetz, E. R., Olson, J. P., Aspuru-Guzik, A. & Cao, Y. Variational quantum factoring. Preprint at https://arxiv.org/abs/1808.08927 (2018).

21. 21.

Farhi, E., Goldstone, J. & Gutmann, S. A quantum approximate optimization algorithm. Preprint at https://arxiv.org/abs/1411.4028 (2014).

22. 22.

Romero, J., Olson, J. P. & Aspuru-Guzik, A. Quantum autoencoders for efficient compression of quantum data. Quant. Sci. Technol. 2, 045001 (2017).

23. 23.

Li, Y. & Benjamin, S. C. Efficient variational quantum simulator incorporating active error minimization. Phys. Rev. X 7, 021050 (2017).

24. 24.

Johnson, P. D., Romero, J., Olson, J., Cao, Y. & Aspuru-Guzik, A. QVECTOR: an algorithm for device-tailored quantum error correction. Preprint at https://arxiv.org/abs/1711.02249 (2017).

25. 25.

Khatri, S. et al. Quantum-assisted quantum compiling. Quantum 3, 140 (2019).

26. 26.

LaRose, R., Tikku, A., O’Neel-Judy, É., Cincio, L. & Coles, P. J. Variational quantum state diagonalization. npj Quant. Inf. 5, 57 (2019).

27. 27.

Griffiths, R. B. Consistent Quantum Theory (Cambridge Univ. Press, 2001).

28. 28.

Halliwell, J. J. A review of the decoherent histories approach to quantum mechanics. Ann. N. Y. Acad. Sci. 755, 726–740 (1995).

29. 29.

Hohenberg, P. C. Colloquium: an introduction to consistent quantum theory. Rev. Mod. Phys. 82, 2835–2844 (2010).

30. 30.

Dowker, H. F. & Halliwell, J. J. Quantum mechanics of history: the decoherence functional in quantum mechanics. Phys. Rev. D. 46, 1580–1609 (1992).

31. 31.

Riedel, C. J., Zurek, W. H. & Zwolak, M. Objective past of a quantum universe: redundant records of consistent histories. Phys. Rev. A. 93, 032126 (2016).

32. 32.

Finkelstein, J. Definition of decoherence. Phys. Rev. D. 47, 5430–5433 (1993).

33. 33.

McClean, J. R., Romero, J., Babbush, R. & Aspuru-Guzik, A. The theory of variational hybrid quantumclassical algorithms. New J. Phys. 18, 023023 (2016).

34. 34.

Berry, D. W., Childs, A. M., Cleve, R., Kothari, R. & Somma, R. D. Simulating hamiltonian dynamics with a truncated taylor series. Phys. Rev. Lett. 114, 090502 (2015).

35. 35.

IBM Q 16 Rueschlikon backend specification, https://github.com/Qiskit/qiskit-backend-information/tree/master/backends/rueschlikon/V1 (2018).

36. 36.

Trost, J. & Hornberger, K. Hund’s paradox and the collisional stabilization of chiral molecules. Phys. Rev. Lett. 103, 023202 (2009).

37. 37.

Coles, P. J., Gheorghiu, V. & Griffiths, R. B. Consistent histories for tunneling molecules subject to collisional decoherence. Phys. Rev. A. 86, 042111 (2012).

38. 38.

Paz, J. P. & Zurek, W. H. Environment-induced decoherence, classicality, and consistency of quantum histories. Phys. Rev. D. 48, 2728–2738 (1993).

39. 39.

Zurek, W. H., Dorner, U. & Zoller, P. Dynamics of a quantum phase transition. Phys. Rev. Lett. 95, 105701 (2005).

40. 40.

Allegra, M., Giorda, P. & Lloyd, S. Global coherence of quantum evolutions based on decoherent histories: theory and application to photosynthetic quantum energy transport. Phys. Rev. A. 93, 042312 (2016).

41. 41.

Liu, H. et al. Quantum mechanics simulation of protein dynamics on long timescale. Proteins 44, 484–489 (2001).

42. 42.

Poulin, D. Classicality of quantum information processing. Phys. Rev. A. 65, 042319 (2002).

43. 43.

Bloembergen, N. On the interaction of nuclear spins in a crystalline lattice. Physica 15, 386–426 (1949).

44. 44.

Lowe, I. J. & Gade, S. Density-matrix derivation of the spin-diffusion equation. Phys. Rev. 156, 817–825 (1967).

45. 45.

Dumez, J.-N. Many body dynamics in nuclear spin diffusion, Theses, Ecole normale supérieure de lyon—ENS LYON (2011).

46. 46.

Adachi, S., Kaji, R., Furukawa, S., Yokoyama, Y. & Muto, S. Nuclear spin depolarization via slow spin diffusion in single InAlAs quantum dots observed by using erase-pump-probe technique. J. Appl. Phys. 111, 103531 (2012).

47. 47.

Englander, S. W. & Mayne, L. The case for defined protein folding pathways. Proc. Natl Acad. Sci. USA 114, 8253–8258 (2017).

48. 48.

Eaton, W. A. & Wolynes, P. G. Theory, simulations, and experiments show that proteins fold by multiple pathways. Proc. Natl Acad. Sci. USA 114, E9759–E9760 (2017).

49. 49.

Garcia-Escartin, J. C. & Chamorro-Posada, P. Swap test and Hong-Ou-Mandel effect are equivalent. Phys. Rev. A. 87, 052330 (2013).

50. 50.

Cincio, L., Subası, Y., Sornborger, A. T. & Coles, P. J. Learning the quantum algorithm for state overlap. New J. Phys. 20, 113022 (2018).

51. 51.

Zurek, W. H. Preferred states, predictability, classicality and the environment-induced decoherence. Prog. Theor. Phys. 89, 281–312 (1993).

52. 52.

Dalvit, D. A. R., Dziarmaga, J. & Zurek, W. H. Predictability sieve, pointer states, and the classicality of quantum trajectories. Phys. Rev. A. 72, 062101 (2005).

## Acknowledgements

We thank IBM for the use of their quantum processor. The views expressed in this article are those of the authors and not of IBM. This work was supported by the U.S. Department of Energy (DOE), Office of Science, Office of High Energy Physics, QuantISED program, and also by the U.S. DOE, Office of Science, Basic Energy Sciences, Materials Sciences and Engineering Division, Condensed Matter Theory Program. All authors acknowledge support from the LDRD program at Los Alamos National Laboratory (LANL). L.C. was also supported by the DOE through the J. Robert Oppenheimer fellowship. A.T.S. and P.J.C. additionally acknowledge support from the LANL ASC Beyond Moore’s Law project. Finally, W.H.Z. acknowledges partial support by the Foundational Questions Institute grant FQXi-1821 and Franklin Fetzer Fund, a donor advised fund of the Silicon Valley Community Foundation.

## Author information

All authors contributed to the preparation and revision of the manuscript. P.J.C. invented the algorithm and developed the basic formalism. A.A. designed and carried out the experimental implementations, analyzed the results, and contributed to the formalism. L.C., A.T.S., and W.H.Z. consulted on all stages of the project.

Correspondence to Patrick J. Coles.

## Ethics declarations

### Competing interests

The authors declare no competing interests.

Peer review information: Nature Communications thanks Jochen Gemmer and other anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Rights and permissions

Reprints and Permissions

Arrasmith, A., Cincio, L., Sornborger, A.T. et al. Variational consistent histories as a hybrid algorithm for quantum foundations. Nat Commun 10, 3438 (2019) doi:10.1038/s41467-019-11417-0