Abstract
Time has a fundamentally different character in quantum mechanics and in general relativity. In quantum theory events unfold in a fixed order while in general relativity temporal order is influenced by the distribution of matter. When matter requires a quantum description, temporal order is expected to become nonclassical—a scenario beyond the scope of current theories. Here we provide a direct description of such a scenario. We consider a thought experiment with a massive body in a spatial superposition and show how it leads to entanglement of temporal orders between timelike events. This entanglement enables accomplishing a task, violation of a Bell inequality, that is impossible under local classical temporal order; it means that temporal order cannot be described by any predefined local variables. A classical notion of a causal structure is therefore untenable in any framework compatible with the basic principles of quantum mechanics and classical general relativity.
Introduction
Quantum mechanics forces us to question the view that physical quantities (such as spin, positions or energy) have predefined values: Bell’s theorem shows that if observable quantities were determined by some locally defined classical variables, it would be impossible to accomplish certain tasks—such as the violation of Bell’s inequalities—whereas such tasks are possible according to quantum mechanics^{1,2} and have been realised in experiments^{3,4,5,6}. However, the causal relations between events remain fixed in quantum theory: whether an event A is in the past, in the future, or spacelike separated from another event B is predefined by the location of such events in spacetime^{7,8}. In contrast, in general relativity, spacetime itself is dynamical: the presence of massive objects affects local clocks and thus causal relations between events defined with respect to them. Nonetheless, the dynamical causal structure of general relativity is still classically predefined: the causal relation between any pair of events is uniquely determined by the distribution of matterenergy degrees of freedom (DOFs) in their past light cone. In other words, causal relations are always determined by local classical variables. This picture is expected to change if we consider quantum states of gravitating DOFs: if a massive system is prepared in a superposition of two distinct states, each yielding an observably different causal structure for future events, would it be possible to observe causal relations that display genuine quantum features?
A main obstacle in the analysis of macroscopic superpositions of gravitating bodies is that, in the absence of a classical spacetime manifold, it becomes unclear how to identify spacelike surfaces on which quantum states are defined, or global fields of timelike vectors to define time evolution. Indeed, some models even postulate that such superpositions are simply not valid physical states and must decohere (or collapse) fast enough to preserve a classical description of spacetime and dynamical laws^{9,10,11,12,13}. A very different mindset underlies various quantum gravity frameworks^{14}—where quantum features of the metric and therefore of the causal relations are indeed expected. However, to date, none of the quantum gravity frameworks has been applied to analyse such an epitomic example as superpositions of spacetimes with macroscopically distinct causal structures. Therefore, it is unclear whether there exists any phenomenology unequivocally associated with quantum causal structures, nor whether quantum gravity frameworks can circumvent or directly address the objections against superpositions of manifolds. Independently, quantum formalisms have been recently developed to study quantum causal structures at an abstract level in the context of quantuminformation processing^{8,15,16}. However, although quantum features of spacetime are among the motivations for these studies, no direct link with quantum gravity has yet been established.
This work provides the first direct analysis of quantum causal relations arising from a spatial superposition of a massive object. We show how the temporal order between timelike events can become superposed or even entangled. We further discuss a thought experiment, an admissible albeit remote physical scenario, where these nonclassical causal relations arise among physical events. In order to prove their nonclassicality, we formulate a Belltype theorem for temporal order: We define a task that cannot be accomplished if the time order between the events was predetermined by local variables, while the task becomes possible if the events are in a spacetime region affected by the gravitational field of a massive object in an appropriate quantum state. Our approach provides a method to directly describe scenarios so far considered to be out of reach for standard theoretical physics. We show explicitly how to overcome the difficulties with describing superpositions of metrics that motivated collapse models. On the other hand, our result is independent of the highenergy completion of any specific quantum gravity framework—we do not assume any new physics, the results are based entirely on wellestablished, lowenergy general relativity and on quantum mechanics. Our results are therefore robust against particular mathematical approaches to quantising gravity, thus providing a benchmark for specific frameworks. Furthermore, the time and energy scale at which entangled temporal order arises is closer than the Planck scale, typically invoked in this context, and is also far remote from the scale given by the decoherence models—which therefore do not preclude quantum features of spacetime to arise. Our results thus reveal that both the above approaches are missing crucial intuition and correct physical understanding of the phenomena associated with causal structures at the interface of quantum and gravitational physics. In turn, our work provides a robust method to quantitatively assess these phenomena, helping to build correct physical intuition for quantum causal structures.
Results
Dynamical causal structure in general relativity
In classical general relativity, the causal structure is the structure of light cones of the spacetime metric^{17,18}. As the matterenergy DOFs determine the metric through Einstein’s equations, the causal structure of a region of spacetime is dynamical: it depends on the state of the matter energy in its past light cone. A major obstacle towards a quantum theory of gravity is that it is not clear how to transpose the mathematical notion of causal relations to scenarios where matter DOFs can be in general quantum states, as such scenarios seem to preclude the use of any underlying spacetime manifold with respect to which events, light cones and causal relations could be defined. To overcome this obstacle, our approach is to start from a physical understanding of events and their causal relations. Even in classical general relativity a physical event cannot be directly identified with a point on a spacetime manifold, a fundamental aspect of the theory captured mathematically by diffeomorphism invariance^{19}. Although it can be debated whether or not spacetime points have an intrinsic physical meaning, a natural way to define diffeomorphisminvariant events is to specify them operationally, relative to physical systems; for example, positions and proper times of physical systems used as clocks^{20}. We adopt this notion of events throughout the work. Causal relations are then understood as the possibility to exchange nonfasterthanlight signals—or more generally, physical systems—between operationally defined events.
The presence of massive bodies generally alters the relative rates at which clocks tick. For example, in a weak field limit, a clock in a gravitational potential Φ exchanging signals with an identical clock far away from the source of Φ, where the potential effectively vanishes, will appear to tick slower by a factor \(\sqrt {1 + 2\frac{{\mathrm{\Phi }}}{{c^2}}}\). In classical physics, this leads to the welltested timedilation^{21,22} and redshift effects^{23}. When the clocks are described as quantum systems, new effects arise from the combination of quantum and general relativistic theories. For a clock in superposition of different distances to the mass, its timekeeping DOFs become entangled to the clock’s position^{24,25,26}. This entanglement implies a universal decoherence mechanism for generic macroscopic systems under time dilation^{27,28}. The regime of lowenergy quantum systems in curved spacetime can be described within a framework of general relativistic composite quantum particles^{29}. Here we additionally exploit the fact that only the distance between a clock and a mass has physical significance and due to linearity of quantum theory this must hold also for a superposition of different distances. (There is no difference in the relative ticking rates of two clocks whether we think that the clocks are being positioned at different distances—possibly in a superposition—from the mass, or that the mass is positioned at different distances from the clocks^{30}.)
Consider two agents, a and b, with two initially synchronised clocks, each following a fixed world line. A third agent prepares one of two mass configurations, \({\mathrm{K}}_{{\mathrm{A}} \prec {\mathrm{B}}}\) or \({\mathrm{K}}_{{\mathrm{B}} \prec {\mathrm{A}}}\), so as to induce time dilation between the clocks of a and b. If configuration \({\mathrm{K}}_{{\mathrm{A}} \prec {\mathrm{B}}}\) is prepared, event A—defined by the clock of agent a showing proper time t_{a} = τ^{*}—will be in the past light cone of the event B, which is defined in an analogous way: by the clock of agent b showing proper time t_{b} = τ^{*}. If configuration \({\mathrm{K}}_{{\mathrm{B}} \prec {\mathrm{A}}}\) is prepared, event B will be in the past light cone of event A. To keep the world lines of the agents independent of the mass configuration, their laboratories can be embedded in tight enough trapping potentials, that is, much stronger than the gravitational field (which is feasible since our protocol does not require macroscopic source masses, see Methods). In Supplementary Note 4 we discuss other mass configurations, which have the desired effect on temporal order, but for which the agents a, b can remain inertial.
A possible way to realise configuration \({\mathrm{K}}_{{\mathrm{A}} \prec {\mathrm{B}}}\) is to place an approximately pointlike body of mass M closer to b than to a, see Fig. 1. The lightcone structure of the resulting spacetime is fully determined by the metric tensor g_{μν}, for which we adopt the sign convention (−, +, +, +). In isotropic coordinates in the firstorder postNewtonian expansion the metric components are^{31} \(g_{{\mathrm{00}}}(r) =  ( {1 + 2\frac{{{\mathrm{\Phi }}(r)}}{{c^2}}} )\) and \(g_{ij}(r) = \delta _{ij}( {1 + 2\frac{{{\mathrm{\Phi }}(r)}}{{c^2}}} )^{  1}\), i, j = 1, 2, 3, where \({\mathrm{\Phi }}(r) =  \frac{{GM}}{r}\) is the gravitational potential and r is the spatial distance between the mass and the event where the metric is evaluated. For an event with a spatial coordinate R_{a} and the mass at a spatial coordinate r_{M} (where the spatial coordinates are defined, for example, by a faraway agent as in Fig. 1), we have r ≡ R_{a} − r_{M}. Note that we use a common coordinate system to describe the different mass configurations and the associated spacetime metrics. Operationally, we can associate such coordinates with the faraway agent, whose local clocks are not affected by the change in the matter distribution. However, this is only a convenient interpretation, we can always think of the coordinates in analogy to gauge fixing—any physical prediction regarding proper times of the clocks and exchange of the signals will not depend on the choice of coordinates.
We consider that a and b remain at fixed coordinate distances from the mass, r_{a} and r_{b} = r_{a} − h, respectively, and find the parameters for which event A ends up in the past light cone of B for \({\mathrm{K}}_{{\mathrm{A}} \prec {\mathrm{B}}}\) (and vice versa for \({\mathrm{K}}_{{\mathrm{B}} \prec {\mathrm{A}}}\)). An infinitesimal proper time element along a world line at a distance r from the mass is given by \({\mathrm{d}}\tau (r) = \sqrt {  g_{{\mathrm{00}}}(r)} {\mathrm{d}}t,\) where t is the coordinate time, and a photon travelling in the radial direction from r_{a} reaches r_{b} after a coordinate time \(T_{c} = \frac{1}{c}{\int}_{r_{\mathrm{b}}}^{r_{\mathrm{a}}} {\mathrm{d}} r{\prime}\sqrt {  \frac{{g_{rr}(r{\prime})}}{{g_{{\mathrm{00}}}(r{\prime})}}}\). Therefore, if the photon is emitted at the local time t_{a} = τ^{*}, it reaches r_{b} when b’s local time is \(\bar t_{\mathrm{b}} = \sqrt {  g_{{\mathrm{00}}}(r_{\mathrm{b}})} \left( {\frac{{\tau ^ \ast }}{{\sqrt {  g_{{\mathrm{00}}}(r_{\mathrm{a}})} }} + T_{c}} \right)\), assuming that the local clocks are synchronised so that t_{a} = 0 and t_{b} = 0 coincide with the coordinate time t = 0. For
we have \(\bar t_{\mathrm{b}} \le \tau ^ \ast\), which means that there is enough time for a notfasterthanlight signal emitted at event A (defined by t_{a} = τ^{*}) to travel the distance h and reach agent b at event B (defined by t_{b} = τ^{*}). This means that event A is in the causal past of event B as required. For example, for \(h \ll r_{\mathrm{a}}\) condition (1) is satisfied for \(\tau ^ \ast > \frac{{2r_{\mathrm{a}}^2c}}{{GM}}\). Configuration \({\mathrm{K}}_{{\mathrm{B}} \prec {\mathrm{A}}}\) can be arranged analogously, by placing the mass closer to a than to b. Then, the condition \(\tau ^ \ast > \frac{{2r_{\mathrm{b}}^2c}}{{GM}}\), for \(h \ll r_{\mathrm{b}}\), ensures that B is in the causal past of A. Note that with the above conditions on τ^{*} the events A and B are always timelike separated, but have different time orders for the two mass configurations—these conditions guarantee that the time order between A and B is swapped in all reference frames.
The example above simply illustrates that in general relativity causal structure is dynamical and depends on the stressenergy tensor of the matter DOFs: preparing different matter distributions on a spacelike hypersurface can result in different causal relations between events in its causal future.
Quantum control of temporal order
When A is in the past light cone of B, a physical system can in principle be transferred from A to B. Consider a quantum system S initially prepared in state ψ〉^{S}, which undergoes a unitary U_{A} at event A (at the spacetime location where the clock of agent a marks proper time τ^{*}) and a unitary U_{B} at event B. Such ordered events can therefore result in the following state of S:
If B is before A, and S is prepared in the same initial state, the final state of S is
A situation can therefore be arranged such that state (2) is produced for configuration \({\mathrm{K}}_{{\mathrm{A}} \prec {\mathrm{B}}}\) and (3) is produced for \({\mathrm{K}}_{{\mathrm{B}} \prec {\mathrm{A}}}\). (We ignore a possible additional time evolution between the two events for simplicity.) Different mass configurations can result in different temporal orders of local operations, which holds in quantum as well as in classical theory. Let us make the following assumptions:

(a)
Macroscopically distinguishable states of physical systems can be assigned orthogonal quantum states.

(b)
Gravitational time dilation in a classical limit reduces to that predicted by general relativity.

(c)
The quantum superposition principle holds (regardless of the mass or nature of the involved system).
Even though the above assumptions hold in the standard quantum and general relativistic frameworks, it is not known if a fundamental theory of quantum gravity satisfies them. Our aim is to investigate their consequences for the notion of temporal order.
The coordinates introduced in the previous section define a foliation of spacetime into equaltime slices. As long as no horizons are present in any of the considered configurations, such slices define spacelike hypersurfaces. With each hypersurface one can associate a Hilbert space, containing the quantum states of interest at the given time. The time coordinate corresponds to the time t in Fig. 1 and is operationally defined as the time measured by the local clock of the faraway agent (not affected by the mass configurations). These quantum states can be understood operationally as states assigned by the faraway agent. However, as discussed in the previous section, such an interpretation is not strictly necessary but is merely a convenient way to define the relevant mathematical objects and to carry out the calculations.
The two mass configurations \({\mathrm{K}}_{{\mathrm{A}} \prec {\mathrm{B}}},{\mathrm{K}}_{{\mathrm{B}} \prec {\mathrm{A}}}\) can thus be assigned quantum states \(\left {{\mathrm{K}}_{{\mathrm{A}} \prec {\mathrm{B}}}} \right\rangle ^{\mathrm{M}}\), \(\left {{\mathrm{K}}_{{\mathrm{B}} \prec {\mathrm{A}}}} \right\rangle ^{\mathrm{M}}\). By assumption (a) these states are orthogonal. Since each state individually satisfies the classical limit (mass is sufficiently localised around a single world line), following assumption (b), the system S will evolve as in Eqs. (2) or (3) depending whether the mass is in state \(\left {{\mathrm{K}}_{{\mathrm{A}} \prec {\mathrm{B}}}} \right\rangle ^{M}\) or \(\left {{\mathrm{K}}_{{\mathrm{B}} \prec {\mathrm{A}}}} \right\rangle ^{M}\), respectively. Finally, by assumption (c), a superposition \(\left {{\mathrm{K}}_ + } \right\rangle ^{\mathrm{M}}: = \frac{1}{{\sqrt 2 }}\left( {\left {{\mathrm{K}}_{{\mathrm{A}} \prec {\mathrm{B}}}} \right\rangle ^{\mathrm{M}} + \left {{\mathrm{K}}_{{\mathrm{B}} \prec {\mathrm{A}}}} \right\rangle ^{\mathrm{M}}} \right)\) is a physically allowed mass configuration, and will yield the following final state of the joint system:
An explicit calculation showing how this state arises is presented in Methods. We note that not only classical gravity but also semiclassical^{14} and stochastic gravity^{32} theories would not yield Eq. (4) since these frameworks describe gravitational interactions in terms of classical, possibly stochastic, variables, thus violating assumption (c).
Note that, given a specific physical system used as a clock, it is possible to simulate its time dilation using nongravitational interactions. For example, an electric field can shift atomic energy levels and thus “time dilate” a clock based on atomic transitions. Therefore, one can produce a state analogous to (4) without using gravity. However, only gravity can alter the relative ordering of events independently of the nature of the systems and interactions used as clocks, due to the universality of time dilation: the preparation and manipulation of the massive object can be carried out without any knowledge of other aspects of the protocol. Such a universality underpins a fundamental distinction between our gravitational protocol and other, nongravitational, methods to control causal relations between operationally defined events^{33,34,35,36,37,38,39}. (See also Supplementary Note 4 for further discussion.)
Finally, the state (4) is the result of a process wherein the order of operations on a target system (S) is determined by the quantum state of a control system (position of the massive body). Such a process is known as a quantum switch^{15} and has been studied as a possible quantuminformation resource^{40,41,42,43,44}. The state ψ_{sup}〉^{MS} is a superposition of two amplitudes corresponding to different predefined, classical orders between events A and B. Note that, if the control system is discarded, the reduced state of S is
which is indistinguishable from a probabilistic mixture of \(\tilde \psi _1\rangle\) and \(\tilde \psi _2\rangle\). The state in Eq. (5) can be interpreted as arising from events A and B with a classical, albeit unknown, temporal order. Therefore, any protocol aimed at testing operationally quantum features of temporal order necessarily requires a measurement of the control system.
Bell’s theorem for temporal order
The above argument shows that superpositions of massive objects can in principle result in a coherent quantum control of temporal order between events. However, one might question whether such a conclusion has a direct physical meaning or whether it relies on a particular interpretation of state (4). Furthermore, the state assignment is defined in terms of a given coordinate system, while we would like to base our conclusions on coordinateindependent physical events. Since the very meaning of quantum states and measurements might be put into question in the absence of a classical spacetime, a proof of nonclassical causal relations should not rely on the validity of the quantum formalism. In the following we show that it is possible to probe the nature of temporal order irrespective of the validity of quantum theory. We formulate a theoryindependent argument—which does not rely on the quantum framework and provides means to exclude the very possibility of explaining data from a hypothetical experiment in terms of a classical temporal order (which can be stochastic and dynamical) within a broad class of probabilistic theories, not limited to quantum mechanics. Our formulation is analogous to Bell’s theorem for local hidden variables^{1,2} (see Methods) and we thus refer to the theorem below as Bell’s theorem for temporal order of events. The core of the argument is simple: given a bipartite system prepared in a separable state, it is not possible to violate any bipartite Bell inequality by performing local operations (transformations and measurements) on the two parts, as long as the local operations are applied in a definite order.
The scenario involves a bipartite system with subsystems S_{1} and S_{2} and a system M that can influence the temporal order of events. For j = 1, 2, each system S_{j} undergoes two transformations, \(T_{{\mathrm{A}}_j}\) and \(T_{{\mathrm{B}}_j}\), at spacetime events A_{j}, B_{j}, respectively. Each system is then measured at an event C_{j} according to some measurement setting i_{j}, producing a measurement outcome o_{j}. Additionally, M is measured at an event D, spacelike separated from both C_{1} and C_{2}, producing an outcome z, see Fig. 2. We now define the notion of classical order between events:
Definition 1: A set of events is classically ordered if, for each pair of events A and B, there exists a spacelike surface and a classical variable λ defined on it that determines the causal relation between A and B: for each given λ, either \({\mathrm{A}}\preccurlyeq {\mathrm{B}}\) (A in the past causal cone of B), \({\mathrm{B}}\preccurlyeq {\mathrm{A}}\) (A in the past causal cone of B) or AB (A and B spacelike separated).
Classically ordered events do not necessarily form a partially ordered set: classical order can be dynamical (the order between two events can depend on some operation performed in the past, i.e. some agent can prepare λ) and stochastic (λ might be distributed according to some probability, and not specified deterministically)^{45,46}.
Bell’s theorem for temporal order. No states, set of transformations and measurements which obey assumptions 1–5 below can result in a violation of the Bell inequalities:

1.
Local state: The initial state ω of S_{1}, S_{2} and M is separable (as defined in Methods).

2.
Local operations: All transformations performed on the systems are local (as defined in Methods).

3.
Classical order: The events at which operations (transformations and measurements) are performed are classically ordered.

4.
Spacelike separation: Events (A_{1}, B_{1}) are spacelike separated from events (A_{2}, B_{2}); C_{1}, C_{2}, and D are pairwise spacelike separated.

5.
Freechoice: The measurement choices in the Bell measurement are independent of the rest of the experiment. (This is a standard assumption necessary in Belllike theorems.)
More formally, let us denote by \({\Bbb T} = (T_{{\mathrm{A}}_{\mathrm{1}}},T_{{\mathrm{B}}_{\mathrm{1}}},T_{{\mathrm{A}}_{\mathrm{2}}},T_{{\mathrm{B}}_{\mathrm{2}}})\) the set of all local transformations irrespective of their order. The thesis of the theorem can be rephrased as: the conditional probability
produced under assumptions 1–5 does not violate Bell’s inequalities for any value of z. The proof of the theorem is presented in Methods.
Violation of Bell inequalities for temporal order
Here we show how the gravitational quantum control of temporal order from the first section can result in events whose temporal order is entangled: a bipartite quantum system, initially in a product state \(\left {\psi _1} \right\rangle ^{{\mathrm{S}}_{\mathrm{1}}}\left {\psi _2} \right\rangle ^{{\mathrm{S}}_{\mathrm{2}}}\), is sent to two different regions of space such that a_{1}, b_{1} and c_{1} only interact with S_{1}, while a_{2}, b_{2} and c_{2} only interact with S_{2}. Agents a_{1}, a_{2} perform, respectively, the unitaries \(U_{{\mathrm{A}}_{\mathrm{1}}}\), \(U_{{\mathrm{A}}_{\mathrm{2}}}\) at the events A_{1}, A_{2}, while agents b_{1}, b_{2}, perform the unitaries \(U_{{\mathrm{B}}_{\mathrm{1}}}\), \(U_{{\mathrm{B}}_{\mathrm{2}}}\) at the events B_{1}, B_{2}. Finally, c_{1} and c_{2} measure S_{1} and S_{2} at events C_{1} and C_{2}, respectively, see Fig. 3. Assume that a massive system can be prepared in two configurations, \({\mathrm{K}}_{{\mathrm{A}} \prec {\mathrm{B}}}\) and \({\mathrm{K}}_{{\mathrm{B}} \prec {\mathrm{A}}}\), such that \({\mathrm{A}}_{\mathrm{1}} \prec {\mathrm{B}}_{\mathrm{1}} \prec {\mathrm{C}}_{\mathrm{1}}\) (A_{1} in the past light cone of B_{1}, etc.) and \({\mathrm{A}}_{\mathrm{2}} \prec {\mathrm{B}}_{\mathrm{2}} \prec {\mathrm{C}}_{\mathrm{2}}\) for \({\mathrm{K}}_{{\mathrm{A}} \prec {\mathrm{B}}}\), while \({\mathrm{B}}_{\mathrm{1}} \prec {\mathrm{A}}_{\mathrm{1}} \prec {\mathrm{C}}_{\mathrm{1}}\) and \({\mathrm{B}}_{\mathrm{2}} \prec {\mathrm{A}}_{\mathrm{2}} \prec {\mathrm{C}}_{\mathrm{2}}\) for \({\mathrm{K}}_{{\mathrm{B}} \prec {\mathrm{A}}}\), and such that the events are spacelike separated as per assumption 4, which can always be achieved by having the groups sufficiently separated. If the mass is prepared in superposition \(\frac{1}{{\sqrt 2 }}\left( {\left {{\mathrm{K}}_{{\mathrm{A}} \prec {\mathrm{B}}}} \right\rangle ^{\mathrm{M}} + \left {{\mathrm{K}}_{{\mathrm{B}} \prec {\mathrm{A}}}} \right\rangle ^{\mathrm{M}}} \right)\), the joint state of the mass and the systems after the application of the unitaries is
Agent d at the event D measures the mass in the superposition basis \( \pm \rangle = \frac{1}{{\sqrt 2 }}\left( {{\mathrm{K}}_{{\mathrm{A}} \prec {\mathrm{B}}}\rangle \pm {\mathrm{K}}_{{\mathrm{B}} \prec {\mathrm{A}}}\rangle } \right)\). Conditioned on the outcome, the joint state of S_{1} and S_{2} reads
If the states \(U_{{\mathrm{B}}_{\mathrm{1}}}U_{{\mathrm{A}}_{\mathrm{1}}}\left {\psi _1} \right\rangle ^{{\mathrm{S}}_{\mathrm{1}}}\), \(U_{{\mathrm{B}}_{\mathrm{2}}}U_{{\mathrm{A}}_{\mathrm{2}}}\left {\psi _2} \right\rangle ^{{\mathrm{S}}_{\mathrm{2}}}\) are orthogonal to \(U_{{\mathrm{A}}_{\mathrm{1}}}U_{{\mathrm{B}}_{\mathrm{1}}}\left {\psi _1} \right\rangle ^{{\mathrm{S}}_{\mathrm{1}}}\), \(U_{{\mathrm{A}}_{\mathrm{2}}}U_{{\mathrm{B}}_{\mathrm{2}}}\left {\psi _2} \right\rangle ^{{\mathrm{S}}_{\mathrm{2}}}\), respectively, then the state (8) is maximally entangled. Local measurements can thus be performed on subsystems S_{1}, S_{2} whose outcomes will violate Bell inequalities, conditioned on the measurement outcome at D (see Supplementary Note 2 for an example).
The above thought experiment can in principle be realised in a scenario where it is meaningful to argue that assumptions 1, 2 and 4, 5 are satisfied. Violation of the Bell’s inequality would then imply that assumption 3 does not hold, proving nonclassicality of temporal order. In order to maximally violate the inequality, the timedilated clocks of the agents need to decorrelate from the systems S_{i}. In the Methods section we present a particular scenario using photons that satisfies also this requirement. In Supplementary Note 3 we present two concrete examples of our thougth experiment, using as the systems S_{i} polarisation states of photons, depicted in Supplementary Fig. 1, or spatial modes of a quantum field, depicted in Supplementary Fig. 2.
Discussion
The nonclassical causal structures discussed in this work arise in a semiclassical, albeit nonperturbative, regime where no explicit quantisation of the gravitational field is needed (which is complementary to the regime of most quantum gravity frameworks^{14}). Our approach shows that general relativity and standard quantum mechanics are sufficient to analyse scenarios involving superpositions of macroscopically different classical backgrounds. Not only is there no tension between the two frameworks, but there is also no ambiguity in the prediction of physical effects that arise: for each probability amplitude, the timedilation effects introduced by the mass can be treated classically. The considered processes involve a simple superposition of such amplitudes and the final probability amplitude is given by the usual Feynman sum. Note that, even though no explicit quantisation of the metric is used, the amplitudes in the Feynman sum do correspond to macroscopically distinct spacetime metrics: this is because each of these amplitudes contains a different causal structure, which determines the metric up to a conformal factor^{17,18}. Quantisation of the metric is therefore implicit in our result, in a similar way as in recently considered witnesses for quantum gravity in interferometric scenarios^{47,48,49}.
A practical realisation of the Bell test for time order would be extremely challenging, even in light of current efforts to prepare superposition states of massive objects and test their gravitational interactions^{50,51,52,53,54}. However, there would be far reaching consequences if a such a test were fundamentally impossible: this would imply that time order, and thus time itself, can be described with a classical parameter even in spacetimes originating from a quantum state of a massive object—with no need to invoke any other mechanism, such as refs. ^{9,10,11,12,13}, that would decohere these states (see also Supplementary Note 5 for further discussion). On the other hand, since these mechanisms postulate a specific decoherence time of spatial superpositions, one could think that they preclude the preparation of nonclassical causal structures. This is not the case: the time required to complete our protocol can be shorter than the decoherence time postulated by these models (see Methods). Thus, contrary to some motivations^{11,13}, these models do not enforce fundamentally classical spacetime with a fixed causal structure (i.e. there is a parameter regime where entangled causal structures could form but decoherence postulated by these models is negligible). Finally, classical temporal order could not be excluded also in a scenario where massive bodies can be prepared in quantum states but one (or more) of the assumptions 1, 2, 4, and 5 cannot be satisfied for some fundamental reason. We note that in particular the notion of locality may be fundamentally limited in the context of quantum gravity^{55,56}.
We should note that proofofprinciple realisations of indefinite causal order, analogous to the examples discussed here, have been realised in the laboratory. However, such realisations cannot be interpreted as proofs of nonclassical spacetime in the sense of general relativity, see Supplementary Note 4 for a discussion of the key differences between the gravitational and other methods for a quantum control of temporal order. The full extent of the relation between gravitational and nongravitational realisations of quantum causal structures merits an indepth study on its own.
A crucial aspect of Bell’s theorem for temporal order is that it provides a theory independent result—it applies to any framework where causal relations are described classically, such as classical, semiclassical^{14} and stochastic gravity^{32} theories. Moreover, joint validity of the quantum superposition principle and gravitational time dilation, assumptions (a)–(c), suffice for a maximal possible violation of the bound. Therefore, a classical notion of temporal order is untenable in any theory compatible with these basic principles. Finally, the way in which a nonclassical causal structure can be engineered exploiting time dilation from a massive body in a quantum state reveals a close connection between the informationtheoretic framework of quantum combs/process matrices and joint effects of quantum mechanics and general relativity.
Methods
Quantum gravitational control of temporal order
According to the Einstein equations, a massive object gives rise to a spacetime metric g_{μν}, μ, ν = 0, ..., 3, which in isotropic coordinates and a postNewtonian expansion reads^{31}: \(g_{{\mathrm{00}}}(r) =  \left( {1 + 2\frac{{{\mathrm{\Phi }}(r)}}{{c^2}}} \right)\), \(g_{ij}(r) = \delta _{ij}\left( {1  2\frac{{{\mathrm{\Phi }}(r)}}{{c^2}}} \right),\) i, j = , 2, 3, where r denotes the distance to the location of the mass. In other words, if a test mass or a clock is positioned at a spatial coordinate R_{a} as described by a faraway agent (as in Fig. 1) and the massive object is at a coordinate r_{M}, then r = R_{a} − r_{M}, which for clarity we denote below by R_{a} − r_{M}. It is important to note that the same coordinates describe scenarios where the mass is placed at different locations at a finite distance from r_{M}, as long as it remains far from an asymptotic region so that the spatial and temporal coordinates of the faraway agent remain unaffected (i.e. are those of flat Minkowski spacetime). In these coordinates, the Hamiltonian of a clock—a particle with internal DOFs—reads
(see e.g. refs. ^{57,58,59}) where P^{i}, i = 1, 2, 3 are the components of the momentum operator, and Ω_{a} is the internal Hamiltonian, describing the local time evolution of the internal DOFs. Note that we can restrict ourself to an effectively onedimensional scenario, so only one of the spatial coordinates has been kept in the above expression. In the first postNewtonian expansion and considering that both the mass and the clock follow fixed world lines at constant R_{a} and r_{M}, respectively, Eq. (9) becomes
The asymptotic time coordinate t defines spacelike hypersurfaces that are independent of the location of the mass and on which one can define states of all the involved systems (the clocks, the target systems and the mass itself) and Hamiltonian (10) describes their time evolution of with respect to t. Due to the interactions between the mass and the clocks—effected by the spacetime metric, which contains the potential Φ(R_{a} − r_{M})—the time evolution of the clocks depends on their relative distance R_{a} − r_{M} to the mass. Crucially, by the definition of t and the Hamiltonian our description includes both considered different mass configurations: the mass can be semiclassically localised around a single spatial coordinate r or in superposition of different spatial coordinates and the associated states belong to the same Hilbert space associated with a spacelike hypersurface labelled by t. We thus have all the tools to analyse time evolution in the presence of a superposition state of the mass, even though it leads to a quantifiably nonclassical causal structure.
With respect to t and the associated foliation of spacetime, the evolution of the clock, which at t = 0, is in an internal state s_{a}(τ_{0})〉, where τ_{0} denotes the clock’s proper time at t = 0, reads
where \(\tau (R_{\mathrm{a}}  r_{M},t): = t\left( {1 + \frac{{{\mathrm{\Phi }}(R_{\mathrm{a}}  r_{M})}}{{c^2}}} \right)\) is the proper time elapsing for the clock at a radial distance R_{a} − r_{M} from the mass when the elapsed coordinate time is t; and for clarity we set ħ = 1.
Before continuing on to the gravitational quantum control, we give an example of an internal Hamiltonian, state, and evolution. Let us take Ω_{a} = E_{0}0〉〈0 + E_{1}1〉〈1 and \(s_{\mathrm{a}}(\tau _0 = 0)\rangle = \frac{1}{{\sqrt 2 }}(0\rangle + 1\rangle )\), which describe, for example, an atom in an equal superposition of some two electronic energy levels 0〉,1〉 with energies E_{0}, E_{1}, respectively. Under H_{a} from Eq. (10) internal state s_{a}(0)〉 from Eq. (11) evolves as
which is simply s_{a}(τ(R_{a} − r_{M}, t))〉.
We now use the above to show how the quantum superposition principle and general relativity lead to the prediction that quantised matter acts as a quantum control of temporal order. To this end, we assume conditions (a)–(c) from the Results section and consider two clocks positioned at R_{A} and R_{B}, respectively. The Hamiltonian of clock a is thus Eq. (10) and fully analogously for b, \(H_{\mathrm{b}} \approx {\mathrm{\Omega }}_{\mathrm{b}}\left( {1 + \frac{{{\mathrm{\Phi }}(R_{\mathrm{b}}  r_{M})}}{{c^2}}} \right)\). The clocks are initially synchronised with each other and with a clock of the distant agent so that at t_{0} = 0 both clocks are at τ_{0} = 0. We further consider a target system, for example, a mode of the electromagnetic field, initially in a state ψ〉^{S}, on which an operation \({\cal{O}}_{\mathrm{A}}\) is performed at an event A = (R_{a}, τ_{a} = τ^{*}) and an operation \({\cal{O}}_{\mathrm{B}}\) at an event B = (R_{b}, τ_{b} = τ^{*}), where τ_{a}, τ_{b} refer to the proper times of the clock A, B, respectively. We effectively represent these operations as \({\cal{O}}_{\mathrm{A}} = \delta (\tau _{\mathrm{a}}  \tau ^ \ast ,r  R_{\mathrm{a}}){\mathrm{O}}_{\mathrm{A}}\), where δ(τ_{A} − τ^{*}, r − R_{a}) is a Dirac delta distribution and O_{A} is an operator (e.g. describing rotation of the polarisation of an electromagnetic field mode by a particular halfwave plate) independent of time and location. The total Hamiltonian reads
which for simplicity assumes trivial time evolution of the mass and of the target system between the application of the operations. We furthermore consider the following initial (at t_{0} = 0) state of the mass, clocks and the target system:
where positions r_{L}, r_{R} of the mass refer to the configurations in the left and the right panel of Fig. 1, respectively, that is, they realise configurations \({\mathrm{K}}_{{\mathrm{A}} \prec {\mathrm{B}}}\) and \({\mathrm{K}}_{{\mathrm{B}} \prec {\mathrm{A}}}\): for r_{L}〉 the mass is at a distance r_{a} = r_{L} − R_{a} from clock a and at r_{b} = r_{a} − h from b, while for r_{R}〉 the relative distances are swapped and the mass is at a distance r_{a} − h from a and at r_{a} from b. After coordinate time t such that τ(r_{a}, t) > τ^{*} (where \(\tau ^ \ast > \frac{{2r_{\mathrm{b}}^2c}}{{GM}}\), see main text) the state evolves to
The order of applying unitary transformations \(U_{\mathrm{A}} = e^{  i{\mathrm{O}}_{\mathrm{A}}}\) and \(U_{\mathrm{B}} = e^{  i{\mathrm{O}}_{\mathrm{B}}}\) to the target system is controlled by the position of the mass, which due to timedilation changes causal relations between events A and B. Swapping the mass distribution: r_{L}〉 → r_{R}〉, r_{R}〉 → r_{L}〉 and letting the state evolve for another time interval t results in the final state where the clocks become synchronised again
where τ_{f} = τ(r_{a}, t) + τ(r_{a} − h, t). Measuring the mass in a superposition basis r_{L}〉^{M} ± r_{R}〉^{M} prepares the target system in the corresponding superposition state U_{B}U_{A}ψ〉^{S} ± U_{A}U_{B}ψ〉^{S}.
The above example demonstrates that under very conservative assumptions a spatial superposition of a mass generates a quantumcontrolled application of unitary operations. More fundamentally, this effect stems from the superposition of different causal structures associated with the superposed states of the mass.
Proof of Bell’s theorem for temporal order
Bell’s theorem in general asserts that, under certain assumptions, the correlations between the outcomes of independent measurements on two subsystems must satisfy a class of inequalities. The two measuring parties are referred to as Alice and Bob. In every experimental run, each of them measures one of two properties of the subsystem they receive. For each of the properties, one of two outcomes is obtained, for convenience chosen to be ±1. Bell’s inequalities follow from the conjunction of the following assumptions: (1) measurement results are determined by properties that exist prior to and independent of the experiment (hidden variables); (2) results obtained at one location are independent of any measurements or actions performed at spacelike separation (locality); (3) any process that leads to the choice of which measurement will be carried out is independent from other processes in the experiment (free choice). The outcomes of Alice A(i, λ) and Bob B(i, λ) thus only depend on their own choice of setting, index i, and on the property of the system, variable λ. The correlation between outcomes A(i, λ) and B(i, λ) for the measurement choices i, j is described by \(E(A_i,B_j) = {\int} {\mathrm{d}}\lambda P(\lambda )A(i,\lambda )B(j,\lambda )\), where P(λ) is the probability distribution over the properties of the systems. It is straightforward to check that one possible inequality satisfied by the correlations E(A_{i}, B_{j}) is the socalled Clauser–Horne–Shimony–Holt inequality: E(A_{1}, B_{1}) + E(A_{1}, B_{2}) + E(A_{2}, B_{1}) − E(A_{2}, B_{2}) ≤ 2. Crucially, quantum theory allows for the lefthand side of this inequality to reach a value >2, and experimental measurements of this (and other inequalities) have confirmed such a violation^{3,4,5,6}. The significance of the violations of Bell’s inequalities is in showing that neither nature nor quantum mechanics obey all three assumptions mentioned above.
The assumption of classical order is sufficient to derive Causal Inequalities^{16,60}: tasks that, without any further assumptions, cannot be performed on a classical causal structure. However, it is not possible to violate causal inequalities using quantum control of order^{45,61}, this is why additional assumptions were required in the present context. It is an open question whether a gravitational implementation of a scenario that does allow for a violation of causal inequalities is possible.
The theorem we have formulated is theory independent, but not fully deviceindependent, as it requires the notions of a physical state and a physical transformation (in addition to the measured probability distributions), which we introduce below and then proceed to the proof. Discussion of the present work in the context of the theorydependent framework of causally nonseparable quantum processes^{16,45,61} and the fully theory and deviceindependent approach of causal inequalities^{16,60} is presented in Supplementary Note 1.
We consider a sufficiently broad framework to describe physical systems that can undergo transformations and measurements, similar to generalised probabilistic theories^{62,63,64}. This framework is more general than quantum or classical theory and we thus need to define key notions required in the proof. In this framework, a state ω is a complete specification of the probabilities P(oi, ω) for observing outcome o given that a measurement with setting i is performed on the system. We are interested in situations where a system can be split up in subsystems, say S_{1} and S_{2}, with spacelike separated agents performing independent operations on S_{1} and S_{2}. We say ω is a product state, and write ω = ω_{1} ⊗ ω_{2}, if probabilities for local measurements factorise as P(o_{1}, o_{2}i_{1}, i_{2}, ω) = P(o_{1}i_{1}, ω_{1})P(o_{2}i_{2}, ω_{2}). If state \(\omega _1^f\) is prepared for system S_{1} and state \(\omega _2^f\) is prepared for system S_{2}, according to a probability distribution P(f) for some variable f, we write \(\omega = {\int} {\mathrm{d}}f{\kern 1pt} P(f)\omega _1^f \otimes \omega _2^f\) and say the state is separable. Probabilities are then given by the corresponding mixture: \(P(o_1,o_2i_1,i_2,\omega ) = {\int} {\mathrm{d}}fP(o_1i_1,\omega _1^f)P(o_2i_2,\omega _2^f)P(f)\). Note that for such a decomposition Bell inequalities cannot be violated^{1,65}.
A physical transformation of the system is represented by a function \(\omega \mapsto T(\omega )\). To make our arguments precise we need a notion of local transformations, namely, realised at the time and location defined by a local clock. If S_{1} is the subsystem on which a local transformation T_{1} acts, and S_{2} labels the DOFs spacelike separated from T_{1}, then, by definition, T_{1} transforms product states as \(\omega _1 \otimes \omega _2 \mapsto T_1(\omega _1) \otimes \omega _2\) and separable states by convex extension. How local operations act on general, nonseparable states can depend on the particular physical theory; however, action on separable states will suffice for our purposes. We further need to define how the local transformations combine. This depends on their relative spatiotemporal locations: if transformations T_{1}, T_{2} are spacelike separated they combine as (T_{1} ⊗ T_{2})(ω_{1} ⊗ ω_{2}) = T_{1}(ω_{1}) ⊗ T_{2}(ω_{2}), which follows from the definition above; if T_{1} is in the future of T_{2}, we define their combination as T_{1} ο T_{2}(ω) = T_{1}(T_{2}(ω)). (For simplicity, we omit possible additional transformations taking place between the specified events, as they are of no consequence for our argument.)
Proof Assumption (1) says that there is a random variable f determining the local states \(\omega _1^f\), \(\omega _2^f\) of systems S_{1}, S_{2}, respectively. Assumption (3) says there is a random variable λ that determines the order of events. In general, the two variables can be correlated by some joint probability distribution P(λ, f). By assumption (4), events labelled A_{1}, B_{1} are spacelike separated from events A_{2}, B_{2} and the order between events within each set (A_{j}, B_{j}), j = 1, 2 can be defined by a permutation σ_{j}. Most generally, there is a probability P(σ_{j}λ) that the permutation σ_{j} is realised for a given λ. By assumption (2), for each given order the system undergoes a transformation \(T^{\sigma _1} \otimes T^{\sigma _2}\), where \(T^{\sigma _1}\) is the transformation obtained by composing \(T_{{\mathrm{A}}_{\mathrm{1}}}\) and \(T_{{\mathrm{B}}_{\mathrm{1}}}\) in the order corresponding to the permutation σ_{1} and similarly for \(T^{\sigma _2}\). (For example, if σ_{1} corresponds to the order \({\mathrm{A}}_{\mathrm{1}} \prec {\mathrm{B}}_{\mathrm{1}}\), then \(T^{\sigma _1} = T_{{\mathrm{B}}_{\mathrm{1}}} \circ T_{{\mathrm{A}}_{\mathrm{1}}}\).) Furthermore, at event D an outcome z is obtained with a probability P(zλ, f, σ_{1}, σ_{2}). Finally, using assumption (1), we write the probabilities for all outcomes as
A simple Bayesian inversion P(σ_{1}λ)P(σ_{2}λ)P(zλ, f, σ_{1}, σ_{2})P(λ, f) = P(λ, f, σ_{1}, σ_{2}z)P(z), where we used P(σ_{j}λ) = P(σ_{j}λ, f), gives the desired probabilities
where \(\tilde f\) is a shorthand for the variables λ, f, σ_{1}, and σ_{2}. The above probability distribution satisfies the hypothesis of Bell’s theorem and thus cannot violate any Bell inequality.
Exemplary scenario realising Bell test for temporal order of events
The protocol allowing for the violation of Bell’s inequalities for temporal order exploits correlations between the clocks of the agents a_{1}, b_{1} and the agents a_{2}, b_{2}, created due to time dilation induced by the mass. It should be noted that the protocol allows maximal violation of the Bell inequality if the joint state of the systems S_{1} and S_{2} is pure (and maximally entangled) when the Bell measurements are realised. Thus, for a maximal violation, the clocks need to decorrelate from the mass after the application of the unitaries. Below we sketch a scenario that can achieve this.
The spacetime arrangement of the mass and the agents in this example is presented in Fig. 4. It can be realised in one spatial dimension: agents acting on the system S_{1} are located at distance h from each other, and the mass is placed at distance r (configuration \({\mathrm{K}}_{{\mathrm{B}} \prec {\mathrm{A}}}\)) or r + L (configuration \({\mathrm{K}}_{{\mathrm{A}} \prec {\mathrm{B}}}\)) from agent a_{1}. Agents acting on system S_{2} are placed symmetrically on the opposite side of the mass, such that the mass is at a distance r + L from a_{2} in configuration \({\mathrm{K}}_{{\mathrm{B}} \prec {\mathrm{A}}}\) and r in configuration \({\mathrm{K}}_{{\mathrm{A}} \prec {\mathrm{B}}}\). Here, events A_{j} are defined by the local time τ_{a} that differs from the local time τ_{b} defining B_{j}, j = 1, 2. In such a case, even though the mass is always closer to a_{j} than to b_{j}, the two mass configurations can lead to different event orders—as they induce different relative time dilations. (Equivalently, one can introduce an initial offset in the synchronisation of the clocks.) Note that the time orders between the two groups are here “anticorrelated”: \({\mathrm{A}}_{\mathrm{1}} \prec {\mathrm{B}}_{\mathrm{1}}\) and \({\mathrm{B}}_{\mathrm{2}} \prec {\mathrm{A}}_{\mathrm{2}}\) for \({\mathrm{K}}_{{\mathrm{A}} \prec {\mathrm{B}}}\), and vice versa for \({\mathrm{K}}_{{\mathrm{B}} \prec {\mathrm{A}}}\). Since otherwise the scenario is the same for S_{1} and S_{2}, we focus on the operations performed on S_{1}. The key observation is that swapping the mass distribution, as depicted in Fig. 4, will eventually disentangle the clocks from the mass, and since the clocks must be suitably time dilated when the operations are performed, the operations must not take place in the future light cone of the swapped mass state.
The proper time τ_{a} that has to elapse for the clock of a_{1} such that the order of events is \({\mathrm{A}}_{\mathrm{1}} \prec {\mathrm{B}}_{\mathrm{1}}\) for \({\mathrm{K}}_{{\mathrm{A}} \prec {\mathrm{B}}}\rangle\) and \({\mathrm{B}}_{\mathrm{1}} \prec {\mathrm{A}}_{\mathrm{1}}\) for \({\mathrm{K}}_{{\mathrm{B}} \prec {\mathrm{A}}}\rangle\) for the present case reads
where T_{c}(r, L/2) is the coordinate travel time of light between radial distances r and r + L/2 from the mass. The coordinate time corresponding to τ_{a} is \(T_{\mathrm{a}} = \tau _{\mathrm{a}}/\sqrt {  g_{{\mathrm{00}}}(r)}\). The proper time of event B_{1} is then defined as:
It can directly be checked that when the mass is placed in configuration \({\mathrm{K}}_{{\mathrm{A}} \prec {\mathrm{B}}}\)—at a distance r + L from a_{1}—the event A_{1} defined by local clock of a_{1} showing proper time τ_{a} from Eq. (18) is in the past light cone of event B_{1}, which is defined by the local clock of b_{1} showing proper time τ_{b} from Eq. (19). When the mass is placed in configuration \({\mathrm{K}}_{{\mathrm{B}} \prec {\mathrm{A}}}\), event B_{1} ends up in the past of the event A_{1}. The coordinate time required for the application of the operations can be estimated as twice the travel time of light between the agents, T_{o} = 2T_{c}(r + L/2, h).
The world lines of the mass can be arranged such that: (a) the mass is moving slow so that the two amplitudes of the mass are swapped in a time interval longer than T_{o}; (b) during the application of the operations the distance of each agent to the mass is approximately the same for both mass configurations (as in Fig. 4). The first guarantees that there is enough time to apply the operations after the clocks get correlated, the second—that the slowdown of light in curved spacetime, the Shapiro delay^{66,67}, can be neglected.
The coordinatetime duration of the entire protocol can be estimated as T_{p} = 2T_{a} + 4L/2c, where L/2c is the minimal time required to put the mass in superposition of amplitudes separated by the distance L/2. Taking as an example M ~ 0.1 μg, L = h ~ 0.1 μm, r ~ 1 fm, the protocol in Fig. 4 takes T_{p} ~10 h. Furthermore, we note that a quantum treatment of the local clocks is central to our protocol since the application of the operations on the target systems is conditioned on the states of the clocks. The timeenergy uncertainty^{68,69} thus poses a limitation to a singleshot precision with which spacetime events can be defined with physical clocks. The optimal clock state in this context—evolving the fastest—is a balanced superposition of energy eigenstates; for an energy gap ħ ⋅ 2πν_{c}, where ν_{c} is the clock frequency, the smallest time that can be resolved by a single quantum system is the socalled orthogonalisation time^{70,71,72} t_{⊥} = 1/2ν_{c}. For the values of parameters quoted above, the coordinatetime difference between the superposed locations of the events A_{i}, i = 1, 2 is ~10^{−15} s, and we thus need a system with frequency ν_{c} ≥10^{15} Hz such as a clock based on optical transitions in ytterbium^{73} or mercury^{74}, which both give t_{⊥} ~10^{−16} s. While this ideal limit is not reached with practical systems, the resolution of current atomic clocks based on such atoms far exceeds this theoretical bound due to averaging over many atoms, with 2.5 × 10^{−19} uncertainty of the clock frequency recently demonstrated in ref. ^{75}. We further note that by using n entangled atoms, the orthogonalisation time of the entire system becomes t_{⊥}/n and can thus be even a few orders of magnitude smaller^{76} than required. Finally, such atoms have masses ~10^{−25} kg and their back action on the metric produced by M ~ 10^{−7} kg would thus be negligible. Since the mass difference between the atom in the two involved energy levels is 2πħν_{c}/c^{2} ~10^{−35} kg also quantum effects from the clocks’ mutual gravitational interactions^{58} can be neglected.
We conclude that it is in principle possible to achieve the required entanglement of orders, swap the mass distribution so as to finally disentangle the clocks from the mass, and satisfy the locality conditions on the events. Although a direct experiment in such a regime is not practical, the above example surprisingly shows that the regime where entangled temporal order arises is in no way related to the Planck scale. It is usually assumed that the Planckscale marks the regime where quantum gravity effects become relevant (first discussed in this context by Bronstein^{77}), but this is not the case for the superposition of temporal order. In terms of a potential experiment, one could also take a different (theorydependent) approach and explore possible witnesses of entangled temporal order^{61}, in analogy to witnesses of entanglement in quantuminformation theory^{78}. A witness would probe the quantum nature of temporal order indirectly and under further assumptions, but in a relaxed parameter range. Such an approach may lead to more feasible experiments, which will be explored in a future study.
A spatial superposition state of a mass such as used in our protocol is postulated to decohere in various gravityinspired collapse models^{9,10,11,12,13} (which thus violate assumption (c) in the first section). However, even if endorsed, these models do not immediately preclude realisation of our protocol: the decoherence time scale in those models is the Diosi–Penrose time^{10,11} \(T_{{\mathrm{DP}}} = \frac{{2\delta ^3\hbar }}{{G(ML)^2}}\), where δ is a free parameter. For every value of δ one can find the mass and the relevant distances (M, r, L, h) so that the duration of our entire protocol is shorter than T_{DP}. For example, following the recent ref. ^{79} and taking δ = 10^{−7} m, for r = 10^{10}R_{Sch}, L = 5r, h = r and M = 1 g, where R_{Sch} ≈10^{−30} m, the protocol from Fig. 4 takes T_{p} ≈7 × 10^{−18} s, while T_{DP} ≈0.5 s. Taking instead the originally proposed value δ = 10^{−15} m^{10}, the desired regime is achieved, for example, for M = 10^{−7} kg, r = 10^{7}R_{Sch}, L = 5 × 10^{5}r, h = 10^{5}r; with T_{p} ~10^{−23} s and T_{DP} ~10^{−13} s. Thus, the above models in principle still allow for events with entangled temporal order, and do not enforce the classicality of the causal structure of space time.
Data availability
The data that support the plots within this paper and other findings of this study are available from the corresponding author upon reasonable request.
References
 1.
Bell, J. S. On the Einstein–Poldolsky–Rosen paradox. Physics 1, 195–200 (1964).
 2.
Clauser, J. F., Horne, M. A., Shimony, A. & Holt, R. A. Proposed experiment to test local hiddenvariable theories. Phys. Rev. Lett. 23, 880–884 (1969).
 3.
Freedman, S. J. & Clauser, J. F. Experimental test of local hiddenvariable theories. Phys. Rev. Lett. 28, 938–941 (1972).
 4.
Hensen, B. et al. Loopholefree Bell inequality violation using electron spins separated by 1.3 kilometres. Nature 526, 682–686 (2015).
 5.
Giustina, M. et al. Significantloopholefree test of bell’s theorem with entangled photons. Phys. Rev. Lett. 115, 250401 (2015).
 6.
Shalm, L. K. et al. Strong loopholefree test of local realism. Phys. Rev. Lett. 115, 250402 (2015).
 7.
Hardy, L. Probability theories with dynamic causal structure: a new framework for quantum gravity. Preprint at https://arxiv.org/abs/grqc/0509120 (2005).
 8.
Hardy, L. Towards quantum gravity: a framework for probabilistic theories with nonfixed causal structure. J. Phys. A 40, 3081–3099 (2007).
 9.
Karolyhazy, F. Gravitation and quantum mechanics of macroscopic objects. Il Nuovo Cim. A 42, 390–402 (1966).
 10.
Diosi, L. Models for universal reduction of macroscopic quantum fluctuations. Phys. Rev. A 40, 1165–1174 (1989).
 11.
Penrose, R. On gravity’s role in quantum state reduction. Gen. Relativ. Gravit. 28, 581–600 (1996).
 12.
Stamp, P. C. E. Environmental decoherence versus intrinsic decoherence. Philos. Trans. R. Soc. Ser. A 370, 4429–4453 (2012).
 13.
Penrose, R. On the gravitization of quantum mechanics 1: quantum state reduction. Found. Phys. 44, 557–575 (2014).
 14.
Kiefer, C. Quantum Gravity 3rd edn. International Series of Monographs on Physics (OUP, Oxford, 2012).
 15.
Chiribella, G., D’Ariano, G. M., Perinotti, P. & Valiron, B. Quantum computations without definite causal structure. Phys. Rev. A 88, 022318 (2013).
 16.
Oreshkov, O., Costa, F. M. & Brukner, C. Quantum correlations with no causal order. Nat. Commun. 3, 1092 (2012).
 17.
Hawking, S. W., King, A. R. & McCarthy, P. J. A new topology for curved space?time which incorporates the causal, differential, and conformal structures. J. Math. Phys. 17, 174–181 (1976).
 18.
Malament, D. B. The class of continuous timelike curves determines the topology of spacetime. J. Math. Phys. 18, 1399–1404 (1977).
 19.
Stachel, J. The hole argument and some physical and philosophical implications. Living Rev. Relativ. 17, 1 (2014).
 20.
Rovelli, C. What is observable in classical and quantum gravity? Class. Quantum Gravity 8, 297 (1991).
 21.
Hafele, J. C. & Keating, R. E. Aroundtheworld atomic clocks: observed relativistic time gains. Science 177, 168–170 (1972).
 22.
Chou, C.W., Hume, D., Rosenband, T. & Wineland, D. Optical clocks and relativity. Science 329, 1630–1633 (2010).
 23.
Pound, R. & Rebka, G. Apparent weight of photons. Phys. Rev. Lett. 4, 337–341 (1960).
 24.
Zych, M., Costa, F., Pikovski, I. & Brukner, C. Quantum interferometric visibility as a witness of general relativistic proper time. Nat. Commun. 2, 505 (2011).
 25.
Zych, M., Costa, F., Pikovski, I., Ralph, T. C. & Brukner, C. General relativistic effects in quantum interference of photons. Class. Quantum Gravity 29, 224010 (2012).
 26.
Zych, M., Pikovski, I., Costa, F. & Brukner, Č. General relativistic effects in quantum interference of “clocks”. J. Phys. 723, 012044 (2016).
 27.
Pikovski, I., Zych, M., Costa, F. & Brukner, Č. Universal decoherence due to gravitational time dilation. Nat. Phys. 11, 668–672 (2015).
 28.
Pikovski, I., Zych, M., Costa, F. & Brukner, Č. Time dilation in quantum systems and decoherence. N. J. Phys. 19, 025011 (2017).
 29.
Zych, M. Quantum Systems under Gravitational Time Dilation. Springer theses (Springer International Publishing, Switzerland, 2017).
 30.
Zych, M., Costa, F. & Ralph, T. C. Relativity of quantum superpositions. Preprint at https://arxiv.org/abs/1809.04999 (2018).
 31.
Weinberg, S. Gravitation and Cosmology: Principle and Applications of General Theory of Relativity (Wiley, New York, 1972).
 32.
Hu, B. L. & Verdaguer, E. Stochastic gravity: a primer with applications. Class. Quantum Gravity 20, R1 (2003).
 33.
Procopio, L. M. et al. Experimental superposition of orders of quantum gates. Nat. Commun. 6, 7913 (2015).
 34.
Rubino, G. et al. Experimental verification of an indefinite causal order. Sci. Adv. 3, https://doi.org/10.1126/sciadv.1602589 (2017).
 35.
Rubino, G. et al. Experimental entanglement of temporal orders. Preprint at http://arxiv.org/abs/1712.06884 (2017).
 36.
Goswami, K. et al. Indefinite causal order in a quantum switch. Phys. Rev. Lett. 121, 090503 (2018).
 37.
Goswami, K., Romero, J. & White, A. Communicating via ignorance. Preprint at https://arxiv.org/abs/1807.07383 (2018).
 38.
Wei, K. et al. Experimental quantum switching for exponentially superior quantum communication complexity. Phys. Rev. Lett. 122, 120504 (2019).
 39.
Guo, Y. et al. Experimental investigating communication in a superposition of causal orders. Preprint at http://arxiv.org/abs/1811.07526 (2018).
 40.
Chiribella, G. Perfect discrimination of nosignalling channels via quantum superposition of causal structures. Phys. Rev. A 86, 040301 (2012).
 41.
Colnaghi, T., D’Ariano, G. M., Facchini, S. & Perinotti, P. Quantum computation with programmable connections between gates. Phys. Lett. A 376, 2940–2943 (2012).
 42.
Araújo, M., Costa, F. & Brukner, C. Computational advantage from quantumcontrolled ordering of gates. Phys. Rev. Lett. 113, 250402 (2014).
 43.
Feix, A., Araújo, M. & Brukner, Č. Quantum superposition of the order of parties as a communication resource. Phys. Rev. A 92, 052326 (2015).
 44.
Guérin, P. A., Feix, A., Araújo, M. & Brukner, Č. Exponential communication complexity advantage from quantum superposition of the direction of communication. Phys. Rev. Lett. 117, 100502 (2016).
 45.
Oreshkov, O. & Giarmatzi, C. Causal and causally separable processes. N.J. Phys. 18, 093020 (2016).
 46.
Abbott, A. A., Giarmatzi, C., Costa, F. & Branciard, C. Multipartite causal correlations: polytopes and inequalities. Phys. Rev. A 94, 032131 (2016).
 47.
Bose, S. et al. Spin entanglement witness for quantum gravity. Phys. Rev. Lett. 119, 240401 (2017).
 48.
Marletto, C. & Vedral, V. Gravitationally induced entanglement between two massive particles is sufficient evidence of quantum effects in gravity. Phys. Rev. Lett. 119, 240402 (2017).
 49.
Belenchia, A. et al. Quantum superposition of massive objects and the quantization of gravity. Phys. Rev. D 98, 126009 (2018).
 50.
Bose, S., Jacobs, K. & Knight, P. L. Scheme to probe the decoherence of a macroscopic object. Phys. Rev. A 59, 3204–3210 (1999).
 51.
Marshall, W., Simon, C., Penrose, R. & Bouwmeester, D. Towards quantum superpositions of a mirror. Phys. Rev. Lett. 91, 130401 (2003).
 52.
Kleckner, D. et al. Creating and verifying a quantum superposition in a microoptomechanical system. N. J. Phys. 10, 095020 (2008).
 53.
Wan, C. et al. Free nanoobject ramsey interferometry for large quantum superpositions. Phys. Rev. Lett. 117, 143003 (2016).
 54.
Schmöle, J., Dragosits, M., Hepach, H. & Aspelmeyer, M. A micromechanical proofofprinciple experiment for measuring the gravitational force of milligram masses. Class. Quantum Gravity 33, 125031 (2016).
 55.
Giddings, S. B., Marolf, D. & Hartle, J. B. Observables in effective gravity. Phys. Rev. D 74, 064018 (2006).
 56.
Donnelly, W. & Giddings, S. B. Diffeomorphisminvariant observables and their nonlocal algebra. Phys. Rev. D 93, 024030 (2016).
 57.
Zych, M. & Brukner, Č. Quantum formulation of the Einstein equivalence principle. Nat. Phys. 14, 1027–1031 (2018).
 58.
Castro Ruiz, E., Giacomini, F. & Brukner, Č. Entanglement of quantum clocks through gravity. Proc. Natl. Acad. Sci. USA 114, E2303–E2309 (2017).
 59.
Zych, M., Rudnicki, L. & Pikovski, I. Gravitational mass of composite systems. Phys. Rev. D 99, 104029 (2019).
 60.
Branciard, C., Araújo, M., Feix, A., Costa, F. & Brukner, Č. The simplest causal inequalities and their violation. N. J. Phys. 18, 013008 (2016).
 61.
Araújo, M. et al. Witnessing causal nonseparability. N. J. Phys. 17, 102001 (2015).
 62.
Hardy, L. Quantum theory from five reasonable axioms. Preprint at https://arxiv.org/abs/quantph/0101012 (2001).
 63.
Barrett, J. Information processing in generalized probabilistic theories. Phys. Rev. A 75, 032304 (2007).
 64.
Chiribella, G., D’Ariano, G. M. & Perinotti, P. Probabilistic theories with purification. Phys. Rev. A 81, 062348 (2010).
 65.
Fine, A. Hidden variables, joint probability, and the bell inequalities. Phys. Rev. Lett. 48, 291–295 (1982).
 66.
Shapiro, I. I. Fourth test of general relativity. Phys. Rev. Lett. 13, 789–791 (1964).
 67.
Shapiro, I. I. et al. Fourth test of general relativity: new radar result. Phys. Rev. Lett. 26, 1132–1135 (1971).
 68.
Fleming, G. A unitarity bound on the evolution of nonstationary states. Il Nuovo Cim. A 16, 232–240 (1973).
 69.
Mandelstam, L. & Tamm, I. The Uncertainty Relation Between Energy and Time in Nonrelativistic Quantum Mechanics, in Selected Papers, 115–123 (Springer, Berlin Heidelberg, 1991).
 70.
Margolus, N. & Levitin, L. B. The maximum speed of dynamical evolution. Phys. D 120, 188–195 (1998).
 71.
Kosiński, P. & Zych, M. Elementary proof of the bound on the speed of quantum evolution. Phys. Rev. A 73, 024303 (2006).
 72.
Zieliński, B. & Zych, M. Generalization of the margoluslevitin bound. Phys. Rev. A 74, 034301 (2006).
 73.
Pizzocaro, M. et al. Absolute frequency measurement of the ^{1} S _{0}−^{3} P _{0} transition of ^{171}Yb. Metrologia 54, 102–112 (2017).
 74.
Hoyt, C. W. et al. Observation and absolute frequency measurements of the ^{1} S _{0}−^{3} P _{0} optical clock transition in neutral ytterbium. Phys. Rev. Lett. 95, 083003 (2005).
 75.
Marti, G. E. et al. Imaging optical frequencies with 100 μHz precision and 1.1 μm resolution. Phys. Rev. Lett. 120, 103201 (2018).
 76.
Kómár, P. et al. Quantum network of atom clocks: a possible implementation with neutral atoms. Phys. Rev. Lett. 117, 060506 (2016).
 77.
Gorelik, G. in Studies in the History of General Relativity (eds Eisenstaedt, J. & Kox, A. J.) Vol. 4, 364–379 (Birkhaeuser, Boston, 1992).
 78.
Terhal, B. M. Bell inequalities and the separability criterion. Phys. Lett. A 271, 319–326 (2000).
 79.
Bahrami, M., Smirne, A. & Bassi, A. Role of gravity in the collapse of a wave function: a probe into the Diósi–Penrose model. Phys. Rev. A 90, 062105 (2014).
Acknowledgements
We thank G. Chiribella, G. Milburn, H. Wiseman and M. Vojinovic for feedback. M.Z. and F.C. acknowledge support through the Australian Research Council (ARC) Centre of Excellence for Engineered Quantum Systems (CE 110001013), Discovery Early Career Researcher Awards DE180101443, DE170100712 and the Templeton World Charity Foundation (TWCF 0064/AB38). I.P. acknowledges support of the NSF through a grant to ITAMP and the Branco Weiss Fellowship—Society in Science, administered by the ETH Zürich. Č.B. acknowledges the support of the Austrian Science Fund (FWF) through the Doctoral Programme CoQuS, the project I2526N27 and I2906, the research platform TURIS and the ÖAW Innovationsfond “Quantum Regime of Gravitational Source Masses”. This publication was made possible through the support of a grant from the John Templeton Foundation and from the Foundational Questions Institute (FQXi) Fund. The opinions expressed in this publication are those of the authors and do not necessarily reflect the views of the John Templeton Foundation. F.C. and M.Z. acknowledge the traditional owners of the land on which the University of Queensland is situated, the Turrbal and Jagera people.
Author information
Affiliations
Contributions
M.Z., F.C., I.P and Č.B. contributed to all aspects of the research, with the leading input from M.Z.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Peer review information: Nature Communications thanks Rodolfo Gambini and other anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zych, M., Costa, F., Pikovski, I. et al. Bell’s theorem for temporal order. Nat Commun 10, 3772 (2019). https://doi.org/10.1038/s4146701911579x
Received:
Accepted:
Published:
Further reading

Cyclic quantum causal models
Nature Communications (2021)

The Multiround Process Matrix
Quantum (2021)

Entanglement amplification between superposed detectors in flat and curved spacetimes
Physical Review D (2021)

Indefinite causal order enables perfect quantum communication with zero capacity channels
New Journal of Physics (2021)

Delayedchoice causal order and nonclassical correlations
Physical Review Research (2021)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.