Introduction

Quantum theory and general relativity are the two most fundamental theories of physics; yet we know that one of them, or both, might ultimately have to be modified into a more general theory, which will resolve their clash.

A possible route to that general theory is the quantisation of gravity, in the same spirit as other field theories; yet, whether or not this quantisation is necessary is still controversial. A number of arguments have been proposed in favour of it,1,2,3 to show that, as Feynman put it, ‘We are in trouble if we believe in quantum mechanics, but we do not quantise gravitational theory’.2

These arguments all address the more general problem of whether a hybrid system composed of a quantum and a classical sectors, coupled via some interaction, is possible—which goes well beyond the domain of quantum gravity. It was already considered by Heisenberg in the context of the coupling between electromagnetic fields and quantum charges,4 and it has been an open question ever since.5 In such arguments, the quantum sector is required to obey unitary quantum theory. The classical sector instead is required to satisfy a number of different conditions, capturing some notion of classicality: for instance, in ref. 1 it is required to have observables which can all be sharp simultaneously; in ref. 3 to obey a Liouvillian dynamics in phase space; in ref. 6 to be described by a Wigner function whose evolution is generated by Moyal brackets. In all such cases, the conclusion is that the model describing the hybrid classical-quantum system is inconsistent. In particular, either the quantum sector must spread its features to the classical sector; or the uncertainty relations are violated on the quantum sector.

Now, these arguments have two problems. First, they all rely on adopting a specific hybrid dynamical formalism, capturing both quantum and classical dynamics, thus having limited scope and applicability. Also, some of them assume particular couplings between the classical and the quantum sectors or particular measuring procedures, following Heisenberg’s logic.7, 8

In addition, the dynamics on the composite system is frequently assumed to be unitary, i.e., linear, deterministic and reversible. This is too strong as an assumption, since it assumes the fundamental trait of unitary quantum dynamics on the composite system, thus almost forcing that trait to hold on the classical sector.

In this paper we propose a radically different argument which is free of these shortcomings, formulated exclusively in information-theoretic terms. It is independent of specific details of the dynamical theories or measurement procedures that couple the two sectors; and the quantum sector is modelled in such a way that it includes even systems that obey post-quantum theories. The information-theoretic nature of the argument allows it to be applicable to more general cases, where the classical sector need not be the gravitational field, but a generic non-quantum system, e.g., a simple harmonic oscillator or a classical bit. We shall nonetheless refer to the two sectors as ‘quantum’ and ‘classical’, with the understanding that those notions are generalised as detailed in our model.

In this more general form, the argument has interesting consequences. In particular, it is compatible with the classical sector not obeying the full quantum theory. As an example, we consider a hybrid system made of qubits and rebits.9 A rebit has complementary observables, but its dynamics is restricted to states that have only real amplitudes. Yet, in such hybrid systems superdense coding and teleportation are still possible. This provides a deeper insight into what ‘quantising’ a system really means; it also highlights an interesting difference between ways of implementing the swap operation, which is not manifest when the two systems are of the same kind.

Results

To formulate the argument independently of dynamics, we adopt the approach of the constructor theory of information.10, 11 The advantage is that there is no need to assume any specific underlying dynamical model and the assumptions can be cast in general information-theoretic terms.

In constructor theory physical systems are ‘substrates’—i.e., physical systems on which physical transformations, or ‘tasks’, can be performed. An attribute n is the set of all states where the substrate has a given property. For instance, the set of all quantum states of a qubit where a given projector is sharp with value 1 is an attribute. A task is the specification of a physical transformation, in terms of input/output pairs of attributes. For example, the NOT task on the attributes 0, 1 is written as {0 → 1, 1 → 0}.

A task is ‘impossible’ if the laws of physics impose a limit to how accurately it can be performed. Unitary quantum theory, for instance, requires the task of cloning sets of non-orthogonal quantum states to be impossible.12 Otherwise, the task is ‘possible’: there can be arbitrarily good approximations to a ‘constructor’ for it, which is defined as a substrate that whenever presented with the substrates in any of the input attributes of the task, delivers them in one of the corresponding output attributes, and, crucially, retains the property of doing it again. In quantum information, gates for computational tasks are example of constructors.13

In constructor theory, one can provide a powerful information-theoretic characterisation of the classical and quantum sectors, without specifying particular dynamical laws, by regarding them as different classes of substrates, defined as follows.10 First, one defines an ‘information medium’ as a substrate with a set of attributes X, called ‘information variable’, with the property that the following tasks are possible:

$$\mathop {\bigcup}\limits_{x \in X} \left\{ {\left( {{\bf{x}},{{\bf{x}}_{\bf{0}}}} \right) \to \left( {{\bf{x}},{\bf{x}}} \right)} \right\},$$
(1)
$$\mathop {\bigcup}\limits_{x \in X} \left\{ {{\bf{x}} \to \Pi ({\bf{x}})} \right\}$$
(2)

for all permutation Π on the set of labels of the attributes in X and some blank attribute x 0X.

The former task corresponds to ‘copying’, or cloning, the attributes of the first replica of the substrate onto the second, target, substrate; the latter, for a particular Π, corresponds to a logically reversible computation (which need not require it to be realised in a physically reversible way). So, an information medium is a substrate that can be used for classical information processing (but could, in general, be used for more). For example, a qubit is an information medium with any set of two orthogonal quantum states.

Any two information media (e.g., a photon and an electron) must satisfy an ‘interoperability principle’,10 which expresses elegantly the intuitive property that classical information must be copiable from one information medium to any other, irrespective of their physical details. Specifically, if S 1 and S 2 are information media, respectively with information variable X 1 and X 2, their composite system S 1 S 2 is an information medium with information variable X 1 × X 2, where X denotes the Cartesian product of sets. This requires the task of copying information variables (as in Eq. (1)) from one to the other to be possible.

With these tools one can express information-theoretic concepts, such as measuring and distinguishing, without resorting to formal concepts, such as orthogonality, linearity or unitarity. This is the key feature that will allow our argument to be independent of particular dynamical models. The variable X is ‘distinguishable’ if the task

$$\mathop {\bigcup}\limits_{x \in X} \left\{ {{\bf{x}} \to {{\bf{q}}_{\bf{x}}}} \right\}$$
(3)

is possible, where the variable {q x } is some information variable. If the variable {x 0 , x 1 } is distinguishable, we say that the attribute x 0 is distinguishable from x 1 , denoted as x 0 x 1 . This notion of distinguishability allows one to generalise the orthogonal complement of a vector space: for any attribute n define the attribute \(\overline {\bf{n}} \) as the union of all attributes that are distinguishable from n.

An ‘observable’ is an information variable whose attributes x have the property that \(\overline{\overline {\bf{x}}} = {\bf{x}}\); this notion generalises that of a quantum observable. An observable X is said to be ‘sharp’ on a substrate, with value x, if the substrates is in a state ξ that belongs to one of the attributes x X. A special case of the distinguishing task is the perfect measurement task:

$$\mathop {\bigcup}\limits_{x \in X} \left\{ {\left( {{\bf{x}},{{\bf{x}}_{\bf{0}}}} \right) \to \left( {{\bf{x}},{{\bf{p}}_{\rm{x}}}} \right)} \right\}$$
(4)

where the first substrate is the ‘source’ and the second the ‘target’. From the interoperability principle, it follows that the above task must be possible for any information variable. A measurer of X is any constructor that can perform the above task, for some choice of the ‘output variable’ {p x }. In particular,10 a measurer of an observable X has the property that if the output variable is sharp with value p x on the target, then the observable X must be sharp on the source in input, with value x.

We can now formulate the argument for quantisation using these information-theoretic tools only. One starts by characterising the two sectors, as follows.

The classical sector S C is modelled as an information medium with the property that the union of all of its information observables is an information observable T. This means that its observables can all be simultaneously sharp.

To model the quantum sector S Q we resort to the constructor-theoretic notion of a ‘superinformation medium’—an information medium with at least two ‘disjoint’ information observables X and Z, with the property that X Z is not an information variable. A qubit, for example, is a superinformation medium with observables X = {x 1 , x 2 } and Z = {z 1 , z 2 }, where z i labels the attribute of being in the i-th eigenstate of the Z component of the qubit, and likewise for X. This is because the variable {z 1 , z 2 , x 1 , x 2 } is not copiable perfectly under unitary quantum theory, and therefore it is not an information variable. On the other hand, a system obeying, e.g., Spekkens’ toy model14 is not a superinformation medium, because the reason why attributes in different observables cannot be cloned is that their intersection is non-empty in the space of states (in that they correspond to an uncertainty in preparation).

In addition, we require that S Q has the property that a perfect measurer of Z is also capable of distinguishing X, as in Eq. (3), and vice versa. This is true in quantum theory, where a CNOT in the X basis can be used perfectly to discriminate the states in the Z basis.

We shall now proceed to demonstrate the main result of our paper—namely, that the classical sector S C must itself be a superinformation medium with at least two observables, which cannot be sharp simultaneously. Informally speaking, the generalised ‘complementarity’ feature of the quantum sector must spread to the classical sector.

For simplicity, we assume that the all the information observables are binary: T = {t 1 , t 2 }, Z = {z 1 , z 2 }, X = {x 1 , x 2 }. As shown in ref. 10, the attributes x in X generalise quantum superpositions or mixtures of the eigenstates of Z. In particular, define the attribute \({u_Z} \buildrel\textstyle.\over= \mathop {\bigcup}\nolimits_{z \in Z} {\bf{z}}\); one then has that x, z: xz, x ∩ z = {}, and \({\bf{x}} \subset {\overline{\overline u} _Z}\)—which corresponds to the fact that each x generalises the notion of a quantum state that is in the “span” of the eigenstates of Z, but is not an eigenstate of Z.

A particular realisation of a measurer of Z is a constructor for the following task on the joint substrate S C S Q :

$$\left\{ {\left( {{{\bf{x}}_1},{{\bf{t}}_1}} \right) \to \left( {{{\bf{x}}_1},{{\bf{t}}_1}} \right),\left( {{{\bf{x}}_2},{{\bf{t}}_1}} \right) \to \left( {{{\bf{x}}_2},{{\bf{t}}_2}} \right)} \right\}.$$

This measurer must exist as a consequence of interoperability of information media.

Now, suppose this measurer of Z is presented with x where X is sharp. We have:

$$\left\{ {\left( {{{\bf{x}}_1},{{\bf{t}}_1}} \right) \to {{\bf{p}}_ + },\left( {{{\bf{x}}_2},{{\bf{t}}_2}} \right) \to {{\bf{p}}_ - }} \right\},$$

where, by the requirement that the measurer of Z can also distinguish X, {p +, p } is an information variable.

Let us define an observable S on the composite system, which once again must exist by the interoperability of information, whose physical interpretation is ‘whether the first substrate has the same label z as the second substrate’. S generalises what in quantum theory would be the projector for being in the symmetric subspace. By relabelling t 1 as z 1 and t 2 as z 2, it follows that S must be sharp in both p + and p , with value “Yes”.11

A particular way of measuring S is to apply a measurer of T to S C, with target S Q, defined so that:

$$\begin{array}{l}\\ \left( {{{\bf{z}}_1},{{\bf{t}}_1}} \right) \to \left( {{{\bf{z}}_1},{{\bf{t}}_1}} \right)\\ \\ \left( {{{\bf{z}}_2},{{\bf{t}}_2}} \right) \to \left( {{{\bf{z}}_1},{{\bf{t}}_2}} \right)\\ \\ \left( {{{\bf{z}}_2},{{\bf{t}}_1}} \right) \to \left( {{{\bf{z}}_2},{{\bf{t}}_1}} \right)\\ \\ \left( {{{\bf{z}}_1},{{\bf{t}}_2}} \right) \to \left( {{{\bf{z}}_2},{{\bf{t}}_2}} \right)\\ \end{array}$$
(5)

where Z being sharp on the target S Q with value z 1 means ‘yes’ and with value z 2 means ‘no’. Therefore, a measurer of T applied to S C S Q prepared with the attributes p + or p delivers in output S Q with Z sharp with value z 1 :

$$\begin{array}{l}\\ {{\bf{p}}_ + } \to \left( {{{\bf{z}}_1},{{\bf{r}}_1}} \right)\\ \\ {{\bf{p}}_ - } \to \left( {{{\bf{z}}_1},{{\bf{r}}_2}} \right)\\ \end{array}$$

where r 1 and r 2 are some attributes of the classical sector S C—whose existence is a consequence of this argument.

Now, crucially, the two attributes r 1 and r 2 can be shown to constitute an information observable, which cannot be simultaneously sharp when T is sharp. Specifically, the observable T cannot be sharp in any of the states in either r 1 or r 2,11: tT, r 1  ∩ t and r 2  ∩ t are empty. For suppose, say, r 1  ∩ t 1 included some state s. Then, by definition of measurer of Z, Z should be sharp in input on S Q, with value z 1; but this contradicts the defining property of superinformation media, that z 1 and x 1 have empty intersection. Likewise for the other attributes. In addition, r 1 t or r 2 t, because otherwise, again, x 1 and x 2 should be distinguishable from some of the z’s, contrary to assumption. Also, upon defining u T  =  tT t, both r 1 and r 2 are included in \({\overline{\overline u} _T}\)—i.e., they must belong to the generalisation of what in quantum theory would be called the span of the quantum states in the T basis. Note that this conclusion would follow even without assuming the additional property that a measurer of Z can also distinguish the attributes in X. This would correspond to a weaker result, showing that the classical sector must at least exhibit the equivalent of mixtures—but need not have two complementary observables.

In addition, composing a measurer of T with a measurer of Z as above still gives a measurer of Z (see the Fig. 1). By requiring that any measurer of Z can also distinguish X (i.e., it can perform the task Eq. (3) for X), one has that the r 1 , r 2 must be an information observable, too.

Fig. 1
figure 1

Equivalence of measurers. A measurer M Z of Z acting on S Q composed with a measurer M T of T on S C is equivalent to a different realisation \({M_{\rm{z}}}\) of a measurer of Z on S Q

Hence S C must have two ‘complementary observables’: T = {t 1 , t 2 } and R = {r 1 , r 2 }, thus being a superinformation medium with those two observables. This concludes our argument that S C must also be ‘quantised’.

Discussion

Note that it is impossible for r 1 and r 2 to be statistical mixtures of t 1 and t 2 , because they are perfectly distinguishable from one another. In addition, our argument rules out the case where r 1 and r 2 each correspond to sets of states intersecting with t 1 and t 2 , e.g., because they represent some irreducible limitation to resolution in the space of states, as in ref. 14.

Remarkably, our argument does not assume unitarity or reversibility of the dynamical laws, which was assumed in previous arguments. One only uses the definition of an observable and of a measurer. Specifically, we assume that the measurer acts deterministically on all states of the combined system S C S Q. In other words, any state in input is required to produce always the same state in output whenever the measurer acts upon it. This requires that the measurement can be performed to an arbitrarily high degree of accuracy, i.e., the noise can be reduced to negligible values. Hence, the argument does not apply to stochastic, collapse-based variants of quantum theory, e.g., refs. 15, 16. This is because if that would result in a high level of noise, which would cause the measurer to produced indistinguishable output states when presented with different input states. Note also that a key assumption is that a measurer of Z can also distinguish X: therefore, the argument would not apply to theories claiming that quantum gravity has no observable effects, e.g..17, 18

The same result can be achieved by supposing, as in ref. 2, that the measurement interaction is physically reversible on the composite substrate S C S Q, when S Q is prepared in any of the attributes in the joint variable XZ. This is a stronger assumption, which implies our assumption that the measurer of X is capable of distinguishing Z. Reversibility in this sense holds in quantum information theory, but need not be true in general.

Our argument does not require S C to obey strictly quantum theory, because our assumptions allow for more general cases. However, we do believe that more specific features of bipartite quantum systems, such as the maximum violation of the Bell inequality, can also be derived by adding further information-theoretic constraints,11 in the same spirit as ref. 19. It should also be stressed that under our definition of classicality the argument run the other way around would have no implications for the quantum sector. Let us explain why in some detail. Under our definition of classicality, a system is classical if it has only one observable, i.e., it has no complementary observables. Our argument the other way around would be assuming that the quantum sector is coupled to the classical sector via a measurer of that classical observable. This would have no implications for the quantum sector. This is because that measurement only requires there to be a set of distinguishable states on the quantum side that can be used as outputs of the measurer, and that is always possible for a quantum system of high enough dimensionality. On the other hand, no matter how large the dimension of the classical system is, our argument shows that it cannot be just classical if coupled to the quantum sector via a measurer of one of its quantum observables. The classical side must have at least another observable that does not commute with the one originally assumed to exist. However, a different notion of classicality, e.g., that the classical system induces a collapse or some sort of decoherence on the quantum system, might lead to a different conclusion. As discussed earlier, this does not satisfy the assumptions of our argument.

As an example of a more general hybrid system, we shall consider systems consisting of two different superinformation media: qubits and rebits.

A rebit is a superinformation medium, which is obtained from a qubit by requiring that the information observables (as defined above) are only the physical quantities represented by real-valued operators. We interpret this as meaning that all states in the Block sphere can be prepared, but only a subset of pairs of orthogonal pure states can be discriminated; precisely those that can be represented as two-dimensional vectors on the real Hilbert space, \(\left| \psi \right\rangle = a\left| 0 \right\rangle + b\left| 1 \right\rangle \) where a and b are real amplitudes whose squares add up to 1, and \(\left| 0 \right\rangle \) and \(\left| 1 \right\rangle \) are the eigenvectors of the observable Z. Complex amplitudes, in other words, are not observable. A single rebit can be used to perform interference experiments, because it has two non-commuting observables: X and Z. But what about other hallmark quantum-information processing tasks?

Superdense coding20 can be implemented with just two rebits, given that it involves only measuring X, Z and iY, which are real observables. Remarkably, a hybrid system of qubits and rebits can also be used to perform teleportation.21 Specifically, one can use a qubit, initially prepared in the state \(\left| \psi \right\rangle \) to be teleported; two rebits, prepared in a Bell state, and another ancillary qubit, prepared in some blank state \(\left| 0 \right\rangle \). The crucial step in the teleportation protocol is to teleport the state to the second rebit, by using a Bell measurement on the composite system of the first qubit and the other rebit. When the state \(\left| \psi \right\rangle \) is teleported to the second rebit, its phases are inaccessible, as only real-valued operators can be measured. However, one can swap the state on the rebit with that of the other, ancillary, qubit. This is possible because the swap can be accomplished as a sequence of three CNOT gates in the Z basis operating between the rebit and the qubit. These are precisely the copying operations (i.e., measurements) that we assume to be allowed on the two subsystems. The readout of the teleported state is then performed by measurements on the qubit.

These hybrid systems of qubits and rebits bring out an interesting asymmetry between superdense coding and teleportation. They also display a crucial difference between swapping physically two systems (e.g., moving the two carriers of information in space) and performing the logical swap of their states (which can be realised by a sequence of three CNOT gates, operating on the quantum degrees of freedom of the carriers). These two operations are not the same for a system made of a qubit and a rebit; but they are the same for a system of two qubits. Since the logical swap might require different resources to the physical swap, in a qubit–rebit system the usual assumption about the swap being a trivial operation might not apply, thus leading to a different notion of what the elementary/complex operations are.

Our constructor-information-theoretic proposal radically changes the approach to arguments for quantisation, emancipating them from specific, narrow and overly constrained dynamical models. Our assumptions are far less restrictive, in that they do not rely on unitarity or reversibility. They are also cast in an elegant, exact information-theoretic form—exclusively in terms of the possibility/impossibility of certain copying tasks on the two sectors, which provides a more general notion of being “quantum”. This is achieved via the constructor theory of information, which provides a powerful approach to exploring the interaction of systems obeying different physical theories, even post-quantum. Our argument thus provides radically new foundations, tools and content for the debate about quantisation of a classical system interacting with a quantum system. These are applicable not only in the context of quantum gravity, but also in other fields where the quantum-classical boundaries need to be explored: from quantum information to electrodynamics; from cosmology to quantum thermodynamics.