Abstract
In a Bell experiment, it is natural to seek a causal account of correlations wherein only a common cause acts on the outcomes. For this causal structure, Bell inequality violations can be explained only if causal dependencies are modeled as intrinsically quantum. There also exists a vast landscape of causal structures beyond Bell that can witness nonclassicality, in some cases without even requiring free external inputs. Here, we undertake a photonic experiment realizing one such example: the triangle causal network, consisting of three measurement stations pairwise connected by common causes and no external inputs. To demonstrate the nonclassicality of the data, we adapt and improve three known techniques: (i) a machinelearningbased heuristic test, (ii) a dataseeded inflation technique generating polynomial Belltype inequalities and (iii) entropic inequalities. The demonstrated experimental and data analysis tools are broadly applicable paving the way for future networks of growing complexity.
Introduction
Bell’s theorem^{1}, more than any other result, elucidates the manner in which quantum theory necessitates a departure from a classical worldview^{2,3}. Recently, it has been realized that it can be understood as a nogo result for providing a satisfactory account of quantum correlations using a classical causal model^{4,5,6,7}. Under this reframing, violating a Bell inequality can be understood as attesting to the necessity of using an intrinsically quantum notion of a causal model to achieve a causal account of the correlations^{5,6,8,9,10,11,12,13}, and thus as witnessing nonclassicality. Furthermore, it becomes clear that such an analysis can be generalized to causal structures that are distinct from the Bell scenario^{5,14,15,16,17,18,19,20,21,22,23,24,25,26}.
Such generalizations are highly relevant to the problem of developing quantum technologies. In the context of the Bell scenario alone, the possibility of witnessing nonclassicality has applications ranging from quantum cryptography^{27} to selftesting^{28} and communication complexity problems^{29}, as well as deviceindependent information processing^{30,31}, where the processing can be accomplished while relaxing what needs to be known about the inner workings of the devices. Given that tasks such as these are also of interest in arbitrary quantum networks^{32,33,34}, which can have complex topologies, it is evident that there is a need for new dataanalysis tools appropriate for witnessing nonclassicality in generic causal structures (see review in ref. ^{25}). Moreover, so far, all the demonstrations of quantum nonlocality, in the Bell scenario (Fig. 1(a)) or in complex networks, relied on the use of external inputs, variables whose values can be freely chosen by the experimenter and which serve to switch between different measurement settings^{35,36,37,38,39}. The free choice of measurements lies at the basis of Bell’s theorem^{40} and in experimental demonstrations, this freedom has to be assumed, or at best made be as plausible as possible^{41,42}. By contrast, quantum networks with several independent sources allow the demonstration of nonclassicality without the need for external freely chosen inputs, replacing the freedom of choice assumption with the assumption of independence of the sources^{5,19,43,44,45}.
In spite of its significance, this challenge remains largely unexplored, especially from the experimental perspective. This work is a contribution to this effort. We undertake the experimental investigation of a causal structure that has attracted growing attention^{5,15,16,18,19,22,23,43,46,47,48,49,50,51,52,53,54,55}: the “triangle scenario”, depicted in Fig. 1(b). Here, three distant parties each receives a share from two out of three independent sources, and in stark contrast to the Bell scenario, each party implements a single measurement on the systems in its lab, rather than having the freedom to choose among a set of incompatible measurements.
Using a versatile photonic setup with three independent sources (one sharing entanglement and two sharing classical correlations) and the feedforward of classical information by means of fast optical switches, we provide the first experimental demonstration of classically unrealizable correlations in the triangle structure without the use of external inputs. Importantly, witnessing nonclassicality in this new kind of causal structure goes beyond the standard Bell inequality violation and requires a radically different approach. In the course of doing so, we have enhanced some of the existing tools for testing nonclassicality in generic causal structures both from the experimental and the theoretical perspectives. These enhancements are in the service of making the tools applicable to generic causal structures and arbitrary data, thus paving the way for future experiments in causal networks of growing size and complexity.
Results
Beyond Bell’s theorem
Leveraging Bell’s theorem, Fritz^{5} showed the existence of a distribution in the triangle scenario that is realizable quantumly but not classically. Fritz’s result is best understood as a quantum nogo theorem akin to Bell’s 1964 nogo theorem^{1} or the tripartite GreenbergerHornZeilinger (GHZ) argument^{56}. As with the distributions described in those works, Fritz’s distribution has the feature that certain variables are perfectly correlated, something that is predicted by quantum theory to be possible in principle, but which can never be realized in a real experiment given the unavoidable presence of noise.
It was Clauser, Horne, Shimony, and Holt (CHSH) who first demonstrated how to turn Bell’s argument into an experimental test, by deriving noiserobust inequalities^{57}. Similarly, in the tripartite Bell scenario (Fig. 1(c)), the step from the GHZ argument to the possibility of a noiserobust test was achieved by Mermin’s inequality^{58}. In the case of the triangle scenario, classical causal compatibility inequalities have also been derived^{48} but these unfortunately require a degree of sensitivity higher than can reasonably be achieved in current experimental tests. Note that the inequalities derived in ref. ^{51}, by contrast, are not noiserobust because they apply only to distributions exhibiting perfect correlations between certain variables, analogously to Bell’s 1964 inequality. New techniques are therefore required to witness nonclassicality in the triangle scenario for the sort of experimental data achievable at present.
Developing new dataanalysis techniques is also motivated by considerations of utility. If all one seeks to do is to demonstrate the existence of nonclassicality in a given causal structure, then it is clearly sufficient to implement a dedicated experiment that targets a specific distribution and to test an inequality that is known to be able to witness nonclassicality for the targeted distribution. If, on the other hand, one seeks to use nonclassicality in a given causal structure as a resource for various informationprocessing tasks, then it is clearly of greater utility to have a test that is able to witness nonclassicality for any distribution that is not classically realizable in the given causal structure.
In some cases, this higher bar can be met by determining all of the classical causal compatibility inequalities associated to a given causal structure and testing for violations of any of these^{2}. Unfortunately, however, such a complete characterization soon becomes out of reach, even for seemingly simple scenarios^{2,59}. In order to be able to witness nonclassicality on arbitrary data, therefore, it is better to seek a “satisfiability” algorithm, which takes as its input a concrete example of data, and answers the question of classical realizability for that data alone, and in the case of a negative answer, identifies an inequality that is optimized for witnessing its nonclassicality.
We here propose a dataseeded algorithm of this sort that can be used for a generic causal structure. This is achieved by leveraging the fact that the inflation technique for causal inference^{18} can reduce the satisfiability problem to a linear program. We also pursue a second route to witnessing nonclassicality on generic data. In this approach, one foregoes deriving inequalities altogether and one simply performs a statistical hypothesis test where the hypothesis is the compatibility of the data with a classical causal model for the given causal structure. Specifically, one implements a variation of the parameters of the model—some of which make explicit reference to the hidden (i.e., unobserved) variables—to try and find the best fit to the data, and one considers the hypothesis falsified at some level of confidence when no good fit can be found. We here show that such hypothesis testing on experimental data can be made feasible for causal networks using the machinelearning technique developed in ref. ^{52} where the topology of the causal network is mapped to the topology of a neural network. Finally, suitably mapping the triangle network to a generalization of Bell’s scenario that incorporates the possibility of measurement dependence (i.e., that abandons the free choice assumption), we also witness the nonclassicality of the data by using an entropic approach, recently introduced in ref. ^{44}.
Note that for the triangle scenario, our goal is to witness nonclassicality of the experimentally realized distribution assuming only that the causal relations among the three measurement nodes and sources are those described by the triangle scenario. If one were to avail oneself of additional assumptions, in particular, assumptions regarding the causal relations among variables within a given laboratory, then one could witness nonclassicality of our experimental data using standard Bell inequalities. Since such additional assumptions do not hold for all setups that can realize a distribution exhibiting a quantumclassical gap, an analysis, which leveraged these additional causal assumptions would not achieve the goal of being applicable to arbitrary data.
The causal modeling perspective on Bell’s theorem
Bell’s theorem can be seen as a particular instance of a causal inference problem where for a given hypothesis about the causal structure of the experiment, one inquires whether a classical causal model is able to reproduce the observations^{4,6}. In a Bell experiment, a source distributes physical systems between two distant observers—Alice and Bob—they choose the values of their setting variables, denoted by x and y, respectively (these determine which of a set of incompatible measurements is implemented at each lab), and then they register the outcomes, denoted by a and b, respectively. For simplicity here, we represent the variables and their values with the same letter. The natural causal structure to hypothesize in such an experiment is the one depicted in Fig. 1(a), termed the “Bell scenario”.
The assumption of a classical causal model implies that the observed distribution can be decomposed as
This decomposition is familiar in discussions of Bell’s theorem as what follows from assuming a hidden variable model satisfying local causality and certain other conditions^{60,61}, but it can also be understood as a simple consequence of the causal Markov condition^{62} under the assumption that the causal structure is that of the Bell scenario^{4,6}.
In turn, for a quantum causal model, sources of correlations are not copies of a variable λ that is probabilistically distributed but rather pairs of systems that are in a joint quantum state ρ (potentially entangled). Similarly, dependencies among nodes are not represented by conditional probabilities such as p(a∣x, λ) but by the quantum analogs thereof, completely positive and trace preserving (CPTP) maps, which, in the particular case of a measurement, correspond to a positive operatorvalued measure (POVM). Operationally, the quantum description is given by Born’s rule, implying that
where \({\{{M}_{ax}^{A}\}}_{a}\) and \({\{{M}_{by}^{B}\}}_{b}\) are POVMs on A and B, respectively.
Bell’s theorem^{1} asserts that the quantum description can lead to an observable distribution that fails to have a classical explanation in terms of the causal model (1).
The triangle scenario
Among the simplest quantum networks beyond the paradigmatic Bell causal structure is the triangle scenario of Fig. 1(b). It is distinguished from the tripartite Bell scenario (depicted in Fig. 1(c)) by the fact that the distant parties are not connected by a 3way source, but by three 2way sources.
In the triangle scenario, the correlations that admit a classical realization, i.e., those that are compatible with a classical causal model with the structure of Fig. 1(b), can be written as:
By contrast, the correlations which admit of a quantum realization in the triangle network are given by
where ρ_{AB} denotes the density operator of the state shared between the nodes {A, B} (likewise for ρ_{AC} and ρ_{BC}), while \({\{{M}_{a}^{A}\}}_{a}\) denotes a POVM on the subsystem in station A (similarly for \({\{{M}_{b}^{B}\}}_{b}\) and \({\{{M}_{c}^{C}\}}_{c}\)).
Recently, it has been theoretically and experimentally demonstrated that a quantum triangle network with a setting variable at each station can give rise to nonclassical correlations^{63}. This result, however, employs measurement choices for each of the observers. Here, we go a significant step beyond, showing that nonclassical correlations can emerge even without any freedom of choice.
The Fritz distribution
In Fritz’s example^{64} of a distribution p_{Q}(a, b, c) that is not classically realizable, a, b and c are 4valued variables, each of which is conceptualized as a pair of binary variables, a = (a_{0}, a_{1}), b = (b_{0}, b_{1}) and c = (c_{0}, c_{1}). Moreover, one can decompose the quantum system A as A = (A_{0}, A_{1}), where A_{0} is the subsystem appearing in ρ_{AC} and A_{1} is the subsystem appearing in ρ_{AB}; analogously for B = (B_{0}, B_{1}) and C = (C_{0}, C_{1}). The example is realized by taking the three POVMs in Eq. (4) to have the following form:
where \({\{{M}_{{c}_{0}}^{{C}_{0}}\}}_{{c}_{0}},{\{{M}_{{c}_{1}}^{{C}_{1}}\}}_{{c}_{1}},{\{{M}_{{a}_{0}}^{{A}_{0}}\}}_{{a}_{0}},{\{{M}_{{b}_{0}}^{{B}_{0}}\}}_{{b}_{0}}\) are all measurements of the σ_{z} Pauli observable, \({\{{M}_{{a}_{1}{a}_{0}}^{{A}_{1}}\}}_{{a}_{1}}\) corresponds to one of the two Pauli observables among {σ_{x}, σ_{z}} depending on the value of a_{0}, and \({\{{M}_{{b}_{1}{b}_{0}}^{{B}_{1}}\}}_{{b}_{1}}\) corresponds to one of the two observables among \(\{({\sigma }_{x}+{\sigma }_{z})/\sqrt{2},({\sigma }_{x}{\sigma }_{z})/\sqrt{2}\}\) depending on the value of b_{0}. In Fritz’s description of a genuinely quantum distribution in the triangle scenario, the state ρ_{AB} is taken to be, for example, a singlet state \(\left{\Psi }^{}\right\rangle=(\left01\right\rangle \left10\right\rangle )/\sqrt{2}\); while ρ_{AC} and ρ_{BC} are maximally entangled states \((\left00\right\rangle+\left11\right\rangle )/\sqrt{2}\). However, since all the measurements on ρ_{AC} and ρ_{BC} are of σ_{z}, it is sufficient to take these to be a classically correlated state, namely:
As noted in ref. ^{5}, to see that Fritz’s distribution is not classically realizable, it suffices to make a connection to a Bell scenario between Alice and Bob. Note that the variables a_{0} and b_{0} determine the measurements that are implemented on A_{1} and B_{1}. In this respect, they are akin to measurement settings x and y in the usual scenario. However, because a_{0} and b_{0} are outputs in the triangle scenario, they could in principle depend on the common source between Alice and Bob. In the usual Bell scenario, of course, if the setting variable x (or y) is correlated with λ_{AB}, one cannot derive the Bell inequalities. The assumption that x and y are not correlated with λ_{AB} is termed measurement independence (or freedom of choice) and is a consequence of the hypothesis that the causal structure for the usual Bell scenario is that of Fig. 1(a).
For the Fritz distribution in the triangle scenario, one can still infer that a_{0} and λ_{AB} are uncorrelated, but now this follows from the fact that a_{0} is perfectly correlated with the outcome c_{0}, which is causally disconnected from λ_{AB}. Similarly, the lack of correlation between b_{0} and λ_{AB} is inferred from the perfect correlation between b_{0} and c_{1} and the fact that c_{1} is causally disconnected from λ_{AB}. If one considers the conditional distribution p(a_{1}, b_{1}∣a_{0}, b_{0}) that is obtained by making the appropriate Bayesian inversion on a distribution p(a, b, c) that is classically realizable in the triangle scenario, then given the independence of a_{0} (and b_{0}) from λ_{AB}, this conditional distribution should satisfy the standard Bell inequalities. The fact that the measurements in Fritz’s example have been chosen to ensure that the conditional p_{Q}(a_{1}, b_{1}∣a_{0}, b_{0}) violates a standard Bell inequality implies that the distribution p_{Q}(a, b, c) is not classically realizable in the triangle scenario.
Any experiment that aims to realize the Fritz distribution in the triangle scenario has the goal of realizing the ideal states and measurements specified above, but due to the inevitability of noise, the states and measurements that are actually implemented are necessarily noisy versions of these. This implies that the correlations between a_{0} and c_{0} and between b_{0} and c_{1} will not be perfect, which in turn blocks the inference from the classical realizability of p(a, b, c) in the triangle scenario to the classical realizability of p(a_{1}, b_{1}∣a_{0}, b_{0}) in the standard Bell scenario. As such, to witness nonclassicality in such an experiment, one must go beyond the techniques that witness nonclassicality in a standard Bell experiment.
It is worth reiterating here a point made in the beginning of subsection “Beyond Bell’s theorem", that our goal is to witness nonclassicality using a dataanalysis technique that assumes only the causal structure of the triangle scenario. If we associate a laboratory with each of the nodes in the causal structure, then even though our particular experiment involves specific causal relations between systems within the laboratories, the data analysis cannot make use of this extra structure. In other words, we seek a dataanalysis technique that can witness nonclassicality without assuming any such extra structure. This is the sort of assumption that is appropriate for the deviceindependent paradigm, wherein the experimental devices are presumed to be supplied by an adversary. All that is presumed to be guaranteed is that the causal relations among the laboratories are the ones specified by the triangle scenario. If one could avail oneself of the extra structure that is present in the experiment but not part of the description of the triangle scenario, then standard Bell inequalities would be sufficient to witness nonclassicality. For instance, if one could assume that Alice’s output a_{0} was a faithful copy of the classical randomness she shares with Charlie and that Bob’s output b_{0} was a faithful copy of the classical randomness he shares with Charlie, then one could infer that neither a_{0} nor b_{0} could depend on Λ_{AB} and consequently having p(a_{1}, b_{1}∣a_{0}, b_{0}) violate a Bell inequality would be sufficient to witness nonclassicality. As a second example, if one could assume that the pair of variables c_{0} and c_{1} that are outputs of Charlie’s laboratory are such that c_{0} depends only on the source shared with Alice and c_{1} depends only on the source shared with Bob, then the causal structure being assumed is equivalent to a 4party linelike structure rather than a triangle scenario. In this case, the full set of Bell inequalities for the conditional distribution p(a, b∣c_{0}, c_{1}) (where a = (a_{0}, a_{1}) and b = (b_{0}, b_{1})) are the necessary and sufficient conditions for classicality^{65}.
In order to be able to witness the nonclassicality of our data assuming only the triangle causal structure, therefore, we cannot rely on standard Bell inequalities. This is why we must have recourse to new dataanalysis techniques, such as those presented in subsections “Bounding measurement dependence and violating an entropic inequality for the triangle network”, “Violation of a causal compatibility inequality” and “Bounding measurement dependence and violating an entropic inequality for the triangle network”.
Experimental setup
In our experimental implementation, we used the polarization degrees of freedom of a pair of photons as the two qubits distributed by the source shared between A and B, with the σ_{z} eigenstates corresponding to the \(\{\leftH\right\rangle,\leftV\right\rangle \}\) basis of linear polarization. We investigated quantum correlations arising in the triangle network where we aim to have the source between A and B prepare the singlet state. Meanwhile, for the source shared by A and C and the source shared by B and C, we aim to have these prepare the classically correlated state of Eq. (6).
Recent years have seen the first experimental implementations of causal structures with a number of independent sources^{63,66,67,68}. In our implementation, the pair of photons associated to the source between A and B are at a wavelength of 810 nm, and are generated through spontaneous parametric downconversion in a ppKTP nonlinear crystal pumped with a 405nm UV CWlaser, placed inside a Sagnac interferometric geometry^{69,70}, depicted in the box labeled ρ_{AB} in Fig. 2. To implement the classically correlated sources Λ_{AC} and Λ_{BC}, electrical pulses randomly generated by the shotnoise of distant pairs of singlephoton detectors are locally split (boxes labeled Λ_{AC} and Λ_{BC} in Fig. 2); then they are sent to the stations A, C and B, C, respectively, by means of 20mlong electrical cables. Detection of such signals gives values for the bits a_{0}, b_{0}, c_{0}, c_{1}.
Note that this electrical signal sets up classical correlations (i.e., shared randomness) between Charlie and Alice (Bob), and this is a faithful implementation of the state in Eq.(6).
Owing to the probabilistic nature of photon generation and random shotnoise events from detectors, justifying the independence of different sources turns out to be very demanding. This is the reason why the first experimental realization of quantum networks^{41,71,72} actually involved a single laser source, thereby requiring a devicedependent justification for the supposed independence of the generated quantum states that relies on the knowledge of the inner process of photon generation. Using spatially separated nonsynchronized sources, of different natures, enforces the independence of the sources, also having direct applications in quantum communication protocols. Note, however, that the independence of the sources still remains an assumption, considering that this assumption can always be violated by superdeterministic models^{73}.
To experimentally achieve the implementation of the separable measurement operators as in Eq. (5), the electrical signals arriving at A and B determine the state of ultrafast optical switches (Nano Speed UltraFast 1x2 by company Photonwares with a switching time equal to ~8ns) that affect the measurements on the photons coming from ρ_{AB}. More specifically, based on which one of the two signals arrives in A (B) from Λ_{AC} (Λ_{BC}), the switch will send the photon from ρ_{AB} to two fibers connected to the measurement setups implementing the different polarization measurements. The measurement of the photons is performed by polarization controllers defining the measurement basis followed by infiber polarizing beam splitters (PBS) and singlephoton detectors. Finally, the four detectors in A (B) are electronically connected to a timetodigital converter, located in the measurement station. The signal from the photon counting, together with the signal from source Λ_{AC} (Λ_{BC}) generate the 4valued outcome a (b). Conversely, in station C the 4valued outcome c is given by the two classical signals from Λ_{AC} and Λ_{BC}. Note that the electronic signals generated by the detectors are sent to three separated timetodigital converters, one for each measurement station A, B, C, and the recorded events are sent for data processing to a computer located outside the laboratory.
We record experimental events by first choosing a small window w_{1} ~ 4.1ns, to filter in the signals produced simultaneously from the same source Λ_{i}. This allows us to account mostly for 2fold events, which are due to the same entangled pair, or the same split signal, thus filtering out most of the experimental noise due to the detectors’ dark counts and residual environmental light. The 6fold coincidence events are finally computed by employing a time window equal to w_{2} ~ 20 μs inside which an event is defined by the arrival of three twofold coincidences (see Supplementary Note 1 for more details on data analysis). Such a choice of value for the 6fold coincidence window represents a compromise between two different requirements. On one side, we want to make such a window as narrow as possible to approximately achieve simultaneity, with respect to both the generation and the measurements, which in principle could lead to an implementation directly addressing the locality loophole. On the other, a broader window is necessary to detect a large enough number of 6fold coincidences, enhancing the events’ rate and thus leading to sufficiently small errors on the measured probabilities in smaller measurement times.
In this demonstration, we do not attempt to achieve spacelike separation between the registration of the outcomes a, b, and c. Achieving such a separation would provide the strongest possible justification for the lack of causal influences between the outcomes a, b and c. It is important to note, however, that it would still not justify the lack of a 3way common cause.
Furthermore, due to the low efficiencies of the singlephoton detectors (η ~ 0.5) and the fact that the threshold values required for closing the detector loophole in the triangle scenario are not yet known, we rely on the fairsampling assumption. On this point, we note that even for the much simpler case of the Bell scenario, closing the detector loophole required decades of effort.
Experimental results
As stated above, in order to realize the Fritz distribution, it is sufficient to share entanglement only between Alice and Bob’s measurement stations, since Alice and Charlie as well as Bob and Charlie can merely share classical correlations. Moreover, using such classical sources (in our case, a doubled electronic signal) makes it possible to experimentally achieve correlations between Alice and Charlie and between Bob and Charlie that can be almost perfect for the duration of the experiment. Recall that perfect correlation is required for the logic of Fritz’s argument to go through, but demonstrating perfect correlations can never be done in an experiment and, importantly, demonstrating nonclassicality in the triangle network in the manner described by Fritz would boil down to violating a standard Bell inequality (sometimes referred to as disguised network nonlocality^{74}). So, we did not use this approach here, as it is the goal of our work to introduce and validate dataanalysis techniques that would be applicable for any example of a quantumclassical gap in the triangle scenario, including gaps based on distributions that, unlike Fritz’s, could be noisy. Figure 3 provides a comparison between the theoretical Fritz distribution reported in panel 3a, obtainable with noiseless states and measurement operators, and the experimentally achieved one reported in panel 3b. The latter one was reconstructed from ~1.4 ⋅ 10^{6} events collected in ~10 h of data taking, achieving a 6fold coincidence rate of ~38.7 Hz (see Supplementary Note 2 for the complete distribution).
Even with our approach, employing ultrafast optical switches and classical correlations shared between A and C and between B and C, the measurement outcomes on the state Λ_{AC} are not perfectly correlated, nor those on Λ_{BC}, contrary to the ideal Fritz distribution: specifically, the probability of anticorrelation in each case is found to be p_{anticorr} = 3 ⋅ 10^{−5}. As argued, it is the practical impossibility of achieving perfect correlations, which necessitates implementing a hypothesis test for compatibility or a test of causal compatibility inequalities. In what follows, we will focus on three possible avenues: machinelearning techniques^{52,75,76}, the inflation method^{13,18,48,77} and finally, recently derived entropic inequalities^{44}.
Excluding the hypothesis of classicality with machine learning
We follow the approach in^{52}, the central idea of which is to encode the structure of the causal network under test in the topology of a neural network. Consider the triangle network with quaternary outputs as depicted in Fig. 1(b), where three sources λ_{AB}, λ_{BC}, and λ_{AC} send information to three parties, Alice, Bob, and Charlie, each receiving, respectively, the pairs (λ_{AB}, λ_{AC}), (λ_{AB}, λ_{BC}), and (λ_{AC}, λ_{BC}), as schematically shown in Fig. 4(a). After locally processing the inputs, they flag a number a, b, c ∈ {0, 1, 2, 3}, by sampling the probability distributions p(a∣λ_{AB}, λ_{AC}), p(b∣λ_{AB}, λ_{BC}) and p(c∣λ_{BC}, λ_{AC}), respectively. In the machinelearning algorithm, the input layers to the multilayer perceptrons (MLPs) are composed of the independent uniformly distributed random numbers in the unit interval, i.e., λ_{AB}, λ_{BC}, λ_{AC} ∈ [0, 1], with the restriction in the flow of information mirroring the causal structure of the triangle network: The Ablock of the hidden layer receives random numbers (λ_{AB}, λ_{AC}), the Bblock receives (λ_{AB}, λ_{BC}) and the Cblock receives (λ_{BC}, λ_{AC}). Therefore, individual inputs belong to \({{\mathbb{R}}}^{2}\) (i.e., they have length 2). For the training, we provide batches of (N_{batch}, 2) dimension for the corresponding MLP for each of the three blocks.
If a certain probability distribution p(a, b, c) is compatible with a classical causal model on the triangle causal structure, then a set of three independent neural networks mimicking the topology of the triangle should be able to reproduce the distribution. By numerically sampling over different values of the random numbers λ_{AB}, λ_{BC} and λ_{AC} one can construct the approximation \(\tilde{p}(a,b,c)\) by averaging the Cartesian product of the output conditional probabilities corresponding to each party. See Methods for more details.
In turn, if the distribution under test is nonclassical, the neural network will be unable to mimic the distribution perfectly, producing considerable errors. To quantify how much the machine model can approximate the target/experimental distribution, we employ the elementwise meansquare error (MSE), also termed as L2norm error, between p(a, b, c) and \(\tilde{p}(a,b,c)\). This is given by \({\mathsf{MSE}}=\frac{1}{64}p(a,b,c)\tilde{p}(a,b,c){}_{2}\) and can be understood as a measure of nonclassicality^{76}. By repeated iterations, the neural network can be optimized in order to minimize this distance, since it should be close to zero if the target distribution has a classical model that the machine manages to approximate. Clearly, however, even if the distribution is compatible with the triangle network, due to numerical precision and the finite size of the neural network, the distance will never be exactly zero. To address this issue, we mix our experimental probability p(a, b, c) with the flat distribution p_{I}(a, b, c) = 1/64, which is compatible with the triangle structure, so that the machine is asked to retrieve the best possible model for the mixed distribution \(\tilde{p}=v\,p+(1v){p}_{I}\). If p has no classical explanation, then we expect that, as one increases the weight v of p in the mixed distribution \(\tilde{p}\), there is a range of values wherein a classical model of \(\tilde{p}\) remains possible and MSE is very small, but that there exists a threshold value beyond which MSE begins to increase, and the machine cannot make an almost perfect approximation anymore.
As shown in Fig. 4(b), only below a certain threshold value around \({v}_{{{{{{{{\rm{crit}}}}}}}}}=1/\sqrt{2}\)^{52}, can the machine learn \(\tilde{p}\) while it fails to do so for higher values of v. This analysis gives a strong indication of the nonclassicality of p, but given that there is no guarantee that the machine finds the optimal parameters, it does not guarantee it. To overcome this limitation, in the following we present two alternative techniques.
Violation of a causal compatibility inequality
In order to demonstrate the nonclassicality of the experimental data relative to the triangle causal network, we seek to identify some causal inequalities, which must be satisfied by all distributions compatible with the classical triangle network but which are violated by our experimental statistics. To this end, we turn to the inflation technique for causal inference introduced in ref. ^{18}.
As detailed in the Methods, the inflation technique relates compatibility with a given causal structure \({{{{{{{\mathcal{G}}}}}}}}\) to feasibility of a linear program (LP). If the LP related to an inflation of \({{{{{{{\mathcal{G}}}}}}}}\) (see Fig. 5(a)) is found to be infeasible, then evidently p is incompatible with \({{{{{{{\mathcal{G}}}}}}}}\). In our case, \({{{{{{{\mathcal{G}}}}}}}}\) is taken to be the classical triangle scenario causal structure depicted in Fig. 1(b).
In the case of infeasibility, the algorithm returns an infeasibility witness, in the form of an inequality. In this way, we can find a causal compatibility inequality tailored to the specific experimental data we obtained. Using the second order inflation of the triangle network shown in Fig. 5(a), one can derive causal compatibility inequalities (satisfied by all trianglecompatible p(a, b, c)) of the form
where the y are real coefficients.
As further detailed in the Methods, the LP of the inflation technique may be specially adapted to yield elegant looking causal compatibility inequalities; namely, where sets of monomials are each associated to a single (i.e., uniform) coefficient. Working with such an adapted LP can be orders of magnitude less computationally demanding as compared to the unadapted LP. However, it may be the case that despite a given distribution leading to infeasibility in the unadapted primal LP, there may not exist any inequality with restricted coefficients capable of witnessing that fact. As such, one is motivated to carefully select a coefficient restriction, which matches the specifically targeted distribution: one should only impose that a pair of monomials should share a uniform coefficient in the inequality if the given distribution would lead to both monomials being evaluated to the same numerical value (within a small tolerance). One cannot impose arbitrary coefficient uniformity restrictions. The Methods contains an explanation for why certain special coefficient restrictions may be justifiable. We employed the ideal theoretical Fritz distribution as our guide when selecting our LP adaptation, rendering moot the selection of a numerical tolerance. We stress, however, that a theoretical guide is not a prerequisite for optimally adapting the inflation technique LP to witness the nonclassicality of experimental data: it is perfectly possible to isolate the nearsymmetries in the experimental data without the educated guess provided by a theoretical model.
The infeasibility witness obtained by the program for our data yields an inequality of the form of Eq. (7), which is violated by the experimental data by several standard deviations: in this way, we unambiguously demonstrate the emergence of nonclassicality in the triangle network, without relying on Bell’s theorem. We depict the particular coefficients \({y}_{{a}_{1}{b}_{1}{c}_{1}{a}_{2}{b}_{2}{c}_{2}}\) defining the inequality that we obtained from the adapted LP in Fig. 5(b). Denoting the value that the data gives for the lefthandside of this inequality by \({V}_{\exp }\), we obtain V_{exp} = − 0.02436 ± 0.00016 (using a 6fold coincidence window w_{2} ~ 20 μs), corresponding to a violation of the inequality by 152 standard deviations. In Fig. 6, we plot \({V}_{\exp }\) as a function of the choice of the 6fold coincidence window w_{2}. As expected, by increasing w_{2}, we increase the detection rate of 6fold events, in turn decreasing the statistical error on the computed value of V_{exp}, shown in the figure with the red shadowed area.
Bounding measurement dependence and violating an entropic inequality for the triangle network
Another approach that can be used to robustly demonstrate the nonclassicality of the generated data is to map the triangle network into a modification of the Bell scenario, in a similar way to Fritz’s original proof of nonclassicality in the triangle scenario. In this modification, any amount of measurement dependence is in principle allowed between the hidden variable and the measurement settings. Consequently, even though the scenario is related to Bell’s, the nonclassicality exhibited necessarily goes beyond that which one finds in Bell’s scenario because in the latter measurement dependence allows for a classical account of any correlations. Indeed, in a Bell scenario where causal influences between the source and the measurement settings are allowed, some amount of measurement independence has to be assumed in order to witness nonclassicality from the data^{60,78,79}, otherwise, any violation of a Bell inequality can be explained by classical local models^{80}.
In the modified scenario, one can use entropic inequalities to put an upper bound on the amount of measurement dependence, as demonstrated in ref. ^{44}. The modification can be understood as a twostep departure from Bell’s scenario. In the first step, depicted in Fig. 7(a), one allows there to be a common cause not only on the pair of outcome variables, but on all four of the observed variables, meaning that a given outcome variable shares a common cause with the setting variable at the opposite wing; this is a relaxation of the assumption of freedom of choice^{41,60,61,81}. In the second step, one introduces additional observed variables c_{0}c_{1} and a variable λ_{AC} that is a common cause to Alice’s setting and outcome (a_{0}, a_{1}) and c_{0}c_{1}, as well as a variable λ_{BC} that is a common cause to Bob’s setting and outcome (b_{0}, b_{1}) and c_{0}c_{1} (see Fig. 7(b)).
Referring to the directed acyclic graph (DAG) of the triangle network shown in Fig. 2, we map the measurement settings of the two stations A and B of the Bell scenario to the variables a_{0} and b_{0}, and the measurement outcomes are mapped to the variables a_{1} and b_{1}. It is clear, therefore, that if one lumps a_{0} and a_{1} together, and similarly for b_{0} and b_{!}, the modified Bell scenario can be seen to have the form of the triangle network.
In this modified Bell scenario, shown in Fig. 7(b), one can lower bound the measurement dependence, quantified via the mutual information I(λ_{AB}: a_{0}, b_{0}) between the source λ_{AB} and the measurement settings a_{0} and b_{0}, relating it with the violation of the CHSH inequality^{61,82}. Further, employing the entropic approach^{8,46,83,84}, this mutual information can also be upper bounded by an entropic function that involves only observable variables and so can be extracted directly from the experimental data. Combining both the upper and lower bounds on I(λ_{AB}: a_{0}, b_{0}), one arrives at a Bell inequality blending probabilities and entropies, the violation of which witnesses the nonclassicality of the data, irrespectively of any potential measurement dependence I(λ_{AB} : a_{0}, b_{0}) present in the experiment. This inequality is given by (see ref. ^{44} for the further details)
where S^{CHSH} is the standard CHSH quantity evaluated on \(P({a}_{1}{b}_{1}{a}_{0}{b}_{0})\left.\right)\)^{57}, and
with I(a_{0} : b_{0}: C) : = H(a_{0}, b_{0}, C) − H(a_{0}, b_{0}) − H(a_{0}, C) − H(b_{0}, C) + H(a_{0}) + H(b_{0}) + H(C) the tripartite mutual information and \(H(X)={\sum }_{x}p(x)\log p(x)\) the Shannon entropy relative to the variable X.
Using the experimental data in Fig. 3, we obtain a value \({{{{{{{{\mathcal{E}}}}}}}}}_{exp}=0.340\pm 0.001\), violating the bound of Eq. (8) by 340 standard deviations and thereby demonstrating nonclassicality.
Discussion
The triangle scenario has particular novelty as a means of witnessing nonclassicality insofar as there is no known way to obtain classical causal compatibility inequalities for it from standard Bell inequalities. This is in contrast to the two other causal structures distinct from the Bell scenario that have been experimentally investigated previously, namely, the instrumental scenario^{24} and the bilocality scenario^{66,68,71,72,85,86}. In the case of the instrumental scenario, it suffices to process the Bell inequalities by forcing equality between the value of the setting variable at one wing and the value of the outcome variable at the opposite wing^{87}. In the case of the bilocality scenario, by postselecting on the outcome of the measurement that accesses both sources, the other two measurements can be proven to satisfy the Bell inequalities in a classical causal model (via an analog of entanglementswapping)^{16,17}. Such shortcuts to deriving noiserobust causal compatibility inequalities, however, are not available in the triangle scenario.
Another peculiar aspect of the triangle scenario is the possibility to show new forms of nonclassicality that do not require the use of external inputs freely chosen by the experimenter, but instead rely on the assumption of independence of the sources, as shown by Fritz^{5}. In this work, we realized for the first time a triangle network without external inputs, proving the emergence of nonclassicality in this new regime, up to detection and locality loopholes. This has been possible by employing fast feedforward of measurement in an optical setup comprising an entangled photon source and two sources of classical correlations. In order to demonstrate the nonclassicality of the experimental data, we had to extend preexisting dataanalysis techniques, making them suitable to detect nonclassicality in noisy distributions.
The dataanalysis techniques we have presented here are also distinguished insofar as they have the capacity to witness nonclassicality for any distribution that might arise in an experiment, whereas previous experiments witnessing nonclassicality in causal structures beyond Bell have used tools that can only witness the nonclassicality of limited classes of target distributions. This approach thus extends dataseeded techniques previously limited to the standard bipartite Bell scenario^{2,3,88,89} to the realm of more complex causal networks.
The employed dataanalysis techniques and aspects of our photonic setup provide a scalable platform in which nonclassicality can be witnessed in networks of growing size and of arbitrary topology. In particular, the implemented measurements are based on local wirings, i.e., separable measurements with classical feedbacks, making the approach scalable. Furthermore, it is widely speculated that the triangle scenario may admit distributions, which imply a nogo result whose logic is entirely independent of that of Bell’s theorem^{19,90}. These are likely to require entangled measurements as well as three sources of entanglement, and consequently, integrating such measurements and sources into our setup may open the way to experimentally targeting distributions, which are thought to exhibit these new types of nonclassicality.
Finally, this work can also pave the way for future applications in quantum communications involving several sources and measurement stations.
Methods
Details on the machinelearning implementation
The number of samples we sum over, i.e., the batch size, is N_{batch} = 10,000. We decided to vary the architecture of the neural network using different number of layers (n_{layers} = [3, 4, 5, 6]) and number of neurons (n_{neurons} = [16, 32]), accounting for an assembly of 8 neural networks independently trained, in order to obtain better approximations by taking the minimum or the average of the predictions. The ensemble of networks also reduces the probability of being trapped in optimization local minima and enhances the relative expressive power of the method in comparison to a single architecture; see Fig. 4(a). As pointed out in ref. ^{52}, ideal values for parameters and hyperparameters vary for distinct triangle scenarios, therefore the strength of the ensemble approach also varies. The reader is referred to the Supplementary Note 3 for more specific details.
Details on the inflation technique
At its core, the inflation technique at n^{th} order shows that

IF: A distribution p is compatible with a given classical causal structure \({{{{{{{\mathcal{G}}}}}}}}\)

THEN: For the n^{th} order inflation graph \({{{{{{{\mathcal{G}}}}}}}}^{\prime}\) induced by \({{{{{{{\mathcal{G}}}}}}}}\) there must exist some larger distribution \(p{\prime}\) pertaining to the observable nodes in \({{{{{{{\mathcal{G}}}}}}}}^{\prime}\) such that

1.
\(p^{\prime}\) possesses certain symmetry properties related to automorphisms of \({{{{{{{\mathcal{G}}}}}}}}^{\prime}\), and

2.
the distribution p^{⊗n}—defined as n identical but independently distributed (I.I.D.) copies of p—arises as a marginal distribution of \(p^{\prime}\).

1.
These conditions implicitly define a linear program (LP). In the Supplementary Note 4, we elaborate on the required marginal symmetry properties, which must be satisfied by distributions compatible with the second order inflation graph depicted in Fig. 5(a).
Farkas’ duality lemma tells us how to extract a certificate of infeasibility whenever a LP is infeasible^{91}. Note that Farkas’ lemma applies to convex optimization in general^{92}; linear programming is just a special case. For the primal LP defined by second order inflation, the certificate of infeasibility is a dual vector y such that y ⋅ p^{⊗2} ≥ 0 holds for all instances of p^{⊗2}, which make the primal LP feasible. Given such a dual vector y, one certifies the infeasibility of \({p}_{{{{{{{{\rm{nonclassical}}}}}}}}}^{\otimes 2}\)—i.e., one certifies the incompatibility of p_{nonclassical} with a classical causal model with the structure \({{{{{{{\mathcal{G}}}}}}}}\)—whenever one finds that \({{{{{{{\boldsymbol{y}}}}}}}}\cdot {p}_{{{{{{{{\rm{nonclassical}}}}}}}}}^{\otimes 2} \, < \,0\). Hence, the certificate y yields a quadratic polynomial inequality satisfied by all distributions p, which are compatible with \({{{{{{{\mathcal{G}}}}}}}}\).
We employed the “hierarchy” version of inflation defined in ref. ^{77} due to its computationally efficient and dataagnostic implementation.
The second order inflation graph of the classical triangle network is depicted in Fig. 5(a), and the \(p^{\prime}\), which is posited to exist would pertain to the twelve observable random variables depicted in Fig. 5(a), namely {a^{(1)}, b^{(1)}, c^{(1)}, a^{(2)}, b^{(2)}, c^{(2)}, a^{(3)}, b^{(3)}, c^{(3)}, a^{(4)}, b^{(4)}, c^{(4)}}.
The LP implied by inflation is as follows. The condition for the existence of \(p^{\prime}\) can be understood as a collection of very many inequality constraints (every probability, which makes up \(p^{\prime}\) must be nonnegative) along with one equality constraint (the sum of all probabilities comprising \(p^{\prime}\) totals unity). The symmetry requirements of \(p^{\prime}\) can be understood as equality constraints relating the various probabilities comprising \(p^{\prime}\). Finally, the requirement that p^{⊗2} is a marginal of \(p^{\prime}\) can be understood as equating p^{⊗2} evaluated at a particular set of values for its arguments to a sum over all those probabilities of \(p^{\prime}\), which agree on these values. In other words, if p is compatible with \({{{{{{{\mathcal{G}}}}}}}}\), then some collection of equality and inequality constraints are simultaneously satisfiable; i.e., some LP should be feasible.
The Farkas infeasibility certificate of the LP defined by inflation constitutes quadratic inequalities, which are satisfied by all trianglecompatible distributions but violated by the nonclassical distribution whose triangleincompatibility is witnessed by inflation. See Supplementary Note 4 for an explicit walkthrough of the inflation technique in full detail.
Adapting polytope membership LPs to yield symmetric inequalities
It can be insightful to compare the LP defined by inflation to the more familiar LP associated with Bell nonlocality. In Bell nonlocality, a family of conditional probability distributions (a.k.a. a “correlation”) is said to admit a local hidden variable model (LHVM) if and only if corresponding vector of all conditional probabilities lies within the local polytope. When a correlation does not admit a LHVM explanation, then we can always find a separating hyperplane (typically a facet of the local polytope) such that the vector of conditional probabilities associated with the given correlation lies strictly to one side of the hyperplane whereas all LHVMexplainable correlations correspond to vectors of conditional probabilities in or on the other side of the hyperplane. Thus, hyperplanes that distinguish all LHVMexplainable vectors from some other are equivalent to Bell inequalities; these hyperplanes, which correspond to facets of the local polytope are equivalent to facetdefining Bell inequalities.
The picture is quite similar when thinking about the LP associated with inflation. Instead of vectors of conditional probabilities, however, we are considering vectors whose elements are products of unconditional probabilities, i.e., vectors of probability monomials. The LP of inflation similarly defines a polytope: a vector of monomials is in the polytope iff the primal LP is feasible; the objective of the dual LP is to return a separating hyperplane such that

1.
the given vector of monomials is as far from the hyperplane as possible, and

2.
such that all vectors, which would make the primal LP feasible lie on or on the other side of the hyperplane.
Without loss of generality, a polytope may be defined in terms of its extremal points. Let M^{d,n} be a d × n matrix whose n columns correspond to the extremal points of the polytope, each of which is a vector in dimension d, and where we have introduced a notation of marking an object’s dimension in superscript for pedagogical clarity in what follows. A vector v^{d} lies withing the polytope (technically, the LP formulations here apply to both bounded polytopes and unbounded polycones) if and only if
We can relax the satisfiability LP of Eq. (10) into an optimization problem, which measures the degree of primal infeasibility. One natural measure of the infeasibility of Eq. (10) is defined by the following optimization problem:
Note that if the LP of Eq. (10) can be satisfied, then the objective in Eq. (11) can be reach up to 0; conversely, if the objective in Eq. (11) is strictly negative over all variables, which satisfy that LP’s conditions, then the LP in Eq. (10) is evidently infeasible. The formal dual to the above LP can then be used to extract optimal separating hyperplanes. The astute reader may notice that even the reformulated LP as given in Eq. (11) may not always be feasible; it can only be satisfied if v^{d} is wholly in the linear span of the columns of M^{d,n}. If v^{d} has some component orthogonal to that linear span, then the primal formulation in Eq. (11) is infeasible and the dual formulation in Eq. (12) is unbounded. See Appendix B of ref. ^{93} for alternative relaxations of an LP satisfiability problem into an optimization problem, and the connection therein to distance measures such as robustness and nonlocal fraction. Namely,
Indeed, the weak duality theorem in linear programming ensures that regardless of the feasibility of Eq. (10), it holds that for every y^{d} satisfying the condition of Eq. (12) and every x^{n}, s^{n} satisfying the conditions of Eq. (11), it is always the case that \({y}^{d}\cdot {v}^{d}\ge {{\mathbb{1}}}^{n}\cdot {s}^{n}\). So, if any y^{d} can be found satisfying the condition of Eq. (12) such that y^{d} ⋅ v^{d} ≤ 0, this serves as a certificate of the infeasibility of Eq. (10).
Now, the matrix M^{d,n}, which defines the polytope may exhibit inherent symmetries. An inherent symmetry of a matrix is a pair of permutation operations \({\pi }_{{{{{{{{\rm{row}}}}}}}}}^{d,d}\) and \({\pi }_{{{{{{{{\rm{col}}}}}}}}}^{n,n}\), acting, respectively, on the row space and column space of the matrix, such that if both the row permutation and the column permutation are performed the matrix is invariant. That is,
Whenever such an inherent symmetry can be identified, it can be used to transform feasible solutions of both the primal and dual formulations into new solutions: Suppose we have a collection of vectors v^{d}, y^{d}, x^{n}, s^{n} such that all of the conditions of both Eq. (11) and Eq. (12) are satisfied. Then, acting on all the vectors with the inherent symmetry leads to a new solution pair to both the primal and dual LP formulations, with the same duality gap (if any). Accordingly, we have that the symmetrized inequality \({{y}^{{\prime} }}^{d}\ge 0\) where \({{y}^{{\prime} }}^{d}:=\frac{{y}^{d}+{\pi }_{{{{{{{{\rm{row}}}}}}}}}^{d,d}\cdot {y}^{d}}{2}\) is also a valid inequality. When y^{d} is an optimal solution to the dual LP in Eq. (12), then symmetrized inequality \({{y}^{{\prime} }}^{d}\) is also optimal if v^{d} is invariant under the inherent symmetry operation \({\pi }_{{{{{{{{\rm{row}}}}}}}}}^{d,d}\).
This is what allows us to restrict the coefficients of the separating hyperplanes. Suppose we find a bunch of different inherent symmetries of the matrix, which defines the polytope; these can be used to construct a group with welldefined actions on both the row and column spaces. We can then twirl the matrix with respect to this group: We collect columns that map to each other under the group action, and replace each orbit of columns with a single new column given by the mean of the orbit. We do the same to the rows. This twirling operation thus yields a substantially smaller matrix, say, \({{M}^{{\prime} }}^{{d}^{{\prime} },{n}^{{\prime} }}\). Given a vector v^{d} in the row space of the matrix, we can apply the same twirling to obtain \({{v}^{{\prime} }}^{{d}^{{\prime} }}\), essentially projecting the vector to the symmetric subspace of the group. We now can obtain a separating hyperplane \({{y}^{{\prime} }}^{{d}^{{\prime} }}\) by applying the dual formulation of the LP in this symmetric subspace. To convert this hyperplane in the symmetric subspace to a hyperplane in the full row space we detwirl: namely, each row in a given orbit is uniformly associated with the coefficient of that orbit in the symmetric subspace.
There is no loss of generality whatsoever in using this symmetryadapted version of the LP if the target vector v^{d} is also invariant under the group. So, in general, the most efficient way to exploit inherent symmetries in linear programming is to identify the largest symmetry group (acting on both row and column spaces), which leaves both M^{d,n} and v^{d} invariant.
For more information regarding exploiting symmetry in linear programming see refs. ^{94,95,96,97}.
Robustness to noise added by varying 2fold coincidence window
We study the behavior of the nonlocality tests over the addition of noise due to the enlargement of the twofold coincidence window w_{1}. Increasing such a window causes the increase of accidental counts, affecting both the events from the entangled source and those relative to classically correlated signals. From a practical point of view, such noise acts substantially as a white noise on the correlations, that is event pairs, which are uniformly and randomly distributed. Considering such effects, we do expect that at some point, increasing the noise, our witnesses will not be able to detect a nonclassical behavior anymore. This is, in fact, the case. We show the curve of the violation of the inequality, Eq. (7) and Fig. 5(b) inferred by means of the inflation technique, as a function of the 2fold window w_{1} in Fig. 8. The same study is performed with the value of the violation of the entropic inequality in Eq. (8) as shown in Fig. 9.
Data availability
The data that support the findings of this study are available in the Supplementary Information and from the corresponding author upon request.
Code availability
All the custom code developed for this study is available from the corresponding author upon request.
References
Bell, J. S. On the Einstein Podolsky Rosen paradox. Phys. Phys. Fiz. 1, 195 (1964).
Brunner, N., Cavalcanti, D., Pironio, S., Scarani, V. & Wehner, S. Bell nonlocality. Rev. Mod. Phys. 86, 419 (2014).
Scarani, V. Bell Nonlocality (Oxford University Press, 2019).
Wood, C. J. & Spekkens, R. W. The lesson of causal discovery algorithms for quantum correlations Causal explanations of Bellinequality violations require finetuning. New J. Phys. 17, 033002 (2015).
Fritz, T. Beyond Bell’s theorem II: scenarios with arbitrary causal structure. Comm. Math. Phys. 341, 391 (2016).
Wiseman, H. M. & Cavalcanti, E. G. In Quantum [Un] Speakables II. 119–142 (Springer, 2017).
Schmid, D., Selby, J. H., & Spekkens, R. W. Unscrambling the omelette of causation and inference: the framework of causalinferential theories. https://arxiv.org/abs/2009.03297 (2020).
Chaves, R., Majenz, C. & Gross, D. Information–theoretic implications of quantum causal structures. Nat. Commun. 6, 1 (2015).
Cavalcanti, E. G. & Lal, R. On modifications of Reichenbach’s principle of common cause in light of Bell’s theorem. J. Phys. A. 47, 424018 (2014).
Costa, F. & Shrapnel, S. Quantum causal modelling. New J. Phys. 18, 063032 (2016).
Allen, J.M. A., Barrett, J., Horsman, D. C., Lee, C. M. & Spekkens, R. W. Quantum common causes and quantum causal models. Phys. Rev. X 7, 031021 (2017).
Barrett, J., Lorenz, R. & Oreshkov, O. Quantum causal models. https://arxiv.org/abs/1906.10726 (2019).
Wolfe, E. et al. Quantum inflation: a general approach to quantum causal compatibility. Phys. Rev. X 11, 021043 (2021).
Yurke, B. & Stoler, D. EinsteinPodolskyRosen effects from independent particle sources. Phys. Rev. Lett. 68, 1251 (1992).
Henson, J., Lal, R. & Pusey, M. F. Theoryindependent limits on correlations from generalized Bayesian networks. New J. Phys. 16, 113043 (2014).
Branciard, C., Rosset, D., Gisin, N. & Pironio, S. Bilocal versus nonbilocal correlations in entanglementswapping experiments. Phys. Rev. A 85, 032119 (2012).
Branciard, C., Gisin, N. & Pironio, S. Characterizing the nonlocal correlations created via entanglement swapping. Phys. Rev. Lett. 104, 170401 (2010).
Wolfe, E, Spekkens, R. W. & Fritz, T. The inflation technique for causal inference with latent variables. J. Causal Inference 7 https://arxiv.org/abs/1609.00672 (2019).
Renou, M.O. et al. Genuine quantum nonlocality in the triangle network. Phys. Rev. Lett. 123, 140401 (2019).
PozasKerstjens, A. et al. Bounding the sets of classical and quantum correlations in networks. Phys. Rev. Lett. 123, 140503 (2019).
Navascues, M., Wolfe, E., Rosset, D. & PozasKerstjens, A. Genuine network multipartite entanglement. Phys. Rev. Lett. 125, 240505 (2020).
Kela, A., Von Prillwitz, K., Åberg, J., Chaves, R. & Gross, D. Semidefinite tests for latent causal structures. IEEE Trans. Info. Theo. 66, 339 (2020).
Gisin, N. Entanglement 25 years after quantum teleportation testing joint measurements in quantum networks. Entropy 21, 325 (2019).
Chaves, R. et al. Quantum violation of an instrumental test. Nat. Phys. 14, 291 (2018).
Tavakoli, A. et al. Bell nonlocality in networks. Rep. Prog. Phys. https://arxiv.org/abs/2104.10700 (2021).
Gebhart, V., Pezzè, L. & Smerzi, A. Genuine multipartite nonlocality with causaldiagram postselection. Phys. Rev. Lett. 127, 140401 (2021).
Pirandola, S. et al. Advances in quantum cryptography. Adv. Opt. Photon. 12, 1012–1236 (2020).
Šupić, I. & Bowles, J. Selftesting of quantum systems: a review. Quantum 4, 337 (2020).
Brukner, Č., Żukowski, M., Pan, J.W. & Zeilinger, A. Bell’s inequalities and quantum communication complexity. Phys. Rev. Lett. 92, 127901 (2004).
Acín, A. et al. Deviceindependent security of quantum cryptography against collective attacks. Phys. Rev. Lett. 98, 230501 (2007).
Acín, A. & Masanes, L. Certified randomness in quantum physics. Nature 540, 213 (2016).
Wehner, S., Elkouss, D. & Hanson, R. Quantum internet: a vision for the road ahead. Science 362, eaam9288 (2018).
Kimble, H. J. The quantum internet. Nature 453, 1023 (2008).
Briegel, H.J., Dür, W., Cirac, J. I. & Zoller, P. Quantum repeaters: the role of imperfect local operations in quantum communication. Phys. Rev. Lett. 81, 5932 (1998).
Scheidl, T. et al. Violation of local realism with freedom of choice. Proc. Natl. Acad. Sci. USA 107, 19708 (2010).
Weihs, G., Jennewein, T., Simon, C., Weinfurter, H. & Zeilinger, A. Violation of Bell’s inequality under strict Einstein locality conditions. Phys. Rev. Lett. 81, 5039 (1998).
Shalm, L. K. et al. Strong loopholefree test of local realism. Phys. Rev. Lett. 115, 250402 (2015).
Giustina, M. et al. Significantloopholefree test of Bell’s theorem with entangled photons. Phys. Rev. Lett. 115, 250401 (2015).
Hensen, B. et al. Loopholefree Bell inequality violation using electron spins separated by 1.3 kilometres. Nature 526, 682 (2015).
Hooft, G. The freewill postulate in quantum mechanics. https://doi.org/10.48550/arXiv.quantph/0701097 (2007).
BIG Bell Test Collaboration and others. Challenging local realism with human choices. Nature 557, 212 (2018).
Rauch, D. et al. Cosmic Bell test using random measurement settings from highredshift quasars. Phys. Rev. Lett. 121, 080403 (2018).
Abiuso, P. et al. Singlephoton nonlocality in quantum networks. Phys. Rev. Res. 4, L012041 (2022).
Chaves, R. et al. Causal networks and freedom of choice in bell’s theorem. PRX Quantum 2, 040323 (2021).
Boreiri, S. et al. Towards a minimal example of quantum nonlocality without inputs, https://arxiv.org/abs/2207.08532 (2022).
Chaves, R., Luft, L. & Gross, D. Causal structures from entropic information geometry and novel scenarios. New J. Phys. 16, 043001 (2014).
Steudel, B. & Ay, N. Informationtheoretic inference of common ancestors. Entropy 17, 2304 (2015).
Fraser, T. C. & Wolfe, E. Causal compatibility inequalities admitting quantum violations in the triangle structure. Phys. Rev. A 98, 022113 (2018).
Pusey, M. F. Quantum correlations take a new shape. Physics 12, 113043 (2019).
Kraft, Tristan, et al. Quantum entanglement in the triangle network. Phys. Rev. A 103, L060401 (2021).
Šupić, I., Bancal, J.D. & Brunner, N. Quantum nonlocality in networks can be demonstrated with an arbitrarily small level of independence between the sources. Phys. Rev. Lett. 125, 240403 (2020).
Kriváchy, T. et al. A neural network oracle for quantum nonlocality problems in networks. NPJ Quant. Inf. 6, 70 (2020).
Renou, M.O. et al. Limits on correlations in networks for quantum and nosignaling resources. Phys. Rev. Lett. 123, 070403 (2019).
Bäumer, E., Gisin, N. & Tavakoli, A. Demonstrating the power of quantum computers, certification of highly entangled measurements and scalable quantum nonlocality. npj Quantum Information 7, https://doi.org/10.1038/s4153402100450x (2021).
Sekatski, P, Boreiri, S. & Brunner, N. Partial selftesting and randomness certification in the triangle network. https://arxiv.org/abs/2209.09921 (2022).
Greenberger, D. M., Horne, M. A. & Zeilinger, A. In Bell’s Theorem, Quantum theory and Conceptions of the Universe. 69–72 (Springer, 1989).
Clauser, J. F., Horne, M. A., Shimony, A. & Holt, R. A. Proposed experiment to test local hiddenvariable theories. Phys. Rev. Lett. 23, 880 (1969).
Mermin, N. D. Extreme quantum entanglement in a superposition of macroscopically distinct states. Phys. Rev. Lett. 65, 1838 (1990).
Geiger, D. & Meek, C. Quantifier elimination for statistical problems, in Proc. 15th Conf. on Uncertainty in Artificial Intelligence. 226–235 (Morgan Kaufmann Publishers Inc., 1999).
Hall, M. J. Relaxed bell inequalities and kochenspecker theorems. Phys. Rev. A 84, 022102 (2011).
Chaves, R., Kueng, R., Brask, J. B. & Gross, D. Unifying framework for relaxations of the causal assumptions in Bell’s theorem. Phys. Rev. Lett. 114, 140403 (2015).
Pearl, J. Causality (Cambridge University Press, 2009).
Suprano, A. et al. Experimental genuine tripartite nonlocality in a quantum triangle network. PRX Quantum 3, 030342 (2022).
Fritz, T. Beyond Bell’s theorem: correlation scenarios. New J. Phys. 14, 103001 (2012).
Evans, R. J. Graphs for margins of Bayesian networks. Scand. J. Stat. 43, 625 (2016).
Sun, Q.C. et al. Experimental demonstration of nonbilocality with truly independent sources and strict locality constraints. Nat. Photon. 13, 687 (2019).
Poderini, D. et al. Experimental violation of nlocality in a star quantum network. Nat. Commun. 11, 1 (2020).
Carvacho, G. et al. Quantum violation of local causality in an urban network using hybrid photonic technologies. Optica 9, 572 (2022).
Kim, T., Fiorentino, M. & Wong, F. N. Phasestable source of polarizationentangled photons using a polarization Sagnac interferometer. Phys. Rev. A 73, 012316 (2006).
Fedrizzi, A., Herbst, T., Poppe, A., Jennewein, T. & Zeilinger, A. A wavelengthtunable fibercoupled source of narrowband entangled photons. Opt. Express 15, 15377 (2007).
Carvacho, G. et al. Experimental violation of local causality in a quantum network. Nat. Commun. 8, 1 (2017).
Saunders, D. J., Bennet, A. J., Branciard, C. & Pryde, G. J. Experimental demonstration of nonbilocal quantum correlations. Sci. Adv. 3, e1602743 (2017).
Hossenfelder, S. & Palmer, T. Rethinking superdeterminism. Front. Phys. 8, 139 (2020).
Tavakoli, A., Skrzypczyk, P., Cavalcanti, D. & Acín, A. Nonlocal correlations in the starnetwork configuration. Phys. Rev. A 90, 062109 (2014).
Bharti, K., Haug, T., Vedral, V. & Kwek, L.C. Machine learning meets quantum foundations: a brief survey. AVS Quant. Sci. 2, 034101 (2020).
Canabarro, A., Brito, S. & Chaves, R. Machine learning nonlocal correlations. Phys. Rev. Lett. 122, 200401 (2019).
Navascués, M. & Wolfe, E. The inflation technique completely solves the causal compatibility problem. J. Causal Inference 8, 70 (2020).
Barrett, J. & Gisin, N. How much measurement independence is needed to demonstrate nonlocality? Phys. Rev. Lett. 106, 100406 (2011).
Putz, G., Rosset, D., Barnea, T. J., Liang, Y.C. & Gisin, N. Arbitrarily small amount of measurement independence is sufficient to manifest quantum nonlocality. Phys. Rev. Lett. 113, 190402 (2014).
Brans, C. H. Bell’s theorem does not eliminate fully causal hidden variables. Int. J. Theor. Phys. 27, 219 (1988).
Hall, M. J. Local deterministic model of singlet state correlations based on relaxing measurement independence. Phys. Rev. Lett. 105, 250404 (2010).
Hall, M. J. & Branciard, C. Measurementdependence cost for Bell nonlocality Causal versus retrocausal models. Phys. Rev. A 102, 052228 (2020).
Chaves, R. et al. Inferring latent structures via information inequalities. https://arxiv.org/abs/1407.2256 (2014).
Fritz, T. & Chaves, R. Entropic inequalities and marginal problems. IEEE Trans. Info. Theo. 59, 803 (2012).
Li, Z.D. et al. Testing real quantum theory in an optical quantum network. Phys. Rev. Lett. 128, 040402 (2022).
Wu, D. et al. Experimental refutation of realvalued quantum mechanics under strict locality conditions. Phys. Rev. Lett. 129, 140401 (2022).
Van Himbeeck, T. et al. Quantum violations in the Instrumental scenario and their relations to the Bell scenario. Quantum 3, 186 (2019).
Elliott, M. B. A linear program for testing local realism. https://arxiv.org/abs/0905.2950 (2009).
Zhang, Y., Glancy, S. & Knill, E. Asymptotically optimal data analysis for rejecting local realism. Phys. Rev. A 84, 062118 (2011).
Gisin, N. et al. Constraints on nonlocality in networks from nosignaling and independence. Nat. Commun. 11, 2378 (2020).
Andersen, E. D. Certificates of primal or dual infeasibility in linear programming. Comp. Optim. Appl. 20, 171 (2001).
Dinh, N. & Jeyakumar, V. Farkas’ lemma: three decades of generalizations for mathematical optimization. TOP 22, 1 (2014).
Cao, H. et al. Experimental demonstration that no tripartitenonlocal causal theory explains nature’s correlations. Phys. Rev. Lett. 129, 150402 (2022).
Bancal, J.D., Gisin, N. & Pironio, S. Looking for symmetric bell inequalities. J. Phys. A. 43, 385303 (2010).
Bremner, D., Sikiric, M. D. & Schuermann, A. Polyhedral representation conversion up to symmetries. CRM proceedings. 48 (2009).
Lörwald, S. & Reinelt, G. Panda: a software for polyhedral transformations. EURO J. Comput. Optim. 3, 297–308 (2015).
Ioannou, M. & Rosset, D. Noncommutative polynomial optimization under symmetry. https://arxiv.org/abs/2112.10803 (2021).
Acknowledgements
The authors thank Tamás Kriváchy for discussions about the ML implementation. This work was supported by The John Templeton Foundation via the grant QCAUSAL No 61084, via The Quantum Information Structure of Spacetime (QISS) Project (qiss.fr) (the opinions expressed in this publication are those of the author(s) and do not necessarily reflect the views of the John Templeton Foundation) Grant Agreement No. 61466 and via QISS2 Grant Agreement No. 62312, by MIUR via PRIN 2017 (Progetto di Ricerca di Interesse Nazionale): project QUSHIP (2017SRNBRK), by the Regione Lazio programme “Progetti di Gruppi di ricerca” legge Regionale n. 13/2008 (SINFONIA project, prot. n. 85201715200) via LazioInnova spa and by the ERC Advanced Grant QUBOSS (Grant agreement no. 884676). RC and AC acknowledge the Serrapilheira Institute (Grant No. Serra170815763), the Brazilian National Council for Scientific and Technological Development (CNPq) via the National Institute for Science and Technology on Quantum Information (INCTIQ) and Grants 307295/20206 and No. 311375/20200, the Brazilian agencies MCTIC and MEC. Research at Perimeter Institute is supported in part by the Government of Canada through the Department of Innovation, Science and Industry Canada and by the Province of Ontario through the Ministry of Colleges and Universities.
Author information
Authors and Affiliations
Contributions
E.P., D.P., G.R., I.A., G.C., F.S., E.W., R.S., A.C., and R.C. developed the project, E.P., D.P., I.A., A.S., G. Milani, G.C., and F.S. devised the experiment; E.P., D.P., G.R., I.A., A.S., G. Milani, G.C., and F.S. performed the experiment; E.P., D.P., G.R., I.A., A.S., G.C., F.S., E.W., R.S., A.C., G. Moreno, and R.C. performed dataanalysis and modeling; E.W. and R.S. developed the theoretical tools of the inflation technique; A.C., G. Moreno, and R.C. developed the theoretical tools of machine learning; all authors discussed the results and contributed to the writing of the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Yanbao Zhang and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Polino, E., Poderini, D., Rodari, G. et al. Experimental nonclassicality in a causal network without assuming freedom of choice. Nat Commun 14, 909 (2023). https://doi.org/10.1038/s4146702336428w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s4146702336428w
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.