## Introduction

Quantum shot noise represents a hard limit for the precision of all sensors that do not harness quantum resources. Photonic quantum metrology1 promises to surpass the shot-noise limit (SNL, $\mathrm{\Delta }\varphi \sim 1/\surd N$) by using quantum states of light that exhibit entanglement,2 discord3 or squeezing4 to suppress statistical fluctuation in optical-phase estimation. This will ultimately enable greater precision in measuring experimental quantities of interest, such as distance, birefringence, angle or sample concentration. Metrology in the low-photon-flux regime is pursued to gain maximal information while minimising detrimental effects from probe light—for example, in biological sensing.5,6 This is complementary to the objectives of gravity-wave astronomy schemes that use squeezed light and require high circulating laser power (watts to kilowatts) in km-scale interferometry for sub-SNL performance.7

A widespread objective of quantum metrology has been to engineer instances of ‘NOON’ states,2,812 which are path-entangled states of N photons across two modes $\frac{1}{\surd 2}\left(|N〉|0〉+|0〉|N〉\right)$. They offer both super-resolution (N-fold decrease in fringe period) and supersensitivity (enhanced precision towards the Heisenberg limit—Δϕ~1/N), and they are the optimal state for low-flux sensing in the lossless regime. The current record in size of NOON-like states is five photons using post-selection13 and four photons using ancillary-photon detection.14 Key components for this architecture have been demonstrated in integrated optics, including state generation and manipulation,15 micro-fluidics5 and photon detection.16 Ultimately, this could enable practical deployment of quantum-enhanced sensors outside of the quantum optics laboratory. However, to date, unavoidable optical loss hampers quantum advantage and can actually lead to worse precision than by just using a bright laser.2 Moreover, loss will be present in any practical scenario, including absorbance in measured samples and non-unit efficiency detectors. Consequently, revised scaling laws of precision with photon flux have been derived,17 along with optimised superposition states of fixed photon number, numerically for small photon number18,19 and analytically for large.17

Here, we demonstrate the underpinning principles of a practical—loss-tolerant—scheme for sub-shot-noise interferometry, illustrated in Figure 1. This scheme is designed to use the full four-mode multi-photon state naturally occurring in a non-linear optical process known as type-II spontaneous parametric downconversion (SPDC), which generates a coherent superposition of correlated photon-number states.2022 This scheme can exhibit a remarkably high tolerance to loss by harnessing all detected photon-number correlations arising from the superposition of all generated photon-number components in the generalised singlet state. We greatly simplify the theoretical analysis of a prior version proposed in ref. 23 using the positive-operator-valued measurement (POVM) formalism; this enables application of arbitrary photon-number-counting methods, including the multiplexed detection scheme used in our experiment here. The scheme can tolerate up to 40% total loss using photon counting without post-selection (Figure 2). To motivate developing such a photon-counting-based system, the type-II SPDC state can be studied in terms of photon-number measurement outcomes in isolation: n-photon detection events can be registered from any photon-number component of the entire multi-photon state n, under all loss. Here, we measure the fourfold detection events arising from all SPDC contributions of four or more photons, which is the smallest non-trivial measurement subset that displays the singlet behaviour with more than one photon in the sensing interferometer of the scheme (and thereby capable of exhibiting quantum advantage). When accounting for all photons passing through the phase shift ϕ in our experiment, the total loss in our experiment of 83% prohibits us from observing precision that surpasses the SNL. However, as evidence of the potential of this scheme to provide a quantum advantage, we investigate experimentally the correlations of the subset of fourfold detection events, which in principle enable a quantum advantage of up to 45% in the mean-squared error for estimating phase for our measured experimental parameters, and we find that the dynamics we observe is in good agreement with theoretical prediction.

Our approach addresses serious challenges faced by discrete-variable photon-counting schemes that are designed to operate with a deterministically generated or heralded fixed number of photons. The only system that has demonstrated quantum interference of more than two photons is SPDC, and complex experimental techniques comprising optical delay, fast switching and auxiliary photon detection are required to generate fixed photon-number states. This includes approaches to loss-tolerant quantum metrology, such as Holland and Burnett states.2427 To perform experiments with n photons, post-selection is commonly used to ignore components of fewer photons (<n), whereas terms associated with higher-photon number (>n) are treated as noise. This is particularly problematic for quantum metrology, where all photons passing through the sample need to be accounted for, and unwanted photon-number components are detrimental to measurement precision. In contrast, for the scheme presented here, higher-photon terms contribute to enhancement of phase sensitivity of the fourfold detection events.

### The Scheme: metrology using type-II SPDC

Our demonstration (Figure 1b) can be treated in three stages: (i) a source of multi-photon entangled light, (ii) unitary rotation with an unknown parameter ϕ to be estimated on the sensing path a, equivalent to interferometry and (iii) correlated photon-counting measurement.

(i) The source is based on pulsed non-colinear type-II SPDC20,21, which generates entanglement across four modes—two spatial paths (a, b) and two polarisations (h,v) (see Materials and methods). In the ideal case, for which all experimental imperfections are neglected, the state generated is a four-mode squeezed state23 that is the superposition of photon-number states that are symmetrical across a and b

$\begin{array}{}\text{(1)}& |\mathrm{P}\mathrm{D}\mathrm{C}〉\propto {\sum }_{n=0}^{\infty }{\left(\mathrm{t}\mathrm{a}\mathrm{n}\mathrm{h}\tau \right)}^{n}\left[{\sum }_{m=0}^{n}{\left(-1\right)}^{m}|n-m,m,m,n-m〉\right]\end{array}$

where τ is an interaction parameter that corresponds to the parametric gain, and the modes are listed in order (ah, av, bh, bv). For example, the four-photon term $|2,0,0,2〉$ comprises two horizontally polarised photons in path a and two vertically polarised photons in path b. Note that we have omitted normalisation. This state has the property that each term indexed by n corresponds to an entangled state having a total of 2n photons, and maps onto the singlet state that represents two spin $-\frac{n}{2}$ systems in the Schwinger representation.28 When τ is small, as is the case for our experiment with τ=0.061, $|\mathrm{P}\mathrm{D}\mathrm{C}〉$ is dominated by the $n=1$ term, which enables post-selection of the two-photon entangled state1 $\frac{1}{\sqrt{2}}\left({|H〉}_{a}{|V〉}_{b}-{|V〉}_{a}{|H〉}_{b}\right)$, which is now common in quantum optics, as ref. 20. For larger τ, the photon intensity grows as 2sinh2 τ (ref. 21). The symmetry and correlation properties of $|\mathrm{P}\mathrm{D}\mathrm{C}〉$ have been the subject of several investigations, with experimental evidence reported for entanglement between 100 photons,29 with possible applications proposed outside of metrology.21,30 However, the potential for increasing temporal indistinguishability when using optical cavities to increase gain τ needs to be considered in future developments of the scheme investigated here.

The four-mode squeezed state (1) that we require is a coherent superposition of multi-photon Fock-state terms; this is distinct to an undesirable statistical mixture of photon pairs. To avoid this mixture in the state generation, it is important to use a pulsed laser pump that has a pulse duration shorter than the coherence length of the photon pairs generated. The detector time bandwidth is longer than the coherence envelope of the generated photons in our experiment. However, with our interference filters (<3 nm bandwidth), we estimate a coherence time of >700 fs, whereas the jitter on the Perkin & Elmer single-photon counting module (SPCM, Excelitas Technologies corp., Waltham, MA, USA) that we use is of the order picosecond. However, the pulses of our pump laser (which we estimate to be <160-fs long, after SHG of 85 fs Ti:sapph) act as an effective gate:11 the pulses are separated by 12 ns, and our detectors and coincidence logic can resolve well within this window. Temporal modes can introduce large amounts of distinguishability between photons generated in SPDC when the pulse length of the pump laser is longer than the coherence length of the photons generated. This is not the case in our experiment. Evidence that we have achieved this is given by the close agreement between theory and experiment of our multi-photon interference fringes in Figure 3. As highlighted by, for example, ref.11, the quantum interference of statistically mixed photon pairs in different temporal modes differs from that of photon-number Fock states.

(ii) The rotation we consider is

$\begin{array}{}\text{(2)}& {U}_{a}\left(\varphi \right)=\left(\begin{array}{cc}\mathrm{cos}\left(\varphi /2\right)\hfill & \mathrm{sin}\left(\varphi /2\right)\hfill \\ \mathrm{cos}\left(\varphi /2\right)\hfill & -\mathrm{cos}\left(\varphi /2\right)\hfill \end{array}\right)\end{array}$

where ϕ is the parameter we wish to estimate with quantum-enhanced precision. This operator maps exactly to rotations of any two-level quantum system, including the relative phase shift in an interferometer. We implement Ua(ϕ) using a half-waveplate in the sensing path a, operating on modes {ah, av}, for which ϕ is four times the waveplate’s rotation angle.

(iii) Finally, photons in each of the four modes (ah, av, bh, bv) are detected with number-resolving photodetection—the original proposal23 assumed fully photon-number-resolving detectors that implement projections onto all Fock states. We approximate number-resolving detection using a multiplexed method25,31 using readily available components. We use four 1×4 optical fibre splitters with 16 ‘bucket detector’ avalanche photodiode SPCM (APDs). Each APD has two possible outcomes: no detection event (‘0’) for a vacuum projection and a detection event (‘1’) for the detection of one or more photons, with nominal 60% efficiency. We use a 16-channel coincidence counting system that records all possible combinations of multi-photon detection events occurring coincidentally across the 16 APDs.

Our approach uses a superposition of all photon-number singlet states and can in principle achieve Heisenberg scaling23 in a similar manner to NOON states. More importantly, this state surpasses the shot-noise limit despite a realistic level of loss that would otherwise preclude any quantum advantage when using NOON states. The original proposal23 predicts a quantum advantage in the presence of up to 50% total system loss using a theoretically optimal (but experimentally challenging) measurement scheme, without post-selection. Figure 2 illustrates new results on the sub-SNL performance of photon counting on the Type-II SPDC scheme in the presence of loss. Intuition for the loss tolerance in this scheme can be gained by considering the effect of losing a single photon from one of the modes: each singlet component transforms into a state that closely approximates another singlet of lower photon number.21 As loss decreases, detection outcomes of equal photon numbers detected in the sensing and reference arm become the dominant contribution (See Figure 4 and the associated discussion below). Experimentally, we observe sub-SNL phase sensitivity in the four-photon coincidence detection subspace of our experiment, without post-selecting zero loss or assuming a fixed photon number in our theoretical analysis. This supports the loss tolerance expected from detecting N>4 from any higher-photon number components of type-II SPDC.23

### POVM theory

A powerful method to simplify calculating measurement outcome probabilities for our experiment is to use the POVM formalism.32,33 All photon-counting operations correspond to POVM elements Er, which are diagonal in the Fock-state basis $\left\{|c〉\right\}$:

$\begin{array}{}\text{(3)}& {E}_{r}={\sum }_{c=0}^{\infty }{w}_{r}\left(c\right)|\mathrm{c}〉〈\mathrm{c}|,\end{array}$

where r and c denote the detection pattern and the photon number, respectively. The weights wr(c) are non-negative and satisfy ∑rwr(c)=1. The probability of r detection events is given by ${\mathit{P}}_{r}=\text{tr}\left(\rho {\stackrel{ˆ}{E}}_{r}\right)$, where ρ is the density matrix of any state input to the measurement setup. For the perfectly number-resolving case, the only non-zero POVM weight is when c=r and wr(r)=1. However, with multiplexed detection, all weights wr(c) with cr can be non-zero. For example, for a two-photon state incident on one of our multiplexed detectors, there is a probability of 1/4 that both photons go to the same APD causing one detection event (w1(2)=1/4) and a probability of 3/4 for two detection events (w2(2)=3/4). The entire table of the POVM weights for our multiplexed system is illustrated in Figure 5. The method to compute these weightings stems from ref. 34 and is explained in the Supplementary Information.

Multiplexed detector POVMs are applied to each of the modes ah, av, bh and bv to compute the probability for a detection outcome r=(rah, rav, rbh, rbv), given a phase rotation ϕ:

$\begin{array}{}\text{(4)}& {P}_{r}\left(\varphi \right)={\sum }_{{c}_{ah},{c}_{av},{c}_{bh},{c}_{bv}\ge 0}{w}_{{r}_{ah}}\left({c}_{ah}\right){w}_{{r}_{av}}\left({c}_{av}\right){w}_{{r}_{bh}}\left({c}_{bh}\right){w}_{{r}_{bv}}\left({c}_{bv}\right){p}_{c}\left(\varphi \right)\end{array}$

where c=(cah, cav, cbh, cbv) is the photon number for each mode and pc(ϕ) corresponds to the probability for a measurement outcome of a perfect projection $|{c}_{ah},{c}_{av},{c}_{bh},{c}_{bv}〉〈{c}_{ah},{c}_{av},{c}_{bh},{c}_{bv}|$. From equation (1), rotation on modes ah and av yields the probability to detect c according to

$\begin{array}{}\text{(5)}& {p}_{c}={\delta }_{{c}_{a},{c}_{b}}\frac{{\mathrm{tanh}}^{2{c}_{a}}\tau }{{\mathrm{cosh}}^{4}\tau }{|〈{c}_{ah},{c}_{av}|\mathrm{U}\left(\varphi \right)|{c}_{bh},{c}_{bv}〉|}^{2}\end{array}$

where photon number for the two paths is denoted by ca=cah+cav and ca=cbh+cbv, and where the Wigner-d matrix element ${d}_{m\prime ,m}^{j}\left(\varphi \right)={|〈j+m\prime ,j-m\prime |U\left(\varphi \right)|j+m,j-m〉|}^{2}$ describes the rotation amplitudes on two separate modes populated by number states,28 and is conveniently represented as a cosine Fourier series.23

We incorporate the total circuit and detector efficiency (η) into the POVM elements via an adjustment of wr(c). We use a standard loss model for which the mode in question is coupled via a hypothetical beamsplitter to an ancillary mode, initially in the vacuum state, which is traced out at the end. We assume that losses are polarisation independent, and therefore all loss that can arise in our setup commutes with Ua(ϕ). We model the single-photon detectors in each multiplexed photon-counting array with the same efficiency ηd, and hence detector loss can be incorporated as a loss channel with efficiency ηd to the combined POVM; this loss commutes with fibre splitters and can be considered as part of the combined system efficiency. The effect of system efficiency η can be incorporated into the multiplexed POVM by the linear transformation:

$\begin{array}{}\text{(6)}& |c〉〈c|↦\sum _{c\prime =c}^{\infty }\left(\begin{array}{c}c\prime \\ c\end{array}\right){\eta }^{c}{\left(1-\eta \right)}^{c\prime -c}|c\prime 〉〈c\prime |\end{array}$

The weights wr(c) are altered correspondingly as illustrated in Figure 5.

## Results

In the reported setup, the typical total rates for detecting two photons in coincidence and four photons in coincidence across the two paths are 17 k per second and 2 per second, respectively, making our setup suitable for observing sub-SNL precision for the subset of fourfold events. Note that, according to our value of τ=0.061, the rate for the generation of four-photon singlets at the sources is less compared with that for two-photon singlets by a factor 3tanh2(0.061)/2~0.006; however, the measured rates of four- and two-photon coincidences are modified from this because of the effects of losses, as well as limited resolution for the multiplexed photon-number counting for discriminating Fock states $|1〉$ and $|2〉$.

We plot in Figure 3 all nine possible four-photon detection patterns r of two photons in the reference path and two photons in the sensing path as a function of ϕ, measured simultaneously by the setup in Figure 1b. For comparison, we plot these data together with theoretical curves Pr(ϕ), normalising to the total counts collected at each ϕ. These theoretical curves use the measured experimental parameters of τ=0.061, and lumped collection/detection efficiencies of ηa=0.23 and ηb=0.12 in the sensing and reference paths, respectively (a geometric average of 83.2% loss), assuming otherwise perfect Ua(ϕ) and photon interference. Parameters τ, ηa and ηb are extracted from our setup as follows. The probability of generating one pair of photons in SPDC is computed via equation (1) and given by

$\begin{array}{}\text{(7)}& p=2{\mathrm{tanh}}^{2}\left(\tau \right){/\mathrm{cosh}}^{4}\left(\tau \right)\end{array}$

We sum the normalised detection rates of pairs of detected single photons that are not part of a coincidence event with 1 other photon event or with no events elsewhere in the detection scheme:

$\begin{array}{}\text{(8)}& {P}_{\left(1,0,0,0\right)}+{P}_{\left(0,1,0,0\right)}=p{\eta }_{b}\left(1-{\eta }_{a}\right)\end{array}$
$\begin{array}{}\text{(9)}& {P}_{\left(0,0,1,0\right)}+{P}_{\left(0,0,0,1\right)}=p\left(1-{\eta }_{b}\right){\eta }_{a}\end{array}$

Summing all four (normalised) twofold coincidences yields

$\begin{array}{}\text{(10)}& {P}_{\left(1,0,0,1\right)}+{P}_{\left(0,1,1,0\right)}+{P}_{\left(1,0,1,0\right)}+{P}_{\left(0,1,0,1\right)}=p{\eta }_{a}{\eta }_{b}\end{array}$

This is then solved for ηb and ηa, then for p, and hence τ via a cubic equation. These relations are assumed to be constant with respect to the phase rotation ϕ and taken as the ϕ-average over experimental data.

The asymmetry in ηa and ηb arises from the different spectral width of the extraordinary and ordinary light on, respectively, paths a and b, passing through identical spectral filtering.35 The setup is robust to this, as the state symmetry is preserved despite ηaηb, provided loss is polarisation insensitive.21 From the data presented in Figure 3, we extract the probability distributions pi(ϕ) as least-square fits from each data set, and normalise such that ∑ipi(ϕ)=1.

Statistical information about ϕ can be extracted from the frequencies of each output detection pattern and quantified using Fisher information $\mathcal{J}\left(\varphi \right)$, (ref. 36). We compute the Fisher information of our demonstration using two methods, both plotted in Figure 6. The first (solid black line) is directly computed using the experimentally extracted pi(ϕ) in the relation $\mathcal{J}\left(\varphi \right)={\sum }_{i=1}^{9}{p}_{i}{\left(\varphi \right)}^{-1}{\left(\mathrm{d}{p}_{i}\left(\varphi \right)/\mathrm{d}\varphi \right)}^{2}$, with error estimated using a Monte-Carlo simulation that assumes Poisson-distributed noise on the four-photon detection rates. The second method is to obtain the variance Δ2ϕj of maximum-likelihood estimates {ϕj}, each using $\mathcal{N}$ photons, and to evaluate the relation ${\mathcal{J}}_{\mathrm{M}\mathrm{L}}=1/\left(N×{\Delta }^{2}{\varphi }_{j}\right)$. Note that maximum-likelihood estimation saturates the Cramér–Rao bound and loses any bias as data are accumulated, and it is practical for characterising an unknown phase when pi(ϕ) are characterised. We simulate $ℳ=10,000$ maximum-likelihood estimates for a discrete set of waveplate settings, and for each estimate we sample $\mathcal{N}=1,000$ times from pi(ϕ). This number of samplings ensures unbiased and efficient estimation.36 Computed values of ${\mathcal{J}}_{\mathrm{M}\mathrm{L}}\left(\varphi \right)$ are then plotted (circles) in Figure 6, showing close agreement with $\mathcal{J}\left(\varphi \right)$.

We also plot in Figure 6 theoretical Fisher information computed from the POVM description of our multiplexed detection system, by taking into account the SPDC gain parameter τ and the total circuit and detector efficiency η of our setup. We find general agreement of the main features between theory and experiment ($\mathcal{J}$ and ${\mathcal{J}}_{\mathrm{M}\mathrm{L}}$), whereas the discrepancy is attributed to imperfect waveplate rotations and imperfect indistinguishability (mode overlap) of multi-photon states across the four modes.

Figure 6 also shows the shot-noise limit for two photons passing through the measured phase, computed on the basis of the average photon number in the sensing path for the fourfold detection events. For our experiments τ<0.1, which bounds the Fisher information for the target path and is computed to lie in the range 2.01±0.01. The shaded region displays the quantum advantage over the shot-noise limit for the post-selected subset of fourfold detection events—the maximum advantage, over detecting the same intensity as for our experiment in arm a but for classical interferometry, achieved in our experiment is 28.2±2.4% at ϕ=3.91±0.06 rad. The theoretical maximum advantage that can be achieved by the scheme with our τ and η parameters is 45%. This is possible because of the low rate of components with six photons or more compared with four photons for the pre-loss SPDC state (0.5%), and a quantum advantage is predicted theoretically for τ0.250 with our experimental values for loss.

An important feature of the theory and experiment curves in Figure 6 is the troughs in $\mathcal{J}$ (similar features were presented elsewhere, e.g., the Supplementary Information for ref. 25), occurring about points where some or all of the fringes in Figure 3 have minima or maxima. In contrast, when all experimental imperfections are absent, $\mathcal{J}$ is predicted to be independent of phase rotation—a common feature of metrology schemes using photon-number-counting measurement.37 The definition of $\mathcal{J}\left(\varphi \right)$ reveals points of instability when the numerator ${\mathrm{d}p}_{i}\left(\varphi \right)/\mathrm{d}\varphi$ vanishes but pi(ϕ) does not—this will arise even with very small experiment imperfections that lead to interference fringes with visibility <1. A solution is to incorporate a reference phase in conjunction with a feedback protocol to optimise the estimation of an unknown phase.25 The symmetry of the generalised singlet state at the heart of this scheme enables a control phase to be placed on the reference path as opposed to the sensing path in the traditional manner. We demonstrate the feasibility of the former by repeating our experiment with a controllable reference phase rotation (θ in Figure 1b) placed in the reference path b that shifts the regions of maximal sensitivity with respect to the phase in the sensing path—see Supplementary Information for four-photon interference fringes and corresponding Fisher information. This may find practical application where the reference phase has to be separated from the sensing path. Furthermore, the reference path could be used for heralding to maximise the Fisher information per photon passing through the unknown sample using fast switching38 of the sensing path conditioned on detection events at the reference path. Using heralding and perfect photon-number-resolving detection, the entire downconversion state can achieve quantum advantage with the τ value from our experiment (see Supplementary Information).

Finally, we return to a detailed theoretical analysis of the contribution to phase sensitivity of subsets of outcomes (ra, rb). The symmetry property of $|\mathrm{P}\mathrm{D}\mathrm{C}〉,$ namely ${U}_{a}\otimes {U}_{b}|\mathrm{P}\mathrm{D}\mathrm{C}〉=|\mathrm{P}\mathrm{D}\mathrm{C}〉$ for arbitrary unitary rotation U of polarisation modes, is preserved under polarisation-insensitive photon loss. This maps onto the requirement of polarisation-insensitive loss in our setup—we denote the loss rates in path a (or b) by $1-{\eta }_{a\left(b\right)}$. The symmetry property dictates the general form of the type-II SPDC state post loss, as well as key features of the detection probabilities (after polarisation rotation of ah, av) when outcomes are grouped according to total detections in each arm (ra, rb) (refs 21,23). Consequently, we consider renormalised probabilities ${P}_{r}^{\prime }\left(\varphi \right)={P}_{r}\left(\varphi \right)/{P}_{\left({r}_{a},{r}_{b}\right)}$, where $r=\left({r}_{ah},{r}_{av},{r}_{bh},{r}_{bv}\right)$ and ${P}_{\left({r}_{a},{r}_{b}\right)}$ denote the total probability for detections in the subset (ra,rb). ${P}_{\left({r}_{a},{r}_{b}\right)}$ is independent of the rotation angle when fully number-resolving detectors are used. Only when the total number of photons detected in each of the sensing and reference paths are equal (ra=rb) does ${P}_{r}^{\prime }\left(\varphi \right)$ have a singlet contribution (the na=ra=rb singlet term in equation (1)), whereas all other contributions arise because of loss from singlets with greater photon number (and reduce the achievable precision). The effects of loss on ${P}_{r}^{\prime }\left(\varphi \right)$ are parameterised by $\Xi =\sqrt{\left(1-{\eta }_{a}\right)\left(1-{\eta }_{b}\right)}\mathrm{tanh}\tau$: Ξ increases with higher losses and higher values for τ (which increase the rate of singlet production at high-photon numbers), and it is reduced if the losses are predominantly in one arm. To achieve sub-shot noise precision without post-selection, values of τ and ηa,b must be chosen to achieve simultaneously a low value of Ξ, high weighting for ${P}_{\left({r}_{a},{r}_{b}\right)}$ with ra=rb>1 and high average photon number. The components with ra=rb provide supersensitive precision when Ξ is small: this enabling principle of our scheme can be demonstrated using post-selection when high values for τ and ηa (b) are not achieved, as in our current setup. We give illustrative examples in Figure 4.

## Discussion

We have demonstrated the key enabling principles of a promising technique for realising practical quantum-enhanced sensors that is robust to loss and designed to use a photon source based on current technology. Our experimental results demonstrate the validity of our theoretical predictions concerning phase supersensitivity for those detection events that correspond to multi-photon singlet states. Our theory in turn predicts that supersensitive precision is achievable without post-selection in an experimental regime that is achievable with near-term and state-of-the-art componentry, whereas detection events that have contributions from singlet components behave as in our experiment. Our approach is amenable to near-term implementation in an integrated architecture with on-chip interference of downconversion,15 on-chip16 detectors and integration with micro-fluidic channels.5 This would enable inherently a stable path encoding to measure very small optical path lengths—other bulk optical techniques could be used to achieve this to convert polarisation to path, such as in ref. 39. We note that our scheme does not require optical delays or GHz switching. This is in contrast to other quantum technology schemes that rely on generating a fixed number of photons and therefore require greater architecture complexity—e.g., reliance on heralded photon sources that comprise fast switching, low-loss optical delay, high-speed detection and micro-electronic integration. Our demonstration now shifts the emphasis for practical quantum metrology onto developing and using low-loss circuitry and high-efficiency photon detection; 95% efficiency transition edge sensor40 and 93% efficient super-conducting nanowire detectors operating in the infrared41 have recently been reported. For a given efficiency η, the gain parameter τ in the downconversion process also dictates the level of precision the scheme can achieve. As circuit loss is reduced, it would be beneficial to increase τ to the values (τ>1) studied in ref. 23; enhancing SPDC with a cavity42 may be a promising approach to achieve this, enabling future experimental work to examine larger photon detection subsets.

## Materials and methods

Horizontally polarised 404-nm laser pulses (estimated to be <160-fs long) are generated by up-conversion (SHG) of a Ti-Sapphire laser system (85 fs pulse length, 80MHz repetition rate). This is focused to a waist of 50 μm within a 2-mm-thick non-linear β barium borate crystal, phase matched for type-II SPDC, to ideally generate the state $|PDC〉$ at the intersection of the ordinary (o) and extraordinary (e) cones of photons20 in paths a′ and b′ of Figure 1b. Spatial and temporal walk-off between e and o light is compensated20 with one half-waveplate (optic axis at 45° to the vertical) and one 1 -mm-thick β barium borate crystal in each of the two paths a and b. The spectral width of ordinary and extraordinary light generated in type-II SPDC differs, leading to spectral correlation of the two polarisations. Setting one waveplate to 90° and aligning the two paths onto a PBS separates the e and o light, sending all e light onto output a and all o light onto output b, ref. 43. This removes spectral-path correlation in the PDC state, leaving only polarisation entanglement across paths a and b, and thus erasing polarisation-dependent loss in the sensing path and the reference path of the setup.