Introduction

Sensitivity in standard optical imaging and sensing, such as those exploiting classical illuminating fields, is fundamentally lower bounded by the shot noise, which is the inverse square root of the number of photons used. Surpassing such a limit is particularly effective when there is a constraint on the usable optical power, for example, a limit determined by the damage threshold of the sample1, the stress of the optical elements2, or alteration of chemical and biological photo sensitive process and, most fundamentally, quantum back-action3. Following the seminal works of Caves4, which have demonstrated how squeezed light can improve the sensitivity in interferometry, non-classical states of light have long been considered to overcome shot noise, thus prompting deep theoretical investigations and many proposed schemes5, 6, 7, 8, 9.

The experimental possibility of generating two-photon entangled states10 (such as NOON states with N=2) and the availability of single-photon detectors have enabled the demonstration of the potential of quantum enhanced sensing, aimed at reaching the fundamental Heisenberg limit in phase-contrast polarization microscopy11, 12, magnetic field sensing13 and solution concentration measurement14.

However, almost all previous experimental results have been proof of principle demonstrations6, because of the difficulties in generating high photon number entangled states and a high photon flux (comparable to that used in classical schemes) and protecting them from decoherence up to the detection, thus limiting the optical and detection losses15.

Some remarkable results have been obtained because of recent progress in the generation of a highly non-classical single mode16 and few modes squeezed states17: these have been successfully implemented in gravitational wave interferometry2, for particle tracking in a biological environment18, 19, and to some extent, for beam displacement measurement20, 21 and optical magnetometry22.

Most quantum-enhanced imaging and sensing protocols have been obtained exploiting single or few spatial modes of the quantum probe beam, such as in the case of squeezing, and with single photon detection in schemes based on two-photon correlated states. In both cases, only one parameter of the system, namely a single point of the sample, can be probed in a single run. The reconstruction of the sample as a whole requires time consuming scanning and accumulation of many detection windows. Instead, the exploitation of a high number, namely thousands, of modes in the same run is the requirement for quantum enhanced wide field imaging. In practice, the number of spatial details of a structure that can be probed at the same time is determined by the number of spatial modes enclosed in the illuminating field. Indeed, one of the challenges in quantum optics and quantum enhanced imaging is to generate and efficiently detect highly non-classical features in a multi-mode regime23, 24, 25, 26, 27, 28. The first proof of principle of a quantum enhanced imaging protocol exploiting this parallelism was reported in Ref. 29, following the proposal of Ref. 30, even if the average enhancement and the poor spatial resolution were not sufficient for any practical purposes, such as in absorption microscopy, in which the technique is naturally addressed. In this paper, we address this point by reporting the realization of a sub-shot-noise (SSN) microscope exploiting thousands of spatial modes, detected independently by the same number of pixels of a charge-coupled-device (CCD) camera operated in the linear (non-amplified) regime. Thousands of photons per pixel are detected in the exposure time of a single shot. Therefore, the microscope operates in a wide-field regime (no scanning is required) and is suitable for dynamic imaging. The microscope is based on the non-classical and spatially multimode correlations of squeezed vacuum that is naturally generated by a traveling wave parametric amplifier in both low and high gain regimes24, 26, 30, 31, 32. The noise of the image, formed by the probe beam interacting with the sample, is locally reduced by pixel-by-pixel subtraction of the correlated noise pattern measured on the other beam30. Moreover, we introduce the concept of a quantum enhanced median filter33: quantum noise reduction at different spatial scales can be naturally combined with the statistical noise smoothing used in every standard image processing algorithm, thus resulting in an impressive overall enhancement in object recognition.

We attain a significant improvement in sensitivity with respect to that of any classical absorption microscopy system at the same illumination level. Our present results outperform the previous proof of principle demonstrations29, 34, substantially improving the resolution by a factor of 10–100 (depending on the sensitivity level), in terms of both pixel count and the size of the imaged details of the sample.

Wide field microscopy is the simplest, fastest, least expensive and oldest imaging modality used, for example, for live cell imaging. It has the advantage of requiring the lowest photon dose, especially for absorbed light imaging. The lowest photon dose that achieves a measurable metric for the experimental question should be used, for instance to ensure that the cellular processes under investigation are not shifted to an alternate pathway due to environmental stress35. Indeed, the results presented here could potentially have immediate applications in many fields, such as biology and biochemistry. Furthermore, the comparison with the reference beam can be used to provide the absolute value of the absorption, thus providing the possibility for quantitative analysis of the related properties. With small modifications (essentially in the data processing), our technique can also serve as a basis for enhancing sensitivity in schemes with different goals: for example, ghost imaging36, 37, 38, 39, 40, 41, detection and imaging in the presence of environmental or electronic noise background42, 43 and accurate characterization of retina rod-cell response to single photon stimulation44. Finally, this new capability can also be used in quantum radiometry, for example, for the absolute calibration of detectors with spatial resolution, as demonstrated in Refs 45, 46, 47, 48.

Materials and methods

In absorption wide-field imaging, as in standard microscopy, a probe illuminates the sample all at once and the transmitted pattern is imaged by the detector, typically as the pixel array of a camera. The intensity measured by each pixel, Nα, here expressed in number of photons, has a mean expectation value , where is the mean number of detected photons per pixel in absence of the sample and α is the absorption coefficient. The photon noise of the measurement can be obtained by modeling the absorption as the action of a beam splitter of transmittance 1–α on the beam with initial variance (Ref. 49), thus resulting in the form , where is the Fano factor in absence of the sample. The value F=1 establishes a bound between classical and non-classical photon statistics. In particular, F is lower bounded by the unity for classical states, whereas specific non-classical states may have sub-Poissonian photon statistics, that is, 0≤F<1. The uncertainty of the absorption estimation in the direct (DR) imaging scheme is therefore

The limit of the sensitivity for a classical probe (F=1) is , representing, for small absorption, the shot-noise limit with the typical scaling of the inverse square root of the number of photons. However, by inspecting Equation (1), it is clear that non-classical optical fields with Fano factor smaller than one allow beating the shot noise limit. We note that the Fano factor, appearing in Equation (1), is usually deteriorated with respect to the value of F0 for the unperturbed (pure) state, due to optical losses, including detector quantum efficiency. In particular, one obtains F=ηF0+1–η where we define the overall detection probability 0≤η≤1. Thus, the non-classical behavior, in terms of noise reduction, has a lower bound of Floss=1–η. It must be emphasized that splitting a single mode beam into n pixels leads to a detection probability of the order of η≤1/n for each pixel, ruling out the possibility of using single modes for SSN wide-field imaging for any reasonable number of pixels. Thus, the necessity of having many non-classical spatial modes is evident, each addressing a single pixel with limited losses. Even if sub-Poissonian light beams were obtained as single or few modes, it is not completely obvious how to generate a beam with a high number of sub-Poissonian modes and simultaneously detect them. On the other side, it is relatively simple to produce a pair of beams that are (individually) spatially incoherent but locally correlated at the quantum level, by means of traveling wave parametric amplifier in the spontaneous regime. Even if the fluctuations of a single spatial mode in one beam are super-Poissonian, these fluctuations are perfectly replicated in the correlated mode of the second beam because of photon number entanglement. This correlation is verified for all of the wide range of localized transverse spatial modes. The degree of correlation and its non-classical features can be quantified on the basis of the noise reduction factor (NRF) (Refs 23, 24, 26, 30, 31, 32, 49, 50, 51, 52), measured for a pair of pixels by collecting correlated spatial modes. The NRF indeed represents the equivalent of the Fano factor for bipartite state, in which the shot noise level becomes . While for classical beams the NRF is lower bounded by 1, quantum correlation can lead to 0≤σ<1. In particular, in the presence of losses, σ=1–η.

Whereas twin beams in a single spatial mode have been used to demonstrate SSN absorption measurements in a double-beam scheme53, 54, parametric down-conversion (PDC) multi-mode quantum correlations can be used for wide-field SSN imaging30.

The object is placed in one beam, and the second beam is used as a reference. Note that the double-beam (or double-path) approach is commonly used in imaging and spectroscopy involving faint absorptions, because it enables the canceling out of classical (super-Poissonian) noise and provides a direct estimation of the absolute transmittance (absorption) by instantaneous comparison with the unperturbed reference beam. We consider the intensity difference between two correlated pixels of the two beams, whose expectation value is (we assume balanced beams, ). In this context, the noise can be expressed in terms of the noise reduction factor in absence of the sample, in the form (see Ref. 30).

Therefore, the absorption uncertainty in the differential (DF), both classical and quantum, imaging scheme is

The performance of the classical differential (DC) imaging scheme is derived from Equation (2) by substituting σ=1. In the situation of interest, the absorption is so small that the term α2(F–1) is negligible even in presence of classical super-Poissonian noise of the source (F>1), and the uncertainty in the differential classical imaging scheme becomes , which is only a factor of larger than shot-noise-limited direct imaging. Under the same condition, the quantum enhancement provided by SSN correlations with σ<1 is also quantified in terms of signal-to-noise ratio, SNR=αα, by using Equations (1) and (2), as

The advantage with respect to the differential classical scheme appears when σ<1, wheras a more strict condition, σ<1/2, is needed to beat the direct (shot-noise-limited) scheme.

The setup for the SSN microscope scheme is sketched in Figure 1a. A CW laser beam (100 mW at λp=405 nm) pumps a 1-cm type-II beta-barium-borate nonlinear crystal, in which two correlated beams are generated. The far field of the emission, where spatial correlation occurs, is realized at the focal plane of a lens with fFF=1 cm focal length. Then, the far field plane is imaged (magnification factor M=7.8) to the detection plane by means of a second lens system with fIM=1.6 cm. The detector is a CCD camera Princeton Inst. Pixis 400BR Excelon, operating in linear mode (no electro-multiplication gain), with high quantum efficiency (>95% at 810 nm), 100% fill factor and low noise [read noise is few e/(pix·frame)]. The size of the physical pixels of the camera is 13 μm, but here we group them by a 3 × 3 hardware binning. Hereafter, if not explicitly indicated, the single elementary pixel is intended to have a linear size of 39 μm, thus causing a sufficient reduction of the effect of the read noise for our purposes. A spectral selection is performed with two identical interferential filters (800±20 nm, with transmission of 99%), one just after the crystal and the other mounted on the camera. A test sample, with absorption α=1% representing the Greek letter Φ (size 300 × 400 μm) is realized by a few nanometers thick titanium deposition on a coated glass slide.

Figure 1
figure 1

Experimental setup. (a) Schematic of the experiment. (b) Image of the single shot acquired by the CCD camera in 100 ms. The pixel size is 39 μm, corresponding to 5-μm resolution in the focal (object) plane. (c) 2D map of the NRF for 5-μm resolution in the focal (object) plane.

When inserted, the letter intercepts one beam at the focal plane of the far field lens whereas the slide extends to the second beam (in the far field, the centers of the two beams are separated by 1.0 mm). It is important to note that the acquisition time of a single shot is typically 1011 times the coherence time of the PDC process (on the order of 10–12 s). Since the number of photons per pixel is ~103, the occupation of the single temporal mode is 10–8 photons/mode. Therefore, the statistics of the noise should be Poissonian, that is, shot-noise limited.

In the crystal, a photon of the pump beam is converted into a pair of photons with lower frequency, thus fulfilling the energy and momentum conservation (phase matching-condition). In particular, if the pump is approximated by a plane wave with transverse momentum q=0, the two photons of a pair must fulfill q1+q2=0, thus being emitted with opposite transverse momenta. The lens maps momentum q at wavelength λ into point x=λfq/2π at the focal plane, thus leading degenerate photons (having λ1=λ2=λd=2λp=810 nm) being found in symmetric positions x1=–x2 with respect to the pump direction, as shown in Figure 1a. Two symmetric pixels of the camera should always detect the same number of photons. Moreover, even if the emission is broadband both in frequency and momentum, phase matching conditions establish a relationship between the wavelength and the photon direction, specifically the modulus of the transverse momentum, so that photons are emitted in concentric cones, each corresponding to a certain wavelength. In Type-II PDC, the correlated photons have orthogonal polarizations, and the emission cones have different centers. The intensity distribution, detected in a 100-ms shot, is shown in Figure 1b. The dotted circumferences approximately represent the wavelength distribution in the two orthogonally polarized beams. By selecting two symmetric regions, A and B, around degeneracy (represented by red squares in the picture), one expects each pixel of A to be non-classically correlated with a corresponding symmetric pixel in B. In practice, there are two important limiting factors to this imaging system that must be considered. One limits the field of view, and the other lower bounds the spatial scale at which the noise subtraction can be efficiently performed:

  • The trade-off between field of view and NRF: Near the degeneracy wavelength for correlated photons, λ1=λdλ and λ2=λd–Δλ; the point-to-point correlation in the far field becomes . Therefore, the center of symmetry of the correlations is x=0 only at the degenerate wavelength. As long as one moves from the degeneracy, the center of symmetry shifts proportionally to . After the pixel grid has been positioned to be symmetric with respect to x=0, only a relatively small spectral bandwidth around degeneracy can be tolerated (in our case, ~40 nm). This reflects on the available angular bandwidth, see Figure 1b, which corresponds to an approximately 500 × 500 μm field of view in the focal plane. Figure 1c shows a map of the NRF obtained by subtracting locally, pixel-by-pixel, the two regions A and B. The best NRF occurs along the degenerate ring, and far from it, the correlation slightly decreases because the corresponding pixels of the two regions no longer perfectly intercept correlated directions. Moreover, right-hand part shows a further increase of the NRF, which our experience suggests can be ascribed to a small aberration in the optical system, and which we are able to decrease but not completely suppress. This technical issue can be solved by a careful analysis and realization of an ad-hoc optical system.

  • The trade-off between spatial resolution and NRF: In practice, rather than a plane wave, the pump is a Gaussian beam propagating along the z direction, with a waist wp, and thus with a transverse momentum distribution centered at q=0 with bandwidth Δq~2/wp. Transverse momentum conservation q1+q2=0±Δq leads to a less strict position correlation in the far field, x1+x2=±Δx, where the relative uncertainty of the photon position is Δx≡2r~λf/πwp. A measurement of this spatial uncertainty is provided in Figure 2, representing the spatial cross correlation function of the noise patterns of two symmetric regions of 40 × 40 physical pixels. The cross section is Gaussian with two slightly different FWHM in the horizontal and vertical axes, estimated to be 2ry=5.6(0.2) μm and 2rx=6.8(0.2) μm, respectively. It is clear that two symmetric pixels detect most of the correlated photons only if their size L is larger than this uncertainty (see Ref. 32).

    Figure 2
    figure 2

    Spatial cross correlation function. The 2D map represents the value of the correlation coefficient between two regions of 40 × 40 pixels, chosen approximately in symmetric positions at the detection plane. Here, we exploited the full resolution of the camera (physical pixel of 13 μm), which corresponds to 1.7 μm in the object plane. The peak represents the position in which the regions are well correlated, pixel-by-pixel. Shifting one of the regions in the pixel grid more than the spatial jitter of the correlated photon makes the correlation coefficient drop to zero. The vertical and horizontal sections are shown in the left-hand-side and bottom graphs, respectively, with their fitting Gaussian functions and the indication of the FWHMs. Corr. coeff, correlation coefficient.

Results and discussion

The NRF can be described by the function σ=1–η0·ηcoll, in which the detection probability η is split into the term η0, representing the transmission-detection efficiency of the optical path, and 0<ηcoll<1, representing the collection efficiency of correlated photons. The efficiency ηcoll is a monotonically increasing function of the ratio L/2r, which reaches the asymptotic value ηcoll~1 for (Supplementary Information). Thus, at different resolution scales, given by the size of pixels or, more generally, by the spatial scale in which the signal is integrated, the noise reduction factor and thus the SNR, according to Equation (3a) and (3b), are different. The spatial information at different scales can be recovered starting from a high resolution image in which the pixels size xpix can be smaller than the correlation area and by averaging the signal in groups of d × d pixels so that L=d·xpix. This is particularly useful, given that it can be interpreted as a quantum-enhanced version of the standard median filter used for noise correction in classical imaging. In the classical median filter, the value of a pixel is replaced with the mean value of its d × d neighborhood, thus enabling a reduction of the noise simply by a statistical cancelation. Of course, this comes at the expense of filter details smoothing. If the median filter is applied to the noise-subtracted image, the two effects, namely the statistical smoothing and the photon noise subtraction, combine, thereby allowing a net improvement of faint object recognition. Figure 3 shows an example of the application of the median filter to a direct image, a classical differential image, and a SSN image. The upper-right panel represents the shape of the object obtained by averaging over 300 shots at full resolution (L=5 μm). The image represents the absorption coefficient, assuming values around zero outside the Φ and value around α=0.01 inside the Φ. The other panels present single shot images at different resolution scales. The SSN images are obtained by the subtraction of the correlated noise from the direct image. The DC images are simulated by subtracting an uncorrelated (but shot-noise-limited) noise pattern, for example, a noise region shifted more than 2r from the correlated position. Notably, the single shot image of the sample is completely hidden by the shot noise at the full resolution, d=1. When the median filters are applied, d>1, the shape of the sample clearly emerges, especially for the SSN case. As discussed before, the advantage of the SSN image increases with the scale d of the median filter. The DC image is the poorest because it contains twice the shot noise. Notably, DC imaging is advantageous compared with the DR imaging if classical super-Poissonian noise is present, in particular when F>2.

Figure 3
figure 3

Application of a median filter to a single shot image for different integration scales d. The direct (DR) image, the differential classical (DC) image, and the SSN image are compared in each panel for the same value of d. Upper-left panel is the image of the object after averaging over 300 shots.

The exposure time of the single shot in our setup is 100 ms, and the read-out time of the pixel matrix is few hundreds of milliseconds. In a few seconds, it is possible to realize a SSN movie (See Supplementary Movie) in which the sample is simply translated by a micrometer stage during the acquisition. Even in this case, the edges of the moving object are easier to follow in the SSN imaging box (right-hand side window) than in the DC imaging (central window) or in the DR imaging box (left-hand side window), thus demonstrating that our technique is suitable for dynamic imaging. With a more powerful or pulsed pump laser and the faster operating modes achievable by commercial cameras, rate of hundreds of frame/s should be reasonably attained.

The statistical analysis of the SNR is performed by acquiring shots with and without the sample. The shots without the sample are used to estimate the NRF and Fano factor at different scales and to eliminate static intensity and efficiency gradients by a standard flat field algorithm. Each shot with the sample contains a DR shot-noise limited image and the correlated noise. The NRF of the nth shot is evaluated experimentally through spatial statistics over the ensemble of the correlated pixel pairs belonging to regions A and B, namely , where is the mean value of the pixel of the region and is the variance. The average NRF over the 300 values is reported in the graph in Figure 4 for different resolution scales in the object plane (see also Supplementary Information). In particular, we note that already for a resolution of 5 μm, comparable with the correlation FWHM of the spatial correlation function, the system reaches an NRF of σ=0.8, thus providing better performance than DC imaging (see Equation (3a)) in a matrix of >8000 pixels. For a resolution of 15 μm the NRF is below 0.5, which is the threshold to surpass the performance of the DR, SSN limited, imaging (see Equation (3b)). Similarly, the Fano factor is evaluated and reported in Figure 4, thus demonstrating the Poissonian character of the light statistics.

Figure 4
figure 4

Experimental noise reduction factor (NRF) and signal-to-noise ratio (SNR) as a function of the resolution in the focal (object) plane L (or equivalently as a function of the binning d of the median filter, upper scale). Red dots represent the average of the NRF in a region of the same size as the Φ object, that is 400 × 300 μm. The black dots are the SNR of the SSN images normalized to that of the direct images. For L≥15 μm, the quantum protocol has advantages. Analogously, the blue series demonstrates that the advantage of the SSN imaging with respect to the differential classical imaging is present at any spatial resolution and reaches values of more than 80%. Solid lines correspond to the quantum enhancement predicted by Equation (3a) and (3b), when the estimated values of the NRF are considered.

The SNR is estimated over a stripe of pixels of the images after the application of the median filter (in particular, we consider a vertical stripe inside the main axis of the letter Φ). We label α(n)(x) as the absorption value of the pixel in position x of the nth shot. First, the SNR for each position is evaluated as , where is the experimental temporal average of the absorption and is the variance. Then, the spatial average of the SNR(x) for x belonging to the vertical stripe is evaluated. The experimental results, demonstrating the advantages of the quantum noise subtraction, are reported in Figure 4. The data are compared with the theoretical predictions obtained by substituting the estimated NRF in the theoretical expression of Equation (3a) and (3b). The SNR improvement of the SSN imaging is slightly higher than expected, with respect to both DR and DC imaging, because the SNR is evaluated on a vertical stripe close to the degenerate wavelength region, where the NRF is slightly lower than the average (see Figure 1c and the related discussion).

Conclusions

In summary, we realized the first sub-shot-noise wide field microscope, demonstrating a noise reduction of 20% below the shot noise for each resolution cell (pixel) of 5 μm in a matrix of ~8000 pixels. This value is sufficient for wide-field imaging of complex structures. Increasing the spatial scale of the details, the noise reduction factor improves accordingly; for example, a resolution of 25 μm is 62% below shot noise and for 50 μm is 72% on average below shot noise level. At this last scale, for example, the SNR is almost twice that of the classical differential technique for the same illumination level, whereas the sensitivity improvement with respect to classical direct imaging (obtained by a single, shot noise limited beam) is ~30%. Equivalently it allows one to maintain the same SNR reducing almost four times the illumination level compared with the DC imaging and almost twice compared with DR imaging. In general, to have the SNR of DC (or of DR) imaging, the photon number of the quantum beam can be decreased by a factor σ (2σ for DR). We showed that the decrease in the quantum noise at different scales is fully compatible with, and actually improves, the standard noise reduction techniques, on the basis of a posteriori elaborations of the image taken at full resolution. We demonstrated this important point with a sample made of a ultra-thin metallic depositions on a glass slide. These performance levels represent a breakthrough, filling the gap between the proof-of-principle of quantum enhanced scheme and a system suitable for applications6, 55.

We believe that our technique has the potential for wide-spread use in absorption microscopy. The spatial resolution can be further improved (nothing prevents reaching the Abbe limit), and the range of applicability can be extended by engineering highly non-classical bright squeezed vacuum sources in a pulsed regime52.

Author contributions

IRB, AM and MG conceived the idea of the experiment, which was discussed and designed with input from all authors. NS, IRB, AM realized the experimental setup and collected the data in INRIM quantum optics labs (coordinated by MG). All authors discussed the results and contributed to the writing of the paper.