Phasor field diffraction based reconstruction for fast non-line-of-sight imaging systems

Liu, Xiaochun; Bauer, Sebastian; Velten, Andreas

doi:10.1038/s41467-020-15157-4

Download PDF

Article
Open access
Published: 02 April 2020

Phasor field diffraction based reconstruction for fast non-line-of-sight imaging systems

Nature Communications volume 11, Article number: 1645 (2020) Cite this article

8483 Accesses
79 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Non-line-of-sight (NLOS) imaging recovers objects using diffusely reflected indirect light using transient illumination devices in combination with a computational inverse method. While capture systems capable of collecting light from the entire NLOS relay surface can be much more light efficient than single pixel point scanning detection, current reconstruction algorithms for such systems have computational and memory requirements that prevent real-time NLOS imaging. Existing real-time demonstrations also use retroreflective targets and reconstruct at resolutions far below the hardware limits. Our method presented here enables the reconstruction of room-sized scenes from non-confocal, parallel multi-pixel measurements in seconds with less memory usage. We anticipate that our method will enable real-time NLOS imaging when used with emerging single-photon avalanche diode array detectors with resolution only limited by the temporal resolution of the sensor.

Imaging 3D chemistry at 1 nm resolution with fused multi-modal electron tomography

Article Open access 26 April 2024

Metasurface-enabled single-shot and complete Mueller matrix imaging

Article 02 May 2024

Neural étendue expander for ultra-wide-angle high-fidelity holographic display

Article Open access 22 April 2024

Introduction

Time of flight Non-line-of-sight (NLOS) imaging uses fast pulsed light sources and detectors combined with computational methods to image scenes from indirect light reflections making it possible to reconstruct images or geometry of the parts of a scene that are occluded from direct view. Due to this unique capability, NLOS imaging is promising for applications in diverse fields such as law enforcement, infrastructure assessment, flood prevention, border control, disaster response, planetary research, geology, volcanology, manufacturing, industrial monitoring, vehicle navigation, collision avoidance, and military intelligence. In a time-resolved NLOS imaging measurement, points on a relay wall are illuminated by a picosecond laser. Light from these points illuminates the hidden scene and a fast detector captures the optical signal returned from the scene at points on the relay wall. A suitable computational method is then used to decode the image around the corner.

Despite recent breakthroughs, obtaining a high resolution real time or near real time NLOS video remains elusive. An algorithm suitable for fast NLOS imaging must fulfill three separate requirements: The ability to use data that can be captured in real time, a computational complexity allowing for execution in a fraction of a second on a conventional CPU or GPU, and a memory complexity suitable for use in the limited memory of such a system.

After theoretical exploration of the problem^1,2, the first experimental demonstration of NLOS imaging used a filtered backprojection (FBP) algorithm^3,4 similar to inverse methods used in computed tomography. Modified FBP algorithms such as error backprojection⁵ and Laplacian of Gaussian (LOG) FBP⁶ can provide high quality reconstructions, but have a high computational complexity and take minutes to hours to execute on a desktop computer. Buttafava et al.⁷ show that it is possible to use a gated Single-Photon Avalanche Diode (SPAD) for NLOS imaging. SPADs can potentially be manufactured at low cost and in large arrays enabling fast parallel NLOS capture.

Among the fastest current reconstruction methods, O’Toole et al.⁸ propose a Light Cone Transform (LCT) method based on co-located illumination and detection points and acquire all measurements through a scanning process of the relay wall (so-called confocal acquisition setup). Lindell et al.⁹ demonstrate another reconstruction method for confocal data transferred from seismic imaging which is called FK Migration. Both algorithms rely on 3D convolutions allowing for fast reconstruction and demonstrate the ability to recover complex scenes from confocal measurements⁹. They require interpolation over irregular 3D grids in order to approximate the data points needed for the convolutions. This requires oversampling the reconstructions and computing nearest neighbors which is associated with significant added memory requirements. The crucial limitation of these methods that we explore in more detail below is, however, that they can only utilize the light returning from the confocal location on the relay wall and thus cannot utilize the vast majority of light available in an NLOS measurement. This is illustrated in our Supplementary Note 4. Lindell et al. also demonstrate a way to approximate non-confocal data as confocal data⁹ for simple planar scenes that allows both LCT and FK Migration algorithms to obtain approximate reconstructions from non-confocal data. Real time reconstruction of low resolution retro-reflective scenes has also been demonstrated in a confocal scanning scenario with both LCT and FK Migration methods. However, the presented confocal real time captures require retroreflective targets that return most reflected light to the moving laser/detection point, while arbitrary diffuse objects require scan times of at least 10 minutes⁹. In this case, the bottleneck of these methods is not the computation, but the acquisition. Furthermore, reconstruction of higher resolution scenes with diffuse surfaces is hindered by the large memory requirements and the inefficient confocal capture process requiring sequential point scanning capture with a single SPAD pixel.

Liu et al.¹⁰ and Reza et al.¹¹ introduce a virtual wave phasor field formalism that is the basis of this work. Using the phasor field method, the NLOS imaging problem can be stated as a line of sight optical imaging problem based on diffraction and solved using existing diffraction theory methods. Recent work also includes further experimental investigation in the propagation of phasor field virtual waves¹², as well as the extension of the phasor field model to scenes with occlusions and specular reflectors¹³ who use the paraxial approximation to obtain an approximate convolution operator to model wave propagation. More insight into the theory of phasor field waves is also provided by Teichman et al.¹⁴.

In this work, we introduce an NLOS reconstruction method using the phasor field formalism along with a convolutional fast Fourier transform (FFT) based Rayleigh Sommerfeld Diffraction (RSD) algorithm to provide fast non-approximative scene reconstructions for general capture setups, in particular including non-confocal setups using a single laser and a sensor array. Our hardware prototype includes a SPAD detector and a picosecond pulse laser which will be mentioned specifically later. When used in the confocal scenario, this new method performs at speed similar to LCT and FK Migration, while requiring significantly less memory. In addition to applying our new algorithm to open source data^9,10, we also perform several additional experiments.

Results

Phasor field NLOS camera

The concept of phasor field NLOS imaging is described in Fig. 1. Data from the scene is collected by illuminating a set of points x_p on a relay surface P and collecting the light returned at points x_c on a relay surface C. This data set represents impulse responses H(x_p → x_c, t) of the scene. Using such an impulse response we can compute the scene response at points x_c to an input signal ${\mathcal{P}}({{\bf{x}}}_{{\rm{p}}},t)$ as

$${\mathcal{P}}({{\bf{x}}}_{{\rm{c}}},t)=\int _{{\rm{P}}}[{\mathcal{P}}{({{\bf{x}}}_{{\rm{p}}},t)} {\mathop * _{t}} \,H({{\bf{x}}}_{{\rm{p}}}\to {{\bf{x}}}_{{\rm{c}}},t)]\mathrm{d}{{\bf{x}}}_{{\rm{p}}}$$

(1)

where the ${\mathop {*} \limits_{t}}$ operator indicates a convolution in time. We call the quantities ${\mathcal{P}}({{\bf{x}}}_{{\rm{p}}},t)$ and ${\mathcal{P}}({{\bf{x}}}_{{\rm{c}}},t)$ phasor field wavefronts. ${\mathcal{P}}({{\bf{x}}}_{{\rm{c}}},t)$ describes the wavefront that would be returned from the scene if it were illuminated by a illumination wave ${\mathcal{P}}({{\bf{x}}}_{{\rm{p}}},t)$. Reconstructing an image from the wave front of a reflected wave is the fundamental problem solved by a line of sight imaging system. The reconstruction operation

$$I({{\bf{x}}}_{{\rm{v}}},t)=\Phi ({\mathcal{P}}({{\bf{x}}}_{{\rm{c}}},t))$$

(2)

resulting in a 3D image I(x_v) of the scene amounts to propagation of the wavefront at C back into the scene into the points x_v where it has the shape of the scene objects. The Fourier domain version ${\Phi }_{{\mathcal{F}}}(\cdot )$ of the wave propagation operator Φ(⋅) is known as the Rayleigh-Sommerfeld Diffraction (RSD) integral:

$$\Phi ({{\mathcal{P}}}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{c}}},\Omega ))={\left|{{\mathcal{R}}}_{{{\bf{x}}}_{{\rm{v}}}}({{\mathcal{P}}}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{c}}},\Omega ))\right|}^{2}.$$

(3)

The RSD in the considered context is calculated by

$${\mathcal{R}}_{{\bf{x}}_{\rm{v}}}({\mathcal{P}}_{\mathcal{F}}({{\bf{x}}_{{\rm{c}}}},\Omega))=\alpha ({\bf{x}}_{\rm{v}})\int _{\rm{C}}{\mathcal{P}}_{\mathcal{F}}({\bf{x}}_{\rm{c}},\Omega)\underbrace{\frac{{e}^{-ik\left|{{\bf{x}}}_{\rm{c}}-{\bf{x}}_{\rm{v}}\right|}}{\left|{\bf{x}}_{\rm{c}}-{\bf{x}}_{\rm{v}}\right|}} _{{\rm{RSD}\,{\rm{diffraction}} \, {\rm{kernel}}}}\, {\mathrm{d}}{{\bf{x}}}_{{\rm{c}}}.$$

(4)

In this equation, k = Ω∕c denotes the wavenumber and c across our paper refers to the speed of light. The conventional RSD propagates the electric field, but in this context propagation of an intensity modulation is required. The phasor field RSD differs from the conventional version by the amplitude correction factor α(x_v)¹⁰. This factor depends on the location x_v of the reconstruction point and could be precomputed once the geometry of the relay surface is known. Alternatively, it can be disregarded, as it only causes a slowly varying error in brightness of reconstructed points, but not their location. The RSD in Eq. (4) is a function of each individual monochromatic phasor field component. For this reason, the wavefront ${\mathcal{P}}({{\bf{x}}}_{{\rm{c}}},t)$ received at the aperture has been replaced by its Fourier domain representation ${{\mathcal{P}}}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{c}}},\Omega )$. Throughout this paper, frequency domain quantities are denoted by the same variable as the respective time domain quantities, but with the subscript ${\mathcal{F}}$ and the argument angular frequency Ω instead of t. For instance, ${{\mathcal{F}}}_{t}\left({\mathcal{P}}({{\bf{x}}}_{{\rm{p}}},t)\right)={{\mathcal{P}}}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{p}}},\Omega )$ and ${{\mathcal{F}}}_{t}\left(H({{\bf{x}}}_{{\rm{p}}}\to {{\bf{x}}}_{{\rm{c}}},t)\right)={H}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{p}}}\to {{\bf{x}}}_{{\rm{c}}},\Omega )$, where ${{\mathcal{F}}}_{t}(\cdot )$ denotes the Fourier transform with respect to time. Note that in this paper, the RSD propagation direction is from the camera aperture (i.e., relay surface C) into the reconstruction volume.

**Fig. 1: Illustration of the proposed fast phasor field NLOS imaging method.**

It is important to note that both illumination ${\mathcal{P}}({{\bf{x}}}_{{\rm{p}}},t)$ and image formation Φ(⋅) are implemented virtually on a computer. For this reason, they can be chosen to mimic any LOS imaging system. For the purpose of NLOS 3D image reconstruction, one option is to choose a transient camera sending a virtual phasor field pulse

$${\mathcal{P}}({{\bf{x}}}_{{\rm{p}}},t)={e}^{i{\Omega }_{{\rm{C}}}t}\delta ({{\bf{x}}}_{{\rm{p}}}-{{\bf{x}}}_{{\rm{ls}}}){e}^{-\frac{{(t-{t}_{0})}^{2}}{2{\sigma }^{2}}}$$

(5)

from the virtual light source position x_ls into the scene. The center frequency Ω_C has to be chosen according to the spatial relay wall sampling. The smallest achievable wavelength should be larger than twice the largest distance between neighboring points x_p and x_c and larger than the temporal resolution of the imaging hardware¹⁰. For example, given a spatial sampling of 1 cm, the smallest possible modulation wavelength is larger than 2 cm. For the following, we set t₀ = 0. The illumination pulse as a function of time needs to be converted into the frequency domain, so that each corresponding frequency is then propagated separately by the RSD in Eq. (4). The temporal Fourier transform of the illumination phasor field yields

$${{\mathcal{P}}}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{p}}},\Omega )= {{\mathcal{F}}}_{t}\left({\mathcal{P}}({{\bf{x}}}_{{\rm{p}}},t)\right)\\ = \, \delta ({{\bf{x}}}_{{\rm{p}}}-{{\bf{x}}}_{{\rm{ls}}})\left(2\pi \delta {(\Omega -{\Omega }_{{\rm{C}}})} \, {\mathop * _{f}} \, \sigma \sqrt{2\pi }{e}^{-\frac{{\sigma }^{2}{\Omega }^{2}}{2}}\right).$$

(6)

The result ${{\mathcal{P}}}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{p}}},\Omega )$ in the frequency domain is a Gaussian centered around the central frequency Ω_C as it is shown in Fig. 1. Figuratively, the RSD propagates the light wave arriving at the aperture (i.e., relay surface C) back into the scene, thereby reconstructing it. Equivalently, one can think of it as a virtual imaging system that forms the image acquired by a virtual sensor behind the relay wall.

After processing all frequency components through space with the RSD, the result at x_v needs to be converted to the time domain again by applying the inverse Fourier transform. The overall reconstruction is therefore calculated by

$$I({{\bf{x}}}_{{\rm{v}}},t)= \left|\int _{-\infty }^{+\infty }{e}^{i\Omega t}{{\mathcal{R}}}_{{{\bf{x}}}_{{\rm{v}}}} \left(\underbrace{\underbrace{{{\mathcal{P}}}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{p}}},\Omega )}_{{{\rm{Illumination}} \, {\rm{phasor}} \, {\rm{field}}}}\cdot \, {H}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{p}}}\to {{\bf{x}}}_{{\rm{c}}},\Omega )}_{{{\rm{Phasor}} \, {\rm{field}} \, {\rm{at}} \, {\rm{the}} \, {\rm{camera}} \, {\rm{aperture}} \, ({\rm{relay}} \, {\rm{surface}} \, C)}}\right)\frac{\mathrm{d}\Omega }{2\pi }\, \right|^{2},$$

(7)

where the integral over P has vanished as there is only one virtual illumination point x_ls. Calculating the square is omitted in the actual reconstruction implementation, as it only affects the scene contrast.

Fast phasor field diffraction

The main goal of this paper is to develop a fast 3D NLOS reconstruction method based on the RSD propagator ${{\mathcal{R}}}_{{{\bf{x}}}_{{\rm{v}}}}(\cdot )$ in Eq. (7). Multiple convolutional RSD methods have been introduced in the literature^15,16,17 and form the basis of our approach.

The RSD as defined in Eq. (4) can propagate the wave from an arbitrary surface to any arbitrary point x_v. For fast implementation we constrain the operator to propagate the wave between two parallel planes. This allows us to work with two spatial dimensions. We introduce the scalar coordinates x_c = (x_c, y_c, 0) and x_v = (x_v, y_v, z_v) and rewrite the RSD in Eq. (4) as follows:

$${\mathcal{P}}_{\mathcal{F}}({\bf{x}}_{\rm{v}},\Omega) = \,{\mathcal{R}}_{{z}_{\rm{v}}}({\mathcal{P}}_{\mathcal{F}}({\bf{x}}_{\rm{p}},\Omega )\cdot {H}_{\mathcal{F}}({\bf{x}}_{\rm{p}}\to {\bf{x}}_{\rm{c}},\Omega))\\ = \,{\mathcal{R}}_{{z}_{\rm{v}}}({\mathcal{P}}_{\mathcal{F}}({\bf{x}}_{\rm{c}},\Omega))\\ {\mathcal{P}}_{\mathcal{F}}({x}_{\rm{v}},{y}_{\rm{v}},{z}_{\rm{v}},\Omega )= \,{\mathcal{R}}_{{z}_{\rm{v}}}({\mathcal{P}}_{\mathcal{F}}({x}_{\rm{c}},{y}_{\rm{c}},0,\Omega))\\ = \iint_{\!\!\!\!\!-\infty }^{{+\infty}}{\mathcal{P}}_{\mathcal{F}}({x}_{\rm{c}},{y}_{\rm{c}},0,\Omega)\underbrace{\frac{\alpha ({x}_{{\rm{v}}},{y}_{{\rm{v}}},{z}_{{\rm{v}}}){e}^{-i\frac{\Omega }{c}\sqrt{{({x}_{{\rm{c}}}-{x}_{{\rm{v}}})}^{2}+{({y}_{{\rm{c}}}-{y}_{{\rm{v}}})}^{2}+{z}_{{\rm{v}}}^{2}}}}{\sqrt{{({x}_{{\rm{c}}}-{x}_{{\rm{v}}})}^{2}+{({y}_{{\rm{c}}}-{y}_{{\rm{v}}})}^{2}+{z}_{{\rm{v}}}^{2}}}}_{{\rm{RSD}} \, {\rm{diffraction}} \, {\rm{kernel}}}\ \mathrm{d}{\it{x}}_{{\rm{c}}}\ \mathrm{d}{\it{y}}_{{\rm{c}}}\\ = \iint _{\!\!\!\!\!-\infty }^{{+\infty}}{{\mathcal{P}}}_{{\mathcal{F}}}({x}_{{\rm{c}}},{y}_{{\rm{c}}},0,\Omega )\cdot \underbrace{G({x}_{{\rm{v}}}-{x}_{{\rm{c}}},{y}_{{\rm{v}}}-{y}_{{\rm{c}}},{z}_{{\rm{v}}},\Omega )}_{2{\rm{D}} \, {\rm{convolution}} \, {\rm{kernel}}}\ \mathrm{d}{\it{x}}_{{\rm{c}}}\ \mathrm{d}{\it{y}}_{{\rm{c}}}\\ = \underbrace{{{\mathcal{P}}}_{{\mathcal{F}}}({x}_{{\rm{c}}},{y}_{{\rm{c}}},0,\Omega )* G({x}_{{\rm{c}}},{y}_{{\rm{c}}},{z}_{{\rm{v}}},\Omega )}_{{{\rm{Spatial}} \, 2{\rm{D}} \, {\rm{convolution}}}},$$

(8)

where the geometrical setup is illustrated in Fig. 2. Equation (8) considers two parallel planes with spacing z_v in a Cartesian coordinate system. For this reason, the RSD notation changed from ${{\mathcal{R}}}_{{{\bf{x}}}_{{\rm{v}}}}(\cdot )$ for the point x_v to ${{\mathcal{R}}}_{{z}_{{\rm{v}}}}(\cdot )$ to indicate that the propagation holds for all points in the plane at distance z_v from the relay wall. For a single frequency component Ω, the relation between the wavefront ${{\mathcal{P}}}_{{\mathcal{F}}}({x}_{{\rm{c}}},{y}_{{\rm{c}}},0,\Omega )$ at the camera aperture plane and the wavefront ${{\mathcal{P}}}_{{\mathcal{F}}}({x}_{{\rm{v}}},{y}_{{\rm{v}}},{z}_{{\rm{v}}},\Omega )$ at the virtual image plane is a two-dimensional spatial convolution with the 2D convolution kernel defined by $G({x}_{{\rm{c}}},{y}_{{\rm{c}}},{z}_{{\rm{v}}},\Omega )=\frac{\alpha ({x}_{{\rm{v}}},{y}_{{\rm{v}}},{z}_{{\rm{v}}})\cdot \exp (-i\frac{\Omega }{c}\sqrt{{x}_{{\rm{c}}}^{2}+{y}_{{\rm{c}}}^{2}+{z}_{{\rm{v}}}^{2}})}{\sqrt{{x}_{{\rm{c}}}^{2}+{y}_{{\rm{c}}}^{2}+{z}_{{\rm{v}}}^{2}}}$ where the factor α(x_v, y_v, z_v) will be ignored during reconstruction. Note that the RSD in Eq. (8) needs to be calculated for each individual frequency component ${{\mathcal{P}}}_{{\mathcal{F}}}({x}_{{\rm{c}}},{y}_{{\rm{c}}},0,\Omega )$. Considering the virtual pulse illumination in Eq. (5), the wavefront ${{\mathcal{P}}}_{{\mathcal{F}}}({x}_{{\rm{c}}},{y}_{{\rm{c}}},0,\Omega )$ is a broad-band signal; its spectrum is a Gaussian centered around Ω_C as shown in Eq. (6). For this reason, it is sufficient to consider the frequency range Ω ∈ [Ω_C − ΔΩ, Ω_C + ΔΩ]. Although the magnitude is not completely zero outside this interval, it is very small and can be neglected. The chosen range ΔΩ depends on the virtual illumination pulse bandwidth and thus on the pulse width parameter σ. Thus, applying Eq. (8) for the frequencies Ω ∈ [Ω_C − ΔΩ, Ω_C + ΔΩ] and subsequent inverse Fourier transform with respect to time

$${\mathcal{P}}({x}_{{\rm{v}}},{y}_{{\rm{v}}},{z}_{{\rm{v}}},t)=\int _{{\Omega }_{{\rm{C}}}-\Delta \Omega }^{{\Omega }_{{\rm{C}}}+\Delta \Omega }{e}^{j\Omega t}\cdot \underbrace{{{\mathcal{R}}}_{{z}_{{\rm{v}}}}\left({{\mathcal{P}}}_{{\mathcal{F}}}({x}_{{\rm{c}}},{y}_{{\rm{c}}},0,\Omega )\right)}_{{{\rm{Monochromatic}} \, {\rm{wavefront}} \, {\rm{at}} \, {\rm{depth}}} \, {z}_{{\rm{v}}}}\ \frac{\mathrm{d}\Omega }{2\pi }$$

(9)

is equivalent to sending the designed modulated virtual illumination pulse wavefront ${\mathcal{P}}({{\bf{x}}}_{{\rm{p}}},t)={e}^{i{\Omega }_{{\rm{C}}}t}{e}^{-\frac{{t}^{2}}{2{\sigma }^{2}}}$ into the hidden scene, capturing its reflection at the visible relay wall, and propagating it back into the scene or imaging it onto a virtual imaging sensor using a virtual lens. The relay wall functions as a virtual aperture. The output ${\mathcal{P}}({x}_{{\rm{v}}},{y}_{{\rm{v}}},{z}_{{\rm{v}}},t)$ in Eq. (9) depends on the time t, as each reconstruction point is illuminated only for a short period of time. Taking the absolute value of ${\mathcal{P}}({x}_{{\rm{v}}},{y}_{{\rm{v}}},{z}_{{\rm{v}}},t)$ in Eq. (9) and squaring it makes us arrive at a 4D reconstruction (cf. Eq. (7)). We can understand this reconstruction as a movie of a virtual pulse traveling through the hidden scene, as shown in Fig. 3. In this figure, a patch shaped as a 4 is being illuminated by a spherical wavefront coming from the illumination point on the relay surface.

**Fig. 2: Rayleigh Sommerfeld Diffraction (RSD) calculation.**

**Fig. 3: Space-time wave propagation using RSD.**

The process of reconstructing a 4D model for achieving a 3D spatial reconstruction is unnecessarily time-consuming. The 3D reconstruction of the scene can be obtained from the movie by freezing the time of arrival corresponding to the peak of the illumination pulse for each voxel. Thus each voxel contains only the direct (3rd) bounce signal from the hidden object. This can be performed by calculating the spherical geometry as a function of point source illumination position (x_ls, y_ls, 0) and replacing t at each voxel (x_v, y_v, z_v):

$$t:= \frac{1}{c}\sqrt{{({x}_{{\rm{v}}}-{x}_{{\rm{ls}}})}^{2}+{({y}_{{\rm{v}}}-{y}_{{\rm{ls}}})}^{2}+{z}_{{\rm{v}}}^{2}}.$$

(10)

In this equation, we have used the scalar representation x_ls = (x_ls, y_ls, 0) of the virtual light source position. Replacing t by the appropriate spatial coordinates as described in this equation leads to a 3D virtual camera that only sees the direct bounce from the hidden object and removes the fourth dimension; the respective voxels are reconstructed at exactly the time when the pulse arrives. This leads to a more time-efficient reconstruction than acquiring the full 4D wavefront.

However, there is one problem that needs to be taken care of. While the RSD in Eq. (8) is calculated at planes parallel to the aperture, the illumination pulse spreads spherically from the light source. Theoretically, this means each point on the plane should be reconstructed with a different time shift as given in Eq. (10) which leads to an integral expression, i.e., the inverse Fourier transform at the plane should be calculated with a different time shift at each voxel. In order to circumvent this tedious process, it is reasonable to split the plane into sections and within each section use the same (artificially corrected) travel time. Since the virtual illumination pulse has a finite temporal width, there can be significant overlap between a RSD reconstruction plane and the piece-wise pulse time shift. The spatial sectioning, i.e., the assignment which spatial region is reconstructed with the same time shift, is illustrated in Fig. 4.

**Fig. 4: Spatial sectioning for determining the piece-wise time offset.**

Our objective thus is to reconstruct each voxel at depth z at a time t when it is actually illuminated by the virtual pulse. We first define the spatial pulse width D = c ⋅ σ∕0.15. In the next step, the radial difference between any voxel on the reconstruction plane and the maximum of the pulse tangential to the reconstruction is calculated by

$$E({x}_{{\rm{v}}},{y}_{{\rm{v}}},{z}_{{\rm{v}}})=\sqrt{{({x}_{{\rm{v}}}-{x}_{{\rm{ls}}})}^{2}+{({y}_{{\rm{v}}}-{y}_{{\rm{ls}}})}^{2}+{z}_{{\rm{v}}}^{2}}-{z}_{{\rm{v}}}.$$

(11)

The geometry is shown in Fig. 4, where only the 3D cross-section at y_v = 0 is displayed. The spatial sectioning is defined via the functions

$${M}_{1}({x}_{{\rm{v}}},{y}_{{\rm{v}}},{z}_{{\rm{v}}}) =\left\{\begin{array}{ll}1&0\le E({x}_{{\rm{v}}},{y}_{{\rm{v}}},{z}_{{\rm{v}}})\le \frac{D}{2}\\ 0&{\rm{else}}\hfill\end{array}\right.,\quad {B}_{1}=\tilde{D},\\ {M}_{2}({x}_{{\rm{v}}},{y}_{{\rm{v}}},{z}_{{\rm{v}}}) =\left\{\begin{array}{ll}1&\frac{D}{2} \, < \, E({x}_{{\rm{v}}},{y}_{{\rm{v}}},{z}_{{\rm{v}}})\le \frac{3D}{2}\\ 0&{\rm{else}}\hfill\end{array}\right.,\quad {B}_{2}=\tilde{D}+D, \\ \vdots \\ {M}_{L}({x}_{{\rm{v}}},{y}_{{\rm{v}}},{z}_{{\rm{v}}}) =\left\{\begin{array}{ll}1&(L-\frac{3}{2})D \, < \, E({x}_{{\rm{v}}},{y}_{{\rm{v}}},{z}_{{\rm{v}}})\le (L-\frac{1}{2})D\\ 0&{\rm{else}}\hfill\end{array}\right.,\quad {B}_{L}=\tilde{D}+(L-1)D,$$

(12)

which also tell us which spatial regions have to use which distance shift B₁, …, B_L. The virtual illumination pulse illuminates a spherical shell of thickness D that moves outward with time. The red and green shells in Fig. 4 illustrate the pulse positions at two different time instances, spatially separated by D. Depending on the distance between reconstruction voxel (x_v, y_v, z_v) and the pulse maxima, this voxel will get assigned the time of the closest pulse maximum. Note that planes at a larger distance z_v will have larger central regions that are treated with the same time shift. For example, a reconstruction plane far away from the relay wall may lie completely inside the shell of a single pulse and is therefore not split into sections. The described arrival time correction therefore accounts for the difference between the z-coordinate of a voxel and its distance to the virtual illumination source that determines the time t when it is illuminated. Allowing for a range D around the pulse means that not the maximum of the Gaussian illumination but a point near the maximum with a somewhat lower magnitude is used. The mismatch between reconstruction planes and illumination spheres therefore only results in differences in reconstruction brightness, but not in reconstructed scene geometry.

Since D is usually on the order of 20–30 cm, most simple scenes considered in this paper can be reconstructed using a single spatial section (L = 1). Larger field of view examples such as the Office Scene in the result section require two spatial sections (L = 2). The reconstruction of a larger field of view scenario using one spatial section will have a vignetting effect as if this virtual imaging system had a poor imaging quality due to a oversimplified lens design. This vignetting effect is shown in Fig. 5: On the left of the figure, one distance shift B₁ is used for reconstructing both spatial regions M₁ and M₂ which is equivalent to using one section. On the right, two different distance shift values B₁ and B₂ (see Eq. (12)) are used for M₁ and M₂.

**Fig. 5: Larger field of view scenario with two versions of the office scene.**

Equation (12) contains the constant offset parameter $\tilde{D}$. This is zero for a perfectly calibrated system, such as a simulated scene, but can be adjusted to a nonzero value to account for hardware calibration in real-world experiments.

Then, the overall scene reconstruction (see Eqs. (2) and (9)) can be written as

$$I({\bf{x}}_{\rm{v}},t)= \, \left| \int_{{\Omega }_{\rm{C}}-\Delta \Omega }^{{\Omega }_{\rm{C}}+\Delta \Omega} \sum _{l = 1}^{L}{M}_{l}({x}_{\rm{v}},{y}_{\rm{v}},{z}_{\rm{v}})\cdot \exp \Bigg(i\frac{\Omega }{c}({z}_{\rm{v}}+{B}_{l})\Bigg)\right.\\ \left. \cdot {\Bigg(\underbrace{{\mathcal{P}}_{\mathcal{F}}({x}_{\rm{c}},{y}_{\rm{c}},0,\Omega )* G({x}_{\rm{c}},{y}_{\rm{c}},{z}_{\rm{v}},\Omega )}_{2{\rm{D}} \, {\rm{convolution}}, \, {\rm{implemented}} \, {\rm{as}} \, 2{\rm{D}} \, {\rm{FFT}}}\Bigg)}\frac{\mathrm{d}\Omega}{2\pi}\right|^{2}.$$

(13)

The functions M_l(x_v, y_v, z_v) cut out spatial regions and tell us where to use which distance shift B_l, l = 1, …, L; wherever M_l(x_v, y_v, z_v) is 1, the corresponding B_l is used.

Performing the NLOS reconstruction with the described RSD operator has some other advantages apart from low time and memory requirements, as shown in the results section (Section 3). The RSD calculation is easily parallelizable, because the reconstructions at different plane depths z_v do not depend on each other, and Eq. (13) can be applied to each plane separately. This is in contrast to the LCT and the FK Migration methods, which perform 3D Fourier transforms of the acquired confocal data. For the RSD, when performing the reconstructions not in parallel, but starting at the relay wall and subsequently proceeding to larger depths, the memory requirement can be drastically reduced. For deriving 3D images of the reconstructed scene, it is sufficient to calculate the maximum of all reconstruction voxels along z_v and store its index. This is a sparse representation of the full 3D data volume. When reconstructing by moving away from the relay wall, only the current maxima and indices of the respective z_v voxel columns need to be stored, and not the full 3D reconstruction results which would require gradually increasing memory.

For all reconstructions, only a certain number of discrete frequencies Ω in the interval [Ω_C − ΔΩ, Ω_C + ΔΩ] is propagated. It is important to point out how the number of Fourier components that are used for reconstruction is defined. The variable β determines the number of wavelengths λ that fit into one pulse; D = βλ. The larger β, the smaller the width of the frequency domain Gaussian. γ is the peak ratio, i.e., only the frequency components with amplitude higher than γ are propagated. The smaller ones are neglected because they hardly contribute to the overall signal. Throughout the paper, we set γ to 0.01, meaning that all frequency components with magnitude smaller than 1% of the maximum magnitude are ignored.

The discrete spacing ${\Omega }_{{\rm{res}}}$ of the considered frequency components is given by the FFT frequency resolution:

$${\Omega }_{{\rm{res}}}=2\pi \frac{{f}_{{\rm{sampling}}}}{{N}_{{\rm{bins}}}}\ ,$$

(14)

where f_sampling is the sampling frequency of the histograms (i.e., 1/bin width) and N_bins the number of time bins.

The number of Fourier components depends on the choice of the virtual illumination pulse. In large scenes, it would also increase with scene depth which would increase computational and memory complexity. To avoid this, large scenes would have to be reconstructed in multiple depth sections. In this work, we reconstruct scenes with depth up to 3 m representing the largest complex scenes for which data exist. For these scenes, a depth sectioning step is not necessary.

Fourier domain histogram (FDH) single photon capture

According to Eq. (7), the virtual wave acquired at the virtual aperture is calculated by ${{\mathcal{P}}}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{c}}},\Omega )={{\mathcal{P}}}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{p}}},\Omega )\cdot {H}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{p}}}\to {{\bf{x}}}_{{\rm{c}}},\Omega )$. This requires the Fourier domain representation of the impulse response ${H}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{p}}}\to {{\bf{x}}}_{{\rm{c}}},\Omega )$ from x_p to x_c. A new memory efficient direct acquisition method for ${H}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{p}}}\to {{\bf{x}}}_{{\rm{c}}},\Omega )$ is presented in the following.

The SPAD detector uses time-correlated single photon counting (TCSPC) to generate the transient responses H(x_p → x_c, t). After the emission of a laser pulse, a SPAD pixel receives one photon and an electronic signal is transmitted to the TCPSC unit that encodes the time between the emission of the laser pulse and the detection of an associated returning photon. The arrival times of all photons during a measurement interval are transferred to a computer and are arranged in a histogram to obtain the transient scene response H(x_p → x_c, t) for a given x_p and x_c. To obtain ${H}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{p}}}\to {{\bf{x}}}_{{\rm{p}}},\Omega )$ we could collect and store these TCSPC histograms and perform the Fourier transform on it. A more memory efficient way is to build the frequency spectrum directly from the timing data obtained from the hardware. We call this new capturing method a FDH and its creation process is shown in Fig. 6. It can be written as

$${H}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{p}}}\to {{\bf{x}}}_{{\rm{c}}},\Omega )= \int _{-\infty }^{+\infty }H({{\bf{x}}}_{{\rm{p}}}\to {{\bf{x}}}_{{\rm{c}}},t)\cdot {e}^{-i\Omega t}\ \mathrm{d}{\it{t}}\\ = \int _{-\infty }^{+\infty }\left(\sum _{n = 1}^{N}\delta (t-{T}_{n})\right)\cdot {e}^{-i\Omega t}\ \mathrm{d}{\it{t}}\\ = \sum _{n = 1}^{N}{e}^{-i\Omega {T}_{n}}\ .$$

(15)

**Fig. 6: Illustration of Fourier Domain Histogram.**

The travel times T_n are discrete; the time resolution is determined by the acquisition hardware (in the context of NLOS imaging typically a few to tens of picoseconds). Equation (15) means that the FDH ${H}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{p}}}\to {{\bf{x}}}_{{\rm{c}}},\Omega )$ is acquired by multiplying each of the N photon travel times T_n, n = 1, …, N, by a phase term depending on the considered frequency Ω and adding the result to the previous value for that frequency. As a consequence, instead of a large number of time bins (on the order of thousands), only one value for each Ω (typically dozens, as shown in the results in Section 3) needs to be stored and processed. Figure 6 illustrates the generation of the FDH. Similar to the time domain histogram binning, this FDH performs binning for each captured photon.

We want to remark that the travel times T_n in Eq. (15) are measured from the respective illumination position on the relay wall into the scene and back to the relay wall at the detector focus position. The travel times from the laser setup to the illumination on the relay wall and from the detector focus point on the relay wall to the detector setup have been subtracted and are not part of H. Alternatively, the total travel time from laser to detector can be incorporated and the travel times from laser to wall and wall to detector are combined into Δt. The final result from Eq. (15) is then multiplied by e^iΩΔt to correct for this constant time offset.

Phasor field NLOS camera for confocal measurements

The RSD reconstruction method for NLOS data presented so far only deals with the non-confocal case, which means that the illumination point x_p and the camera point x_c on the relay wall are different. However, a confocal dataset H^c(x_p → x_c, t) as used in LCT and FK migration algorithms^8,9 only contains data with x_p = x_c:

$${H}^{{\rm{c}}}({{\bf{x}}}_{{\rm{p}}}\to {{\bf{x}}}_{{\rm{c}}},t)=H({{\bf{x}}}_{{\rm{p}}}\to {{\bf{x}}}_{{\rm{c}}},t)\delta ({{\bf{x}}}_{{\rm{p}}}-{{\bf{x}}}_{{\rm{c}}}).$$

(16)

Such a dataset is not suitable for implementing the virtual point light source described in Eq. (5). Instead, we can model an illumination wavefront that is focused on x_v:

$${\mathcal{P}}({{\bf{x}}}_{{\rm{p}}},t)={e}^{i\Omega (t-\frac{1}{c}| {{\bf{x}}}_{{\rm{v}}}-{{\bf{x}}}_{{\rm{p}}}| )}{e}^{-\frac{{(t-{t}_{0}-\frac{1}{c}| {{\bf{x}}}_{{\rm{v}}}-{{\bf{x}}}_{{\rm{p}}}| )}^{2}}{2{\sigma }^{2}}}.$$

(17)

Setting t₀ to 0 and applying the Fourier transform leads to

$${{\mathcal{P}}}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{p}}},\Omega )=\left(2\pi \delta {(\Omega -{\Omega }_{{\rm{C}}})}{\mathop * _{f}}\sqrt{2\pi }\sigma {e}^{-\frac{{\sigma }^{2}{\Omega }^{2}}{2}}\right){e}^{-i\frac{\Omega }{c}| {{\bf{x}}}_{{\rm{v}}}-{{\bf{x}}}_{{\rm{p}}}| }.$$

(18)

Inserting into Eq. (7) yields

$$ I({{\bf{x}}}_{{\rm{v}}},t)=\\ \,\,\,\,\,\,\,\,{\left|\int _{-\infty }^{+\infty }{e}^{i\Omega t}{{\mathcal{R}}}_{{{\bf{x}}}_{{\rm{v}}}}\left({\int }_{\!\!\!\!P}{(2\pi )}^{\frac{3}{2}}\sigma {e}^{-\frac{{\sigma }^{2}{(\Omega -{\Omega }_{{\rm{C}}})}^{2}}{2}}{e}^{-i\frac{\Omega }{c}| {{\bf{x}}}_{{\rm{v}}}-{{\bf{x}}}_{{\rm{p}}}| }\delta ({{\bf{x}}}_{{\rm{p}}}-{{\bf{x}}}_{{\rm{c}}})\cdot {H}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{p}}}\to {{\bf{x}}}_{{\rm{c}}},\Omega )\mathrm{d}{{\bf{x}}}_{{\rm{p}}}\right)\frac{\mathrm{d}\Omega }{2\pi }\right|}^{2}\\ ={\left|\int _{-\infty }^{+\infty }{e}^{i\Omega t}{{\mathcal{R}}}_{{{\bf{x}}}_{{\rm{v}}}}\left({(2\pi )}^{\frac{3}{2}}\sigma {e}^{-\frac{{\sigma }^{2}{(\Omega -{\Omega }_{{\rm{C}}})}^{2}}{2}}{e}^{-i\frac{\Omega }{c}| {{\bf{x}}}_{{\rm{v}}}-{{\bf{x}}}_{{\rm{c}}}| }\cdot {H}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{c}}}\to {{\bf{x}}}_{{\rm{c}}},\Omega )\right)\frac{\mathrm{d}\Omega }{2\pi }\right|}^{2}\\ ={\left|\int _{-\infty }^{+\infty }{e}^{i\Omega t}{\int }_{\!\!\!\!C}{(2\pi )}^{\frac{3}{2}}\sigma {e}^{-\frac{{\sigma }^{2}{(\Omega -{\Omega }_{{\rm{C}}})}^{2}}{2}}{e}^{-i\frac{\Omega }{c}| {{\bf{x}}}_{{\rm{v}}}-{{\bf{x}}}_{{\rm{c}}}| }\cdot {H}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{c}}}\to {{\bf{x}}}_{{\rm{c}}},\Omega ){e}^{-ik| {{\bf{x}}}_{{\rm{v}}}-{{\bf{x}}}_{{\rm{c}}}| }\mathrm{d}{{\bf{x}}}_{{\rm{c}}}\frac{\mathrm{d}\Omega }{2\pi }\right|}^{2}\\ ={\left|\int _{-\infty }^{+\infty }{e}^{i\Omega t}{\int }_{\!\!\!\!C}{(2\pi )}^{\frac{3}{2}}\sigma {e}^{-\frac{{\sigma }^{2}{(\Omega -{\Omega }_{{\rm{C}}})}^{2}}{2}}\cdot {H}_{{\mathcal{F}}}({{\bf{x}}}_{{\rm{c}}}\to {{\bf{x}}}_{{\rm{c}}},\Omega ){e}^{-2ik| {{\bf{x}}}_{{\rm{v}}}-{{\bf{x}}}_{{\rm{c}}}| }\mathrm{d}{{\bf{x}}}_{{\rm{c}}}\frac{\mathrm{d}\Omega }{2\pi }\right|}^{2}.$$

(19)

The reconstruction thus uses an RSD operator with an additional factor of two doubling all distances. We use our fast RSD operator to evaluate this RSD integral.

Both the computational implementation steps and the pseudocode of the presented RSD NLOS reconstruction algorithm are available in Supplementary Note 3.

Acquisition hardware

Most of our results in Figs. 7 and 8 are obtained on a publically available dataset¹⁰. In addition we provide three additional datasets (Fig. 8 rows 1, 2, and 4). The experimental setup used to create all those datasets consists of a gated single-photon avalanche diode (SPAD) with a Time-Correlated Single Photon Counter (TCSPC, PicoQuant HydraHarp) with a time resolution of about 30 ps and a dead time of 100 ns to measure the time response as well as a pico-second laser (Onefive Katana HP amplified diode laser with 1 W at 532 nm, and a pulse width of about 35 ps used at a repetition rate of 10 MHz) as light source. The entire system’s temporal resolution is around 70 ps. We perform several new non-confocal experiments (20 ms exposure time Office Scene in Supplementary Table 1 and two scenes showing patches 4 and 44i, Supplementary Table 2) and use existing experimental non-confocal¹⁰ and confocal⁹ datasets to compare our method against the literature. The experiments are performed using the non-confocal acquisition scheme. The detection aperture on the relay wall is around 1.8 m by 1.3 m with 1 cm spacing between each captured time response. This yields 181 by 131 captured time responses for each scene. Scene descriptions are provided in Supplementary Table 3 including scene depth complexity and target materials.

Fig. 7: Methods comparison on Office Scene: Exposure time per each pixel measurement from first row to last row is 1 ms, 5 ms, 10 ms, 20 ms, 1000 ms (note that the 1000 ms Office Scene dataset was acquired with slight differences in the object location).

**Fig. 8: Methods comparison on simple targets: Exposure time for these scenes are all 1000 ms per each pixel measurement.**

Reconstructions

Reconstructions with maximum intensity projection along the depth direction are shown in Figs. 7 and 8. Results with three-dimensional volume rendering are shown in Supplementary Fig. 2. For the non-confocal dataset, we consider three solvers: our proposed fast RSD based solver, the back-projection solver presented previously¹⁰ (denoted by Direct Integration) and two approximate fast methods: LCT and FK Migration, referred to as approx LCT and approx FK. Both of them cannot operate on the non-confocal data used here, however, Lindell et al.⁹ describe a way to turn a non-confocal dataset into an approximately confocal dataset that allows application of non-confocal methods. We implemented this approximate method based on the description and show the approximate reconstructions in the last two columns of Figs. 7 and 8. We refer to this approximation as midpoint approximation. The Direct Integration solver is slow because of the discrete integration step, but we use it as an accurate theoretical calculation reference for our method. Both approx LCT and approx FK yield blurry results compared to both the proposed RSD and Direct Integration when applied to the single plane targets shown in Fig. 8. Beside the simple plane scenes, we consider a more complex Office Scene with multiple targets and targets outside the scanning aperture with a large field of view. The results are shown in Fig. 7. None of the approximate solutions achieves the imaging quality of the phasor field solution (first three columns in Fig. 7). There are two properties of the approximate solutions: The LCT and FK Migration methods inherently can only recover objects within the aperture, and, to make things worse, the approximation made by converting non-confocal into confocal datasets results in an even smaller aperture. To recover a larger hidden volume with a larger field-of-view of the virtual image, we perform a zero padding step at the aperture to make it larger. Even in this case, none of the approximate solutions provides sharper and well-focused images than the RSD-based reconstruction algorithms.

One thing we would like to point out about Fig. 7 for our proposed method is that as the exposure decreases down to 1 ms, the calculation error is highlighted as a almost constant background. We can reduce this artifact in the short exposure scenario by increasing the number of used Fourier components to mimic a shorter Gaussian envelope for the illumination pulse. This effect of choosing a different number of Fourier components for the final results as well as the corresponding execution time is also shown in Fig. 9 on the 20 ms Office Scene.

**Fig. 9: Virtual illumination function design space and reconstruction speed.**

For both simple scene and complex Office Scene results, our proposed methods are much faster with reconstructions in seconds. The exact run times of the un-optimized solvers discussed above are given in Supplementary Tables 1 and 2. All computational parameters (number of Fourier components for the new RSD method, reconstruction volume size, voxel grid resolution etc.) used for creating the reconstruction results in Figs. 7 and 8 are provided in Supplementary Table 4.

Discussion

To the best of our knowledge, our proposed method is the first to solve the general non-confocal NLOS imaging scenario with a similar time requirement and computational complexity as the fastest existing algorithms. In contrast to them, however, our method has much lower memory requirements. This allows us to reconstruct larger scenes and will enable implementation on embedded systems and GPU units where memory is limited. We believe our method will enable real time NLOS imaging and reconstruction of large room scale scenes at full resolution. In this section, we discuss some related NLOS imaging works which currently fail to support real time NLOS and the computational complexity of our proposed method.

We discussed some related works which currently cannot support real time NLOS imaging scenarios. Such reconstruction methods include a fast GPU backprojection solver¹⁸. This method solves the back-projection method faster than CPU implementations, but is still too slow to operate in real time, partially to high memory bandwidth requirements. The current implementation also does not support negative numbers and double precision, both of which are necessary for more advanced phasor field backprojection applications. First returning photon and Fermat path theory can recover surface geometry^19,20 of simple scenes with single objects. Improved iterative back-projection solutions using a new rendering model and frequency analysis^21,22 can create particularly high quality surface reconstructions. Full color NLOS imaging with single pixel photo-multiplier tube combined with a mask^23,24 has also been demonstrated. Further work includes real-time transient imaging for amplitude modulated continuous wave lidar applications²⁵, analysis of missing features based on time-resolved NLOS measurements²⁶, convolutional approximations to incorporate priors into FBP²⁷, occlusion-aided NLOS imaging using SPADs^28,29, Bayesian statistics reconstruction to account for random errors³⁰, temporal focusing for a hidden volume of interest by altering the time delay profile of the hardware illumination³¹, and a database for NLOS imaging problems with different acquisition schemes³². Reconstruction times for all these methods remain in the minutes to hours range even for small scenes of less than a meter in diameter. To the best of our knowledge, none of the works above have been applied successfully to larger and more complex scenes with the exception of the back-projection based methods. Ahn et al.²⁷ can improve the reconstruction quality after the back-projection via an iterative convolution step. Since the method involved a back-projection as it’s first step it shares the speed and complexity disadvantages of the back-projection based methods mentioned above. In addition, the resolution of an NLOS reconstruction is limited by the time resolution of the detection system⁸. For a SPAD, the time resolution is 30 ps at best leading to a theoretically achievable grid resolution of 1 cm in the hidden scene. Methods that can process scenes of moderate and high volume and complexity include FK Migration, the LCT, and Phasor-Field virtual waves which are discussed in this paper.

There are also several contributions showing that it is possible to do NLOS imaging without picosecond scale time resolution or with non-optical signals: Inexpensive nanosecond time of flight sensors can be used to recover the hidden scene³³, tracking can be performed using intensity based NLOS imaging ³⁴, occlusions are harnessed to recover images around a corner using regular cameras^35,36,37, even describing the occlusion-aided method as a blind deconvolution problem without knowledge of the occluder³⁸. Other approaches decode the hidden object from regular camera images by using a deep neural network trained with simulated data only³⁹, or use acoustic⁴⁰ or long-wave infrared⁴¹ signals to image around the corner. While promising for low cost applications, none of these methods achieve reconstruction qualities comparable to the picosecond time-resolved NLOS imaging approaches.

Our proposed method is computationally bounded by the FFT process. Let N denote the number of pixels along each of the three spatial dimensions of the reconstruction space. Calculating the RSD reconstruction requires a 2D FFT at each of the N depth planes for each Fourier component. The computational complexity of the presented algorithm is then given by

$${\mathcal{O}}({N}^{3}\mathrm{log}\,N)$$

(20)

because the number of Fourier components is just a constant by performing reconstructions in multiple depth sections which is shown in Section “Fast Phasor Field Diffraction”. LCT and FK have the same complexity as described in the respective papers^8,9; all other methods applied to complex scenes published so far have higher complexity. The memory complexity of our algorithm is defined by the need to store the FDH (details provided later in the Methods section) and the resulting 2D image. For the scenes described here this is actually O(N²). In larger scenes we would need to store multiple FDHs for multiple depth sections in order to maintain low computational complexity. In this case memory complexity is O(N³).

The computational complexity for our proposed solution, LCT and FK are the same from the theoretical point of view. In practice, due to the need for oversampling and interpolation the actual memory requirement for each method is several hundred times higher than ours in their current form. Unfortunately there are many different options with different trade-offs and it is not completely clear which is used inside the Matlab interpolation functions used in the current algorithms. Existing papers on FK Migration typically discuss their particular choices and their impact on memory complexity and reconstruction quality in considerable detail^42,43. To get a better understanding of the source of the memory requirements, let us consider the requirements for our method and FK Migration as an example.

Consider a scene with the size of the Office Scene from 0 to 2.5 m away from the relay wall that is used in this paper. The temporal measurements are collected from 150 by 150 spatial sampling points. For our proposed method, storing the FDH requires 150*150*139*4 (139 is the number of used frequency components for the similarly sized Office Scene as shown in Supplementary Table 4) bytes which is around 12.51 MB (or 25 MB to store both real and imaginary parts). Algorithms exist that can compute the FFT without requiring extra working memory. Our reconstructions are computed slice by slice and only the maximum is kept. The only additional memory required is to store the 2D result. If we would like to create a 3D visualization, we have to store the index of the maximum. This requires 150*150*4*2 = 180 kB of additional memory. The total memory required is thus 50.18 MB.

FK Migration needs the histogram in the time domain. Sampling resolution in the histogram and resolution in the Fourier domain are linked through the FFT and cannot be chosen freely. To cover this scene setup with 32 ps temporal sampling rate, at least 512 temporal sampling bins are required for each captured time response to cover the light path round trip in 5 m. Assume each temporal bin is in single-precision using 4 bytes. For LCT/FK, one needs to store 150*150*512*4 bytes which is around 46 MB. This is already significantly more than the memory requirement of our entire algorithm. We assume the 3D FFT can be performed without additional working memory. For FK and LCT, two extra steps are required apart from the 3D FFT. The first extra step is to oversample the DFT by zero padding the data before the Fourier transform. This provides higher resolution in the Fourier domain and makes the following interpolation step easier. The current implementation increases the size of the data by 2 in each dimension by zero padding resulting in a memory need of 0.368 GB (2³*46 MB). This 3D dataset structure is complex-valued and needs an additional second channel to store real and imaginary part. The second extra step is to perform the interpolation to compute the points in Fourier space that are needed as input for the inverse 3D FFT. As is stated directly in the literature, this interpolation step is the complexity bottleneck for FK Migration⁴². The current FK Migration code uses the Matlab function interp⁹. That uses two neighbor points along each dimension to perform a linear 3D interpolation. Without prior assumptions about the structure of the grids, search in nearest neighbors would have a computational complexity for O(N⁶) which is impractical. This can be improved by pre-computing a map of nearest neighbors using a faster algorithm like a k-d tree. To store the six nearest neighbors of each data point requires 2.21 GB (6*2³*46 MB). Then the linear interpolation if implemented in this way would require 2.21 GB of working memory in addition to the size of the data itself. While we can’t be sure that this is what Matlab is doing, the memory load is consistent with our measurements. The memory profile while running both methods on our captured dataset is shown in Supplementary Fig. 5 and its order of magnitude coincides with the estimate. After inverse 3D Fourier transform the final result is a sparse three-dimensional complex matrix of size 150*150*300 or larger. The current method reconstructs a higher resolution matrix as a side effect of the oversampling. This is not actually needed and doesn’t significantly affect the result. We thus have an additional memory need of 150*150*300*2*4 = 54 MB. Note again that just the result takes up more memory than our entire computation. This results in a total peak memory use of 2.21 GB + 2*0.368 GB = 2.946 GB. There are several ways that can likely reduce this memory load.

Knowledge of the relative layouts of the two grids may reduce or eliminate the requirement for working memory in the interpolation. One can also fine tune the trade-off between Fourier domain oversampling, more sophisticated interpolation methods, and reconstruction quality. It might also be possible to perform further down-sampling along the temporal dimension and use single instead of double precision variables to require less memory. These approaches are interesting topics for future research and can draw from considerable prior work on this problem in related FK Migration application areas. At present, however, the method takes several hundred times more memory than our proposed method. The LCT includes a similar re-sampling step that creates large memory requirements. Re-sampling and interpolation problems in this domain are studied in the literature covering planar and spherical inverse radon transforms.

Methods

Discrete phasor field diffraction model and implementation

In this section, the computational implementation for the model derived in the main paper is described. We will explain the discrete RSD model and implementation here; the respective pseudocode is provided in Supplementary Note 3. We introduce the RSD discrete model and link it to physical measurement parameters (scanning aperture size, sensor grid spacing) and then provide the corresponding algorithmic implementation procedure as a guideline.

We provide a description for the FFT based RSD solver implementation for ${{\mathcal{R}}}_{{z}_{{\rm{v}}}}(\cdot )$ in Eq. (8). For an actual algorithm implementation, it is necessary to discretize the continuous model. Considering discrete parameters such as a finite size square aperture sampling both the camera aperture and reconstruction planes C and V at uniform distances δ_in and δ_out, the wavefront is a matrix of size N × N. We use the symbols [nx_v, ny_v] and [nx_c, ny_c] to represent the discrete indices. We consider δ_in = δ_out = δ spatial sampling in both input and output domains where $\delta =\frac{\lambda }{2}$ is the maximum sampling distance⁴⁴. The variable Z denotes the maximum value of z_v. For brevity, all following equations ignore the frequency variable Ω of the input ${{\mathcal{P}}}_{{\mathcal{F}}}[n{x}_{{\rm{c}}},n{y}_{{\rm{c}}}]$ and output ${{\mathcal{P}}}_{{\mathcal{F}}}[n{x}_{{\rm{v}}},n{y}_{{\rm{v}}}]$ wavefronts. Overall, with these discrete parameters, the RSD operator in Eq. (8) can be written as a standard discrete convolution as follows:

$${{\mathcal{P}}}_{{\mathcal{F}}}[n{x}_{{\rm{v}}},n{y}_{{\rm{v}}}]= \mathop{\sum\sum}\limits^ {{N/2-1}\ {N/2-1}}_{n{x}_{{\rm{c}}},n{y}_{{\rm{c}}} = -N/2}{{\mathcal{P}}}_{{\mathcal{F}}}[n{x}_{{\rm{c}}},n{y}_{{\rm{c}}}]\ \cdot G[n{x}_{{\rm{v}}}-n{x}_{{\rm{c}}},n{y}_{{\rm{v}}}-n{y}_{{\rm{c}}},{z}_{{\rm{v}}}]\\ G[n{x}_{{\rm{c}}},n{y}_{{\rm{c}}}]= \, \alpha \cdot {\delta }^{2}\cdot \frac{\exp [-i\frac{\Omega }{c}\sqrt{n{x}_{{\rm{c}}}^{2}{\delta }^{2}+n{y}_{{\rm{c}}}^{2}{\delta }^{2}+{z}_{{\rm{v}}}^{2}}]}{\sqrt{n{x}_{{\rm{c}}}^{2}{\delta }^{2}+n{y}_{{\rm{c}}}^{2}{\delta }^{2}+{z}_{{\rm{v}}}^{2}}}.$$

(21)

Thus, the discrete model in Eq. (21) can be implemented as two-dimensional Fast Fourier Transform (2D FFT) algorithm. Notice that the parameter α(x_v, y_v, z_v) is ignored for the reconstruction in Eq. (21). Then the algorithmic procedure is:

Goal: Given input wavefront ${{\mathcal{P}}}_{{\mathcal{F}}}[n{x}_{{\rm{c}}},n{y}_{{\rm{c}}}]$, spacing between the input and output parallel plane z_v (depth), angular frequency Ω and associated wavelength λ, calculate the output wavefront ${{\mathcal{P}}}_{{\mathcal{F}}}[n{x}_{{\rm{v}}},n{y}_{{\rm{v}}}]$ by

$${{\mathcal{P}}}_{{\mathcal{F}}}[n{x}_{{\rm{v}}},n{y}_{{\rm{v}}},\hat{z}]={{\mathcal{R}}}_{{z}_{{\rm{v}}}}\left[{{\mathcal{P}}}_{{\mathcal{F}}}[n{x}_{{\rm{c}}},n{y}_{{\rm{c}}},0]\right].$$

(22)

Step 1: Discretize depth

$$\hat{z}=\frac{Z}{\delta }$$

Step 2: Zero padding according to the desired reconstruction volume size

$$\begin{array}{rcl}&&N^{\prime} ={\rm{Hidden}}\ {\rm{volume}}\ {\rm{side}}\ {\rm{length}}\\ &&{\rm{pad}}=\frac{N^{\prime} -N}{2}\\ &&{{\mathcal{P}}}_{{\mathcal{F}}}[n{x}_{{\rm{c}}},n{y}_{{\rm{c}}}]={\rm{padarray}}\left({{\mathcal{P}}}_{{\mathcal{F}}}[n{x}_{{\rm{c}}},n{y}_{{\rm{c}}}],[{\rm{pad}},{\rm{pad}}],0\right)\\ &&{\rm{Update}}\ {\rm{discrete}}\ {\rm{size}}:\ N=N^{\prime} \end{array}$$

Step 3: Variable substitution

$${\eta }^{2}=\frac{\lambda Z}{N{\delta }^{2}}=\frac{\lambda \hat{z}}{N\delta }$$

Step 4: Compute convolution kernel

$$G[n{x}_{{\rm{c}}},n{y}_{{\rm{c}}},\hat{z}]= \,\frac{\exp \left[-i2\pi \cdot {\hat{z}}^{2}/({\eta }^{2}N)\cdot r\right]}{r}\\ r= \, \sqrt{n{x}_{{\rm{c}}}^{2}/{\hat{z}}^{2}+n{y}_{{\rm{c}}}^{2}/{\hat{z}}^{2}+1}$$

Step 5: Perform the inverse diffraction

$${{\mathcal{P}}}_{{\mathcal{F}}}[n{x}_{{\rm{v}}},n{y}_{{\rm{v}}},\hat{z}]={\bf{IFFT}}\left\{{\bf{FFT}}\left\{{{\mathcal{P}}}_{{\mathcal{F}}}[n{x}_{{\rm{c}}},n{y}_{{\rm{c}}},0]\right\}\ \bullet \ {\bf{FFT}}\left\{G[n{x}_{{\rm{c}}},n{y}_{{\rm{c}}},\hat{z}]\right\}\right\}$$

Here are some short explanations for the computational algorithm above:

1.
In Step 2, to reconstruct a volume with maximum dimensions x_v and y_v larger than the maximum aperture dimensions x_c and y_c (or x_p and y_p), one needs to increase the spatial dimension (parameter $N^{\prime}$) by zero padding the input wavefront.
2.
Step 5 is based on the standard FFT and IFFT algorithm. The symbol • stands for the point-wise multiplication operation. Step 5 can be done in space as well. However, for two matrices of comparable size, Fourier domain multiplication usually runs faster than spatial convolution.
3.
The inverse focusing step realized by the RSD creates a virtual image on the other side of the relay wall, so the sign of the depth parameter z_v should be chosen negative for the considered reconstruction volume.

Memory usage

We are also interested in the memory usage of the fast algorithms (proposed, approx LCT, approx FK). We acquire the memory usage during reconstructions for the Office Scene in Fig. 7. The memory profile during execution is shown in Supplementary Fig. 5. Our memory testing as well as all our code are running on an Intel Core i7-7700 CPU, 3.6 GHz x 8 with 32 GB memory using Matlab. During testing, the base memory usage for non-GUI Matlab is around 750 MB. Independent of the reconstruction quality, approx LCT and approx FK need much more memory than our proposed method. Neglecting the memory of the operating system etc., our method would require about 5 MB of memory when implemented most efficiently. A more detailed discussion regarding to the memory usage can be found later in the Discussion section.

Confocal and rendered data

As a confocal scanning scenario, we use the open source experimental dataset⁹. Our proposed reconstruction method requires similar time and lower memory usage compared to the LCT⁸ and FK⁹ methods. The reconstruction results of confocal datasets are shown in Supplementary Figs. 3 and 4. In terms of the difference between non-confocal (SPAD array) and confocal (Single SPAD with scanning) capture, we provide a short discussion in Supplementary Note 4.

Reconstructions using a rendered dataset with known ground truth are shown in Supplementary Fig. 1. Our proposed method reconstructs an image of the hidden scene that resembles the image that would be captured with a camera located at the relay wall. In our reconstructions, we recover phasor field irradiance for the hidden object. It is expected that the reconstruction shows spatial distortions similar to the ones seen by a real camera, as it is shown in Supplementary Fig. 1. If an exact depth measurement is desired, these biases would have to be calibrated. This is an interesting subject for future work.

Data availability

The data supporting the findings of this study are available (downloaded) at: figshare repository https://doi.org/10.6084/m9.figshare.8084987, Computational Optics Group https://biostat.wisc.edu/~compoptics/phasornlos20/fastnlos.html and Standard Computational Imaging Lab http://www.computationalimaging.org/publications/nlos-fk/. The source data underlying Supplementary Fig. 5 are provided as a Source Data file.

Code availability

The code used in this work is included in this published article and its supplementary information files.

References

Kirmani, A., Hutchison, T., Davis, J. & Raskar, R. Looking around the corner using ultrafast transient imaging. Int. J. Compu. Vision 95, 13–28 (2011).
Article Google Scholar
Ramesh, R. & Davis, J. 5d time-light transport matrix: What can we reason about scene properties? Engineering (2008).
Velten, A. et al. Recovering three-dimensional shape around a corner using ultrafast time-of-flight imaging. Nat. Commun. 3, 1–8 (2012).
Article Google Scholar
Gupta, O., Willwacher, T., Velten, A., Veeraraghavan, A. & Raskar, R. Reconstruction of hidden 3D shapes using diffuse reflections. Opt. Express 20, 19096–19108 (2012).
Article ADS Google Scholar
LaManna, M. et al. Error backprojection algorithms for non-line-of-sight imaging. IEEE Trans. Pattern Anal. Mach. Intell. 41, 1615–1626 (2018).
Article Google Scholar
Laurenzis, M. & Velten, A. Feature selection and back-projection algorithms for nonline-of-sight laser-gated viewing. J. Electron. Imaging 23, 063003 (2014).
Article ADS Google Scholar
Buttafava, M., Zeman, J., Tosi, A., Eliceiri, K. & Velten, A. Non-line-of-sight imaging using a time-gated single photon avalanche diode. Opt. Express 23, 20997–21011 (2015).
Article CAS ADS Google Scholar
O’Toole, M., Lindell, D. B. & Wetzstein, G. Confocal non-line-of-sight imaging based on the light-cone transform. Nature 555, 338–341 (2018).
Article ADS Google Scholar
Lindell, D. B., Wetzstein, G. & O’Toole, M. Wave-based non-line-of-sight imaging using fast fk migration. ACM T. Graphic. 38, 1–13 (2019).
Article Google Scholar
Liu, X. et al. Non-line-of-sight imaging using phasor-field virtual wave optics. Nature 572, 620–623 (2019).
Article CAS ADS Google Scholar
Reza, S. A., La Manna, M., Bauer, S. & Velten, A. Phasor field waves: a Huygens-like light transport model for non-line-of-sight imaging applications. Opt. Express 27, 29380–29400 (2019).
Article ADS Google Scholar
Reza, S. A., La Manna, M., Bauer, S. & Velten, A. Phasor field waves: experimental demonstrations of wave-like properties. Opt. Express 27, 32587–32608 (2019).
Article ADS Google Scholar
Dove, J. & Shapiro, J. H. Paraxial theory of phasor-field imaging. Opti. Express 27, 18016–18037 (2019).
Article ADS Google Scholar
Teichman, J. A. Phasor field waves: a mathematical treatment. Opt. Express 27, 27500–27506 (2019).
Article ADS Google Scholar
Shen, F. & Wang, A. Fast-Fourier-transform based numerical integration method for the Rayleigh-Sommerfeld diffraction formula. Appl. Opt. 45, 1102–1110 (2006).
Article ADS Google Scholar
Nascov, V. & Logofătu, P. C. Fast computation algorithm for the Rayleigh-Sommerfeld diffraction formula using a type of scaled convolution. Appl. Opt. 48, 4310–4319 (2009).
Article ADS Google Scholar
Astola, J. & Yaroslavsky, L. (Eds.) Advances in Signal Transforms: Theory and Applications (Vol. 7) (Hindawi Publishing Corporation, 2007).
Arellano, V., Gutierrez, D. & Jarabo, A. Fast back-projection for non-line of sight reconstruction. Opt. Express 25, 11574–11583 (2017).
Article ADS Google Scholar
Tsai, C. Y., Kutulakos, K. N., Narasimhan, S. G. & Sankaranarayanan, A. C. The geometry of first-returning photons for non-line-of-sight imaging. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 7216–7224 (Honolulu, HI, USA, 2017).
Xin, S. et al. A theory of fermat paths for non-line-of-sight shape reconstruction. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 6800–6809 (Long Beach, CA, USA, 2019).
Tsai, C. Y., Sankaranarayanan, A. C. & Gkioulekas, I. Beyond Volumetric Albedo–A Surface Optimization Framework for Non-Line-Of-Sight Imaging. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 1545–1555 (Long Beach, CA, USA, 2019).
Wu, D. et al. Frequency analysis of transient light transport with applications in bare sensor imaging. In European Conference on Computer Vision 542–555. (Springer, Berlin, Heidelberg, 2012).
Musarra, G. et al. Non-line-of-sight Three-dimensional imaging with a single-pixel camera. Phys. Rev. Appl. 12, 011002 (2019).
Article CAS ADS Google Scholar
Musarra, G. et al. 3D RGB Non-Line-Of-Sight single-pixel imaging. In Imaging Science and Applications. (pp. IM2B-5) https://doi.org/10.1364/ISA.2019.IM2B.5 (Optical Society of America, 2019).
Peters, C., Klein, J., Hullin, M. B. & Klein, R. Solving trigonometric moment problems for fast transient imaging. ACM Transactions on Graphics (TOG) 34, 1–11 (2015).
Article Google Scholar
Liu, X., Bauer, S. & Velten, A. Analysis of feature visibility in non-line-of-sight measurements. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 10140–10148 (Long Beach, CA, USA, 2019).
Ahn, B., Dave, A., Veeraraghavan, A., Gkioulekas, I. & Sankaranarayanan, A. C. Convolutional Approximations to the General Non-Line-of-Sight Imaging Operator. In Proc. IEEE International Conference on Computer Vision 7889–7899 (Seoul, South Korea Source, 2019).
Xu, F. et al. Revealing hidden scenes by photon-efficient occlusion-based opportunistic active imaging. Opt. Express 26, 9945–9962 (2018).
Article ADS Google Scholar
Heide, F. et al. Non-line-of-sight imaging with partial occluders and surface normals. ACM Trans. Graph. 38, 1–10 (2019).
Article Google Scholar
Huang, L., Wang, X., Yuan, Y., Gu, S. & Shen, Y. Improved algorithm of non-line-of-sight imaging based on the Bayesian statistics. JOSA A 36, 834–838 (2019).
Article ADS Google Scholar
Pediredla, A., Dave, A. & Veeraraghavan, A. Snlos: Non-line-of-sight scanning through temporal focusing. In 2019 IEEE International Conference on Computational Photography (ICCP). 1–13 (IEEE, Tokyo, Japan, 2019).
Galindo, M. A dataset for benchmarking time-resolved non-line-of-sight imaging. In ACM SIGGRAPH 2019 Posters 1–2. https://graphics.unizar.es/nlos (2019).
Heide, F., Xiao, L., Heidrich, W. & Hullin, M. B. Diffuse mirrors: 3D reconstruction from diffuse indirect illumination using inexpensive time-of-flight sensors. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 3222–3229 (Tokyo, Japan, 2014).
Klein, J., Peters, C., Martín, J., Laurenzis, M. & Hullin, M. B. Tracking objects outside the line of sight using 2D intensity images. Sci. Rep. 6, 1–9 (2016).
Article Google Scholar
Saunders, C., Murray-Bruce, J. & Goyal, V. K. Computational periscopy with an ordinary digital camera. Nature 565, 472–475 (2019).
Article CAS ADS Google Scholar
Thrampoulidis, C. et al. Exploiting occlusion in non-line-of-sight active imaging. IEEE Trans. Comput. Imag. 4, 419–431 (2018).
Article Google Scholar
Baradad, M. et al. Inferring light fields from shadows. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 6267–6275 (Salt Lake City, UT, USA, 2018).
Yedidia, A. B., Baradad, M., Thrampoulidis, C., Freeman, W. T. & Wornell, G. W. Using unknown occluders to recover hidden scenes. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 12231–12239 (Long Beach, CA, USA, 2019).
Chen, W., Daneau, S., Mannan, F. & Heide, F. Steady-state non-line-of-sight imaging. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 6790–6799 (Long Beach, CA, USA, 2019).
Lindell, D. B., Wetzstein, G., & Koltun, V. Acoustic non-line-of-sight imaging. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 6780–6789 (Long Beach, CA, USA, 2019).
Maeda, T., Wang, Y., Raskar, R. & Kadambi, A. Thermal non-line-of-sight imaging. In 2019 IEEE International Conference on Computational Photography (ICCP) 1–11 (IEEE, Tokyo, Japan, 2019).
Margrave, G. F. & Lamoureux, M. P. Numerical Methods of Exploration Seismology: With Algorithms in Matlab® (Cambridge University Press, 2019).
Margrave, G. F. Direct Fourier migration for vertical velocity variations. Geophysics 66, 1504–1514 (2001).
Article ADS Google Scholar
Fink, M. Time-reversed acoustics. Sci. Am. 91–97 (1999).

Download references

Acknowledgements

This work was funded by DARPA through the DARPA REVEAL project (HR0011-16-C-0025), and the DURIP program (FA9550-18-1-0409). We thank Jeffrey H. Shapiro for the insights about the phasor field broad-band model. We also appreciate the help of Marco La Manna, Ji-Hyun Nam, Toan Le, and Atul Ingle on the hardware setup and helpful discussion during calibrations. Xiaochun Liu would like to acknowledge the helpful discussion with David B. Lindell about his approximate non-confocal method, with Ibón Guillén and Miguel J. Galindo about volume rendering methods and simulated datasets.

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, University of Wisconsin – Madison, Madison, WI, USA
Xiaochun Liu & Andreas Velten
Department of Biostatistics and Medical Informatics, University of Wisconsin – Madison, Madison, WI, USA
Sebastian Bauer & Andreas Velten

Authors

Xiaochun Liu
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Bauer
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Velten
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.L. and A.V. conceived the method. X.L. implemented the computational model for the reconstruction. X.L., S.B. performed experiments and comparison for the paper. A.V. supervised all aspects of the project. All authors contributed to designing the experiments and writing the paper.

Corresponding author

Correspondence to Andreas Velten.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Daniele Faccio, Felix Heide and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Source Data

Source Data - Code and Sample Dataset

Source Data - Supp Fig 5

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Liu, X., Bauer, S. & Velten, A. Phasor field diffraction based reconstruction for fast non-line-of-sight imaging systems. Nat Commun 11, 1645 (2020). https://doi.org/10.1038/s41467-020-15157-4

Download citation

Received: 03 July 2019
Accepted: 20 February 2020
Published: 02 April 2020
DOI: https://doi.org/10.1038/s41467-020-15157-4

This article is cited by

Attention-based network for passive non-light-of-sight reconstruction in complex scenes
- Yaqin Zhang
- Meiyu Huang
- Xueshuang Xiang
The Visual Computer (2024)
Research Advances on Non-Line-of-Sight Imaging Technology
- Mengge Liu
- Hao Liu
- Mingliang Xu
Journal of Shanghai Jiaotong University (Science) (2024)
Non-line-of-sight imaging with arbitrary illumination and detection pattern
- Xintong Liu
- Jianyu Wang
- Lingyun Qiu
Nature Communications (2023)
Learning diffractive optical communication around arbitrary opaque occlusions
- Md Sadman Sakib Rahman
- Tianyi Gan
- Aydogan Ozcan
Nature Communications (2023)
Computational imaging of moving objects obscured by a random corridor via speckle correlations
- Tian Shi
- Liangsheng Li
- Ning Zheng
Nature Communications (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.