Introduction

Optical microscopy is an essential tool for understanding the physiology of living tissues due to its high spatial resolution, molecular specificity, and minimal invasiveness1. However, these benefits are out of reach when target objects are located either inside curved passages or underneath light-scattering tissues. By visualizing these hard-to-reach areas, endoscopes have revolutionized medical practice for early disease diagnosis. Over the past decade, endoscopes with microscopic resolution have been developed for more accurate and earlier diagnosis2,3. In addition, the demand for ultrathin endoscopes (with a probe diameter well below 1 mm) has been steadily growing to minimize the discomfort and complications accompanied by insertion of the endoscope probe3,4,5,6,7.

Endoscopic microscopy typically employs various optical fibres as thin and flexible light-guiding channels. For example, a single optical fibre has been used by attaching various types of scanning devices and optical elements to the distal side of the fibre facing the sample4,8,9,10. Multiphoton imaging4,9,11,12,13 and optical coherent tomography (OCT)14,15,16 have been implemented in this configuration. However, the scanner attached to the fibre is often too bulky to be ultrathin even though the diameter of the fibre itself is small. Image guiding media such as coherent fibre bundles are used to eliminate the need for distal scanners, thus making the endoscope probe thinner and more robust. Individual fibre cores in the bundle are used as image pixels by either attaching imaging optics to the tip of the fibre bundle or making direct contact of the fibre tip to the sample surface17,18. In this configuration, wide-field fluorescence imaging modality has been implemented for rapid medical diagnosis19,20. And confocal fluorescence imaging has been realized by the high-speed scanning of the focus at the proximal side of the fibre outside the subject either with17,21 or without22 a distal lens. One critical drawback of this configuration is its inability to acquire label-free reflectance images of biological tissues. Strong back-reflection of the illumination occurring at the fibre cores exactly coincides with much weaker reflection signals from the biological tissues. This is one of the main reasons why fluorescence imaging mode is widely used, in which fluorescence emission can be separated from the back-reflection noise by using colour filters. Since most fluorescence imaging requires staining, its use for general clinical diagnosis is limited. A simple solution for intrinsic reflectance imaging is to introduce a separate fibre for the illumination. However, this has only been applicable for macroscopic imaging due to the enlarged probe diameter and low spatial resolving power constrained by the fibre-sample distance required for the separate illumination.

Various types of lensless fibre endoscopes have been developed to minimize the probe size to the diameter of fibre itself. For example, multimode optical fibres have been employed as image guiding medium23,24,25. They provide numerous independent spatial modes that can transfer image information at once. Since their areal mode density is one or two orders of magnitude larger than the fibre core density in the fibre bundle, the probe diameter can be even smaller than the fibre bundle probe. However, complex mode mixing obscures image information. Many recent studies have proved that the prior calibration of a multimode fibre in the form of a transmission matrix (TM) enables the recovery of the object image23,24,26,27. This concept has been applied to fluorescence endoscopic imaging of mouse brain28,29 and wide-field reflectance imaging of biological tissues25. The TM approach has also been applied to a fibre bundle endoscope for either fluorescence27 or transmittance/reflectance imaging30,31,32 for the removal of both pixelation artefacts and distal optics. However, TM-based endoscopes have so far been used as a rigid type rather than a flexible type28,29, mainly because the TM calibration becomes invalid when the fibres are bent or twisted during insertion. Several approaches have been introduced to resolve this critical constraint. For example, speckle correlation due to the memory effect of the fibre bundle has been used to reconstruct object image without prior knowledge of the TM33,34. Feedback optimization of two-photon signals measured at the proximal end allows direct access to the fibre TM35, and a virtual guide star is administrated to the distal side to dynamically compensate for fibre bending36,37. The use of specially designed fibres that have less sensitive to the bending deformation has been proposed38,39,40. While these studies have shown the feasibility to a certain extent, innate limitations such as the lengthy optimization time, enlargement of the probe unit, constraint for the bending configurations, and difficulty in fabricating ideal fibres still preclude the realization of the flexible endoscopes. Essentially, previous lensless fibre endoscopes either require prior calibration or operate in fluorescence imaging mode, and an ideal endoscope satisfying all the critical requirements—flexibility, ultrathin probe diameter, microscopic resolution, and stain-free imaging—has yet to be developed.

Herein, we present a fully flexible and ultrathin fibre endoscope that can perform high-resolution holographic imaging of unstained tissues through a narrow and curved passage. Our endoscope probe is simply a bare fibre bundle with no lens or scanner attached to the tip. Therefore, the diameter of the endoscope is equal to that of the fibre bundle itself, which was either 200 μm or 350 μm in the present study. Unlike a conventional fibre bundle endoscope, the fibre cores are not image pixels. Instead, the fibre is placed at any distance larger than 400 μm from the target sample. In this unique configuration, the illumination is made through each fibre core, one by one, but the detection is made by all the other fibre cores. The reflected waves from the sample that are blurred and spread to the other fibre cores are measured by using a holographic field measurement32. This allows us to selectively block the back-reflection noise from the illumination core. Since the measured holographic field contains the Fourier transform of an object function by means of Fresnel diffraction, we term our method ‘Fourier holographic endoscopy’. However, an object image cannot be reconstructed from the detected field due to the dynamically varying core-to-core phase retardations with respect to the fibre bending and twisting. While the fibre was calibrated before its insertion in previous TM approaches, we developed an algorithm that can identify the complex core-to-core phase retardations and reconstruct the diffraction-limited and pixelation-free object image all at the same time without the need for any prior calibration. This allowed us to perform endoscopic imaging with an arbitrary and varying bending configuration. For an object located behind a narrow and curved passage, we performed the endoscopic reflectance imaging with a microscopic spatial resolution of 0.85 μm for low-contrast resolution targets. Furthermore, we realized the volumetric 3D imaging covering the depth range of 400–1200 μm with an axial resolution as good as 14 μm by exploiting the holographic image reconstruction. To verify the applicability of the proposed method for investigating biological specimens, we conducted Fourier holographic endoscope imaging of unstained rat intestine tissues that are completely invisible to conventional reflectance-mode endoscopes. We demonstrated imaging of individual villi with the spatial resolution and contrast on par with or better than those of conventional confocal reflectance microscopes.

Results

Experimental layout of Fourier holographic endoscopy

Our experimental setup is shown in Fig. 1a. A diode laser (Finesse Pure, Laser Quantum) with a wavelength of λ = 532 nm and a coherence length of 6 mm was used as the light source. The output beam was split into sample and reference waves with a beam splitter (BS1). The sample beam was reflected by a two-axis galvanometer mirror (GM) and focused on the input plane (IP) of a coherent fibre bundle via an objective lens (OL). Two types of fibre bundles were used, one with 350 μm diameter (Fujikura, FIGH-10-350S) and the other with 200 μm diameter (Fujikura, FIGH-10-200S). The 350-μm-diameter fibre bundle (1 m-long, and a minimum bending radius of 35 mm) contains around 10,000 fibre cores, the diameter of each being around 2 μm. We adjusted the GM scanning angle to sequentially focus the illumination beam on each fibre core. This ensured that the illumination beam (indicated as green) travelled through the single fibre core and came out from the output plane (OP) of the fibre bundle via the same fibre core.

Fig. 1: Schematic of the experimental setup and the image formation principle.
figure 1

a Experimental setup. The output beam from a laser is divided into sample and reference beams. The sample beam is delivered to the sample through the fibre bundle. The backscattering signal from the sample, indicated as yellow for clarity although its wavelength is identical to the incident wave, is captured by the fibre bundle and delivered to the camera. The reference beam generates an interferogram together with the signal beam at the camera. GM: 2-axis galvanometer scanning mirror, BS1-3: beam splitters, L1-5: lenses, OL: objective lens. IP and OP: input and output planes of the fibre bundle, respectively. SP: sample plane. b Image formation principle. The illumination and reflection pathways have been unfolded to make their distinction clear. \(\left({u}_{i},{v}_{i}\right)\) and \(\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}}\right)\): spatial coordinates at the fibre bundle for illumination and detection pathways, respectively; \({\phi }_{{{{{{\rm{i}}}}}}}^{{{{{{\rm{b}}}}}}}\left({u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)\) and \({\phi }_{{{{{{\rm{r}}}}}}}^{{{{{{\rm{b}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}}\right)\): phase retardations induced by the fibre bundle during the illumination and reflection, respectively; \({E}_{{{{{{\rm{i}}}}}}}\left(x,y\right)\) and \({E}_{{{{{{\rm{r}}}}}}}\left(x,y\right)\): the electric fields incident to and reflected by the sample, respectively; and \({E}_{{{{{{\rm{camera}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}}\right)\): the electric field detected at the camera.

To separate the back-reflection noise from the fibre bundle, we placed the sample plane (SP) at a standoff distance d from the OP. This allowed the backscattering signals from the sample to be spread in space and collected at cores other than the illumination core. The imaging resolution varies by the standoff distance. However, our system always provides a better spatial resolution than contact-mode imaging whose resolution is set by the core diameter. As verified later, this is because the space-bandwidth product of our system is higher by a factor of 13. Typically, d ranged from 400 to 1200 μm to ensure that the Fresnel approximation is valid (see Supplementary section 4 for minimal working distance)41. In this arrangement, the illumination beam at the OP arrives at the SP as a parabolic wave. The angle of the parabolic illumination wave at the SP is varied depending on the position of the illumination core at the IP. The waves backscattered by the object at SP (indicated as yellow) are captured by multiple fibre cores at the OP and delivered back to the IP. The waves returning through the IP are captured by the OL and delivered to the camera. To obtain the amplitude and phase of the returning waves, a reference wave was introduced at the camera with an off-axis configuration, and the Hilbert transform was applied to the recorded holograms.

Calibration-free image reconstruction principle

The detailed layout of the image formation process is shown in Fig. 1b. The incident wave focused at the fibre core located at \(\left({{{{{{\boldsymbol{u}}}}}}}_{{{{{{\bf{i}}}}}}}{{{{{\boldsymbol{,}}}}}}{{{{{{\boldsymbol{v}}}}}}}_{{{{{{\bf{i}}}}}}}\right)\) of the IP experiences phase retardation \({{{{{{\boldsymbol{\phi }}}}}}}_{{{{{{\bf{i}}}}}}}^{{{{{{\bf{b}}}}}}}\left({{{{{{\boldsymbol{u}}}}}}}_{{{{{{\bf{i}}}}}}}{{{{{\boldsymbol{,}}}}}}{{{{{{\boldsymbol{v}}}}}}}_{{{{{{\bf{i}}}}}}}\right)\) at the corresponding core and exits as a point source at the OP. This point source travels to the SP and meets the target object as a parabolic wave. The incident wave at \(\left({{{{{\boldsymbol{x}}}}}}{{{{{\boldsymbol{,}}}}}}{{{{{\boldsymbol{y}}}}}}\right)\) of the SP can be written as

$${E}_{{{{{{\rm{i}}}}}}}\left(x,y{{{{{\rm{;}}}}}}{u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)=\frac{{e}^{{ikd}}}{i\lambda d}{{\exp }}\left\{i\frac{k}{2d}\left[{\left(x-{u}_{{{{{{\rm{i}}}}}}}\right)}^{2}+{\left(y-{v}_{{{{{{\rm{i}}}}}}}\right)}^{2}\right]\right\}{e}^{i{\phi }_{{{{{{\rm{i}}}}}}}^{{{{{{\rm{b}}}}}}}\left({u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)}.$$
(1)

Here, \(k=2\pi {\lambda }^{-1}\) is the wavenumber. The wave is reflected by the target sample having object function \(O\left(x,y\right)\), which is the amplitude reflectance of the target. The reflected wave given by \({E}_{{{{{{\rm{r}}}}}}}\left(x,y{{{{{\rm{;}}}}}}{u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)=O\left(x,y\right){E}_{{{{{{\rm{i}}}}}}}\left(x,y{{{{{\rm{;}}}}}}{u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)\) is depicted on the transmission side of the sample in Fig. 1b to make it clearly distinct from the incident wave. The reflected wave travels back to \(\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}}\right)\) of the OP following Fresnel diffraction and returns to the IP after experiencing fibre core-dependent phase retardation \({\phi }_{{{{{{\rm{r}}}}}}}^{{{{{{\rm{b}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}}\right)\). After incorporating all of these processes, we can obtain the field at the camera, i.e. the field detected at the fibre proximal facet plane, as

$${E}_{{{{{{\rm{camera}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}}{{{{{\rm{;}}}}}}{u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)=-\frac{{e}^{2{ikd}}}{{\lambda }^{2}{d}^{2}}{e}^{i{\phi }_{{{{{{\rm{r}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}}\right)}{\widetilde{O}}_{{{{{{\rm{M}}}}}}}\left(\frac{k}{d}\left({u}_{{{{{{\rm{r}}}}}}}+{u}_{{{{{{\rm{i}}}}}}}\right),\frac{k}{d}\left({v}_{{{{{{\rm{r}}}}}}}+{v}_{{{{{{\rm{i}}}}}}}\right)\right){e}^{i{\phi }_{{{{{{\rm{i}}}}}}}\left({u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)}.$$
(2)

Here, \({\widetilde{O}}_{{{{{{\rm{M}}}}}}}\) is the Fourier transform of the modified object function \({O}_{{{{{{\rm{M}}}}}}}\left(x,y\right)=O\left(x,y\right){\exp }\left\{i\frac{k}{d}\left({x}^{2}+{y}^{2}\right)\right\}\), and \({\phi }_{{{{{{\rm{i}}}}}}}\left({u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)=\frac{k}{2d}\left({{u}_{{{{{{\rm{i}}}}}}}}^{2}+{{v}_{{{{{{\rm{i}}}}}}}}^{2}\right)+{\phi }_{{{{{{\rm{i}}}}}}}^{{{{{{\rm{b}}}}}}}\left({u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)\) and \({\phi }_{{{{{{\rm{r}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}}\right)=\frac{k}{2d}\left({u}_{{{{{{\rm{r}}}}}}}^{2}+{v}_{{{{{{\rm{r}}}}}}}^{2}\right)+{\phi }_{{{{{{\rm{r}}}}}}}^{{{{{{\rm{b}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}}\right)\) are the phase retardations in the illumination and reflection pathways, respectively (see the Supplementary section 1 for the detailed derivation of Eq. (2)). In essence, the measured field contains the object spectrum with scaling factor k/d and spectral shift by \(\left({-u}_{{{{{{\rm{i}}}}}}},{-v}_{{{{{{\rm{i}}}}}}}\right)\). The quadratic phase terms are introduced because of Fresnel diffraction. Interestingly, they are separately absorbed in \({O}_{{{{{{\rm{M}}}}}}}\left(x,y\right)\), \({\phi }_{{{{{{\rm{i}}}}}}}\left({u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)\), and \({\phi }_{{{{{{\rm{r}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}}\right)\).

Finding \({O}_{{{{{{\rm{M}}}}}}}\left(x,y\right)\), \({\phi }_{{{{{{\rm{i}}}}}}}\left({u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)\), and \({\phi }_{{{{{{\rm{r}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}}\right)\) from the measured reflection waves is generally a difficult task, especially when the phase retardations among the cores are uncorrelated. However, Eq. (2) is formally equivalent to the imaging configuration where the target object embedded within an aberrating medium is located at the exact focal plane of the objective lens42. In our previous studies42,43,44, we developed a method termed ‘closed-loop accumulation of single scattering (CLASS)’ to separate the object function from complex sample-induced aberrations and recover the ideal diffraction-limited spatial resolution. We specially redesigned this CLASS algorithm to address the present problem by accounting for the pixelated sampling by the fibre bundle and the quadratic phase terms. The working principle is to first construct the reflection matrix R in which the elements are filled with \({E}_{{{{{{\rm{camera}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}};{u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)\) with \(\left({u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)\) and \(\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}}\right)\) as column and row indices, respectively. We then iteratively compute the correlations among the columns and rows to identify \({\phi }_{{{{{{\rm{i}}}}}}}\left({u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)\) and \({\phi }_{{{{{{\rm{r}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}}\right)\), from which we obtain \({O}_{{{{{{\rm{M}}}}}}}\left(x,y\right)\) (see Methods for details of image reconstruction). We can obtain the reflectance map of the target sample by the simple relationship \({\left|{O}_{{{{{{\rm{M}}}}}}}\left(x,y\right)\right|}^{2}={\left|O\left(x,y\right)\right|}^{2}\). This algorithm automatically identifies \(\frac{k}{d}\left({x}^{2}+{y}^{2}\right)\) in \({O}_{{{{{{\rm{M}}}}}}}\left(x,y\right)\) for the objects within working range. Since the fibre-to-sample distance d is automatically obtained from the quadratic phase terms, it is also possible to reconstruct a 3D image from the recording of a single reflection matrix.

Procedures for data acquisition and image reconstruction

Experimental procedures to perform the endoscopic imaging are shown in Fig. 2. We placed a US Air Force (USAF) resolution target (Edmund, 2″ x 2″ Positive, 1951 USAF Hi-Resolution Target #58-198) at the SP. The conventional full-field endoscopic image taken by the same fibre bundle is shown in Fig. 2h as a point of reference (see Methods for details). From the photograph of the fibre bundle surface taken at the IP, we chose which fibre cores to illuminate (the red dots in Fig. 2a). In an arbitrary fibre bending configuration, we adjusted the GM angle to focus the illumination beam on each core at a time. In a typical experiment, illumination was made to 100–3000 different fibre cores. Raw images of the backscattered waves from the sample taken at the IP are shown in Fig. 2b for the illumination of a few representative core fibres. The spatial coordinates of these output images correspond to \(\left({{{{{{\boldsymbol{u}}}}}}}_{{{{{{\bf{r}}}}}}}{{{{{\boldsymbol{,}}}}}}{{{{{{\boldsymbol{v}}}}}}}_{{{{{{\bf{r}}}}}}}\right)\) at the fibre bundle surface. In each image, the bright dot is due to the direct back-reflection from the fibre core where the illumination was focused. They were moving synchronously with the scanning of \(\left({{{{{{\boldsymbol{u}}}}}}}_{{{{{{\bf{i}}}}}}}{{{{{\boldsymbol{,}}}}}}{{{{{{\boldsymbol{v}}}}}}}_{{{{{{\bf{i}}}}}}}\right)\). Backscattered signal waves by the target object were spatially spread over the other cores in the recorded images. We selectively removed the back-reflection noise by dropping the signal at the pixel where the back-reflection noise was focused. We then performed the Hilbert transform on the raw images in Fig. 2b to obtain a complex field map of the backscattered waves (Fig. 2c).

Fig. 2: Data acquisition and image reconstruction.
figure 2

a Bright-field image of the fibre bundle taken at the IP by illuminating the incoherent source from the OP. The fibre cores where the illumination beam was focused are indicated as red dots. b Raw images captured by the camera for the scanning of \(\left({u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)\). For better visibility, the images shown here were taken without the reference beam, while the interference images were recorded for the endoscopic imaging (see the Supplementary section 2 for raw interference images). The colour bar indicates normalized intensity excluding the back-reflection noise. c Complex field maps \({E}_{{{{{{\rm{camera}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}};{u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)\) obtained from the raw interference images in b. Circular colour map: real and imaginary values of \({E}_{{{{{{\rm{camera}}}}}}}\). d and e Fibre core-dependent phase retardations \({\phi }_{{{{{{\rm{i}}}}}}}\left({u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)\) and \({\phi }_{{{{{{\rm{r}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}}\right)\), respectively, identified via the algorithm. The colour bar signifies the phase in radians. Scale bar, 0.1k. f and g The same as d and e, respectively, but for a low-contrast resolution target. h Conventional endoscopic image of a USAF target taken by the incoherent illumination from the IP. The fibre bundle was in contact with the target surface. i Coherent addition of inverse Fourier transformed images of the complex field maps in c before correcting for the fibre core-dependent phase retardations. j The same as i but after the correction. The colour bar signifies normalized amplitude. km The same as hj, respectively, but for a low-contrast resolution target. The 350-μm-diameter fibre bundle was used, and the fibre-to-sample distance d was 500 μm for i, j, l, and m. Scale bars in a to c: 50 μm. Scale bars in j and m: 30 μm.

The detected images in Fig. 2c correspond to \({E}_{{{{{{\rm{camera}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}};{u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)\) in Eq. (2). Since \({\phi }_{{{{{{\rm{i}}}}}}}^{{{{{{\rm{b}}}}}}}\left({u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)\) and \({\phi }_{{{{{{\rm{r}}}}}}}^{{{{{{\rm{b}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}}\right)\) vary significantly during the insertion of the fibre, the object image cannot be reconstructed directly from the images in Fig. 2c. The inverse Fourier transform of the images in Fig. 2c resulted in speckled patterns, and their coherent summation showed no sign of image information (Fig. 2i). As described earlier, we constructed a reflection matrix from the images in Fig. 2c and applied our algorithm to identify \({\phi }_{{{{{{\rm{i}}}}}}}\left({u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)\) and \({\phi }_{{{{{{\rm{r}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}}\right)\), which are respectively shown in Fig. 2d, e. They presented complex phase patterns, as was expected. We compensated for these core-dependent phase retardations to the images in Fig. 2c, which were then coherently added after accounting for the spectral shift by (ui, vi) (see Methods for the algorithm and image reconstruction). The final reconstructed image shown in Fig. 2j presents fine details in Groups 8 and 9 that were invisible in the conventional endoscope image in Fig. 2h. Our endoscope could resolve the smallest feature of the USAF 1951 target in the 1st element of group 9 having a distance of 1.56 μm between the neighboring bars. This suggests that lateral resolution is equal to or better than 1.56 μm. In fact, the edge of the USAF target is sharp enough to fully evaluate the endoscope resolution. By analyzing the edge response function, we found that the lateral resolution was as good as 0.85 μm, which is very close to the theoretical resolution (0.78 μm) of our system (see Supplementary section 3). Furthermore, there was no pixelation in the reconstructed image because the acquired images via \({E}_{{{{{{\rm{camera}}}}}}}\) were pixelated in the spatial frequency but not in real space. This led to the increase of space-bandwidth product by a factor of 13 in comparison with the conventional contact-mode endoscopes (see Supplementary section 3).

The image resolution of our endoscope is determined by the diameter of the fibre bundle D, which sets the numerical aperture (NA) by \(\alpha=n\left(D/2\right){d}^{-1}\), where n is the refractive index of the medium between the fibre and the sample. When \(\alpha\) is larger than the NA of the fibre itself (0.4), then the latter limits the achievable spatial resolution. As d is reduced from 1.2 mm to 400 μm, \(\alpha\) is increased from 0.12 to 0.47 for n = 1, and the theoretical spatial resolving power is increased from 1.6 to 0.67 μm. In the experiment, the reflected waves within the diameter of \({D}_{{{{{{\rm{eff}}}}}}}=0.80D\) were used for the sake of the computational time. Therefore, the diffraction-limited resolution at d = 500 μm was 0.78 μm, which is close to the experimental result. The view field diameter was set as \(L=\left(\lambda /n\right)d\Delta {D}^{-1}\), where \(\Delta {D}\) = 3.2 μm is the fibre core-to-core spacing. Therefore, L increased from 66 μm to 170 μm with an increase in d. In the image reconstruction process, the effective core-to-core spacing was reduced to \(\Delta {D}_{{{{{{\rm{eff}}}}}}}\)= 1.5 μm due to the synthesis of multiple images with the spectral shift by \(\left({u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)\). As a result, L ranged from 140 to 410 μm with an increase in d. The estimated view field at d = 500 μm was 170 μm, which agrees well with the experimental result (see Supplementary section 3 for the resolving power and view field for 350-μm-diameter and 200-μm-diameter fibre bundles).

To prove the benefit of our method in removing the back-reflection noise, we performed imaging of a low-contrast USAF resolution target (Thorlabs, Positive 1951 USAF test target, R1DS1P). The reflectance of the target was 10% higher than that of the background at 532 nm wavelength. The conventional endoscopic imaging (Fig. 2k) did not show any structures at all because the back-reflection noise was much stronger than the backscattering signal from the resolution target. On the contrary, our method could clearly unveil the target structures (Fig. 2m). Figure 2f, g shows the identified core-dependent phase retardations.

Endoscopic imaging through a narrow and curved passage

We demonstrated label-free microscopic imaging through a narrow and curved passage. A target object was placed inside a box, and an 80-cm-long plastic tube was connected to a hole at the ceiling of the box (Fig. 3a, b). The inner and outer diameters of the tube were 4.3 and 6.5 mm, respectively. While the proximal end of the fibre was fixed at the focal plane of the objective lens (the inset in Fig. 3b), the distal side of the fibre was inserted through the tube until it reached the target (Edmund, 2″ × 2″ Positive, 1951 USAF Hi-Resolution Target #58-198). A conventional endoscopic image was taken when the fibre bundle was in contact with the target (Fig. 3c). Although this image was pixelated and its resolving power was low, it allowed us to position the fibre to the region of interest. Once the target was identified, we pulled back the fibre bundle to set d within the working range and recorded a reflection matrix by scanning the illumination fibre cores. The acquired image is shown in Fig. 3d, in which the reconstructed image is autofocused to the target object and fine details of Group 9 are clearly visible with a spatial resolution of 1.1 μm. The standoff distance was calculated as 800 μm from the quadratic phase of the reconstructed image. This result confirms that our endoscope can take an in-situ microscopic image of a target object through a narrow and curved passage with no prior calibration of the fibre bundle.

Fig. 3: Endomicroscopic imaging through a narrow and curved passage, and 3D imaging capability.
figure 3

a and b Front and top views of the experimental configuration, respectively. The zoomed-in image in a: the distal side of the fibre bundle near the sample. The zoomed-in image in b: the light coupling to the IP of the fibre from the objective lens, respectively. c and d Conventional endoscopic image and the reconstructed image with our endoscope, respectively. Scale bars: 20 μm. The colour bar in c shows normalized intensity and the colour bar in d shows normalized amplitude. e Schematic for endoscopic imaging of stacked targets. Two resolution targets were placed at two different depths, I and II, with fibre-to-sample distances of 0.6 and 1.06 mm, respectively. Ground-truth images of the targets in depths I and II taken by conventional brightfield microscope were shown next to the schematic. f and g Endoscopic images for the depths of I and II, respectively, reconstructed using a single reflection matrix recording. The 350-μm-diameter fibre bundle was used for image acquisition. Scale bars: 30 μm. The colour bar signifies normalized amplitude.

3D image reconstruction from a single matrix recording

Our endoscope has 3D imaging capability since our algorithm works as a holographic image reconstruction. For the two targets separated by 460 μm (Fig. 3e), we took a single reflection matrix. Our algorithm identified the target image from depth I (Fig. 3f) since this depth gave stronger correlations than depth II when computing the core-dependent phase retardations. Once ϕi (ui, vi) and ϕr (ur, vr) had been identified for depth I, we applied the computational propagation of each Ecamera (ui, vi; ui, vi) based on the angular spectrum method41 after applying the correction to obtain the target image at depth II. After numerical refocusing, completely different structures corresponding to depth II came to light (Fig. 3g) (see Supplementary Movie 1 taken while the focal plane was computationally scanned). This result showed the volumetric image range covered by a single matrix recording. We demonstrated volumetric imaging of TiO2 particles dispersed in agarose gel and proved that the depth resolution of our endoscope was as good as 14 μm (see Supplementary section 3 and Supplementary Movie 2).

Microscopic imaging of unstained rat intestine tissues

We demonstrated the endomicroscopic imaging of unstained biological tissues. An intestine tissue excised from a rat was placed on a slide glass (see Methods for the sample preparation). At first, the tip of the fibre bundle was placed close to the surface of the villi in the intestine. The image acquired by the conventional endoscope is shown in Fig. 4a, in which the contrast between the target and background was too low to discern any villus. In fact, the contrast of the villi was so low that they could be vaguely visible even in the transmission image (Fig. 4b) taken by illuminating from the sample side. We then pulled back the fibre bundle about 1000 μm away from the villi and took endoscopic images for different regions. Representative images obtained for the circular areas indicated in Fig. 4b are shown in Fig. 4c–f. The boundary of the villi is sharply resolved with a much better contrast than the conventional endoscope image, and the structures of the relatively transparent inner parts near the boundary are recognizable. The entire map of two villi was reconstructed by stitching multiple images taken for different regions (Fig. 4g). The spatial resolution obtained from the sharpness of the boundary was about 1 μm, far better than typical colonoscopy can provide. In this demonstration, the reflection from the glass surface was dominant over that from the tissue surface. Therefore, the contrast of the reconstructed image was generated by the reflection of glass surface transmission through the intestine tissue.

Fig. 4: Microscopic imaging of villi in a rat intestine.
figure 4

a Conventional reflectance endoscope image taken when the fibre bundle was in contact with the villi. b Conventional transmission image obtained through the fibre bundle. The LED illumination was sent from the villi to the fibre bundle. Scale bar: 100 μm. c-f Label-free reflectance images obtained using our holographic endoscope for the circular areas indicated in b. Scale bar: 25 μm. g Reconstructed image of two villi by stitching multiple images taken over wide region of interest. The 350-μm-diameter fibre bundle was used for image acquisition. Scale bar: 100 μm.

Coherence-gated Fourier holographic endoscope for tissue imaging

To demonstrate tissue imaging in a full reflection-mode condition, we placed a rat intestine tissue on agarose gel and filled the gap between the gel and fibre bundle with water (Fig. 5a). Refractive indices of water and agarose gel were almost identical, making reflection from their interface negligible. Since a slide glass supporting the sample was located far below the excised tissue (> 1 cm), reflection from the glass surface was negligible. Therefore, the reflection signal captured by the fibre bundle was mainly from the surface of intestine tissues as well as the distal side of fibre itself. In this geometry, the conventional contact-mode endoscope couldn’t visualize anything, nor the clue of the existence of the villi (Fig. 5c). This is because the reflection signal from the surface of the villi was completely obscured by that from the fibre surface due to the small index difference between the tissue and water/agarose gel. This shows the difficulty in realizing the reflectance-mode endoscope imaging for biological tissues. On the contrary, our Fourier holographic endoscope could clearly visualize the external boundaries and morphology of the villi with high contrast (Fig. 5d). To cover the wide FOV, we moved the distal tip of the fibre bundle and consecutively imaged multiple sites of the villi. The recorded images were stitched to produce the extended image. From the sharpness of the boundaries of the villus, spatial resolution was estimated to be 2 μm. As ground truth, a confocal reflectance image was taken by a conventional microscope using an air-type 0.4 NA objective lens at a wavelength of 516 nm (Fig. 5b). The overall morphology of the villus and its boundaries were almost identical to those resolved by our endoscope image, validating the capability of our endoscope for imaging unstained tissues. It should be noted that the contrast of our endoscope image is slightly better than the conventional confocal reflectance image due to the addition of coherence gating explained below.

Fig. 5: Imaging of a rat villi in the full reflection-mode condition.
figure 5

a Sample configuration. An excised rat intestine tissue was placed on top of thick agarose gel (thickness > 1 cm) on a slide glass. The distal tip of the fibre bundle was approximately 600 μm above the tissue surface with their gap filled with water. b Conventional reflectance image recorded by a confocal microscope using 0.4 NA objective lens at the wavelength of 516 nm. c Contact-mode reflectance endoscope image. d Fourier holographic endoscope image. Scale bars: 100 μm.

In this demonstration, we specially adopted a low-coherence light source to further attenuate the contribution of the reflection from the distal tip of the fibre bundle. Although single-core illumination with wide-field detection can eliminate most of the back-reflection noise, there still exist stray reflections due to the leak of illumination to the other fibre cores. Even these stray reflections can make a devastating effect on the image reconstruction due to the weak contrast of the intestine tissues in water/agarose gel. The use of a low-coherence source whose coherence length is shorter than the tip-sample distance can effectively reject these contributions as they cannot form an interference with the reference beam, whose path length was set to the tissue sample. Specifically, an output beam from a pulsed laser (custom-made Yb laser, wavelength: 1032 nm) was frequency-doubled by a second harmonic generation process to produce a light source with a wavelength of 516 nm and a bandwidth of 3 nm. This corresponds to the coherence length of 20 μm. A 300 μm-diameter fibre bundle (FIGH-06-300S) with a length of 15 cm and a core number of approximately 4000 was used for this measurement. Due to the dispersion by the imaging optics and fibre cores, the effective coherence-gating was broadened to about 100 μm, which was much shorter than the typical gap of 600 μm between the fibre tip and the tissue surface. It turned out that the addition of the coherence gating was critical in imaging tissues immersed in water and agarose gel. On a condition where the tissue surface is exposed to air, which is normally the case in a typical endoscopic investigation, the demand for coherence gating is expected to be reduced. This result suggests that our endoscope’s label-free microscopic imaging capability can potentially be used for identifying the abnormalities occurring at the surface of the villi at an earlier stage than the current practice.

Discussion

We developed a fully flexible ultrathin lensless endoscope that can perform microscopic imaging of unstained biological tissues through a narrow and curved passage. The proposed method takes the thinnest possible form because the fibre bundle itself is used as the endoscope probe. The diameter of the probe was either 350 μm or 200 μm in the present study, but it can be reduced further just by employing a smaller fibre bundle. Therefore, the diameter can even be comparable to the thinnest acupuncture needle. The achieved lateral and axial resolutions were 0.85 μm and 14 μm, respectively, which are comparable to those of high-resolution microscopes. The recording of holographic images enabled the reconstruction of 3D objects for a volume covering a depth range of 400–1200 μm using a single reflection matrix recording. While retaining all these benefits, our endoscope operates as a flexible type rather than a rigid type because we eliminated the need for prior calibration of the imaging probe. We resolved the complex core-dependent phase retardations directly from the images taken during the imaging session, which allowed us to freely navigate the endoscope probe around the region of interest.

The acquisition speed for the volumetric imaging was 1 Hz with the use of an sCMOS camera because the required complex field maps can be as small as 100 images. The imaging speed could be accelerated up to 20 Hz simply by employing a high-speed CMOS camera with a frame rate of 2,000 Hz (see Supplementary section 8). This implementation will substantially reduce the motion artifacts during the image acquisition. Image reconstruction takes 2 s to process data with 200 raw images using a desktop computer (CPU: Intel Xeon Gold 6130, 2.10 GHz, RAM: 256 GB, GPU: NVIDIA TITAN RTX D6 24 GB). We expect that the complete replacement of Matlab with the compiled programing language such as C/C++ will make the overall process much faster and the real-time visualization possible.

The capability of label-free and flexible endoscopic imaging with a microscopy level of performance could expedite non-invasive or minimally invasive disease diagnosis and industrial inspections. For example, our endoscope is so thin that it can be inserted through narrow cavities such as the minute airways in the lung and narrow microvessels for microscopic investigation. Since the probe can be as thin as an acupuncture needle, it can directly be inserted into the brain tissues to find the neurological disorders with minimal complications. Our endoscope can also facilitate precise phototherapy and efficient light stimulation as the treatment light can be delivered through the fibre bundle while the endoscopic imaging accesses the target lesion through the same imaging probe. The proposed method was combined with the time-gated detection by employing a low-coherence light source for better background noise reduction. Time-gated detection will allow endoscopic imaging deep inside tissues as well as the mapping of the superficial morphology. Since our approach can provide the imaging contrast better than the conventional confocal reflectance microscopy, this method will further expedite disease diagnosis occurring deep within tissues16,45,46. The applicability is not limited to the medical diagnosis but can be extended to industrial inspections. In modern semiconductors and microprocessors, devices are stacked in layers for maximal integration. Our endoscope can be used to monitor each step of fabrication process occurring inside the chambers with minimal interventions. Label-free imaging capability is particularly well-suited because the use of chemical staining is not desirable due to the concern about contaminating the devices.

Methods

Correcting the fibre core-dependent phase retardations

From complex images \({E}_{{{{{{\rm{camera}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}};{u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)\) in Fig. 2c, we constructed reflection matrix R in which the column and row indices correspond to \(\left({u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)\) and \(\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}}\right)\), respectively. This was achieved by converting the individual images in Fig. 2c into column vectors and appending them together to form a matrix. We identified \({\phi }_{{{{{{\rm{i}}}}}}}^{{{{{{\rm{c}}}}}}}\left({u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)\) from the correlation among the columns and constructed a corrected matrix in which the matrix elements were \({E}^{{{{{{\rm{c}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}};{u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)={e}^{-i{\phi }_{{{{{{\rm{i}}}}}}}^{c}\left({u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)}{E}_{{{{{{\rm{camera}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}};{u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)\). We then identified \({\phi }_{{{{{{\rm{r}}}}}}}^{{{{{{\rm{c}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}}\right)\) from the correlation among the rows of the corrected matrix, which was then applied to construct the corrected matrix whose elements were \({E}^{{{{{{\rm{c}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}};{u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)={e}^{-i{\phi }_{{{{{{\rm{i}}}}}}}^{c}\left({u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)}{E}_{{{{{{\rm{camera}}}}}}}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}};{u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right){e}^{-i{\phi }_{{{{{{\rm{r}}}}}}}^{c}\left({u}_{{{{{{\rm{r}}}}}}},{v}_{{{{{{\rm{r}}}}}}}\right)}\). These processes were repeated until the object spectrum \({\widetilde{O}}_{{{{{{\rm{M}}}}}}}\left(\frac{k}{d}\left({u}_{{{{{{\rm{r}}}}}}}+{u}_{{{{{{\rm{i}}}}}}}\right),\frac{k}{d}\left({v}_{{{{{{\rm{r}}}}}}}+{v}_{{{{{{\rm{i}}}}}}}\right)\right)\) was identified42. We shifted the object spectrum by \(\left({u}_{{{{{{\rm{i}}}}}}},{v}_{{{{{{\rm{i}}}}}}}\right)\) for individual images, which converted the acquired spectrum to \({\widetilde{O}}_{{{{{{\rm{M}}}}}}}\left(\frac{k}{d}{u}_{{{{{{\rm{r}}}}}}},\frac{k}{d}{v}_{{{{{{\rm{r}}}}}}}\right)\) for all choices of illumination cores. These spectra were added together, and an inverse Fourier transform was taken to obtain coherently accumulated object function \({O}_{M}\left(x,y\right)\). \(\left|{O}_{M}\left(x,y\right)\right \vert=\left|O\left(x,y\right)\right|\) is shown in Figs. 2j and 2m. See the Supplementary section 2 for details of the image processing.

Acquisition of the conventional endoscope images

Conventional endoscope images shown in Figs. 2h, k, 3c, and 4a were taken by bringing the tip of the fibre bundle into contact with the resolution target. This is equivalent to attaching a lens having the magnification of 1x to the tip of a fibre. A light-emitting diode (Model: M530L3, Thorlabs, wavelength: 530 nm) was inserted in the sample beam path in Fig. 1a and its output beam illuminated the entire fibre bundle. Reflected images were recorded with the same camera used for our endoscope imaging.

Preparation of the rat intestines

We extracted intestinal tissues from 1- to 2-day-old Sprague-Dawley rats. Both male and female SD rats were used. The upper part of the small intestine was quickly excised and fixed for 2–4 h at 4 °C in 4% paraformaldehyde. After fixation, the intestinal tissue was washed with phosphate-buffered saline solution and then mounted on a slide glass with immersion oil for imaging. All of the experimental procedures and protocols above were conducted in accordance with the guidelines established by the Committee of the Animal Research Policy at Korea University.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.