Limited-View and Sparse Photoacoustic Tomography for Neuroimaging with Deep Learning

Guan, Steven; Khan, Amir A.; Sikdar, Siddhartha; Chitnis, Parag V.

doi:10.1038/s41598-020-65235-2

Download PDF

Article
Open access
Published: 22 May 2020

Limited-View and Sparse Photoacoustic Tomography for Neuroimaging with Deep Learning

Scientific Reports volume 10, Article number: 8510 (2020) Cite this article

5808 Accesses
41 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Photoacoustic tomography (PAT) is a non-ionizing imaging modality capable of acquiring high contrast and resolution images of optical absorption at depths greater than traditional optical imaging techniques. Practical considerations with instrumentation and geometry limit the number of available acoustic sensors and their “view” of the imaging target, which result in image reconstruction artifacts degrading image quality. Iterative reconstruction methods can be used to reduce artifacts but are computationally expensive. In this work, we propose a novel deep learning approach termed pixel-wise deep learning (Pixel-DL) that first employs pixel-wise interpolation governed by the physics of photoacoustic wave propagation and then uses a convolution neural network to reconstruct an image. Simulated photoacoustic data from synthetic, mouse-brain, lung, and fundus vasculature phantoms were used for training and testing. Results demonstrated that Pixel-DL achieved comparable or better performance to iterative methods and consistently outperformed other CNN-based approaches for correcting artifacts. Pixel-DL is a computationally efficient approach that enables for real-time PAT rendering and improved image reconstruction quality for limited-view and sparse PAT.

Deep learning optoacoustic tomography with sparse data

Article 16 September 2019

A deep neural network for real-time optoacoustic image reconstruction with adjustable speed of sound

Article 02 October 2023

Deep learning approach for denoising low-SNR correlation plenoptic images

Article Open access 10 November 2023

Introduction

Neuroimaging in small animals have played an essential role in preclinical research to provide physiological, pathological, and functional insights that are key for understanding and treating neurological diseases. Over the past few decades, there has been significant advances in magnetic resonance imaging (MRI) and optical imaging techniques for structural and functional neuroimaging. For example, MRI can acquire high resolution images of brain structures over large volumes, 3D connectivity and diffusivity information using diffusion tensor imaging, and brain activity using functional MRI^1,2,3. However, MRI has poor temporal resolution and cannot be used to study fast hemodynamic mechanisms and responses. Optical imaging techniques can exploit the diverse biological molecules (e.g. hemoglobin, melanin, and lipids) – each possessing different optical properties – present in biological tissues to provide contrast for structural and functional imaging^4,5,6. However, strong optical scattering limits the imaging depth of optical techniques to approximately 1–2 mm into the brain⁷.

Photoacoustic tomography (PAT) is an emerging non-invasive hybrid technique that has recently seen substantial growth in numerous preclinical biomedical applications and as a powerful clinical diagnostic tool^8,9,10,11. In particular, there is a strong interest in PAT for preclinical structural and functional neuroimaging^{12,13,14,15,16}. Given its unique use of light and sound, PAT combines the high contrast and molecular specificity of optical imaging with the high spatial resolution and centimeter-penetration depth of ultrasound imaging^17,18,19. PAT has been demonstrated capable of kilohertz volumetric imaging rates, far exceeding the performance of other modalities, which enables new insights into previously obscure biological phenomena²⁰. There are diverse contrast agents available such as chemical dyes, fluorescent proteins, and nanoparticles that can be used to further enhance the imaging capabilities of PAT^21,22.

PAT involves irradiating the biological tissue with a short-pulsed laser. Optical absorbers within the tissue are excited by the laser and undergo thermoelastic expansion which results in the generation of acoustic waves²³. A sensor array surrounding the tissue is then used to detect the acoustic waves, and an image is formed from the measured sensor data. PAT image reconstruction is a well-studied inverse problem that can be solved using analytical solutions, numerical methods (e.g. time reversal), and model-based iterative methods^{24,25,26,27,28}. In general, a high-quality image can be reconstructed if the sensor array has a sufficiently large number of sensor elements and completely encloses the tissue. However, building an imaging system with these specifications is often prohibitively expensive, and in many in vivo applications such as neuroimaging, the sensor array typically can only partially enclose the tissue^29,30. These practical limitations result in sparse spatial sampling and limited-view of the photoacoustic waves emanating from the medium. Reconstructing from sub-optimally acquired data causes streaking artifacts in the reconstructed PAT image that inhibits image interpretation and quantification³¹.

To address these issues, iterative methods are commonly employed to remove artifacts and improve image quality. These methods use an explicit model of photoacoustic wave propagation and seek to minimize a penalty function that incorporates prior information^32,33,34. However, they are computationally expensive due to the need for repeated evaluations of the forward and adjoint operators, and resulting image quality is dependent on the constraints imposed^35,36.

Given the wide success of deep learning in computer vision, there is a strong interest in applying similar methods for tomographic image reconstruction problems^37,38,39. Deep learning has the potential to be an effective and computationally efficient alternative to state-of-the-art iterative methods. Having such a method would enable improved image quality, real-time PAT image rendering, and more accurate image interpretation and quantification.

Among the many deep learning approaches for image reconstruction, post-processing reconstruction (Post-DL) is the most widely used and has been demonstrated for improving image reconstruction quality in CT^40,41, MRI⁴², and PAT^{43,44,45,46,47,48}. It was shown capable of achieving comparable or better performance than iterative methods for limited-view and sparse PAT image reconstruction^45,49,50,51. In Post-DL, an initial inversion is used to reconstruct an image with artifacts from the sensor data. A convolutional neural network (CNN) is then applied as a post-processing step to remove artifacts and improve image quality. The main drawback of Post-DL is that the initial inversion does not properly address the issues of limited-view and sparse sampling, which results in an initial image with artifacts. Image features (e.g. small vessels) that are missing or obscured by artifacts are unlikely to be recovered by the CNN.

Previous works attempted to improve upon Post-DL by removing the need for an initial inversion step^50,52. One approach termed direct reconstruction (Direct-DL) used a CNN to reconstruct an image directly from the sensor data⁵². The main challenge in using Direct-DL is the need to carefully select parameters (e.g. stride and kernel size) for each convolutional layer in order to transform the sensor data into the desired image dimensions. Changing either the dimensions of the input (e.g. using a different number of sensors) or output would require a new set of convolution parameters and the CNN architecture to be modified. Direct-DL was shown capable of reconstructing an image but underperformed compared to Post-DL. Interestingly, a hybrid approach using a combination of Post-DL and Direct-DL, where an initial inversion and the sensor data are given as inputs to the CNN, was shown to provide an improvement over using Post-DL alone^53,54.

Another approach termed “model-based learning” similarly does not require an initial inversion step and achieves state-of-the-art image reconstruction quality^50,55,56,57. This approach is like iterative reconstruction and uses an explicit model of photoacoustic wave propagation for image reconstruction. However, the prior constraints are not handcrafted and instead are learned by a CNN from training data. The improved performance does come at the cost of requiring more time to train the CNN and reconstruct an image⁵⁰. Thus, the choice between model-based learning and direct learned approaches (e.g. Post-DL and Direct-DL) depends on whether the application prioritizes image reconstruction speed or quality.

In this work, we propose a novel approach termed pixel-wise deep learning (Pixel-DL) for limited-view and sparse PAT image reconstruction. Pixel-DL is a direct learned approach that employs pixel-wise interpolation to window relevant information, based on the physics of photoacoustic wave propagation, from the sensor data on a pixel-basis. The pixel-interpolated data is provided as an input to the CNN for image reconstruction. This strategy removes the need for an initial inversion and enables the CNN to utilize more information from the sensor data to reconstruct a higher quality image. The pixel-interpolated data has similar dimensions to the desired output image which simplifies CNN implementation. We compare Pixel-DL to conventional PAT image reconstruction methods (time reversal and iterative reconstruction) and direct learned approaches (Post-DL and a modified implementation of Direct-DL) with in silico experiments using several vasculature phantoms for training and testing.

Methods

Photoacoustic signal generation

The photoacoustic signal is generated by irradiating the tissue with a nanosecond laser pulse δ(t). Light absorbing molecules in the tissue undergo thermoelastic expansion and generate photoacoustic pressure waves²³. Assuming negligible thermal diffusion and volume expansion during illumination, the initial photoacoustic pressure x can be defined as

$$x(r)=\Gamma (r)A(r)$$

(1)

where A(r) is the spatial absorption function and $\Gamma (r)$ is the Grüneisen coefficient describing the conversion efficiency from heat to pressure⁵⁸. The photoacoustic pressure wave p(r, t) at position r and time t can be modeled as an initial value problem for the wave equation, in which c is the speed of sound⁵⁹.

$$({\partial }_{tt}-{c}_{0}^{2}\Delta )p(r,t)=0,\,p(r,t=0)=x,\,{\partial }_{t}p(r,t=0)=0$$

(2)

Sensors located along a measurement surface ${S}_{o}$ measure a time-dependent signal. The linear operator $ {\mathcal M} $ acts on $p(r,t)$ restricted to the boundary of the computational domain Ω over a finite time T and provides a linear mapping from the initial pressure x to the measured time-dependent signal y.

$$y={ {\mathcal M} }_{p|\partial \Omega \times (0,T)}=Ax\,$$

(3)

Photoacoustic image reconstruction

Time reversal is a robust reconstruction method that works well for homogenous and heterogeneous mediums and also for any arbitrary detection geometry^27,28. A PAT image is formed by running a numerical model of the forward problem backwards in time. This involves transmitting the measured sensor data in a time-reversed order into the medium. Time reversal can reconstruct a high-quality image if the acoustic properties of the medium are known a priori and if the sensor array has enough detectors and fully encloses the tissue.

In this work, iterative reconstruction is used to recover the PAT image x from the measured signal y by solving the following optimization problem using the isotropic total variation (TV) constraint

$$x=\mathop{{\rm{argmin}}}\limits_{x{\prime} }{\Vert y-Ax{\prime} \Vert }^{2}+\lambda {|x{\prime} |}_{TV}$$

where the parameter $\lambda > 0$ is a regularization parameter^32,36,60. The TV constraint is a widely employed regularization functional for reducing noise and preserving edges. Iterative reconstruction with a TV constraint works well in the case of simple numerical or experimental phantoms but often leads to sub-optimal reconstructions for images with more complex structures⁴³.

Deep learning

In this work, three different CNN-based deep learning approaches were used for limited-view and sparse PAT image reconstruction (Fig. 1). These direct learned approaches all began with applying an initial processing step to the PAT sensor data and then recovering the final PAT image using a CNN. The primary difference among these approaches was the processing step used to initially transform the PAT sensor data. In Post-DL, the sensor data was initially reconstructed into an image containing artifacts using time reversal, and the CNN was applied as a post-processing step for artifact removal and image enhancement. In Pixel-DL, pixel-wise interpolation was applied to window relevant information in the sensor data and to map that information into the image space. In the modified Direct-DL implementation (mDirect-DL), a combination of linear interpolation and down sampling was applied so that the interpolated sensor data had the same dimensions as the final PAT image.

CNN Architecture: fully dense UNet

After the sensor data was transformed, the final PAT image was recovered using the Fully Dense UNet (FD-UNet) CNN architecture (Fig. 2). The FD-UNet builds upon the UNet, a widely used CNN for biomedical imaging tasks. by incorporating dense connectivity into the contracting and expanding paths of the network⁶¹. This connectivity pattern enhances information flow between convolutional layers to mitigate learning redundant features and reduce overfitting⁶². The FD-UNet was demonstrated to be superior to the UNet for artifact removal and image enhancement in 2D sparse PAT⁴⁷.

Pixel-Wise interpolation

Pixel-wise interpolation uses a model of photoacoustic wave propagation to map the measured time series pressure in the sensor data to a pixel position within the image reconstruction grid that the signal likely originated from. In this work, we choose to apply pixel-wise interpolation using a linear model of photoacoustic wave propagation since the in silico experiments were performed using a homogenous medium (e.g. uniform density and speed of sound). The linear model assumes the acoustic waves are propagating spherically and traveling at a constant speed of sound. Based on these assumptions, the time-of-flight can be easily calculated for a pressure source originating at some position in the medium and traveling to a sensor located on the medium boundary.

Reconstructing an image begins by defining an image reconstruction grid that spans the region of interest in the imaging system (Fig. 3a). The goal of pixel-wise interpolation is to map the time series pressure measurements of each sensor to the defined reconstruction grid on a pixel-basis, which results in a 3D data array with dimensions corresponding to the 2D image space and sensor number (Fig. 3b,c). This is achieved by repeating the following interpolation process for each sensor in the sensor array (Fig. 3d–f). The time-of-flight for a signal originating from each pixel position and traveling to the selected sensor is calculated based on a model of photoacoustic wave propagation. In the case of a linear model, the time-of-flight is proportional to the distance between the selected pixel and sensor (Fig. 3e). Pressure measurements in the sensor data are interpolated onto the reconstruction grid using the calculated time-of-flight for each pixel (Fig. 3f).

Deep learning implementation

The CNNs were implemented in Python 3.6 with TensorFlow v1.7, an open source library for deep learning⁶³. Training and evaluation of the network is performed on a GTX 1080Ti NVIDIA GPU. The CNNs were trained using the Adam optimizer to minimize the mean squared error loss with an initial learning rate of 1e-4 and a batch size of three images for 40 epochs. Training each CNN required approximately one hour to complete. Pairs of training datasets $\{{x}_{i},\,{y}_{i}\}$ were provided to the CNN during training, where ${x}_{i}$ represents the input data (e.g. initial time reversal reconstruction, pixel-interpolated sensor data, and interpolated sensor data) and ${y}_{i}$ represents the corresponding artifact-free ground truth image. A separate CNN was trained for each CNN-based approached, imaging system configuration, and training dataset.

Photoacoustic data for training and testing

Training data were procedurally generated using data augmentation, where new images were created based on a 340 × 340 pixel-size image of a synthetic vasculature phantom generated in MATLAB (Fig. 3a). First, scaling and rotation was applied to the initial phantom image with a randomly chosen scaling factor (0.5 to 2) and rotation angle (0-359 degrees). Then a 128 × 128 pixels sub-image was randomly chosen from the transformed image and translated by a random vertical and horizontal shift (0–10 pixels) via zero-padding. Outputs from multiple iterations (up to five) of the data augmentation process are summed together to create a training image. The synthetic vasculature phantom dataset was comprised of 500 training images. Testing data were generated from a 3D micro-CT mouse brain vasculature volume⁶⁴ with a size of 260 × 336 × 438 pixels. The Frangi vesselness filter was applied to suppress background noise and enhance vessel-like features⁶⁵. A new image was created from the filtered volume by generating a maximum-intensity projection of a randomly chosen 128 × 128 × 128 pixel sub-volume. The mouse brain vasculature dataset was comprised of 50 testing images.

The “High-Resolution Fundus Image Database” is a public database that contains 45 fundus images from human subjects that were either healthy, had glaucoma, or had diabetic retinopathy. The images had corresponding vessel segmentation maps created by a group of experts and clinicians within the field of retinal image analysis⁶⁶. The 45 fundus images were split into a separate training set (N = 15) and testing set (N = 30). The training dataset was procedurally generated using data augmentation based on the images within the training set and was comprised of 500 training images. The testing dataset was comprised of the original 30 images and 20 additional images, generated using data augmentation based on images from the testing set, for a total of 50 testing images.

The “ELCAP Public Lung Image Database” is a public database that contains 50 low-dose whole-lung CT scans obtained within a single breath hold⁶⁷. The whole-lung volumes were split into a training (N = 15) and testing set (N = 35). Vessel-like structures were segmented from the whole-lung CT volumes using the Frangi vesselness filter [63]. The training dataset was then generated by taking maximum intensity projection images (MIP) of randomly sampled sub-volumes from the filtered volumes in the training set. Data augmentation was also applied to the MIPs to generate a training dataset comprised of 500 training images. With the same procedures, MIPs were taken from the filtered volumes in the testing set to create a testing dataset comprised of 50 images.

In all three cases (mouse-brain vasculature, fundus image database, and ELCAP Lung database), training and testing data were completely segregated. In the latter two experiments, significant variations were present between the training and testing datasets due to patient-to-patient variability and innate differences in vascular morphology between healthy subjects and patients with varying degrees of disease.

A MATLAB toolbox, k-WAVE, was used to simulate photoacoustic data acquisition using an array of acoustic sensors⁶⁸. Photoacoustic simulations in the k-WAVE toolbox are implemented using a pseudospectral approach⁶⁹. Each training and testing image were normalized (values between 0 and 1) and treated as a photoacoustic source distribution on a computation grid of 128 × 128 pixels. The medium was assumed to be non-absorbing and homogenous with a speed of sound of 1500 m/s and density of 1000 Kg/m³. The sensor array had 16, 32, or 64 sensor elements equally spaced on a semi-circle with a diameter of 120 pixels. The time reversal method in the k-WAVE toolbox was also used for reconstructing an image from the simulated photoacoustic time series data.

Reconstructed images were compared against the ground truth using the peak-signal-to-noise ratio (PSNR) and structural similarity index (SSIM) as metrics for image quality. PSNR provides a global measurement of image quality, while SSIM provides a local measurement that takes into account for similarities in contrast, luminance, and structure⁷⁰.

Disclaimer

Author S.G.’s affiliation with The MITRE Corporation is provided for identification purposes only and is not intended to convey or imply MITRE’s concurrence with, or support for, the positions, opinions or viewpoints expressed by the author. Approved for Public Release; Distribution Unlimited. Case Number 18-4405. ©2019 The MITRE Corporation. All Rights Reserved.

Results

Conventional PAT image reconstruction techniques (e.g. time reversal and iterative reconstruction) and CNN-based approaches (Post-DL, Pixel-DL, and mDirect-DL) were compared over several in silico experiments for reconstruction image quality and reconstruction time. CNN-based approaches were all implemented using the FD-UNet CNN architecture. Reconstructed images were compared to the ground truth image using PSNR and SSIM as quantitative metrics for image reconstruction quality.

Mouse brain vasculature experiment

In the first experiment, the CNNs were trained on the synthetic vasculature phantom dataset and tested on the mouse brain vasculature dataset. Although both datasets contained images of vasculature, they were non-matched meaning there were likely image features (e.g. vessel connectivity patterns) in the testing dataset but not in the training dataset. In addition to evaluating the CNNs’ performance, this experiment sought to determine if the CNNs were generalizable when trained on the synthetic vasculature phantom and tested on the mouse brain datasets.

The time reversal reconstructed images had severe artifacts blurring the image and the lowest average PSNR and SSIM for all sparsity levels (Fig. 4 and Table 1). Images reconstructed with iterative or a CNN-based method had fewer artifacts and a higher average PSNR and SSIM. Vessels obscured by artifacts in the time reversal reconstructed images were more visible in the other reconstructed images. As expected, increasing the number of sensors resulted in fewer artifacts and improved image quality for all PAT image reconstruction methods. Pixel-DL consistently had a higher average PSNR and SSIM than Post-DL for all sparsity levels and similar scores to iterative reconstruction.

Table 1 Average PSNR and SSIM for Micro-CT Mouse Brain Vasculature Testing Dataset (N = 50 testing images).

Full size table

In the case of sparse sampling (especially with 16 sensors), Post-DL often introduced additional vessels that were not originally in the ground truth image (Fig. 4a,b). This was likely due to the CNN misinterpreting strong artifacts in the input image as real vessels. Pixel-DL exhibited a similar behavior but typically had fewer false additional vessels. This issue was not as prevalent in images reconstructed using the iterative method. However, images reconstructed using iterative reconstruction had an overly smoothed appearance compared to the deep learning-based reconstructed images. This is a pattern commonly observed when using the total variation constraint. Moreover, iterative reconstruct.

Pixel-DL consistently outperformed time reversal in reconstructing images of the synthetic vasculature and mouse brain vasculature (Fig. 5). Interestingly, mDirect-DL only outperformed time reversal in reconstructing the synthetic vasculature images, which were used to train the CNN. The mDirect-DL reconstructed image of mouse brain vasculature resembled the ground truth image but was substantially worse than the time reversal reconstruction. This indicated that the CNN learned a mapping from the PAT-sensor data to the image space but severely overfitted to the training data. During training, the CNNs for Pixel-DL and mDirect-DL converged to a minimum mean squared error, but the Pixel-DL CNN converged to a lower error.

Lung and fundus vasculature experiment

In the second experiment, the CNNs were trained and tested on the lung vasculature and fundus vasculature datasets. This experiment represented a scenario in which the training and testing datasets are derived from segregated anatomical image data. There were natural differences between the training and testing datasets since the original images were acquired from healthy patients and those with varying disease severity.

As expected, the time reversal reconstructed images of lung and fundus vasculature had the most artifacts and the lowest average PSNR and SSIM for all sparsity levels (Fig. 6 and Table 2). Images reconstructed with a CNN-based method or iterative reconstruction resulted in fewer artifacts and a higher average PSNR and SSIM. Pixel-DL consistently outperformed Post-DL for both vasculature phantoms for all sparsity levels. Comparable to iterative reconstruction, Pixel-DL had similar performance for the fundus vasculature and outperformed it for the lung vasculature dataset. For images reconstructed from PAT sensor data acquired using 16 sensors, Pixel-DL reconstructed images appeared sharper and were qualitatively superior compared to iteratively reconstructed images despite having similar SSIM and PSNR values.

Table 2 Average PSNR and SSIM for Lung and Fundus Vasculature Testing Dataset (N = 50 testing images).

Full size table

Image reconstruction times

The average reconstruction time reported for each method are for reconstructing a single image from the PAT sensor data. Time reversal is a robust and computationally inexpensive reconstruction method (~2.57 seconds per image). Iterative reconstruction removed most artifacts and improved image quality but had a much longer average reconstruction time (~491.21 seconds per image). Pixel-DL reconstructed images with similar quality to iterative reconstruction and was faster by over a factor of 1000 (~7.9 milliseconds per image). Average reconstruction time for Post-DL is dependent on the initial inversion used since the computational cost of a forward pass through a CNN is essentially negligible. Since time reversal was used as the initial inversion, Post-DL had a longer average reconstruct time than Pixel-DL (~2.58 seconds per image).

Discussion

In this work, we propose a novel deep learning approach termed Pixel-DL for limited-view and sparse PAT image reconstruction. We performed in silico experiments using training and testing data derived from multiple vasculature phantoms to compare Pixel-DL with conventional PAT image reconstruction methods (time reversal and iterative reconstruction) and direct learned approaches (Post-DL and mDirect-DL). Results showed that Pixel-DL consistently outperformed time reversal, Post-DL, and mDirect-DL for all experiments. Pixel-DL was able to generalize well evidenced by its comparable performance to iterative reconstruction for the mouse brain vasculature phantom despite having only trained on images generated from a synthetic vasculature phantom with data augmentation. Having a more varied training dataset may further improve CNN generalization and performance. When the training and testing data were derived from segregated anatomical data, Pixel-DL had similar performance to iterative reconstruction for the fundus vasculature phantom and outperformed it for the lung vasculature phantom. The total variation constraint used for iterative reconstruction was likely suboptimal for reconstructing lung vasculature images since the lung vessels were small and closely grouped.

Comparison between deep learning approaches

The CNN architecture and hyperparameters used for all deep learning approaches implemented were essentially the same. Thus, discrepancies in performance between the approaches were primarily due to their respective inputs into the CNN (Fig. 4). In Post-DL, the input was an image initially reconstructed from the sensor data using time reversal. The input and output to the CNN are both conveniently images of the same dimensions. This removed the need for the CNN to learn the physics required to map the sensor data into the image space. However, the initial inversion did not properly address the issues of limited-view and sparse sampling which resulted in an initial image with artifacts. Moreover, the CNN no longer had access to the sensor data and was only able to use information contained in the image to remove artifacts. There was likely useful information in the sensor data for more accurately reconstructing the PAT image, which was ignored in this approach.

In Pixel-DL, the initial inversion is replaced with pixel-wise interpolation, which similarly provides a mapping from the sensor data to image space. Relevant sensor data is windowed on a pixel-basis using a linear model of acoustic wave propagation. This enables the CNN to have a richer information source to reconstruct higher quality images. Furthermore, there is no initial inversion introducing artifacts; thus, the CNN does not have an additional task of learning to remove those artifacts.

mDirect-DL similarly did not require an initial inversion and instead used the full sensor data as an input to the CNN to reconstruct an image. The potential advantage of mDirect-DL is that the CNN had full access to the information available in the sensor data to reconstruct a high-quality image. However, reconstructing directly from the sensor data was also a more difficult task because the CNN needed to additionally learn a mapping from the sensor data into the image space. Results showed that the CNN had difficulty in learning a generalizable mapping and overfitted to the training data (Fig. 5). The FD-UNet was likely not an optimal architecture for this task since it was designed assuming the input was an image. A different neural network architecture for a multidimensional time-series input would be better suited.

A limitation of Post-DL and Pixel-DL for sparse and limited-view PAT is that the reconstructed image could have additional vessels that are not in the ground truth image. This can be problematic depending on the requirements of the application. Large vessels and structures are often reliably reconstructed in the image, but some small vessels could be false additions. This limitation primarily occurred at the sparsest sampling level and could be addressed by increasing the number of sensors used for imaging. The loss function could also be modified to penalize the CNN for reconstructing false additional vessels, but this could lead to the CNN to preferentially not reconstruct small vessels. Alternatively, a model-based learning approach could be used for better image quality if computational cost is not a limitation.

Deep learning for in vivo imaging

A key challenge in applying deep learning for in vivo PAT image reconstruction is that a large training dataset is required for the CNN to learn and be able to remove artifacts and improve image quality. The training data can be acquired experimentally using a PAT imaging system that has a sufficient number of sensors and full-view of the imaging target. However, this process is often infeasible because it is prohibitively expensive, time-consuming, and needs to be repeated when the imaging system configuration or imaging target is changed. Alternatively, synthetic training data can be generated using numerical phantoms or images from other modalities. In combination with data augmentation techniques, this approach enables for arbitrarily large synthetic training datasets to be created. However, CNN image reconstruction quality is largely dependent on the degree to which the simulations used to generate the training data matches actual experimental conditions. Properly matching the simulation is a non-trivial task that necessitates the PAT imaging system to be well-characterized and understood. Some factors to be considered when creating the simulations include: sensor properties (e.g. aperture size, sensitivity, and directivity), sensor configuration, laser illumination, and medium heterogeneities. Generally, it is preferable to closely match the simulation to the experimental conditions, but post-processing (e.g. filtering and denoising) can also be applied to the experimental data. It is beyond the scope of this work to discuss the impact of each factor in detail, but the issue of medium heterogeneities, specifically for speed of sound, is examined.

In this work, Pixel-DL was applied using a linear model of acoustic wave propagation that assumes the acoustic waves propagate spherically and travel at a constant speed of sound throughout the medium. Although this model was sufficient for the case of a homogenous medium, a different model would be needed if the medium was heterogeneous (e.g. speed of sound and density) such as for in vivo imaging. Naively reconstructing with these assumptions for heterogeneous mediums would result in additional artifacts that degrade image quality and potentially impact CNN performance. The severity of the artifacts would depend on the degree of mismatch between the heterogeneity and assumed value. If the distribution of the heterogeneities or acoustically reflective surfaces is known then they can be accounted for during the time-of-flight calculations when applying pixel-interpolation. However, if it is not known then the CNN should be trained with training data containing examples of heterogeneous mediums similar to what would be anticipated during image reconstruction. This would enable the CNN to learn to compensate for potential artifacts due to applying pixel interpolation with a linear model of acoustic wave propagation when the medium is actually not homogeneous.

Deep learning for fast image reconstruction

The proposed Pixel-DL approach can be used as a computationally efficient method for improving PAT image quality under limited-view and sparse sampling conditions. It can be readily applied to a wide variety of PAT imaging applications and configurations. Pixel-DL enables for the development of more efficient data acquisition approaches. For example, PAT imaging systems can be built with fewer sensors without sacrificing image quality, which would allow for the technology to be more affordable. Pixel-DL achieved similar or better performance and was faster than iterative reconstruction by over a factor of a 1000. It would allow for real-time PAT image rendering which would provide valuable feedback during image acquisition.

In this work we have demonstrated in silico the feasibility of Pixel-DL for PAT imaging of vasculature-like targets. This approach can also be readily applied to ultrasound imaging. Image reconstruction for PAT and ultrasound imaging both largely rely on time-of-flight calculations to determine where the signal originated. Therefore, a similar linear model of acoustic wave propagation can be used to readily apply Pixel-DL for ultrasound image reconstruction problems. Pixel-DL can also be adapted to other imaging modalities if a model mapping the sensor data to the image space is available.

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

Glover, G. H. Overview of Functional Magnetic Resonance Imaging. Neurosurg. Clin. N. Am. 22, 133–139 (2011).
Article PubMed PubMed Central Google Scholar
Kim, S.-G. & Ogawa, S. Biophysical and physiological origins of blood oxygenation level-dependent fMRI signals. J. Cereb. Blood Flow Metab. Off. J. Int. Soc. Cereb. Blood Flow Metab. 32, 1188–1206 (2012).
Article CAS Google Scholar
Chuang, N. et al. An MRI-based atlas and database of the developing mouse brain. NeuroImage 54, 80–89 (2011).
Article PubMed Google Scholar
Villringer, A. & Chance, B. Non-invasive optical spectroscopy and imaging of human brain function. Trends Neurosci 20, 435–442 (1997).
Article CAS PubMed Google Scholar
Zhang, F. et al. Multimodal fast optical interrogation of neural circuitry. Nature 446, 633–639 (2007).
Article ADS CAS PubMed Google Scholar
Boyden, E. S., Zhang, F., Bamberg, E., Nagel, G. & Deisseroth, K. Millisecond-timescale, genetically targeted optical control of neural activity. Nat. Neurosci. 8, 1263–1268 (2005).
Article CAS PubMed Google Scholar
Kim, C., Erpelding, T. N., Jankovic, L., Pashley, M. D. & Wang, L. V. Deeply penetrating in vivo photoacoustic imaging using a clinical ultrasound array system. Biomed. Opt. Express 1, 278–284 (2010).
Article PubMed PubMed Central Google Scholar
Heijblom, M. et al. Photoacoustic image patterns of breast carcinoma and comparisons with Magnetic Resonance Imaging and vascular stained histopathology. Sci. Rep. 5 (2015).
Lin, L. et al. Single-breath-hold photoacoustic computed tomography of the breast. Nat. Commun. 9, 2352 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Zhu, Y. et al. Light Emitting Diodes based Photoacoustic Imaging and Potential Clinical Applications. Sci. Rep 8, 1–12 (2018).
Article ADS CAS Google Scholar
Liba, O. & Zerda, A. de la. Photoacoustic tomography: Breathtaking whole-body imaging. Nat. Biomed. Eng 1, 1–3 (2017).
Article CAS Google Scholar
Hu, S. & Wang, L. V. Neurovascular Photoacoustic Tomography. Front. Neuroenergetics 2 (2010).
Wang, D., Wu, Y. & Xia, J. Review on photoacoustic imaging of the brain using nanoprobes. Neurophotonics 3 (2016).
Wang, X. et al. Noninvasive laser-induced photoacoustic tomography for structural and functional in vivo imaging of the brain. Nat. Biotechnol. 21, 803–806 (2003).
Article CAS PubMed Google Scholar
Li, L. et al. Single-impulse panoramic photoacoustic computed tomography of small-animal whole-body dynamics at high spatiotemporal resolution. Nat. Biomed. Eng 1, 1–11 (2017).
Article Google Scholar
Tang, J., Coleman, J. E., Dai, X. & Jiang, H. Wearable 3-D Photoacoustic Tomography for Functional Brain Imaging in Behaving Rats. Sci. Rep 6, 25470 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Beard, Paul Biomedical photoacoustic imaging. Interface Focus 1, 602–631 (2011).
Article PubMed PubMed Central Google Scholar
Zhang, P., Li, L., Lin, L., Shi, J. & Wang, L. V. In vivo superresolution photoacoustic computed tomography by localization of single dyed droplets. Light Sci. Appl 8, 1–9 (2019).
Article CAS Google Scholar
Wang, L. V. Multiscale photoacoustic microscopy and computed tomography. Nat. Photonics 3, 503–509 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Özbek, A., Deán-Ben, X. L. & Razansky, D. Optoacoustic imaging at kilohertz volumetric frame rates. Optica 5, 857–863 (2018).
Article ADS PubMed PubMed Central Google Scholar
Chatni, M. R. et al. Tumor glucose metabolism imaged in vivo in small animals with whole-body photoacoustic computed tomography. J. Biomed. Opt. 17 (2012).
Jin, Y., Jia, C., Huang, S.-W., O’Donnell, M. & Gao, X. Multifunctional nanoparticles as coupled contrast agents. Nat. Commun. 1, 41 (2010).
Article ADS PubMed PubMed Central CAS Google Scholar
Xia, J., Yao, J. & Wang, L. V. Photoacoustic tomography: principles and advances. Electromagn. Waves Camb. Mass 147, 1–22 (2014).
Article Google Scholar
Xu, M. & Wang, L. V. Universal back-projection algorithm for photoacoustic computed tomography. Phys. Rev. E 71, (2005).
Li, S., Montcel, B., Liu, W. & Vray, D. Analytical model of optical fluence inside multiple cylindrical inhomogeneities embedded in an otherwise homogeneous turbid medium for quantitative photoacoustic imaging. Opt. Express 22, 20500–20514 (2014).
Article ADS PubMed Google Scholar
Hristova, Y., Kuchment, P. & Nguyen, L. Reconstruction and time reversal in thermoacoustic tomography in acoustically homogeneous and inhomogeneous media. Inverse Probl. 24, 055006 (2008).
Article ADS MathSciNet MATH Google Scholar
Treeby, B. E., Zhang, E. Z. & Cox, B. T. Photoacoustic tomography in absorbing acoustic media using time reversal. Inverse Probl. 26, 115003 (2010).
Article ADS MathSciNet MATH Google Scholar
Cox, B. T. & Treeby, B. E. Artifact Trapping During Time Reversal Photoacoustic Imaging for Acoustically Heterogeneous Media. IEEE Trans. Med. Imaging 29, 387–396 (2010).
Article PubMed Google Scholar
Huang, B., Xia, J., Maslov, K. & Wang, L. V. Improving limited-view photoacoustic tomography with an acoustic reflector. J. Biomed. Opt. 18 (2013).
Wu, D., Wang, X., Tao, C. & Liu, X. J. Limited-view photoacoustic tomography utilizing backscatterers as virtual transducers. Appl. Phys. Lett. 99, 244102 (2011).
Article ADS CAS Google Scholar
Xu, Y., Wang, L. V., Ambartsoumian, G. & Kuchment, P. Reconstructions in limited-view thermoacoustic tomography. Med. Phys. 31, 724–733 (2004).
Article PubMed Google Scholar
Huang, C., Wang, K., Nie, L., Wang, L. V. & Anastasio, M. A. Full-Wave Iterative Image Reconstruction in Photoacoustic Tomography With Acoustically Inhomogeneous Media. IEEE Trans. Med. Imaging 32, 1097–1110 (2013).
Article PubMed PubMed Central Google Scholar
Arridge, S. R., Betcke, M. M., Cox, B. T., Lucka, F. & Treeby, B. E. On the Adjoint Operator in Photoacoustic Tomography. Inverse Probl. 32, 115012 (2016).
Article ADS MathSciNet MATH Google Scholar
Haltmeier, M. & Nguyen, L. Analysis of Iterative Methods in Photoacoustic Tomography with Variable Sound Speed. SIAM J. Imaging Sci 10, 751–781 (2017).
Article MathSciNet MATH Google Scholar
Zhang, C., Zhang, Y. & Wang, Y. A photoacoustic image reconstruction method using total variation and nonconvex optimization. Biomed. Eng. OnLine 13 (2014).
Arridge, S. et al. Accelerated high-resolution photoacoustic tomography via compressed sensing. Phys. Med. Biol. 61, 8908 (2016).
Article PubMed Google Scholar
Gu, J. et al. Recent advances in convolutional neural networks. Pattern Recognit 77, 354–377 (2018).
Article Google Scholar
Krizhevsky, A., Sutskever, I. & Hinton, G. E. ImageNet classification with deep convolutional neural networks. Commun. ACM 60, 84–90 (2017).
Article Google Scholar
Wang, G., Ye, J. C., Mueller, K. & Fessler, J. A. Image Reconstruction is a New Frontier of Machine Learning. IEEE Trans. Med. Imaging 37, 1289–1296 (2018).
Article PubMed Google Scholar
Jin, K. H., McCann, M. T., Froustey, E. & Unser, M. Deep Convolutional Neural Network for Inverse Problems in Imaging. IEEE Trans. Image Process 26, 4509–4522 (2017).
Article ADS MathSciNet MATH Google Scholar
Han, Y. S., Yoo, J. & Ye, J. C. Deep Residual Learning for Compressed Sensing CT Reconstruction via Persistent Homology Analysis. ArXiv161106391 Cs (2016).
Sandino, C. M., Dixit, N., Cheng, J. Y. & Vasanawala, S. S. Deep convolutional neural networks for accelerated dynamic magnetic resonance imaging. /paper/Deep-convolutional-neural-networks-for-accelerated-Sandino-Dixit/de12d079e3821ee22586682594d399cbc59d3ff0 (2017).
Hauptmann, A. et al. Model based learning for accelerated, limited-view 3D photoacoustic tomography. ArXiv170809832 Cs Math (2017).
Antholzer, S., Haltmeier, M., Nuster, R. & Schwab, J. Photoacoustic image reconstruction via deep learning. In Photons Plus Ultrasound: Imaging and Sensing 2018 vol. 10494 104944U (International Society for Optics and Photonics, 2018).
Antholzer, S., Haltmeier, M. & Schwab, J. Deep learning for photoacoustic tomography from sparse data. Inverse Probl. Sci. Eng. 27, 987–1005 (2019).
Article MathSciNet PubMed MATH Google Scholar
Schwab, J., Antholzer, S., Nuster, R. & Haltmeier, M. DALnet: High-resolution photoacoustic projection imaging using deep learning. ArXiv180106693 Phys. (2018).
Guan, S., Khan, A., Sikdar, S. & Chitnis, P. Fully Dense UNet for 2D Sparse Photoacoustic Tomography Artifact Removal. IEEE J. Biomed. Health Inform., https://doi.org/10.1109/JBHI.2019.2912935 (2019)
Allman, D., Reiter, A. & Bell, M. A. L. Photoacoustic Source Detection and Reflection Artifact Removal Enabled by Deep Learning. IEEE Trans. Med. Imaging 37, 1464–1477 (2018).
Article PubMed PubMed Central Google Scholar
Davoudi, N., Deán-Ben, X. L. & Razansky, D. Deep learning optoacoustic tomography with sparse data. Nat. Mach. Intell. 1–8, https://doi.org/10.1038/s42256-019-0095-3 (2019).
Hauptmann, A. et al. Model-Based Learning for Accelerated, Limited-View 3-D Photoacoustic Tomography. IEEE Trans. Med. Imaging 37, 1382–1393 (2018).
Article PubMed Google Scholar
Antholzer, S., Schwab, J. & Haltmeier, M. Deep Learning Versus 1$ -Minimization for Compressed Sensing Photoacoustic Tomography. In 2018 IEEE International Ultrasonics Symposium (IUS) 206–212, https://doi.org/10.1109/ULTSYM.2018.8579737 (2018).
Waibel, D. et al. Reconstruction of initial pressure from limited view photoacoustic images using deep learning. In Photons Plus Ultrasound: Imaging and Sensing 2018 vol. 10494.
Lan, H. et al. Hybrid Neural Network for Photoacoustic Imaging Reconstruction. In 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) 6367–6370, https://doi.org/10.1109/EMBC.2019.8857019 (2019).
Lan, H. et al. Ki-GAN: Knowledge Infusion Generative Adversarial Network for Photoacoustic Image Reconstruction In Vivo. In Medical Image Computing and Computer Assisted Intervention – MICCAI 2019 (eds. Shen, D. et al.) 273–281 (Springer International Publishing, 2019), https://doi.org/10.1007/978-3-030-32239-7_31.
Hauptmann, A. et al. Approximate k-Space Models and Deep Learning for Fast Photoacoustic Reconstruction. In Machine Learning for Medical Image Reconstruction (eds. Knoll, F., Maier, A. & Rueckert, D.) 103–111 (Springer International Publishing, 2018), https://doi.org/10.1007/978-3-030-00129-2_12.
Adler, J. & Öktem, O. Solving ill-posed inverse problems using iterative deep neural networks. Inverse Probl. 33, 124007 (2017).
Article ADS MathSciNet MATH Google Scholar
Schwab, J., Antholzer, S. & Haltmeier, M. Learned backprojection for sparse and limited view photoacoustic tomography. In Photons Plus Ultrasound: Imaging and Sensing 2019 vol. 10878 1087837 (International Society for Optics and Photonics, 2019).
Beard, P. Biomedical photoacoustic imaging. Interface Focus 1, 602–631 (2011).
Article PubMed PubMed Central Google Scholar
Xu, M. & Wang, L. V. Universal back-projection algorithm for photoacoustic computed tomography. In vol. 5697 251–255 (International Society for Optics and Photonics, 2005).
Beck, A. & Teboulle, M. A Fast Iterative Shrinkage-Thresholding Algorithm for Linear. Inverse Problems. SIAM J. Imaging Sci. 2, 183–202 (2009).
Article MathSciNet MATH Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015 (eds. Navab, N., Hornegger, J., Wells, W. M. & Frangi, A. F.) 234–241 (Springer International Publishing, 2015).
Huang, G., Liu, Z., van der Maaten, L. & Weinberger, K. Q. Densely Connected Convolutional Networks. ArXiv160806993 Cs (2016).
Abadi, M. et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. 19.
Dorr, A., Sled, J. G. & Kabani, N. Three-dimensional cerebral vasculature of the CBA mouse brain: a magnetic resonance imaging and micro computed tomography study. NeuroImage 35, 1409–1423 (2007).
Article CAS PubMed Google Scholar
Frangi, A. F., Niessen, W. J., Vincken, K. L. & Viergever, M. A. Multiscale vessel enhancement filtering. In Medical Image Computing and Computer-Assisted Intervention — MICCAI’98 (eds. Wells, W. M., Colchester, A. & Delp, S.) vol. 1496 130–137 (Springer Berlin Heidelberg, 1998).
Budai, A., Bock, R., Maier, A., Hornegger, J. & Michelson, G. Robust Vessel Segmentation in Fundus Images. International Journal of Biomedical Imaging, https://www.hindawi.com/journals/ijbi/2013/154860/, https://doi.org/10.1155/2013/154860 (2013).
Public Lung Image Database, http://www.via.cornell.edu/lungdb.html.
Treeby, B. E. & Cox, B. T. k-Wave: MATLAB toolbox for the simulation and reconstruction of photoacoustic wave fields. J. Biomed. Opt. 15, 021314 (2010).
Article ADS PubMed Google Scholar
Treeby, B. E. & Cox, B. T. k-Wave: MATLAB toolbox for the simulation and reconstruction of photoacoustic wave fields. J. Biomed. Opt. 15, 021314 (2010).
Article ADS PubMed Google Scholar
Wang, Z., Bovik, A. C., Sheikh, H. R. & Simoncelli, E. P. Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process 13, 600–612 (2004).
Article ADS PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Bioengineering Department, George Mason University, 4400 University Drive, Fairfax, 22030, VA, USA
Steven Guan, Amir A. Khan, Siddhartha Sikdar & Parag V. Chitnis
The MITRE Corporation, McLean, VA, 22102, USA
Steven Guan

Authors

Steven Guan
View author publications
You can also search for this author in PubMed Google Scholar
Amir A. Khan
View author publications
You can also search for this author in PubMed Google Scholar
Siddhartha Sikdar
View author publications
You can also search for this author in PubMed Google Scholar
Parag V. Chitnis
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Steven Guan was responsible for the data analysis, wrote the main manuscript, and prepared the figures. Amir A. Khan and Siddhartha Sikdar were advisors that have reviewed and edited the manuscript in addition to playing a critical role in defining the research plan. Parag V. Chitnis was a key contributor in planning and completing the research plan and have also reviewed and made substantial edits to the manuscript.

Corresponding authors

Correspondence to Steven Guan or Parag V. Chitnis.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Figures.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Guan, S., Khan, A.A., Sikdar, S. et al. Limited-View and Sparse Photoacoustic Tomography for Neuroimaging with Deep Learning. Sci Rep 10, 8510 (2020). https://doi.org/10.1038/s41598-020-65235-2

Download citation

Received: 06 November 2019
Accepted: 26 April 2020
Published: 22 May 2020
DOI: https://doi.org/10.1038/s41598-020-65235-2

This article is cited by

A deep neural network for real-time optoacoustic image reconstruction with adjustable speed of sound
- Christoph Dehner
- Guillaume Zahnd
- Dominik Jüstel
Nature Machine Intelligence (2023)
Photoacoustic imaging aided with deep learning: a review
- Praveenbalaji Rajendran
- Arunima Sharma
- Manojit Pramanik
Biomedical Engineering Letters (2022)
An iterative gradient convolutional neural network and its application in endoscopic photoacoustic image formation from incomplete acoustic measurement
- Zheng Sun
- Xinyu Wang
- Xiangyang Yan
Neural Computing and Applications (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Deep learning optoacoustic tomography with sparse data

A deep neural network for real-time optoacoustic image reconstruction with adjustable speed of sound

Deep learning approach for denoising low-SNR correlation plenoptic images

Introduction

Methods

Photoacoustic signal generation

Photoacoustic image reconstruction

Deep learning

CNN Architecture: fully dense UNet

Pixel-Wise interpolation

Deep learning implementation

Photoacoustic data for training and testing

Disclaimer

Results

Mouse brain vasculature experiment

Lung and fundus vasculature experiment

Image reconstruction times

Discussion

Comparison between deep learning approaches

Deep learning for in vivo imaging

Deep learning for fast image reconstruction

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Figures.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

A deep neural network for real-time optoacoustic image reconstruction with adjustable speed of sound

Photoacoustic imaging aided with deep learning: a review

An iterative gradient convolutional neural network and its application in endoscopic photoacoustic image formation from incomplete acoustic measurement

Comments

Search

Quick links