Machine Friendly Machine Learning: Interpretation of Computed Tomography Without Image Reconstruction

Lee, Hyunkwang; Huang, Chao; Yune, Sehyo; Tajmir, Shahein H.; Kim, Myeongchan; Do, Synho

doi:10.1038/s41598-019-51779-5

Download PDF

Article
Open access
Published: 29 October 2019

Machine Friendly Machine Learning: Interpretation of Computed Tomography Without Image Reconstruction

Scientific Reports volume 9, Article number: 15540 (2019) Cite this article

7792 Accesses
27 Citations
13 Altmetric
Metrics details

Subjects

Abstract

Recent advancements in deep learning for automated image processing and classification have accelerated many new applications for medical image analysis. However, most deep learning algorithms have been developed using reconstructed, human-interpretable medical images. While image reconstruction from raw sensor data is required for the creation of medical images, the reconstruction process only uses a partial representation of all the data acquired. Here, we report the development of a system to directly process raw computed tomography (CT) data in sinogram-space, bypassing the intermediary step of image reconstruction. Two classification tasks were evaluated for their feasibility of sinogram-space machine learning: body region identification and intracranial hemorrhage (ICH) detection. Our proposed SinoNet, a convolutional neural network optimized for interpreting sinograms, performed favorably compared to conventional reconstructed image-space-based systems for both tasks, regardless of scanning geometries in terms of projections or detectors. Further, SinoNet performed significantly better when using sparsely sampled sinograms than conventional networks operating in image-space. As a result, sinogram-space algorithms could be used in field settings for triage (presence of ICH), especially where low radiation dose is desired. These findings also demonstrate another strength of deep learning where it can analyze and interpret sinograms that are virtually impossible for human experts.

Segment anything in medical images

Article Open access 22 January 2024

Jun Ma, Yuting He, … Bo Wang

Towards a general-purpose foundation model for computational pathology

Article 19 March 2024

Richard J. Chen, Tong Ding, … Faisal Mahmood

DeepDOF-SE: affordable deep-learning microscopy platform for slide-free histology

Article Open access 05 April 2024

Lingbo Jin, Yubo Tang, … Ashok Veeraraghavan

Introduction

Continued rapid advancements in algorithms and computer hardware have accelerated progress in automated computer vision and natural language processing. By combining these two factors with the availability of well-annotated large datasets, significant advances have emerged from automated medical image interpretation for the detection of disease and critical findings^1,2,3. The application of deep learning has the potential to increase diagnostic accuracy and reduce delays in diagnosis and treatment for better patient outcomes⁴. Deep learning techniques are not limited to image analysis, but they also can improve image reconstruction for magnetic resonance imaging (MRI)^5,6, computed tomography (CT)^7,8, and photoacoustic tomography (PAT)⁹. In particular, deep learning approaches have been used to improve image quality for low-dose CT reconstruction by interpolating sparse CT projection data^10,11, denoising sparse-view reconstructed image^7,8, or both¹². These prior works demonstrated that deep learning now is a feasible alternative to well-established analytic and iterative methods of image reconstruction^{13,14,15,16,17}.

However, most prior work using deep learning algorithms has focused on image analysis of reconstructed images or as an alternative approach to image reconstruction. Despite this human centric approach, there is no reason that deep learning algorithms must function in image-space. Since all the information in the reconstructed images is present in the raw measurement data, deep learning models could potentially derive features directly from raw data in sinogram-space without intermediary image reconstruction, with possibly even better performance than models trained in image-space.

In this study, we determined the feasibility of analyzing computed tomography (CT) projection data — sinograms — through a deep learning approach for human anatomy identification and pathology detection. We proposed a customized convolutional neural network (CNN) called SinoNet, optimized it for interpreting sinograms, and demonstrated its potential by comparing its performance to pre-existing system based on other CNN architectures using reconstructed CT images. This approach accelerates edge computing by making it possible to identify critical findings rapidly from the raw data without time-consuming image reconstruction processes. In addition, this could enable us to develop simplified scanner hardware for the direct detection of critical findings through SinoNet alone.

Results

Experimental design

We retrieved 200 contiguous whole-body CT datasets from combined positron emission tomography-computed tomography (PET/CT) examinations for body part recognition and 720 non-contrast head CT scans for intracranial hemorrhage (ICH) detection with IRB approval from the picture archiving and communication systems (PACS) at Massachusetts General Hospital. Axial slices in the 200 whole body scans were annotated as sixteen different body regions by a physician, and slices of the 720 head scans were annotated with the presence of ICH by a panel of five neuroradiologists by consensus (Methods). We evaluated twelve different classification models developed by training Inception-v3¹⁸ on reconstructed CT images and SinoNet with sinograms (Table 1, Methods). The reconstructed CT images containing Hounsfield units (HU) were converted to scaled linear attenuation coefficients (LAC). Two-dimensional (2D) parallel-beam Radon transform was applied to the LAC slices (512 × 512 pixels) to generate a fully-sampled sinogram with 360 projections and 729 detector pixels (‘sino360x729’), which was then uniformly subsampled in the horizontal direction (projection views) and averaged in vertical direction (detector pixels) by factors of 3 and 9 to obtain moderately sampled sinograms with 120 views by 240 pixels (‘sino120x240’) and sparsely sampled sinograms with 40 views by 80 pixels (‘sino40x80’).

Table 1 Summary of the 12 different models evaluated in this study.

Full size table

Original CT images were used as fully sampled reconstructed images (recon360x729), and images reconstructed from the sparse sinograms (‘recon120x240’ and ‘recon40x80’) were generated using a deep learning approach (FBPConvNet⁸) followed by a conversion from LAC to HU. Reconstructed CT images and sinograms with predefined window-level settings were created to evaluate the effect of windowing: ‘wrecon360x729’, ‘wrecon120x240’, ‘wrecon40x80’; and ‘wsino360x729’, ‘wsino120x240’, ‘wsino40x80’ (Methods). Based on the scanning geometries and window-level settings described above, 12 CNN models were evaluated: 6 were developed by training Inception-v3¹⁸ with reconstructed CT images and the other 6 were obtained by training SinoNet with sinograms (Table 1, Methods). Data for body part recognition was randomly split into training, validation, and test sets with balanced genders: 140 scans (female: n = 70; male: n = 70) in training, 30 (female: n = 15; male: n = 15) in validation, and 30 (female: n = 15; male: n = 15) in testing. A dataset split was also performed for ICH detection with 478 scans in training, 121 in validation, and 121 in testing. Details of data preparation, CNN architecture, sinogram generation, and image reconstruction are described in Methods.

Results of body part recognition

Figure 1 shows test performance of the twelve different models for body part recognition. Models trained on fully sampled images had accuracies of 97.4% in image-space (I1), 96.6% in sinogram-space (S1), 97.9% in windowed-image-space (I2), and 97.4% in windowed-sinogram-space (S2). Moderately sampled images had model accuracies of 97.4% in image-space (I3), 96.3% in sinogram-space (S3), 97.9% in windowed-image-space (I4), and 97.4% in windowed-sinogram-space (S4). Sparsely sampled images had model accuracies of 97.1% in image-space (I5), 96.2% in sinogram-space (S5), 97.2% in windowed-image-space (I6), and 97.1% in windowed-sinogram-space (S6). These results imply that models trained and operating in image-space performed slightly better than sinogram-space (SinoNet) models for body part recognition, regardless of scanning geometry. Additionally, windowed input images consistently outperformed the ones with full-range images/sinograms.

Results of intracranial hemorrhage detection

Figure 2 depicts receiver operating characteristic (ROC) curves, and the corresponding areas under the ROC curves (AUC) for the twelve different models of ICH detection. Models trained on fully sampled images had AUCs of 0.898 in image-space (I1), 0.918 in sinogram-space (S1), 0.972 in windowed-image-space (I2), and 0.951 in windowed-sinogram-space (S2). Moderately sampled images had model accuracies of 0.893 in image-space (I3), 0.915 in sinogram-space (S3), 0.953 in windowed-image-space (I4), and 0.947 in windowed-sinogram-space (S4). Sparsely sampled images had model accuracies of 0.885 in image-space (I5), 0.899 in sinogram-space (S5), 0.909 in windowed-image-space (I6), and 0.942 in windowed-sinogram-space (S6).

Comparison of SinoNet and Inception-v3 for analyzing sinograms

Table 2 details performance comparisons of Inception-v3 and SinoNet for interpreting fully-sampled sinograms (360 projection views and 729 detector pixels) for both body part recognition and ICH detection. SinoNet models significantly outperformed Inception-v3 models in both tasks.

Table 2 Comparison of Inception-v3 and SinoNet network performance when both networks are trained on full-range sinograms are varying sampling densities for body part recognition and intracranial hemorrhage (ICH) detection.

Full size table

Discussion

We have demonstrated that models trained on sinograms can achieve similar performance when compared to models using conventional reconstructed images for body part recognition and ICH detection in all three scanning geometries, despite the fact that the raw measurement data are not interpretable to humans. SinoNet, when trained with sinograms, has comparable performance with that of Inception-v3 when trained with reconstructed CT images for body part recognition, regardless of the number of projection views or detectors. For ICH detection, SinoNet trained with full-range sinograms outperformed Inception-v3 trained with full dynamic range reconstructed images for all three sampling densities, with SinoNet significantly outperforming Inception-v3 when using windowed, sparsely sampled images. By applying window settings similar to what a radiologist would use, network performance increased significantly due to the improved contrast of target to background (Fig. 3) in both reconstructed images and in sinogram-space. As depicted in Fig. 3(b), not only are the key features relevant to hemorrhage enhanced in the windowed CT image, but also in the windowed sinogram.

SinoNet, a proposed convolutional neural network, was developed for analyzing sinograms through customized Inception modules with multi-scale convolutional and pooling layers¹⁸. In SinoNet, the square convolutional filters in the original Inception module were replaced by various sized rectangular convolutional filters which include width-wise (projection dominant) and height-wise (detector dominant) filters. The customized architecture of SinoNet allowed for significantly improved performance in both body part recognition and ICH detection when compared with Inception-v3 models trained with sinograms, regardless of sampling density. These results imply that non-square filters may be effective in enabling models to learn the interplay between projection views and detector pixels from sinusoidal curves and to extract salient features from the sinogram domain for classification, a task thought to be impossible for human experts to grasp. This approach is similar to the one proposed for learning temporal and frequency features using rectangular convolution filters in spectrograms¹⁹.

SinoNet, by operating in sinogram-space, can accelerate image interpretation for pathology detection as complex computations for image reconstruction are not required. SinoNet also excels when the projection data was moderately or sparsely sampled, maintaining its AUC at 0.942 on the hemorrhage detection task, while Inceptionv3 dropped from 0.972 to 0.909. Sparsely sampled datasets suggest that radiation dose could be markedly decreased with only a slight degradation in performance for sinogram-space algorithms. The number of projections linearly correlates with radiation dose, theoretically achieving 33% and 89% dose reductions for moderately and sparsely sampled data respectively. Similarly, by reducing the size and number of detectors required for diagnostic CT data, cheaper and simpler CT scanners can be created. At our institution, the average head CT has a CTDIvol of 50 mGy. Sparsely sampled data could have CTDIvol between 6 and 16 mGy. One possible use of this technique would be to use the sinogram model as a first-line screening tool in the field setting without image reconstruction, subsequently prioritizing a patient for potential stroke therapy given no evidence of intracranial hemorrhage. Subsequent full-dose CT could be used to confirm the interpretation from the sinogram method. Another possible use for this technique would be to create “smart-scanners” which allow the CT scanner to adjust the protocol and field of view based on the intended region of the body.

Although these results demonstrate the power of the sinogram based approach, several important areas of future investigation remain. Due to their unavailability, the sinograms used in this study were simulated by applying the 2D parallel-beam Radon transform to the reconstructed CT images rather than actual measurement data acquired from CT scanners. Improved simulation data could be acquired by accounting for other advanced projection geometries — cone-beam or fan-beam — and considering Poisson noise when generating projection data. Although SinoNet trained with windowed sinograms achieved comparable or better performance compared with windowed reconstructed images, windowed sinograms were generated from reconstructed images that were postprocessed with predefined window settings; generation of windowed sinograms directly from CT measurement data is not straightforward, but it could be implemented by using energy-resolving, photon-counting detectors from multi-energy CT imaging to acquire measurements in multiple energy bins²⁰. Our work will need to be further validated by using raw data from clinical scanners as well as raw data from actual low-dose image acquisitions to see if performance remains robust despite increased image noise.

In conclusion, sinogram-space deep learning with our proposed CNN called SinoNet is feasible for human anatomy identification and pathology detection (presence of ICH) on sinograms acquired using different scanning geometries in terms of projections and detectors which are not virtually interpretable to human experts like sinograms. In particular, this study showed SinoNet performed better for pathology detection directly from sparse sinograms than reconstructed images, indicating the potential of deep learning to identify critical findings from raw data without expensive image reconstruction processes in field settings for triage, especially where low dose radiation is required.

Methods

All the images were fully de-identified in compliance with the Health Insurance Portability and Accountability Act (HIPAA). This retrospective study was conducted with the approval of the Institutional Review Board (IRB) of Massachusetts General Hospital and under a waiver of informed consent. All experiments were performed in accordance with relevant guidelines and regulations.

Data collection and annotation

Body part recognition

A total of 200 contrast-enhanced PET/CT examinations of head, neck, chest, abdomen, and pelvis for 100 female and 100 male patients were retrieved from our institutional PACS between May 2012 and July 2012. A total of 56,334 axial slices in the CT scans were annotated as one of sixteen body regions by a physician (see Supplementary Fig. S1). 30 cases (Female: n = 15; Male: n = 15) were randomly selected for use as validation data for hyperparameter tuning and model selection, another 30 cases (Female: n = 15; Male: n = 15) as test data for performance evaluation, and the rest of 140 cases (Female: n = 70; Male: n = 70) as training data for model development (Table 3).

Table 3 Distribution of training, validation, and test datasets for body part recognition.

Full size table

Intracranial hemorrhage (ICH) detection

A total of 720 5-mm non-contrast head CT scans were identified and retrieved from our PACS between June 2013 and July 2017. Every 5-mm thick axial slice (3,151 slices without ICH and 2,895 slices with ICH) was annotated by five board-certified neuroradiologists (blinded for review, 9 to 34 years experience) according to presence of ICH by consensus. The examinations included 201 cases without ICH and 519 cases with ICH, which were randomly split into train (141 cases), validation (30 cases), and test (30 cases) datasets at the case-level to ensure slices from the same case were not split across different datasets (Table 4).

Table 4 Distribution of training, validation, and test datasets for ICH detection.

Full size table

Sinogram generation

Simulated sinograms were utilized in this study instead of raw data obtained by commercial CT scanners as this was a retrospective analysis and access to raw projection data from patient CT scans could not be retrieved. To generate simulated sinograms, the pixel values of 512 × 512 CT images stored in DICOM file were first converted into scaled linear attenuation coefficients (LACs). Any calculated negative LAC was leveled to zero under the assumption that it is physically impossible to have negative LACs, so this result must represent random noise. Subsequently, three different sinograms were generated based on the scaled LAC images. First, we computed sinograms with 360 projection views over 180 degrees and 729 detectors (‘sino360x729’), using the 2D parallel-beam Radon transform. ‘sino360x729’ were then used to produce sparser sinograms by uniformly subsampling projection views (in the horizontal direction) and averaging projection data from adjacent detectors (in the vertical direction) by factors of 3 and 9 to obtain sinograms with 120 projection views and 240 detectors (‘sino120x240’) and sinograms with 40 projection views and 80 detectors (‘sino40x80’), respectively (Fig. 4). Sparser sinograms (‘sino40x80’, ‘sino120x240’) were resized to 360 × 729 pixels using a bilinear interpolation to have a uniform resolution with the corresponding full-view sinograms (‘sino360x729’).

Image reconstruction

Reconstructed images were generated from the synthetic sinograms for models I1–I6. Original CT images were used as the reconstructed images for ‘recon360x729’ as fully sampled sinogram data could be completely reconstructed into images using filtered back projection (FBP). However, other complex algorithms are needed to reconstruct high-quality images from sparser datasets, such as model-based iterative reconstruction. Rather than employing complex iterative algorithms, we implemented a deep learning approach to reconstruct sparsely sampled sinograms as this technique has been demonstrated to compare favorably to state-of-the-art iterative algorithms for sparse-view image reconstruction^7,8. We implemented FBPConvNet, a modified U-net²¹ with multiresolution decomposition and residual learning as proposed by a prior work⁸. FBPConvNet takes FBP reconstructed images from sparser sinograms (‘sino120x240’ or ‘sino40x80’) as inputs and is trained for regression between the input and the original CT image (converted into LACs) with mean square error (MSE) as the loss function (see Supplementary Fig. S2). Since the output images of FBPConvNet were LACs, they were converted into HU as the final reconstructed images. Sparser sinograms were resized to 360 × 729 pixels using bilinear interpolation in order to make the corresponding FBP images have the uniform resolution of 512 × 512 pixels, resulting in final reconstructed images of 512 × 512 pixels. The best FBPConvNet models selected based on root mean square error (RMSE) values on the validation data were employed on ‘sino120x240’ and ‘sino40x80’ to generate ‘recon120x240’ and ‘recon40x80’ respectively (Fig. 5). The RMSE of reconstructed images obtained from the FBPConvNet in validation dataset are much smaller than that of conventional FBP images (see Supplementary Table S1).

Windowed images and sinograms

We utilized full-range 12-bit grayscale images and windowed 8-bit grayscale images with different window-levels (WL) and window-widths (WW) suitable for each task: abdominal window (WL = 40 HU, WW = 400 HU) for body part recognition and brain window (WL = 50 HU, WW = 100 HU) for ICH detection. The windowed sinograms were generated from corresponding windowed CT images. Examples of windowed images and sinograms are shown in Supplementary Fig. S3.

Convolutional neural network for sinograms: SinoNet

A customized convolutional neural network, SinoNet, was designed for analyzing sinograms using customized Inception modules with multiple convolutional and pooling layers and dense connection for efficient use of model parameters^18,22. As shown in Fig. 5, the Inception module was modified with various sized rectangular convolutional filters in SinoNet. The non-square filters include height-wise (detector dominant) and width-wise (projection dominant) filters to enable efficient extraction of features from sinusoidal curves. Two Inception modules were densely connected to form a Dense-Inception block, which was followed by a Transition block to reduce the number and dimension of feature maps for computational efficiency, as suggested in the original report²². In this study, SinoNet was used only for interpreting sinograms.

Baseline convolutional neural network: Inception-v3

Inception-v3¹⁸, a validated CNN for object recognition in the ImageNet Large Scale Visual Recognition Challenge (ILSVRC)²³, was selected as the network architecture to develop classification models trained on reconstructed images. We modified Inception-v3 by replacing the last fully-connected layers with a sequence of a global average pooling (GAP) layer, a fully-connected layer, and a softmax layer with outputs of the same number of categories: 16 multi-class outputs for body part recognition and a binary output for ICH detection. Inception-v3 was also used to classify sinograms when evaluating SinoNet performance at body part recognition and ICH detection when using sinograms as the input data.

Weight initialization

All models developed using Inception-v3 and SinoNet for body part recognition task were initialized with He normal initialization²⁴. For the ICH detection task, models were initialized with corresponding pre-trained weights on the body part recognition with full-view scanning geometry. For example, the Inception-v3 model trained with ‘recon360x729’ for body part recognition was used as the initial weights for Inception-v3 models trained with reconstructed images for ICH detection for all scanning geometries and window levels. Similarly, SinoNet ICH detection models were initialized using the weights from the body part recognition SinoNet model trained with ‘sino360x729’.

Performance evaluation and statistical analysis

Test accuracy was used as the performance metric for comparing body part recognition models, and ROC curves with AUC were used for evaluating performance of models for detection of ICH. All performance metrics were calculated using scikit-learn 0.19.2 available in python 2.7.12. A non-parametric approach (DeLong²⁵) was used to assess the statistical significance of the difference between AUCs of ICH detection models trained with reconstruction images and sinograms using Stata version 15.1 (StataCorp, College Station, Texas, USA). We employed a non-parametric, bootstrap approach with 2,000 iterations to compute 95% CIs of the metrics including test accuracy and AUC²⁶.

Network training

Classification models for body part recognition and ICH detection were trained for 45 epochs using the Adam optimizer with default settings²⁷ and a mini-batch size of 80. FBPConvNet models were trained for 100 epochs using the Adam optimizer with default settings and a mini-batch size of 20. The base learning rate of 0.001 was decayed by a factor of 10 every 15 epochs for the classification models and every 33 epochs for FBPConvNet. The best classification and FBPConvNet models were selected based on the validation loss.

Infrastructure

We used radon and iradon functions in Matlab 2018a for generating sinograms and obtaining FBP reconstructed images, respectively. We used Keras (version 2.1.1) with a Tensorflow backend (version 1.3.0) as the framework for developing deep learning models, and performed experiments using an NVIDIA Devbox (Santa Clara, CA) equipped with four TITAN X GPUs with 12 GB of memory per GPU.

References

Esteva, A. et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature 542, 115 (2017).
Article CAS ADS Google Scholar
Gulshan, V. et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 316, 2402–2410 (2016).
Article Google Scholar
Chilamkurthy, S. et al. Deep learning algorithms for detection of critical findings in head ct scans: a retrospective study. Lancet (2018).
Thrall, J. H. et al. Artificial intelligence and machine learning in radiology: opportunities, challenges, pitfalls, and criteria for success. J. Am. Coll. Radiol. 15, 504–508 (2018).
Article Google Scholar
Wang, S. et al. Accelerating magnetic resonance imaging via deep learning. In Biomedical Imaging (ISBI), 2016 IEEE 13th International Symposium on, 514–517 (IEEE, 2016).
Zhu, B., Liu, J. Z., Cauley, S. F., Rosen, B. R. & Rosen, M. S. Image reconstruction by domain-transform manifold learning. Nature 555, 487 (2018).
Article CAS ADS Google Scholar
Xie, S. et al. Artifact removal using improved googlenet for sparse-view ct reconstruction. Sci. reports 8 (2018).
Jin, K. H., McCann, M. T., Froustey, E. & Unser, M. Deep convolutional neural network for inverse problems in imaging. IEEE Transactions on Image Processing 26, 4509–4522 (2017).
Article ADS MathSciNet Google Scholar
Antholzer, S., Haltmeier, M. & Schwab, J. Deep learning for photoacoustic tomography from sparse data. Inverse Probl. Sci. Eng. 1–19 (2018).
Lee, H., Lee, J. & Cho, S. View-interpolation of sparsely sampled sinogram using convolutional neural network. In Medical Imaging 2017: Image Processing, vol. 10133, 1013328 (International Society for Optics and Photonics, 2017).
Lee, H., Lee, J., Kim, H., Cho, B. & Cho, S. Deep-neural-network-based sinogram synthesis for sparse-view ct image reconstruction. IEEE Transactions on Radiat. Plasma Med. Sci. 3, 109–119 (2018).
Article Google Scholar
Yuan, H., Jia, J. & Zhu, Z. Sipid: A deep learning framework for sinogram interpolation and image denoising in low-dose ct reconstruction. In 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018), 1521–1524 (IEEE, 2018).
Wang, G., Ye, J. C., Mueller, K. & Fessler, J. A. Image reconstruction is a new frontier of machine learning. IEEE transactions on medical imaging 37, 1289–1296 (2018).
Article Google Scholar
Do, S. et al. High fidelity system modeling for high quality image reconstruction in clinical ct. PloS one 9, e111625 (2014).
Article ADS Google Scholar
Do, S. & Karl, C. Sinogram sparsified metal artifact reduction technology (ssmart). In The Third International Conference on Image Formation in X-ray Computed Tomography, 798–802 (2014).
Do, S., Näppi, J. J. & Yoshida, H. Iterative reconstruction for ultra-low-dose laxative-free ct colonography. In International MICCAI Workshop on Computational and Clinical Challenges in Abdominal Imaging, 99–106 (Springer, 2013).
Do, S. et al. A decomposition-based ct reconstruction formulation for reducing blooming artifacts. Phys. Med. Biol. 56, 7109 (2011).
Article Google Scholar
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J. & Wojna, Z. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE conference on computer vision and pattern recognition, 2818–2826 (2016).
Pons, J., Lidy, T. & Serra, X. Experimenting with musically motivated convolutional neural networks. In Content-Based Multimedia Indexing (CBMI), 2016 14th International Workshop on, 1–6 (IEEE, 2016).
McCollough, C. H., Leng, S., Yu, L. & Fletcher, J. G. Dual-and multi-energy ct: principles, technical approaches, and clinical applications. Radiology 276, 637–653 (2015).
Article Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical image computing and computer-assisted intervention, 234–241 (Springer, 2015).
Huang, G., Liu, Z., Van Der Maaten, L. & Weinberger, K. Q. Densely connected convolutional networks. CVPR 1, 3 (2017).
Google Scholar
Russakovsky, O. et al. Imagenet large scale visual recognition challenge. Int. journal computer vision 115, 211–252 (2015).
Article MathSciNet Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In Proceedings of the IEEE international conference on computer vision, 1026–1034 (2015).
DeLong, E. R., DeLong, D. M. & Clarke-Pearson, D. L. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics 837–845 (1988).
Efron, B. & Tibshirani, R. J. An introduction to the bootstrap (CRC press, 1994).
Kingma, D. P. & Ba, J. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (201 4).

Download references

Author information

Authors and Affiliations

Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, 02114, USA
Hyunkwang Lee, Chao Huang, Sehyo Yune, Shahein H. Tajmir, Myeongchan Kim & Synho Do
John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, 02138, USA
Hyunkwang Lee

Authors

Hyunkwang Lee
View author publications
You can also search for this author in PubMed Google Scholar
Chao Huang
View author publications
You can also search for this author in PubMed Google Scholar
Sehyo Yune
View author publications
You can also search for this author in PubMed Google Scholar
Shahein H. Tajmir
View author publications
You can also search for this author in PubMed Google Scholar
Myeongchan Kim
View author publications
You can also search for this author in PubMed Google Scholar
Synho Do
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.L. and S.D. initiated and designed the research. S.D. supervised the data collection. H.L., S.D., M.K. and S.H.T. acquired the data. H.L. and C.H. executed the research, developed the algorithms, and implemented software tools for the experiments. H.L., C.H. and S.Y. interpreted the data and analyzed the results. H.L., C.H. and S.H.T. wrote the manuscript. H.L. prepared all tables and figures. C.H. prepared sinograms with three different levels of sampling for the experiments. All authors reviewed the manuscript.

Corresponding author

Correspondence to Synho Do.

Ethics declarations

Competing interests

S.D. is a consultant of Nulogix Health and Doai and received research supports from ZCAI, Tplus, and MediBloc.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lee, H., Huang, C., Yune, S. et al. Machine Friendly Machine Learning: Interpretation of Computed Tomography Without Image Reconstruction. Sci Rep 9, 15540 (2019). https://doi.org/10.1038/s41598-019-51779-5

Download citation

Received: 23 April 2019
Accepted: 28 September 2019
Published: 29 October 2019
DOI: https://doi.org/10.1038/s41598-019-51779-5

This article is cited by

The use of deep learning methods in low-dose computed tomography image reconstruction: a systematic review
- Minghan Zhang
- Sai Gu
- Yuhui Shi
Complex & Intelligent Systems (2022)
Predicting the clinical management of skin lesions using deep learning
- Kumar Abhishek
- Jeremy Kawahara
- Ghassan Hamarneh
Scientific Reports (2021)
Scenario Reduction of Realizations Using Fast Marching Method in Robust Well Placement Optimization of Injectors
- Reza Yousefzadeh
- Mohammad Sharifi
- Mohammad Ahmadi
Natural Resources Research (2021)
Position paper of the EACVI and EANM on artificial intelligence applications in multimodality cardiovascular imaging using SPECT/CT, PET/CT, and cardiac CT
- Riemer H. J. A. Slart
- Michelle C. Williams
- Antti Saraste
European Journal of Nuclear Medicine and Molecular Imaging (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Experimental design

Results of body part recognition

Results of intracranial hemorrhage detection

Comparison of SinoNet and Inception-v3 for analyzing sinograms

Discussion

Methods

Data collection and annotation

Body part recognition

Intracranial hemorrhage (ICH) detection

Sinogram generation

Image reconstruction

Windowed images and sinograms

Convolutional neural network for sinograms: SinoNet

Baseline convolutional neural network: Inception-v3

Weight initialization

Performance evaluation and statistical analysis

Network training

Infrastructure

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links