Deep learning synthesis of cone-beam computed tomography from zero echo time magnetic resonance imaging

Choi, Hyeyeon; Yun, Jong Pil; Lee, Ari; Han, Sang-Sun; Kim, Sang Woo; Lee, Chena

doi:10.1038/s41598-023-33288-8

Download PDF

Article
Open access
Published: 13 April 2023

Deep learning synthesis of cone-beam computed tomography from zero echo time magnetic resonance imaging

Hyeyeon Choi¹^na1,
Jong Pil Yun²^na1,
Ari Lee³,
Sang-Sun Han³,
Sang Woo Kim¹ &
…
Chena Lee^3,4

Scientific Reports volume 13, Article number: 6031 (2023) Cite this article

1559 Accesses
3 Citations
Metrics details

Subjects

Abstract

Cone-beam computed tomography (CBCT) produces high-resolution of hard tissue even in small voxel size, but the process is associated with radiation exposure and poor soft tissue imaging. Thus, we synthesized a CBCT image from the magnetic resonance imaging (MRI), using deep learning and to assess its clinical accuracy. We collected patients who underwent both CBCT and MRI simultaneously in our institution (Seoul). MRI data were registered with CBCT data, and both data were prepared into 512 slices of axial, sagittal, and coronal sections. A deep learning-based synthesis model was trained and the output data were evaluated by comparing the original and synthetic CBCT (syCBCT). According to expert evaluation, syCBCT images showed better performance in terms of artifacts and noise criteria but had poor resolution compared to the original CBCT images. In syCBCT, hard tissue showed better clarity with significantly different MAE and SSIM. This study result would be a basis for replacing CBCT with non-radiation imaging that would be helpful for patients planning to undergo both MRI and CBCT.

Physics-informed Deep Learning for Dual-Energy Computed Tomography Image Processing

Article Open access 27 November 2019

Cycloidal CT with CNN-based sinogram completion and in-scan generation of training data

Article Open access 18 January 2022

Competitive performance of a modularized deep neural network compared to commercial algorithms for low-dose CT image reconstruction

Article 10 June 2019

Introduction

The innovation of cone-beam computed tomography (CBCT) has revolutionized the aspect of treatment in dentistry. CBCT is preferred over multi-slice CT (MSCT) in the dental field because it uses micro-unit-sized isotropic voxels, minimizing image distortion when reconstructed in non-orthogonal directions, such as the tooth axis or cross-section of the dental arch¹. The high-resolution aspect of CBCT, compared to MSCT, also facilitates the assessment of minute structures in the desired direction^2,3. However, CBCT images cannot analyze soft tissue density due to poor contrast with artifacts and noise. In cases when it is necessary to diagnose both soft and hard tissue pathology, such as in temporomandibular disease, magnetic resonance imaging (MRI) should be acquired for soft tissue diagnosis and CBCT.

Radiation-free imaging, especially magnetic resonance imaging (MRI), is advancing rapidly with the recent introduction of bone MRI sequences. Hilgenfield et al.^4,5 reported that MRI-based implant planning was reliable and sufficiently accurate. Although the diagnostic accuracy and reliability of bone MRI has been shown to be equivalent to those of CT, the unfamiliar contrast and a lot of imaging noise are the limitations for the immediate application of this technique for clinical conditions⁶. Due to such limitations, there have been several studies to convert head and neck MRI into CT images using deep learning^7,7,8,9,11. Yet, the current literature suggests MRI transformation models based on MSCT images and there is no study based on CBCT as an imaging source.

In previous studies for the MRI to MSCT conversion, the U-Net was widely used as a baseline for synthesis networks; Wang et al.¹¹ and Bahrami et al.¹² showed the capability of U-Net on MSCT synthesis. Han et al.⁸ and Massa et al.⁹ used modified U-Net architecture inspired by the VGG network and Inception module, respectively, which have shown excellent feature extraction capabilities in computer vision tasks. Generative model-based methods were also utilized in previous studies¹³. Gholamiankhah et al.⁷ compared the quality of samples generated from ResNet and generative adversarial networks. Qi et al.¹⁰ utilized conditional GAN with multi-channel inputs from head and neck MRI. Regarding CBCT images, none attempted to use MRI as a source image, but a few studies used MSCT images as a source. Yuan et al.¹⁴ utilized MSCT images as a source to synthesize CBCT using U-Net to reduce the artifact distortion of CBCT. Conversely, Chen et al.¹⁵ utilized CBCT as an input of cycleGAN to generate MSCT images with enhanced scattering artifacts. Unlike existing studies, the synthesis of CBCT from MRI requires additional consideration of registration errors and different patterns of each modality. In this article, we proposed a novel method for CBCT synthesis based on previous studies.

Therefore, we hypothesized that a deep learning model would synthesize CBCT images accurately from MRI images, specifically adjusted to describe hard tissue, including tooth and alveolar and maxillofacial bones. We attempted to synthesize accurate CBCT images using various pre-processing MRI methods. In addition, it was assumed that the synthesized images could be utilized for 3D modeling, similar to the conventional CBCT images used in clinics.

Materials and methods

Ethics

This study was approved by the institutional review board (IRB No. 2-2021-0027) of Yonsei University Dental Hospital and was conducted and completed in accordance with the ethical regulations. Due to the retrospective nature of the study, the requirement for informed consent was waived and this was approved by Yonsei University Dental Hospital, IRB. All imaging data were anonymized before export.

Data collection

For this study, 21 patients who underwent both CBCT and MRI for temporomandibular joint (TMJ) disease in our institution were randomly selected and examined. The CBCT-MRI paired data set was randomly divided into training (n = 16) and test (n = 5) sets.

All CBCT images were obtained with an Alphard 3030 unit (Asahi Roentgen, Kyoto, Japan) using the following parameters: tube voltage, 90 kVp; tube current, 8 mA; exposure time, 17 s; field of view (FOV), 150 × 150 mm; and voxel size, 0.3 mm. There was no modification on the reconstruction filter from the projection into the axial image data and the default parameters provided by the manufacturer were used in this study.

MRI was performed using the 3.0 T scanner (Pioneer; GE Healthcare, Waukesha, WI, USA) with a 21-channel head coil. Isotropic three-dimensional zero echo time (ZTE) sequences were acquired with the following parameters: TE/TR, 0/785 ms; flip angle, 4°; receiver bandwidth, 31.25 kHz; number of excitations (NEX), 2; FOV, 180 × 180 mm (supra-orbital rim to upper neck region); acquisition matrix, 260 × 260; voxel size, 0.35 mm; slice thickness, 1.0 mm; and scan time, ~ 5 min.

Data preparation

Paired image data were registered due to differences in patient orientation during image acquisition. The entire registration process was conducted via ITK-snap (ver. 3.0, www.itksnap.org). The gross orientation of MRI (anterior–posterior position) was matched with CBCT orientation (superior-inferior position) manually. Then, based on the mutual information, geometrical rigid registration was conducted until the mutual information between the two images reached its maximum¹⁶.

Then, the MRI image was resliced into the same thickness, 300 $\upmu$m, as the CBCT image. Five hundred and twelve CBCT and MRI axial slides were prepared. For data augmentation, axial image data were reconstructed into 512 coronal and sagittal slides each, and all images were prepared in the BMP format. In total, 64,512 images (3,072 images per data pair) of CBCT and MRI data were prepared.

Deep learning network training

A modified U-Net structure was used for our synthesis model. U-Net is commonly applied to biomedical imaging tasks, as it shows relatively higher accuracy than existing networks with a small number of source images¹⁷. To enhance the result performance by extracting more hierarchical features than those of the original U-Net, we modified several parts of the network structure as illustrated in Fig. 1. First, the encoder structure was substituted with the Bottleneck blocks of ResNet-50¹⁸, and all 2-dimensional convolution layers were changed into 3-dimensional convolution layers. Second, the last skip connection of the U-Net was removed because the minute registration error between MRI and CBCT makes the morphology of synthesized prediction confusing, and the different patterns of the input MRI can affect the results. Lastly, to prevent the model capacity from exceeding our hardware memory size, the number of convolution kernels was changed, as described in Fig. 1a. The ablation studies for each proposed component were performed.

Sixteen sets of MRI-CBCT pairs were used for training the synthesis model, and five sets were evaluated as test sets. For pre-processing, we multiplied the MRI-CBCT pair by a circle binary mask with a radius of 256 pixels to remove the background noise (Fig. 1b). Then, the masked images were stacked in the vertical direction to reconstruct a 3-D image of size 512 × 512 × 512. Due to different field of view size, peripheral area loss occurs in specific images of MRI and CBCT sequences. The noisy sequences were excluded in the training step to ensure stable network training. We used only 21–490, 1–360, and 41–380 sequences for the x, y, and z axes of the entire image, respectively. To overcome the limitation of the hardware (memory size) and execute the data augmentation, we randomly extracted patches from the whole image. The experiments were conducted with two different sizes of patches; a large patch of size 128 × 128 × 16 and a small patch of size 64 × 64 × 16, as illustrated in Fig. 1c.

The network was trained by Adam optimizer with an initial learning rate of 2.5 × ${10}^{-4}$, that was exponentially decayed by 0.8 every 200 iterations, and the weight decay was 10^–5. The smooth L1 loss and the early stopping method were used with a stopping factor of 5. The mini-batch sizes were 32 and 8 for the small and large patches, respectively. The input patches were normalized to [− 1, 1].

In the inference phase, an input MRI image was partitioned into the patches using a sliding window method, with the step size being half the patch size (Fig. 1d). The trained synthesis model predicted the CBCT patches. Then, each patch was weighted by the Gaussian filter to generate a smooth cross-section of 3-D synthetic CBCT (syCBCT). Finally, the syCBCT was merged by overlaying weighted patches with the same stride of the sliding window.

Accuracy assessment and clinical validation

Three-dimensional model surface deviation

A three-dimensional maxillofacial model was generated in the STL format based on both original CBCT and syCBCT. The two models were superimposed for measurement using Geomagic Control X (3D Systems, Cary, NC, USA). Then, the overall surface deviation was acquired (Fig. 2a) for both large and small patches based on syCBCT. The surface deviation of syCBCT was also obtained for anatomical regions, maxilla, and mandible in the axial and anterior–posterior coronal planes (Fig. 2b). The reference planes were determined by following a previous study¹⁹. The axial plane was determined by the cement o-enamel junction of the upper and lower teeth. Anatomical landmarks, including mental foramen (anterior) and mandibular foramen (posterior), were used to determine the coronal plane. All measured deviation values were obtained in root mean square (RMS, mm).

Expert image quality evaluation

Two radiologists with more than 10 years of experience conducted a subjective evaluation using the modified version of the clinical image evaluation chart of CBCT provided by the Korean Academy of Oral and Maxillofacial Radiology (Table 1). The clinical image evaluation chart comprises 4 sections: artifact, noise, resolution, and overall image. In the artifact, noise, and resolution sections, the evaluator graded image series as poor, moderate, or good. For overall grade, the possible outcomes were: no diagnostic value, poor, moderate, or good.

Table 1 Clinical image evaluation chart of CBCT.

Full size table

Image quality evaluation metrics

For five sets of test data, the image quality of the syCBCT in axial series was compared to that of the original CBCT image using three indices, mean absolute error (MAE), peak signal-to-noise ratio (PSNR), and structural similarity indexing method (SSIM), that are frequently used to evaluate synthetic images²⁰. MAE suggests a correlation with the image noise level, PSNR is closely related to the clarity and resolution of the image, and SSIM is comprehensively correlated with the structural similarity of the synthetic image. The definition and ideal reference value¹⁸ of each index were as follows:

$$MAE = \frac{1}{N}\mathop \sum \limits_{i = 1}^{N} \left| {syCT_{i} - CT_{i} } \right|,\;{\text{reference}}\;{\text{value}} = 0\;{\text{HU}}$$

(1)

$$PSNR = 10 \times {\text{log}}\left( {\frac{{f_{max}^{2} }}{{rmse^{2} }}} \right),\;{\text{reference}}\;{\text{value}} > {25}\;{\text{dB}}$$

(2)

$$SSIM = \frac{{\left( {2\mu_{{\hat{y}_{i} }} \mu_{{y_{i,k} }} + c_{1} } \right)\left( {2\sigma_{{\hat{y}_{i} y_{i,k} }} + c_{2} } \right)}}{{\left( {\mu_{{\hat{y}_{i} }}^{2} + \mu_{{y_{i,k} }}^{2} + c_{1} } \right)\left( {\sigma_{{\hat{y}_{i} }}^{2} + \sigma_{{y_{i,k} }}^{2} + c_{2} } \right)}},\;{\text{reference}}\;{\text{value}} = {1}$$

(3)

All metrics were obtained according to the ability to present hard tissue, soft tissue, and air in syCBCT compared to the original CBCT²⁰.

Statistical analysis and comparisons

To measure the surface deviation of large and small patch-based 3-D models, RMS values were compared using the Mann–Whitney test. The deviation at the anatomical regions (maxilla, mandible, posterior, and anterior) was compared using the Kruskal–Wallis test and Dunn’s multiple comparison post-hoc test. The number of grades from the clinical CBCT image evaluation chart according to each criterion (artifact, noise, resolution, and overall) was also assessed for original CBCT and syCBCT images. Inter-observer agreement was obtained by interclass correlation coefficient (ICC). The image quality metrics, MAE, PSNR, and SSIM, were compared for hard and soft tissue as well as air in individual syCBCT using one-way ANOVA. Statistical analysis was conducted with GraphPad Prism version 9.4.1 (GraphPad Software, La Jolla, CA, USA, www.graphpad.com) and a confidential interval of 95%.

Results

The mean surface deviation was 2.95 $\pm \hspace{0.17em}$0.35 and 2.93 $\pm \hspace{0.17em}$0.39 mm for large and small patch-based syCBCT, respectively, and there was no statistical difference. Four small patch-based 3D models showed less surface deviation than large patch-based models, while one small patch-based 3D model (syCBCT2) showed more surface deviation than large patch-based models (Table 2, Fig. 3). In deviation measured at different anatomical regions, the anterior region showed larger deviation (large patch, 3.76 mm; small patch, 4.01 mm), and the maxilla showed smaller deviation (large patch, 3.09 mm; small patch, 2.81 mm) (Table 2). The mean surface deviation between the maxilla and anterior region in small patch-based models was significantly different.

Table 2 Mean value of surface deviation in overall three-dimensional models and the respective anatomical region.

Full size table

Expert image quality evaluation showed that syCBCT provided better performance in terms of artifact and noise criteria than the original CBCT. On the contrary, the original CBCT obtained a ‘good’ grade for the resolution criterion (Figs. 4, 5). All original CBCTs showed a ‘good’ grade for the overall image, while only one syCBCT based on small patch models showed a ‘good’ grade (Fig. 4d). The ICC between the evaluators was 0.85.

The proposed network introduced structural changes based on U-Net and applied a Gaussian filter at post-processing. The ablation studies for each proposed component were performed for the small patch, and the corresponding results are listed in Supplementary Table S1. Among the image quality metrics, MAE and SSIM showed significantly better performance in evaluating hard tissue structures (Table 3). However, PSNR showed the best performance in describing air. All three types of tissues showed significantly different level of image quality according to all indices. Additionally, all indices (except SSIM) showed better performance in small patch-based-syCBCT than in the large patch-based image for hard tissue.

Table 3 Image quality evaluation metrics based on hard tissue, soft tissue, and air evaluation of the image.

Full size table

Discussion

This study was the first approach to synthesizing dental CBCT images based on ZTEMRI images using deep learning. It is considered an important attempt at this point intime when the need for radiation-free and low-dose dental imaging is increasing. As a result of this study, syCBCT images comparable to CBCT images used at present were achieved. The image quality indices, MAE, PSNR, and SSIM, showed acceptable values in the current study compared to the previous medical image synthetic studies. It was significant that the syCBCT image was superior to the original CBCT image in terms of artifacts and noise, though the resolution was insufficient. In addition, 3D model manipulation, which was challenging based on MRI, showed feasibility through this study.

It was significant that the syCBCT showed improvement in the artifacts and noise of the image compared to the original CBCT. These unexpected results have not been reported in any previous studies on CT image synthesis based on MRI data, probably because all studies focused on multichannel CT rather than CBCT^8,9,11. Traditionally, compared to multichannel CT, CBCT is known to produce images with extensive noise and artifacts due to a low radiation dose and cone-shaped beam. Many researchers have tried to reduce scattering noise and artifacts in CBCT since its introduction in dentistry^21,22. Although due to a different phenomenon, MRI also produces highly noisy images with artifacts. Thus, we did not expect to obtain improved syCBCT from MRI in terms of noise and artifacts. This result would have significant potential for research on artifact and noise elimination in CBCT, which has been an unsolved problem up to the present time.

Clinical imaging evaluation depicted that the resolution of syCBCT was unsatisfactory in this study. The original CBCT showed a good to moderate grade of resolution, while syCBCT showed poor to moderate grade resolution. This was consistent with imaging quality metrics. The value of PSNR, which represents the clarity of the image, was less than that in previous similar studies²³. Among several suspected reasons, relatively low sharpness of the hard tissue structure in MRI could be considered primarily. Although the voxel size and slice thickness of the original MRI data was within the range of clinically used CBCT unit, the relatively low sharpness of the bone margin was considered to be an insurmountable problem of the imaging modality itself. This part needs to be supplemented with the development of additional advanced image post-processing techniques.

Meanwhile, the image noise and artifacts level showed enhanced quality, showing lower MAE values, compared to those in the previous studies²³. Also, the value of SSIM in our study, which indicates overall image quality, was comparable to that of the previous studies²³. Although the clarity of the syCBCT image was low in the current study, the overall image quality was comparable to that of previous studies due to reduced noise and artifacts.

The blurred margins and low sharpness of anatomic structures in synthetic CT images have been an issue in deep-learning-based CT image synthesis^7,14,24, and a similar tendency was shown in our study. Leynes et al.²⁴ mentioned that gross bone depiction in syCBCT was comparable to that in the original CT image, whereas it was difficult to depict finer bone structures. Han⁷ also reported that the error in syCBCT mainly occurred at the border of bone tissue. Yuan et al.¹⁴ studied the production of synthetic CT from fast-scan CBCT based on deep-learning models and stated that small fine details were not preserved in synthetic CT images. The overall resolution of synthetic CT was poorer than that of the original CBCT image. To overcome such a problem, Chen et al.¹⁵ pre-processed multichannel CT using the up-sampling method. Through this pre-processing, multichannel CT images were turned into images with higher resolution. Accordingly, the synthetic image output was expected to show improved sharpness and clarity. It is mentioned that, despite their efforts, deformation still tends to appear in the output image¹⁵.

In the training step, we adopted two different-sized patches as input. In the case of small patches, we expected more precise results with less distortion than those in large patches, enabling us to concentrate more on the delicate morphology of the small region. As a result, improved performances were obtained in the image quality metrics, surface deviation, and expert image quality evaluation. However, the statistical differences were not significant. Thus, advanced research about image pre-processing that enhances the sharpness of input images is needed. In addition, we suggest that excluding patches that contain registration errors due to postural differences in the training step will help to improve the quality of syCBCT. Further, one of the issues with comparing surface deviation in the 3D maxillofacial model, was that the model file contains errors due to the conversion of the file type from the original image format. Therefore, the few millimeters deviations should be considered as due to comparing the relative error according to the input data types and different facial regions, and so it is difficult to view as an absolute error.

Chen et al.¹⁵ mentioned misregistration of the image sets as a possible reason for the synthetic image deformation. The current study included the registration between MRI and CBCT. In particular, the MRI images used in this study could not be completely registered with CBCT images owing to differences in the patient posture during both imaging procedures. Additionally, the MRI used in this study was for TMJ evaluation, and the image signal of the lower submental area, which was relatively far below the TMJ, was not satisfactorily sensitive for accurate model training. Hence, a prospective study design should be established to develop deep-learning models that can synthesize more accurate CBCT images.

Here, a modified U-Net structure with a backbone of ResNet was used. Gholamiankhah et al. and Bahrami et al. compared GAN, eCNN, U-Net, and V-net with ResNet and concluded that ResNet showed the best performance in CT synthesis from MRI^7,12. We also adapted the ResNet, to take advantage of the feature extraction capability, and removed the last skip connection in U-Net to reduce the disturbance of inevitable registration errors in our dataset. We confirmed that each component of the proposed method improved the quality of syCBCT by conducting the ablation studies (see Supplementary Table S1). Although, all indices did not show best performance, the SSIM, which is known as close to the human visual perception, showed highest values in the proposed model of the current study. MAE and PSNR of hard tissue was degraded quality in the proposed model compared to the previous studies^8,17, however, the difference was minute that cannot be detected by naked eye of human (Supplementary Figure S1).

Additionally, comparing our proposed method with the existing methods of U-Net¹⁷ and Han et al.⁸, the proposed method generally showed superior performance in image quality indices (see Supplementary Table S2). Han et al.⁸ avoided using 3-dimensional convolution filters by warring about of GPU memory limit. We changed the number of kernels in each convolution layer to handle this issue. This modification increased the efficiency of the network capacity by reducing the number of model parameters from 31 million (U-Net) to 10 million (ours).

Previous studies utilized the adversarial learning strategy^7,25, which trains a synthesis model with a discriminator that tries to distinguish target images as either real or synthesized. However, adversarial learning is known to be challenging to optimize due to “mode collapse,” in which a synthesis model keeps generating identical samples²⁶. To prevent mode collapse, we used smooth L1 loss instead of adversarial loss. The smooth L1 loss computes pixel-wise differences between original and synthesized images and is relatively robust to recognize outliers rather than mean squared error loss. In our experiment, the artifact of CBCT is considered the outlier, which shows a larger value than other areas. Therefore, it was thought that the utilization of the smooth L1 loss results in reducing syCBCT artifacts would be effective in this study.

There are several limitations to this study. First, although the sample size used in this study was comparable to that of the previous studies, the more enhanced performance of the model can be achieved with more samples due to the nature of deep learning research. Further research with additional MRI and CBCT data sets would help to increase the accuracy of the synthetic image. Additionally, as mentioned above, due to the difference in the patient's position in MRI and CBCT, perfect registration could not be achieved, leading to errors in CBCT image output. In this study, the registration process was conducted using commercial software, while a more sophisticated approach to the registration procedure is required. Lastly, obtaining MRI source data with high image quality, especially in the mandible area, would show a more improved result than that of the current study. Thus, a solid prospective study design would be required to develop more advanced CBCT synthetic models.

Conclusion

This study provided the first approach to CBCT synthesis from ZTE MRI, a non-ionizing radiation imaging. Compared to the conventional CBCT image, the generated CBCT image showed a clinically applicable level in dentistry with improved image quality in terms of noise and artifact. The study results would be expected to provide a basis for non-ionizing radiation imaging with improved quality for replacing CBCT for patients planning to undergo both MRI and CBCT simultaneously.

Data availability

The data generated and analyzed during the current study are not publicly available due to privacy laws and policies in Korea, but are available from the corresponding author on reasonable request.

References

Tayman, M.A.. et al. Effect of different voxel sizes on the accuracy of CBCT measurements of trabecular bone microstructure: A comparative micro-CT study. Imaging Sci Dent. 52, 171–179 (2022).
Google Scholar
Nardi, C. et al. Head and neck effective dose and quantitative assessment of image quality: A study to compare cone beam CT and multislice spiral CT. Dentomaxillofac. Radiol. 46, 20170030 (2017).
Article PubMed PubMed Central Google Scholar
Pauwels, R. et al. Comparison of spatial and contrast resolution for cone-beam computed tomography scanners. Oral Surg. Oral Med. Oral Pathol. Oral. Radiol. 114, 127–135 (2012).
Article PubMed Google Scholar
Hilgenfeld, T. et al. Use of dental MRI for radiation-free guided dental implant planning: A prospective, in vivo study of accuracy and reliability. Eur. Radiol. 30, 6392–6401 (2020).
Article PubMed PubMed Central Google Scholar
Hilgenfeld, T. et al. High-resolution single tooth MRI with an inductively coupled intraoral coil-can MRI compete with CBCT. Invest. Radiol. 57, 720–727 (2022).
Article CAS PubMed Google Scholar
Lee, C. et al. CT-like MRI using the zero-TE technique for osseous changes of the TMJ. Dentomaxillofac. Radiol. 49, 20190272 (2020).
Article PubMed PubMed Central Google Scholar
Gholamiankhah, F., Mostafapour, S., & Arabi, H. Deep learning-based synthetic CT generation from MR images: Comparison of generative adversarial and residual neural networks. arXiv. https://doi.org/10.48550/arXiv.2103.01609 (2021).
Han, X. MR-based synthetic CT generation using a deep convolutional neural network method. Med. Phys. 44, 1408–1419 (2017).
Article CAS PubMed Google Scholar
Massa, H. A., Johnson, J. M. & McMillan, A. B. Comparison of deep learning synthesis of synthetic CTs using clinical MRI inputs. Phys. Med. Biol. 65, 23NT03 (2020).
Article CAS PubMed PubMed Central Google Scholar
Qi, M. et al. Multi-sequence MR image-based synthetic CT generation using a generative adversarial network for head and neck MRI-only radiotherapy. Med. Phys. 47, 1880–1894 (2020).
Article PubMed Google Scholar
Wang, Y., Liu, C., Zhang, X. & Deng, W. Synthetic CT generation based on T2 weighted MRI of nasopharyngeal carcinoma (NPC) using a deep convolutional neural network (DCNN). Front. Oncol. 9, 1333 (2019).
Article PubMed PubMed Central Google Scholar
Bahrami, A., Karimian, A. & Arabi, H. Comparison of different deep learning architectures for synthetic CT generation from MR images. Phys. Med. 90, 99–107 (2021).
Article PubMed Google Scholar
Lee, C., et al. Synthesis of T2-weighted images from proton density images using a generative adversarial network in a temporomandibular joint magnetic resonance imaging protocol. Imaging Sci Dent. 52, 393–398 (2022).
Article PubMed PubMed Central Google Scholar
Yuan, N. et al. Convolutional neural network enhancement of fast-scan low-dose cone-beam CT images for head and neck radiotherapy. Phys. Med. Biol. 65, 035003 (2020).
Article PubMed PubMed Central Google Scholar
Chen, L. et al. Synthetic CT generation from CBCT images via unsupervised deep learning. Phys. Med. Biol. 66, 115019 (2021).
Article Google Scholar
Chen, H.-M. Mutual information: A similarity measure for intensity based image registration. In Advanced Image Processing Techniques for Remotely Sensed Hyperspectral Data (eds Varshney, P. K. & Arora, M. K.) 89–108 (Springer, 2004).
Chapter Google Scholar
Ronneberger, O., Fischer, P., & Brox T. U-net: Convolutional networks for biomedical image segmentation in Proceedings of the 18th International Conference on Medical Image Computing and Computer-Assisted Intervention (eds Navab, N., Hornegger, J., Wells, W. M., & Frangi, A. F.) Munich, Germany, (Springer, 2015).
He, K., Zhang, X., & Ren, S. Deep residual learning for image recognition in Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 770–778 (Las Vegas, 2016). https://doi.org/10.1109/CVPR.2016.90.
Lee, C. et al. Accuracy of digital model generated from CT data with metal artifact reduction algorithm. Sci. Rep. 11, 10332 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Mason, A. et al. Comparison of objective image quality metrics to expert radiologists’ scoring of diagnostic quality of MR Images. IEEE Trans. Med. Imaging 39, 1064–1072 (2020).
Article PubMed Google Scholar
Jin, J. Y. et al. Combining scatter reduction and correction to improve image quality in cone-beam computed tomography (CBCT). Med. Phys. 37, 5634–5644 (2010).
Article PubMed Google Scholar
Kim, Y. H. et al. Quantitative analysis of metal artifact reduction using the auto-edge counting method in cone-beam computed tomography. Sci. Rep. 10, 8872 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Boulanger, M. et al. Deep learning methods to generate synthetic CT from MRI in radiotherapy: A literature review. Phys. Med. 89, 265–281 (2021).
Article CAS PubMed Google Scholar
Leynes, A. P. et al. Zero-echo-time and dixon deep pseudo-CT (ZEDD CT): Direct generation of pseudo-CT images for pelvic PET/MRI attenuation correction using deep convolutional neural networks with multiparametric MRI. J. Nucl. Med. 59, 852–858 (2018).
Article PubMed PubMed Central Google Scholar
Park, Y. S. et al. Deep learning-based prediction of the 3D postorthodontic facial changes. J. Dent. Res. 101, 1372–1379 (2022).
Article CAS PubMed Google Scholar
Srivastava, A., Valkov, L., Russell, C., Gutmann, M. U., & Sutton, C. VEEGAN: Reducing mode collapse in GANs using implicit variational learning. arXiv. https://doi.org/10.48550/arXiv.1705.07761 (2017).

Download references

Acknowledgements

This study has been conducted with the support of the Korea Institute of Industrial Technology (JA230007). This study was supported by a faculty research grant of Yonsei University College of Medicine for (6-2021-0036). No conflicts of interest are declared.

Funding

Chena Lee was supported by the Yonsei University College of Dentistry (6-2021-0036).

Author information

These authors contributed equally: Hyeyeon Choi and Jong Pil Yun.

Authors and Affiliations

Department of Electrical Engineering, Pohang University of Science and Technology, 77 Cheongam-ro Nam-gu, Pohang, 37673, Republic of Korea
Hyeyeon Choi & Sang Woo Kim
Daegyeong Division, Korea Institute of Industrial Technology, Daegu, Republic of Korea
Jong Pil Yun
Department of Oral and Maxillofacial Radiology, Yonsei University College of Dentistry, 50-1 Yonsei-ro Seodaemun-gu, Seoul, 03722, Republic of Korea
Ari Lee, Sang-Sun Han & Chena Lee
Institute for Innovative in Digital Healthcare, Yonsei University, Seoul, Republic of Korea
Chena Lee

Authors

Hyeyeon Choi
View author publications
You can also search for this author in PubMed Google Scholar
Jong Pil Yun
View author publications
You can also search for this author in PubMed Google Scholar
Ari Lee
View author publications
You can also search for this author in PubMed Google Scholar
Sang-Sun Han
View author publications
You can also search for this author in PubMed Google Scholar
Sang Woo Kim
View author publications
You can also search for this author in PubMed Google Scholar
Chena Lee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.C., C.L., and A.L. proposed the ideas; A.L. and C.L. collected data; H.C., J.P., A.L. and C.L. analyzed and interpreted data; H.C., J.P., A.L., S.H., S.K. and C.L. critically reviewed the contents; and H.C., J.P., S.W., and C.L. drafted the article; H.C., J.P., A.L., S.H., S.W., and C.L. critically revised the article.

Corresponding authors

Correspondence to Sang Woo Kim or Chena Lee.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Choi, H., Yun, J.P., Lee, A. et al. Deep learning synthesis of cone-beam computed tomography from zero echo time magnetic resonance imaging. Sci Rep 13, 6031 (2023). https://doi.org/10.1038/s41598-023-33288-8

Download citation

Received: 30 December 2022
Accepted: 11 April 2023
Published: 13 April 2023
DOI: https://doi.org/10.1038/s41598-023-33288-8

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Physics-informed Deep Learning for Dual-Energy Computed Tomography Image Processing

Cycloidal CT with CNN-based sinogram completion and in-scan generation of training data

Competitive performance of a modularized deep neural network compared to commercial algorithms for low-dose CT image reconstruction

Introduction

Materials and methods

Ethics

Data collection

Data preparation

Deep learning network training

Accuracy assessment and clinical validation

Three-dimensional model surface deviation

Expert image quality evaluation

Image quality evaluation metrics

Statistical analysis and comparisons

Results

Discussion

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links