Coronary artery segmentation under class imbalance using a U-Net based architecture on computed tomography angiography images

Pan, Li-Syuan; Li, Chia-Wei; Su, Shun-Feng; Tay, Shee-Yen; Tran, Quoc-Viet; Chan, Wing P.

doi:10.1038/s41598-021-93889-z

Download PDF

Article
Open access
Published: 14 July 2021

Coronary artery segmentation under class imbalance using a U-Net based architecture on computed tomography angiography images

Li-Syuan Pan¹,
Chia-Wei Li²,
Shun-Feng Su¹,
Shee-Yen Tay^2,3,
Quoc-Viet Tran¹ &
…
Wing P. Chan^2,3,4

Scientific Reports volume 11, Article number: 14493 (2021) Cite this article

5333 Accesses
22 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Coronary artery disease is caused primarily by vessel narrowing. Extraction of the coronary artery area from images is the preferred procedure for diagnosing coronary diseases. In this study, a U-Net-based network architecture, 3D Dense-U-Net, was adopted to perform fully automatic segmentation of the coronary artery. The network was applied to 474 coronary computed tomography (CT) angiography scans performed at Wanfang Hospital, Taiwan. Of these, 10% were used for testing. The CT scans were divided into patches of 16 original high-resolution slices. The slices were overlapped between patches to take advantage of surrounding imaging information. However, an imbalance between the foreground and background presents a challenge in smaller-object segmentation such as with coronary arteries. The network was optimized and achieved a promising result when the focal loss concept was adopted. To evaluate the accuracy of the automatic segmentation approach, the dice similarity coefficient (DSC) was calculated, and an existing clinical tool was used. The subjective ratings of three experienced radiologists were used to compare the two ratings. The results show that the proposed approach can achieve a DSC of 0.9691, which is significantly higher than other studies using a deep learning approach. In the main trunk, the results of automatic segmentation agree with those of the clinical tool; they were significantly better in some small branches. In our study, automatic segmentation tool shows high-performance detection in coronary lumen vessels, thereby providing potential power in assisting clinical diagnosis.

Deep learning segmentation of major vessels in X-ray coronary angiography

Article Open access 15 November 2019

Su Yang, Jihoon Kweon, … Seung-Jung Park

Optimizing ensemble U-Net architectures for robust coronary vessel segmentation in angiographic images

Article Open access 19 March 2024

Shih-Sheng Chang, Ching-Ting Lin, … Yang C. Fann

AngioNet: a convolutional neural network for vessel segmentation in X-ray angiography

Article Open access 10 September 2021

Kritika Iyer, Cyrus P. Najarian, … C. Alberto Figueroa

Introduction

Coronary CT angiography (CCTA) provides detailed imaging and can deliver precise analyses and prognostic information for diagnoses (Leipsic et al.¹). Extracting coronary arteries from CCTA scans is a critical step for analyzing those images. A typical first step is extracting the coronary artery centerline allowing reconstruction of the coronary artery area based on its position and the predicted radius. Several automatic approaches have been proposed for extracting the coronary artery centerline^2,3,4,5. In contrast to centerline extraction, segmentation of the coronary arteries improves visualization and quantification of the vessels. Manual segmentation of CCTA scans by radiologists is time-consuming because only 2D slices of the CT images are available after image acquisition. Extracting the coronary arteries from 2D images relies on expert knowledge, but even the experts make errors because of the tedious nature of the work. Diagnosing coronary artery diseases depends primarily on computed tomography (CT) images, and therefore a high-quality fully automatic segmentation approach for defining the coronary arteries is essential.

Deep learning, especially the convolutional neural network (CNN), has recently shown capabilities in an extensive range of medical image analyses (Litjens et al.⁶, 2014, 2017). A deep CNN network architecture, U-Net⁷, has shown promising results in automatic segmentation for a variety of medical applications^{8,9,10,11,12,13,14}. In recent years, researchers have studied coronary artery reconstruction based on deep learning approaches. For example, Wolterink et al.⁵ proposed using a 3D CNN classifier to predict the direction and radius of an artery at any given point in a CCTA image. Kjerland et al.¹⁵ proposed generating volume data as a training set from centerline data and considered a two-stream CNN architecture that could use different scales of input to perform a fully automatic segmentation task. Huang et al.¹⁶ employed a basic 3D U-Net convolutional network for coronary artery segmentation using a small amount of real-world data. Chen et al.¹⁷ incorporated a vessel map into CT angiography images using the proposed 3D multi-channel U-Net. However, these studies have relatively limited performance and are evaluated using a small dataset for testing [e.g., the dice similarity coefficient (DSC) has been shown to be 0.5975¹⁵ in 1 case, 0.7146¹⁶ in 4 cases, and 0.8060¹⁷ in 4 cases (2 for validation, 2 for testing)]. The differences could be due to class imbalance, which was not considered.

The focus of this study was to perform high-quality fully automatic artery segmentation in a deep learning approach that can be used clinically. Here, a U-Net-based architecture, 3D Dense-U-Net¹⁸, was adopted with the focal loss¹⁹ function instead of the typical DSC loss to address class imbalance. Furthermore, rather than cropping images to accommodate hardware limitations faced in other deep learning approaches, all data were processed to maintain the original high resolution¹⁸ thus ensuring that the global information was sufficient. To validate the performance of this approach, the test data were evaluated not only using intersection over union (IoU) and DSC, but the results were compared with the clinical tool; subjective ratings of their performances were provided by three experienced radiologists.

Materials and methods

Data description

This study was performed in accordance with relevant guidelines and was conducted after approval by the Taipei Medical University-Joint Institutional Review Board (N201710005). Informed consent was waived because of its retrospective nature.

CCTA scans were acquired using a dual-source CT scanner at the Department of Radiology, Wanfang Hospital, Taipei Medical University, Taiwan (Somatom Definition; Siemens Medical Solutions, Forchheim, Germany). Those obtained from 2013 to 2019 were retrospectively compared against our inclusion criteria: (1) full information about the entire coronary tree was collected and (2) the patient was aged more than 20 years but less than 85 years yielding 474 qualified scans out of 480 scans (six unqualified scans were excluded). All scans were collected using a tube voltage of 120 kVp and a maximum tube current of 900 mA. Images were reconstructed to a mean voxel size of 0.32 × 0.32 × 0.7 mm³, and the margins of two major coronary arteries [the left coronary artery (LCA) and the right coronary artery (RCA)] and all their branches were manually annotated by consensus between three radiology technologists and approved by another radiologist experienced in cardiovascular imaging. These approved annotated images were used as the ground truth. In this study, only non-stenotic CCTA dataset were included for developing segmentation algorithm. The scans were divided into a training set (432 scans) and a test set (42 scans). From the latter, 42 were used to compare against the existing semi-automatic clinical method (Syngo.via, Siemens).

Preprocessing

Each scan was a 3D medical image of the coronary arteries at a resolution of 512 × 512 spanning multiple slices (usually between 140 and 220 slices). The memory in the graphics processing unit (GPU) could not accommodate all the slices at this resolution; therefore, each scan was divided into patches of 16 slices each maintaining the original resolution. This preprocessing approach assured that sufficient global information was retained in each slice and is a clear advantage versus transforming the data to a lower resolution. Slices between patches were overlapped, and correlated information between them was utilized so that every voxel of every image could be analyzed and compared with surrounding slices during training. Figure 1 shows four overlapped slices in the training data, but there was no need in test data.

Network architecture

U-Net is widely used for medical image segmentation by employing its convolutional encoder-decoder architecture. We used the improved version (3D Dense-U-Net¹⁸) to perform the residual²⁰ and dense²¹ interconnections. This provides better results. A detailed network architecture is shown in Fig. 2. We believe that the Dense-U-Net architecture can provide high-quality results in coronary artery segmentation tasks.

Weighted focal loss

Dense-U-Net was originally designed for segmentation of brain and spine data. The objects in the coronary artery area are much smaller and require a greater focus on the learning process. Focal loss concept¹⁹ is proposed for this kind of sparse data. Our research shows that this indeed achieves better performance.

In CCTA images, a huge imbalance between the coronary artery and the background area is seen, and most labels contain useless information. In most cases, the ratio between the area of the coronary artery and the area of the background is up to 1:400. This can confound the results when 3D Dense-U-Net is employed directly. The focal loss concept was first designed for object detection tasks where an extreme imbalance between the foreground and background classes is present. The class imbalance can cause inefficient network training because most locations are negative samples (background) and usually cannot contribute useful learning information. Additionally, negative samples can overwhelm the positive ones (arteries) during training because most loss values originate from negatives. This can be improved by employing the focal loss concept:

$$ FL(p,\;~y) = ~ - \alpha (1~ - ~p)^{\gamma} ~\;ylog(p) - ~(1~ - ~\alpha )p^{\gamma} ~(1~ - ~y)log(1~ - ~p) $$

(1)

where y is the ground truth (1 for arteries and 0 for background), p ∈ [0, 1] is the predicted probability that the network belongs to the positive class, α is a class weight for the positive class, and γ is a focusing parameter for adjusting the weight of well-classified samples. In this study, the class imbalance is resolved when α = 0.6 and γ = 2; misclassified samples are nearly eliminated.

Results

In this study of 474 coronary CT images collected at our institution, the test set of 42 scans was used to evaluate our results employing IoU and DSC; both are widely used in qualifying image detection and medical volume segmentation tasks. These metrics are defined as:

$$ IoU({\text{Y}},\;\hat{{\text{Y}}}) = \frac{{\left| {Y \cap \hat{Y}} \right|}}{{\left| {Y \cup \hat{Y}} \right|}} $$

(2)

$$ DSC({\text{Y}},~\;\hat{{\text{Y}}}) = \frac{{2\left| {Y \cap \hat{Y}} \right|}}{{\left| {Y\left| + \right|\hat{Y}} \right|}} $$

(3)

where Y denotes the expert’s annotation, and $\hat{{\text{Y}}}$ denotes the model’s prediction. In this study, Y and $\hat{{\text{Y}}}$ ∈ {0, 1}. The predicted results of the model were evaluated using 3D IoU and 3D DSC on our test set. The results using basic 3D U-Net were compared to those using 3D Dense-U-Net. Figure 3 shows the accuracy based on the test set demonstrating that IoU exceeds 94% and DSC is around 97% in more than half of the scans when 3D Dense-U-Net is employed; this performance is significantly better than that of the basic version (by paired t-test; p = 4.11 × 10^–13). The averages for IoU and DSC using 3D Dense-U-Net were 94.03% and 96.91% compared to 92.40% and 96.03% when using basic U-Net (significance; p = 2.62 × 10^–12).

The trained network was tested using the test set, and the experimental results were visualized in both 2D and 3D (Figs. 4, 5). Figure 4 shows part of the slices from the original CCTA images—both the ground truth and the network-predicted results. The images were sampled every 20 slices. Most of the predicted results were quite similar to the ground truth even when the regions of the coronary artery were small. Figure 5 shows the 3D visualizations. To clearly compare the results, the ground truth and the network prediction, each in its own color, were overlapped into a single image showing overlapping areas in a different color and areas not in agreement in their original colors. Most areas of the arteries are the same between the two. This shows that using the 3D Dense-U-Net architecture in combination with the focal loss concept can achieve a promising result when automatically segmenting CCTA images.

These results were compared to those from the existing semi-automatic clinical tool (Syngo.via, Siemens) based on coronary arterial centerline detection and which was applied to 42 clinical scans from the test set. The time taken for segmentation by our method (about 10–15 s) is almost equal to the time taken for segmentation by Siemens Syngo.via (about 12 s). The accuracy of arterial segmentation using the two approaches were subjectively scored by three experienced radiologists. They were asked to rate the outcomes of the segmentation images produced by our approach and via the use of the coronary arterial centerlines by a clinical tool on an 8-point scale (7, entirely correct; 6, 1 error; 5, 2 errors; 4, 3 errors; 3, 4 errors; 2, 5 errors; 1, 6 errors; and 0, more than 7 errors). Statistical analyses were performed using the R software program with paired t-test functionality version 3.5.1. A difference was considered statistically significant when the Bonferroni-corrected p-value was less than 0.05. Figure 6 shows the subjective ratings of the segmentation results by our approach and by the clinical tool. The results show that our approach is correct by the clinical tool in the main trunks and the main branches of the coronary arteries. Furthermore, the detection accuracy of our approach is significantly better for some small branches of the coronary arteries (Bonferroni-corrected p = 0.05).

Discussion

We demonstrated that combining the U-Net based network with the focal loss concept indeed can achieve an excellent result when applied to scans from our CCTA database. Furthermore, the network was validated on a relatively larger test set to ensure that it can be generalized to unseen real-world data. To confirm that our research can be used clinically, the network prediction was compared with an existing clinical tool (Syngo.via, Siemens). The segmentation scores show that this new approach is significantly better than using the existing clinical tool in the automatic separation of the coronary lumen arteries.

To date, coronary artery segmentation studies using deep learning approaches have given relatively limited performances^15,16,17. One reason could be that the class imbalance was not considered in most of these studies. Chen et al.¹⁷ attempted to balance the number of samples between the vessel areas and the background regions by data argumentation. The result was relatively better than those of the two approaches not considering the imbalance^15,16. It is clear that the network cannot provide good performance without considering the class imbalance. Buda et al.²² demonstrated that the impact of class imbalance on classification tasks of CNNs is detrimental. In practice, the small volumes of the arteries are the primary challenge because the network cannot learn when an abundance of useless information is in the learning dataset. Using the loss function concept is more critical for improving segmentation of the coronary artery region compared to other methods. It has been shown that oversampling from minor classes of data can lead to overfitting^23,24. Another reason could be that input images have been reduced to a low resolution (i.e., 32 × 32 × 32) to accommodate the limitations of the GPU memory possibly leading to a low receptive field from the prediction and robbing the network of sufficient global information.

Semi-automatic tools have been widely used to analyze CCTA images in diagnosing coronary diseases, but several minutes, sometimes more, are necessary for detecting vascular structures because clinical radiologists/technologists must adjust the extracted structure manually. After the manual modification of coronary structures by the semi-automatic tool, clinical radiologists could make a diagnosis from the findings of arterial plaques. Here, an efficient automatic approach was introduced to track individual coronary artery trees. The resulting subjective rating shows that its accuracy coincides with the clinical tool in the main trunk of coronary arteries; it is significantly better in some small branches. The automatic segmentation results based on our approach do not miss small branches such as the conus artery, marginal artery, obtuse marginal two artery, and septal artery while the clinical tool always detects those small branches in the wrong way. Moreover, the automatic segmentation approach proposed here allows rapid extraction of a coronary artery (within 20 s using a 2.7 GHz 4-core CPU with 2 × 8-GB 2133 MHz RAM and one 500-GB SSD). In contrast to the clinical tool, this fully automatic approach not only gives high performance but also great speed in detecting coronary lumen arteries making it a convenience in the clinical diagnosis of coronary diseases.

Though these results are promising, the required computing power and training time are great given that a 3D network is employed. Training requires almost two hours per epoch converging after 40 epochs using a NVIDIA RTX 2080 Ti GPU. Therefore, the tenfold cross validation method was not implemented in this research though it can achieve a slightly better performance. Future research should be focused on reducing computational expense without significantly decreasing performance. Because we focused primarily on addressing the foreground/background imbalance, it would be interesting to apply long short-term memory with a CNN architecture to better understanding the sequential information in CCTA data. Since our method development for automatic coronary artery segmentation is the first step for coronary artery diagnosis, future work should include some processes of centerline detection. MPR will be combined with automatic segmentation for the whole picture of automatic segmentation of calcification and stenosis for clinical diagnosis.

In summary, we developed automatic segmentation of coronary lumen arteries using 3D Dense-U-Net. This is the first step to use AI in diagnosing coronary artery disease. Future work will combine centerline detection and MPR with automatic segmentation of coronary arteries to create completely automatic segmentation of arterial calcifications and stenoses to assist effective clinical diagnoses.

References

Leipsic, J. et al. SCCT guidelines for the interpretation and reporting of coronary CT angiography: A report of the society of cardiovascular computed tomography guidelines committee. J. Cardiovasc. Comput. Tomogr. 8, 342–358 (2014).
Article Google Scholar
Schneider, M., Hirsch, S., Weber, B., Székely, G. & Menze, B. H. Joint 3-D vessel segmentation and centerline extraction using oblique Hough forests with steerable filters. Med. Image Anal. 19, 220–249 (2015).
Article Google Scholar
Gülsün, M. A., Funka-Lea, G., Sharma, P., Rapaka, S. & Zheng, Y. Coronary centerline extraction via optimal flow paths and cnn path pruning. In International Conference on Medical Image Computing and Computer-Assisted Intervention, 317–325 (Springer, 2016).
Lesage, D., Angelini, E. D., Funka-Lea, G. & Bloch, I. Adaptive particle filtering for coronary artery segmentation from 3D CT angiograms. Comput. Vis. Image Underst. 151, 29–46 (2016).
Article Google Scholar
Wolterink, J. M., van Hamersvelt, R. W., Viergever, M. A., Leiner, T. & Išgum, I. Coronary artery centerline extraction in cardiac CT angiography using a cnn-based orientation classifier. Med. Image Anal. 51, 46–60 (2019).
Article Google Scholar
Litjens, G. et al. A survey on deep learning in medical image analysis. Med. Image Anal. 42, 60–88 (2017).
Article Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical Image Computing and Computer-Assisted Intervention, 234–241 (Springer, 2015).
Milletari, F., Navab, N. & Ahmadi, S.-A. V-net: Fully convolutional neural networks for volumetric medical image segmentation. In 2016 Fourth International Conference on 3D Vision (3DV), 565–571 (IEEE, 2016).
Men, K., Dai, J. & Li, Y. Automatic segmentation of the clinical target volume and organs at risk in the planning CT for rectal cancer using deep dilated convolutional neural networks. Med. Phys. 44, 6377–6389 (2017).
Article CAS Google Scholar
Li, X. et al. H-DenseUNet: Hybrid densely connected UNet for liver and tumor segmentation from CT volumes. IEEE Trans. Med. Imaging 37, 2663–2674 (2018).
Article Google Scholar
Kazemifar, S. et al. Segmentation of the prostate and organs at risk in male pelvic CT images using deep learning. Biomed. Phys. Eng. Express 4, 055003 (2018).
Article Google Scholar
Zhu, W. et al. AnatomyNet: Deep learning for fast and fully automated whole-volume segmentation of head and neck anatomy. Med. Phys. 46, 576–589 (2019).
Article Google Scholar
Daniel, M. C. et al. Automated segmentation of the corneal endothelium in a large set of ‘real-world’ specular microscopy images using the U-Net architecture. Sci. Rep. 9, 1–7 (2019).
Article ADS Google Scholar
Kwak, G. H. et al. Automatic mandibular canal detection using a deep convolutional neural network. Sci. Rep. 10, 1–8 (2020).
Article Google Scholar
Kjerland, Ø. Segmentation of Coronary Arteries from CT-scans of the Heart Using Deep Learning. Master’s thesis, NTNU (2017).
Huang, W. et al. Coronary artery segmentation by deep learning neural networks on computed tomographic coronary angiographic images. In Conf. Proc. IEEE Eng. Med. Biol. Soc., 608–611 (IEEE, 2018).
Chen, Y.-C. et al. Coronary artery segmentation in cardiac CT angiography using 3D multi-channel u-net. arXiv preprint arXiv:1907.12246 (2019).
Kolarík, M., Burget, R., Uher, V., Ríha, K. & Dutta, M. K. Optimized high resolution 3D Dense-U-Net network for brain and spine segmentation. Appl. Sci. 9, 404 (2019).
Article Google Scholar
Lin, T.-Y., Goyal, P., Girshick, R., He, K. & Dollár, P. Focal loss for dense object detection. In Proceedings of the IEEE International Conference on Computer Vision, 2980–2988 (2017).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 770–778 (2016).
Huang, G., Liu, Z., Van Der Maaten, L. & Weinberger, K. Q. Densely connected convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 4700–4708 (2017).
Buda, M., Maki, A. & Mazurowski, M. A. A systematic study of the class imbalance problem in convolutional neural networks. Neural Netw. 106, 249–259 (2018).
Article Google Scholar
Chawla, N. V., Bowyer, K. W., Hall, L. O. & Kegelmeyer, W. P. Smote: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002).
Article Google Scholar
Wang, K.-J., Makond, B., Chen, K.-H. & Wang, K.-M. A hybrid classifier combining smote with PSO to estimate 5-year survivability of breast cancer patients. Appl. Soft Comput. 20, 15–24 (2014).
Article CAS Google Scholar

Download references

Acknowledgements

This research was funded by a grant (MOST 107-2634-F-038-001) from the Ministry of Science and Technology, Taiwan. The authors are grateful to Dr. Yi-Chien Hsieh, Dr. Chin-Wei Chien, and Dr. Wilson T. Lao for their assistance in evaluating our approach and clinical tools.

Author information

Authors and Affiliations

Department of Electrical Engineering, National Taiwan University of Science and Technology, Taipei, 10607, Taiwan
Li-Syuan Pan, Shun-Feng Su & Quoc-Viet Tran
Department of Radiology, Wan Fang Hospital, Taipei Medical University, Taipei, 10607, Taiwan
Chia-Wei Li, Shee-Yen Tay & Wing P. Chan
Department of RadiologySchool of MedicineCollege of Medicine, Taipei Medical University, Taipei, 10607, Taiwan
Shee-Yen Tay & Wing P. Chan
Medical Innovation Development Center, Wan Fang Hospital, Taipei Medical University, Taipei, 10607, Taiwan
Wing P. Chan

Authors

Li-Syuan Pan
View author publications
You can also search for this author in PubMed Google Scholar
Chia-Wei Li
View author publications
You can also search for this author in PubMed Google Scholar
Shun-Feng Su
View author publications
You can also search for this author in PubMed Google Scholar
Shee-Yen Tay
View author publications
You can also search for this author in PubMed Google Scholar
Quoc-Viet Tran
View author publications
You can also search for this author in PubMed Google Scholar
Wing P. Chan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.-S.P., C.-W.L., Q.-V.T., W.-P.C., and S.-F.S. contributed to the concept. L.-S.P. analyzed the data, performed the deep learning approaches, and evaluated the results. L.-S.P. and C.-W.L. authored the manuscript. C.-W.L., S.-Y.T., and W.-P.C. prepared and interpreted the clinical data. W.-P.C. and S.-F.S. contributed to preparing and revising the manuscript. All authors reviewed and approved the final version of the manuscript.

Corresponding author

Correspondence to Wing P. Chan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Pan, LS., Li, CW., Su, SF. et al. Coronary artery segmentation under class imbalance using a U-Net based architecture on computed tomography angiography images. Sci Rep 11, 14493 (2021). https://doi.org/10.1038/s41598-021-93889-z

Download citation

Received: 07 June 2020
Accepted: 27 May 2021
Published: 14 July 2021
DOI: https://doi.org/10.1038/s41598-021-93889-z

This article is cited by

Deep learning–based atherosclerotic coronary plaque segmentation on coronary CT angiography
- Natasa Jávorszky
- Bálint Homonnay
- Márton Kolossváry
European Radiology (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.