Machine learning for endoleak detection after endovascular aortic repair

Talebi, Salmonn; Madani, Mohammad H.; Madani, Ali; Chien, Ashley; Shen, Jody; Mastrodicasa, Domenico; Fleischmann, Dominik; Chan, Frandics P.; Mofrad, Mohammad R. K.

doi:10.1038/s41598-020-74936-7

Download PDF

Article
Open access
Published: 27 October 2020

Machine learning for endoleak detection after endovascular aortic repair

Salmonn Talebi¹^na1,
Mohammad H. Madani²^na1,
Ali Madani^1,3,
Ashley Chien¹,
Jody Shen²,
Domenico Mastrodicasa²,
Dominik Fleischmann²,
Frandics P. Chan² &
…
Mohammad R. K. Mofrad^1,4

Scientific Reports volume 10, Article number: 18343 (2020) Cite this article

2488 Accesses
11 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Diagnosis of endoleak following endovascular aortic repair (EVAR) relies on manual review of multi-slice CT angiography (CTA) by physicians which is a tedious and time-consuming process that is susceptible to error. We evaluate the use of a deep neural network for the detection of endoleak on CTA for post-EVAR patients using a novel data efficient training approach. 50 CTAs and 20 CTAs with and without endoleak respectively were identified based on gold standard interpretation by a cardiovascular subspecialty radiologist. The Endoleak Augmentor, a custom designed augmentation method, provided robust training for the machine learning (ML) model. Predicted segmentation maps underwent post-processing to determine the presence of endoleak. The model was tested against 3 blinded general radiologists and 1 blinded subspecialist using a held-out subset (10 positive endoleak CTAs, 10 control CTAs). Model accuracy, precision and recall for endoleak diagnosis were 95%, 90% and 100% relative to reference subspecialist interpretation (AUC = 0.99). Accuracy, precision and recall was 70/70/70% for generalist1, 50/50/90% for generalist2, and 90/83/100% for generalist3. The blinded subspecialist had concordant interpretations for all test cases compared with the reference. In conclusion, our ML-based approach has similar performance for endoleak diagnosis relative to subspecialists and superior performance compared with generalists.

CathAI: fully automated coronary angiography interpretation and stenosis estimation

Article Open access 11 August 2023

Computed tomography-based automated measurement of abdominal aortic aneurysm using semantic segmentation with active learning

Article Open access 18 April 2024

Prediction of stent under-expansion in calcified coronary arteries using machine learning on intravascular optical coherence tomography images

Article Open access 23 October 2023

Introduction

Endovascular aortic repair (EVAR) is the primary treatment for many patients with aortic pathology particularly in the setting of abdominal aortic aneurysm. The procedure has largely replaced the traditional open surgical approach employed in the past which is often associated with increased morbidity and mortality in the peri-operative period^1,2,3,4,5. Lifelong surveillance imaging is typically performed to evaluate for postoperative EVAR complications which may be asymptomatic and potentially fatal⁶. Endoleak is one of the main and recognized complications associated with EVAR⁷. Endoleak is defined as persistence of blood flow outside the stent graft and within the aneurysm which may lead to growth and subsequent rupture of the aneurysm sac^{8, 9}. Computed tomography angiography (CTA) is the standard imaging technique for postoperative surveillance following EVAR^{10, 11}. Currently in the routine clinical setting, detection of endoleak requires manual review of multi-slice CTA scans. The interpretation process by humans is tedious, may be subject to error and/or demonstrate variability between human readers.

Machine learning (ML) is an emerging technique which has been increasingly applied to various fields in medicine such as cardiology¹², radiology^13,14,15, ophthalmology^{16, 17}, dermatology^{18, 19}, and pathology^{20, 21}. Machine learning algorithms may learn from examples and respond to new inputs based on their prior training²². Machine learning may provide a means to facilitate human CTA endoleak detection in various ways including efficiency, accuracy and standardization of interpretation. The objective of our study is to develop and test a machine learning based model for endoleak detection as well as compare its performance with that of both subspecialist and general diagnostic radiologists.

A substantial amount of labeled segmentation maps is needed to produce a state-of-the-art supervised deep learning segmentation model. Obtaining manual segmentation maps for medical images is a tedious and costly process. Data augmentation is an important and effective method to help capture the complete distribution of possible data. Advanced data augmentation techniques have been shown to provide substantial model performances increases²⁶. Studies have also shown the feasibility of getting state-of-the-art performance by augmenting just one image²⁷. Data augmentation techniques such as random image rotations and introduction of nonlinear deformations have been shown effective in improving medical segmentation model accuracies²⁸. Other augmentation methods involving adding or subtracting regions of images have not been commonly used for natural images because the outcomes appear artificial. However, the addition or subtraction of regions in CT images may be more feasible due to the constraints of CT images compared to natural images. In this study, we present a novel data augmentation method using segmentation maps to augment CT slices by adding and removing regions of the CT slice containing an endoleak.

Methods

Data preprocessing

This retrospective study was conducted with the approval of the Stanford Institutional Review Board (IRB) and under a waiver of informed consent. All methods were performed in accordance with relevant guidelines and regulations. Fifty CTA scans from 50 post-EVAR patients with endoleak and 20 CTA scans from 20 post-EVAR patients without endoleak were retrospectively identified. The presence or absence of endoleak in each patient was determined based on the corresponding clinical CTA radiology report dictated by a cardiovascular imaging subspecialty trained diagnostic radiologist. Post-EVAR patients without endoleak are referred to as controls in this study.

The CTA imaging from the positive endoleak cases and negative controls were sent from the picture archiving and communication system (PACS) to TeraRecon (TeraRecon Inc., Foster City, CA) for obtaining de-anonymized DICOM images. The deanonymized CT images included noncontrast, arterial phase, and delayed phase images for each positive endoleak case and control. The de-anonymized images were subsequently transferred to a secure encrypted password-protected server. The DICOM files were then processed into a machine-readable format using Pydicom, flattened, and stored in a storage-efficient manner using HDF5 standards.

CTA scans were split into sets used for training (32 positive endoleak CTAs, 8 control CTAs), validation (8 positive endoleak CTAs, 2 control CTAs), as well as a held-out test subset (10 positive endoleak CTAs, 10 control CTAs). Endoleak regions were labeled for training purposes by manual contouring of individual CT images on all positive endoleak CTAs which was performed by a diagnostic radiologist with cardiovascular imaging subspecialization. Manual segmentation of the endovascular stent lumen and aneurysm sac was also performed for a subset of the controls (10 CTAs). Image pre-processing was performed by thresholding the raw pixel values, mean-centering data, and resizing images.

Data augmentation

Standard data augmentation techniques included image rotation and introduction of pixel noise to enhance the training process for the ML model. Additional data augmentation was also performed with a custom developed endoleak augmentation technique referred to as Endoleak Augmentor. Endoleak Augmentor involves the addition and removal of endoleaks on individual CTA images using aneurysm sac and endoleak masks derived from segmentation (Fig. 1) which were then inputted to the ML model for training. Output from the Endoleak Augmentor was verified for adequate addition and removal of endoleaks by a cardiovascular imaging subspecialist radiologist prior to supplying the images to the algorithm for training.

The Endoleak Augmentor was integrated into Keras as a custom data generator class (Algorithm 1). During data generation a batch of labeled data X are selected. For each x_b in the batch we generate transformed version x’_b depending on the augment ID l_b (algorithm 1, lines 5, 8 and 12).

Endoleaks are inserted into CT slices using x’_b, y’_b = augment_adder(x_b, x_ab, u_e, σ_e) (algorithm 1, line 9). Using the aneurysm sac segmentation map x_ab as a boundary, an endoleak of random shape is generated and inserted into the aneurysm sac. Pixel values for the endoleak are calculated using a Gaussian distribution of endoleak pixel values calculated from u_e, σ_e. A new CT slice x’_b is generated that contains an endoleak.

Endoleaks are removed from CT slices using x’_b, y’_b = augment_remover(x_b, x_eb, u_a, σ_a) (algorithm 1, line 13). Using the endoleak segmentation map x_eb the endoleak pixels are replaced using a Gaussian distribution of aneurysm sac pixel values calculated from u_a, σ_a. A new CT slice x’_b is generated with the endoleak removed.

Algorithm 1

Endoleak Augmentor takes a batch of labeled CT slices X and corresponding augmentation labels l. Depending on the augmentation label, a CT slice will either have an endoleak added, an endoleak removed, or no augmentation. The algorithm produces a collection of updated labeled CT slices X’.

Model training and evaluation

Our training set contains a total of 10,539 CTA slices of which 4190 slices contain endoleak segmentation maps and 1532 contain stent lumen and aneurysm sac segmentation maps. A separate validation set containing a total of 1940 CTA slices of which 746 were positive for endoleaks. The training slices are input into the Endoleak Augmentor which transforms the slices and inputs them into our deep learning model. A U-net style model with a convolutional encoder–decoder architecture was used to generate predicted endoleak segmentation maps. Model overview with architecture is depicted in Figs. 2, 3. During training, various hyperparameters were evaluated to improve the model’s prediction performance. Next, the threshold to determine whether a slice is considered yes for endoleak is optimized by picking the probability threshold that maximized the precision, recall and F1 scores of the validation set. Additional post processing logic is implemented to remove false positive predictions that contain very small or very large endoleaks. A final post processing step is performed on all of a patient’s CTA slices to determine if a patient has an endoleak. Testing was performed at a per slice and per case level based on a subset of 10 positive endoleak cases and 10 controls. The same test subset was also reviewed by three additional blinded general diagnostic radiologists and one blinded cardiovascular imaging subspecialty trained radiologist for comparison against the ML model at a per case level.

Results

Table 1 summarizes the clinical characteristics of the study’s patients. Patients were predominately male and elderly with elevated systolic blood pressures. The most frequent comorbidity among patients was hypertension followed by coronary artery disease with no statistically significant differences between post-EVAR patients with and without endoleak. The majority of patients with endoleak as well as most of the controls underwent EVAR for abdominal aortic or aortoiliac aneurysm (48/50 for the endoleak group, 18/20 for the control group). Isolated iliac artery aneurysm or abdominal aortic dissection was the EVAR indication in 2/50 endoleak group patients and 2/20 control group patients. Prevalence of endoleak types are as follows: type 1 (5/50), type 2 (28/50), type 3 (8/50). Nine of the remaining 50 endoleak group patients had either multiple or indeterminate endoleak types. Embolization material in conjunction with EVAR was present in 10/50 endoleak group patients and 2/20 control group patients. Three of the twenty (15%) of the CTA scans in the test subset had embolization material associated with the endovascular stent graft.

Table 1 Patient clinical characteristics.

Full size table

The area under the curve (AUC) for individual CT slice prediction at a per slice level was 0.89 for a set of 600 CT slices randomly selected from the test subset (Fig. 4). A patient level prediction is then made using an ensemble of all of a patient’s individual CT slice predictions. Performances of the machine learning model and 3 blinded general diagnostic radiologists on a per patient or case level relative to the gold standard interpretation by a cardiovascular imaging subspecialty trained radiologist are shown in Table 2. Accuracy, precision and recall of the ML model was 95%, 90%, and 100% with an AUC of 0.99. The accuracy confidence intervals— as determined by the standard deviation of 1000 bootstrapped sets sampled with replacement—was 11%. Accuracy, precision and recall was 70%, 70% and 70% for the blinded general radiologist 1. Blinded general radiologist 2 had an accuracy of 50%, precision of 50% and recall of 90%. Accuracy, precision and recall for the blinded general radiologist 3 was 90%, 83% and 100%. The blinded subspecialist had interpretations for presence or absence of endoleak that were concordant with the reference clinical radiology report dictated by a cardiovascular imaging subspecialist for all test cases. Examples of masks predicted by the ML model are shown in Fig. 5.

Table 2 Performance metrics for the machine learning (ML) model and general radiologists.

Full size table

Discussion

The detection of endoleak following EVAR requires meticulous review of multi-slice CTA scans by humans which can be time-consuming and potentially inaccurate. There has been sparse prior investigation into the application of machine learning in the setting of post-EVAR CTA imaging particularly evaluation for endoleak. One study has demonstrated use of a computer vision algorithm for the segmentation of the inner and outer boundaries of abdominal aortic aneurysms²³. A recent study by Hahn et al. evaluated the use of a deep learning method for endoleak identification²⁴. Our machine learning model accuracy exceeded their study (95% vs. 89%) with an AUC of 0.99 versus 0.94. Hahn et al. reported using the radiology report as the gold standard for endoleak detection with only a subset of 100 CTA images independently read by two human readers consisting of one interventional radiologist and one vascular surgeon. However, the performance metrics of their ML model relative to each of the individual human reader performances for endoleak diagnosis were not stated. In our study in addition to using the radiology report interpreted by cardiovascular imaging subspecialized diagnostic radiologists as the gold standard, we report our machine learning endoleak detection performance along with the individual performances of 3 additional blinded general diagnostic radiologists and 1 additional blinded cardiovascular imaging subspecialty trained radiologist. The purpose of our study design is to provide for further direct comparison at both a machine versus human level and at a human versus human level. Our study raises the possibility that there may be variability even among human readers for endoleak diagnosis, a topic which has not been extensively investigated in the literature²⁵. The prior study also excluded CT images with prior embolization although fifteen percent of our test subset CTAs had embolization material which can be encountered in the clinical setting.

Data augmentation is critical to the success of machine learning model performance across domains from computer vision to natural language processing. In this work, we introduce a novel data augmentation technique that can be broadly applied for recognition tasks in medical imaging. This augmentation method provides multiple benefits for training a deep learning model. One benefit is the training set now contains anatomically identical CT slices, one with an endoleak and one without an endoleak. This helps the deep learning model focus on the variation representing an endoleak and not other anatomical differences. Another benefit is the amount of new endoleak CT slices we can generate. For each CTA slice containing an aneurysm sac segmentation map we are able insert many uniquely shaped endoleaks. This increases both the quantity and variance of the training data—ideally improving overall performance and reducing overfitting effects.

Our study has several limitations. Since our data came from a single medical center, the generalizability of the machine learning model may have been limited. Furthermore, our sample sizes including the test subset were small and further studies with more patients are needed for validation of the deep neural network. Lastly, our model uses prediction masks to detect only the existence of an endoleak. However, the endoleak prediction masks can serve as interpretable predictions of the endoleak type and aneurysm size. In future work the quantification of endoleak type and aneurysm size from the prediction masks will be evaluated.

The introduction of machine learning based systems into the clinical setting for post-EVAR surveillance may potentially lead to increased efficiency, accuracy, and greater consistency among readers for post procedural complication detection which could in turn result in improved management of these patients. In summary, this study demonstrates that our machine learning method performance is comparable to that of cardiovascular imaging subspecialist radiologists and superior to that of general radiologists. This raises the possibility that machine learning may eventually assist humans in the interpretation of post-EVAR scans in routine clinical practice.

References

Elkouri, S. et al. Perioperative complications and early outcome after endovascular and open surgical repair of abdominal aortic aneurysms. J. Vasc. Surg. 39, 497–505 (2004).
Article Google Scholar
Daye, D. & Walker, T. G. Complications of endovascular aneurysm repair of the thoracic and abdominal aorta: evaluation and management. Cardiovas. Diagn. Ther. 8, S138–S156 (2018).
Article Google Scholar
Greenhalgh, R. M. et al. Comparison of endovascular aneurysm repair with open repair in patients with abdominal aortic aneurysm (EVAR trial 1), 30-day operative mortality results: randomised controlled trial. Lancet 364, 843–848 (2004).
Article CAS Google Scholar
Prinssen, M. et al. Dutch randomized endovascular aneurysm management (DREAM) trial group. A randomized trial comparing conventional and endovascular repair of abdominal aortic aneurysms. N. Engl. J. Med. 351, 1607–1618 (2004).
Article CAS Google Scholar
Paravastu, S. C. et al. Endovascular repair of abdominal aortic aneurysm. Cochrane Database Syst. Rev. 1, 4178 (2014).
Google Scholar
Picel, A. C. & Kansal, N. Essentials of endovascular abdominal aortic aneurysm repair imaging: postprocedure surveillance and complications. Am. J. Roentgenol. 203, 358–372 (2014).
Article Google Scholar
Golzarian, J. & Struyven, J. Imaging of complications after endoluminal treatment of abdominal aortic aneurysms. Eur. Radiol. 11, 2244–2251 (2001).
Article CAS Google Scholar
Heye, S. Diagnosis and treatment of endoleaks after endovascular repair of thoracic and abdominal aortic aneurysms. JBR-BTR 4, 189–195 (2013).
Google Scholar
Bashir, M. R., Ferral, H., Jacobs, C., Mccarthy, W. & Goldin, M. Endoleaks after endovascular abdominal aortic repair: management strategies according to CT findings. AJR 192, 178–186 (2009).
Article Google Scholar
Pandey, N. & Litt, H. I. Surveillance imaging following endovascular aneurysm repair. Sem. Interv. Radiol. 32, 239–248 (2015).
Article Google Scholar
Uthoff, H. et al. Current clinical practice in postoperative endovascular aneurysm repair imaging surveillance. J. Vasc. Interv. Radiol. 23, 1152–1159 (2012).
Article Google Scholar
Madani, A., Ong, J. R., Tibrewal, A. & Mofrad, M. R. K. Deep echocardiography: data efficient supervised and semi-supervised deep learning towards automated diagnosis of cardiac disease. npj Digital Med. 1, 1–11 (2018).
Article Google Scholar
Ribli, D., Horvath, A., Unger, Z., Pollner, P. & Csabai, I. Detecting and classifying lesions in mammograms with deep learning. Sci. Rep. 8, 4165 (2018).
Article ADS Google Scholar
Lindsey, R. et al. Deep neural network improves fracture detection by clinicians. Proc. Natl. Acad. Sci. 115, 11591–11596 (2018).
Article MathSciNet CAS Google Scholar
Lehman, C. D. et al. Mammographic breast density assessment using deep learning: clinical implementation. Radiology 290, 52–58 (2019).
Article Google Scholar
Gulshan, V. et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 316, 2402–2410 (2016).
Article Google Scholar
Lee, C. S., Baughman, D. M. & Lee, A. Y. Deep learning is effective for classifying normal versus age-related macular degeneration optical coherence tomography images. Ophthalmol. Retina 1, 322–327 (2016).
Article Google Scholar
Phillips, M. et al. Assessment of accuracy of an artificial intelligence algorithm to detect melanoma in images of skin lesions. JAMA Netw. Open 2, e1913436 (2019).
Article Google Scholar
Han, S. S. et al. Classification of the clinical images for benign and malignant cutaneous tumors using a deep learning algorithm. J. Invest. Dermatol. 138, 1529–1538 (2018).
Article CAS Google Scholar
Steiner, D. F. et al. Impact of deep learning assistance on the histopathologic review of lymph nodes for metastatic breast cancer. Am J. Surg. Pathol. 42, 1636–1646 (2018).
Article Google Scholar
Fuyong Xing, F., Hai, Su. H., Neltner, J. & Lin, Y. L. Automatic Ki-67 counting using robust cell detection and online dictionary learning. IEEE Trans. Biomed. Eng. 61, 859–870 (2014).
Article Google Scholar
Rajkomar, A., Dean, J. & Kohane, I. Machine learning in medicine. N. Engl. J. Med. 380, 1347–1358 (2019).
Article Google Scholar
Lu, J., Egger, J., Wimmer, A., Großkopf, S. & Freisleben, B. Detection and visualization of endoleaks in CT data for monitoring of thoracic and abdominal aortic aneurysm stents. SPIE Medical Imaging. 69181F (2008).
Hahn, S., Perry, M., Morris, C., Wshah, S. & Bertges, D. Machine deep learning accurately detects endoleak after endovascular abdominal aortic aneurysm repair. Vasc. Sci. 1, 5–12 (2020).
Google Scholar
Nolz, R. et al. Type 2 endoleaks: the diagnostic performance of non-specialized readers on arterial and venous phase multi-slice CT angiography. PLoS ONE 11, e0149725 (2016).
Article Google Scholar
Xie, Q., Dai, Z., Hovy, E., Luong, M., & Le, Q. Unsupervised data augmentation for consistency training. arXiv:1904.12848 (2019).
Asano, Y., Rupprecht, C. & Vedaldi, A. A critical analysis of self-supervision, or what we can learn from a single image. arXiv:1904.13132 (2019).
Pereira, S., Pinto, V., Alves, V. & Silva, C. A. Brain tumor segmentation using convolutional neural networks in MRI images. IEEE Trans. Med. Imaging 35, 1240–1251 (2016).
Article Google Scholar

Download references

Author information

These authors contributed equally: Salmonn Talebi and Mohammad H. Madani.

Authors and Affiliations

Molecular Cell Biomechanics Laboratory, Departments of Bioengineering and Mechanical Engineering, University of California, 208A Stanley Hall #1762, Berkeley, CA, 94720-1762, USA
Salmonn Talebi, Ali Madani, Ashley Chien & Mohammad R. K. Mofrad
Department of Radiology, School of Medicine, Stanford University, Stanford, CA, USA
Mohammad H. Madani, Jody Shen, Domenico Mastrodicasa, Dominik Fleischmann & Frandics P. Chan
Salesforce Research, Palo Alto, CA, USA
Ali Madani
Molecular Biophysics and Integrative Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
Mohammad R. K. Mofrad

Authors

Salmonn Talebi
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad H. Madani
View author publications
You can also search for this author in PubMed Google Scholar
Ali Madani
View author publications
You can also search for this author in PubMed Google Scholar
Ashley Chien
View author publications
You can also search for this author in PubMed Google Scholar
Jody Shen
View author publications
You can also search for this author in PubMed Google Scholar
Domenico Mastrodicasa
View author publications
You can also search for this author in PubMed Google Scholar
Dominik Fleischmann
View author publications
You can also search for this author in PubMed Google Scholar
Frandics P. Chan
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad R. K. Mofrad
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.M., M.M., F.P.C. and M.R.K.M. conceived of the research study. A.C., A.M., S.T., M.M. contributed toward the design, implementation, and evaluation of machine learning and data processing techniques. M.M. was responsible for annotating all patient CT slices and creating segmentation maps of endoleaks, aneurysm sacs and stents. J.S. and D.M. were involved with evaluation of the test set. A.M., D.F., F.P.C., M.M., M.R.K.M., S.T. managed the project vision and implementation along with writing of the manuscript.

Corresponding authors

Correspondence to Frandics P. Chan or Mohammad R. K. Mofrad.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Talebi, S., Madani, M.H., Madani, A. et al. Machine learning for endoleak detection after endovascular aortic repair. Sci Rep 10, 18343 (2020). https://doi.org/10.1038/s41598-020-74936-7

Download citation

Received: 04 May 2020
Accepted: 30 September 2020
Published: 27 October 2020
DOI: https://doi.org/10.1038/s41598-020-74936-7

This article is cited by

Machine learning in vascular surgery: a systematic review and critical appraisal
- Ben Li
- Tiam Feridooni
- Mohammed Al-Omran
npj Digital Medicine (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.