Predicting mortality from AI cardiac volumes mass and coronary calcium on chest computed tomography

Miller, Robert J. H.; Killekar, Aditya; Shanbhag, Aakash; Bednarski, Bryan; Michalowska, Anna M.; Ruddy, Terrence D.; Einstein, Andrew J.; Newby, David E.; Lemley, Mark; Pieszko, Konrad; Van Kriekinge, Serge D.; Kavanagh, Paul B.; Liang, Joanna X.; Huang, Cathleen; Dey, Damini; Berman, Daniel S.; Slomka, Piotr J.

doi:10.1038/s41467-024-46977-3

Download PDF

Article
Open access
Published: 29 March 2024

Predicting mortality from AI cardiac volumes mass and coronary calcium on chest computed tomography

Nature Communications volume 15, Article number: 2747 (2024) Cite this article

1827 Accesses
45 Altmetric
Metrics details

Subjects

Abstract

Chest computed tomography is one of the most common diagnostic tests, with 15 million scans performed annually in the United States. Coronary calcium can be visualized on these scans, but other measures of cardiac risk such as atrial and ventricular volumes have classically required administration of contrast. Here we show that a fully automated pipeline, incorporating two artificial intelligence models, automatically quantifies coronary calcium, left atrial volume, left ventricular mass, and other cardiac chamber volumes in 29,687 patients from three cohorts. The model processes chamber volumes and coronary artery calcium with an end-to-end time of ~18 s, while failing to segment only 0.1% of cases. Coronary calcium, left atrial volume, and left ventricular mass index are independently associated with all-cause and cardiovascular mortality and significantly improve risk classification compared to identification of abnormalities by a radiologist. This automated approach can be integrated into clinical workflows to improve identification of abnormalities and risk stratification, allowing physicians to improve clinical decision-making.

Opportunistic assessment of ischemic heart disease risk using abdominopelvic computed tomography and medical record data: a multimodal explainable artificial intelligence approach

Article Open access 29 November 2023

Automated coronary calcium scoring using deep learning with multicenter external validation

Article Open access 01 June 2021

Deep convolutional neural networks to predict cardiovascular risk from computed tomography

Article Open access 29 January 2021

Introduction

Coronary artery disease (CAD) is a leading cause of morbidity and mortality^1,2. Coronary artery calcium (CAC) scores obtained from non-contrast ECG-gated computed tomography (CT) has emerged as a method for evaluation of asymptomatic patients^1,2. CAC scores are a robust marker of cardiovascular risk^{3,4,5,6,7,8,9,10}, and may even help improve patient compliance with medical therapies and lifestyle interventions^11,12. Contrast-enhanced cardiac CT also provides information regarding cardiac chamber volumes and left ventricular (LV) mass which are predictive of mortality¹³, and cardiovascular events¹⁴. Importantly, CAC is not routinely evaluated on non-cardiac CT. Additionally, cardiac chamber volumes and left ventricular mass classically could not be evaluated on non-contrast CT, since contrast is required to differentiate myocardium from blood pool and to identify the valve planes which separate cardiac chambers. However, these non-cardiac, non-contrast CT scans make up the vast majority of the over 15 million chest CT scans performed annually in the United States alone¹⁵.

Recent advancements in artificial intelligence (AI) have potentially enabled quantification of these measures from non-gated CT imaging. CAC can be manually measured from non-gated CT imaging¹⁶, with excellent correlations CAC scores from gated examinations¹⁷. However, manual annotation of CAC is time consuming, particularly for the lower radiation dose scans, which are used for cancer screening. AI has been applied to automate quantification of CAC from lung cancer screening CT scans^18,19, and was associated with cardiovascular mortality in a selected cohort of patients from the National Lung Screening Trial (NLST)²⁰. A recently developed AI model may also facilitate quantification of left and right atrial and ventricular volumes and LV mass from non-contrast CT²¹, but these estimates from ungated, non-contrast CT have never been validated as markers of risk.

We integrated our convolutional AI model which automatically measures CAC^22,23,24, with another AI model (TotalSegmentator) which automatically segments cardiac chamber volumes²¹. The aim of our study was to evaluate the clinical potential of a fully automated AI pipeline that estimates CAC, cardiac chamber volumes, LV mass, and shape index when applied to low-dose (non-contrast and ungated) lung CT with respect to predicting clinical outcomes in three external populations.

Results

Population characteristics—NLST

We included a total of 24354 patients with median age 61 (IQR 57–65), of whom and 14441 (59.3%) were males. The overall study design is shown in Fig. 1. The model was able to process chamber volumes and coronary artery calcium with an end-to-end processing time of ~18 s, while failing to segment only 0.1% of cases. In total, 4618 (19.0%) had CAC 0, 9006 (37.0%) had CAC 1–100, 4816 (19.8%) had CAC 101–400, and 5914 (24.3%) had CAC > 400. Population characteristics in patients categorized by extent of CAC are presented in Table 1.

Table 1 Population characteristics stratified by extent of coronary artery calcification (CAC)

Full size table

Histograms outlining the distribution of CAC, LV, left atrial (LA), right ventricular (RV), and right atrial (RA) volumes and LV mass index are shown in Supplemental Fig. 1 and the correlation between values as shown in Supplemental Fig. 2. Correlation between gated CT measurements and ungated CT estimates are summarized in Supplemental Fig. 3. All Spearman correlations were excellent (LV myocardium r = 0.947, LA volume r = 0.926, RA volume r = 0.893, LV volume r = 0.793, and RV volume r = 0.922). Comparisons between baseline CT estimates and one-year CT estimates in 22292 patients are shown in Fig. 2. Correlations were excellent (LV myocardium r = 0.917, LA volume r = 0.866, RA volume r = 0.864, LV volume r = 0.892, and RV volume r = 0.899) with no significant bias.

**Fig. 2: Correlation between baseline and follow-up values.**

Case examples showing segmentation of CAC and chamber volumes are shown in Fig. 3. The current clinical standard, radiologist identified cardiovascular abnormalities, were noted in a minority of scans, with reported abnormalities on 62 (1.3%) patients with CAC 0, 311 (3.5%) patients with CAC 1–100, 360 (7.5%) patients with CAC 101–400, and 828 (14.0%) patients with CAC > 400 and in only 303 (9.3%) of patients with abnormal LV mass index.

Associations with all-cause mortality—NLST

During median follow-up of 6.7 years (IQR 6.3–7.0), 1795 (7.4%) patients died. Of those deaths, 459 (25.6%) were adjudicated as cardiovascular deaths. Kaplan-Meier curves for all-cause mortality stratified by CAC categories are displayed in Supplemental Fig. 4. Increasing CAC category was associated with an increasing risk of all-cause mortality as shown in Supplemental Table 1. Identification of cardiovascular abnormality by the radiologist was associated with less risk (unadjusted HR 1.84, 95% CI 1.58–2.13) than the presence of CAC > 100. Quartiles of LV, LA, RV, and RV volume provided risk stratification for all-cause mortality as shown in Supplemental Fig. 5. Abnormal LV mass index was also associated with an increased risk of mortality (unadjusted HR 1.76, 95% CI 1.57–1.97).

Associations with all-cause mortality in the multivariable model are presented in Supplemental Table 2. Patients with CAC 1–100 (adjusted HR 1.24, 95% CI 1.04 –1.47), CAC 101–400 (adjusted HR 1.56, 95% CI 1.30–1.87), and CAC > 400 (adjusted HR 1.88, 95% CI 1.57–2.24) were at increased risk of all-cause mortality. Increasing LA volume (adjusted HR 1.11, 95% CI 1.06–1.16), LV mass index (adjusted HR 1.34, 95% CI 1.22–1.47), and shape index (adjusted HR 1.31, 95% CI 1.02–1.66) were also associated with increased risk of death.

Associations with Cardiovascular Mortality - NLST

Kaplan-Meier curves for cardiovascular mortality stratified by CAC are shown in Fig. 4, and further detailed in Supplemental Table 1. Quartiles of LV, LA, RV, and RA volume provided risk stratification for cardiovascular mortality as shown in Supplemental Fig. 6. Incidences of cardiovascular mortality in patients with normal compared to abnormal chamber volumes are shown in Supplemental Fig. 7.

**Fig. 4: Kaplan-Meier survival curves for cardiovascular mortality.**

Associations with cardiovascular mortality in the multivariable model are shown in Table 2. Patients with CAC 101–400 (adjusted subHR 2.59, 95% CI 1.67–4.03, p < 0.001) and CAC > 400 (adjusted subHR 3.57, 95% CI 2.31–5.54, p < 0.001) were at significantly increased risk of cardiovascular mortality. Increasing LA volume (adjusted HR 1.14, 95% CI 1.05–1.24, p = 0.001), and LV mass index (adjusted HR 1.26, 95% CI 1.05–1.51, p = 0.012) were also associated with increased risk of cardiovascular death.

Table 2 Associations with cardiovascular mortality

Full size table

Adjusted associations with cardiovascular mortality were similar when limited to patients without a history of heart disease (Supplemental Table 3). In patients with a history of heart disease CAC was not associated with cardiovascular death, but LV mass index was (adjusted subHR 1.37 per 10 g/m², 95% CI 1.04–1.80, p = 0.025). Associations with cardiovascular mortality in patients without reported cardiovascular abnormalities were similar to the primary analysis (Supplemental Table 4). Results stratified by tube voltage and slice thickness are shown in Supplemental Tables 5 and 6.

Categorical NRI results for cardiovascular mortality are shown in Supplemental Tables 7-8. Compared to radiologist identified cardiovascular abnormalities, all groups of imaging variables significantly improved categorical risk classification, with overall improvement 2.8%−20.3%. However, the combination of all imaging variables led to the greatest improvement, with overall categorical reclassification improvement of 25.9% (95% CI 20.6%–31.2%). Similar results were seen when assessing improvement in classification compared to a multivariable model incorporating age, sex, smoking history, and past medical history.

ROCs for all-cause mortality and cardiovascular mortality are shown in Fig. 5. The AUC for all-cause mortality of the combination of all quantitative imaging variables (AUC 0.657, 95% CI 0.644 – 0.671) was higher than for CAC (AUC 0.638, 95% CI 0.625–0.652), LV mass index (AUC 0.586, 95% CI 0.572–0.600), LA volume (AUC 0.574, 95% CI 0.560–0.588), shape index (AUC 0.538, 95% CI 0.524–0.553, p < 0.001), or radiologist identification of abnormalities (AUC 0.523, 95% CI 0.516–0.531, p < 0.001). Similarly, AUC for cardiovascular mortality of the combination of all imaging variables (AUC 0.752, 95% CI 0.729–0.775) was higher than for CAC (AUC 0.706, 95% CI 0.683–0.729), LV mass index (AUC 0.674, 95% CI 0.649–0.700), LA volume (AUC 0.633, 95% CI 0.606–0.660), shape index (AUC 0.572, 95% CI 0.544–0.600, p < 0.001), or radiologist identification of abnormalities (AUC 0.530, 95% CI 0.514–0.545, p < 0.001). Comparison of prediction performance for clinical, imaging, and combined models are shown in Fig. 6.

**Fig. 5: Receiver operating characteristic curves for all-cause mortality and cardiovascular mortality using features.**

**Fig. 6: Receiver operating characteristic curves for all-cause mortality and cardiovascular mortality using models.**

EISNER population

We included 2014 patients who underwent CAC scanning as part of the Early Identification of Subclinical Atherosclerosis by Noninvasive Imaging Research (EISNER) trial to provide external validation in a healthier population. During median follow-up 14.6 years (IQR 1.9–17.4) cardiac death or MI occurred in 74 (3.7%) patients. Population characteristics stratified by occurrence of cardiac death or MI are shown in Supplemental Table 9. Median LA volume was higher (67.9 ml vs 60.6 ml, p < 0.001) and prevalence of CAC > 400 was higher (27.0% vs 6.4%, p < 0.001) in patients who experienced cardiac death or MI. Associations with cardiac death or MI are shown in Supplemental Table 10. Models combining clinical and imaging data (AUC 0.804, 95% CI 0.759–0.849, p < 0.001) and imaging data alone (AUC 0.792, 95% CI 0.746–0.838, p = 0.012) had higher AUC for cardiac death or MI compared to a clinical model incorporating age, sex, and medical history (AUC 0.715, 95% CI 0.653–0.776) as shown in Supplemental Fig. 8. Including DL-imaging features also significantly improved categorical and continuous NRI (Supplemental Table 11). Risk stratification for cardiac death or MI in young (age <60 years) non-smokers is shown in Supplemental Table 12.

Low-dose CT population

We included 3319 patients referred for myocardial perfusion imaging who underwent low-dose, ungated CT for attenuation correction of the perfusion scan to provide further external validation. During median follow-up 2.9 years (IQR 1.6–5.0) death or MI occurred in 177 (5.3%) patients. Population characteristics stratified by occurrence of cardiac death or MI are shown in Supplemental Table 13. Median LA volume (85.0 ml vs 76.4 ml, p < 0.001) and median LV volume (129.4 ml vs 118.4 ml, p < 0.001) were higher in patients who experienced death or MI. Associations with death or MI are shown in Supplemental Table 14. Receiver operating characteristic curves for death or MI using clinical, imaging, and combined models are shown in Supplemental Fig. 9. Including DL-imaging features also significantly improved categorical and continuous NRI (Supplemental Table 15).

Discussion

We evaluated whether quantifying CAC, cardiac volumes, LV mass, and ventricular morphology using two previously validated AI models could improve risk stratification of patients undergoing non-cardiac lung CT scans in a large external population from the NLST trial. We demonstrated that LV mass index and LA volume from non-contrast, ungated CT scans are associated with all-cause and cardiovascular mortality. We also found that higher deep learning (DL)-derived CAC was associated with an increased risk of both all-cause mortality and cardiovascular mortality. Importantly, we also demonstrated that a combined model incorporating CAC, cardiac volumes, LV mass index, and ventricular morphology had higher prediction performance than any measure in isolation. Furthermore, the combined model improved categorical risk classification of over 25% of patients compared to the current clinical standard, radiologist identification of cardiovascular abnormality. We went on to show associations between CAC, cardiac volumes, LV mass index and clinical outcomes in two additional external populations with different risk profiles. Given that in the United States, 428 CT scans are performed per 1000 adults each year²⁵, this approach could be used to improve identification of cardiovascular abnormalities and estimation of cardiovascular risk for a substantial number of patients.

Chest imaging is one of the most frequently performed CT examinations¹⁵, with dedicated cardiac imaging representing a small fraction of those. While incidental cardiac findings may be seen on over half of those studies, it is only reported on 3-31% of studies²⁶. It is possible that Radiologists could identify additional abnormalities if specifically focused on cardiac incidentals, but this is not the case in typical clinical practice. This is consistent with our finding that only 14% of patients with CAC > 400 and less than 10% of patients with abnormal LV mass index had abnormalities reported. This care gap exists despite guidelines suggesting that CAC be routinely assessed on all non-cardiac chest CT scans¹⁶. The proposed approach could potentially simplify this process by providing automated estimates of chamber volumes, LV mass index, and CAC for radiologists to incorporate during reporting, leading to substantially improved prediction of cardiovascular mortality and improved risk categorization in over 25% of patients. For example, AI-based identification of CAC (with radiologist oversight) improves adoption of medical therapy when coupled with automated notifications²⁷. The CAC model is computationally efficient, providing results in ~6 s (faster than a standard U-Net model)²⁸. We paired this model with a recently developed AI model for automated segmentation of structures from CT, with all results available in ~18 s. The combined workflow is fully automated and therefore could be readily incorporated into most clinical workflows without significant disruptions.

Our study demonstrates that cardiac chambers and LV myocardium can be estimated from non-contrast chest CT to improve risk classification. Patients with higher atrial or ventricular volumes and abnormal LV mass index were more likely to experience all-cause or cardiovascular mortality. Additionally, left atrial volume and LV mass index were associated with increased risk of both all-cause and cardiovascular mortality after adjusting for relevant confounding factors and all other imaging variables. We applied thresholds for abnormal cardiac volumes, which were based on a study of healthy individuals undergoing cardiac CT. While we did identify significant associations with cardiac outcomes, our results highlight the need for age and sex-specific normal values. While previous studies have demonstrated that CT-derived left ventricular volumes¹³ and left ventricular hypertrophy²⁹ are associated with adverse cardiovascular events, these studies were performed using contrast-enhanced studies. It may be possible for radiologists to provide chamber volume estimates from non-contrast scans, but this is not currently routinely performed and would likely have high inter-reader variability and be too time-consuming for routine clinical use. Lastly, we evaluated shape index and eccentricity index, which are measures of ventricular morphology. We demonstrated that higher shape index, representing a more spherical LV cavity, was independently associated with both all-cause and cardiovascular mortality. Similar volume measurements (but not CAC) can also be performed using cardiovascular magnetic resonance³⁰, but capacity is limited at most centers and our proposed DL-based estimates can be performed on any non-contrast CT. Importantly, associations were similar in patients without reported cardiac abnormalities. Left atrial volume and LV mass index were associated with cardiovascular mortality in patients with a history of cardiac disease (but not CAC, potentially due to inclusion of patients with previous stents or bypass grafts), providing a valuable method for risk stratification in this population. Similar results were demonstrated in a younger population of patients undergoing CAC scanning as part of the EISNER trial as well as in a third external cohort of patients undergoing low-dose, ungated CT with myocardial perfusion imaging. Lastly, we demonstrated that incorporating all of the imaging variables had the highest prediction performance for all-cause and cardiovascular mortality while also leading to the greatest improvement in categorical risk classification compared to radiologist identification of abnormalities.

In three large, external cohorts we demonstrated that CAC scores obtained in a fully automated manner using DL were associated with all-cause and cardiovascular mortality. Chiles et al. demonstrated that expert physician evaluation of CAC, with formal scoring or estimates, were associated with cardiovascular death with HRs of 6.10 – 6.95 for the highest CAC groups in a group of 1575 patients³¹. These risks are similar to that seen for CAC > 400 in our analysis in a much larger sample. Zeleznik et al. developed a U-net based DL model which automatically quantified CAC and showed that in a subset of 14959 patients from the NLST, subjects with CAC > 400 had an unadjusted HR 5.98 compared to CAC 0 for cardiovascular death²⁰. The higher risk demonstrated in our study (unadjusted subHR 7.07) could be explained by improved classification of patients with CAC 0 which are used as the reference risk group. We previously demonstrated favorable results for the cLSTM model compared to a U-net based model²⁸. Additionally, it is notable that the associations in our study were also present between CAC and all-cause mortality, suggesting that targeted interventions could potentially influence overall survival.

Our study has a few important limitations. We evaluated associations with cardiovascular death, but it is possible the cause of death is misclassified in some patients. However, results were similar when looking at associations with all-cause death. We have limited information regarding the exact nature of cardiovascular abnormalities which were identified. Therefore, we are not able to determine how frequently the identified abnormality was significant coronary calcification compared to other identifiable abnormalities, such as chamber enlargement or valve calcification. Similarly, we do not have further classification of history of heart disease. Additionally, we do not know if physicians initiated medical therapy in response to CT findings, which may decrease the associations between imaging findings and outcomes. While we did not assess the correlation between DL measurements and expert segmentations of CAC, we have previously demonstrated excellent intraclass correlation between the DL and expert CAC measurements^22,28. We performed several analyses in all three populations and some associations may be related to chance alone. However, the associations with DL-based imaging features were consistent across analyses and the likelihood of making multiple type 1 errors for the same variable would be minimal and applying corrections for multiple testing can increase the rate of type 2 error³². We did not incorporate race or ethnicity into our analyses. The majority of patients in the NLST trial were white (91% in our cohort); future studies should evaluate methods to incorporate more diverse populations. Lastly, we utilized DL models to extract known anatomic features; using DL to directly predict outcomes may lead to identification of latent features associated with outcomes. However, explanatory mechanisms would need to be implemented and validated in such an approach to warrant clinical use.

Imaging biomarkers—CAC, cardiac chamber volumes, LV mass, and shape index—can be automatically and rapidly quantified using DL from non-cardiac CT scans. These estimates were predictive of all-cause and cardiovascular mortality. DL-derived CAC scores improved classification of patients compared to expert identification of cardiovascular abnormalities. Routine measurements of all these parameters can potentially enhance risk stratification and improve clinical decision-making in the management of patients at risk of cardiovascular disease.

Methods

Study populations

The overall design of this retrospective study is shown in Fig. 1. This study utilized de-identified images from 3 separate external testing cohorts including patients from the NLST (NCT00047385), a multicenter randomized controlled trial of patients randomized to low-dose chest CT for lung cancer screening³³, asymptomatic patients from the EISNER trial (NCT00927693) who underwent CAC scanning³⁴, and patients from two centers who underwent myocardial perfusion imaging with low-dose, ungated, chest CT for attenuation correction. The NLST trial included current or former heavy smokers between the ages of 55 and 74. Patients were randomly assigned to non-contrast, non-ECG-gated chest CT imaging between 2002 and 2007. We included 24805 subjects from this external cohort (previously unseen by the DL models) with available baseline CT imaging and follow-up for mortality. Of those, we excluded cases where image files were corrupt (n = 133, 0.5%), the scan length was less than 12 cm or did not include the heart (n = 292, 1.2%), and cases where segmentation failed (n = 26, 0.1%), leaving 24354 subjects. In cases where segmentation failed, neither the CAC nor the cardiac volume model was able to process the scan. The baseline CT was used to assess associations with outcomes. We also compared estimates from baseline CT scans with estimates from CT scans performed at 1 year in 22292 patients as a measure of stability. Lastly, we compared cardiac volumes and left ventricular mass in a cohort of 80 patients from a clinical trial. These patients underwent low-dose, ungated CT and contrast-enhanced, ECG-gated, cardiac CT angiography on the same day, during a single imaging session, minimizing potential differences between scans (NCT02110303). Cardiac volumes and mass from ECG-gated, contrast-enhanced CT were annotated manually by experienced clinicians using dedicated software (Syngo.Via, Siemens Healthineers, Erlangen, Germany). The study protocol complied with the Declaration of Helsinki and was approved by the institutional review boards at participating institutions. The study used de-identified image sets and did not collect new data, therefore the research is considered non-human subject research.

Clinical data

For the NLST population, past medical history and smoking history were collected during the course of the trial³³. Additionally, clinical interpretation from each scan was recorded as part of the NLST trial including the presence or absence of clinically relevant cardiovascular abnormalities such as CAC or cardiomegaly. Patients had follow-up for all-cause mortality and information from death certificates regarding underlying cause. Cardiovascular mortality was determined for the ICD-10 codes using established definitions for cardiovascular mortality³⁵, and validated ICD-10 codes³⁶. For the EISNER population, medical history was determined at baseline and patients were followed prospectively for occurrence of cardiovascular death or myocardial infarction³⁴. For the third external population, demographics and medical history were determined at the time of CT scanning and incidence of all-cause mortality of myocardial infarction was determined from administrative databases.

CT image acquisition and reconstruction

Images were acquired at each participating site with site-specific protocols. Anonymized image datasets were used for the present analysis. In the NLST population, CT scans were acquired using 17 different camera systems, including systems manufactured by GE Healthcare, Philips, Siemens, and Toshiba. Most patients were imaged with tube voltage of 120 kVp (n = 21287, 87.5%) followed by 140 kVp (2313, 10.5%), and a small number of patients were imaged with other tube voltages (80 kVp n = 377, 90 kVp n = 141, 100 kVp n = 21, 130 kVp n = 4, 135 kVp n = 211). Median tube current was 60 mA (interquartile range 45–72). Mean pixel size was 0.67 mm and ranged from 0.44–0.98 mm. Reconstruction thickness ranged from 1–6 mm, with most patients having slice thickness of 2.5 mm (46.9%) or 2 mm (30.1%). For the EISNER population, patients underwent standard CAC scans including a single scan of ~30–40 slices which were 3 mm or 2.5 mm in thickness. For the remaining external cohort, CT scans were performed with a helical acquisition with a tube voltage of 120 kVp and slice thickness of 3.0 mm (n = 1878) or 5.0 mm (n = 1441).

Model architectures and training

We utilized our previously validated DL model for CAC segmentation²⁸. In brief, the system consists of two networks, the first of which is trained for segmentation of the heart silhouette and the second network was trained to segment the CAC. A supervised learning regimen was used for both segmentation networks. The heart mask was applied to the final CAC prediction to reduce bone overcalling or calcification in non-cardiac regions. For training, internal validation, and internal testing, we used data from 3 centers that included 9543 scans (1827 ECG-gated CAC scans and 7716 CT attenuation scans)^22,23. The model includes a correction factor for slice thickness, ensuring consistent scoring in spite of differences in slice thickness. CAC scores are automatically obtained from the DL segmentations using established methods³.

Cardiac chamber volumes and LV myocardium were segmented using TotalSegmentator²¹. The model utilizes the no new-net UNet (nnU-Net) architecture³⁷ to automatically segment a variety of anatomic structures from images. Expert annotations from contrast images were transferred to registered non-contrast images sets to train the model to segment the same structures on non-contrast image sets. During model validation, the Dice score for LV myocardium, LV, left atrium (LA), right ventricle (RV) and right atrium (RA) ranged from 0.95–0.97²¹. Three-dimensional segmentations for one patient are shown in Supplemental Figure 10. LV myocardial volume was used to calculate LV mass, using a density factor of 1.055³⁸. Abnormal LV mass was defined as volume >97.5th percentile using sex-specific normal limits indexed to body surface area (>80 g/m² for men, >65 g/m² for women)³⁹. Similarly, we defined abnormal cardiac volumes for women (LV volume > 147 mL, RV volume > 180 mL, LA volume > 99 mL and RA volume >126 mL) and men (LV volume > 195 mL, RV volume > 240 mL, LA volume > 121 mL and RA volume > 162 mL) based on >97.5th percentile of normal volumes using sex-specific normal limits³⁹. A comparison of patient classifications at baseline compared to follow-up imaging at 1 year is shown in Supplemental Table 16. Lastly, we quantified the major axis (length) and the two minor axis measurements for the LV volume segmentations. Shape index was calculated as the ratio of the maximal minor axis dimension to the major axis dimension, similar to the method applied in myocardial perfusion imaging⁴⁰. Eccentricity index was calculated as: 1-(minor axis*minor axis/length²). Lower values for shape index signify relative elongation of the LV, while higher values are seen in more spherical remodeling patterns.

Statistical analysis

Continuous variables were summarized as mean (standard deviation [SD]) if normally distributed and compared using a Student’s t-test. Continuous variables that were not normally distributed were summarized as median (interquartile range [IQR]) and compared using a Mann-Whitney U-test or Kruskal–Wallis test. Agreement between estimates from low-dose, ungated CT and measurements from contrast-enhanced, ECG-gated, cardiac CT scans, and agreement between estimates from baseline and 1 year CT scans was assessed with Spearman’s correlation and visualized with Bland-Altman plots.

Associations with all-cause mortality were assessed with univariable and multivariable Cox proportional hazards analyses. The multivariable model included the variables of interest (LA, LV, RA, and RV volume, LV mass, shape index, eccentricity index, and CAC) as well as potential confounders including age⁴¹, sex⁴¹, history of COPD⁴², diabetes⁴³, hypertension⁴⁴, heart disease⁴⁵, and stroke⁴⁶ as well as smoking history⁴⁷. The suspected relationships between these variables are outlined in Supplemental Figure 11. Associations with cardiovascular mortality were evaluated with Fine-Gray competing risk analyses, with non-cardiovascular mortality as a competing risk. For the EISNER and low-dose CT populations there were insufficient events to simultaneously evaluate all variables, so multivariable models were created using stepwise backward elimination. In the NLST population, associations were separately assessed in patients with and without a history of heart disease as well as in patients without radiologist identified cardiovascular abnormalities. Lastly, we evaluated for differences in the associations with clinical outcomes according to tube voltage and slice thickness. These analyses were limited to unadjusted analyses due to a low number of events in some groups, resulting in wide confidence intervals. However, we also assessed for differences in associations between tube voltage and slice thickness categories using interaction analyses. The proportional hazards assumption was evaluated with Schoenfeld residuals⁴⁸, and found to be valid in all analyses.

In the NLST population, we evaluated prediction performance, using area under the receiver operating characteristic curve (AUC), for all-cause mortality and cardiovascular mortality of CAC, cardiac volume, shape index and a combination of the three measures (from regression models including log CAC as a continuous measure, abnormal cardiac volume as a categorical variable, and shape index as a continuous variable). We also evaluated AUC for a clinical model (age, sex, smoking history and medical history [hypertension, diabetes, heart disease, COPD, stroke]), an imaging model with DL-derived variables (CAC, cardiac volumes, shape and eccentricity index, and LV mass), and a combined model incorporating all variables. Variables were combined using logistic regression. Categorical net reclassification index (NRI) was used to assess the additive prognostic utility of DL CAC, cardiac volumes, shape index, and the combined model⁴⁹. NRI was calculated when added to either radiologist identification of cardiovascular abnormality or all other components of the multivariable model including age, sex, smoking history, and past medical history.

All statistical tests were two-sided, and a p-value < 0.05 was considered statistically significant. All analyses were performed using Stata/IC version 13.1 (StataCorp, College Station, Texas, USA) and R (version 4.1.2) including the “DAGitty” package⁵⁰.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All derived data supporting the findings of this study are available within the paper, in the supplementary information file, and in the source data file. Original data from the NLST can be requested through the National Cancer Institute. Restricted access for the deidentified EISNER, and low-dose CT populations can be obtained via requests to the corresponding author Dr. Piotr Slomka (Piotr.Slomka@cshs.org). Requests should include the name and contact details of the person requesting the data, which data and clinical variables are requested and the purpose of requesting the data. Requests will be subject to consideration by the steering committees of the cohorts and the investigational review board of Cedars-Sinai Medical Center and investigational review boards from other centers if applicable. Time frame for a response will be within 3 months. Data requests under agreement will be considered for the purpose of reproducing the data and subject to appropriate confidentiality obligations and restrictions. Source data are provided with this paper.

Code availability

The TotalSegmentator code is publicly available²¹ at https://github.com/wasserth/TotalSegmentator, and the cLSTM code is available at https://doi.org/10.5281/zenodo.10632288⁵¹.

References

Gulati, M. et al. 2021 AHA/ACC/ASE/CHEST/SAEM/SCCT/SCMR guideline for the evaluation and diagnosis of chest pain: a report of the American college of cardiology/american heart association joint committee on clinical practice guidelines. Circulation 144, e368–e454 (2021).
PubMed Google Scholar
Knuuti, J. et al. 2019 ESC guidelines for the diagnosis and management of chronic coronary syndromes: the task force for the diagnosis and management of chronic coronary syndromes of the European society of cardiology (ESC). Eur. Heart J. 41, 407–477 (2020).
Article PubMed Google Scholar
Agatston, A. S. et al. Quantification of coronary artery calcium using ultrafast computed tomography. J. Am. Coll. Cardiol. 15, 827–832 (1990).
Article CAS PubMed Google Scholar
Budoff, M. J., Blankstein, R., Nasir, K. & Blaha, M. J. Power of zero stronger than “soft” plaque. J. Cardiovasc. Comp. Tomogr. 14, 279 (2020).
Article Google Scholar
Margolis, J. R. et al. The diagnostic and prognostic significance of coronary artery calcification. a report of 800 cases. Radiology 137, 609–616 (1980).
Article CAS PubMed Google Scholar
Greenland, P., Blaha, M. J., Budoff, M. J., Erbel, R. & Watson, K. E. Coronary calcium score and cardiovascular risk. J. Am. Coll. Cardiol. 72, 434–447 (2018).
Article CAS PubMed PubMed Central Google Scholar
Simons, D. B. et al. Noninvasive definition of anatomic coronary artery disease by ultrafast computed tomographic scanning: a quantitative pathologic comparison study. J. Am. Coll. Cardiol. 20, 1118–1126 (1992).
Article CAS PubMed Google Scholar
Blaha, M. J., Blankstein, R. & Nasir, K. Coronary artery calcium scores of zero and establishing the concept of negative risk factors. J. Am. Coll. Cardiol. 74, 12–14 (2019).
Article PubMed Google Scholar
Budoff, M. J. et al. Ten-year association of coronary artery calcium with atherosclerotic cardiovascular disease (ASCVD) events: the multi-ethnic study of atherosclerosis (MESA). Eur. heart J. 39, 2401–2408 (2018).
Article CAS PubMed PubMed Central Google Scholar
Mitchell, J. D., Paisley, R., Moon, P., Novak, E. & Villines, T. C. Coronary artery calcium and long-term risk of death, myocardial infarction, and stroke: the walter reed cohort study. JACC Cardiovasc Imaging 11, 1799–1806 (2018).
Article PubMed Google Scholar
Rozanski, A. et al. Impact of coronary artery calcium scanning on coronary risk factors and downstream testing. J. Am. Coll. Cardiol. 57, 1622–1632 (2011).
Article CAS PubMed PubMed Central Google Scholar
Gupta, A. et al. The identification of calcified coronary plaque is associated with initiation and continuation of pharmacological and lifestyle preventive therapies: a systematic review and meta-analysis. JACC Cardiovasc. Imaging. 10, 833–842 (2017).
Article PubMed PubMed Central Google Scholar
Arsanjani, R. et al. Left ventricular function and volume with coronary CT angiography improves risk stratification and identification of patients at risk for incident mortality: results from 7758 patients in the prospective multinational CONFIRM observational cohort study. Radiology 273, 70–77 (2014).
Article PubMed Google Scholar
Kawel-Boehm, N. et al. Left ventricular mass at MRI and long-term risk of cardiovascular events: the multi-ethnic study of atherosclerosis (MESA). Radiology 293, 107–114 (2019).
Article PubMed Google Scholar
Berrington de González, A. et al. Projected cancer risks from computed tomographic scans performed in the United States in 2007. Arch. Int. Med. 169, 2071–2077 (2009).
Article Google Scholar
Hecht, H. S. et al. 2016 SCCT/STR guidelines for coronary artery calcium scoring of noncontrast noncardiac chest CT scans: a report of the Society of cardiovascular computed tomography and society of thoracic radiology. J. cardiovasc. comp. Tomogr. 11, 74–84 (2017).
Article Google Scholar
Budoff, M. J. et al. Coronary artery and thoracic calcium on noncontrast thoracic CT scans: comparison of ungated and gated examinations in patients from the COPD gene cohort. J. Cardiovasc. Comp. Tomogr. 5, 113–118 (2011).
Article Google Scholar
de Vos, B. D., Lessmann, N., de Jong, P. A. & Išgum, I. Deep learning-quantified calcium scores for automatic cardiovascular mortality prediction at lung screening low-dose CT. Radio. Cardiothorac. Imaging 3, e190219–e190219 (2021).
Article Google Scholar
van Velzen, S. G. M. et al. Deep learning for automatic calcium scoring in CT: validation using multiple cardiac CT and chest CT protocols. Radiology 295, 66–79 (2020).
Article PubMed Google Scholar
Zeleznik, R. et al. Deep convolutional neural networks to predict cardiovascular risk from computed tomography. Nat. Commun. 12, 715 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Wasserthal, J. et al. TotalSegmentator: robust segmentation of 104 anatomical structures in CT images. Radiol. Artif. Intell. 5, 1–9 (2023).
Miller, R. J., et al. Deep learning coronary artery calcium scores from SPECT/CT attenuation maps improves prediction of major adverse cardiac events. J. Nucl. Med. 64, 652–658 (2023).
Pieszko, K., et al. Deep learning of coronary calcium scores from PET/CT attenuation maps accurately predicts adverse cardiovascular events. JACC Cardiovasc. Imaging. 16, 675–687 (2023).
Pieszko, K. et al. Reproducibility of quantitative coronary calcium scoring from PET/CT attenuation maps: comparison to ECG-gated CT scans. Eur. J. Nucl. Med Mol. Imaging. 49, 4122–4132 (2022).
Article CAS PubMed PubMed Central Google Scholar
Smith-Bindman, R. et al. Trends in use of medical Imaging in US health care systems and in Ontario, Canada, 2000-2016. JAMA 322, 843–856 (2019).
Article PubMed PubMed Central Google Scholar
Dempster, E., Cartlidge, T., Rofe, R. & Neary, P. Clinical implications of incidental cardiac findings on non-cardiac CT thorax. Clin. Radiol. 75, e2 (2020).
Article Google Scholar
Sandhu, A. T. et al. Incidental coronary artery calcium: opportunistic screening of previous nongated chest computed tomography scans to improve statin rates (NOTIFY-1 Project). Circulation 147, 703–714 (2023).
Article PubMed Google Scholar
Pieszko, K., et al. Calcium scoring in low-dose ungated chest CT scans using convolutional long-short term memory networks. Proc. SPIE Int. Soc. Opt. 12032 https://doi.org/10.1117/12.2613147 (2022).
Klein, R., Ametepe, E. S., Yam, Y., Dwivedi, G. & Chow, B. J. Cardiac CT assessment of left ventricular mass in mid-diastasis and its prognostic value. Eur. Heart J. Cardiovasc. Imaging .18, 95–102 (2016).
Article PubMed Google Scholar
Abdi-Ali, A. et al. LV mass independently predicts mortality and need for future revascularization in patients undergoing diagnostic coronary angiography. JACC Cardiovasc. Imaging. 11, 423–433 (2018).
Article PubMed Google Scholar
Chiles, C. et al. Association of coronary artery calcification and mortality in the national lung screening trial: a comparison of three scoring methods. Radiology 276, 82–90 (2015).
Article PubMed Google Scholar
Perneger, T. V. What’s wrong with bonferroni adjustments. BMJ 316, 1236–1238 (1998).
Article CAS PubMed PubMed Central Google Scholar
National Lung Screening Trial Research, T. et al. The national lung screening trial: overview and study design. Radiology 258, 243–253 (2011).
Article Google Scholar
Rozanski, A. et al. Impact of coronary artery calcium scanning on coronary risk factors and downstream testing the EISNER (Early Identification of subclinical atherosclerosis by noninvasive imaging research) prospective randomized trial. J. Am. Coll. Cardiol. 57, 1622–1632 (2011).
Article CAS PubMed PubMed Central Google Scholar
Hicks, K. A. et al. 2017 cardiovascular and stroke endpoint definitions for clinical trials. Circulation 137, 961–972 (2018).
Article PubMed Google Scholar
Southern, D. A. et al. An administrative data merging solution for dealing with missing data in a clinical registry: adaptation from ICD-9 to ICD-10. BMC Med. Res. Methodol. 8, 1 (2008).
Article PubMed PubMed Central Google Scholar
Isensee, F., Jaeger, P. F., Kohl, S. A. A., Petersen, J. & Maier-Hein, K. H. nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nat. Methods 18, 203–211 (2021).
Article CAS PubMed Google Scholar
Gheorghe, A. G. et al. Cardiac left ventricular myocardial tissue density, evaluated by computed tomography and autopsy. BMC Med. Imaging. 19, 29 (2019).
Article PubMed PubMed Central Google Scholar
Fuchs, A. et al. Normal values of left ventricular mass and cardiac chamber volumes assessed by 320-detector computed tomography angiography in the copenhagen general population study. Eur. Heart J. Cardiovasc. Imaging. 17, 1009–1017 (2016).
Article PubMed Google Scholar
Miller, R. J. H. et al. Quantitation of poststress change in ventricular morphology improves risk stratification. J. Nucl. Med. 62, 1582–1590 (2021).
Article PubMed PubMed Central Google Scholar
Mikkola, T. S., Gissler, M., Merikukka, M., Tuomikoski, P. & Ylikorkala, O. Sex differences in age-related cardiovascular mortality. PloS ONE 8, e63347 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Andre, S. et al. COPD and cardiovascular disease. Pulmonology 25, 168–176 (2019).
Article CAS PubMed Google Scholar
Tancredi, M. et al. Excess mortality among persons with type 2 diabetes. N. Engl. J. Med. 373, 1720–1732 (2015).
Article CAS PubMed Google Scholar
Fuchs, F. D. & Whelton, P. K. High blood pressure and cardiovascular disease. Hypertension 75, 285–292 (2020).
Article CAS PubMed Google Scholar
Miller, R. J. H. et al. Prognostic significance of previous myocardial infarction and previous revascularization in patients undergoing SPECT MPI. Int J. Cardiol. 313, 9–15 (2020).
Article PubMed Google Scholar
Rincon, F. et al. Stroke location and association with fatal cardiac outcomes: northern manhattan Study (NOMAS). Stroke 39, 2425–2431 (2008).
Article PubMed PubMed Central Google Scholar
Gallucci, G., Tartarone, A., Lerose, R., Lalinga, A. V. & Capobianco, A. M. Cardiovascular risk of smoking and benefits of smoking cessation. J. Thorac. Dis. 12, 3866–3876 (2020).
Article PubMed PubMed Central Google Scholar
Lee, M. & Han, J. Statistical methods and models in the analysis of time to event data. Ann. Transl. Med. 8, 73 (2020).
Article PubMed PubMed Central Google Scholar
Pencina, M. J., D’Agostino, R. B., D’Agostino, R. B. & Vasan, R. S. Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond. Stat. Med. 27, 157–172 (2008).
Article MathSciNet PubMed Google Scholar
Textor, J., van der Zander, B., Gilthorpe, M. S., Liskiewicz, M. & Ellison, G. T. Robust causal inference using directed acyclic graphs: the R package ‘dagitty’. Int J. Epidemiol. 45, 1887–1894 (2016).
PubMed Google Scholar
Killekar, A., Shanbhag, A. & Slomka Piotr, J. Predicting mortality from AI-based cardiac volumes, mass, and coronary calcium on chest CT. Zenodo https://doi.org/10.3390/diagnostics14020125 (2024).

Download references

Acknowledgements

This research was supported in part by grant R35HL161195 from the National Heart, Lung, and Blood Institute/ National Institutes of Health (NHLBI/NIH) (PI: PS) as well as R01EB034586 from the National Institute of Biomedical Imaging and Bioengineering (PI: PS). The authors thank the National Cancer Institute for access to NCI’s data collected by the National Lung Screening Trial (NLST) accessed under project number NLST-981. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Author information

These authors contributed equally: Robert J. H. Miller, Aditya Killekar.

Authors and Affiliations

Departments of Medicine (Division of Artificial Intelligence in Medicine), Imaging and Biomedical Sciences Cedars-Sinai Medical Center, Los Angeles, CA, USA
Robert J. H. Miller, Aditya Killekar, Aakash Shanbhag, Bryan Bednarski, Anna M. Michalowska, Mark Lemley, Konrad Pieszko, Serge D. Van Kriekinge, Paul B. Kavanagh, Joanna X. Liang, Cathleen Huang, Damini Dey, Daniel S. Berman & Piotr J. Slomka
Department of Cardiac Sciences, University of Calgary, Calgary, AB, Canada
Robert J. H. Miller
Division of Cardiology, University of Ottawa Heart Institute, Ottawa, Ontario, Canada
Terrence D. Ruddy
Division of Cardiology, Department of Medicine, Columbia University Irving Medical Center and New York-Presbyterian Hospital, New York, New York, NY, USA
Andrew J. Einstein
Department of Radiology, Columbia University Irving Medical Center and New York-Presbyterian Hospital, New York, New York, NY, USA
Andrew J. Einstein
British Heart Foundation Centre for Cardiovascular Science, University of Edinburgh, Edinburgh, UK
David E. Newby
Department of Interventional Cardiology and Cardiac Surgery, University of Zielona Gora, Gora, Poland
Konrad Pieszko

Authors

Robert J. H. Miller
View author publications
You can also search for this author in PubMed Google Scholar
Aditya Killekar
View author publications
You can also search for this author in PubMed Google Scholar
Aakash Shanbhag
View author publications
You can also search for this author in PubMed Google Scholar
Bryan Bednarski
View author publications
You can also search for this author in PubMed Google Scholar
Anna M. Michalowska
View author publications
You can also search for this author in PubMed Google Scholar
Terrence D. Ruddy
View author publications
You can also search for this author in PubMed Google Scholar
Andrew J. Einstein
View author publications
You can also search for this author in PubMed Google Scholar
David E. Newby
View author publications
You can also search for this author in PubMed Google Scholar
Mark Lemley
View author publications
You can also search for this author in PubMed Google Scholar
Konrad Pieszko
View author publications
You can also search for this author in PubMed Google Scholar
Serge D. Van Kriekinge
View author publications
You can also search for this author in PubMed Google Scholar
Paul B. Kavanagh
View author publications
You can also search for this author in PubMed Google Scholar
Joanna X. Liang
View author publications
You can also search for this author in PubMed Google Scholar
Cathleen Huang
View author publications
You can also search for this author in PubMed Google Scholar
Damini Dey
View author publications
You can also search for this author in PubMed Google Scholar
Daniel S. Berman
View author publications
You can also search for this author in PubMed Google Scholar
Piotr J. Slomka
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.J.H.M. participated in study design, data analysis, manuscript drafting and manuscript revisions. A.K. participated in study design, data analysis, and manuscript revisions. A.S. participated in study design, data analysis, and manuscript revisions. B.B. participated in study design, data analysis, and manuscript revisions. A.M.M. participated in data analysis, and manuscript revisions. T.D.R. participated in study design, data collection, and manuscript revisions. A.J.E. participated in study design, data collection, and manuscript revisions. D.E.N. participated in study design, data collection, and manuscript revisions. M.L. participated in data analysis, and manuscript revisions. K.P. participated in study design, data analysis, and manuscript revisions. S.D.V.K. participated in data analysis and manuscript revisions. P.B.K. participated in data analysis, and manuscript revisions. J.X.L. participated in study design and manuscript revisions. C.H. participated in study design and manuscript revisions. D.D. participated in study design, and manuscript revisions. D.S.B participated in study design and manuscript revisions. P.J.S. participated in study design, data analysis, and manuscript revisions.

Corresponding author

Correspondence to Piotr J. Slomka.

Ethics declarations

Competing interests

R.J.H.M. has received consulting fees and research support from Pfizer. D.S.B. and Slomka and P.B.K. participate in software royalties for QPS software at Cedars-Sinai Medical Center. P.J.S. has received research grant support from Siemens Medical Systems and has served as a consultant for Synektik. D.S.B. has served as a consultant for GE Healthcare. A.M.M. is supported by a research scholarship from the Polish National Agency for Academic Exchange. A.J.E. reports receiving a speaker’s fee from Ionetix, consulting fees from W. L. Gore & Associates, authorship fees from Wolters Kluwer Healthcare—UpToDate, and serving on a scientific advisory board for Canon Medical Systems USA; his institution has grants/grants pending from Attralus, Bruker, Canon Medical Systems USA, Eidos Therapeutics, GE HealthCare, Intellia Therapeutics, Ionis Pharmaceuticals, Neovasc, Pfizer, Roche Medical Systems, and W. L. Gore & Associates. T.D.R. has received research grant support from GE HealthCare and Pfizer. D.E.N. reports a grant from the British Heart Foundation and Wellcome Trust. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Miller, R.J.H., Killekar, A., Shanbhag, A. et al. Predicting mortality from AI cardiac volumes mass and coronary calcium on chest computed tomography. Nat Commun 15, 2747 (2024). https://doi.org/10.1038/s41467-024-46977-3

Download citation

Received: 25 August 2023
Accepted: 12 March 2024
Published: 29 March 2024
DOI: https://doi.org/10.1038/s41467-024-46977-3

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.