Introduction

Individuals eligible for lung cancer screening have a high risk of dying from cardiovascular (CV) causes, even higher than lung cancer1,2,3. While low-dose screening chest computed tomography (CT) is recommended to detect lung cancer, the heart is also imaged but not the focus of the evaluation. Thus, screening CTs offer a unique opportunity to assess imaging markers to predict CV disease (CVD) risk without additional radiation or cost.

Although prior studies have shown a relationship between lung screening CT-derived coronary artery calcium (CAC) and mortality4, data on the prognostic value of other imaging features is limited. Epicardial adipose tissue (EAT) is a metabolically active organ directly adjacent to coronary arteries5,6 that has paracrine proatherogenic effects on coronary artery walls7,8. A growing body of evidence links increased EAT volume and density to coronary artery disease (CAD) and adverse CV events in asymptomatic9,10,11, stable chest pain12, and acute chest pain13,14 individuals. Most prognostic data is based on manual, time-consuming EAT measurements using specialized electrocardiogram (ECG)-synchronized cardiac CT scans. The prognostic value of automated EAT assessment from non-gated low-dose chest CTs is unknown but of substantial clinical relevance, given the high number of individuals undergoing lung cancer screening and their increased CVD risk. Hence, an automated EAT assessment has the potential of becoming a new, low-cost, low-effort, and high-yield tool to identify individuals at increased CVD risk, guide their primary prevention, and improve their CV health.

The primary objective of this study was to investigate whether an automated deep-learning algorithm to measure EAT volume and density on non-ECG-gated low-dose lung screening chest CTs helps predicting all-cause and CV mortality in a high-risk population of heavy smokers. This study shows that automatically assessed EAT volume and density relate to all-cause and CV mortality, independent of traditional CV risk factors, including level of education, body habitus, and CAC score.

Methods

Study setting and participants

Our study retrospectively analyzed prospectively acquired data of 24,090 individuals enrolled in the CT arm of the National Lung Screening Trial (NLST; ClinicalTrials.gov identifier: NCT00047385), a community-based randomized controlled trial of chest X-rays vs. low-dose chest CT imaging for lung cancer screening1,15. The NLST trial included current/recent (smoking cessation within the past 15 years) and former heavy smokers with a 30-pack-year or more smoking history aged 55 to 74 years from 33 U.S. sites between August 2002 and April 200415. The study size was driven by the available NLST data and included all participants randomized to the CT arm. Participants without available or interpretable CTs were excluded. Flow diagram Supplementary Fig. 1 provides specific study cohort details. All individuals were prospectively followed over 12 years for incident adverse events, including all-cause and CV mortality (fatal myocardial infarction or stroke)1. In addition, self-reported sex, race, and ethnicity were acquired as a part of the parent trial.

The secondary use of NLST data was approved by the National Cancer Institute (Bethesda, Maryland) and the local Mass General Brigham (Boston, MA) institutional review board (IRB#: 2017P002844). All participants provided written consent to the parent trial, and informed consent was waived for this retrospective study.

CT image acquisition and coronary artery calcium assessment

All scans were performed on multi-detector (≥4) CT scanners using standard non-ECG-synchronized, non-contrast, low-dose chest CT protocols15. Each participant’s first chest CT was considered. For 13,996/24,090 (58%) participants, CAC scores (Agatston score), calculated from a deep learning algorithm, were available as previously published4. The continuous CAC score was divided into four clinically relevant categories: CAC = 0; CAC = 1–100, CAC = 101–300, and CAC > 30016.

Epicardial adipose tissue measurement

We measured EAT volume (cm3) and density (Hounsfield Units, HU) using a dedicated deep-learning algorithm in the non-contrast non-gated NLST CT data. EAT was defined as fat tissue inside the pericardial sac14,17 (Fig. 1). Since body fat depots strongly correlated with obesity and body habitus, we indexed the absolute EAT volume by body surface area (BSA; m2)18 and used the BSA-indexed EAT volume (cm3/m2) for all analyses. Furthermore, we adjusted all multivariable models for body mass index (BMI). EAT density was defined as the average attenuation of all segmented EAT voxels within the pericardial sac.

Fig. 1: Overview of the deep-learning framework and EAT segmentation steps implemented on lung cancer screening CT.
figure 1

EAT is segmented from non-contrast low-dose chest CTs by localizing the heart, identifying the pericardium (red line), and rendering the fatty components (blue) within the pericardial sac. EAT was rendered using fixed attenuation thresholds of −190 to −30 HU. CT computed tomography, EAT epicardial adipose tissue, HU Hounsfield Units. Created with BioRender.com.

Deep-learning algorithm for epicardial adipose tissue segmentation

We developed and open-sourced a deep learning system for fully automated EAT segmentation in cardiac ECG-gated and non-ECG-gated chest CT utilizing data from the large Framingham Heart Study (FHS)19, Prospective Multicenter Imaging Study for Evaluation of Chest Pain (PROMISE)20, and NLST1 cohorts; this algorithm was used in a prior PROMISE analysis12. The deep-learning algorithm comprised three stages: (1) heart localization, identifying and isolating/cropping the heart region from the input chest CT scans; (2) heart segmentation by identifying the pericardial sac; (3) EAT rendering within the pericardial sac (Fig. 1). EAT was defined as all voxels within the segmented pericardial sac and attenuation between −190 to −30 HU21.

Training and testing data sets

A dataset of 2164 randomly selected CT scans from FHS (n = 628), PROMISE (n = 1,140), and NLST (n = 396) was used to develop the system. Four experienced CV radiologists provided standard manual segmentations for all 2164 cases.

We used 858 cases (FHS, n = 628; PROMISE, n = 130; NLST, n = 100) for training and tuning of the algorithm, while the remaining 1306 cases (PROMISE, n = 1010; NLST, n = 296) were reserved for testing.

Deep-learning algorithm development

The heart localization step was framed as a coarse segmentation problem on down-sampled scans and segmentation masks. We trained the heart localization model on resampled 3 mm isotropic images cropped to 112 × 112 × 112 voxel cubes. The localization models predicted heart masks were used to crop the hearts by resampling them to the original voxel spacing, calculating the center of the bounding box around the mask, and finally cropping a 384 × 384 × 80 voxel cube around it. We added an 11-voxel safety margin to ensure that the whole heart is captured in the cropped region.

A heart segmentation model was then trained on the cropped scans from the previous stage, with data resampled to the voxel spacing of 2.0 × 2.0 × 2.5 mm and cropped to 128 × 128 × 80 voxel cubes to fit the graphics processing unit (GPU) memory. Compared to the localization model, the chosen higher spatial resolution allowed for precise segmentation and improved performance.

Finally, to obtain EAT segmentations, the system automatically rendered voxels between −190 and −30 HU from the original high-resolution image in the area of the heart mask generated by the segmentation model. The same rendering step is used in manual EAT segmentation.

Both heart localization and segmentation stages used separately trained 3D U-Net architectures22 with four down-sampling steps and a dropout rate of 0.5. To reduce the network’s memory requirements, we introduced a feature reduction step in the bottleneck part of the architecture. We trained the models using Dice loss function23 and Adam optimizer24 with an initial learning rate of 0.0001 that we reduced by 50% (localization) or 70% (segmentation) every 100 epochs. The batch size was 3 (localization) or 4 (segmentation). Data augmentation strategies differed for the two stages. We applied a random ±10 voxels axial plane translation and ±4 degrees rotation in any axis for the localization model. We used a random ±20 voxels axial plane translation and ±35 degrees rotation in any axis for the segmentation model. Both networks were trained for 1200 epochs. While the localization stage model was trained on a single 70/30 training and tuning set split, the segmentation model used a variation of cross-validation to take advantage of all the data without overfitting. We achieved that by reselecting the training (70%) and tuning (30%), which were split randomly every 100 epochs throughout the training.

Development and evaluation were done on a Linux workstation equipped with one Nvidia RTX A6000 GPU using Tensorflow-GPU (v1.14), Keras (v2.3.1), and NVIDIA CUDA (v10.2). Using this setup, the system needed less than 2 s on average to segment EAT in a given scan. This system is the latest version of our heart segmentation system developed by our group4,12,25,26. Unlike the previous versions that required four GPUs, the newest algorithm can be run on single GPU or CPU-only systems.

Deep-learning algorithm testing

To determine the algorithm’s accuracy, we compared 1306 automatically vs. manually segmented hearts in an independent data set using the original high-resolution CT images. Our system demonstrated a strong performance with a median Dice coefficient of 0.95 (Interquartile Range, IQR = 0.02; Spearman’s correlation of 0.96; p < 0.0001), indicating a high degree of overlap between the automated and manual heart segmentations. The median surface distance was 1.6 mm (IQR = 0.71), suggesting a close match between the automated and manually segmented pericardial sac. Supplementary Fig. 2 provides images of optimally and suboptimally segmented hearts.

Statistical methods

Continuous variables were reported as mean ± SD or median (IQR), and categorical variables as absolute and relative frequencies. Clinical characteristics were stratified by survival/all-cause death, and the differences were tested with the log-rank test. We also provide EAT volume and density values across categories of clinical features, testing the differences by the Kruskal–Wallis test. Spearman’s correlation was used to calculate the correlation between EAT volume and density.

Uni- and multivariable Cox proportional hazard regression models were used to calculate Hazard ratios (HR) and 95% confidence intervals (CI) to evaluate associations between the continuous EAT measures and incident all-cause and CV death. Multivariable models were adjusted for age, sex, race, ethnicity, smoking (former vs. current), number of pack years, history of heart disease, history of stroke, hypertension, diabetes, education status, BMI, and CAC score. Furthermore, we included EAT volume and density in all main multivariable regressions to test their independent prognostic value. We also performed sensitivity analyses in participants (A) without known CAD, heart failure, or stroke at baseline and (B) in participants without CAC at baseline. Cumulative event rates between EAT volume and density quartiles were compared using the Kaplan-Meier method and tested for significance using log-rank tests. We also performed C-statistic tests to assess the predictive power of the EAT measures and compared the discriminatory capacity between models using the Likelihood-ratio test for nested models. We added a cNRI analysis to assess the models’ incremental value using the Stata NRI package by Lunt27,28. We used the continuous NRI as recommended by Pencina et al.27,28, as continuous NRI offers the broadest and most standardized application and is not affected by different event rates and should thus be used when comparing NRIs across studies. Stata 17.1 (College Station, TX, USA) was used for all analyses, and two-sided P Values of <0.05 were considered statistically significant.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Results

Study population

The study analyzed 24,090 NLST participants with an average age of 61 ± 5 years and 59% (14,204/24,090) of them being men. Over a median follow-up time of 12.3 (11.9–12.8) years, 19.5% (4,690/24,090) of the participants died, with 23.2% (1,089/4,690) CV deaths. Participants who died were generally older, more likely males, African American, Non-Hispanic, active smokers with more pack years and had a history of CVD (i.e., CAD, heart failure, or stroke), more CV risk factors, higher CAC score, slightly lower BMI, and lower level of education. Table 1 details key baseline demographics and clinical characteristics.

Table 1 Baseline demographic and clinical characteristics stratified by all-cause death

Association of epicardial adipose tissue with CV risk factors and CAC

The BSA-indexed mean EAT volume was 70.3 ± 24.6 cm3/m2. It increased with age, was higher in men vs. women, and was substantially higher in Asians, followed by Caucasians and African Americans. EAT volume also increased with rising BMI, was higher in former vs. current smokers, and increased with the number of pack years. Also, participants with prior heart disease or stroke, lower level of education, or higher CAC scores presented high EAT volumes (P ≤ 0.007 for all; Supplementary Data 1). The association between EAT volume and CAC (ln-transformed CAC due to right skewness of the CAC score) remained significant in a multivariable regression analysis adjusting for age, sex, race, ethnicity, smoking status (current vs. former), number of pack years, hx of myocardial infarction, hx of stroke, diabetes mellitus, hypertension, education status, and BMI (beta per 10 cm3/m2: 0.04, 95%CI: 0.02–0.05, P < 0.001).

The mean EAT density was −77.7 ± 5.2 HU and decreased with increasing EAT volume, reflected in a strong negative correlation (Spearman’s rho: −0.72; P < 0.001; Fig. 2). Accordingly, the associations between EAT density and clinical characteristics were similar but inverse compared to EAT volume (Supplementary Data 1).

Fig. 2: Association between EAT volume and density.
figure 2

The scatterplot shows a strong negative correlation between EAT volume and density, as shown by the fitted red regression line. Blue dots represent individual observations. EAT = epicardial adipose tissue, HU = Hounsfield Units.

Association of epicardial adipose tissue with mortality

Epicardial adipose tissue volume

EAT volume was approximately 11% higher in participants who died (76.0 ± 28.0 vs. 69.0 ± 23.5 cm3/m2; P < 0.001, Table 1), and the mortality rates increased with rising EAT volume (quartiles of EAT volume—all-cause mortality: Q1 ( < 53 cm3/m2): 16.3%, Q2 (53–67 cm3/m2): 16.0%, Q3 (67–85 cm3/m2): 19.6%, Q4 (>85 cm3/m2): 25.9%; CV mortality: Q1: 3.7%, Q2: 3.3%, Q3: 4.7%, Q4: 6.4%; log-rank: P < 0.001 for both, Supplementary Table 1). Figure 3 provides the corresponding survival curves.

Fig. 3: Kaplan-Meier survival analysis of EAT volume and density for all-cause and cardiovascular mortality.
figure 3

A two-sided log-rank test was used to calculate the P values. EAT = epicardial adipose tissue, HU = Hounsfield Units.

In the Cox regression analysis, an EAT volume increase by 10 cm3/m2 was associated with 19% and 27% higher hazards of all-cause and CV mortality, respectively, independent of EAT density (all-cause mortality: HR = 1.19; 95%CI: 1.17–1.21; P < 0.001, and CV mortality: HR = 1.27; 95%CI: 1.23–1.30; P < 0.001). This association remained significant after further adjustment for age, sex, race, ethnicity, smoking status (current vs. former), number of pack years, history of CV disease, diabetes, hypertension, education, BMI, and categorical CAC score (HR = 1.10; 95%CI: 1.08–1.13; P < 0.001 and HR = 1.14; 95%CI: 1.10–1.19; P < 0.001, respectively) (Table 2).

Table 2 Association of EAT volume and density with all-cause and cardiovascular mortality

Epicardial adipose tissue volume in participants without known heart disease or CAC at baseline

The association between EAT volume and events was significant in a subgroup of 20,454 participants without known CVD at baseline (i.e., primary prevention cohort without prior CAD, heart failure, or stroke) in the fully adjusted model (all-cause mortality: HR = 1.11; 95%CI: 1.08–1.14; P < 0.001, CV mortality: HR = 1.14; 95%CI: 1.08–1.20; P < 0.001, Supplementary Table 2).

In a subgroup of participants without CAC (3,355/13,966, 24.0%), EAT volume was also related to all-cause and CV mortality in the fully adjusted analyses (all-cause mortality: HR = 1.10; 95%CI: 1.03–1.17; P = 0.003, CV mortality: HR = 1.21; 95%CI: 1.05–1.40; P = 0.01; Supplementary Table 3).

Epicardial adipose tissue density

Overall, EAT density was −77.7 ± 5.2 HU and slightly lower in those who died vs. survivors (−78.0 ± 5.5 vs. −77.6 ± 5.1 HU; P < 0.001), reflected in a negative univariable association between EAT density and adverse events (all-cause mortality (per 10 HU) HR: 0.87, 95%CI: 0.82–0.92, P < 0.001; CV mortality (per 10 HU) HR: 0.88, 95%CI: 0.79–0.99, p = 0.032). Nevertheless, in a multivariable regression analysis accounting for EAT volume, an increase of EAT density by 10 HU was associated with a 66% higher hazard of all-cause mortality and more than a twofold risk in CV mortality (all-cause mortality: HR = 1.66; 95%CI: 1.53–1.80; P < 0.001, CV mortality: HR = 2.14; 95%CI: 1.81–2.52; P < 0.001). This association remained significant after additional adjustment for age, sex, race, ethnicity, smoking status (current vs. former), number of pack years, history of CV disease, diabetes, hypertension, education status, BMI, and categorical CAC score (all-cause death: HR = 1.38; 95%CI: 1.24–1.54; P < 0.001, CV death: HR = 1.78; 95%CI: 1.42–2.22; P < 0.001; Table 2).

Epicardial adipose tissue density in those without prior heart disease or CAC at baseline

Supplemental subgroup analyses in participants without known CV disease revealed a strong and independent association between EAT density and events (Supplementary Table 2). In participants without CAC (n = 3,355), EAT density was associated with CV mortality in a model adjusted for EAT volume (HR = 2.05; 95%CI: 1.03–4.09; P = 0.042). In addition, there was a trend towards an association between EAT density and CV death in the fully adjusted model correcting for CV risk factors, education, EAT volume, and BMI (HR = 1.83; 95%CI: 0.90–3.71; P = 0.096; Supplementary Table 3). Of note, there were only 63 CV deaths in the relatively low-risk subgroup with CAC = 0, and the analysis may have been underpowered.

Incremental value of epicardial adipose tissue volume and density

The combination of clinical parameters with CAC and EAT was most predictive of CV mortality (Harrel’s C concordance: 0.78), while being less predictive of all-cause mortality (Harrel’s C concordance: 0.71). Adding EAT to clinical risk factors and clinical risk factors + CAC score increased the discriminatory capacity significantly from 0.70 to 0.71 (all-cause mortality) and 0.75 to 0.78 (CV-mortality) (P < 0.001 for all). The continuous net reclassification improvement (cNRI) test revealed that adding EAT measures to CV risk factors improved the prediction of all-cause (cNRI: 0.12; P < 0.001) and CV death (cNRI: 0.16; P < 0.001). We found the greatest improvement when adding both EAT and CAC to clinical CV risk factors with cNRI of 0.27 and 0.49 for all-cause and CV mortality, respectively (both P < 0.001) (Table 3).

Table 3 Incremental prognostic value of EAT volume and density

Discussion

People undergoing lung cancer screening are at high CV risk. However, up until now, the full potential of routine low-dose chest CTs has not been fully utilized to determine a person’s mortality risk. Our study demonstrates that an automated deep learning-based epicardial fat quantification accurately stratifies the risk for all-cause and CV mortality across 24,090 heavy smokers enrolled in the NLST. In addition, our results demonstrate that automated EAT assessment is a feasible and reliable way of predicting mortality independent of and incremental to persons’ demographics, clinical characteristics, CV risk factors, body habitus, and CAC score.

Heavy smokers are at increased CVD risk and need better risk stratification. Multiple large clinical trials have shown that low-dose chest CT is an effective screening method for lung cancer, leading to reduced lung cancer and all-cause mortality1,2,3,29,30. However, these trials also revealed that more screened individuals died of CV causes than cancer. For example, in NLST, 356 participants died of lung cancer over six years of follow-up, while 486 died of CV disease1. The recent Nederlands–Leuvens Longkanker Screenings Onderzoek (NELSON) study indicated the effectiveness of low-dose chest CT for lung cancer screening with a cancer-related mortality rate decrease but similarly high CV mortality compared to NLST, emphasizing the importance of CV risk estimation to reduce overall mortality2. CAC scoring can be measured on lung cancer screening CTs and reclassifies CV risk in heavy smokers4. However, a study of 3,110 individuals showed that despite significant CAC being a frequent finding (26%) on lung-screening CT, only a small portion (31%) of those with CAC had been diagnosed with CAD before the CT31. These results emphasize the unmet need for improved CV risk stratification and the opportunity to leverage CT images in this high-risk population better. Our results indicate that EAT provides prognostic value beyond CAC scoring and is associated with mortality even in people without CAC.

Lung cancer and CVD have overlapping clinical risk factors such as obesity, diabetes mellitus, hypertension, tobacco smoking, poor diet, and sedentary lifestyle32. Several of these risk factors, including smoking status, diabetes, hypertension, and obesity, were associated with higher EAT volume and density in our study. These findings suggest that changes in EAT might be part of a shared mechanistic pathway for both cancer and CVD.

While inflammation is a well-established pathway that drives cancer and atherogenesis32,33, EAT has been tied to local perivascular inflammation. EAT, a metabolically active visceral fat depot, encases the coronary arteries directly without a basal membrane functioning as a barrier. This arrangement facilitates a direct exchange of inflammatory mediators and promotes atherogenesis6,7. For instance, in a study of patients with critical CAD who received coronary artery bypass grafts, harvested EAT samples showed higher levels of proinflammatory IL-1β, IL-6, TNFα, and MCP-1 than paired subcutaneous fat controls7. EAT inflammation can be measured on CT, because inflamed EAT changes its morphology through increased lipolysis, inhibited lipogenesis, and increased perivascular edema, resulting in a lower lipid-to-connective tissue and water ratio and a higher EAT density on CT6,34. A recent substudy from the Early Identification of Subclinical Atherosclerosis by Non-invasive Imaging Research (EISNER) study reported correlations of EAT volume and density with systemic inflammatory biomarkers (e.g., CRP, IL-6, PAI-1, and MMP-9)10. Our findings support these mechanistic insights, indicating that inflammation, represented by EAT volume and density alterations, may hold prognostic significance in heavy smokers.

Our study corroborates growing evidence linking increased EAT volume and density with adverse events. For instance, a large meta-analysis including over 20,000 individuals (mainly from the FHS, Multi-Ethnic Study of Atherosclerosis, Heinz Nixdorf Recall Study, EISNER study, and the Rotterdam study) reported a strong relationship between EAT volume, CV risk factors, and CAD severity35. Furthermore, Rajani et al. reported an increased EAT volume in patients with high-risk plaque phenotype beyond traditional CV risk (Odds ratio 1.7, P = 0.04)36, results also seen in a large cohort of stable chest pain patients in the PROMISE study12. In our study, EAT was associated with adverse events incremental to CAC score and in a subgroup of participants without CAC or known CVD at baseline (i.e., primary prevention cohort). These results show that EAT, as an index of inflammation, may be a precursor of CAD rather than just an imaging marker of prevalent disease. To the best of our knowledge, our study is the first to show prognostic value of EAT volume and density in a large prospectively collected sample of heavy smokers, a group at markedly increased CV risk. Therefore, increased EAT might be a valuable new parameter for identifying high-risk patients in this vulnerable cohort requiring preventive care.

There is an unmet need for automated EAT measurement tools. EAT assessment is relatively new and usually requires ECG-synchronized cardiac CT images, specialized software, and manual measurement by expert readers. Consequently, EAT volume and density are not routinely reported on non-cardiac chest CTs. We address this issue and provide a robust and accurate automated algorithm to assess EAT volume and density. In addition, the algorithm offers EAT measurements in under 2 s without human input, making it an “end-to-end” solution for CV risk assessment in clinical settings. Although other research groups have developed deep learning algorithms for automated EAT quantification10,37,38,39, these groups used only dedicated ECG-gated cardiac CT scans, smaller cohorts, or proprietary technologies.

Our deep learning algorithm has several strengths. First, it has been developed in collaboration with a core laboratory that assessed EAT in a standardized fashion using data from multiple large, well-phenotyped cohorts and randomized controlled trials (e.g., FHS, PROMISE). Finally, we share the deep-learning system with the community and believe that sharing the algorithm will contribute to scientific advancement, foster collaboration, and accelerate the adoption of this new technology by academic and commercial entities.

We acknowledge several limitations of the current study. First, some standard CV risk factors (e.g., lipids) were not recorded in the NLST trial, and adjustment for the standard 10-year atherosclerotic CVD risk was not possible. However, most of the strongest predictors (e.g., age and sex) were available and included in the study among other variables influencing outcomes and EAT values (e.g., BMI). Second, nonfatal CV events (e.g., nonfatal myocardial infarction or stroke) were not recorded; hence, our study provides data only on all-cause and CV mortality. Third, the NLST cohort included only heavy smokers (≥30 pack years), and differences in EAT’s prognostic value could not be calculated between heavy smokers, non-smokers, and lower-risk smokers with ≥20–30 pack years, a group currently also recommended for lung cancer screening. Future work is necessary to understand the implications for non-smokers and lesser-smoking populations. Furthermore, CAC scores were available in a representative random sample (N = 13,966) of the NLST data set. Finally, most NLST participants were white non-Hispanics; thus, future studies in more diverse populations are needed.

Conclusions

EAT volume and density are independently associated with all-cause and CV mortality in the high-risk group of lung cancer screening participants. Deep-learning-based EAT assessment stratifies mortality risk beyond traditional CV risk factors and CAC score and, therefore, may guide primary CVD prevention in heavy smokers and improve their outcomes. EAT assessment is a unique method that serves a large group of high-risk individuals without a need for additional imaging or manual measurements, avoiding radiation while saving time and costs. Future prospective studies are warranted to investigate the impact of adding EAT measurements to routine lung cancer screening reports.