Introduction

After its emergence in late 2019, SARS-CoV-2 quickly developed into a global pandemic with nearly 450 million infections and over 6 million deaths (03/2022)1,2. Whether an infection leads to a severe course of disease resulting in mechanical ventilation or death depends on numerous parameters and conditions. Several recent studies identified clinical parameters associated with a higher vulnerability to a severe clinical outcome3,4,5. These studies used general characteristics like age, gender, and Body Mass Index (BMI) combined with different clinical laboratory features to better understand the importance of such features when predicting the clinical course severity of individual SARS-CoV-2 patients3,6,7,8,9,10.

Recently published studies focused primarily on BMI as a predictive feature for clinical outcomes3,6,7. Sattar et al. showed that younger SARS-CoV-2 patients with an increased BMI were at higher risk of hospitalization and ICU transfer3. Within a systematic review, Huang et al. investigated studies that analyzed the impact of BMI on the clinical course and showed that multiple studies suggested BMI is a relevant predictive feature11. One reason behind the frequent use of BMI is its convenient accessibility. However, a mentionable drawback is that the BMI can be understood as a very shallow approximation of body composition, as its interpretability can be negatively affected by physical anomalies or unusual body proportions regarding muscle and fat volumes12.

Based on this knowledge, the focus of research has shifted to the usage of more detailed features describing the body composition of patients. Chandarana et al. extracted visceral, subcutaneous and total adipose tissue features based on L3 axial slices and investigated the impact of those features on the severity13. Phan et al. investigated the impact of cardiac adipose tissues on the severity and mortality on diabetic patients14. Yang et al. measured BCA features on L3 axial slices and showed that a high visceral to subcutaneous adipose tissue ratio, skeletal muscle attenuation and a high intramuscular fat are decisive features regarding the severity15. These findings align with research results from other medical areas, such as oncology, in which body composition features are dominant predictors for clinical outcomes such as mortality16,17,18,19,20.

The major disadvantage of body composition analysis (BCA) in clinical routine is the time-consuming and impracticable manual feature extraction process21. Therefore, many common BCA methods are semi-automated and/or use only reference regions for assessment, like the lumbar vertebra (L3)13,15,20,22,23. However, a 2D reference image at the level of L3 is also known to be only a rough estimation of the tissue composition, which may differ throughout the volume24. Furthermore, the L3 region is often not captured on a regular chest CT, rendering this method unsuitable for assessing patients with SARS-CoV-2 pneumonia in clinical routine.

We have leveraged a fully-automatic 3D semantic segmentation convolutional neural network (CNN) to overcome these limitations. This approach enables us to precisely quantify relevant body tissues like bone, muscle and multiple adipose tissues and combine them as potential predictive biomarkers to determine the clinical outcome of SARS-CoV-2 patients. This study aims to characterize the relationship between different tissue volumes automatically extracted from CT-thorax scans and the clinical outcome for admitted SARS-CoV-2 patients. Additionally, other relevant BCA biomarkers that can help predict the severity of a SARS-CoV-2 infection will be uncovered.

Materials and methods

Ethics declarations

This study was conducted in compliance with the guidelines of the Institutional Review Board of the University Hospital Essen (approval number 21-10029-BO). Due to the study's retrospective nature, the requirement of written informed consent was waived by the Institutional Review Board. The data were completely anonymized before being included in the study.

Data

The retrospective data used for this study were collected at the University Hospital Essen. All patients with a positive SARS-CoV-2 diagnosis and an admission within the central emergency department between March 1st, 2020, and March 13th, 2022 were initially included. Subsequently, only patients with a valid CT thorax scan were considered. In the context of this study, a valid CT is defined as a CT-thorax scan taken as close to the timestamp of the admission as possible with a max difference of ± 5 days. The clinical outcome is divided into two endpoints “severity” and “mortality”. The first endpoint, “severity”, consists of the classes “mild” and “severe”. The class severe contains all patients who required mechanical ventilation and/or did not survive. The class “mild” encompasses all patients who survived without mechanical ventilation. The second endpoint, “mortality”, consists of the classes “survived” and “deceased” patients during their hospital stay. In addition to the imaging data, the following clinical parameters were included: age, gender, c-reactive protein (CRP)(mg/dl), leukocytes (/nl), hemoglobin (g/dl), and alanine aminotransferase (ALAT) (U/l). Furthermore, all CT scans were controlled manually, and scans with a low image quality or image artifacts were excluded. The final cohort contains all patients who meet the CT criteria and have all defined laboratory features available. The complete selection process for the cohort is visualized in Fig. 1. For further cohort characterization, a list of all comorbidities of included patients was extracted and enclosed within the supplemental material (Table S3).

Figure 1
figure 1

The data flow based on all identified and collected SARS-CoV-2 patients until March 2022. Overall, 3590 unique Patients were identified, but only 918 were used for further univariate and multivariate analysis as not all patients identified had a valid CT scan and/or the defined laboratory features available.

As depicted in the visualized selection process, a dataset with 918 patients was assembled. CT scans taken both before and closest to the hospital admission date were selected for each patient. This resulted in a mean difference of 1.05 ± 0.82 days between the initial diagnosis date and the selected chest CT scan. The mean difference in days between the collected laboratory values and the patient admission was 0.71 ± 3.38 days.

CT acquisition

The extracted 918 CT scans were examined with Siemens Somatom CT scanners which uses syngo CT VA50A (619), syngo CT VB20A (257), syngo VA48A (37) and syngo CT 2012B (5) software. All patients were scanned using the head-first-supine position and voxel spacings ranged from 0.54 to 0.97. In addition, 460 CT scans were performed using a pulmonary protocol. For the usage within the body composition network, all scans were resampled to a slice thickness of 5 mm.

Body composition analysis

The extraction of body composition features is based on a pre-trained convolutional neural network (CNN) published by Koitka et al.21. For this study, an enhanced approach of the proposed network to cover more body regions was used to generate the BCA segmentations which contain the following classes: bone, muscle, subcutaneous adipose tissue (SAT), visceral adipose tissue (VAT), intra- and intermuscular adipose tissue (IMAT), epicardial adipose tissue (EAT) and pericardial adipose tissue (PAT). Adipose tissues were derived using HU-based thresholding (− 190 to − 30 HU) from surrounding and detected body regions such as the dermis, abdomen, thorax, mediastinum, and pericardium. Muscle tissue was derived using a threshold of − 29 to 150 HU21. PAT and EAT, which are jointly referred to as pericardial fat, were subtracted from the latter two regions. Pericardial fat is located in the mediastinum adjacent to the parietal pericardium and surrounds the perivascular space of the coronary arteries. Epicardial fat is defined as a visceral fat deposit below the parietal pericardium. It directly surrounds and shares microcirculation with the myocardium. EAT and PAT, therefore, must be viewed separately and can also be distinguished embryologically25,26,27,28. Figure 2 visualizes a collage of coronal visualized segmentation results for randomly selected CT scans from the proposed cohort.

Figure 2
figure 2

(A) Exemplary outputs of the BCA model in coronal view obtained from randomly selected patients. (B) Visualization of the BCA segmentations in the axial view for randomly selected patients. The tissue color codes are defined as follows: orange: subcutaneous adipose tissue, yellow: muscle tissue, cyan: intra- and intermuscular adipose tissue, pink: bone, light blue: pericardial adipose tissue, purple: epicardial adipose tissue, green: visceral adipose tissue.

The model divides CT-thorax scans into four anatomical subgroups: whole scan, thoracic cavity, mediastinum, and pericardium. The tissue volumes were calculated using the thoracic cavity region from the provided CT-thorax scans. This enables overall comparability between all scans used for feature extraction. The BCA network defines the thoracic cavity as the chamber enclosed by the rib cage ranging from the superior thoracic aperture to the thoracic diaphragm and includes the trachea up to the cricoid cartilage. The raw BCA features were further normalized using the number of detected slices belonging to the thoracic cavity region to counteract differences in patient size or scan range. The normalized BCA features were further used to create two biomarkers: sarcopenia marker (SM) and Cardiac Marker (CM). The markers are defined as follows:

$$Sarcopenia \; Marker = \frac{Muscle}{Intra- and\;Intermuscular \; Adipose \; Tissue + Bone}$$
$$Cardiac \; Marker = \frac{Epicardial \; Adipose \; Tissue + Pericardial \; Adipose \; Tissue}{Muscle}$$

The SM uses the raw quantification of muscle divided by IMAT plus bone. As such, regarding patients mortality, it reflects the important ratio of muscle mass to skeletal volume29 and includes intramuscular adipose tissue as an additional risk factor30. A low score corresponds to a low muscle volume in relation to the skeletal volume and the volume of IMAT. The CM uses the adipose tissues of the heart (EAT and PAT) divided by the muscle volume. EAT and PAT are important risk factors for cardiac health31,32,33 and provide independent additional information to the calcium score25. By normalizing EAT and PAT to the muscle volume, on the one hand, the inter-individual comparability is increased, and on the other hand, the muscular habitus is included as a positive determinant for cardiovascular health34. For a visual interpretation of the markers, exemplary axial slices from patients with the highest and lowest marker values are visualized within the supplemental material (Fig. S3).

Statistical analysis

Initially, all relevant BCA raw features for the marker creation (muscle, bone, IMAT, EAT, and PAT) were analysed univariately. In addition, for all features, the normality assumption was checked using the Shapiro–Wilk-Test35. For general reporting, the median and interquartile range (IQR) were calculated for all non-normally distributed continuous variables (median [IQR]). Mean and standard deviation (SD) were used to report normal distributed continuous variables (mean ± SD). Categorical variables were also reported as total count and percentages. For univariate testing of the BCA features, markers, and laboratory values a two-sided Mann–Whitney-U test36 was conducted.

Furthermore, a binary logistic regression was performed to estimate the effect of selected features on the severity and mortality. In the beginning, the skewness of the BCA and laboratory features were calculated. Since all features were highly skewed, all features were log2 transformed to counteract a skewed distribution. In addition, the correlation between the resulting sarcopenia and cardiac marker was calculated using Pearson's correlation index. Due to the high negative correlation of both markers, the logistic regression was conducted for each marker and adjusted to the following clinical features: CRP, leukocytes, hemoglobin, ALAT, age, and sex. It must be stated that the logistic regression was only used for statistical assessment of the defined BCA markers and not in a predictive way. The logistic regression results are described in adjusted odds ratios (OR), corresponding confidence intervals (95% CI), and resulting p-values. All calculated p-values within this study were rounded to three decimal points and values smaller than 0.001 or 0.0001 were reported as ≤ 0.001 and ≤ 0.0001.

The univariate and multivariate analysis was conducted based on the groups “mild” and “severe” for the severity and the classes “survived” and “deceased” for the mortality endpoint. The statistical analysis was performed using the python package statsmodels37 (logistic regression) and scipy38 (Shapiro–Wilk, Mann–Whitney-U).

Approval for human experiments

The experiments of the study were approved by the Institutional Review Board of the University Hospital Essen (approval number 21-10029-BO). The experiments were performed in accordance with the declaration of Helsinki and with all guidelines set forth by the approving institutional review board. Due to the study's retrospective nature, the requirement of written informed consent was waived by the Institutional Review Board. The data were completely anonymized before being included in the study.

Results

The division within the gender groups resulted in a subcohort of 354 female patients encompassing 239 mild and 115 severe cases (77 deceased) and 564 male patients encompassing 360 mild and 204 severe cases (109 deceased). Table 1 shows the baseline statistics of the demographic parameters and the laboratory features concerning the severity classes and mortality.

Table 1 Baseline patient characteristics, including the gender distribution and age statistics with respect to the severity and mortality endpoint within the cohort of 918 patients.

The first analysis univariately compares the normalized BCA features bone, muscle, IMAT, EAT, and PAT, which will be used to obtain markers for each gender using a Mann–Whitney-U test. The obtained results are depicted in Table 2.

Table 2 Overview of the median volume [IQR] and p-values of the normalized BCA-Features (per slice) within both endpoint categories severity and mortality.

As the results of the univariate raw BCA feature analysis indicate, the selected features are mainly significant in the male patient group (muscle, bone, IMAT, EAT, and PAT). In contrast to that, muscle, bone, and IMAT were also significant for the female group. In addition to the results presented in Table 1, the corresponding boxplot visualizations are enclosed within the supplemental material (Figs. S1, S2). Furthermore, the univariate setting was also conducted for the laboratory features and is enclosed within the supplemental material (Table S1). Subsequently, BCA features were combined to generate the sarcopenia and cardiac marker which were tested univariately as well. The resulting significance of the tests are depicted in Fig. 3.

Figure 3
figure 3

A box plot visualization including Mann–Whitney-U tests of the sarcopenia and cardiac marker for the female patients (first row) and male patients in the cohort (second row). (A) Univariate analysis regarding severity (B) Univariate analysis regarding mortality.

Based on the results of the Mann–Whitney-U Test, the SM (pfemale ≤ 0.0001, pmale ≤ 0.0001) as well as the CM (pfemale=0.014, pmale ≤ 0.0001) show significance within the univariate setting regarding the severity endpoint. Furthermore, the SM (pfemale ≤ 0.0001, pmale ≤ 0.0001) and CM (pfemale ≤ 0.0001, pmale ≤ 0.0001) also showed significance regarding the mortality endpoint. In comparison to the BCA markers, the BMI was only available for 349 patients (122 female, 227 male) from which 64 were severe and 21 were deceased. Within the proposed univariate Setting it could be shown that the BMI was not significant within both groups. The complete results for the univariate analysis of the BMI are depicted in the supplemental material (Table S2).

Since the SM and CM are negatively correlated (rseverity = − 0.76, rmortality = − 0.79) and share the muscle volume per definition, both markers were adjusted to the following clinical parameters for each gender group: CRP, leukocytes, hemoglobin, ALAT, and age. The resulting adjusted ORs and the corresponding p-values of the logistic regression are presented in two sections within Table 3, with section A referring to SM and section B referring to CM.

Table 3 Adjusted odds ratios including the 95% CI and the p-value for the severity and mortality endpoint for (A) sarcopenia marker and (B) cardiac marker.

Given the results of logistic regression, the sarcopenia (OR = 0.42, p = 0.006, 95% CI 0.23–0.78) and the cardiac marker (OR = 1.42, p = 0.018, 95% CI 1.06–1.90) are both significant regarding the severity endpoint. However, only the sarcopenia marker (OR = 0.34, p = 0.002, 95% CI: 0.17–0.67) is significant for the mortality endpoint. The ORs and the corresponding confidence intervals are presented in Fig. 4.

Figure 4
figure 4

Odds ratio plot for the multivariate logistic regression conducted for both endpoints. The horizontal line is set to an odds ratio of 1, and the odds ratios are displayed with circles. Additionally, the confidence intervals (CI 5%, CI 95%) are presented with the left bars (5%) and right bars (95%).

Discussion

We investigated the significance of two defined markers, which were automatically extracted from tissues depicted in CT thoracic scans, for the clinical outcome of SARS CoV-2-patients. The main aim was to derive relevant BCA-based biomarkers that could be used as potential features for the risk stratification of SARS CoV-2 patients regarding two endpoints disease severity and mortality.

Our results indicate that BCA biomarkers are statistically related to the clinical course for SARS-CoV-2 patients. As indicated by the p-values from the univariate statistical test, the raw BCA features muscle and bone were substantial for both gender groups and endpoints. In addition, IMAT, PAT, and EAT were considerable for male patients for both endpoints. Furthermore, for female patients, IMAT and PAT were substantial regarding the severity endpoint. The combination of those raw BCA features into the defined sarcopenia and cardiac marker showed that those markers were univariately significant for both genders and both endpoints. Furthermore, the multivariate logistic regression showed that the sarcopenia marker is statistically significant in addition to proven clinical laboratory features for both endpoints. The cardiac marker was only significant regarding the severity endpoint. Based on the ORs we delineate the interpretation of both markers as follows: Patients exhibiting a high sarcopenia marker have lower odds of developing a severe course of disease or death. In contrast, patients with a high cardiac marker are associated with higher odds of developing a severe course or death. In the development of a new risk score for outcome prediction in patients with SARS-CoV-2, these biomarkers should be taken into account since a CT of the thorax is a standard procedure for SARS-CoV-2 patients and the extraction of tissue volumes can be performed conveniently and fully automated within the clinical routine, as shown by Koitka et al.21. In addition, the presented method and associated BCA features provide a much more accurate and diverse analysis of the patient's body than, for example, BMI. This was also shown by the univariate analysis performed (Supplemental Material Table S2).

These results tie well with other studies wherein the relation of different adipose tissues and/or the muscle volume with the clinical course of SARS-CoV-2 patients was shown. Chandarana et al. demonstrated that the VAT volume measured on an L3 region axial slice is a valuable feature for identifying SARS-CoV-2 patients in need of hospitalization13. Grodecki et al. outlined that EAT volume and attenuation seem to be associated with the quantitative burden of SARS-CoV-2 pneumonia and a larger EAT volume or attenuation might independently predict clinical deterioration or death39. Schiaffino et al. showed that patients with lower paravertebral muscle areas and attenuation have a higher risk of ICU admission40. Chandarana et al. used BCA features aggregated from an L3 axial slice to calculate muscle adipose tissue and muscle mass ratio and VAT to total adipose tissue ratios and showed that those features have a predictive value for the identification of patients with a need for hospitalization22.

Our results generally support these findings, which indicate that the adipose tissues located in the cardiac region and muscle volume are decisive features that correlate with the clinical outcome. By contrast, the sarcopenia marker performs well for both endpoints but best in context of the patient mortality. Unlike Grodecki et al. and Schiaffino et al., we deliberately did not include tissue density in our analysis because it depends on many technical parameters, such as tube voltage, hardening artifacts, reconstructed slice thickness and also contrast administration. A very high degree of standardization in image acquisition is therefore required, which is often not available in routine clinical practice and emergency examinations. In view of a practically applicable and robust method, we, therefore, restricted our study to tissue volumes as a foundation for the defined markers.

In contrast to the above-mentioned studies, our work presents a novel approach to quantify all body tissues within seconds, thus making BCA features readily available for use in clinical routine. To the best of our knowledge, none of the presented studies could quantify all BCA features for the complete thoracic cavity. Instead, most studies used reference axial L3 slices for the feature aggregation which is not applicable within the clinical routine since this region is not present in a thoracic CT. Our study presents a novel approach to quantifying all relevant tissues and using them for BCA-based biomarker aggregation. The results of our study suggest that sarcopenia and cardiac marker can also be added to the list of significant parameters. To access these non-invasive features, only a suitable CT scan that the BCA algorithm can automatically process is needed. The presented approach is equally suitable for both standard and emergency cases. From the medical point of view, the clinical application of the approach presented here depends on the existence of a CT thorax scan. In contrast, from a technical point of view, the application requires integrating the BCA network21. Consequently, only cases in which a CT scan is not performed cannot be assesed with this approach. These will most likely be mild cases rather than severe ones.

A notable limitation of this study is that within our dataset, information on SARS-CoV-2 mutations was unavailable at the time. Because the data underlying this study was collected from multiple SARS-CoV-2 waves (2020–2022), the effect of SARS-Cov-2 mutations could have influenced the likelihood of developing a severe disease course or mortality. Future studies should investigate the significance and importance of BCA features in the context of SARS-CoV-2 variants. A further limitation is the monocentric design of our study. Future studies would benefit from a multicentric approach, in which patients with diverse geographical regions of origin are included. This would strengthen the results and indicate the overall applicability of BCA feature extraction. It would also minimize potential biases and counteract the statistical influence of different SARS-CoV-2 mutations on the analysis.

In future studies, large patient cohorts should be used to investigate the applicability of BCA feature extraction in the determination of newly-admitted SARS-CoV-2 patients’ clinical courses. The applicability of BCA feature extraction to the clinical course predictions of other forms of pneumonia, influenza, and ARDS also warrants further investigation. For example, in the case of pneumonia, performing a fully automated BCA on hospitalized patients shortly after admission could potentially enable the early detection of severe disease courses. Implementing BCA in the clinical workflow could offer additional prognostic value by utilizing already gathered data (e.g., CT scans) without any additional expenditure of time or resources.

Conclusion

The results of the study show that BCA markers based on raw BCA parameters extracted automatically from CT scans have the potential to improve risk stratification in patients with acute SARS-Cov-2 infection. Due to the robust and fully automatable methodology, these parameters should be considered in developing new risk scores. Future studies should investigate the predictive value of these features in relation to different SARS-CoV-2 mutation strands, as well as the applicability of BCA feature extraction in other acute respiratory conditions.