Machine learning-based prediction of in-ICU mortality in pneumonia patients

Jeon, Eun-Tae; Lee, Hyo Jin; Park, Tae Yun; Jin, Kwang Nam; Ryu, Borim; Lee, Hyun Woo; Kim, Dong Hyun

doi:10.1038/s41598-023-38765-8

Download PDF

Article
Open access
Published: 17 July 2023

Machine learning-based prediction of in-ICU mortality in pneumonia patients

Eun-Tae Jeon¹^na1,
Hyo Jin Lee²^na1,
Tae Yun Park²,
Kwang Nam Jin¹,
Borim Ryu³,
Hyun Woo Lee² &
…
Dong Hyun Kim¹

Scientific Reports volume 13, Article number: 11527 (2023) Cite this article

2002 Accesses
1 Citations
Metrics details

Subjects

Abstract

Conventional severity-of-illness scoring systems have shown suboptimal performance for predicting in-intensive care unit (ICU) mortality in patients with severe pneumonia. This study aimed to develop and validate machine learning (ML) models for mortality prediction in patients with severe pneumonia. This retrospective study evaluated patients admitted to the ICU for severe pneumonia between January 2016 and December 2021. The predictive performance was analyzed by comparing the area under the receiver operating characteristic curve (AU-ROC) of ML models to that of conventional severity-of-illness scoring systems. Three ML models were evaluated: (1) logistic regression with L2 regularization, (2) gradient-boosted decision tree (LightGBM), and (3) multilayer perceptron (MLP). Among the 816 pneumonia patients included, 223 (27.3%) patients died. All ML models significantly outperformed the Simplified Acute Physiology Score II (AU-ROC: 0.650 [0.584–0.716] vs 0.820 [0.771–0.869] for logistic regression vs 0.827 [0.777–0.876] for LightGBM 0.838 [0.791–0.884] for MLP; P < 0.001). In the analysis for NRI, the LightGBM and MLP models showed superior reclassification compared with the logistic regression model in predicting in-ICU mortality in all length of stay in the ICU subgroups; all age subgroups; all subgroups with any APACHE II score, PaO₂/FiO₂ ratio < 200; all subgroups with or without history of respiratory disease; with or without history of CVA or dementia; treatment with mechanical ventilation, and use of inotropic agents. In conclusion, the ML models have excellent performance in predicting in-ICU mortality in patients with severe pneumonia. Moreover, this study highlights the potential advantages of selecting individual ML models for predicting in-ICU mortality in different subgroups.

Prediction prolonged mechanical ventilation in trauma patients of the intensive care unit according to initial medical factors: a machine learning approach

Article Open access 12 April 2023

Using machine learning tools to predict outcomes for emergency department intensive care unit patients

Article Open access 01 December 2020

Mortality prediction of patients in intensive care units using machine learning algorithms based on electronic health records

Article Open access 03 May 2022

Introduction

Pneumonia is among the leading causes of death globally¹, and each year, 0.1 million patients with pneumonia require intensive care unit (ICU) admission for mechanical ventilation (MV)^2,3. Approximately half of the patients with respiratory failure in the ICU have pneumonia, and a quarter of them are re-admitted to the ICU⁴. Approximately 20–50% of patients with pneumonia who are admitted to the ICU die^4,5,6,7. Clinical scoring systems to predict in-ICU mortality in critically ill patients have been introduced, but none of these have been specifically developed for patients admitted to the ICU with severe pneumonia. Therefore, there is an urgent need for severity assessment tools for predicting in-ICU mortality in patients with severe pneumonia⁸.

There has been recent interest in identifying the factors related to the outcomes of ICU patients. Vital signs^9,10,11; laboratory markers such as lactate, thrombocytopenia, troponin, and bicarbonate levels^12,13,14; and intervention strategies such as the use of vasopressors, MV, and continuous renal replacement therapy have been identified to be closely related to the prognoses of ICU patients^15,16. However, previously validated predictive models including CURB-65, CRB-65, Pneumonia Severity Index (PSI), Quick Sequential Organ Failure Assessment (qSOFA), Simplified Acute Physiology Score 3 (SAPS-3), and Acute Physiology and Chronic Health Evaluation II (APACHE II) had suboptimal predictive performance in patients with severe pneumonia at ICU admission or emergency room department visits^17,18,19. Therefore, a new prediction model for in-ICU mortality of patients admitted to the ICU for severe pneumonia needs to be developed.

Machine learning (ML) has demonstrated great potential in various medical fields, including surgery²⁰, cardiology^21,22 (early detection of heart failure)²³, cancer research^24,25, and intensive care medicine (diagnosis or prognosis prediction of critical illness)^26,27,28. Although the integration of ML into the ICU setting is still in its early stages, several studies have already explored its application in managing critically ill patients²⁶. Some studies have used large population datasets to predict length of stay (LOS)²⁹, ICU readmission³⁰, mortality rates^31,32,33, and the risks of sepsis³⁴ or acute respiratory distress syndrome³⁵. Particular, there is increasing evidence that ML models are superior to conventional severity-of-illness scoring systems for predicting mortality in ICU patients^36,37 or patients with pneumonia³⁸. In addition, ML models may have good predictive performance for mortality, especially in patients with severe pneumonia^39,40,41. One study demonstrated that the artificial neural network model had strong predictive performance for 14-day hospital readmission in patients with pneumonia⁴². Another study demonstrated that ML model has good performance for predicting 30-day mortality in sepsis patients²⁸. However, evidence on whether ML models can accurately predict in-ICU mortality in patients with severe pneumonia who require ICU admission is rare.

Thus, the present study aimed to elucidate the performance of ML models in predicting mortality in patients with severe pneumonia who require ICU admission, using the information at the time of ICU admission. Our findings will aid in selecting individual ML models for predicting in-ICU mortality in different subgroups.

Methods

Ethics

This study complied with the Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD) reporting guidelines⁴³. This study was approved by the IRB Committee of Seoul National University Seoul Metropolitan Government (SNU-SMG) Boramae Medical Center (IRB Number: 10-2021-110) on November 16, 2022 and was conducted in accordance with the 1975 ethical guidelines of the Declaration of Helsinki. The requirement for informed consent from the study subjects was waived by the IRB Committee of Seoul National University Seoul Metropolitan Government (SNU-SMG) Boramae Medical Center due to the retrospective study design.

Data collection, study design, and population

This observational study retrospectively evaluated patients admitted to the ICU for pneumonia at the SNU-SMG Boramae Medical Center between January 2016 and December 2021. The inclusion criteria were: (1) age ≥ 18 years; (2) ICU admission; (3) International Classification of Disease 10th edition code for pneumonia as a major diagnosis or detection of pneumonia on chest computed tomography within 1 week of ICU admission; (4) C-reactive protein (CRP) level ≥ 4 mg/dL; and (5) use of antibiotics for pneumonia. The exclusion criteria were: (1) no oxygen requirement; (2) transfer to the general ward within 3 days; and (3) ICU admission due to more serious medical conditions other than pneumonia.

Baseline data including age, sex, body mass index, smoking history, previous underlying disease, and respiratory comorbidities and clinical features were collected. Clinical features included prognostic scores, vital signs, laboratory examination results, and treatment with antibiotics or steroids.

Main outcome measures

The primary outcome measure was the prognostic accuracy of ML models compared with that of conventional severity-of-illness scoring systems for predicting in-ICU mortality in patients who required ICU admission for severe pneumonia. The secondary outcome measures were as follows: (1) the prognostic accuracy of the ML models; (2) the prognostic accuracy of the ML models in different subgroups; (3) the clinical factors contributing to the prediction of in-ICU mortality in patients admitted to the ICU for severe pneumonia. ICU admission due to severe pneumonia was determined as the presence of at least one major criterion or three minor criteria of the Infectious Disease Society of America/American Thoracic Society guidelines³.

The conventional severity-of-illness scoring systems included the SOFA, SAPS II, and APACHE II scores. Among the scoring systems, the best model that showed the strongest performance was used as the baseline comparator. For the statistical and ML models, we tested three popular models: (1) logistic regression with L2 regularization, (2) gradient-boosted decision tree (LightGBM), and (3) multilayer perceptron (MLP).

Data splitting and preprocessing

Variables with more than a 20% missing rate were excluded to generate the available dataset. Approximately 40% of the data were randomly separated with stratification by the outcome and subgrouping variables. The held-out data were used as a test set only for internal validation of the models. The remaining data were used to develop the models as a training set in a tenfold cross-validation scheme.

Missing values were imputed using multivariate imputation by chained equations⁴⁴. Outliers were detected using an isolation forest⁴⁵ and subsequently replaced with the closest normal value of the training set. All the variables included in the analysis and their missing rates are listed in Supplementary Table S1.

Variable importance and feature selection

The influence of each variable on the predictive ability of the model was evaluated using the SHapley Additive exPlanations (SHAP) method⁴⁶. To rank the variables, the mean absolute SHAP values were calculated as the relative importance of the variables. A LightGBM was used for the SHAP evaluation⁴⁷. The guiding metric for cross-validation performance was the area under the receiver operating characteristic curve (AU-ROC).

Model development

Supplementary Fig. S1 presents the workflow diagram for model development. LightGBM is a gradient-boosted tree-based ensemble model, whereas MLP is a feedforward neural network with a basic architecture comprising fully connected layers. The hyperparameters were tuned using Bayesian optimization to maximize the cross-validation performance. Details of the hyperparameter tuning with package information for all tested models are described in Supplementary Methods and Supplementary Table S2. The models were calibrated using isotonic regression according to the validation data obtained during cross-validation. The methods in the present study were implemented in Python version 3.9.7 (Python Software Foundation, Wilmington, Del, USA), with scikit-learn (version 1.1.2).

Internal validation in different subgroups

The model performance in the different patient subgroups was evaluated using the test set. We prespecified subgroups based on the clinically important phenotypes of pneumonia in the ICU⁴⁸. Subgroup analyses were performed according to (1) the period from hospital admission to ICU admission, (2) age, (3) APACHE II scores, (4) PaO₂/FiO₂ ratio, (5) history of chronic respiratory disease, (6) history of cerebrovascular accident (CVA) or dementia, (7) MV, and (8) use of vasopressors.

Statistical analysis

Categorical variables were analyzed using the chi-squared test, while continuous variables were analyzed using the independent t-test or Mann–Whitney U test. We evaluated the AU-ROC as an overall performance measure and compared the models using the Delong method⁴⁹. Owing to an imbalance in outcome prevalence, the area under the precision-recall curve (AU-PRC) was also evaluated as another overall performance measure⁵⁰. Furthermore, the performance of the models was evaluated in detail according to sensitivity, positive predictive value, negative predictive value, diagnostic odds ratio, and net reclassification improvement (NRI)⁵¹ at three low false-positive rates (FPR) levels of 10%, 20%, and 30%. The recalibration effects were also evaluated using decision curves, which presented a net benefit against different decision thresholds⁵². The sensitivity at fixed FPR levels was evaluated in the subgroups using the best-performing model for overall performance. Calibration errors were evaluated before and after calibration using the Brier score and calibration curves. The Brier score was calculated as the mean squared error of the predicted probabilities⁵³. P values were adjusted using the Benjamini–Hochberg method for multiple comparisons, and the significance level was set at P < 0.05.

Results

Baseline characteristics and clinical features

In total, 816 patients with pneumonia admitted to the ICU were included in the analysis (Fig. 1). The median patient age was 77 years, and 588 patients (72.1%) were male. Overall, 223 (27.3%) patients died. The median duration of LOS was 6 days (IQR, 2–13 days). The baseline characteristics are summarized in Table 1. There were significant differences in age and smoking history between the survivor and non-survivor groups. The non-survivor group was more likely to be older, involve a higher number of current smokers, and had a higher number of smoking pack-years among ever-smokers (9.9). Regarding comorbidities, the non-survivor group also included more patients with interstitial lung disease, pulmonary tuberculosis, lung cancer, chronic kidney disease, cerebellar vessel disease, cardiovascular disease, chronic heart failure, and metastatic cancer. Further, this group had higher illness severity scores, including the APACHE II, SOFA, and SAPS II scores.

Table 1 Baseline patient characteristics.

Full size table

The clinical characteristics are presented in Table 2. Regarding vital signs, the survivor group had higher systolic, diastolic, or mean blood pressure, while the non-survivor group had faster heart and respiratory rates and lower urine output. The non-survivor group had lower levels of partial pressure of oxygen (PaO₂) or carbon dioxide (HCO₃₋), lower levels of oxygen saturation (SpO₂), and a lower ratio of PaO₂/FiO₂. For laboratory findings, the levels of urea nitrogen, creatinine, alanine aminotransferase, total bilirubin, and alkaline phosphatase (ALP) and the prothrombin time and international normalized ratio (PT-INR) were higher in the non-survivor group. Meanwhile, the survivor group was more likely to be treated with steroids and vasopressors. The comparison results between the training and test sets for the baseline characteristics and clinical features are presented in Supplementary Tables S3 and S4, respectively.

Table 2 Clinical patient characteristics.

Full size table

Overall performance of the prediction models

The prognostic performance and accuracy of the prediction models for in-ICU mortality are summarized in Fig. 2. Among the conventional severity-of-illness scoring systems, AU-ROC and AU-PRC were numerically (not statistically) higher in SAPS II than other conventional clinical scoring systems (SAPS II, AU-ROC (0.650 [95% CI, 0.584–0.761]) and AU-PRC (0.406 [0.288–0.518]); SOFA (AU-ROC, 0.619 [0.550–0.689] and AU-PRC, 0.357 [95% CI, 0.251–0.467]), and APACHE II (AU-ROC, 0.616 [95% CI, 0.546–0.685] and AU-PRC, 0.377 [95% CI, 0.262–0.495]) (Supplementary Table S5).

All the proposed ML models significantly outperformed all scoring systems with respect to AU-ROC (P < 0.001). The MLP model showed better performance than SAPS II, with an AU-ROC of 0.838 [0.791–0.884]), although there was no significant difference compared with the logistic regression (0.820 [0.771–0.869]) and LightGBM (0.827 [0.777–0.876]) models. Furthermore, all tested ML models showed numerically higher AU-PRC values for predicting in-ICU mortality than the SAPS II. The MLP model showed the highest AU-PRC value of 0.670 (95% CI, 0.552–0.770), which was numerically higher than that of the logistic regression (0.640 [0.522–0.739]) and LightGBM (0.653 [0.552–0.770]) models, while SAPS II dropped around 0.406. The cross-validation performance results are presented in Supplementary Table S6. The calibration and decision curves are shown in Supplementary Figs. S2 and S3, respectively. The detailed performance and NRI at the 10%, 20%, and 30% FPR levels are presented in Supplementary Table S7.

Overall performance of ML models according to subgroups

Figure 3 shows the overall performance of the ML models for in-ICU mortality in the different subgroups. The models performed consistently in most of the subgroups. Despite a difference in the in-ICU mortality rate, there was no significant difference in AU-ROC between the models in most of the subgroups except for the age 65–74 years subgroup (logistic regression vs MLP; P = 0.023), APACHE II score ≤ 19 subgroup (logistic regression vs MLP; P = 0.045, LightGBM vs MLP; P = 0.031), and MV treatment subgroup (logistic regression vs MLP; P = 0.044). There was no significant difference in the performance of the ML models in other subgroups.

In the analysis for NRI, the LightGBM and MLP models showed superior reclassification compared with the logistic regression model in predicting in-ICU mortality in all LOS in the ICU subgroups; all age subgroups; all subgroups with any APACHE II score, PaO₂/FiO₂ ratio < 200; all subgroups with or without history of respiratory disease; with or without history of CVA or dementia; treatment with MV, and use of inotropic agents (Supplementary Table S8). In most of the above subgroups, the LightGBM model showed higher NRI values than the MLP except for the subgroups aged 65–74 years, with an APACHE II score of ≤ 19, with history of CVA or dementia.

Attributable variables with importance plots and SHAP values

The selected predictors of in-ICU mortality are shown in Fig. 4. From a total of 55 variables, 16 were selected. The selected variables were PaO₂/FiO₂ ratio, CRP level, lactate level, urine output, initial systolic blood pressure (SBP), white blood cell (WBC) count. Among these variables, the partial SHAP dependence plots for the top six variables with the mean absolute SHAP values are illustrated in Supplementary Fig. S4; those for the other variables in Supplementary Fig. S5. The local interpretability of the LightGBM model is demonstrated in Supplementary Fig. S6, which shows how the model predicts each case of true positive, true negative, false positive, and false negative.

Discussion

Our study evaluated the prognostic accuracy of ML models compared with that of conventional severity-of-illness scoring systems for predicting in-ICU mortality in patients with severe pneumonia. All ML models showed excellent performance in predicting in-ICU mortality and were superior to SAPS II. In addition, when the ML models were applied in the different subgroups, the LightGBM and MLP models showed superior reclassification compared with the logistic regression model in predicting in-ICU mortality in all LOS in the ICU subgroups; all age subgroups; all subgroups with any APACHE II score, PaO₂/FiO₂ ratio < 200; all subgroups with or without history of respiratory disease; with or without history of CVA or dementia; treatment with MV, and use of inotropic agents. Furthermore, the LightGBM model showed higher NRI values than the MLP in most of the above subgroups, except for the subgroups aged 65–74 years, with an APACHE II score of ≤ 19, with history of CVA or dementia. Therefore, ML models have the potential to improve in-ICU mortality prediction in patients with severe pneumonia admitted to the ICU. Moreover, this study shows the potential advantages of individual ML models for predicting in-ICU mortality in different subgroups of patients with severe pneumonia admitted to the ICU.

CURB-65 and PSI are the most commonly used clinical severity-of-illness scoring systems for patients with community-acquired pneumonia. However, CURB-65 and PSI have a limitation in that CURB-65 has low sensitivity, while PSI has low specificity for mortality⁵⁴. The Clinical Pulmonary Infection Score is a well-validated prediction model for the development of ventilator-associated pneumonia. However, its predictive performance for mortality is inferior to that of the APACHE II score⁵⁵. Given the limitations of clinical severity-of-illness scoring systems, the usefulness of ML models has recently been studied to predict mortality in patients with pneumonia. The results show that various ML models outperform CURB-65^38,39,56 and PSI⁵⁷ for predicting mortality in patients with severe community- or hospital-acquired pneumonia. However, few previous studies have used ML models to predict the prognosis of patients with severe pneumonia admitted to the ICU. One small study reported that an ML approach had better performance than APACHE II and PSI for predicting mortality in critically ill influenza patients⁵⁸. In our study, SAPS II showed a numerically higher AU-ROC than APACHE II and SOFA for predicting in-ICU mortality on the first day of ICU admission. Importantly, the ML models outperformed SAPS II, which suggested that the ML model can provide more accurate information for optimal decision-making based on the estimated probability of mortality.

We selected logistic regression, LightGBM, and MLP as the ML predictive models for mortality in patients with severe pneumonia based on several considerations. Logistic regression is a well-established and widely used ML model for binary classification tasks. It offers a simple and interpretable approach to modeling the relationship between predictor variables and the outcome as well as performance of ML^59,60,61. LightGBM is a gradient-boosted decision tree algorithm that has gained popularity for its high performance and efficiency⁶². Several studies have demonstrated the favorable predictive value of LightGBM in the field of medicine^63,64,65. Another study found that LightGBM had the best predictive ability among other ML models including XGBoost, logistic regression, and naïve Bayes⁶⁶. MLP-based models are effective in capturing nonlinear relationships, making them ideal candidates for complex and multifactorial disease classification including in stroke^67,68 when compared to conventional statistical modeling. Moreover, both LightGBM and MLP have been used in many clinical studies^69,70,71, demonstrating their extensive applicability and promising predictive performance. In the present study, the ML models outperformed conventional severity-of-illness systems. Scoring systems use a limited number of variables, which might restrict their predictive power in individual patients⁷². ML models are capable of utilizing high-dimensional data, and this could account for their superior performance to conventional scoring systems.

In this study, the LightGBM model showed the highest predictive performance with respect to NRI at 10% FPR. This result supports that decision-tree-based models could be more beneficial than logistic regression models for predicting in-ICU mortality in pneumonia patients at a high cut-off point of 90% specificity. ML models have a strength in capturing the nonlinear relations between the features and the predicted outcomes. We found notable non-linear relationships between in-ICU mortality and several selected variables, including PaO₂, WBC count, pH, initial pulse rate, lymphocyte, HCO₃₋, and PaCO₂. This could be the reason for the lower performance of the logistic regression model, a generalized linear model, in predicting the in-ICU mortality of patients with severe pneumonia. Although cross-validation results were not specified to the FPR level, the MLP model had the largest difference in the AU-ROC value between internal validation and cross-validation. This indicates that compared with the other models, the MLP model might be more relatively overfitted to the training set.

Contrary to the performance in different subgroups at the 10% FPR level, our results in the entire test set demonstrated no significant difference in AU-ROC between the ML models and the logistic regression model. In partial dependence plots of the variables that contributed the most to model predictions, linear relationships with in-ICU mortality were observed, especially in those of the PaO₂/FiO₂ ratio, urine output, and initial SBP. This might be the reason for the lack of statistical significance in these differences. The SHAP model was used to determine the important influencing factors of in-ICU mortality in the ML models, and the results were similar to previous studies: PaO₂/FiO₂^73,74, CRP levels⁷⁵, urine output⁷⁶, initial SBP⁷³, PaO₂^73,77, and leukopenia^78,79 or leukocytosis^80,81.

In addition, using the SHAP model, we found that higher ALP levels and prolonged PT-INR were associated with a higher risk of in-ICU mortality. ALP can be elevated as an acute-phase reaction in acute infections⁸². In community-acquired pneumonia, elevated ALP levels were not associated with mortality⁸³. However, in critically ill patients with septic acute kidney injury (AKI), elevated ALP levels are associated with mortality⁸⁴. In our study, septic shock was found in > 40% of the critically ill patients with pneumonia, and AKI was a common condition. It appears that ALP is related to the severity of impaired renal function through systemic inflammation caused by pneumonia rather than the severity of pneumonia itself. With respect to prolonged PT-INR, substantial coagulation abnormalities are commonly observed in patients with sepsis or pneumonia⁸⁵. The excessive production of thrombogenic tissue factors in sepsis pneumonia compared with low levels of tissue factors under normal conditions⁸⁶ leads to the development of systemic coagulopathy during the period of pneumonia⁸⁷.

Our study had some limitations. First, as the models were developed using data retrospectively collected in a single center and were not externally validated, the results had limited generalizability. Although our study provides valuable insights into the performance of the ML models, further studies are needed to assess the generalizability and real-time applicability of these ML models in predicting in-ICU mortality in patients with severe pneumonia. Furthermore, these studies should include robust external validation using independent datasets and evaluation of the model performance in prospective clinical practice. Second, owing to the small sample size and the inclusion of patients admitted within a long study period of over 6 years, there is a possibility of heterogeneity with respect to patient characteristics, treatment measures, and potential biases. Thus, it might be challenging to clearly establish the usefulness of ML, which is a non-parametric algorithm. Furthermore, six subgrouping variables were adopted. A large number of stratification variables with a small sample size could lead to optimistic results in the internal validation of the models.

Conclusion

Compared to conventional severity-of-illness scoring systems, the ML models of LightGBM, MLP, and logistic regression have better predictive performance for in-ICU mortality in patients with severe pneumonia. Moreover, this study shows the potential advantages of selecting individual ML models for predicting in-ICU mortality in different subgroups of patients with severe pneumonia.

Data availability

The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.

References

Mattila, J. T. et al. Pneumonia. Treatment and diagnosis. Ann. Am. Thorac. Soc. 11(Suppl 4), S189–S192 (2014).
Article PubMed Central PubMed Google Scholar
Spindler, C. & Ortqvist, A. Prognostic score systems and community-acquired bacteraemic pneumococcal pneumonia. Eur. Respir. J. 28(4), 816–823 (2006).
Article CAS PubMed Google Scholar
Mandell, L. A. et al. Infectious Diseases Society of America/American Thoracic Society consensus guidelines on the management of community-acquired pneumonia in adults. Clin. Infect. Dis. 44(Suppl 2), S27-72 (2007).
Article CAS PubMed Google Scholar
Lee, H. W. et al. A population-based observational study of patients with pulmonary disorders in intensive care unit. Korean J. Intern. Med. 35(6), 1411–1423 (2020).
Article PubMed Google Scholar
Vallés, J. et al. Epidemiology, antibiotic therapy and clinical outcomes of healthcare-associated pneumonia in critically ill patients: A Spanish cohort study. Intensive Care Med. 40(4), 572–581 (2014).
Article PubMed Central PubMed Google Scholar
Rodriguez, A. et al. Mortality in ICU patients with bacterial community-acquired pneumonia: When antibiotics are not enough. Intensive Care Med. 35(3), 430–438 (2009).
Article PubMed Google Scholar
Walden, A. P. et al. Patients with community acquired pneumonia admitted to European intensive care units: An epidemiological survey of the GenOSept cohort. Crit. Care. 18(2), R58 (2014).
Article PubMed Central PubMed Google Scholar
Napolitano, L. M. Use of severity scoring and stratification factors in clinical trials of hospital-acquired and ventilator-associated pneumonia. Clin. Infect. Dis. 51(Suppl 1), S67-80 (2010).
Article PubMed Google Scholar
Vincent, J. L. et al. Mean arterial pressure and mortality in patients with distributive shock: A retrospective analysis of the MIMIC-III database. Ann. Intensive Care. 8(1), 107 (2018).
Article PubMed Central PubMed Google Scholar
Seo, M. H. et al. Hypoalbuminemia, low base excess values, and tachypnea predict 28-day mortality in severe sepsis and septic shock patients in the emergency department. Yonsei Med. J. 57(6), 1361–1369 (2016).
Article CAS PubMed Central PubMed Google Scholar
Shen, Y., Lou, Y. & Zhu, S. Hyperthermia is a predictor of high mortality in patients with sepsis. Crit. Care. 24(1), 543 (2020).
Article PubMed Central PubMed Google Scholar
Kim, W. Y. et al. Clinical characteristics and prognosis of pneumonia and sepsis: Multicenter study. Minerva Anestesiol. 79(12), 1356–1365 (2013).
CAS PubMed Google Scholar
Nichol, A. D. et al. Relative hyperlactatemia and hospital mortality in critically ill patients: A retrospective multi-centre study. Crit. Care. 14(1), R25 (2010).
Article PubMed Central PubMed Google Scholar
Takegawa, R. et al. Serum albumin as a risk factor for death in patients with prolonged sepsis: An observational study. J. Crit. Care. 51, 139–144 (2019).
Article CAS PubMed Google Scholar
Ren, Y. et al. Risk factor analysis and nomogram for predicting in-hospital mortality in ICU patients with sepsis and lung infection. BMC Pulm. Med. 22(1), 17 (2022).
Article CAS PubMed Central PubMed Google Scholar
Allegretti, A. S. et al. Continuous renal replacement therapy outcomes in acute kidney injury and end-stage renal disease: A cohort study. Crit. Care. 17(3), R109 (2013).
Article PubMed Central PubMed Google Scholar
Carmo, T. A. et al. Derivation and validation of a novel severity scoring system for pneumonia at intensive care unit admission. Clin. Infect. Dis. 72(6), 942–949 (2021).
Article CAS PubMed Google Scholar
Richards, G. et al. CURB-65, PSI, and APACHE II to assess mortality risk in patients with severe sepsis and community acquired pneumonia in PROWESS. J. Intensive Care Med. 26(1), 34–40 (2011).
Article PubMed Google Scholar
Joya-Montosa, C., Delgado-Amaya, M. D., Trujillo-García, E. & Curiel-Balsera, E. Assessment of specific risk scores for patients admitted to the ICU for severe community-acquired pneumonia. Crit. Care. 19(Suppl 1), P9 (2015).
Article PubMed Central Google Scholar
Kassahun, Y. et al. Surgical robotics beyond enhanced dexterity instrumentation: A survey of machine learning techniques and their role in intelligent and autonomous surgical actions. Int. J. Comput. Assist. Radiol. Surg. 11(4), 553–568 (2016).
Article PubMed Google Scholar
Lopez-Jimenez, F. et al. Artificial intelligence in cardiology: Present and future. Mayo Clin. Proc. 95(5), 1015–1039 (2020).
Article PubMed Google Scholar
Guo, T. et al. Machine learning models for predicting in-hospital mortality in acute aortic dissection patients. Front. Cardiovasc. Med. 8, 727773 (2021).
Article CAS PubMed Central PubMed Google Scholar
Choi, E., Schuetz, A., Stewart, W. F. & Sun, J. Using recurrent neural network models for early detection of heart failure onset. J. Am. Med. Inform. Assoc. 24(2), 361–370 (2017).
Article PubMed Google Scholar
Tang, T. T., Zawaski, J. A., Francis, K. N., Qutub, A. A. & Gaber, M. W. Image-based classification of tumor type and growth rate using machine learning: A preclinical study. Sci. Rep. 9(1), 12529 (2019).
Article ADS PubMed Central PubMed Google Scholar
Tran, K. A. et al. Deep learning in cancer diagnosis, prognosis and treatment selection. Genome Med. 13(1), 152 (2021).
Article PubMed Central PubMed Google Scholar
Gutierrez, G. Artificial intelligence in the intensive care unit. Crit. Care. 24(1), 101 (2020).
Article PubMed Central PubMed Google Scholar
Luo, J., Sun, Y., Chi, J., Liao, X. & Xu, C. A novel deep learning-based method for COVID-19 pneumonia detection from CT images. BMC Med. Inform. Decis. Mak. 22(1), 284 (2022).
Article PubMed Central PubMed Google Scholar
Su, Y., Guo, C., Zhou, S., Li, C. & Ding, N. Early predicting 30-day mortality in sepsis in MIMIC-III by an artificial neural networks model. Eur. J. Med. Res. 27(1), 294 (2022).
Article CAS PubMed Central PubMed Google Scholar
Sotoodeh, M. & Ho, J. C. Improving length of stay prediction using a hidden Markov model. AMIA Jt. Summits Transl. Sci. Proc. 2019, 425–434 (2019).
PubMed Central PubMed Google Scholar
Lin, Y. W., Zhou, Y., Faghri, F., Shaw, M. J. & Campbell, R. H. Analysis and prediction of unplanned intensive care unit readmission using recurrent neural networks with long short-term memory. PLoS ONE 14(7), e0218942 (2019).
Article CAS PubMed Central PubMed Google Scholar
Awad, A., Bader-El-Den, M., McNicholas, J. & Briggs, J. Early hospital mortality prediction of intensive care unit patients using an ensemble learning approach. Int. J. Med. Inform. 108, 185–195 (2017).
Article PubMed Google Scholar
Holmgren, G., Andersson, P., Jakobsson, A. & Frigyesi, A. Artificial neural networks improve and simplify intensive care mortality prognostication: A national cohort study of 217,289 first-time intensive care unit admissions. J. Intensive Care. 7, 44 (2019).
Article PubMed Central PubMed Google Scholar
Rau, C. S. et al. Machine learning models of survival prediction in trauma patients. J. Clin. Med. 8(6), 799 (2019).
Article PubMed Central PubMed Google Scholar
Hu, C. et al. Interpretable machine learning for early prediction of prognosis in sepsis: A discovery and validation study. Infect. Dis. Ther. 11(3), 1117–1132 (2022).
Article PubMed Central PubMed Google Scholar
Maddali, M. V. et al. Validation and utility of ARDS subphenotypes identified by machine-learning models using clinical data: An observational, multicohort, retrospective analysis. Lancet Respir. Med. 10(4), 367–377 (2022).
Article PubMed Central PubMed Google Scholar
Shillan, D., Sterne, J. A. C., Champneys, A. & Gibbison, B. Use of machine learning to analyse routinely collected intensive care unit data: A systematic review. Crit. Care. 23(1), 284 (2019).
Article PubMed Central PubMed Google Scholar
Pirracchio, R. et al. Mortality prediction in intensive care units with the Super ICU Learner Algorithm (SICULA): A population-based study. Lancet Respir. Med. 3(1), 42–52 (2015).
Article PubMed Google Scholar
Cilloniz C, Ward L, Mogensen ML, Pericàs JM, Méndez R, Gabarrús A, et al. Machine-Learning Model for Mortality Prediction in Patients With Community-Acquired Pneumonia: Development and Validation Study. Chest. 2022. doi: https://doi.org/10.1016/j.chest.2022.07.005
Kang, S. Y. et al. Predicting 30-day mortality of patients with pneumonia in an emergency department setting using machine-learning models. Clin. Exp. Emerg. Med. 7(3), 197–205 (2020).
Article PubMed Central PubMed Google Scholar
Huang, D. et al. A prediction model for hospital mortality in patients with severe community-acquired pneumonia and chronic obstructive pulmonary disease. Respir. Res. 23(1), 250 (2022).
Article ADS CAS PubMed Central PubMed Google Scholar
Lichtner, G. et al. Predicting lethal courses in critically ill COVID-19 patients using a machine learning model trained on patients with non-COVID-19 viral pneumonia. Sci. Rep. 11(1), 13205 (2021).
Article ADS CAS PubMed Central PubMed Google Scholar
Tey, S. F. et al. Predicting the 14-day hospital readmission of patients with pneumonia using artificial neural networks (ANN). Int. J. Environ. Res. Public Health. 18(10), 5110 (2021).
Article PubMed Central PubMed Google Scholar
Collins, G. S., Reitsma, J. B., Altman, D. G., Moons, K. G., The TRIPOD Group. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): The TRIPOD statement. Circulation 131(2), 211–219 (2015).
Article PubMed Central PubMed Google Scholar
Van Buuren, S. & Groothuis-Oudshoorn, K. mice: Multivariate imputation by chained equations in R. J. Stat. Softw. 45(3), 1–67 (2011).
Article Google Scholar
Liu, F. T., Ting, K. M. & Zhou, Z.-H. Isolation-based anomaly detection. Acm T. Knowl. Discov. D. 6(1), 1–39 (2012).
Article Google Scholar
Lundberg, S. M. et al. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2(1), 56–67 (2020).
Article PubMed Central PubMed Google Scholar
Ke, G. L. et al. LightGBM: A highly efficient gradient boosting decision tree. Adv. Neural Inf. Process. Syst. 30, 3146–3154 (2017).
Google Scholar
Chalmers, J. D. et al. Severity assessment tools to guide ICU admission in community-acquired pneumonia: Systematic review and meta-analysis. Intensive Care Med. 37(9), 1409–1420 (2011).
Article PubMed Google Scholar
DeLong, E. R., DeLong, D. M. & Clarke-Pearson, D. L. Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics 44(3), 837–845 (1988).
Article CAS MATH PubMed Google Scholar
Davis, J. & Goadrich, M. (eds.) The relationship between Precision-Recall and ROC curves. In Proceedings of the 23rd International Conference on Machine Learning (2006).
Leening, M. J., Vedder, M. M., Witteman, J. C., Pencina, M. J. & Steyerberg, E. W. Net reclassification improvement: computation, interpretation, and controversies: A literature review and clinician’s guide. Ann. Intern. Med. 160(2), 122–131 (2014).
Article PubMed Google Scholar
Vickers, A. J., Van Calster, B. & Steyerberg, E. W. Net benefit approaches to the evaluation of prediction models, molecular markers, and diagnostic tests. BMJ 352, i6 (2016).
Article PubMed Central PubMed Google Scholar
Rufibach, K. Use of Brier score to assess binary predictions. J. Clin. Epidemiol. 63(8), 938–939 (2010).
Article MathSciNet PubMed Google Scholar
Loke, Y. K., Kwok, C. S., Niruban, A. & Myint, P. K. Value of severity scales in predicting mortality from community-acquired pneumonia: Systematic review and meta-analysis. Thorax 65(10), 884–890 (2010).
Article PubMed Google Scholar
Zhou, X. Y., Ben, S. Q., Chen, H. L. & Ni, S. S. A comparison of APACHE II and CPIS scores for the prediction of 30-day mortality in patients with ventilator-associated pneumonia. Int. J. Infect. Dis. 30, 144–147 (2015).
Article PubMed Google Scholar
Zhang, S. et al. A new prediction model for assessing the clinical outcomes of ICU patients with community-acquired pneumonia: a decision tree analysis. Ann. Med. 51(1), 41–50 (2019).
Article PubMed Central PubMed Google Scholar
Jones, B. E. et al. Computerized mortality prediction for community-acquired pneumonia at 117 veterans affairs medical centers. Ann. Am. Thorac. Soc. 18(7), 1175–1184 (2021).
Article PubMed Google Scholar
Hu, C.-A. et al. Using a machine learning approach to predict mortality in critically ill influenza patients: A cross-sectional retrospective multicentre study in Taiwan. BMJ Open 10(2), e033898 (2020).
Article PubMed Central PubMed Google Scholar
Christodoulou, E. et al. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. J. Clin. Epidemiol. 110, 12–22 (2019).
Article PubMed Google Scholar
Lynam, A. L. et al. Logistic regression has similar performance to optimised machine learning algorithms in a clinical setting: Application to the discrimination between type 1 and type 2 diabetes in young adults. Diagn. Progn. Res. 4, 6 (2020).
Article PubMed Central PubMed Google Scholar
Nusinovici, S. et al. Logistic regression was as good as machine learning for predicting major chronic diseases. J. Clin. Epidemiol. 122, 56–69 (2020).
Article PubMed Google Scholar
Ke, G. et al. LightGBM: a highly efficient gradient boosting decision tree. In Proceedings of the 31st International Conference on Neural Information Processing Systems 3149–3157 (Curran Associates Inc., 2017).
Zeng, H. et al. A LightGBM-based EEG analysis method for driver mental states classification. Comput. Intell. Neurosci. 2019, 3761203 (2019).
Article PubMed Central PubMed Google Scholar
Shaker, B. et al. LightBBB: computational prediction model of blood–brain-barrier penetration based on LightGBM. Bioinformatics 37(8), 1135–1139 (2021).
Article CAS PubMed Google Scholar
Sayed, M., Riaño, D. & Villar, J. Predicting duration of mechanical ventilation in acute respiratory distress syndrome using supervised machine learning. J. Clin. Med. 10(17), 3824 (2021).
Article CAS PubMed Central PubMed Google Scholar
Peng, X., Li, L., Wang, X. & Zhang, H. A machine learning-based prediction model for acute kidney injury in patients with congestive heart failure. Front. Cardiovasc. Med. 9, 842873 (2022).
Article PubMed Central PubMed Google Scholar
Chan, K. L. et al. Early identification of high-risk TIA or minor stroke using artificial neural network. Front. Neurol. 10, 171 (2019).
Article PubMed Central PubMed Google Scholar
Abedi, V. et al. Novel screening tool for stroke using artificial neural network. Stroke 48(6), 1678–1681 (2017).
Article PubMed Google Scholar
Lorencin, I., Anđelić, N., Španjol, J. & Car, Z. Using multi-layer perceptron with Laplacian edge detector for bladder cancer diagnosis. Artif. Intell. Med. 102, 101746 (2020).
Article PubMed Google Scholar
Hyland, S. L. et al. Early prediction of circulatory failure in the intensive care unit using machine learning. Nat. Med. 26(3), 364–373 (2020).
Article CAS PubMed Google Scholar
Barda, N. et al. Developing a COVID-19 mortality risk prediction model when individual-level data are not available. Nat. Commun. 11(1), 1–9 (2020).
Article Google Scholar
Carter, E. L., Hutchinson, P. J., Kolias, A. G. & Menon, D. K. Predicting the outcome for individual patients with traumatic brain injury: A case-based review. Br. J. Neurosurg. 30(2), 227–232 (2016).
Article PubMed Google Scholar
España, P. P. et al. Development and validation of a clinical prediction rule for severe community-acquired pneumonia. Am. J. Respir. Crit. Care Med. 174(11), 1249–1256 (2006).
Article PubMed Google Scholar
Luna, C. M. et al. Community-acquired pneumonia: Etiology, epidemiology, and outcome at a teaching hospital in Argentina. Chest 118(5), 1344–1354 (2000).
Article CAS PubMed Google Scholar
Huang, D. et al. Clinical characteristics and risk factors associated with mortality in patients with severe community-acquired pneumonia and type 2 diabetes mellitus. Crit. Care. 25(1), 419 (2021).
Article PubMed Central PubMed Google Scholar
El-Solh, A. A., Sikka, P., Ramadan, F. & Davies, J. Etiology of severe pneumonia in the very elderly. Am. J. Respir. Crit. Care Med. 163(3 Pt 1), 645–651 (2001).
Article CAS PubMed Google Scholar
Bowton, D. L., Scuderi, P. E. & Haponik, E. F. The incidence and effect on outcome of hypoxemia in hospitalized medical patients. Am. J. Med. 97(1), 38–46 (1994).
Article CAS PubMed Google Scholar
Marik, P. E. The clinical features of severe community-acquired pneumonia presenting as septic shock. Norasept II Study Investigators. J. Crit. Care. 15(3), 85–90 (2000).
Article CAS PubMed Google Scholar
Paganin, F. et al. Severe community-acquired pneumonia: Assessment of microbial aetiology as mortality factor. Eur. Respir. J. 24(5), 779–785 (2004).
Article CAS PubMed Google Scholar
Wang, X. et al. A new method to predict hospital mortality in severe community acquired pneumonia. Eur. J. Intern. Med. 40, 56–63 (2017).
Article PubMed Google Scholar
Krüger, S. et al. Assessment of inflammatory markers in patients with community-acquired pneumonia—Influence of antimicrobial pre-treatment: Results from the German competence network CAPNETZ. Clin. Chim. Acta 411(23), 1929–1934 (2010).
Article PubMed Google Scholar
Parker, S. G. Transient hyperphosphatasaemia in association with acute infection in adults. Postgrad. Med. J. 67(789), 638–642 (1991).
Article CAS PubMed Central PubMed Google Scholar
Jinks, M. F. & Kelly, C. A. The pattern and significance of abnormal liver function tests in community-acquired pneumonia. Eur. J. Intern. Med. 15(7), 436–440 (2004).
Article CAS PubMed Google Scholar
Xia, W. et al. Prognostic significance of albumin to alkaline phosphatase ratio in critically ill patients with acute kidney injury. Clin. Exp. Nephrol. 26(9), 917–924 (2022).
Article CAS PubMed Google Scholar
Milbrandt, E. B. et al. Prevalence and significance of coagulation abnormalities in community-acquired pneumonia. Mol. Med. 15(11–12), 438–445 (2009).
Article CAS PubMed Central PubMed Google Scholar
Rijneveld, A. W. et al. Local activation of the tissue factor-factor VIIa pathway in patients with pneumonia and the effect of inhibition of this pathway in murine pneumococcal pneumonia. Crit. Care Med. 34(6), 1725–1730 (2006).
Article CAS PubMed Google Scholar
Abraham, E. Coagulation abnormalities in acute lung injury and sepsis. Am. J. Respir. Cell Mol. Biol. 22(4), 401–404 (2000).
Article CAS PubMed Google Scholar

Download references

Funding

This research was supported by a grant from the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health & Welfare, Republic of Korea (grant number: HI21C1074).

Author information

These authors contributed equally: Eun-Tae Jeon and Hyo Jin Lee.

Authors and Affiliations

Department of Radiology, Seoul National University College of Medicine, Seoul Metropolitan Government-Seoul National University Boramae Medical Center, 5 Gil 20, Boramae-Road, Dongjak-gu, Seoul, South Korea
Eun-Tae Jeon, Kwang Nam Jin & Dong Hyun Kim
Division of Respiratory and Critical Care, Department of Internal Medicine, Seoul National University College of Medicine, Seoul Metropolitan Government-Seoul National University Boramae Medical Center, 5 Gil 20, Boramae-Road, Dongjak-gu, Seoul, South Korea
Hyo Jin Lee, Tae Yun Park & Hyun Woo Lee
Center for Data Science, Biomedical Research Institute, Seoul Metropolitan Government-Seoul National University Boramae Medical Center, Seoul, South Korea
Borim Ryu

Authors

Eun-Tae Jeon
View author publications
You can also search for this author in PubMed Google Scholar
Hyo Jin Lee
View author publications
You can also search for this author in PubMed Google Scholar
Tae Yun Park
View author publications
You can also search for this author in PubMed Google Scholar
Kwang Nam Jin
View author publications
You can also search for this author in PubMed Google Scholar
Borim Ryu
View author publications
You can also search for this author in PubMed Google Scholar
Hyun Woo Lee
View author publications
You can also search for this author in PubMed Google Scholar
Dong Hyun Kim
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.T.J. and H.J.L. had full access to all the data in the study and took responsibility for the integrity of the data and the accuracy of the analysis. E.T.J. contributed to study design, data analysis, and manuscript writing. H.J.L. contributed to the data collection, literature review, and manuscript writing. T.Y.P., K.N.J., and B.Y. contributed to data collection and review of the manuscript. H.W.L. and D.H.K. contributed to the study design, data collection, writing of the manuscript, and supervision of the study.

Corresponding authors

Correspondence to Hyun Woo Lee or Dong Hyun Kim.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jeon, ET., Lee, H.J., Park, T.Y. et al. Machine learning-based prediction of in-ICU mortality in pneumonia patients. Sci Rep 13, 11527 (2023). https://doi.org/10.1038/s41598-023-38765-8

Download citation

Received: 06 March 2023
Accepted: 14 July 2023
Published: 17 July 2023
DOI: https://doi.org/10.1038/s41598-023-38765-8

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.