Introduction

Sepsis is one of the common causes of death in patients who are admitted to the ICU1,2. Many clinical investigations have mentioned that liver failure in patients with sepsis is rapidly progressive and has high morbidity and mortality rates3,4,5,6. The pathophysiological mechanism of sepsis-associated liver injury (SALI) has not been clearly elucidated, and it may be relevant to the inflammatory cellular response, direct damage by endotoxin, hepatic microcirculatory dysfunction and disturbance of bilirubin-bile acid metabolism7. Sepsis-induced ischemic-hypoxic liver injury and sepsis-associated cholestasis are common manifestations of SALI. Although there is no unified SALI standard, the current recommended standard is largely based on total bilirubin (TBIL) > 2 mg/dL and the development of international normalized ratio (INR) > 1.58,9. The albumin-bilirubin (ALBI) score is a novel liver function evaluation score that has excellent liver function assessment and prognosis prediction performance and has also been widely used to assess other liver diseases in recent years10,11,12.

The ALBI score had better performance than the Child‒Pugh scale and the MELD score in forecasting morbidity and mortality rates in patients with liver disease13,14. Meanwhile, the ALBI score has not been used to assess liver injury in sepsis to date. The SOFA score, LODS score and SAPS II score often have a considerable ability to predict sepsis in patients but rarely in SALI patients15. The high mortality of SALI patients may be related to the lack of available diagnostic tools, especially those capable of accounting for the predictive factors of sepsis and early-onset liver injury15. The nomogram has been widely used as a visualization tool for clinical prognostic studies in survival studies of critically ill patients16,17,18. Accordingly, it is necessary to positively explore a nomogram that can effectively predict liver injury in sepsis.

Materials and methods

Data origin

The Medical Information Mart for Intensive Care IV (MIMIC-IV) database is a public database that incorporates detailed patient information from patients hospitalized at Beth Israel Deaconess Hospital (Bowers, Massachusetts, USA) between 2008 and 2019. A retrospective study was performed using the MIMIC-IV (v 2.21) database. The database is constantly updated with the latest version (v2.21) released on 12 July 2022, and more data have been added to increase the comprehensiveness of the data. Patient consent is not required because the identifying information in this platform is hidden. The investigators are enrolled in a learning program offered by the National Institutes of Health (NIH) and are granted the freedom to access the MIMIC-IV database after passing the requisite assessments (certificate number 40655812). All methods were carried out in accordance with relevant guidelines and regulations. All experimental protocols were approved by the Institutional Review Boards of both Beth Israel Deaconess Medical Center (Boston, MA, USA) and the Massachusetts Institute of Technology (Cambridge, MA, USA). Since all data de-identified in this database to remove patients’ information, the requirement for individual patient consent is not indispensable.

Research population and data extraction

Navicat Premium (version 15.0.12, Hong Kong, China) was developed as an intermediate software to connect with the MIMIC IV database, and the requested information was obtained by structure query language (SQL). We included patients who were above the age of 17 and had been diagnosed with sepsis according to internationally recognized Sepsis-3 criteria (ICD-9 code 99591 and ICD-9 code 99592): (1) Patients with pathogenically positive infections and Sequential Organ Failure Assessment (SOFA) scores ≥ 21.

The study extracted relevant variables from the MIMIC-IV database: (1) basic information about the patient, including age and gender; (2) comorbidity; (3) severity score, including Sequential Organ Failure Assessment (SOFA), Logistic Organ Dysfunction System (LODS), and Simplified Acute Physiology II (SAPSII) score; (4) mean vital signs and worst laboratory results within 24 h in the ICU; (5) infection site; and (6) outcomes, including ICU stay time, hospital stay time, the ratio of 90-day mortality, and the first day of ICU usage. Patients were excluded for: (1) age < 18 years; (2) all types of liver diseases; (3) patients with HIV; (4) pregnant women; (5) patients with malignant tumor; (6) patients with deprived trauma; (7) no biochemical and coagulation test with 24 h after ICU admission; (8) death or discharge from hospital with 24 h after ICU admission. A total of 13,129 eligible patients were ultimately included in our study cohort (Fig. 1).

Figure 1
figure 1

Flowchart of data extraction and study design.

Sepsis-associated liver injury

Based on all eligible patients with sepsis, SALI patients were defined as admission to the ICU for the worst 24-h value total bilirubin (TBIL) > 2 mg/dL and the development of an international normalized ratio (INR) > 1.58,9. Logistic regression was performed on the parameters of eligible SALI patients to generate the predictive model. Data extraction and research concepts are shown in Fig. 1.

Statistical analysis

Continuous variables were represented using the median (IQR) and were measured by the Mann‒Whitney U test. Categorical information was described in terms of numbers (percentages) and was tested by the χ2 test. Parameters that were valid for univariate analysis were taken into account in the multivariate logistic analysis, and their odds ratios (ORs) were calculated. When the VIF was ≤ 2, then a noncollinear relationship between the two groups was present. The 90-day survival rates between the SALI and non-SALI groups were assessed by Kaplan‒Meier analysis, and the log rank test was performed to determine the difference between the two groups, with P < 0.05 considered statistically significant. To address the baseline imbalance, we conducted a sample analysis using propensity score matching (PSM) and calculated the differences in 90-day survival times between SALI patients and non-SALI patients.

SALI patients were randomized into training and validation cohorts at a ratio of 7:3. Statistically significant clinical variables from the training cohort were included in a multivariate logistic regression analysis with reverse stepwise selection using the likelihood ratio test and Akaike information criterion (AIC) as a stopping rule19. The multivariate analysis regression results from the training cohort were used to develop a nomogram to assess 90-day mortality according to Occam’s Razor, which means that a successful model should have a minimum number of metrics to illustrate the most research targets in the study20. The calibration of 1000 bootstrap resamples was assessed for the predictability of the nomogram, and Harrell’s concordance index (C-index) was used to determine the predictive performance of the model.

The study compared the performance of the SOFA, LODS, SAPS II and ALBI scores with the expression of the nomogram using an area under the curve of the receiver operating characteristic (AUROC) and decision curve analysis (DCA). In addition, integrated discrimination improvement (IDI) and net reclassification improvement (NRI) indices were applied to enhance the predictive model and the accuracy of the predictive model.

Stata software (Stata/MP V.17, Texas, USA) and R software (V 4.2.1, Vienna, Austria) were used for statistical analysis. All parameters with ≥ 30% missing data were excluded. For missing data, we used the mice package to obtain 5 packets by multiple interpolation and merged them. In addition, we also used packages such as MatchIt, pROC, survival, survminer, Formula, ggplot2, Hmisc, glmnet, forestplot and IswR in R software. All analyses were performed according to the Transparent Reporting of Multivariate Predictive Models for Individual Prognosis or Diagnosis (TRIPOD) guidelines, which refer to the principles and rationale for analysis21.

Ethics statements

There is not any human participants who are involve in the study. Our experiment was performed in accordance with relevant guidelines and regulations.

Ethics approval and consent to participate

Patient consent is not required because the identifying information in this platform is hidden. The investigators are enrolled in a learning program offered by the National Institutes of Health (NIH) and are granted the freedom to access the MIMIC-IV database after passing the requisite assessments (certificate number 40655812).

Results

Clinical characteristics

We enrolled a total of 34,329 patients with sepsis from the MIMIC-IV clinical database, and subsequently screened 13,129 patients with sepsis based on inclusion and exclusion criteria. In this study, 1039 SALI patients were finally enrolled. Age, baseline information, mean value of test results, worst value of vital signs and results during the first day in the ICU between SALI patients and non-SALI patients are shown in Table 1. SALI patients have a median age of 65 (51, 77) years, and non-SALI patients have a higher median age of 68 (56, 79) years. Regarding comorbidities, we observed that SALI patients had lower rates of myocardial infarction, chronic pulmonary disease, cerebrovascular disease, paraplegia, and renal disease than non-SALI patients but were more likely to have congestive heart failure, peripheral vascular disease, rheumatic disease, and peptic ulcer than non-SALI patients. Additionally, the overall severity level of diseases was worse in SALI patients than in non-SALI patients, and the differences were statistically significant (P < 0.001), including the SOFA score, LODS score and SAPS II score.

Table 1 The characteristics of patients included at first ICU admission.

We also observed a considerable increase in the use of continuous renal replacement therapy (CRRT), vasopressors and mechanical ventilation (MV) during the first 24 h in the ICU in SALI patients, all P < 0.001. In addition, the majority of SALI patients had worse vital signs and laboratory parameters. The proportion of lung and urinary tract infections was high in patients with non-SALI, but the proportion of bacteremia infections was higher in SALI patients than in non-SALI patients. Last, in the study endpoints, outcomes, such as ICU stay time and hospital stay time, were longer in SALI patients, and the number of deaths in 90 days for patients who used vasopressors, CRRT and MV was higher.

SALI is an independent risk factor for 90-day mortality in septic patients

After the correction, SALI was detected as an independent risk indicator for 90-day mortality in patients with sepsis by logistic multivariate analysis [odds ratio (OR) 1.158, 95% CI 0.970–1.240, P = 0.034] (Supplementary Table 1). Our data were matched using propensity score matching (PSM) because the SALI patient group and the non-SALI patient group had significant discrepancies. After matching, the number of patients in the matched and unmatched groups was 1039. The 90-day Kaplan–Meier survival analysis was performed before and after matching, with log-rank P < 0.001 before PSM and P = 0.038 after PSM. Consequently, there was a substantial distinction in 90-day mortality between the SALI patients and non-SALI patients, whether matching was performed or not (Supplementary Fig. 1).

Construction of the nomogram in the training set

After a series of screenings, resulting in 1039 SALI patients, a random allocation of 70% was given as the training set and the other 30% as the validation set, 727 and 312, respectively. There was no noticeable difference between the training and validation sets with the exception of the skin and soft tissue infection (SSTI) parameter (Supplementary Table 2). Our research was performed based on a training set of 727 patients. In univariate analysis, basic patient information, comorbidities, first day of ICU use, mean vital signs during 24 h in the ICU, worst laboratory results and site of infection were screened for statistically significant parameters that were included in the logistic multivariate analysis. And parameters finally were added to the multivariate logistic regression analysis, including age, the use of vasopressors, mean arterial pressure, mean oxygen saturation (spo2), lactate maximum, blood urea nitrogen (BUN) maximum, total bilirubin (TBIL) maximum, albumin minimum, red blood cell distribution width (RDW) maximum and activated partial thromboplastin time (APTT) maximum, on which the forest plots were charted (Table 2, Fig. 2). A total of 10 variables illustrated in the forest plot above all followed the AIC principle, and all of the continuous variables had a VIF ≤ 2, so that there were covariance variables in Fig. 2. The nomogram is based on multivariate logistic regression analysis in which the 10 indicators, and are then plotted on the same plane with scaled lines to represent the interrelationships between the variables in the prediction model. The predictive value of the outcome event is calculated by converting the total score into a function of the probability of the outcome event. Nomogram was generated to determine the propensity of 90-day mortality in SALI patients (Fig. 3).

Table 2 Factors independently associated with 90-day mortality of patients with SALI by univariate logistic regression analysis in the training cohort.
Figure 2
figure 2

Factors independently associated with 90-day mortality in SALI patients in the training cohort by multivariate logistic regression analysis with forest plots.

Figure 3
figure 3

Nomogram for estimating the risk of 90-day mortality in SALI patients. By developing the scale score, the change in each variable is described by the forest plot, and then the total score is computed to predict the possibility that an event will happen.

Verification of nomogram predictability

The nomogram was compared with SOFA, LODS, SAPS II and ALBI to monitor 90-day mortality in SALI patients. The AUROC for the training and validation groups of the nomogram was 0.778 (95% CI 0.730–0.799) and 0.804 (95% CI 0.713–0.820), respectively, which were higher than several other scores (Table 3, Fig. 4). In the training set, R2 score = 0.298, Brier score = 0.176, and in the validation set, R2 score = 0.356, Brier score = 0.190. There was also no linear correlation between the continuous variables in the model (Supplementary Fig. 2). The nomogram generally showed excellent consistency with the actual 90-day death rates depending on the calibration curves in the predicted and realistic death rates (Fig. 5). In addition, the IDI and NRI index for the nomogram were also superior to the SOFA, LODS, SAPS II and ALBI scores in the training and validation groups, suggesting that the nomogram had a superior predictive probability for 90-day mortality prediction (Table 3).

Table 3 Comparison of models for predicting the 90-day mortality of SALI.
Figure 4
figure 4

The ROC curve of the nomogram. (a) Training set; (b) validation set.

Figure 5
figure 5

The calibration curve of the nomogram. (a) Training set; (b) validation set.

Clinical use of the nomogram

In the training set, nomogram-directed medical interventions could add more net benefit than the SOFA, LODS, SAPS II and ALBI scores when the threshold probability (PT) was 0.2–0.6 (Fig. 6a). In the validation set, obtaining more net benefit from nomogram-directed treatment than others was possible when the PT was between 0.3 and 0.8 (Fig. 6b). Simultaneously, the clinical benefit curves are given (Supplementary Fig. 3). The red line (number of high-risk people) shows the number of patients at high risk at each of the risk thresholds for 1000 patients, and the blue line (number of high-risk people with outcomes) shows truthful positive patient counts below the risk threshold.

Figure 6
figure 6

The decision curve analysis of the nomogram. (a) Training set; (b) validation set.

Ninety-day mortality risk from the nomogram

The nomogram is a predictive model with excellent sensitivity, specificity and predictive value in identifying 90-day mortality in SALI patients (Supplementary Table 3).

Discussion

Sepsis in the health care field has remained a problematic subject in general health services, as its mortality rate is high, ranging from 20 to 50%22,23,24. Hepatic failure is a component of MODS in sepsis and is often associated with a poor prognosis, although its exact incidence is unknown25,26. Liver injury can be associated with an exacerbation of multiple organ failure27.

The results of our research reported a morbidity rate of 7.91% (1039 of 12129 patients) and mortality rate of 44.10% (458 of 1039 patients) in patients with SALI, suggesting a lower incidence and higher death rate than the Liu et al.15 and Kobashi et al.27 clinical reports. We found that Liu’s team used the older data version 1.0 of MIMIC-IV, which only included liver function indicators in the SALI diagnostic criteria and did not include the more novel combined liver function and coagulation criteria, while that allowed the inclusion of SALI diagnosed any time within 48 h of hospitalization, but our study only included the first 24 h of ICU MIMIC-IV in version 2.0. Therefore, we hypothesized that these differences resulted in a lower patient rate but a higher mortality rate than the study by Liu’s team. Again, in the 2013 investigation by Professor Kobashi, a wider scale of sepsis 2.0 criteria was used, and the standards for inclusion in the SALI group only covered liver function, without coagulation. The standards failed to assess more comprehensively the outcome of the inherent injury and the subsequent injury caused by SALI and therefore also reached the same conclusion as Liu’s team. However, these teams are all aware that a tool is not enough to predict prognosis in sepsis patients and liver injury patients, consistent with our idea.

In this article, the 10 indicators are age, the use of vasopressors, mean arterial pressure, mean SpO2, lactate maximum, BUN maximum, TBIL maximum, albumin minimum, RDW maximum and APTT maximum, from which we found that the BUN maximum has the largest weight. In accordance with its contribution to the nomogram, BUN level was the maximum factor of 90-day mortality in SALI patients. BUN levels are strongly correlated with mortality in patients with sepsis, and sepsis patients have a higher rate of death when their BUN levels increase28,29,30,31. A large study31 found that this association disappeared at the 41.1 mg/dL turning point. For every 10 mg/dL increase in BUN level, sepsis patients had a 29.8% increase in 30-day mortality, while patients with a BUN level ≥ 41.1 mg/dL experienced only a 4.5% increase in 30-day mortality. The BUN level can be utilized as an easy-to-use and rapid measure for early identification in sepsis patients because early and effective management is crucial in this condition. In a study of 2917 patients with sepsis, BUN level (1.08 [1.07–1.09]) was a strong factor of the incidence of AKI with sepsis, and the most sensitive indicator of AKI occurrence was BUN level32. Although the mechanism by which elevated BUN levels can contribute to the poor prognosis associated with sepsis is not clearly defined, there are still several possible reasons that can be explained. Patients in the severe stage are in a hyperproteolytic metabolic state33, and BUN levels increase when protein is excessively catabolic or the renal filtration rate is reduced. Therefore, BUN levels can play an important role in body protein catabolism34 and are a sign of renal damage. The rate of protein catabolism is significantly increased in sepsis patients35, and sepsis is usually accompanied by acute kidney injury36. These factors can contribute to elevated BUN levels in patients with sepsis. The BUN level is more widely used in sepsis and other areas of sepsis and is being used for the first time in patients with SALI and was first reported in adult SALI patients in our study.

There are several other variables in the nomogram that play an important role. A cohort study showed that age was an independent factor in the incidence of SALI patients. Among subjects not over 39 years old, the proportions of “cholestatic”, “hepatocellular”, and “shock liver” disease were 22.2%, 66.6%, and 11.1%, respectively. Additionally, among subjects who were at or over 40 years of age, these percentages were 51.5%, 15.9%, and 32.6%, respectively27. In a recent study, a scholar referred to age as an important component of mortality involved in SALI patients15. With the increasing severity of sepsis disease, a large proportion of patients have circulatory instability and require ascending agents to maintain hemodynamic stability37,38,39, yet the inclusion of vasopressors and mean arterial pressure were first reported in sepsis-associated liver injury in our study. Blood lactate, an indicator of tissue perfusion, is often used to provide feedback on survival in patients with sepsis40,41, and some high-quality studies on the effect of lactate on mortality in SALI patients have been identified only in children42,43; this is the first report in adult SALI patients. Some of the other variables, such as TBIL, albumin, RDW and APTT, which are frequently used in research to predict sepsis and sepsis-associated injury14,17,18,44,45, are rarely mentioned in studies of sepsis-associated liver injury, especially in adult reports15. Our findings suggest that all of these variables independently predict mortality from sepsis-associated liver injury.

Initially, clinical prediction models were used for oncology patients46,47,48, and as an increasing number of researchers continue to develop their understanding of clinical prediction models, they are commonly used in critical illness49,50. There is no authoritative standard for the prognostic assessment of patients with sepsis-associated liver injury to date, and there are few large data studies with a very strong evidence-based medical basis. Our study contains a visual nomogram based on 10 clinically readily available and commonly used parameters that were extracted from a large database called MIMIC-IV, and the efficiency of the current nomogram underwent thorough comprehensive evaluation and internal validation.

Professor Fragaki stated that the ALBI score is more appropriate for the identification and prognostic evaluation of early-onset liver dysfunction in a recent study51; therefore, we chose the ALBI score to assess liver function in this article. The SOFA score, LODS score and SAPS II score are widely used in mortality risk analysis and prognosis assessment of patients with sepsis17,45,52,53. It is rare to use a model to predict both sepsis and liver injury at the same time, except in the few reports where the level of evidence is not high15. Nevertheless, the validity of these scores in prescribing the risk of 90-day mortality in SALI patients still remains unknown. To compare these scores, we evaluated the hypothesized nomogram’s predictive accuracy with various widely used clinical scores, including the SOFA, LODS, SAPS II, and ALBI scores, based on the AUROC. The nomogram performed best in all of the tools. The result that the nomogram could successfully distinguish between true positive patients at high risk of 90-day mortality in both the training and validation sets was further evidenced by DCA curves, IDI, and NRI indices. The nomogram in this case performed better in differentiating the risk of 90-day mortality, as supported by the high C-index (0.778) in the training set and the C-index (0.804) in the validation set, as well as by acceptable calibration. By developing the scale score, the change in each variable is described from the forest plot, and then the total score is computed to predict the possibility that an event will occur.

Several limitations include the following: (1) There is a lack of a definitive definition of SALI. (2) The confounding factors can occur with the inclusion of each variable, which can affect the results because this study is retrospective. (3) According to the definition of sepsis, the addition of specific pathogenic culture results might have improved the predictive strength of the model. Our study only contains parameters related to the first day in the ICU, and it might have been better to have dynamic, continuous observational analysis data on indicators during the ICU stay. (4) When we imported data from the MIMIC-IV database, we found that many variables were missing, and even some missing data were greater than 50%, and these parameters might have an impact on our findings. (5) The MIMIC-IV database is only a single-center study. In the future, multicenter research can be performed in different countries and regions with different economic levels, and external databases can be used for validation. These are endeavors we will pursue in the future.

Conclusion

The nomogram is a comprehensive performance of 10 indicators that have been tested for meaningfulness, including age, usage of vasopressor on the first day of ICU stay, mean arterial pressure, mean spo2, lactate maximum, BUN maximum, TBIL maximum, albumin minimum, RDW maximum and APTT maximum, which can be easily used for accurate measurement of 90-day mortality in SALI patients. This nomogram may be extremely valuable in controlling the progression of SALI when usual measures have been conducted and in ultimately improving the prognosis of SALI patients.