Introduction

Acute-on-chronic liver failure (ACLF) is characterized by an acute deterioration in liver function with pre-existing chronic liver disease. It could happen due to a hepatic or extrahepatic injury or an unknown reason. ACLF may result in multisystem organ failure and is associated with significant morbidity and mortality. High mortality (>ā€‰50%) has persisted for this condition over the past 20 years1.

There are multiple definitions for ACLF proposed by various organizations and studies, including the Asiaā€“Pacific Association for the Study of the Liver (APASL)2, the European Association for the Study of the Liver (EASL) and the American Association for the Study of Liver Diseases (AASLD)3, the European Association for the Study of the Liver-chronic liver failure (EASL-CLIF) Consortium Acute oN chrONIC liver failure study (CANONIC study)4, and the North American Consortium for the Study of End-Stage Liver Disease (NACSELD study)5. The most recent definition of ACLF was defined in the CANONIC study4. They also proposed a theory to divide ACLF into three categories depending on the underlying liver diseases: type A (without cirrhosis), type B (with compensated cirrhosis), and type C (with decompensated cirrhosis)1.

Scoring systems have been used for prediction of prognosis of patients with ACLF, such as the Child-Turcotte-Pugh score6, the Model for End-stage Liver Disease score (MELDs)7, the Sequential Organ Failure Assessment score (SOFAs)8, the Chronic Liver Failure-SOFA score (CLIF-SOFAs)4, the CLIF Consortium Organ Failure score (CLIF-C OFs)9, and the Chronic Liver Failure Consortium ACLF score (CLIF-C ACLFs)9. Jalan et al., developers of CLIF-C ACLFs, found that CLIF-C ACLFs outperformed other scoring systems in prognostic accuracy9. However, CLIF-C ACLFs was developed based on the CANONIC study, a multiple site study conducted in 29 European hospitals where patients had an extremely low rate of hepatitis B virus (HBV) infection4. It raises a concern whether these scoring systems will have prognostic accuracy in ACLF patients with HBV infection, which accounts for a majority of the ACLF patients in eastern countries2.

PIRO concept (Predisposition, Injury, Response, Organ), being used as a scoring system in patients with sepsis10,11, could comprehensively examine the situation of ACLF. Among ACLF patients, predisposition (P) could be defined as severity of chronic liver disease evaluated through demographics, and ACLF type and MELDs; injury (I) could be explained by the precipitating event including hepatic (i.e., virus, drugs use, alcohol, etc.) and/or extrahepatic events (i.e., infection and variceal bleeding); response (R) was the body reaction to the injuries (i.e., inflammation); and organ (O) was defined as organ failure1.

Here, we conducted a study (1) to develop a new scoring system based on the PIRO concept among ACLF patients with HBV infection, (2) to stratify the patients with different mortality into different clusters using classification tree according to the above scoring system, and (3) to set up a calculator for patients to calculate their own estimated death probability using the above scoring system.

Methods

Study design

Patients were screened and enrolled from January 2012 to October 2017 in the Fifth Medical Center of Chinese PLA Hospital (Beijing, China) after the appropriate approvals were obtained. A total of 1,887 consecutive ACLF patients aged 18 to 64 were admitted to the hospital during the study period. After exclusion of patients (Nā€‰=ā€‰673) who had one or more of the following conditions: HCV infection (Nā€‰=ā€‰30), alcohol liver disease (Nā€‰=ā€‰133), autoimmune liver disease (Nā€‰=ā€‰21), drug-induced liver injury (Nā€‰=ā€‰27), Wilson disease (Nā€‰=ā€‰1), cryptogenic liver disease (Nā€‰=ā€‰113), hepatocellular carcinoma (Nā€‰=ā€‰185), other site carcinoma (Nā€‰=ā€‰11), refused to participate the study (Nā€‰=ā€‰158), and liver transplant (Nā€‰=ā€‰39), a total of 1214 patients were included according to the APASL criteria for ACLF2. 774 patients were finally enrolled in this study according to the diagnostic criteria for ACLF in the CANONIC study4. Causes of exclusion are summarized in Figure S1. We randomly selected 619 patients as the training cohort and 155 patients as the validation cohort. All study procedures were approved by the Institutional Review Board at Fifth Medical Center of Chinese PLA Hospital. Written informed consent was obtained from each patient or his/her legal surrogate before enrollment.

Definitions

ACLF was diagnosed according to the CLIF Consortium Organ Function score (CLIF-C OFs)9. We also categorized ACLF patients into three groups1: type A (without cirrhosis), type B (with compensated cirrhosis), and type C (with decompensated cirrhosis).

Chronic HBV infection12 was defined as hepatitis B surface antigen (HBsAg) positive and antibody to hepatitis B core antigen (anti-HBc) positive. Chronicity was defined by the presence of HBsAg positive forā€‰>ā€‰6Ā months. Cirrhosis was diagnosed by liver biopsy, endoscopic signs of portal hypertension, radiological evidence of liver nodularity, or clinical evidence of previous hepatic decompensation (including ascites, hepatic encephalopathy (HE), and acute variceal bleeding) in patients with chronic liver diseases5.

Organ failure was defined according to the CLIF-C OF scoring system9 (Table S1). Ascites was diagnosed by clinical examination and confirmed by ultrasonography13. HE was defined and graded by West-Haven criteria14. Acute variceal bleeding was diagnosed by Baveno V endoscopic criteria15. We defined infections according to standard criteria16 as follows: (1) positive blood cultures in the absence of any recognized source of infection (spontaneous bacteremia); (2) spontaneous bacterial peritonitis (SBP) was diagnosed in presence of ascitic fluid absolute neutrophil countā€‰>ā€‰250 cells/mm317; (3) radiographic evidence of pulmonary infiltration associated with purulent sputum (pneumonia); (4) urinary WBC countā€‰>ā€‰15 cells per high-power field and positive urine culture (urinary tract infection). The systemic inflammation response syndrome (SIRS) was assessed according to the recommendations of the American College of Chest Physicians/Society of Critical Care Medicine Consensus Conference18. Patients were considered to have SIRS if they fulfilled at least 2 of the following criteria: (1) a core temperature ofā€‰>ā€‰38Ā Ā°C (100.4 Ā°F) orā€‰<ā€‰36Ā Ā°C (96.8 Ā°F); (2) a heart rate ofā€‰ā‰„ā€‰90 Ā beats/min; (3) a respiratory rate ofā€‰ā‰„ā€‰20 Ā breaths/min; or (4) a white blood cell (WBC) count ofā€‰ā‰„ā€‰12,000/mm3 orā€‰ā‰¤ā€‰ 4000/mm3, or a differential count showingā€‰ā‰„ā€‰10% immature polymorph nuclear neutrophil cells (PMNC). Patients with SIRS and infection were considered to have sepsis.

Data collection

We collected the following clinical and demographic information: age, gender, underlying liver disease, clinical presentation, laboratory parameters, and outcomes. Baseline clinical characteristics were obtained within 48Ā h of admission to the hospital or, when indicated, at the time of diagnosis of ACLF. For patients discharged from the hospital, prognostic information was obtained from medical records and telephone contact.

Prognostic scores of ACLF

MELD score was calculated as follows: MELDsā€‰=ā€‰9.6ā€‰Ć—ā€‰ln [creatinine (mg/dl)]ā€‰+ā€‰3.8ā€‰Ć—ā€‰ln [bilirubin (mg/dl)]ā€‰+ā€‰11.2ā€‰Ć—ā€‰ln (INR)ā€‰+ā€‰6.43ā€‰Ć—ā€‰(etiology: 0 if cholestatic or alcoholic, 1 otherwise)7. CLIF-C OFs was summarized based on criteria listed in Table S19. CLIF-C ACLFs was calculated as follows: CLIF-C ACLFsā€‰=ā€‰10ā€‰Ć—ā€‰[0.33ā€‰Ć—ā€‰CLIF-C OFsā€‰+ā€‰0.04ā€‰Ć—ā€‰Ageā€‰+ā€‰0.63ā€‰Ć—ā€‰ln (WBC count)-2]9.

We used the PIRO concept to build up the scoring system. Variables were grouped according to each PIRO component. ā€œPā€ was evaluated by age, gender, ACLF type. ā€œIā€ was characterized by hepatic injury (superimposed on other viruses, alcohol, and drug) and/or extrahepatic injury (surgery, infection, and variceal bleeding). ā€œRā€ was defined by SIRS, SBP, bacteremia, urinary tract infection, and pneumonia. ā€œOā€ was assessed by CLIF-C OFs.

Statistical analysis

Continuous variables were described as meanā€‰Ā±ā€‰standard deviation (SD). Comparisons between two groups were conducted using Studentā€™s t-test or ANOVA for continuous variables when homogeneity of variance is valid, or else using correction t-test or ANOVA. Chi-square test was used for categorical variables when homogeneity of variance is valid, or else using continuity adjusted Chi-Square.

We randomly selected the training cohort (80%) and validation (20%) cohort from the total population. Firstly, we set up the scoring system and prognostic model using the training cohort. Secondly, we used the validation cohort to validate the prognostic accuracy of the scoring system and prognostic model obtained from the training cohort. Finally, we set up the scoring system and prognostic model using the total population in order to increase the prognostic power.

Variables that were associated with mortality in the univariate analysis (p valueā€‰<ā€‰0.5) were screened for the multivariate models. Four separate multivariate logistic regression models, one for each PIRO component (P, I, R and O) with 90-day mortality, were built using stepwise selection on the variables screened in the univariate analysis. Once the models were fitted, the set of four scores from regression coefficients were generated for each patient corresponding to 90-day mortality. Then the set of four scores were included in the final combined PIRO model. We used this logistic regression model to generate the calculator for each patient to predict his/her 90-day death probability. Odds ratio (OR) with 95% confidence interval (CI) were calculated to estimate the associations between various predictive variables and mortality. The area under receiver operating characteristics curve (AU-ROC) was employed to assess the accuracy of the models. Calibration was tested using the Hosmerā€“Lemeshow goodness-of-fit test.

To simplify the computation of the scoring system, the regression coefficients were uniformly rescaled to make the maximum total score 50. The AU-ROCs of the models generated by the simplified scoring systems were identical to those derived from the original regression coefficients.

A classification and regression tree was used to define cut-offs for each PIRO component using the new scores. Each node split decision in the tree was chosen from the possible cut-offs for all components according to Giniā€™s coefficient impurity measure. Each node contained at least 5 alive or dead patients. One sample Z test was used for the comparison of mortality rates between validation cohort and estimated mortality rates obtained from training cohort.

All analysis used a two-sided P-value of 0.05 as statistical significance. Statistical analyses were performed using SAS 9.4 (SAS Institute Inc. Cary. NC. USA).

Ethical approval

All procedures performed in studies involving human participants were in accordance with the ethical standards of Fifth Medical Center of Chinese PLA General Hospital research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.

Informed consent

Informed consent was obtained from all individual participants included in the study.

Results

Characteristics of patients with HBV related ACLF in training and validation cohorts

Among patients in training cohort (Table 1), the mean (SD) age was 46.98 (9.65) years and most patients (84.98%) were male. The means (SD) of MELDs, CLIF-C OFs and CLIF-C ACLFs were 26.72 (6.25), 9.25 (1.43) and 41.63 (7.27), respectively. The mortality rate of 90-day was 73.51%. The validation cohort patients showed similar characteristics as the training cohort (Pā€‰>ā€‰0.05).

Table 1 Comparison of characteristics between training cohort and validation cohort.

We set up a clinical scoring system based on PIRO concept and a prediction model for 90-day death probability using the training cohort. The clinical scoring system obtained from the training cohort was validated in an independent validation cohort, which provided similar patterns of death probability distribution in the tree nodes. To increase the prognostic power, we used the total population to build the clinical scoring system based on PIRO concept and a prediction model for 90-day death probability.

Scoring system based on PIRO concept using total population

Distributions of characteristics in each PIRO component by 90-day mortality and ACLF type were shown in Tables 2 and 3, respectively. In the total population, 601 (77.65%) patients were treated with nucleic acid analogs. The reasons some patients did not use nucleic acid analogs were as follows: 1. Patients who met the indications for anti-HBV therapy failed to see a doctor in time for standardized treatment, especially in the areas with poor medical standards. 2. Patients did not meet the indications for anti-HBV therapy before their acute exacerbation of hepatitis B due to injuries, such as infection, surgery, abuse of alcohol or drugs. Age, gender, ACLF type, drug, infection, surgery, variceal bleeding, SIRS, SBP, pneumonia, CLIF-C OFs were associated with 90-day mortality.

Table 2 Characteristics in PIRO components with 90-day mortality.
Table 3 Characteristics in PIRO components with different ACLF type.

Multivariate analysis showed that these variables were independently associated with mortality (Table 4). Variables retained in the final models included age, gender and ACLF type for P, drug, infection, surgery, and variceal bleeding for I, SIRS, SBP, and pneumonia for R, and CLIF-C OFs for O. ACLF type showed a higher performance in predicting 90-mortality (Pā€‰<ā€‰0.0001).

Table 4 Selected variables in multivariate analysis.

Performance of PIRO model for predicting 90-day mortality

The combined PIRO model for 90-day mortality had an AU-ROC (95% CI) of 0.77 (0.73ā€“0.80) (Table 5 and Fig.Ā 1). We also compared the predictive ability of combined PIRO model to current traditional MELDs, CLIF-C OFs, and CLIF-C-ACLFs. The AU-ROC (95% CI) of combined PIRO was higher than MELDs of 0.66 (0.62ā€“0.70), CLIF-C OFs of 0.69 (0.64ā€“0.73), and CLIF-C ACLFs of 0.68 (0.64ā€“0.72) for 90-day mortality (Pā€‰<ā€‰0.0001) (Fig.Ā 2). The Hosmer and Lemeshow test did not show evidence for lack of fit in MELDs, CLIF-C OFs, CLIF-C ACLFs, all four PIRO components, and the combined PIRO models in the training or validation cohorts (pā€‰>ā€‰0.05).

Table 5 Area under the receiver operating characteristics curve (AU-ROC) (95% confidence interval, CI) for predicting 90-day mortality.
Figure 1
figure 1

Accuracy of the PIRO score as compared to each component of PIRO in predicting 90-day mortality of patients with HBV-ACLF.

Figure 2
figure 2

Accuracy of the combined PIRO score as compared to other scoring systems in predicting 90-day mortality of ACLF patients with HBV infection.

Classification and regression tree for 90-day death probability

Rescaled classification scores for each PIRO component was presented in (Table 6). The classification and regression tree was applied to define cut-offs for each score and identify clusters of risk of death, allowing patientsā€™ classification in different risk stages for each variable. Per the cut-offs in the regression tree, each component of the PIRO was categorized into 2 or 3 stages.

Table 6 Scoring system of each PIRO component with 90-day mortality.

The classification and regression tree showed different patterns with different cut-offs for each combination of PIRO components. The estimated probabilities of death with each combination of cut-offs of PIRO could obtained from the trees. The sensitivity and specificity of this tree for predicting 90-day death probability were 89% and 46%, respectively (Fig.Ā 3). The positive and negative predictive values of 90-day death probability were 82% and 59%.

Figure 3
figure 3

Classification tree for 90-day mortalities. The numbers in the leaves represents for the estimated 90-day mortality. P, Predisposition point calculated from Table 6. I, Injury point calculated from Table 6. R, Response point calculated from Table 6. O, Organ failure point calculated from Table 6.

Calculator for each patient calculating the 90-day death probability

Based on the scoring system above, the set of four scores were included in the final combined PIRO model. The sensitivity and specificity of this model for predicting 90-day death probability were 79% and 61% at the biggest Youden index (0.4) with cut-off value of 65%. The positive and negative predictive values of 90-day death probability were 85% and 52%. We used this PIRO model to generate the calculator for each patient to predicting his/her 90-day death probability. Each patient could calculate their estimated 90-day death probability according to the PIRO scoring system using the following formula:

$$90\; \text{day} \; \text{death} \; \text{probability}=\frac{{\text{e}}^{-\left(0.1744\times \text{Ppoint}+0.1746\times \text{Ipoint}+0.1266\times \text{Rpoint}+0.1868\times \text{Opoint}-1.0281\right)}}{1+{\text{e}}^{-\left(0.1744\times \text{Ppoint}+0.1746\times \text{Ipoint}+0.1266\times \text{Rpoint}+0.1868\times \text{Opoint}-1.0281\right)}}$$

Discussion

The PIRO concept has previously been introduced to describe severity of ACLF1,19,20. However, no study had taken a further step to use PIRO concept to establish a prediction model for prediction of ACLF prognosis. This study proposes a prediction model for ACLF patients with HBV infection based on the PIRO concept.

The PIRO concept breakdowns the physiopathology of ACLF, explaining the whole situation of the entity, which could describe more details of the characteristics of patients with ACLF compared to other scoring systems. In our study, we found that the combined PIRO model showed better performance than each PIRO component models, and greater performance than other scoring systems (i.e., MELDs, CLIF-C OFs and CLIF-C ACLFs).

A classification and regression tree presents a convenient way to discriminate patients into different nodes (or clusters) based on their scores of each PIRO component. There were different nodes generated in the trees with different expected mortality rates. According to the PIRO scoring system, each component score of the patient could be calculated, and then the patient could find the corresponding node in the tree according to the scores to estimate his/her probability of death.

Except for the classification tree for predicting 90-day death probability, we also generated a prediction calculator for each patient to calculate his/her own estimated mortality rate according to the four scores in the PIRO scoring system. This calculator was more accurate for predicting each patientā€™s probability of death, compared to the classification tree which was for a group of patients.

The PIRO prediction model can be easily adopted by clinicians because the parameters estimating the scores are available in standard hospital settings. The model provides computed figures, estimates risk of mortality, and allows easy stratification of patients with clinical and prognostic significance. Meanwhile this study validated the value of the proposal of ACLF type. This model included the novel scoring system of CLIF-C OFs. It clearly improved the predictive ability of the main prognostic scores currently available (i.e., MELDs, CLIF-C OFs, and CLIF-C ACLFs). This new classification tree and calculator have the potential to facilitate HBV related ACLF patient care by making an appropriate management plan based on more accurate risk stratification of patients.

Our study represents the first effort to use the PIRO concept to develop a new prediction model for patients with HBV related ACLF using data from a large prospective cohort. The proposal of ACLF type had not previously been validated. In our study, we validated the higher power of using ACLF type in predicting mortality. Not only did we built a classification tree for stratifying patients with different levels of risk, we also presented a calculator for each patient to calculate his/her own outcome probability.

While the PIRO prediction model generated through the training cohort was validated in the validation cohort, its generalizability to other populations warrants further investigation, particularly for ACLF patients without HBV infection. Nonetheless, it represents a major step towards the clinical application of the PIRO concept, expanding its applicability to all patients with ACLF.