Introduction

Paraquat (1,1′-dimethyl-4,4′-bipyridinium dichloride [PQ]) is a non-selective contact herbicide widely used in many countries since the 1960s1. Acute PQ poisoning (APP) has been associated with suicides and accidents, with PQ causing sequential organ damage/failure through oxidative stress and systemic inflammatory responses. The most prominent manifestations include acute lung and kidney injuries2. The mortality rate of patients with APP remains tremendously high (50–90%)3,4,5,6. Consequently, PQ has been banned in most countries, including China. However, PQ is still legally used in some areas7,8.

Early assessment of the severity and prognosis of patients with APP is crucial to guide treatment (hemodilution, immunosuppressive therapy)9,10. Currently, no standardized method exists for predicting the prognosis of APP. Previous studies have demonstrated the utility of the Severity Index of PQ Poisoning (SIPP) as a specific scoring system for prognosticating APP11. However, calculating the SIPP score necessitates the measurement of serum PQ concentration, which demands costly equipment. Consequently, its usage is limited in most countries, particularly in developing nations12,13. Furthermore, research indicates that the SIPP score might overestimate the survival rate of patients with APP14, especially those admitted to the hospital more than 24 h after poisoning15. Prior research has identified complete blood cell count16, liver and kidney function indicators17, serum anion gap18,19, and chest computed tomography20 as independent prognostic indicators for APP. Nevertheless, these indicators are relatively individualistic and demonstrate limited predictive power. The currently available scoring systems for predicting patient prognosis, including the Acute Physiology and Chronic Health Evaluation II (APACHE II) score21, Sequential Organ Failure Assessment (SOFA) score22, and Poisoning Severity Score (PSS)23, are designed for critically ill patients rather than individuals with low exposure or mild symptoms.

Furthermore, due to the intricate calculation process, these scoring systems are incapable of promptly predicting the mortality rate or conducting risk assessments for PQ poisoning patients24. Accordingly, it is imperative to develop an efficient, straightforward, and universally applicable predictive model based on commonly employed laboratory indicators. A nomogram is a simple multivariate prediction model that incorporates multiple variables affecting prognosis to calculate an individual’s survival probability25. Therefore, this study aims to develop a nomogram model based on clinical data collected within 24 h of admission that can identify, at an early stage, patients who are at high risk of in-hospital mortality due to APP.

Methods

Setting

We performed a retrospective, observational cohort study in an emergency department (ED) with more than 3000 beds in an extensive tertiary care teaching hospital in Chengdu City, Sichuan Province, China, per the amended Declaration of Helsinki. The West China Hospital approved the study of the Sichuan University Biomedical Research Ethics Committee (No. 2022-1591). Due to the retrospective nature of the study, the need of informed consent was waived by the Sichuan University Biomedical Research Ethics Committee. All analyses were performed following the Transparent Reporting of a multivariate prediction model for Individual Prognosis or Diagnosis statement26.

Patients

We conducted a study involving patients diagnosed with APP who were admitted to the ED of West China Hospital, Sichuan University, between September 1, 2010, and January 31, 2022. Inclusion criteria comprised of patients aged ≥ 14 years, those with poisoning via gastrointestinal intake, and individuals with PQ plasma concentrations ≥ 0.01 mg/L by high-performance liquid chromatography. Exclusion criteria encompassed patients whose PQ concentrations were not detected in the plasma, those with incomplete clinical data, those with other concomitant poisoning (such as alcohol poisoning), and pregnant women.

For patients who developed symptoms within 6 h of onset, gastric lavage was recommended. Those who developed symtoms within 12 h of onset were administered activated charcoal via oral or nasogastric route. Patients with positive concentrations of PQ in blood tests were advised to undergo blood purification treatment, including blood perfusion, hemodialysis, or continuous venovenous hemofiltration. After excluding contraindications such as gastrointestinal bleeding, all patients should be given methylprednisolone sodium succinate at a dosage of 80 mg/day for anti-inflammatory treatment through intravenous infusion27.

Predictors

We collected anonymous clinical data within 24 h of admission to the Emergency Department (ED) from electronic medical records. The collected data comprised the following variables: time from poisoning to treatment at the study site, sex, age, level of consciousness (LOC), heart rate (HR), respiration rate (RR), systolic blood pressure (SBP), diastolic blood pressure (DBP), white blood cell (WBC) count, neutrophil-to-lymphocyte ratio (NLR), monocyte-to-lymphocyte ratio (MLR), platelet count (PLT), total bilirubin, alanine aminotransferase (ALT), aspartate aminotransferase (AST), alkaline phosphatase (ALP), blood urea nitrogen (BUN), creatinine, cystatin C, carbon dioxide combining power (CO2CP), blood potassium, plasma PQ concentrations, and SIPP. Carbon dioxide combining power (CO2CP) measures a substance or solution's ability to react with and bind carbon dioxide (CO2) molecules. It quantifies the substance's capacity to combine with CO2 chemically. The normal range of CO2CP typically falls between 22 and 30 milliequivalents per liter (mEq/L)28. Decreased LOC is defined as a Glasgow Coma Scale (GCS) score below 1529. The severity of PQ poisoning was estimated quantitatively by using the SIPP, which was calculated by multiplying the elapsed time (hours) from ingestion to arrival by the serum PQ level(μg/ml)11. All patient data were anonymized and de-identified. Variables were treated as continuous variables, except for sex and disturbance of consciousness, which were binary variables. Finally, the continuous variables used to construct the nomogram were transformed into categorical variables based on the actual data. This was achieved by categorizing them based on the normal upper and lower bounds, median, tertiles, and quartiles.

Clinical outcome

The patients were divided into two groups: survivor and non-survivor group, according to the occurrence of in-hospital death. The in-hospital mortality discussed in this study refers to all-cause mortality of APP patients, including deaths in the ED.

Statistical analysis

Data were analyzed using R version 4.2.1 (R Foundation for Statistical Computing, Vienna, Austria). A two-sided P value < 0.05 indicated statistical significance. Data are presented as medians (interquartile ranges) for continuous variables and as numbers (percentages) for categorical variables, as appropriate. The non-parametric Mann–Whitney U test, chi-square analysis, and Fisher’s exact test were used to test for differences between groups, as appropriate. In this study, grouping was achieved through simple random non-relaxation sampling using the simple_ra() function in the R software. All the baseline characteristics were comparable between the training and validation cohorts.

In the training cohort, the least absolute shrinkage and selection operator (LASSO) regression was used to decrease the potential collinearity of variables assessed from the same patient and the overfitting of variables. The latent variables in the LASSO regression analysis were included in the multivariate analysis of stepwise forward selection to determine independent risk factors affecting in-hospital mortality. The results are reported as odds ratios (ORs) and 95% confidence intervals (95% CI). A simple nomogram based on the independent risk factors was developed to predict the probability of mortality. The prediction model was assessed using the concordance index (C-index), area under the receiver operating characteristic (ROC) curve (AUC), calibration curves, decision curve analysis (DCA), and clinical impact curves for both the training and validation cohorts30,31,32.

Results

Patients

A total of 1708 patients with APP presented to our ED, of whom 984 were excluded. Finally, 724 patients who met the inclusion criteria in this study were randomly divided into two groups: training and validation cohorts group using a 7:3 ratio (Fig. 1); among them, 360/724 patients (49.7%) died. In terms of demographics, there were 325/724 (44.9%) males, with a median age of 33 (23–42) years. Among 506/724 patients in the training group, 44.1% were male, with a median age of 29 (22–42) years, and 253/506 (50%) patients died. Of the 218/724 patients in the validation group, 46.8% were male, with a median age of 31 (23–42) years, and 107/218 (49.1%) patients died. No statistical significant differences were observed between the two groups (P > 0.05) (Table 1).

Figure 1
figure 1

Flowchart of the study.

Table 1 Comparison of baseline characteristics between the training and validation cohorts.

Model development

In the demographic and clinical characteristics analysis of 506 patients, 22 features were reduced to nine potential predictive features. These features had non-zero coefficients in the LASSO regression model, which included age, LOC, WBC count, NLR, MLR, ALT, creatinine, CO2CP, and plasma PQ concentrations (Fig. 2a,b).

Figure 2
figure 2

Demographic and clinical feature selection using the LASSO binary logistic regression model. (a) Optimal parameter (λ) selection in the LASSO model used 10-fold cross-validation via the minimum criteria. A partial likelihood deviance (binomial deviance) curve was plotted against log(λ). Dotted vertical lines are drawn at the optimal values using the minimum criteria and 1SE of the minimum criteria (1-SE criteria). (b) LASSO coefficient profiles of nine features. A coefficient profile plot was constructed against log(λ) sequence. A vertical line was drawn at the value selected using 10-fold cross-validation, where the optimal lambda resulted in eight features with non-zero coefficients. LASSO least absolute shrinkage and selection operator, SE standard error.

Construction of nomogram

Multivariate analysis revealed several independent risk factors for in-hospital mortality upon admission, including decreased LOC, NLR, ALT, creatinine, CO2CP, and plasma PQ concentrations. The detailed ORs and 95% CI of the multivariate analysis are summarized in Table 2.

Table 2 Risk factors for in-hospital mortality in patients with acute paraquat poisoning.

Therefore, these six factors were used to construct the prediction model, as shown in Table 2. A simple nomogram was developed to predict the probability of death based on the independent risk factors (Fig. 3a). The NLR values were categorized into quartiles as follows: category 1 for NLR < 6.2, category 2 for NLR 6.2–10.94, category 3 for NLR 10.94–18.02, and category 4 for NLR > 18.02. ALT values were categorized based on the upper limit of normal as category 1 for ALT 0–39 IU/L and category 2 for ALT ≥ 40 IU/L. Additionally, categorical variables were created based on the median value of creatinine, resulting in category 1 for creatinine ≤ 110 μmol/L and category 2 for creatinine > 110 μmol/L. Finally, PQ values were categorized into quartiles: category 1 for PQ ≤ 0.33 g/mL, category 2 for PQ 0.33–1.66 g/mL, category 3 for PQ 1.66–11.15 g/mL, and category 4 for PQ > 11.15 g/mL. As shown in the nomogram, each factor was assigned a point, and the total nomogram points were calculated by summing the individual points of all predictors. The relationship between the total points and the probability of death is presented at the bottom of the nomogram. As an example, CO2CP was considered as a continuous variable. Where a decrease of 5 units corresponded to an approximate 35 points increase in the risk score. The nomogram depicts the predicted probability of in-hospital death resulting from APP, measured on a scale of 0 to 300. Draw a vertical line upward and assign labels to signify the respective points for each covariate. Repeated this procedure for each covariate to ascertain the cumulative.

Figure 3
figure 3

(a) Nomogram for in-hospital mortality in patients with acute paraquat poisoning. Decreased LOC (GCS score < 15) (0, no; 1, yes), NLR (1, < 6.2; 2, 6.2–10.94; 3, 10.94–18.02; 4, > 18.02), ALT (1, 0–39; 2, ≥ 40 IU/L), CREA (1, ≤ 110; 2, > 110 µmol/L), PQ (1, ≤ 0.33; 2, 0.33–1.66; 3, 1.66–11.15; 4, ≥ 11.15 g/mL). (b) Calibration curve of the nomogram in the training set. (c) Calibration curve in the validation set. LOC level of consciousness, GCS glasgow coma scale, NLR neutrophil-to-lymphocyte ratio, ALT alanine aminotransferase, CREA creatinine, CO2CP carbon dioxide combining power, PQ paraquat.

Assessment of nomogram

The calibration curve in Fig. 3b,c suggests high consistency, demonstrating that the model’s predicted probabilities are close to the observed actual probabilities. The bias-corrected C-indices for the training and validation cohorts were 0.933 and 0.947, respectively.

In the training and validation cohorts, the C-indices were 0.953 (95% CI 0.936–0.970) and 0.947 (95% CI 0.920–0.974), respectively, indicating excellent accuracy. The ROC curves and AUC for the training and validation cohorts are shown in Fig. 4a,b. The DCA demonstrated that the nomogram had a superior overall net benefit across a wide range of practical threshold probabilities (Fig. 5a,b). In addition, we plotted clinical impact curves to predict improved probability stratification for a population size of 1000. The predicted probability coincided with the actual probability in the training and validation cohorts (Fig. 5c,d).

Figure 4
figure 4

ROC curve of the nomogram for the training (a) and validation (b) cohorts. ROC receiver operating characteristic, AUC area under the ROC curve.

Figure 5
figure 5

Nomogram decision curve analysis for the training (a) and validation (b) cohorts, and clinical impact curves for the training (c) and validation (d) cohorts.

Construction and comparison of other predictive models

Since plasma PQ concentrations accounted for much of the model and some primary hospitals could not detect them, we used LASSO and multivariate logistic regression to construct another model without PQ plasma concentrations. Finally, we determined the following independent risk factors: LOC, age, WBC count, MLR, ALT Plasma PQ concentrations, and CO2CP. Concurrently, we included the SIPP to compare the effectiveness of the three prediction models and construct the ROC curve (Fig. 6).

Figure 6
figure 6

Comparison of other predictive models. m1, nomogram of predictive models, AUC: 0.953 (95% CI 0.936–0.970); m2, predictive model excluding plasma PQ concentrations, AUC: 0.882 (95% CI 0.853–0.911); m3, SIPP, AUC: 0.909 (95% CI 0.883–0.934); AUC area under the receiver operating characteristic curve, SIPP severity index of paraquat.

Discussion

Timely identification of the severity of patients with PQ poisoning is of utmost importance in clinical practice. It assists in formulating targeted treatment plans, allocating medical resources efficiently, and potentially enhancing patient prognosis. Nevertheless, currently, no universally accepted method exists to comprehensively evaluate the correlation between clinical indicators and the survival outcomes of patients with APP. This study bridges this gap by developing a prediction model that evaluates the risk of in-hospital mortality using clinical data collected during admission. The created nomogram exhibited exceptional performance in both the training and validation cohorts, with an AUC exceeding 0.9. The included indicators can be readily acquired upon admission, and the nomogram score can be calculated with ease. These characteristics render the model appropriate for swift and uncomplicated clinical implementation, facilitating early-stage evaluation and prognosis prediction. Its convenience further encourages widespread adoption. Additionally, based on our comprehension, the sample size employed in this study is one of the largest worldwide, and the constructed model has achieved near-perfect prediction performance. Moreover, after comparing the nomogram to SIPP, we have ascertained the superiority of our prediction model.

Prior nomogram models developed to predict the prognosis of APP patients primarily originated from the research conducted by Lu Shan. They retrospectively included a total of 80 cases of patients with APP, utilizing serological markers and imaging examinations to construct their nomogram. The AUC in their training set and validation set were 0.953 (95% CI 0.936–0.970) and 0.947 (95% CI 0.920–0.974), respectively. In comparison to their study, our current study encompasses an expanded sample size, a wider array of accessible indicators and demonstrates exceptional predictive capacity.

Our study revealed a high mortality rate in patients with decreased LOC due to APP, possibly related to toxic encephalopathy or hyperemia. Previous research has reported that PQ can stimulate glutamate efflux, leading to excitotoxicity33. Animal studies have demonstrated PQ’s ability to induce α-synuclein upregulation, promote aggregate formation, and activate microglial34.

The NLR is a readily detectable inflammation marker using routine blood tests. It has been used to assess inflammatory-related lesions such as tumors, ischemic stroke, and coronary artery disease35,36,37. Neutrophils are important members of the human immune system that have gradually gained attention for their role in APP. Studies have shown that PQ can induce rapid expression of neutrophil chemoattractant proteins in bone marrow mesenchymal stem cells. Consequently, neutrophils rapidly increase in the blood and accumulate in the lungs, promoting the production of chemokines and pro-inflammatory factors (such as tumor necrosis factor α and interleukin 8) and activating the inflammatory response. Simultaneously, alveolar macrophages generated by neutrophils initiate immune responses and generate reactive oxygen species, leading to cellular nicotinamide adenine dinucleotide phosphate (reduced coenzyme II) depletion and cell membrane lipid peroxidation, thereby promoting the expression of pro-fibrotic genes in fibroblasts and resulting in pulmonary fibrosis38,39. Lymphocytes play a central role in regulating inflammatory responses in the human body. However, the specific mechanism through which PQ induces lymphocyte degradation remains unclear. Whether PQ inhibits the cellular immune function requires further investigation.

Our study showed that liver injury was an independent risk factor for death in patients with APP, which contradicts the findings of previous studies showing that toxic hepatitis is common after PQ exposure. Toxic hepatitis appears mild and transient in scope but is associated with higher complication rates, including respiratory and renal failure40.

PQ penetration into tissues and organs can cause a series of oxidative stress reactions, generating a large amount of reactive oxygen species and leading to organ damage or failure41. The kidney is the main excretory organ for PQ, eliminating 90% of PQ within 12–24 h. Consequently, patients with APP often present with acute kidney injury characterized by elevated creatinine42. In addition, renal dysfunction can significantly reduce the PQ excretion rate and increase its accumulation in the body, forming a vicious circle43,44. Further studies are needed to confirm whether correcting kidney injury improves prognosis.

CO2CP refers to the plasma CO2 content measured after isolating plasma from venous blood samples at room temperature and balancing it with the alveolar air of healthy people45. Two major acid–base disturbances, respiratory acidosis and metabolic alkalosis, both of which can result in increased CO2CP, are common in patients with respiratory diseases. If respiratory diseases such as chronic obstructive pulmonary diseases are excluded, the impact of the respiratory acid–base balance on CO2CP can be minimized46. In contrast, reduced CO2CP concentrations suggest metabolic acidosis47 or respiratory alkalosis48, both of which are indicators of poor outcomes. Most often, decreased CO2CP indicates the presence of metabolic acidosis; however, it could also reflect a decline in bicarbonate concentration as compensation for respiratory alkalosis. Therefore, it is considered a broad indicator of lung and kidney damage. In this study, the CO2CP of the mortality group was significantly lower than that of the survival group (P < 0.05). As the lungs and kidneys are important target organs of APP, this index can be regarded as a comprehensive evaluation index.

Considering that some primary hospitals cannot detect plasma PQ concentrations, we constructed a predictive model for plasma concentrations without PQ and compared it with SIPP. The second model showed the AUC of 0.882 (95% CI 0.853–0.911). Although the performance of the second model is not as good as that of the first, it is still worth promoting.

The current study has some limitations. First, it was a single-center retrospective study with selection bias; furthermore, only 724 patients were included because of missing data, which may limit the scalability of the model. Second, this study only included the clinical data of patients with APP at the time of enrollment and did not combine the clinical data after treatment to evaluate patient survival after discharge, which may have affected the results. Owing to limitations in clinical data, the GCS classification in this study was restricted to < 15/15. However, in future studies, we aim to collect more comprehensive data to improve nomogram precision. Moreover, because of the severe lack of clinical data and absence of indicators of lung injury, this study was performed retrospectively. The model must be updated when more multicenter data become available.

Conclusion

We developed a predictive model for in-hospital mortality in patients with APP. The nomogram, which includes six risk factors with favorable predictive accuracy, discrimination, and clinical utility, allows for simple and rapid individual patient risk estimates. It can be used on admission to the ED to predict mortality and facilitate early risk stratification and actionable measures in clinical practice after further external validation.