Introduction

Trauma is the primary cause of mortality in children, and extensive research has been conducted to identify effective strategies for reducing the resulting morbidity and mortality rates1,2. Accurately measuring the severity of injuries is crucial for assessing the quality of care provided to children with trauma and for conducting research on their outcomes3. Despite the guidelines put forth by the American College of Surgeons (ACS) Committee on Trauma (COT), the triage of children still exhibits considerable inconsistency. At present, there is a lack of consensus on a universally accepted pediatric triage scoring system4. To address this issue and to facilitate the prompt assessment and effective allocation of resources for pediatric trauma patients, scoring systems have been employed to ensure efficient and accurate decision-making in trauma patients4.

Scoring systems have been classically classified as anatomical, physiological, or combined scoring systems5,6. The AIS, ISS, and NISS are anatomical scoring systems that employ anatomical variables, including the location and severity of injury7. The GCS, RTS, and PHI are among the physiological scoring systems used and can be calculated through values retrieved from physical examination data7,8. Finally, the TRISS, NTRISS, and aTRISS are combined scoring systems that utilize both anatomical and physiological features of trauma9. The GCS is primarily utilized to evaluate the level of consciousness impairment in patients, achieved through the assessment of ocular, motor, and verbal responses10. The abbreviated injury AIS has also been employed in pediatric trauma by appointing a value of one to six to the injury10. The two main derivates of the AIS, ISS and NISS, were also utilized to predict pediatric outcomes following trauma based on the three injuries with the highest scores10,11. Similarly, the TRISS showed acceptable predictive performance for pediatric trauma outcomes11. RTS and PHI play significant roles in improving pediatric trauma triage4,8,12.

The AIS was first developed in 196913. After that, in 1974, simultaneously, the GCS was announced by Teasdale and Jennett in Scotland14, and the ISS was established by Baker to address the cumulative effect of injury, which was missed in conventional AIS15. Seven years after that, in 1981, the first version of the TRISS was introduced, claiming that the combined use of physiological and anatomical indices in addition to the age range is a very powerful tool for survival prediction16. In 1986, the PHI was developed primarily based on four items as a triage-based trauma scoring system17. In 1989, a revision of the trauma score named the RTS, was introduced18. In 1997, Osler et al. provided a new version of the ISS called the NISS19. Eventually, in 2018, a novel modified version of the TRISS, called the NTRISS, was introduced by Domingues et al.9.

During the initial assessment of a trauma patient, it is crucial to predict various factors, including the need for intensive care and the potential for morbidity and mortality. The objective of this study was to ascertain the prognostic significance of commonly utilized trauma scores, specifically in pediatric patients. We used the National Trauma Registry of Iran (NTRI) to perform a comprehensive comparison among the AIS, ISS, NISS, RTS, PHI, TRISS, and two of its variants (aTRISS and NTRISS) scoring systems for pediatric trauma and evaluated their ability to predict in-hospital mortality and ICU admission.

Methods and materials

Study design and population

This retrospective registry-based study was conducted at eight renowned trauma care hospitals in Iran. The current study encompassed a comprehensive dataset of 6709 pediatric trauma patients aged ≤ 18 years from July 2016 to September 2023. The traumatic events referred to in this context are those specifically outlined in Chapter XX of the International Statistical Classification of Diseases and Health-Related Problems (ICD-10). All the patients in the study were evaluated in an emergency department at one of the hospitals. Patients who were discharged home from the emergency department were excluded from this study. Individuals who were transferred out were excluded from this study due to the potential lack of comprehensive evaluations before their transfer, and their subsequent survival status remains uncertain. Mode of transport was defined as the way the victim was transported from the accident site to the hospital and was categorized as a private vehicle, Emergency Medical services (EMS), and others. In this study, multiple trauma was defined as injury to more than one site, regardless of their severity. The main cause of trauma was divided into transportation accidents, falls, and others. The primary outcomes of this study were ICU admission and in-hospital mortality, defined as occurring either in the emergency department (ED) or during the patient's hospital stay.

Scoring system calculations

The details of the scoring system calculations are described in Supplementary Table 1.

Determination of cutoff points for each scoring system

After conducting a receiver operating characteristic (ROC) curve analysis for each scoring system and thoroughly reviewing the relevant literature, the following cutoff points were selected: AIS ≥ 3 compared to AIS < 3, RTS < 7 (vs. ≥ 7), GCS ≤ 8 (vs. > 8), PHI ≥ 3 (vs. < 3), TRISS, aTRISS, and NTRISS < 0.9 (vs. ≥ 0.9). Due to the multitude of different cutoff points found in the literature, we utilized two cutoff points for the ISS and NISS (≥ 16 vs < 16 and ≥ 25 vs. <  25).

Statistical analysis

Numbers and percentages were used to describe nominal and categorical variables, respectively. Continuous variables were described as mean ± standard deviation (SD) in the case of the normal distribution; otherwise, they were presented as median and interquartile range (IQR). Univariate and multiple logistic regression models were employed to evaluate the factors influencing hospital mortality and ICU admission. To ascertain the predictive efficacy of the scoring systems, the AUC was calculated. To examine the association between the identified cutoff points and the two main outcomes, crude binary logistic regression was initially conducted. Subsequently, adjusted binary logistic regression was performed after controlling for confounding factors encompassing age, sex, and mechanism of trauma. The equality of the AUC was tested using Stata's roccomp command. It provides summary statistics such as the AUC, standard errors, confidence intervals, and p-values for the DeLong test. The DeLong test, provided by roccomp, assesses the null hypothesis that the AUCs of the two models are equal. Data analysis was conducted using Stata software version 14.0 (Stata Corp., College Station, TX, USA; Available on https://www.stata.com/stata14/).

Results

Baseline and clinical characteristics of patients

Among the 6709 pediatric trauma patients, 5309 (79.3%) were male. The mean (SD) age of the included patients was 11.85 (5.88). The main cause of trauma was transportation accidents, accounting for 3401 patients (50.7%), followed by falls (1623 patients, 24.2%). Blunt trauma was the most prevalent type of injury, affecting 5888 patients (87.8%). In terms of transportation to the hospital, 3383 patients (50.4%) were transported via EMS, while 3194 patients (47.6%) relied on private vehicles.

A total of 2527 patients, accounting for 37.7% of the total patients, were identified as having multiple traumatic events. Among the reported cases, injuries to the extremities were the most common, affecting 2993 individuals or 44.6%. This was followed by isolated injuries to the head, face, and neck, which were reported in 768 patients (11.4%).

Among the total patients, 720 (10.7%) required admission to the ICU. A total of 72 patients (1.1%) died during hospitalization. The mortality rate in the ICU was 7.2% (n = 52). The median length of stay (LOS) for all patients was 2 days, with an IQR of 1 to 4 days. However, for those admitted to the ICU, the median LOS was 4 days, with an IQR of 2 to 7 days.

Scoring systems

The median (IQR) of all scoring systems is presented in Table 1. As demonstrated in Table 1, all nine measured scores were significantly different among in-hospital deceased and nondeceased patients and between ICU-admitted and non-ICU-admitted pediatric trauma patients (p < 0.001).

Table 1 Comparison of median (IQR) values for each scoring system based on two main outcomes.

Predictive performance of the scoring systems for pediatric in-hospital mortality

According to the findings presented in Table 2, the system that demonstrated the highest predictive performance for in-hospital mortality was aTRISS, with an AUC value of 0.982. The next highest predictive performances belong to TRISS (AUC = 0.979), NTRISS (AUC = 0.972), GCS (AUC = 0.955), RTS (AUC = 0.944), PHI (AUC = 0.936), ISS (AUC = 0.901), NISS (AUC = 0.900), and AIS (AUC = 0.735). Figure 1 shows the ROC curves for all nine scoring systems for the prediction of in-hospital mortality.

Table 2 Performance of scoring systems in prediction of in-hospital mortality and ICU admission.
Figure 1
figure 1

ROC curves for all nine scoring systems in the prediction of in-hospital mortality.

Predictive performance of the scoring systems for pediatric ICU admission

As unveiled in Table 2, the NTRISS had the highest predictive performance for ICU admission, with an AUC of 0.844. Closely followed by aTRISS, with an AUC of 0.836, TRISS at 0.823, ISS at 0.807, NISS at 0.805, GCS at 0.735, RTS at 0.698, PHI at 0.662, and AIS at 0.651. Figure 2 shows the ROC curves for all nine scoring systems for the prediction of ICU admission.

Figure 2
figure 2

ROC curves for all nine scoring systems in the prediction of ICU admission.

Crude logistic regression of the association between the categorized scoring systems and the two main outcomes

As illustrated in Table 3, the odds of death and ICU admission in patients with AIS ≥ 3 were 7.19 and 3.82 times greater than those in patients with AIS < 3. The odds of death in patients with ISS and NISS ≥ 16 were 7.80 and 7.20 times greater than those in patients with ISS and NISS < 16. The highest odds ratio (OR) of in-hospital mortality were 216.2 in cases with GCS ≤ 8, followed by OR:106.9 in NTRISS < 0.9 and OR: 104.40 in RTS < 7. The highest odds of ICU admission were attributed to patients with aTRISS < 0.9 (OR = 27.4), followed by those with a GCS ≤ 8 (OR = 26.7) and those with a RTS < 7 (OR = 25.3).

Table 3 Crude logistic regression of determined cutoffs for each pediatric scoring system for two main outcomes.

Multiple logistic regression of the association between the categorized scoring systems and the two main outcomes

According to the data presented in Table 4, after controlling for age, sex, and mechanism of trauma, patients with TRISS, aTRISS, or NTRISS values less than 0.9 had significantly increased odds of in-hospital mortality (Adjusted ORs: 219.9, 173.7, and 107.2, respectively). Additionally, these patients also had greater odds of requiring admission to the ICU (aORs: 28.6, 38.6, and 19.6, respectively) (p < 0.001).

Table 4 Multiple logistic regression of determined cutoffs for each pediatric scoring system for two main outcomes (after adjustment for age, gender, and mechanism of trauma).

Furthermore, patients with a GCS ≤ 8 had significantly greater odds of in-hospital mortality (aOR: 199.8) and ICU admission (aOR: 61.7) (p < 0.001). Moreover, in pediatric trauma patients, those with ISS and NISS values equal to or greater than 25 had 49.7 and 33.2 times greater odds of death, respectively, than did their counterparts with ISS and NISS values less than 25. Similarly, these patients also had 20.2 and 22.5 times greater odds of requiring ICU admission, respectively.

Comparison of paired samples area difference under the ROC among all scoring systems

As displayed in Table 5, it is worth noting that the AIS exhibited considerably poorer predictive performance than did the other eight scoring systems, with p values indicating statistical significance (p < 0.001). Compared with the NTRISS, TRISS, and aTRISS, the RTS exhibited significantly poorer predictive performance (p < 0.05). Eventually, the NTISS had a poorer predictive performance compared to aTRISS (p < 0.05).

Table 5 Comparison of paired samples area difference under the ROC curves among all scoring systems.

Besides, as presented in Table 5, the AIS falls short of all its counterpartying scoring systems in terms of predictive performance (p < 0.05). The ISS and NISS both had poorer predictive performance than did the aTRISS and NTRISS (p < 0.001). The TRISS had considerably less predictive performance than did the aTRISS or NTRISS (p < 0.05), and the aTRISS had significantly less predictive performance than the NTRISS (p < 0.05).

Ethical approval and consent to participate

All ethical and moral issues were considered in this study. Informed consent was obtained from the patients or their next of kin. This study was approved by the ethics committee of Sina Hospital, Tehran University of Medical Sciences (Approval ID:IR.TUMS.SINAHOSPITAL.REC.1399.090). We confirm that all methods were performed in accordance with the relevant guidelines and regulations.

Consent for publication

Due to the registry-based nature of this study, verbal consent was obtained from the patients of their next of kin, at the time of the interview.

Discussion

Trauma continues to be the leading cause of death in children, and the progress made in the field of pediatric trauma has closely paralleled advancements in adult trauma care. However, there is considerable disparity among trauma centers in regard to prehospital triage systems20,21. In this study, we analyzed various trauma scores of 6709 pediatric trauma patients, namely, AIS, ISS, NISS, RTS, GCS, PHI, TRISS, aTRISS, and NTRISS scores, to establish correlations between these scores and their respective clinical outcomes. The ROC curves demonstrated that the TRISS and its two derivatives, the aTRISS and NTRISS, respectively, possess the inherent capability to accurately distinguish patients who face an elevated likelihood of encountering unfavorable consequences, encompassing in-hospital mortality and ICU admission. These compelling findings underscore the utmost importance of incorporating these measures into clinical practice, thereby elevating the standard of patient care and prognosis. Furthermore, considering anatomical and physiological factors alone for the prediction of pediatric trauma cases may not seem sufficient; however, a combinatorial approach of utilizing both factors simultaneously may increase outcome prediction.

The Abbreviated Injury Scale (AIS) is a scoring system that categorizes injuries based on their severity. A six-point scale ranging from one (minor) to six (maximal) was used to classify injuries according to their relative severity in different body regions. Derived from expert consensus, this system has achieved global recognition22. Although intended for all age groups, the AIS does not encompass any specific AIS scores applicable solely to pediatric patients23.

The ISS, which was created by Susan Baker in 1974, has been widely employed in the pediatric literature as a measure of injury severity for the past four decades. The ISS (ranging from 1 to 75) was calculated by adding up the squares of the highest AIS severity scores from the three most severely injured body regions. However, if the AIS score is 6, the ISS is automatically set to 7515,24. According to a study of 545,015 pediatric blunt trauma patients, the AUC of the ISS for in-hospital mortality prediction was 0.86. The difference between the current study AUC for the ISS (AUC = 0.9006) and the aforementioned study can be explained by the inclusion of penetrating trauma in this study3. The conventional ISS assigns equal weight to various body regions. The NISS was developed to address this issue. In addition, the impact of age differences on the severity of disease experienced by patients is neglected in ISS3. The TRISS (and its derivatives) were developed to address this.

In 1997. Osler et al. introduced the NISS, which comprises the cumulative value of the squared AIS score pertaining to the three most critical injuries sustained by a patient, irrespective of the affected anatomical region19. On the basis of retrospective examination of 6585 adult individuals affected by trauma, Osler et al. revealed that the NISS was better able to predict patient prognosis than the ISS. They used data from two datasets, 3136 patients from Albuquerque Center and 3449 patients from Emanuel Center. The AUC for ISS and NISS was 0.869 and 0.896 for the first center and 0.907 and 0.896 for the second center. They also stated that the difference between these two scoring systems was statistically significant despite overlapping confidence intervals. Our findings implied that there was no significant difference in the prediction of poorer outcomes between the NISS and ISS. Like our findings, in a study by Grisoni et al., which was conducted on 9151 pediatric trauma patients in four different regional trauma centers in the U.S., the disparities in the predictive capabilities of the two scoring systems were deemed inconsequential, and they concluded that in pediatric trauma patients, there are no notable disparities in the predictive capabilities between the ISS and NISS, as reported in studies involving adult trauma patients25.

The PHI components are SBP, PR, RR, level of consciousness, and nature of the injury (blunt or penetrating). The concept of the PHI was initially introduced by Koehler et al. and has since gained widespread use in medical triage17. The PHI previously exhibited an AUC of 0.9268 for the prediction of death following adult trauma.

According to a meta-analysis of 11 relevant studies published in 2019 on the performance of the RTS in predicting in-hospital mortality in a mixed population of adults and children, the overall AUC of the RTS was 0.9326, which was similar to our finding (AUC = 0.9444). In another multicenter study of 814 pediatric trauma patients, the AUC of the RTS was 0.949.

In a multicenter study of 45,377 pediatric trauma patients, 2579 were deceased. The AUC for the GCS for the prediction of in-hospital mortality was 0.8927. In a study of 104,035 records of pediatric trauma aged 1 to 18 years with 3946 deaths, the AUC of the GCS for mortality prediction was 0.94628, which was much closer to that of our study (AUC of GCS for mortality: 0.954)29.

The TRISS is extensively employed as the predominant tool for determining the likelihood of patient survival following traumatic injuries30. The coefficients of the currently utilized TRISS model were computed using the dataset obtained from the Major Trauma Outcome Study, which was coordinated by the American College of Surgeons Committee on Trauma between 1982 and 198731. Several studies have indicated that the TRISS model may not be appropriate for assessing survival outcomes due to its inadequacy in accurately considering factors such as the area, time period, and age range of the study population30,32,33. Many studies have performed statistical analysis to extract their own sets of coefficients for TRISS based on the local context30. In the present study, first, we used a conventional coefficient for calculating the probability of survival, named TRISS; second, we utilized another set of coefficients, employed in another interesting study by Domingues et al. and called it aTRISS9. Furthermore, we evaluated another scoring system, named the NTRISS, which was developed by Domingues in 2018 and establishes the SBP, NISS, and best motor response, instead of the RTS and ISS, in the conventional TRISS9. There were several concerns, given the use of the TRISS with conventional coefficients in children; nevertheless, in a recently published study in 2023 in which 11 models of the TRISS were evaluated for pediatric trauma, the authors concluded that the proposed models are not superior to conventional models30.

A study conducted in Turkey examined 1510 patients with an average age of 7.8 years and a mortality rate of 4.2%. That study evaluated the performance of the NISS, ISS, GCS, and RTS in predicting in-hospital mortality, with each scoring system having an impressive AUC of 0.993, 0.992, 0.987, and 0.976, respectively. Additionally, these measures achieved AUCs of 0.936, 0.934, 0.913, and 0.903 for predicting ICU admission. However, it is important to note that the study was conducted in a single center, which may limit the generalizability of the findings. Furthermore, the higher mortality rate observed in this particular center should be taken into consideration when interpreting the results34.

In the present study, as shown in Table 5, the predictive performance of the GCS for in-hospital mortality was significantly greater than that of the ISS and NISS. In parallel with our findings, in a single-center study of 588 pediatric trauma patients, the GCS had the best predictive performance compared to the ISS35. However, these findings were reversed in our study when considering ICU admission as the outcome. The GCS exhibited poorer predictive performance than did the ISS and the NISS.

In a single-center study comprising 74 pediatric Turkish patients with a mean age of 7 years and nearly 90% of whom the injuries were blunt in nature, the RTS, TRISS, and ISS were found to be independent predictors of ICU admission10.

In a study conducted at a single center involving 938 pediatric patients with an average age of 3.1 years and a mortality rate of 0.9% in 2019, ISS demonstrated the highest AUC compared to RTS and GCS (AUC = 0.975, 0.899, and 0.864, respectively)36. Interestingly, our study showed contrasting results in terms of predictive performance among these three scoring systems. Specifically, we observed that the GCS provided better mortality prediction than did the RTS or ISS (AUC = 0.955, 0.944, and 0.901, respectively). This discrepancy may be attributed to the higher mortality rate in their study and the fact that it was conducted at a single center. Additionally, the inclusion of only children younger than six years in their research is a crucial factor. This strongly suggested that the age of patients significantly influences the use of scoring systems in a pediatric context. Eventually, after choosing cutoff values of 15 for ISS, 11 for GCS, and 7 for RTS in their study, the crude ORs for mortality were 109.25 for patients with ISS ≥ 15, 136.50 for patients with GCS ≤ 11, and 94.72 for patients with RTS ≤ 736. In our study, the crude ORs for patients with RTS < 7 were 104.40 and 216.21 for GCS score ≤ 8.

In contrast to many other studies that simply utilize predetermined cut-offs from existing literature to calculate the AUC for scoring systems, our current study took a different approach. Initially, we treated all scoring systems as continuous variables and constructed ROC curves for each. Subsequently, we utilized our own findings in conjunction with those from existing literature to determine cut-off values for the scoring systems that effectively categorized patients with both specificity and sensitivity in relation to measured outcomes. Our study successfully demonstrated that all identified cut-off values for the scoring systems were able to significantly differentiate patients based on their outcomes.

An effective scoring system that can accurately assess the severity and type of injury is crucial for prioritizing patient treatment, forecasting patient recovery, assessing trauma care, and distributing therapeutic resources. Selecting an appropriate trauma scoring system is crucial for its effective application based on the context. The application of these trauma scoring systems to guarantee their effectiveness and proper utilization should be comprehended37.

Based on the literature, an AUC ≥ 0.80 is considered as being an acceptable diagnostic test, and an AUC ≥ 0.90 is being excellent one38. Regarding the in-hospital mortality as the main outcome of this study, there might be some degrees of statistical difference among scoring systems, which can be explained by the large number of patients included in this study. Nonetheless, except for the AIS, all scoring systems had excellent results in predicting mortality (AUC ≥ 0.90). In other words, these scoring systems are clinically excellent scoring systems in the prediction of in-hospital mortality. Given the ICU admission as the second outcome in this study, TRISS and its two variants, as well as ISS and NISS had an acceptable performance in its prediction (AUC ≥ 0.80), although none of them had an excellent performance (AUC ≥ 0.90). Despite of statistical difference among them, these five scoring systems are acceptable systems for ICU admission prediction.

In summary, it is imperative to acknowledge that relying solely on anatomical or physiological factors for predicting pediatric trauma cases may be inadequate. However, adopting a combinatorial approach that incorporates both of these factors simultaneously can enhance the accuracy of outcome prediction. This is precisely why TRISS, along with its two other variants, yielded the most appealing outcomes.

Study limitations and considerations

The AIS scores utilized in this study were generated manually. Previous researchers have reported discrepancies between AIS scores assigned by machines and those assigned manually39. The considerable number of included pediatric trauma patients and the multicenter nature of this study increase the generalizability of the findings.

Conclusion

In this study, we demonstrated that TRISS and its two derivatives, namely, aTRISS and NTRISS, have the potential to effectively identify patients who are at a greater risk of experiencing adverse outcomes. In addition, we showed that anatomical-based scoring systems have better predictive performance for ICU admission than their physiological counterparts; however, physiological-based scoring systems are better predictors of in-hospital mortality. These findings highlight the importance of utilizing these measures in clinical practice to improve patient care and patient prognosis.