Development of a deep learning model that predicts critical events of pediatric patients admitted to general wards

Jeon, Yonghyuk; Kim, You Sun; Jang, Wonjin; Park, June Dong; Lee, Bongjin

doi:10.1038/s41598-024-55528-1

Download PDF

Article
Open access
Published: 27 February 2024

Development of a deep learning model that predicts critical events of pediatric patients admitted to general wards

Yonghyuk Jeon¹^na1,
You Sun Kim²^na1,
Wonjin Jang¹,
June Dong Park¹ &
…
Bongjin Lee^1,3

Scientific Reports volume 14, Article number: 4707 (2024) Cite this article

613 Accesses
27 Altmetric
Metrics details

Subjects

Abstract

Early detection of deteriorating patients is important to prevent life-threatening events and improve clinical outcomes. Efforts have been made to detect or prevent major events such as cardiopulmonary resuscitation, but previously developed tools are often complicated and time-consuming, rendering them impractical. To overcome this problem, we designed this study to create a deep learning prediction model that predicts critical events with simplified variables. This retrospective observational study included patients under the age of 18 who were admitted to the general ward of a tertiary children’s hospital between 2020 and 2022. A critical event was defined as cardiopulmonary resuscitation, unplanned transfer to the intensive care unit, or mortality. The vital signs measured during hospitalization, their measurement intervals, sex, and age were used to train a critical event prediction model. Age-specific z-scores were used to normalize the variability of the normal range by age. The entire dataset was classified into a training dataset and a test dataset at an 8:2 ratio, and model learning and testing were performed on each dataset. The predictive performance of the developed model showed excellent results, with an area under the receiver operating characteristics curve of 0.986 and an area under the precision-recall curve of 0.896. We developed a deep learning model with outstanding predictive power using simplified variables to effectively predict critical events while reducing the workload of medical staff. Nevertheless, because this was a single-center trial, no external validation was carried out, prompting further investigation.

An overview of clinical decision support systems: benefits, risks, and strategies for success

Article Open access 06 February 2020

AI in health and medicine

Article 20 January 2022

A foundation model for generalizable disease detection from retinal images

Article Open access 13 September 2023

Introduction

Early detection of deteriorating patients is crucial in order to provide timely intervention before critical events, such as cardiopulmonary resuscitation (CPR), take place. Cardiac arrest due to respiratory failure is known to be more common in children compared to adults, whereas cardiac arrest of cardiac origin is relatively rare in children^1,2,3. For this reason, pediatric patients may have a higher chance of receiving intervention before cardiac arrest occurs. The pediatric early warning score (PEWS) is one of the means that has been developed in an effort to recognize deteriorating patients as early as possible^4,5,6.

PEWS determines a patient’s risk level by measuring and scoring several vital sign values, such as blood pressure (BP) and heart rate (HR) by age. A few examples of PEWS include the Bedside PEWS, the Brighton PEWS, the Melbourne Activation Criteria, and the Bristol PEWS^{5,6,7,8,9,10,11}. In initial studies, these methods demonstrated very high predictive performance, with an area under the receiver operating characteristic curve (AUROC) of around 0.9^4,6. However, numerous validation studies on different types of PEWS carried out by multiple institutions were unable to replicate the same outcomes and showed relatively low performance (AUROC 0.62–0.86)^{9,10,12,13,14}. In addition, the process of obtaining necessary parameters and calculating these scores demands considerable time and effort from medical staff, and the qualitative data required in scoring PEWS, such as capillary refill time and respiratory effort, are often not readily available in contrast to easily obtainable values such as BP, HR, and respiratory rate (RR)^4,6.

As a way to compensate for these shortcomings, the application of machine learning, particularly deep learning, in constructing predictive models has been drawing attention in the research field. Most studies were conducted on adults, and only a few examined pediatric models. One retrospective study assessed a model that used 29 variables to predict the likelihood of transmission to the intensive care units (ICU) within 24 h, and the AUROC was 0.912 (95% confidence interval [CI] 0.905–0.919). Although the accuracy of the predictions was excellent, it might be impractical to collect and analyze 29 variables¹⁵. In another retrospective study of pediatric subjects, a long short-term memory (LSTM) model with a promising AUROC of 0.923 was developed using fewer parameters. The LSTM model, however, could only be used when there are more than 20 consecutive time-stamped vital sign data points. Consequently, initial prediction in general wards can be challenging because vital signs are not typically recorded frequently, unlike in ICU¹⁶.

Therefore, the authors designed this study to create a deep learning model that can anticipate crucial events utilizing simplified variables without long-term continuous measurement values.

Methods

Study setting and data source

This retrospective cross-sectional observational study was conducted at a tertiary children’s hospital with about 350 beds. The subjects were patients under the age of 18 who had been admitted to the general ward of the children’s hospital between January 2020 and December 2022. The pseudonymized data used for analysis were collected from the clinical data warehouse of the hospital information system. The measurements of Systolic BP (SBP), diastolic BP (DBP), HR, RR, body temperature (BT), and the oxygen saturation measured with pulse oximetry (SpO₂) were recorded in the general ward. Measurements from the emergency department or ICU were excluded. Recorded time, sex, age (in months), admission date, discharge date, and pseudonymized study-specific identification code were collected.

This study was approved for exemption from review by the Institutional Review Board of Seoul National University Hospital because it used only pseudonymized information and did not collect personally identifiable information (H-2209-001-1032). Since only information that could not identify the research subjects was used, the above committee confirmed that it was impossible to obtain consent from specific subjects. Moreover, the study was conducted in accordance with the principles of the Declaration of Helsinki.

Data preprocessing

The pseudonymized identification code and hospitalization date were combined to create a unique classification code according to each individual hospitalization date, which was defined as the individual hospitalization identification code (IHID). The collected data were classified according to IHID, sorted in ascending order of vital sign measurement time, and missing values among SBP, DBP, HR, RR, BT, and SpO₂ were replaced with the immediately preceding values. In addition, the interval of vital sign measurement time was calculated within the same IHID (each vital sign measurement time—previous measurement time, in minutes), and this was defined as the measurement interval. Since the normal ranges of BP, HR, and RR in children differ according to age, z-scores for each age were calculated and used for analysis. Centile charts of vital signs for each age developed in a previous study were used for z-score conversion¹⁷.

Critical events were defined as cases where CPR occurred in the general ward, unexpected transfers to the ICU, and cases of mortality (results of CPR or discontinuation of life-sustaining treatment)^18,19. Critical records were defined as the data measured from 6 h before the occurrence of the critical event to the time of occurrence in the case of unexpected ICU transfer or mortality, and in the case of CPR, it was defined as the data measured from 6 h before the occurrence to 30 min after the occurrence (from 6 h before CPR until death in the case of mortality after CPR). In order to perform deep learning on critical records, the total records were divided into two groups: critical group and non-critical group. Since the records of individuals who experienced a critical event will have a mixture of critical records and non-critical records, IHID’s non-critical records with critical events were excluded from the non-critical group. In addition, since it is expected to be an imbalanced dataset in which the size of the non-critical group is substantially larger than the sample size of the critical group, only the last records for each IHID among the non-critical groups were used for deep learning. In general, it is common sense that vital sign records measured during hospitalization for each IHID are not limited to just one occurrence but rather numerous. Therefore, we anticipated that retaining only the last record per IHID among the vital sign records in the non-critical group, and utilizing all records in the critical group, would relatively alleviate the imbalance between the two groups. R version 4.3.1 (R Foundation for statistical computing, Vienna, Austria; https://www.r-project.org) was used for data preprocessing, and open packages such as the generalized additive models for location scale and shape and sitar were used in this process^20,21,22.

Deep learning and data analysis

The preprocessed dataset was divided into a training set and a test set at a ratio of 8:2, and each was used for model training and testing. A simple artificial neural network (ANN) algorithm based on the multilayer perceptron was used for deep learning. Nine parameters used for learning were age, sex, z-score of SBP, z-score of DBP, z-score of HR, z-score of RR, BT, SpO₂, and the measurement interval. The above features were normalized to a value between 0 and 1. The ANN model was composed of 3 hidden layers (each with node counts of 128, 128, and 64, respectively), and a 30% dropout was applied after each layer. The Adam optimizer and rectified linear unit activator were used in the process²³. It was trained for 10,000 epochs with a learning rate of 0.0001 using Python version 3.8.10 (Python Software Foundation, Beaverton, OR, USA; https://www.python.org). Scikit-learn library was used for normalization²⁴, PyTorch was used for model training and test²⁵, and matplotlib and Shapley additive explanation (SHAP) library were used for visualization²⁶. Since the measurement interval value of the first record for each IHID cannot be calculated (missing value), the average value of all measurement intervals was imputed. Continuous variables were described as median (interquartile range) and categorical variables as number (%).

Outcomes

The primary outcome of this study was the overall predictive performance of the developed model. Accuracy, AUROC, and area under the precision-recall curve (AUPRC) were used to evaluate the predictive performance of the model. The secondary outcomes included subdividing critical events into CPR occurrence, unexpected ICU transfer, and mortality, respectively, and evaluating the performance of the developed model for each. Additionally, based on the time elapsed before a critical incident occurred, measurements were divided into six subgroups: 0–1 h, 1–2 h, 2–3 h, 3–4 h, 4–5 h, and 5–6 h. For each subgroup, the predictive performance of the model was included. It also included an assessment of the importance of the prediction process for each feature used in learning and the correlation between features.

Results

Baseline characteristics

During the study period, 13,787 patients were hospitalized a total of 22,184 times, and 1,039,070 vital sign records were analyzed. When analyzed by IHID, the age at admission was 69.0 (23.0–135.0) months, and 9,485 (42.8%) were girls. The duration of hospitalization was 3.0 (2.0–7.0) days.

Of the total records of vital signs, 632 (0.1%) cases were critical records, and the median measurement interval was 161.0 min. Detailed descriptions of SBP, DBP, HR, RR, BT, and SpO₂ are summarized in Table 1. There were 14,227 records remaining after data preprocessing; the age was 74.0 (22.0–139.0) months, and 6,041 (42.5%) were girls. The critical group included 632 (4.4%) of the patients, and among the critical records, 261 instances involved CPR, 238 cases involved unplanned ICU transfers, and 141 cases involved fatalities. There were 8 records of patients who died as a result of CPR. Additional information is described in greater depth in Table 2. The calculated mean value for imputing missing data in the first measurement interval for each IHID was 276.17.

Table 1 Baseline characteristics of all vital sign records.

Full size table

Table 2 Characteristics of datasets used to develop deep learning models.

Full size table

Main outcomes

The accuracy of the developed model was 0.988, AUROC (95% CI) was 0.986 (0.972–0.995), and AUPRC (95% CI) was 0.896 (0.848–0.938) (Fig. 1). In the performance evaluation for each detailed item of the critical events, the detailed item, AUROC (95% CI), AUPRC (95% CI) are respectively as follows: CPR occurrence, 0.967 (0.928–0.988), 0.451 (0.322–0.585) (Supplementary Fig. S1); unexpected ICU transfer, 0.964 (0.951–0.975), 0.203 (0.139–0.276) (Supplementary Fig. S2); and mortality, 0.995 (0.993–0.997), 0.683 (0.551–0.809) (Supplementary Fig. S3). In subgroup evaluation by time interval, the time interval, AUROC, and AUPRC of each time subgroup are as follows: 0–1 h, 0.998, 0.982; 1–2 h, 0.997, 0.963; 2–3 h, 0.996, 0.966; 3–4 h, 0.990, 0.949; 4–5 h, 0.997, 0.976; and 5–6 h, 0.997, 0.971. The respective 95% Cis and graphical illustrations are shown in Fig. 2.

Among the features used to predict the outcomes, measurement interval had the highest impact, followed by SpO₂ and a z-score of RR (Fig. 3). How the model prediction impact changes according to the high and low values of each feature is shown in Fig. 4. The lower the measurement interval (blue), the higher the impact on the model output, and the higher the measurement interval (red), the lower the impact. SpO₂ also showed the same pattern as the measurement interval. On the other hand, greater z-scores for RR and HR had a greater impact on outcomes, while lower z-scores for RR and HR had a lesser effect on outcomes (Fig. 4).

The correlation between the features was studied to further characterize the model. The SHAP value (the impact of the model output) increased with a smaller measurement interval, as in the prior results, but this time around, the z-score of HR had no discernible impact on the value (Supplementary Fig. S4). Regardless of whether the measurement interval was high or low, SpO₂ and SHAP values consistently had an inverse correlation, and this tendency was more pronounced when the measurement interval was smaller (Supplementary Fig. S5). The supplementary figures provide a summary of the inter-feature influence of parameters that are not mentioned above (z-score of RR, Supplementary Fig. S6; z-score of HR, Supplementary Fig. S7; sex, Supplementary Fig. S8; age, Supplementary Fig. S9; z-score of SBP, Supplementary Fig. S10; z-score of DBP, Supplementary Fig. S11; and body temperature, Supplementary Fig. S12).

Discussion

Through this study, we created a deep learning model that uses simplified variables, including vital signs, age, sex, and measurement interval, to predict the need for intervention in pediatric patients who are deteriorating. Our approach, in contrast to earlier studies, predicts the probability of transfer to the ICU using only a handful of variables without the need for accumulated measurements. Furthermore, the model demonstrated an AUROC of 0.986 and an AUPRC of 0.896, which were significantly better than those of earlier studies^15,16.

Numerous studies on previously developed PEWS have reported outstanding AUROC values of around 0.9, but the process of collecting and calculating the parameters for the scoring system is complex and time-consuming, which can significantly increase the workload of the medical staff. Even when the efficacy of the prediction model is high, its impracticality can become an obstacle in clinical settings. It is important to evaluate the workload of medical staff, especially in an environment with limited medical resources^27,28,29. The prediction model created in this study can decrease such workload for the medical staff because it utilizes vital signs, sex, and age as parameters that are expressed in plain values and are easy to access because they are collected in the hospital electronic medical record system. Moreover, predictions with a deep learning model can be generated automatically without manually entering values into a computer, which can eliminate the workload of the medical staff entirely.

In this investigation, the measurement interval was used as a learning parameter as opposed to the LSTM model study, which needs consecutive measurement results. Vital signs are typically not monitored as regularly in general wards as they are in ICUs, but the frequency increases if a patient’s condition deteriorates. We were able to create a prediction model without the necessity of 20 consecutive observations because our prediction model was built to reflect this idea. As a result, predictions can be made before a collection of subsequent measurements is complete.

In the detailed analysis of critical events, AUROC consistently exceeded 0.96 for all CPR occurrences, ICU transfers, and deaths, mirroring the performance in predicting overall critical events. However, AUPRC exhibited a notable decline, possibly stemming from the model's lack of specialized training for individual events. Subsequent subgroup analysis by time interval yielded unexpected results. Contrary to expectations, proximity to critical events did not necessarily enhance prediction performance. Remarkably, the model demonstrated superior results across all time periods compared to the overall critical events prediction. The black box nature of deep learning made it challenging for the authors to provide a definitive explanation for these results. Yet, upon reflection, it was noted that the model was developed without the intention of making predictions based on a series of continuous measurements; instead, it analyzed only measurements from a single timestamp. Another crucial point to consider was that the parameters used for learning did not incorporate information capable of estimating the time from measurement to event occurrence, which was deemed a significant explanatory factor.

The persisting question surrounded the superior results observed in the time-specific subgroups compared to the overall performance. It was hypothesized that as measurements corresponding to critical events were divided into subgroups, the imbalance between the non-critical group and critical subgroups increased, thus maintaining an excellent AUROC. Additionally, to explain the enhanced AUPRC, the authors considered the homogeneity of the data. The non-critical group in the study comprised the last vital sign measurements taken before discharge from patients without a critical event, making it a relatively stable and homogeneous group. Conversely, the critical group, subject to medical interventions, naturally exhibited diversity in collected measurement values. It was reasoned that the longer the collection time, the greater the diversity, and narrowing the collection time window would decrease this diversity. Therefore, as the time window for measurement value collection narrowed, the homogeneity of the collected measurement values increased. Even if measurements at 5–6 h were relatively stable compared to those at 0–1 h, the existence of characteristics clearly distinguishable from the non-critical group just before discharge could contribute to elevated AUROC and AUPRC levels. Still, it is crucial to acknowledge that this explanation is rooted in assumptions and hypotheses, lacking concrete, objective evidence. Therefore, the interpretation and judgment of these findings are ultimately left to the readers.

This study has several limitations. The first is that no external validation was done, as the study was only conducted at one center. During the early stages of development, the PEWS performed outstandingly, but validation tests conducted in diverse settings had mixed results. Although the AUROC and AUPRC of our predictive model were high, we cannot ensure that the performance can be duplicated in other hospitals or in other target populations, as in the case of PEWS. Although overfitting was minimized by applying a 30% dropout to each layer, the possibility of overfitting the dataset in this study cannot be ruled out. Therefore, it is necessary to conduct follow-up studies for external validation in collaboration with other hospitals. Another limitation is that in the first measurement for each IHID, the measurement interval is inevitably missing, and in this case, the average value of the entire measurement interval was replaced. Considering that the factor with the most influence on our predictive model is the measurement interval (Fig. 3), it may be difficult to guarantee its performance for predictive power with only the first measurement. However, the total measurement interval was 240 (162.0–480.0) minutes (Table 2), and the SHAP value changed rapidly when the measurement interval was low (Fig. 4). Therefore, the possibility of significantly changing the risk can be considered sufficiently low even if the average value of the interval was used for the first measurement taken in the general ward. In addition, an essential aspect to address in this study is that, even though deep learning models exhibit proficiency in predicting critical events, it is imperative to closely monitor a patient's organ function preceding major occurrences such as CPR or mortality. Despite the strong predictive capabilities of these models, the meticulous monitoring of a patient's organ function by medical staff remains indispensable for gaining insights into the patient's dynamic health status, allowing timely interventions and personalized care. We believe that the synergistic use of predictive models and continuous monitoring can ensure a comprehensive and proactive approach to patient care in critical situations.

Conclusion

Herein, we developed a deep learning model that predicts critical events using simplified variables. The performance of the model was excellent and worked without consequential serial measurements. A well-designed follow-up multicenter study is needed for external validation.

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

Nadkarni, V. M. et al. First documented rhythm and clinical outcome from in-hospital cardiac arrest among children and adults. JAMA 295, 50–57. https://doi.org/10.1001/jama.295.1.50 (2006).
Article PubMed CAS Google Scholar
Olotu, A. et al. Characteristics and outcome of cardiopulmonary resuscitation in hospitalised African children. Resuscitation 80, 69–72. https://doi.org/10.1016/j.resuscitation.2008.09.019 (2009).
Article PubMed PubMed Central CAS Google Scholar
Shimoda-Sakano, T. M., Paiva, E. F., Schvartsman, C. & Reis, A. G. Factors associated with survival and neurologic outcome after in-hospital cardiac arrest in children: A cohort study. Resusc. Plus 13, 100354. https://doi.org/10.1016/j.resplu.2022.100354 (2023).
Article PubMed PubMed Central Google Scholar
Duncan, H., Hutchison, J. & Parshuram, C. S. The pediatric early warning system score: A severity of illness score to predict urgent medical need in hospitalized children. J. Crit. Care 21, 271–278. https://doi.org/10.1016/j.jcrc.2006.06.007 (2006).
Article PubMed Google Scholar
Lambert, V., Matthews, A., MacDonell, R. & Fitzsimons, J. Paediatric early warning systems for detecting and responding to clinical deterioration in children: A systematic review. Bmj Open 7, e014497. https://doi.org/10.1136/bmjopen-2016-014497 (2017).
Article PubMed PubMed Central Google Scholar
Parshuram, C. S., Hutchison, J. & Middaugh, K. Development and initial validation of the bedside paediatric early warning system score. Crit. Care 13, R135. https://doi.org/10.1186/cc7998 (2009).
Article PubMed PubMed Central Google Scholar
Edwards, E. D., Mason, B. W., Oliver, A. & Powell, C. V. Cohort study to test the predictability of the Melbourne criteria for activation of the medical emergency team. Arch. Dis. Child 96, 174–179. https://doi.org/10.1136/adc.2010.187617 (2011).
Article PubMed CAS Google Scholar
Elencwajg, M. et al. Usefulness of an early warning score as an early predictor of clinical deterioration in hospitalized children. Arch. Argent Pediatr. 118, 399–404. https://doi.org/10.5546/aap.2020.eng.399 (2020).
Article PubMed Google Scholar
Chapman, S. M. et al. ‘The score matters’: Wide variations in predictive performance of 18 paediatric track and trigger systems. Arch. Dis. Child 102, 487–495. https://doi.org/10.1136/archdischild-2016-311088 (2017).
Article PubMed Google Scholar
Trubey, R. et al. Validity and effectiveness of paediatric early warning systems and track and trigger tools for identifying and reducing clinical deterioration in hospitalised children: A systematic review. Bmj Open 9, e022105. https://doi.org/10.1136/bmjopen-2018-022105 (2019).
Article PubMed PubMed Central Google Scholar
McLellan, M. C., Gauvreau, K. & Connor, J. A. Validation of the children’s hospital early warning system for critical deterioration recognition. J. Pediatr. Nurs. 32, 52–58. https://doi.org/10.1016/j.pedn.2016.10.005 (2017).
Article PubMed Google Scholar
Robson, M. A., Cooper, C. L., Medicus, L. A., Quintero, M. J. & Zuniga, S. A. Comparison of three acute care pediatric early warning scoring tools. J. Pediatr. Nurs. 28, e33-41. https://doi.org/10.1016/j.pedn.2012.12.002 (2013).
Article PubMed Google Scholar
Mandell, I. M. et al. Pediatric early warning score and unplanned readmission to the pediatric intensive care unit. J. Crit. Care 30, 1090–1095. https://doi.org/10.1016/j.jcrc.2015.06.019 (2015).
Article PubMed Google Scholar
Gold, D. L., Mihalov, L. K. & Cohen, D. M. Evaluating the pediatric early warning score (PEWS) system for admitted patients in the pediatric emergency department. Acad. Emerg. Med. 21, 1249–1256. https://doi.org/10.1111/acem.12514 (2014).
Article PubMed PubMed Central Google Scholar
Zhai, H. et al. Developing and evaluating a machine learning based algorithm to predict the need of pediatric intensive care unit transfer for newly hospitalized children. Resuscitation 85, 1065–1071. https://doi.org/10.1016/j.resuscitation.2014.04.009 (2014).
Article PubMed PubMed Central Google Scholar
Park, S. J. et al. Development and validation of a deep-learning-based pediatric early warning system: A single-center study. Biomed. J. 45, 155–168. https://doi.org/10.1016/j.bj.2021.01.003 (2022).
Article PubMed Google Scholar
Hwang, S. & Lee, B. Machine learning-based prediction of critical illness in children visiting the emergency department. Plos One 17, e0264184. https://doi.org/10.1371/journal.pone.0264184 (2022).
Article PubMed PubMed Central CAS Google Scholar
Edwards, E. D., Powell, C. V., Mason, B. W. & Oliver, A. Prospective cohort study to test the predictability of the Cardiff and Vale paediatric early warning system. Arch. Dis. Child 94, 602–606. https://doi.org/10.1136/adc.2008.142026 (2009).
Article PubMed CAS Google Scholar
Parshuram, C. S., Bayliss, A., Reimer, J., Middaugh, K. & Blanchard, N. Implementing the bedside paediatric early warning system in a community hospital: A prospective observational study. Paediatr. Child Health 16, e18-22. https://doi.org/10.1093/pch/16.3.e18 (2011).
Article PubMed PubMed Central Google Scholar
Rigby, R. A. & Stasinopoulos, D. M. Smooth centile curves for skew and kurtotic data modelled using the Box-Cox power exponential distribution. Stat. Med. 23, 3053–3076. https://doi.org/10.1002/sim.1861 (2004).
Article PubMed Google Scholar
Rigby, R. A. & Stasinopoulos, D. M. Automatic smoothing parameter selection in GAMLSS with an application to centile estimation. Stat. Methods Med. Res. 23, 318–332. https://doi.org/10.1177/0962280212473302 (2014).
Article MathSciNet PubMed Google Scholar
Cole, T. J., Donaldson, M. D. & Ben-Shlomo, Y. SITAR—A useful instrument for growth curve analysis. Int. J. Epidemiol. 39, 1558–1566. https://doi.org/10.1093/ije/dyq115 (2010).
Article PubMed PubMed Central Google Scholar
Schmidt-Hieber, J. Nonparametric regression using deep neural networks with ReLU activation function. Ann. Stat. 48, 1875–1897 (2020).
MathSciNet Google Scholar
Pedregosa, F. et al. Scikit-learn: Machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet Google Scholar
Paszke, A. et al. PyTorch: An imperative style, high-performance deep learning library. Adv. Neural Inf. Process. Syst. 32, 32 (2019).
Google Scholar
Rodriguez-Perez, R. & Bajorath, J. Interpretation of machine learning models using shapley values: Application to compound potency and multi-target activity predictions. J. Comput. Aided Mol. Des. 34, 1013–1026. https://doi.org/10.1007/s10822-020-00314-0 (2020).
Article ADS PubMed PubMed Central CAS Google Scholar
Le Lagadec, M. D. & Dwyer, T. Scoping review: The use of early warning systems for the identification of in-hospital patients at risk of deterioration. Aust. Crit. Care 30, 211–218. https://doi.org/10.1016/j.aucc.2016.10.003 (2017).
Article PubMed Google Scholar
Balwi, M. K. M., Yee, D. W., Thukiman, K. & Haziqah, A. The relationship between workload and burnout among the medical staff in hospital. Sains Humanika 13, 2 (2021).
Google Scholar
Ullah, E. et al. Workload involved in vital signs-based monitoring & responding to deteriorating patients: A single site experience from a regional New Zealand hospital. Heliyon 8, e10955. https://doi.org/10.1016/j.heliyon.2022.e10955 (2022).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This study was conducted with funding from the Department of Pediatrics Designated Fund at Seoul National University College of Medicine (assignment number 800-20220390).

Funding

This study was conducted with funding from the Department of Pediatrics Designated Fund at Seoul National University College of Medicine (assignment number 800-20220390).

Author information

These authors contributed equally: Yonghyuk Jeon and You Sun Kim.

Authors and Affiliations

Department of Pediatrics, Seoul National University College of Medicine, Seoul National University Hospital, 101, Daehak-ro, Jongno-gu, Seoul, 03080, Korea
Yonghyuk Jeon, Wonjin Jang, June Dong Park & Bongjin Lee
Department of Pediatrics, National Medical Center, Seoul, Republic of Korea
You Sun Kim
Innovative Medical Technology Research Institute, Seoul National University Hospital, Seoul, Republic of Korea
Bongjin Lee

Authors

Yonghyuk Jeon
View author publications
You can also search for this author in PubMed Google Scholar
You Sun Kim
View author publications
You can also search for this author in PubMed Google Scholar
Wonjin Jang
View author publications
You can also search for this author in PubMed Google Scholar
June Dong Park
View author publications
You can also search for this author in PubMed Google Scholar
Bongjin Lee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Study concept and design: B.L. Data collection, analysis and cleaning: Y.J. and B.L., Interpretation of data: Y.J. and B.L., Drafing and revising the manuscript: Y.S.K., W.J., J.D.P., and B.L. Critical edition: W.J., and B.L. Supervise the original draft and article: B.L. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Bongjin Lee.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Figures.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jeon, Y., Kim, Y.S., Jang, W. et al. Development of a deep learning model that predicts critical events of pediatric patients admitted to general wards. Sci Rep 14, 4707 (2024). https://doi.org/10.1038/s41598-024-55528-1

Download citation

Received: 31 August 2023
Accepted: 24 February 2024
Published: 27 February 2024
DOI: https://doi.org/10.1038/s41598-024-55528-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.