Predicting early mortality and severe intraventricular hemorrhage in very-low birth weight preterm infants: a nationwide, multicenter study using machine learning

Yang, Yun-Hsiang; Wang, Ts-Ting; Su, Yi-Han; Chu, Wei-Ying; Lin, Wei-Ting; Chen, Yen-Ju; Chang, Yu-Shan; Lin, Yung-Chieh; Lin, Chyi-Her; Lin, Yuh-Jyh

doi:10.1038/s41598-024-61749-1

Download PDF

Article
Open access
Published: 12 May 2024

Predicting early mortality and severe intraventricular hemorrhage in very-low birth weight preterm infants: a nationwide, multicenter study using machine learning

Yun-Hsiang Yang¹,
Ts-Ting Wang^1,2,
Yi-Han Su¹,
Wei-Ying Chu¹,
Wei-Ting Lin¹,
Yen-Ju Chen¹,
Yu-Shan Chang³,
Yung-Chieh Lin¹,
Chyi-Her Lin^1,4 &
…
Yuh-Jyh Lin¹

Scientific Reports volume 14, Article number: 10833 (2024) Cite this article

571 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Our aim was to develop a machine learning-based predictor for early mortality and severe intraventricular hemorrhage (IVH) in very-low birth weight (VLBW) preterm infants in Taiwan. We collected retrospective data from VLBW infants, dividing them into two cohorts: one for model development and internal validation (Cohort 1, 2016–2021), and another for external validation (Cohort 2, 2022). Primary outcomes included early mortality, severe IVH, and early poor outcomes (a combination of both). Data preprocessing involved 23 variables, with the top four predictors identified as gestational age, birth body weight, 5-min Apgar score, and endotracheal tube ventilation. Six machine learning algorithms were employed. Among 7471 infants analyzed, the selected predictors consistently performed well across all outcomes. Logistic regression and neural network models showed the highest predictive performance (AUC 0.81–0.90 in both internal and external validation) and were well-calibrated, confirmed by calibration plots and the lowest two mean Brier scores (0.0685 and 0.0691). We developed a robust machine learning-based outcome predictor using only four accessible variables, offering valuable prognostic information for parents and aiding healthcare providers in decision-making.

Predicting severe intraventricular hemorrhage or early death using machine learning algorithms in VLBWI of the Korean Neonatal Network Database

Article Open access 15 May 2024

Predicting clinical outcomes using artificial intelligence and machine learning in neonatal intensive care units: a systematic review

Article 13 May 2022

Prediction of extubation failure among low birthweight neonates using machine learning

Article 07 January 2023

Introduction

The first week of life is considered the most vulnerable period for newborns in terms of mortality, particularly among very low birth weight (VLBW) preterm infants¹. Despite surviving these critical initial weeks following birth, VLBW preterm infants remain at a heightened risk for adverse long-term neurodevelopmental outcomes². This risk can be primarily attributed to intraventricular hemorrhage (IVH), with approximately 95% of IVH cases occurring during this period³.

The significance of a dependable and timely risk assessment tool for early mortality and incidence of severe IVH cannot be overstated. This tool could not only provide a structured framework for parents and healthcare providers during the decision-making process but also offer valuable insights into recommending appropriate levels of care based on estimations of mortality and poor outcomes. For example, patients with a high probability of severe IVH may require tailored circulatory management strategies⁴. Moreover, the early identification of infants at the highest risk of developing severe IVH holds promise for enhancing the design of future clinical studies and optimizing the selection of participants for trials⁵.

In Taiwan, the incidence of preterm births has gradually increased from 8.85% in 2004 to 10.73% in 2014, a trend observed on a global scale⁶ Notably, the preterm birth rate in Taiwan has surpassed that in most OECD countries⁷ However, to the best of our knowledge, the existing literature has only identified certain risk factors associated with mortality and severe IVH in Taiwan^8,9,10 The establishment of a nationwide outcome predictor applicable for the Taiwanese population remains an unmet need.

Therefore, this study aimed to develop and validate a straightforward machine learning (ML)-based outcome estimator, utilizing readily available data shortly after birth, to predict the probability of early mortality and development of severe IVH in VLBW preterm infants.

Methods

Study design and population cohorts

In this retrospective observational study, cohort data of VLBW preterm infants was obtained from the Taiwan Neonatal Network, established in 2016 to compile nationwide clinical data of preterm infants delivered in Taiwan from 33 medical centers. The enrollment criteria outlined by the Taiwan Neonatal Network include live-born infants born in Taiwan, with birth weights ranging from 401 to 1500 g or gestational ages ranging from 22 weeks 0 days to 29 weeks 6 days. This data was then used to establish and investigate two cohorts.

Cohort 1 comprised infants born between 2016 and 2021. Their data were collected for subsequent model development, internal validation and model comparison. Cohort 2 comprised infants born in 2022 and was included in the external validation.

The inclusion criteria were gestational age (GA) between 22 weeks and 0 days to 36 weeks and 6 days and a birth body weight (BBW) of less than 1500 g. Infants with missing data were excluded.

This study has been approved by the National Cheng Kung University Hospital Institutional Review Board (A-ER-111–115). The need of informed consent was waived by the National Cheng Kung University Hospital Institutional Review Board due to the fact that data were anonymized and de-identified. All methods were performed in accordance with the relevant guidelines and regulations.

Outcomes

The primary outcomes of the study included: early mortality, severe IVH, and early poor outcomes (early mortality or severe IVH). Early mortality was defined as death within the first week of life and severe IVH was defined as IVH grade III or IV on cranial ultrasound, graded using Volpe’s grading system¹¹

Data preprocessing

We collected essential data as variables for each enrolled infant, resulting in a total of 23 variables. These variables included the following: antenatal steroid use; prenatal magnesium sulphate (MgSO4) use; pregnancy-induced hypertension; chorioamnionitis; GA; BBW; multiple births; Cesarean section; small for GA (defined as birth weight below the 10th percentile for GA, referencing values for birth weight distributions from a previous study of the Taiwanese population)¹²; sex; 1-min Apgar score; 5-min Apgar score; body temperature (defined as the rectal temperature measured for the first time within the first hour of birth); early-onset sepsis (defined as culture-proven sepsis occurring within 72 h of birth); respiratory distress syndrome; congenital anomalies (including chromosomal anomalies, skeletal dysplasia, inborn errors of metabolism, lethal or life-threatening anomalies in the cardiovascular, gastrointestinal, genitourinary, or pulmonary system, and other lethal or life-threatening anomalies); and seven delivery room resuscitation managements, including, neonatal resuscitation, oxygen supplementation, delivery room continuous positive airway pressure ventilation, positive pressure ventilation, endotracheal tube ventilation, chest compressions, and epinephrine administration. RapidMiner software version 10.0 (Altair Engineering, Troy, MI, USA; www.rapidminer.com) was used for data input and the cleaning of missing data.

Selection of variables

To facilitate practical applicability, we conducted variable selection using the information gain attribute evaluator provided by Weka software version 3.8.6 (Waikato Environment for Knowledge Analysis, Hamilton, New Zealand). After measuring the entropy gain in relation to the outcomes, an information gain attribute evaluator was used to evaluate the significance of each of the 23 variables¹³ Additionally, we conducted an evaluation of collinearity between each variable. In the interest of establishing a more streamlined model, we selected the top-ranked variables based on their ranking.

ML algorithms and model building

The flow chart for building models using ML algorithms via Orange software version 3.34.0 (University of Ljubljana, Ljubljana, Slovenia)¹⁴ is shown in Fig. 1.

These models were developed using six algorithms: k-nearest neighbor (kNN), decision tree, random forest, neural network, logistic regression, and gradient boosting.

Brief descriptions of the six ML models are as follows:

The kNN algorithm¹⁵ is an ML instance-based model that stores all instances of the training dataset and makes predictions based on neighborhood proximity, as defined by a similarity metric.
The decision tree algorithm¹⁶ is a tree-structured prediction model that starts with a root node and progresses to a leaf node. Each internal node represented a predictor variable, each internal node connection represented a choice, and each leaf node represented the outcome variable.
The random forest algorithm¹⁷ is an ML ensemble model that combines multiple decision trees to achieve increased prediction accuracy. Each uncorrelated decision tree in the random forest makes a prediction, and the prediction with the largest number of votes is used as the final prediction for the algorithm.
The neural network algorithm¹⁸ is an ML model that mimics the signal transmission through neurons in the human brain. The algorithm comprises multiple layers of nodes: an input layer, multiple hidden layers, and an output layer. Each node functions as a neuron, with a threshold value. If the collected signal reaches this threshold, the nodes are activated and the signal is transferred to the next layer in the network. Predictions were continuously generated until the signal reached the output layer.
The logistic regression algorithm¹⁹ was used for binary and multiclass classifications. It utilizes a cost function, often known as a sigmoid function, to provide an estimate of probability values ranging from zero to one.
The gradient boosting algorithm²⁰ is another ensemble model that incorporates a large number of ML models to provide strong predictors. The algorithm uses a gradient boosting technique to calculate the residual error by training a simple base learner on all the training datasets. A new learner is then created to forecast the prior residual error and increase the accuracy of the prediction model.

Internal evaluation

A tenfold cross-validation approach was employed for internal model validation. The dataset was randomly divided into 10 groups, with nine groups used for training and one for testing in each iteration. The average performance of the test results was subsequently used to assess the overall performance of the model across all the groups.

Model comparison

The performance of all prediction models was assessed by comparing the area under the curve (AUC) using the Orange software. Additionally, calibration plots and mean Brier scores, calculated with the assistance of Python, were employed to evaluate the predictive ability and goodness of fit of the models. This facilitated the observation of agreement between the actual and predicted probabilities.

External validation

The predictive models that exhibited outstanding performance, developed using the Cohort 1 dataset, were subsequently applied to the Cohort 2 dataset for external validation. Furthermore, the AUCs were computed to assess their performance in this independent dataset.

Equation development

The intercepts and coefficients for the selected attributes across different outcomes were calculated using Orange software. Subsequently, we formulated the corresponding equations and developed estimators to predict the probabilities of various target outcomes.

Results

Study population and patient characteristics

A total of 8531 newborns were enrolled during the study period. However, 711 newborns were excluded due to missing data and 349 were excluded because they died within 12 h of delivery. Consequently, 7471 newborns with complete records were included in the final study. Cohort 1 and 2 included 6558 and 913 infants, respectively.

In Cohort 1 (Table 1), there was a significant difference (p < 0.05) between each variable and target outcome, except for: the use of prenatal MgSO4 between the group with and without severe IVH (p = 0.157); multiple births, across all outcomes (p = 0.671 in early mortality, p = 0.32 severe IVH, and p = 0.22 early poor outcomes); and congenital anomalies between the group with and without severe IVH (p = 0.76).

Table 1 Demographic data of the participants.

Full size table

In Cohort 2 (Table 1), there were no significant differences in antenatal steroid use, prenatal MgSO4 use, pregnancy-induced hypertension, multiple births, Cesarean section, small for GA, sex, early onset sepsis, congenital anomalies, neonatal resuscitation, oxygen supplement, chest compression, or epinephrine administration between infants with and without early mortality (p = 0.17, 0.19, 0.76, 0.38, 0.49, 0.38, 0.97, 0.29, 0.57, 0.64, 0.45, 0.16, 0.60, respectively). Similarly, there were no significant differences in multiple births between the group with and without severe IVH (p = 0.29) and with and without early poor outcomes (p = 0.20). The discrepancy observed, wherein significant differences were found between each variable and the target outcome in Cohort 1, whereas such differences were not apparent in Cohort 2, could potentially be attributed to the limited sample size of Cohort 2.

Selection of predictors

Attribute selection, based on the Weka information gain attribute evaluator, enabled the condensed and generic application of the prediction models. The actual values generated by the evaluator for each variable were listed in Fig. 2 and Supplementary Table S1, revealing notable distinctions between the top five ranked variables and those ranked sixth and beyond. Furthermore, variables ranked second to fifth exhibited similar scores. Consequently, the initial selection included the top five variables: gestational age (GA), birth body weight (BBW), 1-min Apgar score, 5-min Apgar score, and endotracheal tube ventilation during initial resuscitation, for model development.

Additionally, considering collinearity concerns, further analysis was conducted using Variance Inflation Factor (VIF) values²¹ as presented in the Supplementary Table S2. This analysis indicated significant collinearity between the 1-min Apgar score and the 5-min Apgar score. Based on prior research²² The 5-min Apgar score is regarded as a more reliable predictor of neonatal outcomes compared to the 1-min Apgar score. Therefore, we opted to exclude the 1-min Apgar score from our prediction variables during model development.

Model development and comparison

The four most crucial variables, which were top-ranked and showed no significant collinearity, were utilized in the development of prediction models using Orange software. The internally validated receiver operating characteristic (ROC) curve results (Fig. 3) indicated that the neural network, logistic regression, and gradient boosting models were the most optimal predictive models for all target outcomes, with AUC values of 0.87, 0.86, and 0.86, respectively, for the prediction of early mortality; 0.82, 0.82, and 0.81, respectively, for severe IVH; and 0.84, 0.84, and 0.83, respectively, for early poor outcomes. The calibration plot illustrates the consistency between predictions and observations across different percentiles of predicted values. Comparing the calibration of all models through a scatter plot reveals the agreement between predictions and observations. According to Fig. 4, both logistic regression and neural network models demonstrated superior calibration performance, as depicted in the calibration plot. Furthermore, the logistic regression model achieved the best mean Brier score across three predictive outcomes, with a score of 0.0685, followed closely by the neural network model, which attained the lowest mean Brier Score of 0.06906. In contrast, the kNN and decision tree models exhibited less favorable calibration performance, with the highest mean Brier scores recorded at 0.0811 and 0.08123, respectively.

For external validation by Cohort 2, we utilized the most powerful prediction models, namely logistic regression and neural network models. The results of the ROC curve analysis (Fig. 5) indicated exceptional predictive capabilities across all outcomes. Specifically, the AUC values were 0.90 and 0.89, respectively, for early mortality prediction; 0.84 and 0.83, respectively, for severe IVH prediction; and 0.86 and 0.84 for early poor outcome prediction for the logistic regression and neural network models, respectively.

Equation development

We used Orange software to calculate the intercepts and coefficients necessary for constructing the prediction models through logistic regression. The results are summarized in Table 2. An equation was formulated for each target outcome as follows: outcome estimators suitable for clinical applications were developed using Microsoft Excel 2016.

Table 2 Intercept and coefficient values of the attributes in various models developed using logistic regression.

Full size table

As an illustrative example, consider a premature male infant born with a GA of 24 weeks and birth weight within the range of 601–700 g. The 5-min Apgar scores were 6, respectively. Importantly, intubation was not required during initial neonatal resuscitation in the delivery room. By inputting these parameters into the outcome estimator, we ascertained the following probabilities: 20% likelihood of early mortality, 35% likelihood of severe IVH, and 44% likelihood of early poor outcomes (Table 3).

Table 3 A table of the early poor outcomes estimator.

Full size table

Discussion

In this study, we used a nationwide retrospective database comprising data on VLBW preterm infants and their associated variables collected immediately after their initial management in the delivery room. Our objective was to develop a predictive model for early mortality, severe IVH, and early poor outcomes using an -ML approach. Following the application of this approach, we identified GA, BBW, 5-min Apgar score, and intubation in the delivery room as the top four most crucial factors for constructing prediction models. Notably, we found that both the logistic regression and neural network models demonstrate superior performance, as indicated by their higher AUROC values. This suggests that they have better discriminative ability in distinguishing between different outcomes. Additionally, these models are well-calibrated, meaning that the predicted probabilities align closely with the observed frequencies of outcomes. Moreover, they have been effectively validated across different cohorts within this study, highlighting their robustness and generalizability across diverse populations or settings. Overall, the logistic regression and neural network models excel in terms of their high AUROC values, good calibration, and successful validation across various cohorts, making them reliable predictors of outcomes in this study.

Currently available scoring systems for predicting early mortality in neonates include: the Clinical Risk Index for Babies (CRIB) II²³ Score for Neonatal Acute Physiology Perinatal Extension II (SNAPPE-II)²⁴ and the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) calculator²⁵ for neonatal conditions or outcomes. These prediction models have been widely employed and subjected to external validation in multiple studies²⁶

In our research, similar to CRIB II and NICHD, we identified GA and BBW as significant risk factors. A systematic review underscored the significance of these risk factors in neonatal mortality in neonatal intensive care units, with GA and BBW emerging as the most frequently cited contributors to neonatal mortality²⁷ Additionally, an investigation conducted on the Taiwanese population, using data from birth certificates and death registries, established a robust correlation between GA, BBW, and the incidence of early mortality²⁸

In 1952, Dr. Virginia Apgar pioneered the development of a scoring system designed to evaluate the physical condition of newborns and gauge their need for resuscitative interventions. Her groundbreaking work revealed a significant correlation between neonatal survival up to 28 days of age and the infant’s condition at delivery²⁹ Notably, contemporary research has substantiated the enduring relevance of the Apgar score, reaffirming its significance nearly five decades later³⁰

Although the Apgar score was initially conceived to assess term infants during an era characterized by high neonatal mortality rates among preterm infants, a recent investigation showed that the relative risk of neonatal mortality consistently escalates as the Apgar score diminishes across all GA categories³¹ Similarly, we included the Apgar score as a pivotal variable for outcome prediction in our study.

In our study, intubation emerged as the most important variable among all initial management procedures conducted in the delivery room. Notably, corroborative research conducted in countries such as Korea³² Iran³³ Thailand³⁴ and Brazil³⁵ has similarly identified intubation as a pivotal risk factor for neonatal outcomes.

In our study, antenatal steroid administration and multiple births did not demonstrate statistical significance as variables for outcome prediction despite their inclusion in the NICHD calculator. This discrepancy may be attributed to the high prevalence of antenatal steroid administration in Taiwan, where 85% of the patients in our study received this treatment, in contrast to the population encompassed by the NICHD calculation, where approximately 70% received antenatal steroids. These demographic differences within the study population may have attenuated the influence of these variables on study outcomes.

In contrast, Boghossian et al.³⁶ reported that the beneficial effects of antenatal steroids on mortality were statistically significant, primarily in infants born between 24 and 25 weeks of gestation. This observation suggests that the efficacy of antenatal steroids in reducing mortality may be contingent on GA.

Multiple births were associated with a notably elevated risk of mortality, particularly among extremely premature infants born at 26 weeks of gestation or earlier, as indicated in prior research³⁷ In our study cohort, where the mean GA of the infants was 28.7 weeks, this characteristic may explain why antenatal steroid administration and multiple births were not significant factors in our analysis.

ML is a subset of artificial intelligence that has been extensively used in healthcare³⁸ According to a recent systematic review³⁹ concerning the deployment of ML models for forecasting neonatal mortality, prominent ML algorithms include neural networks, logistic regression, and random forests. The reviewed articles collectively reported a mean AUC range spanning from 58.3 to 97.0%, with the average exceeding 70%. These findings underscore the ability of ML models to predict neonatal mortality. In our ML -based predictive models, the AUC values demonstrated a comparable and laudable level of performance when juxtaposed with other ML-based models.

In the context of predicting IVH, it is noteworthy that all four variables incorporated into our predictor previously demonstrated strong predictive capabilities for IVH, with particular emphasis on GA. Furthermore, the significance of endotracheal tube ventilation has been underscored in the literature. Additionally, when comparing our IVH predictor to previous models (AUC 0.67–0.85 for severe IVH prediction), our predictor exhibits an outstanding performance⁴⁰

Notably, despite external validation of the CRIB II, SNAPPE-II, and NICHD prediction models in diverse study populations, none of these models incorporated data from the Taiwanese population into their assessments. Predictive methodologies rely heavily on epidemiological population data to predict specific outcomes⁴¹ It is important to emphasize that the utility of a predictive model may be compromised by the possibility that the model is built upon data that could become outdated by the time it undergoes validation.

To the best of our knowledge, our predictive model represents a pioneering endeavor in the development of outcome-predictive models. This was the first initiative to construct such models based on the most current and comprehensive datasets available in Taiwan. Moreover, our model can predict early mortality, severe IVH, and early poor outcomes in VLBW preterm infants immediately following their initial management in the delivery room. Remarkably, this predictive capability was achieved using only four factors, eliminating the need for time-consuming blood sampling; however, these inherent advantages may facilitate widespread application in the Taiwanese population.

Limitations

This study had several limitations. First, restrictions imposed by the available databases impeded the collection of precise clinical data such as blood pressure, oxygen demand, and comprehensive laboratory data encompassing hemograms, biochemical markers, and blood gas analyses. The inclusion of these clinical parameters could potentially enhance the predictive performance of the model^26,39. Second, for privacy protection, the Taiwan neonatal network database recorded anonymous information, with gestational age rounded down and birth body weight recorded in ranges. These unavoidable limitations may impact the collinearity between variables. Third, while our prediction models demonstrated a high degree of accuracy in forecasting outcomes, they lack adaptability over time. As clinical dynamics evolve, these models may experience a decline in predictive accuracy. Fourth, variations in management and procedures across institutions may introduce potential biases that could be unavoidable in our study. Fifth, it is important to acknowledge that ML models may inadvertently manifest bias and discriminatory tendencies. Therefore, additional external validations across diverse population groups are required. This validation should explore whether the model generated can be applied with equal efficacy to populations other than Taiwanese cohorts to ensure a broader range of applicability.

Conclusions

In this study, we developed an outcome predictor designed to predict early mortality, severe IVH, and early poor outcomes in preterm VLBW infants. This predictive model relied on the assessment of four readily available factors immediately after birth: GA, BBW, 5-min Apgar score, and endotracheal tube ventilation during initial resuscitation. Our analysis has yielded a formula that demonstrates exceptional performance, as evidenced by the high AUC values in both the internal validation cohort and the independent external validation population. Furthermore, it is well-calibrated, as evaluated by calibration plots and mean Brier scores. This prediction formula may prove to be a valuable tool and provide essential prognostic information for parents, aiding them in making informed decisions regarding the care and future of VLBW preterm infants. Furthermore, it may offer healthcare providers valuable guidance and facilitates the formulation of effective decision-making strategies for the clinical management of vulnerable infants. However, further validation across diverse populations is required to ensure broader applicability. Moreover, the inclusion of clinical parameters may further improve model accuracy.

Data availability

According to the Taiwan Neonatal Network (TNN) Database Availability and Application Policy, although being anonymized and de-identified, the data are confidential. The data from TNN must only be available to individuals who have access for the authorized research. The data from this study are available from the corresponding author upon reasonable request.

References

Brankovic, S., Hadziomerovic, A. M., Rama, A. & Segalo, M. Incidence of morbidity and mortality in premature infants at the Department of Neonatal Intensive Care of Pediatric Clinic, Clinical Center of Sarajevo University. Med. Arch. 67, 286–288 (2013).
Article PubMed Google Scholar
Mukerji, A., Shah, V. & Shah, P. S. Periventricular/intraventricular hemorrhage and neurodevelopmental outcomes: A meta-analysis. Pediatrics 136, 1132–1143 (2015).
Article PubMed Google Scholar
Guillot, M., Chau, V. & Lemyre, B. Routine imaging of the preterm neonatal brain. Paediatr. Child Health 25, 249–262 (2020).
Article PubMed PubMed Central Google Scholar
Toyoshima, K. et al. Tailor-made circulatory management based on the stress-velocity relationship in preterm infants. J. Formos. Med. Assoc. 112, 510–517 (2013).
Article PubMed Google Scholar
Ahn, S. Y., Chang, Y. S., Sung, S. I. & Park, W. S. Mesenchymal stem cells for severe intraventricular hemorrhage in preterm infants: Phase I dose-escalation ClinicalTrial. Stem Cells Transl. Med. 7, 847–856 (2018).
Article CAS PubMed PubMed Central Google Scholar
Wu, S. T. et al. Maternal risk factors for preterm birth in Taiwan, a nationwide population-based cohort study. Pediatr. Neonatol. 65, 38 (2023).
Article PubMed Google Scholar
Chang, J. Y. et al. Decreasing trends of neonatal and infant mortality rates in Korea: Compared with Japan, USA, and OECD nations. J. Korean Med. Sci. 26, 1115–1123 (2011).
Article ADS PubMed PubMed Central Google Scholar
Tsou, K. I. & Tsao, P. N. The morbidity and survival of very-low-birth-weight infants in Taiwan. Acta Paediatr. Taiwan 44, 349–355 (2003).
PubMed Google Scholar
Liao, M. F., Chaou, W. T. & Tsao, L. Y. Periventricular hemorrhage/intraventricular hemorrhage in premature infants. Acta Paediatr. Sin. 26, 135–142 (1985).
ADS Google Scholar
Chen, C. H., Wang, T. M., Wu, K. H. & Chi, C. S. Intraventricular hemorrhage in preterm neonates—A two year experience. Zhonghua Min Guo Xiao Er Ke Yi Xue Hui Za Zhi 34, 343–348 (1993).
CAS PubMed Google Scholar
Volpe, J. J. Intraventricular hemorrhage in the premature infant—Current concepts. Part I. Ann. Neurol. 25, 3–11 (1989).
Article CAS PubMed Google Scholar
Hsieh, W. S. et al. Nationwide singleton birth weight percentiles by gestational age in Taiwan, 1998–2002. Acta Paediatr. Taiwan 47, 25–33 (2006).
PubMed Google Scholar
Ramasamy, M. & Meena Kowshalya, A. Information gain based feature selection for improved textual sentiment analysis. Wirel. Pers. Commun. 125, 1203–1219 (2022).
Article Google Scholar
Demsar, J. et al. Orange: Data mining toolbox in python. J. Mach. Learn. Res. 14, 2349–2353 (2013).
Google Scholar
Zhang, Z. Introduction to machine learning: k-nearest neighbors. Ann. Transl. Med. 4, 218 (2016).
Article PubMed PubMed Central Google Scholar
Song, Y. Y. & Lu, Y. Decision tree methods: Applications for classification and prediction. Shanghai Arch. Psychiatry 27, 130–135 (2015).
PubMed PubMed Central Google Scholar
Rigatti, S. J. Random forest. J. Insur. Med. 47, 31–39 (2017).
Article PubMed Google Scholar
Choi, R. Y., Coyner, A. S., Kalpathy-Cramer, J., Chiang, M. F. & Campbell, J. P. Introduction to machine learning, neural networks, and deep learning. Transl. Vis. Sci. Technol. 9, 14 (2020).
PubMed PubMed Central Google Scholar
Nick, T. G. & Campbell, K. M. Logistic regression. Methods Mol. Biol. 404, 273–301 (2007).
Article CAS PubMed Google Scholar
Natekin, A. & Knoll, A. Gradient boosting machines, a tutorial. Front. Neurorobot. 7, 21 (2013).
Article PubMed PubMed Central Google Scholar
Kock, N. Lateral collinearity and misleading results in variance-based SEM: An illustration and recommendations. J. Assoc. Inf. Syst. 13, 546–580 (2012).
Google Scholar
Li, F. et al. The Apgar score and infant mortality. PLoS ONE 8(7), e69072 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Parry, G., Tucker, J., Tarnow-Mordi, W., UK Neonatal Staffing Study Collaborative Group. CRIB II: An update of the clinical risk index for babies score. Lancet 361, 1789–1791 (2003).
Article PubMed Google Scholar
Richardson, D. K., Corcoran, J. D., Escobar, G. J. & Lee, S. K. SNAP-II and SNAPPE-II: Simplified newborn illness severity and mortality risk scores. J. Pediatr. 138, 92–100 (2001).
Article CAS PubMed Google Scholar
Tyson, J. E. et al. Intensive care for extreme prematurity—Moving beyond gestational age. N. Engl. J. Med. 358, 1672–1681 (2008).
Article CAS PubMed PubMed Central Google Scholar
Medlock, S., Ravelli, A. C. J., Tamminga, P., Mol, B. W. M. & Abu-Hanna, A. Prediction of mortality in very premature infants: A systematic review of prediction models. PLoS ONE 6, e23441 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Kermani, F., Sheikhtaheri, A., Zarkesh, M. R. & Tahmasebian, Sh. Risk factors for neonatal mortality in neonatal intensive care units (NICUs): A systematic literature review and comparison with scoring systems. J. Pediatr. Neonatal Individ. Med. 9, 1–15 (2020).
Google Scholar
Hsu, S. T. et al. Nationwide birth weight and gestational age-specific neonatal mortality rate in Taiwan. Pediatr. Neonatol. 56, 149–158 (2015).
Article PubMed Google Scholar
Apgar, V. A proposal for a new method of evaluation of the newborn infant. Curr. Res. Anesth. Analg. 32, 260–267 (1953).
Article CAS PubMed Google Scholar
Casey, B. M., McIntire, D. D. & Leveno, K. J. The continuing value of the Apgar score for the assessment of newborn infants. N. Engl. J. Med. 344, 467–471 (2001).
Article CAS PubMed Google Scholar
Cnattingius, S., Johansson, S. & Razaz, N. Apgar score and risk of neonatal death among preterm infants. N. Engl. J. Med. 383, 49–57 (2020).
Article PubMed Google Scholar
Park, H. W., Park, S. Y. & Kim, E. A. Prediction of in-hospital mortality after 24 hours in very low birth weight infants. Pediatrics 147, e2020004812 (2021).
Article PubMed Google Scholar
Sheikhtaheri, A., Zarkesh, M. R., Moradi, R. & Kermani, F. Prediction of neonatal deaths in NICUs: Development and validation of machine learning models. BMC Med. Inform. Decis. Mak. 21, 131 (2021).
Article PubMed PubMed Central Google Scholar
Sritipsukho, S., Suarod, T. & Sritipsukho, P. Survival and outcome of very low birth weight infants born in a university hospital with level II NICU. J. Med. Assoc. Thai. 90, 1323–1329 (2007).
PubMed Google Scholar
Risso, P. & Nascimento, L. F. Risk factors for neonatal death in neonatal intensive care unit according to survival analysis. Rev. Bras. Ter. Intens. 22, 19–26 (2010).
Article Google Scholar
Boghossian, N. S. et al. Association of antenatal corticosteroids with mortality, morbidity, and neurodevelopmental outcomes in extremely preterm multiple gestation infants. JAMA Pediatr. 170, 593–601 (2016).
Article PubMed PubMed Central Google Scholar
Porta, R. et al. Morbidity and mortality of very low birth weight multiples compared with singletons. J. Matern. Fetal Neonatal Med. 32, 389–397 (2019).
Article PubMed Google Scholar
Lisboa, P. J. G. A review of evidence of health benefit from artificial neural networks in medical intervention. Neural Netw. 15, 11–39 (2002).
Article CAS PubMed Google Scholar
Mangold, C. et al. Machine learning models for predicting neonatal mortality: A systematic review. Neonatology 118, 394–405 (2021).
Article PubMed Google Scholar
Kumar, P. & Polavarapu, M. A simple scoring system for prediction of IVH in very-low-birth-weight infants. Pediatr. Res. 94, 2033–2039 (2023).
Article PubMed Google Scholar
Janota, J. et al. Characterization of multiple organ dysfunction syndrome in very low birthweight infants: A new sequential scoring system. Shock 15, 348–352 (2001).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors would like to thank all the parents who participated in this study as well as the 33 team members and administrator of the TNN for their assistance with data collection and registration. They are grateful for the statistical consulting services provided by the Biostatistics Consulting Center at National Cheng Kung University Hospital. They would like to thank Editage (www.editage.com) for their writing support on the manuscript.

Author information

Authors and Affiliations

Department of Pediatrics, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, No.138, Sheng Li Road, Tainan, Taiwan
Yun-Hsiang Yang, Ts-Ting Wang, Yi-Han Su, Wei-Ying Chu, Wei-Ting Lin, Yen-Ju Chen, Yung-Chieh Lin, Chyi-Her Lin & Yuh-Jyh Lin
Department of Pediatrics, Ditmanson Medical Foundation Chia-Yi Christian Hospital, Chia-Yi, Taiwan
Ts-Ting Wang
Department of Emergency Medicine, Chi Mei Medical Center, Tainan, Taiwan
Yu-Shan Chang
Department of Pediatrics, E-Da Hospital, I-Shou University, Kaohsiung, Taiwan
Chyi-Her Lin

Authors

Yun-Hsiang Yang
View author publications
You can also search for this author in PubMed Google Scholar
Ts-Ting Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yi-Han Su
View author publications
You can also search for this author in PubMed Google Scholar
Wei-Ying Chu
View author publications
You can also search for this author in PubMed Google Scholar
Wei-Ting Lin
View author publications
You can also search for this author in PubMed Google Scholar
Yen-Ju Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yu-Shan Chang
View author publications
You can also search for this author in PubMed Google Scholar
Yung-Chieh Lin
View author publications
You can also search for this author in PubMed Google Scholar
Chyi-Her Lin
View author publications
You can also search for this author in PubMed Google Scholar
Yuh-Jyh Lin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.H.Y. and Y.J.L. conceptualized and designed the study, collected and analyzed the data, drafted the initial manuscript, and reviewed and revised the manuscript critically. T.T.W., Y.H.S., W.Y.C., W.T.L., Y.J.C., Y.S.C., Y.C.L., and C.H.L. assisted with the study design and data analysis, and critically revised the manuscript. All authors approved the final manuscript as submitted and agreed to be accountable for all aspects of the work.

Corresponding author

Correspondence to Yuh-Jyh Lin.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Tables.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yang, YH., Wang, TT., Su, YH. et al. Predicting early mortality and severe intraventricular hemorrhage in very-low birth weight preterm infants: a nationwide, multicenter study using machine learning. Sci Rep 14, 10833 (2024). https://doi.org/10.1038/s41598-024-61749-1

Download citation

Received: 05 January 2024
Accepted: 09 May 2024
Published: 12 May 2024
DOI: https://doi.org/10.1038/s41598-024-61749-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Predicting severe intraventricular hemorrhage or early death using machine learning algorithms in VLBWI of the Korean Neonatal Network Database

Predicting clinical outcomes using artificial intelligence and machine learning in neonatal intensive care units: a systematic review

Prediction of extubation failure among low birthweight neonates using machine learning

Introduction

Methods

Study design and population cohorts

Outcomes

Data preprocessing

Selection of variables

ML algorithms and model building

Internal evaluation

Model comparison

External validation

Equation development

Results

Study population and patient characteristics

Selection of predictors

Model development and comparison

Equation development

Discussion

Limitations

Conclusions

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Tables.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links