Introduction

Intraventricular Hemorrhage (IVH), especially severe IVH (SIVH) classified using Papile's classification as Grade III and IV, can lead to significant long term neuro-developmental impairment in preterm infants1,2,3,4. The incidence of severe IVH varies inter-institutionally from three to twelve percent, while for any IVH it ranges from ten to thirty percent5.

Low dose indomethacin therapy has been shown to decrease the occurrence of IVH and white matter injury in the preterm infant population as well as improve long-term neurodevelopmental outcomes6,7,8,9. Additional data have demonstrated that early low dose indomethacin aids in early closure of patent ductus arteriosus (PDA), with lower need for surgical ligation of symptomatic PDA and less white matter injury on neuro-imaging10,11,12. Despite these benefits it is not widely used because of the potential risks from impaired platelet function and decreased systemic blood flow, resulting in necrotizing enterocolitis, spontaneous intestinal perforation and transient renal impairment. The use of indomethacin for IVH prophylaxis varies widely amongst NICU's, from no use to limited use. When using gestational age (GA) as a cut off point for treatment, infants less than 26 weeks are most often treated. This excludes preterm infants born at higher gestational age but who may remain at risk for SIVH.

Our primary aim was to develop a predictive model using clinical parameters available within the first six hours of life that would help target indomethacin therapy to preterm infants at highest risk of SIVH, while minimizing exposure to infants at lower risk.

Results

Observations from the four participating hospitals were restricted to the 2917 infants with complete data and excluded patients with any indomethacin exposure, as this is a prophylactic treatment for IVH. Infants excluded from the analysis because of missing ultrasound (n = 253) were significantly more likely to be outborn (34.0% vs. 19.4%. p<0.001), had a slightly greater mean gestational age [29.9 wks (±3.2) vs. 28.9 wks (±2.4), p< 0.001] and had higher mean birth weight [1359.7 grams (±491.8) vs. 1163.7 (±299.1), p<0.001]. No significant differences between infants with missing and non-missing ultrasound data emerged for sex, Apgar score, steroid use and mode of delivery.

Results of the univariable analyses for the 2917 infants with complete data are given in Table 1. Infants in the SIVH group had lower mean gestational age, birth weight, 5-minute APGAR score, antenatal steroid administration rate, were frequently outborn and delivered via C-Section.

Table 1 Distributions of the seven predictor clinical variables by IVH

Training and evaluating the models: Table 2 shows the results of model training and validation. These models were developed on infants who did not receive any indomethacin exposure (n = 2917). The areas under the curve (AUC) for the four separate training datasets were consistent, ranging from 0·84 to 0·86. Regarding the external validation, the AUCs for test data were also relatively consistent, ranging from 0·82 to 0·88.

Table 2 Model training and validation test results

In view of the consistency of the AUCs above, a final model was generated using data from all centers combined. The AUC for the final model is 0·85 (95% CI: 0·81 –0·88). Figure 1 shows a box-plot of predicted probabilities for cases and non-cases derived from the final model. Model calibration appears satisfactory, as demonstrated by the Hosmer-Lemeshow goodness-of-fit test, (p = 0·22) and the graph of the observed vs. predicted deciles (Figure 2).

Figure 1
figure 1

Box-plot of predicted probabilities for cases and non-cases derived from the final model.

Figure 2
figure 2

The graph of the observed vs. predicted SIVH risk by risk deciles.

Using the model equation we also developed a calculator available online which estimates the individual infant's risk of developing SIVH, after providing the seven clinical predictor variables (Figure 3).

Figure 3
figure 3

Snapshot of the online calculator which estimates the individual infant's risk of developing SIVH, after providing the seven clinical predictor variables.

Available online via open access at www.neoqic.org.

Final model applied to the indomethacin group: We also wanted to examine the effect of any indomethacin exposure on the incidence of severe IVH. We reasoned that if indomethacin prevents severe IVH, the observed risk of severe IVH in the sample of patients who had indomethacin exposure would be less than the risk predicted by our model, because the model was generated using only patients with no indomethacin exposure. Thus, we hypothesized that using the predictive model in the indomethacin group would result in discernibly poor fit. To this end, we applied the prediction model to the group of patients (n = 944) who had any indomethacin exposure. Table 3 shows the distribution of the seven predictor factors by whether indomethacin was given or not. On average, those receiving indomethacin were born earlier, had lower birth weights, were slightly more likely to have received antenatal steroids, were more frequently transferred from another institution and were slightly more likely to be delivered by C-section.

Table 3 Comparison of infants receiving indomethacin versus not receiving indomethacin for selected clinical prediction factors

Although those receiving indomethacin were more than twice as likely to have severe IVH (RR = 2·26, 95% CI: 1·80–2·82), applying the risk equation to those receiving indomethacin does suggest a beneficial effect. As anticipated because the initial model construction excluded patients with indomethacin exposure, the test of model fit indicates that the predictive model does not summarize well the IVH risk profile of those exposed to indomethacin (H-L goodness-of-fit: p < 0·001). Figure 4 shows the plot of the observed vs. predicted risk of severe IVH by risk decile. The total predicted risk among those receiving indomethacin is 14·9%, as compared to the actual risk of 12·6%. Comparing the predicted versus observed risk of severe IVH by risk decile demonstrated that the greatest level of discrepancy occurs in the highest risk decile. In this decile, the level of risk predicted by the model is 44·9%, whereas the observed risk is 23·4%. This difference may be attributable to the effect of indomethacin.

Figure 4
figure 4

The plot of the observed vs. predicted risk of severe IVH by risk decile for the Indomethacin exposed group.

Discussion

In preterm infants, the occurrence of severe IVH is associated with poor neurodevelopmental outcomes, such as severe motor and cognitive delay, as well as cerebral palsy. About thirty-five to forty percent of infants with SIVH will suffer from neurocognitive delays20. The scoring systems such as Clinical Risk Index for Babies (CRIB) and Score for Neonatal Acute Physiology (SNAP, SNAP- II, SNAPPE- II) predict illness severity, mortality and long term outcomes21,22,23. However, these have the limitation of requiring post natal data beyond the first few hours of life and inability to predict individual patient risk. This precludes their use for the early intervention strategies that need to be initiated within the first hours of life. Likewise Heuchan et al. for the Australian and New Zealand Neonatal Network, developed a predictive model to allow for risk adjustments for variation of IVH occurrence in different NICUs in their network24. They also used antenatal and perinatal factors most commonly associated with IVH to create their model but acknowledged the limitation of applying it to individual infants. Thus, at this time, we are unaware of any predictive model that can calculate the risk of intraventricular hemorrhage for any given infant early in life for counseling purposes as well as to assist the clinician in targeting neuroprotective strategies based on individual risk.

Primarily, we have created a simple, clinician-friendly, validated predictive model utilizing perinatal clinical parameters that are available at birth for preterm infants born ≤34 weeks gestational age, especially, for the subset at the highest risk decile for SIVH. Although any IVH increases the risk of neurodevelopmental impairment, the higher grades are more likely to affect long term outcomes. Thus, this is the reason we chose to direct our model specifically towards SIVH. In our model we found that low gestational age, birth weight, five minute Apgar score and being outborn had a strong association with occurrence of SIVH, which is in agreement with the current literature. Interestingly in our dataset cesarean delivery was associated with higher incidence of severe IVH. This differs from some reports which have noted that cesarean delivery may either be protective25, or have no difference against IVH as compared to vaginal delivery26. This difference may be due to our focus on severe IVH as opposed to any IVH reported in the other studies. We did find that maternal antenatal steroid administration had a protective effect which also agrees with previously published reports. We did not find gender to be an important contributor, despite overall morbidity and mortality data suggesting that female gender appears to provide benefit27. Additional clinical values, such as maternal magnesium therapy, admission temperature and blood pressure, may also further improve the predictive accuracy of the model. Future iterations of the model will incorporate this data.

As compared to the model by Heuchen et al. that had an ROC area under the curve of 0·76, our predictive model showed better discrimination (AUC = 0·85), as well as substantial agreement between observed and predicted risk24. This provides reassurance regarding the generalizability of the model to other patient populations with different clinical practices.

The model may provide clinical utility, by identifying preterm infants at highest risk of developing severe intraventricular hemorrhage and may allow targeted prophylactic indomethacin treatment, while minimizing adverse effects by sparing exposure to those at lower risk. Clinicians wishing to apply decision-making algorithms to individual patients, as suggested by Sinclair, require an estimate of the patient's baseline risk29. Up to now, the estimates of baseline risk were crude, as well as subjective, at best. The predictive model described herein provides the clinician an objective and validated estimate of the baseline risk necessary to apply therapeutic decision tools. We caution, though, that the model should be used to support the clinician's interpretation and decision-making – not replace it.

The majority of IVH occurs either prenatally or within the first seventy-two hours of birth, with a small percentage of IVH occurring later. Since multiple perinatal factors from pre-natal to post-natal are associated with IVH, there may be multiple opportunities to prevent SIVH at different times. Preventing or delaying preterm birth, delivery at a tertiary care perinatal center, maternal antenatal steroids and maternal magnesium sulfate for neuroprotection are some prenatal interventions. Decreasing the severity of respiratory disease by judicious use of surfactant therapy and prophylactic use of indomethacin therapy within first six hours of birth are examples of postnatal interventions. Low-dose indomethacin therapy may reduce the risk of severe IVH by 35%, as shown in the TIPP trial as well as additional studies6,9. Additionally, follow-up studies demonstrated no adverse outcomes at 36 months and at school age infants with IVH had worse outcomes than no IVH7,8,30. Of note though the long term follow-up studies did not adjust for other morbidities such as late onset sepsis, NEC, or BPD which may affect long term neurodevelopmental outcomes in the absence of severe IVH. Additionally Miller et al have shown in their prospective observational study that prophylactic indomethacin exposure was independently associated with a lower risk of white matter injury or brain abnormality, even when adjusted for other clinical parameters12. A recent Cochrane meta-analysis also noted significant positive effects of prophylactic indomethacin therapy, including decreased incidence of periventricular leukomalacia or PDA and need for surgical ligation10. Although there are concerns related to the vasoconstrictive mechanism of indomethacin, none of the studies documented a higher incidence of necrotizing enterocolitis or renal complications. Despite this there has not been wide acceptance of this potentially beneficial therapy. Clyman et al have suggested that indomethacin prophylaxis may be a reasonable choice in intensive care nurseries where IVH and PDA are frequent management problems28.

Further support for the validity of the model is seen when the model is applied to infants receiving indomethacin. Assuming that any exposure to indomethacin would confer some protection against SIVH, we would expect our model to over-predict the risk of SIVH among exposed infants and thus, fit poorly. This is in fact what we found. Overall, the model predicted 14·9% incident SIVHs among indomethacin-exposed infants, compared to the actual incidence of 12·6%.

It is clear, however, that the model requires further development and validation. While the overall prediction characteristics of the model are good, there were some risk deciles where observed and expected estimates differed. One limitation of the data was that we were unable to distinguish early (≤72 hrs) vs. later onset of the SIVH. Although the majority of lesions occur early, the small proportion of late onset lesions may compromise model fit, since other clinical or patient factors may mediate their occurrence. Also, additional clinical values, such as maternal magnesium therapy, admission temperature and blood pressure, may further improve the predictive accuracy of the model. Future iterations of the model will incorporate this data.

Although we also wanted to determine the risk of developing gastrointestinal pathology secondary to indomethacin exposure, we did not have sufficient data available for calculating a hazard ratio – this analysis will be addressed in future work. However, it is reassuring to note that in the Cochrane review, there was no increase in the risk of necrotizing enterocolitis or renal complications10.

In conclusion we have developed a predictive model for calculating the risk for developing SIVH for individual VLBW infants using seven clinical variables readily available at birth, using an open access online calculator. This model may be useful for parental counseling and as an aid in clinical decision-making regarding IVH prophylaxis.

Methods

Study design and participants

This was a retrospective cohort analysis of preterm infants ≤34 weeks gestational age at birth admitted to four participating Level III Massachusetts Neonatal Intensive Care Units, belonging to the Neonatal Quality Improvement Collaborative of Massachusetts (NeoQIC). The data was abstracted from January 2000 - December 2010 for three of the participating NICUs and from 2006 to 2010 for one NICU. The study utilized de-identified patient level data as reported to the Vermont Oxford Network (VON) Database.

IRB approval was obtained by each participating NICU from their respective IRB prior to starting the study. Data sharing agreements were also approved by the participating institutions. Parental consent was not deemed necessary since the analysis used a retrospective de-identified cohort.

Study Population

All infants born at ≤34 weeks gestational age admitted to the four NICUs with available data during the study period were eligible for the study. Infants had their first head ultrasound (HUS) usually within or close to 72 hr of life and then serially per each unit's protocol. Regarding early deaths, in general, infants who received an ultrasound and had the data on the clinical predictors were included in the model. Thus, any infant who died prior to receiving or otherwise did not receive an ultrasound was excluded from the study (i.e., they did not have a determination of the outcome). Infants who received an ultrasound and died at a later date were included in the model. Additionally, all infants < 23 weeks gestational age, or who had congenital or chromosomal anomalies were excluded from model creation and validation, as were patients with any postnatal indomethacin exposure. However, data from patients with indomethacin exposure were subsequently used to deduce the impact of indomethacin exposure on the observed versus the predicted SIVH incidence.

The primary goal of the study was to create and validate a predictive model to assess an individual infant's risk for IVH based on clinical variables available within the first six hours of life. The second goal was to compare the observed versus predicted risk between the any indomethacin exposed group versus the non-exposed group.

Data Collection

The four participating NICUs currently report de-identified patient level data to the VON database. Each site provided the requisite fields, as available, for the study period. The VON database clearly defines the clinical variables reported to the database13. Data collection included the following variables: completed gestational age at the time of birth as documented by early fetal US and maternal LMP, birth weight, gender, maternal antenatal steroid administration, mode of delivery, location of delivery (inborn versus outborn), Apgar score at 5 minutes, admission temperature (available for only a subset of infants), any indomethacin therapy and grades of IVH. IVH was classified as Grades I to IV as defined by the modified Papile's classification for IVH, determined by cranial ultrasound. Two out of the four centers used indomethacin for IVH prophylaxis and all centers used either indomethacin or ibuprofen for treatment of patent ductus arteriosus (PDA). The above mentioned variables were chosen as they are readily available by 6 hours of life when indomethacin is given for prophylaxis. The biggest confounder would be blood pressure data at this time but this is not available through the VON database.

Statistical analysis

Preliminary analysis

The distributions of all variables were examined. Continuous data are reported with means and standard deviations. Categorical data are reported with frequency distributions. For univariable analyses of predictor variables and severe IVH, independent sample t-tests and Pearson's chi-square were used, as appropriate.

Developing and Validating the Predictive Models: For model development and validation, the sample was restricted to those patients who did not receive indomethacin (n = 2917). Multivariable logistic models were developed using seven clinical predictors: gestational age, birth weight, gender, any maternal antenatal steroid use, mode of delivery, location of delivery (inborn vs. outborn) and Apgar score at five minutes. Since admission temperature was available for only a subset of infants it was not used in the analysis. For ordinal and continuous predictors, we first examined whether departure from linearity in the logit was present. This was accomplished using statistical and graphical methods described by Hosmer and Lemeshow14 and Vittinghof15. Based on these evaluations, these predictors satisfied linearity assumptions and, as such, were analyzed in their original scales of measurement. In developing the model, we also assessed whether clinically relevant interactions were present among the seven variables. No significant interactions among predictors were found; thus, none were included in the predictive model.

Training models were developed using data from three centers, while validation was conducted on the excluded center. This procedure was run for each unique combination of sites for a total of four training/validation models. The approach not only allowed us to conduct an external validation of the predictive model, but also enabled us to examine the stability of the validation across the four models.

Model discrimination (for training, validation and the final predictive model) was determined by examining the area under the curve (AUC) of the receiver operating characteristic curve (ROC curve). Associated 95% confidence intervals were computed using standard errors as defined by DeLong, DeLong and Clarke-Pearson16,17,18,19.

For the final model, the Hosmer and Lemeshow goodness-of-fit test (H-L test) was used to assess how well the model summarized the data (i.e., model calibration). For the H-L test, the null hypothesis states that the model adequately summarizes the data. As such, a non-significant test suggests adequate model fit.