Introduction

Underdeveloped cerebral circulatory systems of preterm infants have limited cerebral autoregulation that increase their risk of severe complications such as intraventricular hemorrhage (IVH) or death. Current standard monitoring in the delivery room and NICU include heart rate (HR) and peripheral arterial oxygen saturation (SpO2), but neither provide a direct measure of cerebral oxygenation. Non-invasive monitoring of cerebral tissue oxygen saturation (StO2) using near-infrared spectroscopy (NIRS) during resuscitation and neonatal transition may present a way to identify patients at the highest risk of impairment in the early days of life.

A previous single center European study has identified an association between cerebral oxygenation and developmental outcomes but this result has not been replicated [1]. A few small studies have explored NIRS value ranges for the prediction of neurodevelopmental outcomes in the context of neonatal morbidities like hypoxic ischemic encephalopathy (HIE), late-onset sepsis, and congenital heart disease. However, these studies were unable to show correlation between NIRS and developmental outcomes [2,3,4]. Additionally, 83 preterm infants in the SafeBoosC II trial followed to 2 years of age showed no developmental benefit on the Bayley 2 or 3 examination when cerebral oximetry was included in the treatment of extremely preterm infants [5].

Our prospective observational cohort study of neuromonitoring preterm infants during the first 72 h of life (The Neu-Prem Trial) demonstrated that NIRS monitoring during resuscitation held potential for predicting IVH or death [6]. The study found that differences in StO2 within the first 24 h of life were predictive of the development of any grade IVH and death. Given the relationship between cerebral oxygenation and IVH or death, we hypothesized that cerebral oxygenation in the first 72 h of life could predict neurodevelopmental impairment (NDI) at 2 years of age. A better understanding of the relationship between neonatal cerebral oxygenation and neurodevelopmental outcomes may provide guidelines for improved monitoring of preterm newborns, potentially improving neurodevelopmental outcomes. Different devices have varied thresholds for cerebral hypoxia. The suggested threshold for the Fore-Site Elite monitor used in our study calibrated using a phantom model is 67% when using a neonatal probe [7]. We hypothesized that there were differences in duration of cerebral hypoxia in newborns that developed NDI or death when compared to those that did not show any signs of impairment at 2 years of age.

Methods

The original prospective cohort trial was conducted from October 2015 through December 2016 and has been previously published [6]. Informed consent was obtained for the study and subsequent neurodevelopmental follow-up. As part of the original trial a NIRS sensor and electroencephalography leads were placed on 127 newborns <32 weeks of gestational age at birth. Data recorded in the delivery room included ten minutes of continuous NIRS and aEEG along with heart rate, peripheral arterial oxygen saturation (SpO2), fraction of inspired oxygen, and mean airway pressure measurements. Data were recorded in the neonatal intensive care unit for 72 h and included, NIRS, aEEG, and vital signs (heart rate, blood pressure, and SpO2). Within the first 12 h of life and again at 72 h of life an ultrasound scan of the head was performed.

Survivors of the original cohort were followed out to 22–26 months corrected age in our high-risk infant follow up clinic to assess for our primary outcome of NDI or death. Due to this being a convenience sample we were unable to calculate a sample size. Criteria for inclusion in this analysis matched those used for inclusion in the original trial- included infants born <32 weeks GA and did not have congenital anomalies or moderate-to-severe head bruising at birth.

The 22- to 26-month corrected age follow-up visit included a health history, physical examination, and cognitive, language, and motor assessment using the Bayley Scales of Infant and Toddler Development, third edition (Bayley-III), a standardized neurologic examination, and measurements (weight, height, and head circumference). The Gross Motor Function Classification System (GMFCS) was used to evaluate functional motor outcomes. Neurodevelopmental assessment was carried out by examiners who were trained in administration of the Bayley-III, had excellent inter-rater reliability (0.90), and masked to the NIRS values. The Bayley-III includes cognitive, language (receptive and expressive subscales), and motor (fine and gross motor subscales) composite scores with a mean and SD of 100 ± 15. The GMFCS, as modified by the National Institute of Child Health and Human Development Neonatal Research Network [8], is a validated system that can be used for children between 22 and 26 months CA to describe the severity of motor dysfunction. GMFCS levels range from 0 to 5 with a level of 1 indicating mild impairment and a level of 5 indicating the most severe motor impairment. Parental interviews conducted at 2 years of age were used to obtained data regarding visual (e.g., eye surgery, strabismus, myopia) or hearing impairments, pulmonary morbidity including the need for oral or inhaled steroids, oxygen, and bronchodilators, and rehospitalizations.

Moderate to severe NDI defined as having ≥1 of the following: a Cognitive composite score of <70, GMFCS of ≥2, blindness (vision of <20/200), or hearing impairment interfering with the ability to communicate with amplification. Mild NDI was defined as having a Cognitive score of <85, GMFCS score of 1 or more, at least unilateral blindness (vision of 20/200 in only 1 eye), hearing impairment, or need for hearing aids.

Statistical analyses

The sample consisted of infants with StO2 data for the first 72 h and two year follow up data available as well as those who died during hospitalization. Data were averaged hourly for each subject and generalized estimating equations were then used to control for repeated measures. We used an hourly StO2 average of 67 as the threshold cutoff. We performed a secondary analysis of delivery room StO2 (first 10 min of life) and each of the individual outcomes of cognitive, language and motor impairment or death. All data were analyzed using the hypoxic threshold of 67 identified during the original trial [6], and calibration data from Kleiser et al. [7].

Analysis was conducted using SPSS (IBM Corp. Released 2016. SPSS Statistics for Windows, Version 24, Armonk, New York, USA) and Epi-Info™ 7.0 (CDC, Atlanta, GA, USA). Before analysis, data was first verified for accurate data entry, formats, coding, and missing observations. The largest and smallest values for each variable were reviewed for accuracy and plausibility. Each variable was examined for variability and frequency distribution, skewness, and kurtosis. Subsequent data analyses were conducted using descriptive, univariable, and adjusted analyses. Repeated measures analyses were conducted related to specific study aims. All significance tests were 2-sided with critical alpha level of 0.05. Comparison of demographic and subject characteristics were made using Chi-Square, Fisher’s exact test, or logistic regression for dichotomous and categorical variables. Student’s t tests for comparison of means were used for continuous variables, and Mann–Whitney U or Wilcoxon rank-sum test was used as a nonparametric test for ordered categorical variables or for continuous variables failing to meet normal distribution requirements.

Univariable and adjusted analyses between the dependent variable and each independent variable were examined to explore the crude associations between variables using Chi-Square, Fisher’s exact test, ANOVA, Mann–Whitney U or Wilcoxon rank-sum test, linear regression, or logistic regression. Repeated measures analyses were performed via generalized estimating equation modeling (GEE).

Results

From the original cohort of 127 infants, 79 children were able to return for their neurodevelopmental follow-up exam at 2 years corrected age between June 2017 and February 2019 (range 20–26 months corrected). There were 4 infants who died prior to follow-up examination and were included in the composite outcome of any disability or death at 2-year follow up. Of the total sample of 83 infants, 33 were born between 23 and 27 weeks GA and 50 were born between 28 and 31 weeks gestational age at birth (see Fig. 1).

Fig. 1: Consort diagram.
figure 1

Progression of Subjects through screening, randomization and followup.

For the primary outcome, using the StO2 threshold of 67%, infants with NDI or death had increased duration of cerebral hypoxia (4 vs 2.3%, p < 0.001). For the individual components of the neurodevelopmental exam, there was an increased duration of cerebral hypoxia in subjects with language impairment (3.4 vs 2.4%, p = 0.042), death 4.3 vs 3.7% p = 0.049, but not cognitive or motor impairment, or death. The overall cohort neonatal demographics are shown in Table 1.

Table 1 Overall cohort neonatal demographics.

A subgroup analysis of infants 23–27 weeks GA with NDI or death demonstrated a greater duration and difference of cerebral hypoxia (7.6 vs 3.2 (p < 0.001, OR 2.470, 95% CI 1.644, 3.712) compared to those without NDI or death. There was no statistically significant difference between the impaired and non-impaired groups in neonatal or maternal characteristics (Table 2).

Table 2 Maternal and neonatal characteristics.

There were no differences in other vital signs in the first 72 h of life (Fig. 2) in the 23–27 week cohort. Individual generalized estimating equations to adjust for repeated measures were modeled in this subgroup for the physiologic parameters StO2 < 67, heart rate, Sp02, IVH and mean arterial pressure (Table 3) over the first 72 h of life. StO2 < 67 was the only predictor for death or neurodevelopmental impairment (OR 2.75, 95% CI 1.006, 7.5132, p = 0.049).

Fig. 2: Physiological parameters presented over time in the 23–27 week cohort.
figure 2

Cerebral oximetry (StO2), mean blood pressure (BP), peripheral oxygen saturation (SpO2) and heart rate (beats per minute) over 72 h (Panel A) and from 4 to 24 h of age (Panel B).

Table 3 Individual generalized estimating equations to predict any impairment or death, 23–27 weeks GA, 72 hours, adjusting for any IVH.

A secondary analysis of delivery room StO2 in the first 10 min of life for each of the individual outcomes and/or the combined outcomes of cognitive, language and motor impairment or death did not demonstrate any association (data not shown).

Discussion

Several cohort studies in Europe that used the INVOS monitor have found an association between low cerebral oxygenation in the first few hours/days of birth and poor long-term outcomes. Some have used a pediatric sensor or an adult sensor with different cutoff values (65% for the neonatal sensors and 55% for the adult sensor). Three cohort studies to date have demonstrated impairment with abnormal cerebral oxygenation. Verhagen et al. followed 67 infants (mean gestational age 30 weeks) with cerebral oximetry daily (days 1, 2, 3, 4, 5, 8 and 15) and found poorer cognitive outcome with lowest (25–50) and highest quartile (75–100) StO2 [9]. In addition, the lowest quartile (25–50) was also associated with poorer fine motor outcome. Alderliesten et al. compared a cohort of 66 preterm infants <32 weeks treated for hypotension to a set of matched controls that were not treated [10]. While the receipt of blood pressure treatment did not affect neurodevelopmental outcomes, 26 infants with a StO2 of <50% for >10% of the time had a lower neurodevelopmental scores at 18 months (median 99 vs 104, P = 0.02). Recently the same group published a larger cohort (N = 221 for 24–27 week CGA infants seen at 2 years) demonstrating that low StO2 was associated with an unfavorable cognitive outcome [1]. They recommended the use of a threshold of StO2 < 55% for future clinical studies when using the adult near-infrared sensors (StO2 < 65% for neonatal sensor). All three studies used the INVOS devices.

Different near-infrared spectroscopy monitors use different technologies and algorithms to obtain results, and consequently show readings which are not the same for an identical physiologic circumstance [11]. Investigators have attempted to use both in-vivo [12] and in-vitro [7, 13] methods to validate these results. Based on the use of a single device, the SafeBoosc trial used a normal range of clinical values to be 55–85% using a single device [14]. When calibrated to the INVOS device using a phantom, the suggested threshold for the Fore-Site Elite monitor used in this study is 67 when using a neonatal probe [7]. We chose to analyze this data by using a cutoff value of 67 as this best represented a clinical comparison useful to clinicians.

Limitations to this study include a small sample size due to a number of infants being lost to follow-up. The 2-year cognitive assessment was not the primary outcome of the original trial and therefore extensive retention measures were not in place at the onset of the trial. Only 64% of infants eligible for neurodevelopmental follow-up completed a cognitive assessment at 2 years of age. Other limitations include an inability to correct for multiplicity due our small sample size. Furthermore, selection bias may be present in the analyzed group because infants that did not complete cognitive assessment may have been assessed by parents as not in need of support services. There is a concern for missed cognitive assessment for infants of older gestations (>30 weeks) who are more likely to be assessed by parents as normally developing.

It remains unclear whether we can improve neurodevelopmental outcomes of preterm infants by intervening on a low StO2. Current efforts to intervene on low StO2 include the SafeBoosc III trial, which plans to determine whether intervening on a low NIRS can reduce risk of death or severe brain injury [15]. The study was designed to use saturation targets based on values from the INVOS device but will allow for the use of the FORESITE elite at the cutoff level of 67 for cerebral hypoxia. Ideally, the SafeBoosc III trial will have enough power to determine whether intervening on a low NIRS using each device can successfully reduce both the short-term morbidities (death and severe brain injury) as well as long-term neurodevelopmental impairment. Our follow-up of the original Neuprem cohort is the first to demonstrate an association between the increased time spent below a threshold of 67 using the FORESITE device and an abnormal neurodevelopmental outcome or death in preterm infants.