Main

Two large trials have evaluated whether phototherapy for newborns with elevated bilirubin levels reduces the risk of adverse neurodevelopmental outcomes. There was no improvement in neurodevelopmental outcomes 6 years after phototherapy (1) in a trial conducted by Brown and colleagues (2). The National Institutes of Health and Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) Neonatal Research Network trial evaluated the effects of aggressive vs. conservative phototherapy in extremely-low-birth-weight (≤1,000 g, ELBW) infants (3). Death rate did not differ with the two treatments (relative risk (RR) = 1.05, 95% confidence interval (CI) = 0.90, 1.22); however, neurodevelopmental impairment was significantly reduced among survivors with aggressive phototherapy (RR = 0.86, 95% CI = 0.74, 0.99) including better performance on the Bayley Mental Developmental Index and less hearing loss and athetosis.

Bilirubin neurotoxicity prolongs neuronal transmission in the auditory brainstem as measured by brainstem auditory evoked responses (BAERs) (4). We recorded BAERs in infants with birth weights of 501–750 g or 751–1,000 g who were enrolled in two of the centers participating in the NICHD Neonatal Research Network trial.

The primary hypothesis tested was as follows:

  1. 1

    Aggressive phototherapy reduces neuronal transmission times in ELBW infants at 35 weeks postmenstrual age (PMA) as measured by BAER wave V latency.

Secondary hypotheses tested were as follows:

  1. 1

    Aggressive phototherapy reduces other measures of neuronal transmission times in ELBW infants at 35 weeks PMA as measured by the latencies of BAER waves I and III and BAER interwave intervals (V–I, III–I, V–IV, IV–III, II–I).

  2. 2

    Higher total serum bilirubin (TSB) levels in the first 2 weeks after birth in ELBW infants are associated with prolonged neuronal transmission times at 35 weeks PMA as measured by BAER latencies.

Tests of the primary and secondary hypotheses included whether treatment effects differ by center and birth-weight stratum.

Results

Study Sample

Figure 1 presents the study cohort. Six of the 117 (5.1%) eligible study infants who were alive at 35 weeks PMA were not successfully tested; 4/36 (11.1%) were from center A and 2/81 (2.5%) were from center B. Five infants were in the aggressive phototherapy treatment arm (three at center A and two at center B). Four infants (two from each site), including an infant in the conservative phototherapy treatment arm, were discharged or transferred prior to their test date without informing the personnel doing the BAER testing. An administrative error and unsuccessful testing due to poor recording conditions after two attempts accounted for the other two missed tests.

Figure 1
figure 1

The number of study infants who were eligible, consented, randomized, died before the target test age (35 weeks postmenstrual age), and successfully tested.

Of the 111 eligible infants successfully tested according to protocol, the median age at testing was 35 weeks and 1 day PMA (interquartile range = 3 days); 106 (95.5%) infants were tested within 1 week of 35 weeks. Three infants were tested 14, 12, and 10 days prior to 35 weeks because they were going to be transferred to another hospital. The other two infants were retested at 36 weeks 1 day because of poor recording conditions at 35 weeks.

Demographic and baseline characteristics of the successfully tested study infants are listed in Tables 1 and 2 . There were few imbalances in the baseline variables across the two treatment conditions within the two centers. Gender was somewhat of an exception. Males were more prevalent in the conservative (11/19 = 58%) compared to the aggressive (4/13 = 31%) phototherapy treatment conditions at center A. Males were more equally distributed in the conservative (18/41 = 44%) and aggressive (19/38 = 50%) conditions at center B.

Table 1 Demographic and baseline characteristics of the study infants from center A
Table 2 Demographic and baseline characteristics of the study infants from center B

There were center differences in demographic and baseline characteristics ( Tables 1 and 2 ). Center A infants were mostly black (88%), with a small percentage (12%) of white infants. In contrast, 44% of center B infants were black, 30% were white, 24% were Hispanic, and one was Asian. Center A infants were significantly older at birth (27.1 ± 0.4 weeks vs. 26.2 ± 0.2 weeks), less likely to be outborn (6% vs. 25%), more likely to be ventilated (81% vs. 62%), and less likely to be on continuous positive airway pressure at 24 h (0% vs. 34%).

Effectiveness of the Intervention

Table 3 presents bilirubin exposure and phototherapy treatment duration means and 95% CIs. As expected from the study design, infants in the conservative phototherapy condition had significantly higher average TSB levels for the first 14 days after initiating phototherapy (P < 0.001), higher peak TSB levels (P < 0.001), and shorter duration of phototherapy (P < 0.001) than infants in the aggressive phototherapy treatment arm. The 751–1,000 g birth-weight infants had significantly higher average TSB levels for the first 14 days after initiating phototherapy (P = 0.002) and higher peak TSB levels (P = 0.002) than the 501–750 g birth-weight infants (P < 0.001), although the latter result was moderated by treatment condition and center (i.e., there was a phototherapy by center by birth-weight stratum interaction, P < 0.001). Average TSB levels for the first 14 days after initiating phototherapy (P < 0.001) and peak TSB levels (P < 0.001) were significantly higher at center A than at center B. The duration of phototherapy in both treatment conditions was significantly longer at center A (P < 0.001).

Table 3 Bilirubin measurements as a function of phototherapy condition, center, and birth-weight stratum

Reliability of the BAER Scoring

The two scorers agreed that waves III and V from each ear were present for nearly the entire sample (95.5–96.4% of the 111 infants recorded). Those measurements were scored within 0.2 ms of each other 89.6%–94.4% of recordings. Most of the few times that one scorer scored wave III or V as absent, the other did as well (4/4 times for the left ear and 3/5 times for the right ear for wave III and 4/5 times for both ears for wave V).

Wave I was the only measurement scored more reliably at one site than the other (100% agreement that wave I was present for both ears at center A vs. 51% for the left and 48% for the right ear at center B). The noninverting electrode was placed higher on the head at center B than center A to avoid CPAP tubes and nasal cannulae negatively affecting the recording of wave I. As a consequence, wave I was less prominent and often confused with wave II. For nearly 40% of the recordings, either both scorers or at least one scored II and also IV as absent.

Treatment Effects on the BAER

Table 4 presents mean (95% CI) differences in BAER latencies calculated from regression models for infants in the aggressive and conservative treatment conditions by center and birth-weight stratum. The results presented in Table 4 are averaged over the two ears.

Table 4 Mean (95% CI) differences in BAER latencies (ms) between the aggressive and conservative phototherapy treatment conditions

The 751–1,000 g birth-weight center A infants in the aggressive compared to the conservative phototherapy treatment condition had significantly shorter wave V latencies and also significantly shorter wave I and III latencies. Similar nonsignificant trends were recorded for the 501–750 g birth-weight center A infants. The small number of infants in the 501–750 g birth-weight stratum at center A limits the interpretation of those results. At center B, there were no significant differences in BAER latencies between treatment conditions.

There was little support for different phototherapy treatment effects in 501–750 g and 751–1,000 g infants. None of the three-way or two-way phototherapy treatment by birth-weight stratum interactions were significant. In contrast, there was some statistical support for different phototherapy treatment effects in the two centers (i.e., treatment by center interactions). Center differences in the phototherapy treatment effects were robust for wave III latencies (P = 0.032) but not for wave V (P = 0.239) and wave I (P = 0.174) latencies. Given the small sample size, nonsignificant interactions are not surprising.

Aggressive phototherapy treatment did not reduce brainstem transmission times as measured by the V–I, V–III, and III–I interwave latencies (see Table 4 ). These results are consistent with a peripheral (i.e., the distal auditory nerve and/or the inner, middle, or outer ears) and not a brainstem site of lesion. Axonal vs. synaptic transmission was not significantly different in aggressive and conservative phototherapy-treated infants, with one exception. There was a significant phototherapy treatment by birth-weight stratum interaction (P = 0.036) for the IV–III interwave latencies, indicating that aggressive phototherapy was associated with faster axonal transmission in the 501–750 g birth-weight infants. The last row of Table 4 presents the II–I interwave latencies for the 94 infants whose latencies could be scored in at least one ear. The II–I latency interval also measures axonal transmission (5). There was no support for phototherapy treatment differences for wave II–I interwave latencies, suggesting that the phototherapy treatment effect for the 501–750 g birth-weight infants for the IV–III interwave latencies may be a chance result.

Adjusting for the age at testing and each of the baseline and demographic variables listed in Table 3 did not significantly alter the phototherapy treatment effects reported in Table 4 . In addition, adjusting for possible mediators having their onset after the initiation of phototherapy but prior to the 35-week assessment (intraventricular hemorrhage grades 3 and 4, necrotizing enterocolitis, retinopathy of prematurity, patent ductus arteriosus, bronchopulmonary dysplasia, and late-onset sepsis) did not alter the recorded phototherapy treatment effects.

18- to 22-Month Outcomes

To further evaluate center differences, post hoc analyses were conducted comparing 18- to 22-month outcomes. Fewer aggressively treated center A infants (6/13 or 46%) died or had neurodevelopmental impairment at 18–22 months than conservatively treated center A infants (11/18 or 61%). In contrast, more aggressively treated center B infants (22/34 or 65%) died or had neurodevelopmental impairment at 18–22 months than conservatively treated center B infants (18/36 or 50%). These center differences were also observed for deaths and neurodevelopmental impairment analyzed separately.

Four infants had no scorable BAER waves despite valid recording conditions. All four of these infants were from center B: one in the 751–1,000 g stratum receiving conservative phototherapy and the other three in the 501–750 g stratum, one receiving conservative phototherapy and the other two receiving aggressive phototherapy. Three of these infants in this study were the only infants recorded as being deaf at the 18- to 22-month assessment. The other infant with no BAER died before the 18- to 22-month assessment.

Bilirubin Effects

Each of the four bilirubin variables was positively correlated with the BAER V, III, and I latencies but not the interwave intervals for the 751–1,000 g center A infants in the aggressive treatment arm. The regression coefficients were of similar magnitude for all three waves for these infants (V, III, and I latencies are highly correlated) and 8/12 (three waves by four bilirubin variables) were statistically significant at the 0.05 level. For wave V, there was a 0.05-ms (95% CI = −0.05, 0.16) latency increase for every mg/dl increase in 14-day TSB average, a 0.06-ms (95% CI = −0.04, 0.16) latency increase for every mg/dl increase in peak TSB over the 14-day treatment interval, a 0.09-ms (95% CI = 0.01, 0.18) latency increase for every 24-h increase in the age at peak TSB, and a 0.15-ms (95% CI = 0.02, 0.27) latency increase for every 24-h increase in the duration of phototherapy. For the 751–1,000 g center A infants, 11/12 correlations were positive in the conservative treatment arm, although only one was significant at the 0.05 level. There were only three significant positive correlations for infants in the 72 other combinations of center, birth-weight stratum, and phototherapy treatment.

Discussion

Weeks after aggressive phototherapy that only modestly reduced bilirubin exposures (see Table 3 ); BAER wave V, III, and I latencies were significantly shorter for 751–1,000 g birth-weight center A newborns (see Table 4 ). Similar nonsignificant trends were recorded for the 501–750 g birth-weight infants at center A. Neuronal transmission in the brainstem and axonal and synaptic transmission did not differ in the aggressive and conservative treatment conditions. We do not know whether these effects persist past the newborn period, although aggressive phototherapy was associated with a significant reduction in hearing impairment in the parent trial (type of loss did not differ in the aggressive and conservatively treated infants) (3).

The lesion site responsible for altered BAER latencies in center A could be anywhere in the auditory pathway peripheral to the distal auditory nerve. Prior to testing, the ear canal was visually examined to ensure that obstruction of the outer ear canal did not affect the BAER recordings. Middle ear and cochlear testing were not conducted and cannot be ruled out as lesion sites. These results do not support the hypothesis that bilirubin at these levels of exposure alters neural transmission in the brainstem of ELBW newborns, contrasting with results from laboratory studies on animal models and observational studies largely on late-preterm and term newborns with very different bilirubin exposures (4).

Discrepancies between the results we report and studies in the literature may be explained by differences in the newborns sampled, bilirubin exposures, time of BAER assessment, medical care, and study methods. This is the only study randomly allocating ELBW newborns to treatment conditions resulting in different bilirubin exposures. Random allocation minimizes imbalances in measured and unmeasured confounders, increasing the likelihood of evaluating the true relationship between bilirubin exposure and BAER. We conducted adjusted analyses to further confirm the likelihood of exchangeable treatment groups. Our study was powered to detect a meaningful effect size, enrolled all eligible newborns, minimized missing data, and standardized and blinded measurements and analyses to reduce bias. Secondary studies of clinical trials provide randomly allocated group comparisons to research otherwise largely limited to observational studies.

Animal models of hyperbilirubinemia have been used to evaluate neurotoxic effects in the auditory brainstem (4,6,7). The animal models developed may not be appropriate for the phototherapy-induced effects reported in this study because neural transmission in the brainstem was not altered.

No significant phototherapy treatment effects were recorded for center B infants. Differences in newborn baseline characteristics, neurodevelopmental outcomes, phototherapy, and BAER methodology may explain the center differences in the treatment effects we recorded. Adjusting for measured demographic and clinical differences did not account for center differences, although unmeasured confounders may explain center differences. Aggressively treated infants at center A but not center B were more likely to survive without neurodevelopmental impairment. This result is consistent with better outcomes for infants from center A but at odds with no effect on neural transmission in the brainstem.

The duration of phototherapy was longer and the 14-day average and peak TSB levels higher at center A than center B. The 14-day average and peak TSB levels in the center B conservative treatment conditions were comparable to the center A aggressive treatment condition levels. Bilirubin exposures in the center B conservative treatment condition may not have been high enough or long enough to prolong BAERs at 35 weeks PMA. In addition, the fiber-optic blankets used in center A and the phototherapy lights used in center B may change bilirubin into different photoisomers (8,9) with different effects on the auditory pathway.

The vagaries of sampling may also explain center differences. We cannot determine from this study whether rejecting the null hypothesis of no association between aggressive phototherapy and BAER latencies based on center A results commits a type I error or whether failing to reject the null hypothesis based on center B results commits a type II error.

We report that reducing bilirubin exposures in some ELBW infants by aggressive use of phototherapy reduced BAER latencies without altering brainstem transmission. We also report that BAERs of ELBW newborns with slightly different exposures and clinical characteristics were not differentially affected by more aggressive phototherapy. Given the complexity of the biology, it should not be surprising that bilirubin’s effects are variable depending on bilirubin exposures, the newborns evaluated and their clinical conditions, and study methodologies. A more complete understanding of the effects of bilirubin on newborns requires rigorous research methodologies and a better understanding of bilirubin metabolism at different stages of development and in different clinical conditions.

Methods

Trial Design

This study was a two-center, stratified (center and 501–750 g/751–1,000 g birth-weight strata), parallel-group randomized trial. Eligible infants were randomly allocated (1:1 ratio) to the conservative or aggressive phototherapy treatment conditions within center and birth-weight stratum. See Morris et al. for details concerning study design and methodology (3).

Participants

All 501–1,000 g birth-weight infants participating in the NICHD Neonatal Research Network Phototherapy trial from two network centers who were born within defined enrollment periods were eligible for this substudy. Enrollment periods for center A (infants who were 35 weeks PMA after 17 January 2003 but born no later than 1 September 2004) and center B (infants who were 35 weeks PMA after 11 August 2003 but born no later than 3 February 2005) depended on demonstrated adherence to the substudy protocol. Consent for this substudy was nested within the parent-study consent, approved by each center’s internal review board, and obtained from each study newborn’s parent or custodian.

Intervention

Protocol-stipulated phototherapy use was for the first 14 postnatal days. Phototherapy was administered almost entirely by fiber-optic blankets at center A and entirely by phototherapy lights at center B. Bilirubin levels were measured by TSB averaged over the 14 measurement days, the peak TSB during those 14 days, and the age at that peak TSB. The total duration of phototherapy was also measured.

BAER Recording Protocol

Study infants were tested during their 35th week PMA or within 3 days of discharge if discharged prior to 35 weeks. If testing could not be completed according to protocol because the baby was not in a testable state, the technical aspects of the recordings were unacceptable, or the medical staff refused to allow testing due to the unstable medical condition of the infant, the infant was tested as soon as possible to obtain an acceptable recording.

Each ear was assessed separately by presenting 80-dB normal hearing level (nHL), 100-µs clicks at a rate of 20.1 clicks per second. The BAER responses were differential recordings from scalp electrodes pasted on the forehead at midline (noninverting electrode), the ipsilateral mastoid (inverting electrode), and the contralateral mastoid (the ground). The responses were amplified 100,000 times, filtered from 100 Hz to 3 kHz, and averaged for 2,000 repetitions. Four averages were recorded from each ear, the first three to rarefaction clicks and the last to a condensation click. By adding rarefaction and condensation responses, polarity-sensitive cochlear responses cancel while polarity-insensitive neural responses add, providing a way to distinguish cochlear from neural responses (critical for the diagnosis of auditory neuropathy). No cases of auditory neuropathy were identified. The outcome measurements were made from the two best responses to the rarefaction clicks.

The electrode impedances were required to be <10 kΩ and were measured before and after testing each ear. The noise levels in the room, the infant’s testability (state, movement, etc.), and technical adequacy (electrical artifact) were assessed on 5-point Likert scales by the recording audiologists.

Scoring the BAER Waveforms

BAERs record sound-induced neural activity from successive generators along the auditory pathway. The generators for waves I, II, III, IV, and V are the distal and proximal auditory nerves, the cochlear nucleus, the superior olive, and the lateral lemniscus, respectively (10). The primary outcome was wave V latency at 35 weeks PMA. Because the primary site of bilirubin-induced toxicity was hypothesized to be the auditory brainstem, the V–I interwave interval might have been selected as the primary outcome. It was not selected because, prior to the study, we anticipated the difficulty in recording wave I reliably from sick infants in neonatal intensive care units.

Secondary outcomes included wave I and III latencies. In addition, secondary analyses were conducted on infants with reliably scored waves I, III, and V in both ears (n = 81 with complete data of the 111 infants tested) to evaluate site-of-lesion effects. The V–I, V–III, and III–I interwave latency intervals measure transmission times between the respective generators of those waves. A second set of secondary analyses were conducted on infants with reliably scored waves III, IV, and V in both ears (n = 83 with complete data) to distinguish axonal transmission from transmission involving a synapse. Ponton and Eggermont hypothesized that the V–III and III–I interwave latency intervals measure both axonal and synaptic transmission, whereas the IV–III and II–I intervals measure only axonal and not synaptic transmission (5). The II–I latency interval was not included in our secondary analyses a priori because we anticipated a small number of newborns with reliably scored II–I intervals. We recorded II–I intervals reliably in both ears for 55 infants, although they could be scored in at least one ear for 94 infants.

Data files, de-identified with the center number and network ID for each infant, were electronically transferred to the NICHD Neonatal Network Data Center (RTI International) in North Carolina. At the end of the study, separate data files for independent scoring were provided to the two scorers (M.S.O, R.E.L) with more than 20 years experience, who scored all the BAER study waveforms. “Reliable” measurements were identified as within 0.20 ms of each other. The means of the reliable measurements from the two scorers were used as the data analyzed for this study. If a measurement was not reliable, the two scorers reviewed the discrepancy and generated a consensus measurement used in the analyses.

Sample Size

This study was powered a priori to detect an effect size (0.25-ms reduction in wave V latency with SD = 0.50 ms) comparable to a decrease in wave V latency in preterm newborns over a 2-week interval (11). Detecting that effect size with a type I error of 0.05 and a type II error of 0.20 required 64 infants in each treatment condition. Although 174 infants were enrolled, only 111 infants (51 in the aggressive and 60 in the conservative treatment conditions) were tested according to protocol with interpretable waveforms. Most of the “missed” infants (57/63 = 90.5%) died before 35 weeks. Given the type I (0.05) and II (0.20) error rates assumed and 111 infants sampled, we could expect to reliably detect a 0.27-ms rather than a 0.25-ms reduction in wave V latency. We had 74% power to detect the 0.25-ms reduction.

Randomization

Newborns were stratified by center and birth weight (501–750 g and 751–1,000 g) and then randomized to one of the two treatment conditions by calling a centralized computer system at RTI International.

Blinding

The study personnel recording the BAERs did not know the treatment assignments. The scoring of the BAER waveforms was also done blinded to treatment assignment.

Statistical Methods

The analyses were updated prior to analyzing the data to be consistent with the analyses of the NICHD Neonatal Research Network parent trial (3). That trial was designed to evaluate two different phototherapy regimens. In addition, the parent trial evaluated whether birth weight and also center (unpublished analyses) moderated treatment effects. Analyses were intention to treat. P values < 0.05 were considered statistically significant for all analyses, with no adjustment for multiple comparisons. Analyses were conducted using Stata (version 10.1, College Station, TX).

Linear regression models including phototherapy treatment condition, the two stratification variables (center and birth-weight category), all two-way interactions, and the three-way interaction were calculated to evaluate treatment differences in BAER outcomes. A clustered sandwich estimator was used to estimate standard errors for the BAER outcomes because of the correlation between measurements from the two ears of each newborn (12). The referents for the independent variables in these models were the conservative phototherapy treatment arm, center A, and the 751–1,000 g birth-weight stratum. The interaction terms indicated whether the effect of aggressive phototherapy was moderated by center and birth-weight stratum as evaluated in the NICHD Neonatal Research Network trial and specified prior to conducting the analyses.

Additional linear regression models were calculated to adjust for variables that might confound phototherapy treatment effects because of an imbalance in those variables despite random allocation. We evaluated the baseline and demographic variables evaluated in the NICHD Neonatal Research Network trial as potential confounders by adding each baseline and demographic variable to the linear regression models described above. In addition, a forward stepwise procedure was calculated to evaluate multiple confounders simultaneously (P = 0.20 to enter).

In four infants, no BAERs could be identified, consistent with profound hearing losses. Those responses were censored and Tobit regressions were calculated to include the censored responses in the analyses (13). The results of those analyses were similar to the results of the linear regressions described above and are not reported.

The effects of bilirubin levels (average levels for the first 14 days after initiation of therapy, the peak bilirubin levels during those first 14 days, and the age at the peak measurement) and the duration of phototherapy on the BAER were evaluated by linear regression models including main effects for each bilirubin variable, the two stratification variables (center and birth-weight category), all two-way interactions, and the three-way interaction. A clustered sandwich estimator was used to estimate standard errors accounting for the correlation between measurements from the two ears. In addition, simple regressions (BAER variable regressed on bilirubin variable) were calculated for infants from the eight combinations of center, birth-weight category, and phototherapy condition.

Statement of Financial Support

The National Institutes of Health and the Eunice Kennedy Shriver National Institute of Child Health and Human Development provided grant support for the Neonatal Research Network’s Phototherapy Study (U10 HD21373, U10 HD21385, U10 HD27853, U10 HD27880, U10 HD27904, U10 HD36790, U10 HD40498, U10 HD40521, KL2 RR24149, M01 RR44, M01 RR70, M01 RR7122, M01 RR8084, UL1 RR24148).