Introduction

Despite advancements in neonatal nutrition strategies, up to 50% of very low birth weight infants plot at less than 10th percentile for their corrected age on the weight curve at time of discharge from the neonatal intensive care unit (NICU)1,2,3. Optimizing nutrition to promote growth during NICU hospitalization is a key area of research and clinical interest as greater gains in somatic anthropometric measurements (weight, length, head circumference, body mass index (BMI), and lean mass) have all been linked to better neurodevelopmental outcomes in early childhood, including decreased incidence of cerebral palsy, improved cognitive, motor, and language scores, and higher intelligence quotients4,5,6,7,8,9,10. In addition, weight, BMI, lean mass, and increased early macronutrient intake in preterm infants are associated with improved brain global and regional volumes and cross-sectional diameters on magnetic resonance imaging (MRI) at term equivalent age11,12,13. Lower gestational age is also associated with smaller brain volumes in preterm infants and greater risk for neurodevelopmental impairment14. However, it is not known if somatic growth in the NICU is associated with brain volumes and neurodevelopmental outcomes at early school age.

The relationship between NICU somatic growth and cortical metrics also has not been examined. Cortical thickness is considered to be a measurement of the radial columns of pyramidal cells and interneurons that comprise the six layers of the cerebral cortex15. It increases as neuronal populations grow prenatally and postnatally, then starts to subside due to pruning of synapses and efficiency of brain networks during childhood16,17,18. However, the relationship of cortical thickness and outcomes in premature infants is not robust. One group did not find significant regional differences in cortical thickness between preterm and term children19. Some studies report increased cortical thickness in preterm children versus term children20,21,22,23,24, while others report decreased cortical thickness in preterm children20,21,25,26.

Furthermore, preterm infants with poor linear growth specifically during their NICU course have demonstrated worse language scores at age 25,6. While direct head growth can be noted by occipitofrontal circumference, linear growth is also thought to be a proxy for protein and lean mass accretion and, consequently, organ development and brain growth27,28,29. In our pilot study of extremely preterm (EPT, born less than 28 weeks gestation) children at 4–6 years, increased cortical thickness in right temporal areas and structural connectivity in a bitemporal pathway involving the cerebellum were positively correlated with language scores at the same age24,30. Here we evaluate the relationship of postnatal NICU growth, a potentially modifiable factor, with brain and language outcomes in this cohort.

Methods

Participants

This is an analysis of 42 EPT children who were previously recruited for a larger ongoing multimodal neuroimaging investigation examining brain-based markers of language development in EPT children and their term counterparts. Children were recruited from Cincinnati-area NICUs if they were born at < 28 weeks gestation in the years 2009–2012 and had no grade 3–4 intraventricular hemorrhage or periventricular leukomalacia on neonatal cranial ultrasound. Medical chart reviews were performed for EPT participants to verify clinical data (gestational age, cranial ultrasound results) obtained from parents. Children with cerebral palsy, seizures, migraines, history of learning or speech disability, or history of neurologic disorder were excluded. Full details of the parent study, including family surveys of demographic data, have been published elsewhere24,30,31,32,33. The study was approved by the Cincinnati Children’s Hospital Institutional Review Board and conforms to the United States Federal Policy for the Protection of Human Subjects. Written informed consent was obtained from parents. Methods were carried out in accordance with the Declaration of Helsinki.

Neuroimaging acquisition and analysis

Three-dimensional T1-weighted MRI was obtained for each subject on a 3.0 T Phillips Achieva scanner with a T1 turbo field echo sequence (TR/TE = 8.055/3.68 ms, 1.0 × 1.0 × 1.0 mm voxels, Matrix = 256 × 256 × 160). All scans were read by a board-certified radiologist with certificates of added qualifications in pediatric radiology and neuroradiology who was blinded to clinical history. Images were subjected to volumetric and cortical thickness analyses on a whole-brain level using FreeSurfer (Version 6.0.0 http://surfer.nmr.mgh.harvard.edu/ running in Mac OSX operating system version 10.12.6). As we and others have described previously, an automated pipeline in FreeSurfer was used for cortical reconstruction, segmentation, and non-linear surface-based registration24,34,35,36,37. Cortical thickness was calculated as the closest distance from the gray/white boundary to the gray/CSF boundary per vertex34. Regional volumes and cortical thickness measurements were normalized for estimated total intracranial volume (eTIV) by dividing each value by eTIV, then multiplying by 106 for scale. Mean cortical thickness and brain volumetrics were calculated by hemisphere and by region.

Infant data from NICU hospitalization

Weight, length, and head circumference (HC) at birth and at 36 weeks post-menstrual age (PMA) from NICU were obtained by medical chart review. For infants discharged prior to 36 weeks PMA, their discharge measurements were used. For each measurement, z-scores accounting for PMA and sex were calculated using the Fenton reference growth curve38. Changes in z-score were defined as the difference from birth to 36 weeks PMA for each anthropometric z-score. BMI was calculated per the Olsen intrauterine reference39.

Incidences of common comorbidities were also recorded. Severe bronchopulmonary dysplasia (BPD) was defined as needing greater than 2 L/min of nasal cannula support at 36 weeks PMA (grade 2 or higher)40; severe retinopathy of prematurity (ROP) was defined as ROP requiring treatment (either surgery or intravitreal injection of anti-vascular endothelial growth factor agents) or evidence of retinal detachment. Cases of necrotizing enterocolitis (NEC) included stage 2 and stage 3 NEC per modified Bell’s criteria, and late onset sepsis was defined as having a positive blood culture after the first 72 h of life.

Neuropsychological assessments

Children underwent assessment with the Peabody Picture Vocabulary Test 4th Edition (PPVT4)41; Expressive Vocabulary Test 2nd edition (EVT2)42; and Wechsler Nonverbal Scale of Ability (WNV)43, and age-standardized scores were reported. The EVT2 and PPVT4 were used to assess expressive and receptive vocabulary, respectively. They were chosen for their brevity and their high correlation with verbal intelligence, especially in children44,45. They are also widely used in studies of prematurity, including our prior work24,30,31,32,33,46,47,48. Infants had the Bayley Scales of Infant Development 3rd edition (BSID-III)49 performed at 22–26 months corrected age by a certified examiner as part of our infant follow-up clinic protocol. These results were obtained from medical chart review.

Statistical analysis

Pearson correlations were first performed to screen for linear relationships. Analyses were completed using each anthropometric z-score and BMI at birth or 36 weeks PMA or the change between those time points as the predictor. Multiple univariate regression analyses were performed using analysis of maximum likelihood. Dependent outcomes of interest were BSID-III scores at 2 years (cognitive, language, and motor composites), neuropsychological assessments at 4–6 years (PPVT4, EVT2, WNV), global brain volume (eTIV), global cortical thickness (left and right hemispheres), regional brain volumes (corpus callosum, left and right cerebellum, left and right temporal), and regional cortical thickness (left and right temporal). Both normalized and non-normalized brain metrics were evaluated. The relationships between brain metrics and PPVT4/EVT2/WNV assessments as well as between BSID-III and PPVT4/EVT2/WNV scores were also assessed. We did not account for multiple comparisons given the pilot nature of this study and to reduce the likelihood for type II error50.

For the multivariable linear regression models, five covariates of interest were selected a priori: gestational age (GA); sex; highest parental education reported at time of assessment; family income reported at time of assessment; and corrected age at time of assessment/MRI51. These measures were selected as potential confounding factors to adjust for in the multivariable model. Covariates were included in models if there was evidence of potential confounding, based on significance observed in univariate analyses (Supplementary Table 1) or assessing whether the parameter estimate changed by at least 10% with or without inclusion of each covariate (not shown). Furthermore, for each growth metric, multicollinearity was assessed with our covariates using a variance inflation factor threshold of 10, and no collinearity was detected.

A table depicting which covariates were included in each multivariable model is shown in Supplementary Table 2. All models adjusted for GA, sex, highest parental education, and family income (insurance status used as a proxy for models evaluating BSID-III). Regression models with neuroimaging markers included corrected at age MRI. Again, due to the pilot nature of our analysis, we did not include additional corrections for multiple comparisons at this step. Missing data was not imputed for each analysis. Statistical analyses were completed using SAS software version 9.4 (SAS Institute Inc., Cary, NC). Results were considered statistically significant for p < 0.05.

Results

Table 1 shows the patient demographics and NICU clinical outcomes of the 42 EPT children, of which 33 had high quality brain imaging sufficient for analysis. Table 2 presents the results of their neurodevelopmental testing and family demographics at the time of each assessment. 37 had completed BSID-III scores. One participant was unable to complete PPVT4 testing, and a different participant was unable to complete WNV testing.

Table 1 Participant characteristics and neonatal outcomes.
Table 2 Participant assessment results and socioeconomic characteristics at follow-up.

Growth and developmental outcomes

No NICU growth metric was associated with PPVT4, EVT2, or WNV language scores at 4–6 years in either correlation analysis or multivariable linear regression models.

With respect to BSID-III composite scores at 2 years, length z-score at 36 weeks PMA was strongly correlated with language (r = 0.42, p = 0.009). Change in length z-score was also positively associated with cognitive composite score (r = 0.36, p = 0.027), whereas change in BMI was negatively associated (r =  − 0.35, p = 0.035). Of these associations, only the relationship between length z-score at 36 weeks PMA and language remained significant after controlling for sex, GA, highest parental education, and insurance (Table 3). The multivariable model identified additional direct associations between birth length z-score with language composite score and between birth BMI with motor composite score (Table 3).

Table 3 Multivariable regression model comparing growth with BSID-III assessments.

Growth and neuroimaging markers

Pearson correlations comparing growth metrics with normalized global and regional brain MRI markers are shown in Table 4. The strongest correlation was HC z-score at 36 weeks PMA with eTIV (r = 0.45, p = 0.009). Several growth metrics at birth and 36 weeks PMA were negatively associated with normalized measurements, both global and regional, but this was not observed when utilizing non-normalized brain MRI measurements (Supplementary Table 3). Change in growth z-scores from birth to 36 weeks PMA was not associated with any global or regional brain metrics. In the multivariable model, after adjusting for all five covariates of interest, the relationship between HC z-score at 36 weeks PMA and eTIV remained significant, as did the negative associations with normalized left and right hemisphere cortical thickness (Table 5). Again, these negative parameter estimates were not noted when using non-normalized MRI measurements (Supplementary Table 4).

Table 4 Pearson correlations comparing growth with brain MRI metrics.
Table 5 Multivariable regression model comparing growth with brain MRI metrics.

Neuroimaging markers and neurodevelopmental assessments

Pearson correlations comparing normalized global and regional brain imaging metrics to 4–6 year language outcomes found no statistically significant relationships. After controlling for all five covariates of interest, cortical thickness in the right temporal lobe was noted to be positively associated with PPVT4 score (parameter estimate 23.25, 95% CI 3.77 to 42.74).

Neurodevelopmental assessments over time

In Pearson correlations of BSID-III language scores and 4–6 year language outcomes (Supplementary Table 5), PPVT4 and EVT2 were each positively related to language composite and both receptive and expressive scaled scores, with the most robust associations between EVT2 and language composite (r = 0.50, p = 0.002) and receptive scaled score (r = 0.56, p < 0.001). After controlling for sex, GA, highest parental education, and family income, the relationships between EVT2 and both BSID-III language composite and receptive language remained significant (Supplementary Table 5).

Discussion

In this study investigating the relationship between growth in the NICU in EPT infants and their brain and language outcomes at early school age, we have provided the first evidence that growth anthropometrics at 36 weeks PMA are significantly associated with global brain volume and relative cortical thickness at 4 to 6 years of age in children with no known brain injury or neurologic abnormalities. These associations have potential implications for clinical practice and provide further evidence for the third trimester of gestation (experienced outside the womb for our participants) being a vital period of brain growth.

While no other study to our knowledge has related growth in the neonatal period to early school-age brain imaging, our findings are congruent with the work of other investigators who have previously evaluated the association of neonatal anthropometry with various neuroimaging metrics obtained at term equivalent age10,12,52,53. HC at term is strongly related to total brain volume at term10,53, and here we have shown that this relationship persists to school-age. In our unadjusted model, we also found that weight z-score at 36 weeks PMA to be positively associated with total intracranial volume at school age. Similarly, Coviello et al. demonstrated that weight gain in the first 4 weeks of life was correlated with total brain volume at term. In a study that included brain imaging in older school-age children, including those born both at term and preterm, Silva et al. tracked fetal and postnatal growth at multiple intervals, from 20 weeks gestation to 24 months of age, and found that greater weight gain during each age window was independently associated with larger brain volumes at 10 year of age54. Thus, achieving greater growth during the NICU hospitalization for preterm infants may have lasting effects in overall brain size.

Furthermore, we identified a novel negative association between HC at 36 weeks PMA and normalized cortical thickness in both hemispheres. While a negative relationship could be a sign of neuronal pruning, we did not observe the same findings using non-normalized metrics. We posit that this directionality instead could be explained by increases in global brain volume relative to the changes in cortical thickness. Although there were no significant findings relating growth and regional brain metrics in multivariable modeling, PPVT4 and right temporal cortical thickness were positively associated, consistent with our pilot study that previously reported on a subset of the patients in this cohort24.

Some studies have also examined long-term growth and neurodevelopment outcomes. Belfort et al. calculated the slopes of growth from birth to term equivalent age, from term to 4 months, and from 4 months to 1 year of age in an attempt to identify sensitive periods of postnatal growth. They concluded growth in the first two periods were predictive of higher BSID-II scores at 18 months4. In a separate cohort, improved linear growth from term to 4 months was more predictive of lower odds of IQ < 8529. Others have investigated relationships between growth in infancy and early childhood in very low birth weight and preterm children and have found some positive correlations between linear growth, weight gain, or increase in head circumference versus motor and cognitive outcomes at preschool and early school age55,56. There has been wide variation in the growth metrics related to outcome and the nature of the relationship, however. While we did not detect an association between NICU growth and language assessments at 4–6 years, for our sample, birth length z-score was positively associated with BSID-III language composite scores at 2 years corrected age. We also noted a strong association between length z-score at 36 weeks PMA and BSID-III language composite scores at 2 years, consistent with findings from Ramel et al.6, and we observed relationships between both composite and receptive language at 2 years and EVT2 scores at 4–6 years of age. Though it is unknown whether these results could be summative, it is possible that our small sample size prohibited us from discerning a direct relationship between growth and EVT2.

Although we were not able to obtain granular nutritional intake data, others have investigated nutrient intake and its association with specific brain imaging markers. In two different cohorts of preterm infants, patients who received increased macronutrient intakes had larger caudate size at term57 or at adolescence58. Early fat and caloric intake have been associated with regional brain volumetrics in preterm infants12,13, while another study found no relationship59. Protein and total energy intake via human milk specifically has also been related to greater total and regional brain volumes60. Again, these studies have primarily utilized term equivalent age brain imaging.

Taken together, our results and those reported by others suggest that optimizing intake and growth in the NICU has direct impact on the developing brain, including areas known to support language development. The cerebral hemispheres and cerebellum are undergoing a remarkable rate of growth during the period that children born EPT are hospitalized in the NICU61,62. Thus, providing the developing preterm brain with optimal nutritional substrate and closely monitoring and supporting somatic growth are logical targets for interventions aimed at decreasing morbidity from preterm birth. We speculate that these positive associations are due to increased growth of the neuronal cell populations in the cortex through infancy and childhood and not due to decreased pruning later in childhood. The children included in this report have not yet attained maximal cortical thickness (typically the peak occurs around 7 to 10 years for girls and 9 to 11 years for boys)16,17. Furthermore, other investigators have shown that, while EPT children in general have decreased intracranial volume, decreased grey and white matter volume, and decreased cortical surface area versus term children, the trajectory of cortical thinning is not significantly different between EPT children and term children17,20,21,63,64,65. Unfortunately, our findings in this report are consistent with those of other investigators in that the relationship between cortical morphometry and neurodevelopmental outcomes in EPT often do not survive multivariable adjustment, potentially due to limited sample size19,21. However, despite our relatively small sample size, these experiments did find an association between normalized cortical thickness in the right temporal region and language scores, which is congruent with a previous report from our group24. Future work could include an investigation of other markers of cortical morphometry, such as surface area and curvature, which have been shown to relate to language scores in preterm infants66.

Our study has a few relative limitations. We did not have body composition data for our participants, as this is a retrospective secondary analysis of a larger ongoing study. Newer studies evaluating fat-free (lean) mass have found associations with motor and cognitive outcomes at 1–2 years7,8 and brain size at term11,52,67. While we do include length and weight using standard growth curves, it remains unclear if weight or length is a better marker for lean mass and brain growth. Additionally, length and HC measurements are subject to error. At the time of the NICU hospitalizations of these infants, length boards were not routinely used. HC measurements could also be affected by inaccuracies related to delivery. Similarly, we did not prospectively assess nutritional intake, feeding composition, or feeding volumes in the NICU. However, given conflicting evidence, a robust relationship between intake in the NICU and brain growth (as indexed by volumetric analysis of MRI at term equivalent age) is still being established12,13,59,60. This could be due to difficulty reliably assessing enteral intake in the NICU, with some investigators finding nutritional intake in their studies failed to meet recommended amounts and others estimating enteral intake based on an assumed composition of maternal breast milk12,13,59, though accurate human milk analysis has been demonstrated to be capably incorporated in these studies60. Finally, we are relying on growth metrics assessed at birth and at 36 weeks PMA only. Future studies will increase the number of participants and the granularity of data collection, enabling a better assessment of nonlinear variables such as growth curves through higher-order techniques, such as functional data analysis.

In summary, we have reported significant positive associations between neonatal growth in the NICU and total and regional brain volumes at early school age for children born extremely preterm. Somatic growth in the NICU did not correlate with language scores at school-age for this cohort. Our hypothesis-generating pilot study results suggest that growth in the NICU has significant and lasting effects on brain development in children born EPT and that this is an exciting avenue for future research.