Introduction

Survivors of preterm birth are at increased risk of long-term neurodevelopmental impairment,1,2 particularly those born at the earliest gestations.3,4,5 Later moderate to severe neurodevelopmental impairment is present in 32% of extremely low birthweight (ELBW) infants2 and is associated with poor postnatal growth during the neonatal period.6 Growth monitoring is recommended to detect faltering growth and allow appropriate intervention to optimize later development, but the most appropriate growth reference to do this is uncertain.

Widely used fetal growth charts such as Fenton,7 Olsen8 and UK-WHO,9 developed from cross-sectional birth data, do not appropriately represent intrauterine growth or postnatal growth of preterm infants,10 neither do they allow for factors contributing to preterm birth that have consequences for birthweight, or the differences between the intrauterine and neonatal intensive care (NICU) environment.11 Consequently, some experts recommend the use of INTERGROWTH- 21st Very Preterm Size at Birth References12 and INTERGROWTH- 21st Preterm Postnatal Growth standards,13 which were developed from longitudinal measures of normally- growing fetuses and infants subsequently born to healthy low risk mothers,14 for monitoring postnatal growth of preterm infants. However, owing to the very small numbers of infants born ≤33 weeks’ gestation included in INTERGROWTH- 21st, there are concerns that the INTERGROWTH- 21st charts are not appropriate for monitoring the postnatal growth of infants below this gestation.15,16

We examined the associations between postnatal growth faltering using four growth references (Fenton, UK-WHO, Olsen and INTERGROWTH-21st) and neurodevelopment at 2 years’ corrected age (CA) in infants born ELBW to determine how well-faltering growth using each of these references predicted developmental delay at 2 years’ CA, and the period of faltering growth that was the best predictor.

Methods

Study design and population

We undertook a secondary analysis of data from the Protein Intravenous Nutrition on Development (ProVIDe) trial, a multicenter (New Zealand and Australia), double-blinded, randomized controlled trial which recruited 434 ELBW babies from 2014 to 2018.17 Infants were eligible if birthweight was <1000 g and an umbilical arterial catheter (UAC) had been inserted. Infants admitted to NICU >24 h after birth, multiple birth >2 infants, genetic and other known abnormality, a congenital disorder impairing growth or in danger of imminent death were excluded. Participants were randomized to receive 1 g/d amino acid solution or placebo (saline) via the UAC for the first 5 postnatal days in addition to standard intravenous nutrition. The primary outcome was survival free of neurodevelopmental disability at 2 years’ CA. The ProVIDe trial was registered (Australian New Zealand Clinical Trials Registry: ACTRN12612001084875) and approved by the Northern B Health and Disability Committee (No 13/NTB/84) and the Children’s Health Queensland Hospital and Health Service Human Research Ethics Committee (HREC/16/QRCH/224) in New Zealand and Australia respectively. Parents or caregivers of each participant provided written informed consent.

For this study, we included all ProVIDe trial participants who underwent neurodevelopmental assessment at 2 years’ CA.

Anthropometry

Head circumference, weight and height were measured at birth, 4 weeks’ and 36 weeks’ postmenstrual age (PMA) and 2 years’ CA. Head circumference was measured with a non-distensible tape measure. From birth until discharge, babies were weighed naked to the nearest 10 g using digital scales and their crown-to-heel length was measured by trained staff to the nearest 1 mm using a Harpenden or similar neonatometer (Holtain Ltd, Dyfed, Wales). At 2 years, children were weighed, without footwear and in minimal clothing, on a digital scale to the nearest 100 g. Their height was measured with a wall-mounted stadiometer to the nearest 1 mm.

Neurodevelopmental outcomes assessment

Cognitive, language and motor skills were assessed using Bayley Scales of Infant and Toddler Development Edition 3 (Bayley-III)18 at 2 years’ CA. The assessment also included a neurological examination for the diagnosis of cerebral palsy (loss of motor function and abnormalities of muscle tone and power), deafness (required use of hearing aid(s) or worse) and blindness (visual acuity in both eyes worse than 6/60). Severity of cerebral palsy was classified using the Gross Motor Function Classification System (GMFCS).19

A child was classified as having neurodisability if they had at least one of: cerebral palsy; deafness; blindness, or developmental delay (defined as a standardized score of <−1 SD on any of the Bayley- III cognitive, language or motor composite scores). The severity of neurodisability was categorized as:20

Mild

Mild cerebral palsy (walking at 2 years with only minimal movement limitations; GMFCS level 1), or suspected developmental delay (standardized score −2 SD to <−1  SD on any one of Bayley- III motor, cognitive or language composite scores)

Moderate

Moderate cerebral palsy (non-ambulant at 2 years but likely to ambulate later; GMFCS level 2 or 3), or deafness, or moderate developmental delay (standardized score from −3 SD to <−2 SD on any one of the Bayley- III composite scores)

Severe

Severe cerebral palsy (non-ambulant; GMFCS level 4 or 5), severe developmental delay (standardized score < 3 SD on any one of the Bayley- III composite scores) or blindness.

Statistical analyses

We calculated weight, length/ height and head circumference z- scores at birth, 4 weeks’ and 36 weeks’ PMA using the four growth charts (Fenton, UK-WHO, Olsen and INTERGROWTH-21st). For INTERGROWTH-21st, we used the Very Preterm Size at Birth References at birth and the Preterm Postnatal Growth standards at other time points. At 2 years’ CA, we used the WHO Child Growth Standards. The INTERGROWTH-21st Very Preterm Size at Birth reference begins at 24 weeks. Therefore, for babies born at 23 weeks’ gestation, birth z-scores using INTERGROWTH-21st were extrapolated as follows. The reference z-score for weight, length and head circumference were extracted separately for boys and girls from the INTERGROWTH-21st website.21 Each outcome (weight, length, height) then was plotted as a function of gestational age and the likely function determined. For length and head circumference (linear relation), ordinary least squares regression was used to estimate the slope and intercept for each z-score and then extrapolated for lower gestational ages. The same approach was used for birthweight (exponential relation) using nonlinear regression to fit the exponential function Vo*exp (K*gestational age) to each of the z-score data tables. The goodness of each model fit was verified by inspection of residuals. The UK-WHO length z-scores were not available below 27 weeks’ gestation and were treated as missing data not extrapolated because this was unlikely to be accurate over the missing 4 weeks.

Faltering growth was defined as follows: faltering weight gain as weight z-score decline ≥0.8, further classified as mild (0.8–1.2), moderate (>1.2–2) or severe (≥2);10,22 faltering linear growth as length z-score decline ≥0.8,10 and faltering head growth as head circumference z- score decline >1.23

We compared the prevalence of neurodisability between infants with and without faltering growth using Chi-square and Fisher’s exact tests. We examined the association between growth (change in z-scores over time) and Bayley III composite scores using Pearson’s correlations. We estimated the odds of developmental delay at 2 years’ CA among infants with and without faltering growth using binary logistic regression. We examined faltering growth as a predictor of developmental delay by calculating sensitivity, specificity and area under the receiver operating characteristic curve (AUC).

We used IBM SPSS Statistics version 28 and SAS (version 9.4 SAS Institute Inc, Cary NC) for statistical analyses. Continuous variables are presented as means with standard deviations (SD) or median and interquartile range and categorical variables as frequencies and percentages. We measured inter-observer agreement of faltering growth (categorical classification) between the growth references using Cohen’s kappa coefficient with 95% confidence intervals (CI). We undertook a sensitivity analyses excluding infants born before 24 weeks’ whose z-scores were calculated by extrapolation, and excluding small-for-gestational-age (SGA) infants (birthweight <10th centile using each of the four growth charts). A p value ≤ 0.05 was considered statistically significant for all analyses.

Results

Of the 434 infants recruited to the ProVIDe trial, 352 were eligible for assessment at 2 years’ CA and 330/352 (94%) were assessed. Our study included 327/352 (93%) participants who had data available for the primary outcome of neurodisability (Fig. 1). The median gestational age was 26 weeks and median birthweight was 797 g (Table 1). European was the most common ethnicity (49%) and almost a third of mothers (33%) had a university degree. At 2 years’ CA, neurodisability was present in 128/327 (39%), with language the most common domain affected (32%).

Fig. 1: Flowchart for participants included in this study.
figure 1

Data presented as n (%). ProVIDe Protein Intravenous Nutrition on Development.

Table 1 Sociodemographic and clinical characteristics of infants and their mothers.

The proportions of participants classified as having faltering growth were different between growth references at all time periods (Table 2). Similarly, mean z-scores and mean change in z-scores were significantly different (Supplementary Table 1). The highest prevalence of faltering growth from birth to 4 weeks and from birth to 36 weeks’ PMA was recorded using the Olsen growth reference. The INTERGROWTH- 21st growth standard classified more participants as having faltering growth from 4 weeks to 36 weeks’ PMA and from 36 weeks’ PMA to 2 years.

Table 2 Prevalence of faltering growth at various time intervals using Fenton 2013, UK-WHO, Olsen 2010, and INTERGROWTH-21st.

The proportion of agreement (kappa coefficient) in the classification of faltering growth was higher (moderate to excellent) among the three fetal references (Fenton 2013, Olsen 2010 and UK-WHO) than between the INTERGROWTH- 21st healthy preterm growth standard and the fetal references (poor to good) for each period of growth assessed (Table 3). The proportion of agreement between INTERGROWTH- 21st and the fetal references was lowest for faltering weight gain and highest for faltering head growth.

Table 3 Agreement in classification of faltering growth between growth charts.

The prevalence of any neurodisability was 37% higher in participants with faltering head growth than in participants with adequate head growth from birth to 4 weeks using both Fenton (p = 0.031) and INTERGROWTH-21st (p = 0.026) (Table 4). From birth to 36 weeks’ PMA, participants classified by INTERGROWTH-21st, but not the fetal references, as having faltering length and head growth had 55% (p = 0.002) and 42% (p = 0.012) higher prevalence of any neurodisability compared to those who had adequate length and head growth. Similarly, participants classified as having faltering growth in length from 4 weeks to 36 weeks by INTERGROWTH-21st had 46% (p = 0.008) higher prevalence of any neurodisability compared to those who had adequate length growth, whereas the differences in prevalence were smaller (0−26% for length and 18−25% for head growth) and not statistically significant for the fetal references. Generally, these increased strengths of associations were seen for both mild and moderate/severe disability. In contrast, the prevalence of any neurodisability was not significantly increased for infants who had any faltering weight gain. However, moderate/ severe faltering weight gain was associated with 19% to 75% higher incidence of moderate/ severe neurodisability compared to mild faltering weight gain.

Table 4 Prevalence of neurodisability at 2 years’ corrected age in infants with and without faltering growth, classified using Fenton 2013, UK-WHO, Olsen 2010, and INTERGROWTH-21st.

Changes in head circumference z-scores between birth and 36 weeks’ PMA and also between 4 weeks and 36 weeks’ PMA for all four growth references were weakly positively correlated with cognitive, language and motor scores at 2 years (Supplementary Table 2). Faltering weight, length and head growth were more strongly associated with motor delay than cognitive or language delay (OR 1.9−4.7 vs OR 1.6−2.1). Faltering head growth between 4 and 36 weeks’ PMA by the fetal references was associated with the highest odds of motor delay compared to INTERGROWTH-21st (OR 3.4−4.7 vs 2.3). Generally, faltering head growth was more strongly associated with developmental delay than other measurements (OR 1.7−4.7 vs OR 1.6−2.4). Similarly, faltering growth between 4- 36 weeks’ PMA was more strongly associated with developmental delay than other time periods (OR 1.6−4.7 vs OR 1.7−2.9) (Table 5).

Table 5 Odds of developmental delay at 2 years’ corrected age in infants with faltering growth, classified using Fenton 2013, UK-WHO, Olsen 2010 and INTERGROWTH-21st.

The AUC for prediction of developmental delay were all <0.63. Faltering growth from birth to 4 weeks and birth to 36 weeks’ PMA as assessed using INTERGROWTH-21st was more specific but less sensitive for predicting developmental delay than fetal references. From birth to 36 weeks’ PMA, faltering head growth using INTERGROWTH-21st but not the fetal references predicted motor delay at 2 years (AUC = 0.62, 95%CI 0.54−0.71, p = 0.007) and faltering length growth also using INTERGROWTH-21st predicted language delay (AUC = 0.60, 95%CI 0.53–0.67, p = 0.007). Faltering length growth from 4 to 36 weeks’ PMA using INTERGROWTH-21st (AUC = 0.60, 95%CI 0.51–0.68, p = 0.034) and Fenton (AUC = 0.60, 95%CI 0.51–0.69, p = 0.029) predicted motor delay. Faltering in any growth measure did not significantly predict cognitive delay using any of the growth references. (Table 6).

Table 6 Faltering growth categorized by Fenton 2013, UK-WHO, Olsen 2010 and INTERGROWTH-21st as a predictor of developmental delay at 2 years’ corrected age.

Sensitivity analysis excluding the 20 infants born at 23 weeks whose z-scores for INTERGROWTH-21st were calculated by extrapolation did not change our findings (Supplementary Tables 37).

Sensitivity analysis excluding SGA infants also did not change any of our findings about the relationships between faltering growth and developmental delay at 2 years (Supplementary Tables 8 and 9).

Discussion

We examined the differences in classification of postnatal growth using fetal references (Fenton, UK- WHO and Olsen) and a healthy preterm growth standard (INTERGROWTH-21st) and their association with neurodevelopment at 2 years’ CA. Although faltering early growth was poorly predictive of developmental delay using any of the growth charts, our results suggest that infants with faltering postnatal growth classified by INTERGROWTH-21st were at highest risk of developmental delay compared to those classified using fetal references, and that faltering growth in head circumference and length, particularly from 4 to 36 weeks’ PMA, were most strongly associated with developmental delay at 2 years.

We found that there was high agreement between the fetal references in the classification of faltering growth. In general, infants categorized as having faltering growth by Fenton, were also categorized as having faltering growth by UK- WHO and Olsen. However, there was less agreement between INTERGROWTH-21st, the healthy preterm standard, and the fetal references. These findings are in agreement with other reports of higher agreement in faltering growth classification between fetal references than between INTERGROWTH-21st and fetal references24. This is likely attributable to the differences in methods used to develop the different growth curves. The fetal references were developed from cross-sectional birth data of infants born preterm8,15 whereas the INTERGROWTH-21st preterm growth curves were developed from longitudinal growth measures of fetuses who were subsequently born preterm.14 Thus, the fetal growth curves are similar to each other but different from the preterm growth standards.

Olsen charts identified the highest prevalence of faltering growth. However, faltering growth using INTERGROWTH-21st was more strongly associated with developmental delay than faltering growth using the fetal references, although with some overlap of CI. The fetal charts, unlike the INTERGROWTH-21st, are based on in utero growth that does not account for the early physiological weight loss and slightly longer time required by preterm infants to regain birthweight, thus preterm infants are more likely to be classified as having faltering growth using these references than by the preterm postnatal growth standards of INTERGROWTH-21st, particularly within the first four postnatal weeks. Therefore INTERGROWTH-21st chart detects more severe faltering growth, possibly accounting for the observed higher association with developmental delay using the preterm growth standard. We also found that from 36 weeks’ PMA to 2 years, faltering growth using any of the growth charts was not associated with increased risk of developmental delay. This suggests there is no advantage in using any particular one of these growth charts at 36 weeks’ PMA (approximately NICU discharge) for identifying infants at risk of developmental delay.

An inherent limitation of the INTERGROWTH-21st healthy preterm growth standards is the inclusion of very few infants born extremely or very preterm. Thus, for our study, we could not directly calculate birth z-scores of weight, length and head circumference from the INTERGROWTH-21st Very Preterm Size at Birth References for 20 (6.1%) participants born at 23 weeks and had to extrapolate the z-scores from later gestations. However, sensitivity analysis excluding these extrapolated values did not change our findings.

We found that faltering growth using any of the four growth references was not a good predictor of developmental delay at 2 years’ CA, with AUC all <0.63. This finding is in agreement with and extends the findings of a previous study of infants born <33 weeks’ gestation that growth measured using Fenton, Olsen and INTERGROWTH-21st were not strong predictors of developmental delay at 18 months and 7 years, and that the AUC were all <0.69.24 A retrospective study of infants born <33 weeks’ gestation that assessed faltering growth using Fenton and INTERGROWTH-21st as predictors of developmental delay at 12 and 24 months also reported that the AUC were all <0.66.25 Although being born SGA is associated with poorer neurodevelopmental outcomes,26,27 adjusting for SGA did not improve the predictive ability of faltering weight growth for poor neurodevelopmental outcomes in a previous study.25 Our sensitivity analysis excluding SGA infants also did not change our findings, with the odds of developmental delay largely unchanged and all AUC < 0.63. Many factors other than growth, including illness, infection and brain injury, contribute to neurodevelopmental outcomes, and these factors may also contribute to faltering growth in affected infants.28,29 Thus, faltering growth may serve as a marker for identifying infants at risk, rather than being a good predictor of developmental delay by itself. Nevertheless, prevention of faltering growth in NICU remains a key goal, as faltering growth is associated with later developmental delay and could also exacerbate other morbidities.

Many authors have reported that faltering growth during NICU stay, typically up to term-equivalent age, is related to adverse neurodevelopmental outcomes,23,24,30,31 but there is less certainty about which period is most important. Others have noted the frequency of faltering growth in the first few postnatal weeks owing to early weight loss from postnatal diuresis and delayed initiation of nutrition support, among other reasons.10,32,33 Our findings suggest that this early faltering growth in the first 4 weeks is not strongly related to later developmental delay. Rather, it is continuing faltering growth from 4 to 36 weeks’ PMA, particularly if severe, that is associated with increased risk of later developmental delay. This may be consistent with findings of a recent study that faltering head growth calculated from the end of physiological weight loss (14–21 postnatal days) rather than from birth, to discharge, was associated with an increased risk of neurodevelopmental impairment at 24 months’ CA in very preterm infants (aOR = 3.94, 1.19–13.03 vs aOR = 0.69, 0.33–1.45).34 It remains to be seen whether a focus on improving growth in the NICU, particularly between 4 and 36 weeks’ PMA, may help attenuate risk of later developmental delay. It is also possible infants taking part in the ProVIDe trial of early nutrition received better early nutrition than may be the case in other centers, so the generalizability of this finding needs further assessment.

We found that faltering head growth was more strongly associated with developmental delay than faltering weight or length growth. Other studies similarly found that faltering head growth in infancy was associated with higher odds of developmental delay compared to faltering weight or length growth.24,34,35 Head growth is an indicator of brain growth, thus faltering head growth likely reflects poor brain growth,36 which is an independent predictor of faltering neurodevelopmental outcomes.37

Faltering growth in all growth measures was more strongly associated with motor delay than cognitive or language delay. This may be because at 2 years, motor skills are more easily assessed and delays more readily identified than delays in higher order cognitive or language skills,38 or that development of motor function is more affected by early nutrition. Nutritional supplements are reported to have a greater effect on motor outcomes than other aspects of development in infants (<3 years) who were born preterm or SGA.39 It remains to be seen whether this association persists at early school-age follow-up of this cohort. Other authors have reported among infants born extremely preterm, that neurodevelopmental outcomes in mid-childhood are poorly predicted by outcomes at 2 years.38

Our study has many strengths. Firstly, our study focuses on ELBW infants who are at the highest risk of poor nutritional and neurodevelopmental outcomes compared to other preterm and term-born children Second, anthropometric measurements were conducted prospectively according to standardized methods, and we were able to analyze longitudinal changes in growth over time as well as cross-sectional data. The cohort size was large and the follow-up at 2 years’ CA was very high (94%), so the results of this study are generalizable to the ProVIDe trial cohort and also to ELBW of similar characteristics to this RCT cohort.

There are also some limitations. Because infants enrolled in the ProVIDe Trial are likely to have been the most unwell and smallest for whom clinicians had chosen to insert a UAC, the generalizability of our study’s findings may be more limited for ELBW infants who are less sick, and sample size was limited by the size of the inception cohort. Although our cohort was of similar size to others reporting relationships between early growth and later development in preterm infants,25,40,41 validation of our findings with other and ideally larger cohorts of ELBW infants is warranted. It is also likely that as part of routine care, clinicians would have intervened to mitigate any detected faltering growth, thus potentially reducing the strength of any observed associations between faltering early growth and developmental delay. We also acknowledge that neurodevelopmental assessment at 2 years has limited predictive value for later childhood neurodevelopmental outcomes, and thus further follow-up of this cohort at school age is currently underway.

In summary, the highest prevalence of faltering growth was recorded using Olsen growth charts. Faltering head growth and faltering growth from 4 weeks to 36 weeks’ PMA were associated with developmental delay at 2 years, particularly motor delay. Although faltering growth detected with growth charts may help identify infants at risk, it is poorly predictive of developmental delay at 2 years’ CA.