Introduction

Infants born very preterm are at risk of adverse motor, cognitive, language and behavioural outcomes which persist to school age and beyond.1 They are four times more likely to experience motor delays,2 six times more likely to have developmental coordination disorder3 and 40 times more likely to be later diagnosed with cerebral palsy compared to term born peers.4 Cognition is suggested to be affected more frequently than motor function with preterm born infants scoring on average 12 points lower on IQ testing than term born children.5 Early, accurate identification of preterm infants at risk of adverse outcomes is critical to enable structured surveillance, early commencement of targeted interventions and provision of family supports.

The search for early prognostic biomarkers has led to developments in neuroimaging techniques and clinical measures such as assessment of neurological, motor and neurobehavioural function. Currently the combination of magnetic resonance imaging (MRI) at term equivalent age (TEA) and a General Movements assessment (GMs) at 3 months corrected age (CA) has the best prediction for motor and cognitive outcomes following very preterm birth.6,7,8,9,10 Earlier MRI prior to term has been found to be safe and feasible.11 Structural MRI at 30–32 weeks postmenstrual age (PMA) has demonstrated good predictive validity for later motor and cognitive outcomes12 and associations with concurrent clinical measures both early in the neonatal period and at TEA.13 It is not yet known if earlier MRI and earlier clinical assessments strengthen the associations with later outcomes when compared with MRI and clinical assessment at TEA and 3 months CA. Our prospective cohort study, ‘PPREMO: Prediction of preterm motor outcome', aimed to develop a toolbox of structural and advanced diffusion MRI and clinical biomarkers that can predict outcomes prior to TEA and 3-month GMs.11 The first step in the process is to examine the early variables individually and in combination to determine their associations with motor and cognitive outcomes at 2 years CA.

In a cohort of very preterm infants with early neonatal (32 weeks PMA) and TEA MRI and comprehensive clinical assessment, clinical assessment at 3 months CA and outcomes measured at 2 years CA, this study sought to determine (i) which combination of MRI and clinical measures were most strongly associated with motor outcomes at 2 years CA, and (ii) which combination of MRI and clinical measures were most strongly associated with cognitive outcomes at 2 years CA.

Methods

Study design and participants

This prospective cohort study was conducted at the Royal Brisbane and Women’s Hospital (RBWH), a specialist tertiary neonatal centre in Brisbane, Australia. Infants were eligible for inclusion between February 2013 and February 2016 if they were born <31 weeks gestational age, their parents/carers lived within a 200 km radius of the hospital and were English speaking. Infants were excluded if they had a known congenital or chromosomal abnormality likely to affect their neurodevelopmental outcome. Informed parental consent was obtained for all participants. This study was nested within a broader study, and sample size calculations are detailed in the study protocol.11 Ethical approval was obtained from the RBWH Human Research Ethics Committee (HREC/12/QRBW/245) and The University of Queensland (2012001060). Trial registration: Australian New Zealand Clinical Trials Registry (ACTRN12613000280707).

Social demographic data were collected via a questionnaire and a scoring system, previously validated in cohorts of preterm infants, was applied to classify social risk status.14,15,16 Six aspects were considered, including family structure, education of primary caregiver, occupation of primary income earner, employment status of primary income earner, language spoken at home and maternal age. A score of 0 to 2 was applied for each of the six items for a maximum total score of 12. Higher scores indicate a higher level of social risk and a total score of 2 or above was defined as high social risk. Full details of items and scoring are available in Appendix 1.

MRI acquisition

Infants underwent brain MRI at 30–32 weeks PMA or when medically stable (‘Early’), and again at 40–42 weeks PMA (‘Term’). An MR compatible incubator equipped with a dedicated neonatal head coil (LMT Lammers Medical Technology, Lübeck, Germany) was used in conjunction with a 3T MRI Siemens Tim Trio (Erlangen, Germany) scanner. Ear protection attenuated noise, no sedation was used and MRI was performed during natural sleep. Coronal, axial and sagittal T2-weighted HASTE (TR/TE 2000/90 ms, flip angle 150°, field of view 200 × 160 mm, matrix 320 × 256, slice thickness 4 mm) were acquired. Axial T1 TSE (TR/TE 1490/90 ms, flip angle 150°, field of view 200 × 160 mm, matrix 256 × 180, slice thickness 2 mm) and an axial multi-echo T2 TSE (TR/TE1/TE2/TE3 10580/27/122/189 ms, flip angle 150°, field of view 144 × 180 mm, matrix 204 × 256, slice thickness 2 mm) were also acquired.

MRI scoring

An independent neurologist with training in brain MRI (SF) and blinded to history and clinical assessment findings applied a standardised scoring system to MRIs.12,17 The scoring system generates four subscales which can be summed to give a global total score; white matter, cortical grey matter, deep grey matter and cerebellum.12,17 Both T1 and T2 images were evaluated during scoring and T1 hyperintensities and T2 hypointensities were recorded and considered as signal abnormalities. Sagittal T2-weighted images were used to score the corpus callosum as it is clearly visualised as low signal intensity prior to myelination. Inter- and intrarater reproducibility of the scale have been demonstrated in two studies, one of which included SF as a rater.12,17

Clinical measures

Clinical assessments were combined to remove duplicate items and reduce handling. They were completed within a week of MRI by an experienced assessor, blinded to birth history and brain imaging findings (J.G.). Full details of assessment tools and administration are available in the study protocol, including detailed summaries of their psychometric properties,11 see also Fig. 1.

Fig. 1: Flowchart of assessment timepoints and measures.
figure 1

Early, 30–32 weeks postmenstrual age (PMA); Term, 40–42 weeks PMA; blue (dotted line) represents analyses examining relationships between Early measures and 2-year outcomes; yellow (solid line) represents analyses examining relationships between Term measures and 2-year outcomes; orange (dashed line) represents analyses examining relationships between 3-month measures and 2-year outcomes; green (bold lines) represents analyses examining relationships between all three timepoints combined and 2-year outcomes.

The GMs is a neuromotor assessment of observed spontaneous movements. Assessment at Early and Term timepoints are in the ‘writhing period’ (sensitivity 75–100%, specificity 40–48% for CP)18,19 and at 3 months CA is in the ‘fidgety period’ (sensitivity 98%, specificity 91% for CP).20 Scoring was performed by two advanced GMs raters (J.G. and B.S.), with cases of non-agreement reviewed until consensus reached and advice sought from a third blinded rater.

The Hammersmith Neonatal Neurological Examination (HNNE) is a neurological assessment evaluating posture, tone, reflexes, spontaneous movements, orientation and behaviour.21 At the early assessment, all items except placing were administered. In the preterm period, sensitivity and specificity for predicting CP are 57–86% and 45–83%, respectively, increasing to 68–96% and 52–97% at TEA.20 Inter-rater reliability between the clinical assessor (J.G.) and an observer (P.C.) for the HNNE total optimality score was tested with intra-class correlation coefficients of 0.94 Early, and 0.99 at Term.

The NICU Neonatal Neurobehavioural Scale (NNNS) evaluates an infant’s response to stimuli and handling, state regulation, motor performance, and neurological status.22 The NNNS at TEA has been shown to predict motor and cognitive outcomes at 18 months CA, motor outcomes at 24 months CA and cognitive outcomes at 4.5 years.23,24,25 The Premie-Neuro (PN) is a neurological examination which consists of three categories: neurological, movement and responsiveness.26 It has been designed for use from 23 to 37 weeks PMA and has scoring based on expected performance at each week of PMA.26

At TEA and 3 months CA, a visual assessment developed by Ricci et al.27 was used to examine visual function. The test examines ocular motility, acuity and the ability to fix and follow and has demonstrated predictive validity for neurodevelopmental outcomes in preterm cohorts.28 The Test of Infant Motor Performance (TIMP), performed at TEA and 3 months CA, is a standardised, discriminative and evaluative assessment of gross motor development.29,30 Construct validity enabling discrimination between infants at high and low risk of adverse motor outcomes has been demonstrated.29

Outcomes

An experienced paediatric physiotherapist, blinded to all earlier clinical and MRI findings, assessed participants at 2 years CA using the Bayley Scales of Infant and Toddler Development 3rd edition (Bayley III).31 The Bayley III is a discriminative and norm-referenced tool. The motor and cognitive composite scores were utilised in this study. They have a mean of 100 and a SD of 15 and higher scores reflect better development.

Children were assessed by a paediatrician for signs of cerebral palsy as per the definition by Rosenbaum et al.32 to allow description of the cohort. Any level of severity and distribution were included and recorded using the Gross Motor Function Classification System.33

Statistical analysis

Pearson correlations and univariable regression analyses were performed to compare the relationship between individual variables from Early, Term and 3 months CA with 2-year outcomes. Clinical and MRI scores were used as dependent variables without categorisation. The 3-month GMs score was strongly associated with both motor and cognitive outcomes, so we calculated the associations between each of the variables and outcomes, adjusting for 3-month GMs score. Multivariable models of MRI and clinical variables at all three timepoints were constructed using forward stepwise selection of next variable. The variable with the smallest p value was added to the model if p < 0.01. For multivariable analyses we considered: (i) use of maximum possible data available for each model without imputing any data; (ii) as our dataset contained 10 sets of twins and 1 set of triplets, we conducted mixed effects linear regression models with family included as a random effect, (iii) GMs featured as an ordinal variable (0 = normal, 1 = poor repertoire/abnormal fidgety, 2 = cramped synchronised/absent fidgety). Results are presented as regression coefficients with 95% confidence intervals. Analysis was performed using Stata statistical software, v16 (StataCorp, College Station, TX).

Results

Early MRI was available for 119 preterm infants, N = 92 and N = 98 of which had motor and cognitive outcome data for Bayley III. Full demographic, perinatal and clinical assessment data have been published previously13 and are summarised in Table 1. MRI scoring and clinical assessment results are summarised in Table 2. Differences in perinatal data for participants with and without a 2-year outcome are summarised in Supplementary Table 1. Those children who failed to return at 2 years had a slightly greater birthweight (1225 vs 1065 g) and underwent Term MRI at a slightly younger PMA (40 + 0 vs 40 + 4) which are unlikely of clinical significance. They also had a lower rate of receiving antenatal steroids (43% vs 76%).

Table 1 Characteristics of the study sample.
Table 2 Summary of MRI and clinical assessment scores.

Motor outcome

Univariable analyses between MRI/clinical scores and 2-year Bayley III motor outcome

Univariable associations and correlation results are presented in Supplementary Table 2. The 3-month GMs score demonstrated the strongest associations with motor outcome (regression coefficient β = −13.2; 95% confidence interval CI = −18.5 to −7.9, p < 0.001). Early MRI global score and Term MRI white matter, deep grey matter, cerebellar and global scores were all associated with motor outcome. Early GMs, Early HNNE reflexes and abnormal signs, Term HNNE total, and the domain of NNNS regulation at both Early and Term timepoints were associated with motor outcome. The overall TIMP z-score and the visual assessment score at 3 months CA were both significantly associated with motor outcome.

Associations after adjustment for 3-month GMs score

Results are presented in Supplementary Table 2. After adjustment for 3-month GMs, Term MRI deep grey matter score had the strongest association with motor outcome (β = −3.8; 95% confidence interval CI = −6.7 to −1.0, p < 0.01). Term MRI cerebellar and global total scores, and Early GMs and early HNNE reflexes score were significantly associated with motor outcome.

Multivariable models for 2-year Bayley III motor outcome

Multivariable model results are presented in Table 3. When motor outcome multivariable models were developed for each timepoint individually the results are as follows:

  1. i.

    32 weeks PMA model included early HNNE reflexes subscale score (β = 4.30; 95% CI = 1.47 to 7.12; p = 0.003) and the early HNNE abnormal signs subscale score (β = 6.15; 95% CI = 1.64 to 10.65; p = 0.007).

  2. ii.

    TEA model included TEA deep grey matter MRI score (β = −5.45; 95% CI = −8.25 to −2.65; p < 0.001) and the TEA NNNS regulation subscale (β = 6.28; 95% CI = 1.90 to 10.66; p = 0.005).

  3. iii.

    3 months CA included 3-month CA GMs score (β = −13.19; 95% CI = −18.52 to 7.86; p < 0.001).

Table 3 Multivariable models for motor outcome on the Bayley III.

When the three timepoints were combined, the best motor outcome multivariable model included 3-month CA GMs score (β = −11.41; 95% CI = −17.34 to −5.49; p < 0.001), TEA deep grey matter MRI score (β = −6.23; 95% CI = −9.47 to −2.99; p < 0.001) and the early HNNE reflexes subscale score (β = 3.51; 95% CI = 0.86 to 6.16; p = 0.009). The model was constructed from data of 83 participants who had data available for all three variables.

Cognitive outcome

Univariable analyses between MRI/clinical scores and 2-year Bayley III cognitive outcome

Univariable associations and correlation results are presented in Supplementary Table 3. Term HNNE score demonstrated the strongest association with cognitive outcome (β = 1.6; 95% CI = 0.8 to 2.4; p = 0.000047) followed closely by 3-month GMs (β = −12.2; 95% CI = −18.2 to −6.3; p = 0.000054). Early MRI cerebellar score, Term MRI cortical grey matter, deep grey matter, cerebellar and global score were associated with cognitive outcome. Early HNNE reflexes, NNNS stress, NNNS arousal and the neurological subscale of the Premie-Neuro were associated with cognitive outcome. Term HNNE posture and tone, tone patterns, reflexes, spontaneous movements and total score, term NNNS regulation, TIMP and visual assessment, as well as 3-month TIMP and visual assessment were associated with 2-year cognitive outcome.

Associations after adjustment for 3-month GMs score

A very similar pattern of associations between the clinical assessment scores and cognitive outcome was seen after adjusting for 3-month GMs. Only Term NNNS regulation and the TIMP z-score were no longer significant. Fewer MRI scores remained associated with the outcome, with only the Term MRI deep grey matter score maintaining significance (β = −3.5; 95% CI = −6.4 to −0.7; p = 0.02).

Multivariable models for Bayley III cognitive outcome

When cognitive outcome multivariable models were developed for each timepoint individually the results are as follows:

  1. (i)

    32 weeks PMA model included early HNNE reflexes subscale score (β = 4.14; 95% CI = 1.43 to 6.84; p = 0.003).

  2. (ii)

    TEA model included TEA HNNE total score (β = 1.15; 95% CI = 0.35 to 1.94; p = 0.005), TEA deep grey matter MRI score (β = −4.01; 95% CI = −6.84 to −1.17; p = 0.006) and the TEA GMs score (β = −6.24; 95% CI = −10.85 to −1.63; p = 0.008).

  3. (iii)

    3 months CA included 3-month CA GMs score (β = −9.39; 95% CI = −15.54 to −3.23; p = 0.003) and the 3-month TIMP total z-score (β = 4.64; 95% CI = 1.16 to 8.11; p = 0.009).

When the three timepoints were combined, the best overall model for cognitive outcome included the 3-month CA GMs score (β = −10.01; 95% CI = −15.90 to −4.12; p = 0.001) and TEA HNNE score (β = 1.33; 95% CI = 0.57 to 2.08; p = 0.001). Results are presented in Table 4. The model was constructed from data of 95 participants who had TEA HNNE, 3-month GMs and 2-year cognitive data available.

Table 4 Multivariable models for cognitive outcome on the Bayley III.

Discussion

This study analysed a large cohort (n = 98) of very preterm born infants using Early MRI (32 weeks PMA) and TEA with concurrent clinical measures and clinical assessment at 3 months CA to determine associations with motor and cognitive outcomes at 2 years CA. Our study identified that 3-month GMs, Term MRI deep grey matter score and the Early HNNE reflexes score was the strongest model associated with motor outcome when data from all timepoints was included. The TEA HNNE total score and 3-month GMs were most strongly associated with cognitive outcome when data from all timepoints were included. Assessment before TEA strengthens associations for motor outcomes, but less so for cognitive outcomes. Clinical measures alone can be used for associations with cognitive outcomes, a very important finding given the limited availability of MRI in the majority of clinical settings.

This study evaluated whether the addition of data from earlier in the neonatal period, to the traditional TEA and 3 months CA assessment timepoints, could improve associations with outcomes. Our data suggests that addition of neurological assessment data (HNNE) from the earlier timepoint improves associations with motor outcome. Addition of neurological assessment at TEA, improves associations for cognitive outcome. The clinical implications are that there is value in adding earlier neurological testing, to clinical management of very preterm infants, for the purposes of screening and planning further monitoring and surveillance.

Our data support previous studies of 3-month GMs to be strongly associated with both motor6,20,34 and cognitive8,9 outcomes and that the combination of TEA MRI and 3-month GMs has the greatest predictive accuracy for motor outcomes for preterm infants.20 For the first time, we have demonstrated that addition of the HNNE reflexes score at 32 weeks PMA strengthens the associations with later motor outcomes.

It is an interesting and novel finding from this study that the early HNNE reflexes score demonstrates such strong associations with both motor and cognitive outcomes. The reflexes subscale at TEA is also strongly associated with cognitive outcome, and it is the elements of posture and tone, tone patterns and spontaneous movements which are significantly associated with cognitive outcome at TEA but not early on, which strengthens the total HNNE score at TEA. The HNNE reflex subscale assesses primitive reflexes of sucking, hand and toe grasp, Moro reflex, deep tendon reflexes and pacing. Other elements of the HNNE such as posture and tone, movements and behaviour are assessing functions that likely evolve over the first months of life in line with cortical development, becoming more discriminative by TEA and therefore more strongly associated with cognitive outcomes. Our data suggest that neonatal reflexes both early and at term are an important marker of outcomes and further analyses to tease out predictive accuracy and which of the reflex items contribute the most valuable information is warranted.

Diagnostic accuracy statistics are required to determine the true predictive accuracy of individual and combined variables for neurodevelopmental outcomes. This manuscript presents data from the first half of our preterm cohort. We have subsequently completed a total of n = 262 early MRI and clinical assessment, data collection is still in progress for 2-year outcomes and will be complete in February 2022. With the full dataset, we will undertake diagnostic accuracy analyses including sensitivity, specificity, positive predictive value, negative predictive value and accuracy for each of the individual variables and the combined models. The work completed and presented here is an important first step which has identified which of the measures, individual variables and combined models should be investigated further and tested with diagnostic accuracy analyses, both in our larger cohort when data collection is complete, and to guide others working the field as to which measures and variables warrant further investigation. In addition, future work will need to evaluate the predictive contributions of the measures and combined models, independent of other known risk factors such as GA at birth, presence of intraventricular haemorrhage or bronchopulmonary dysplasia, etc.

We have identified which measures and variables from a single timepoint could be selected by clinical or research teams, as well as which combination of variables over time can be combined to assist with risk stratification for follow-up screening and intervention requirements. Further work will further assist in understanding if the associations found are clinically meaningful as despite having statistically significant associations, the actual correlations were relatively low. GMs at all timepoints appear to make a valuable contribution. The HNNE certainly warrants further investigation, having been found in this work to contribute to associations with both motor and cognitive outcomes. The NNNS is complex to administer and score, and is unlikely to confer clinical benefit over and above simple clinical tools such as the GMs and HNNE. The Premie-Neuro has good clinical utility and the neurological subscale was associated with cognitive outcomes. Given the HNNE was so much more strongly associated with cognitive outcomes, it is unlikely that it would be justifiable to select the Premie-Neuro as a clinical tool over the HNNE which also evaluates neurological function. The TIMP warrants further investigation to determine if the associations, especially at 3 months CA, are clinically meaningful and truly predictive, and whether it adds value beyond the administration of the 3-month GMs. The assessment of visual function is very simple and quick to administer and if found to be equivalent to the TIMP in terms of predictive accuracy, may be more efficient to administer in preterm follow-up services. Both early and TEA MRI variables were significantly associated with neurodevelopmental outcomes. Diagnostic accuracy analyses will be able to definitively answer the question as to whether early MRI really adds value to predicting long-term outcome given the expense and challenges of doing this compared with TEA MRI. While our study purposely completed multiple different assessments to compare their relative associations with outcomes, ultimately the long-term aim of this project is to determine the minimum assessment schedule that offers the best cost–benefit balance for both families and clinical services.

In the landscape previously of a ‘wait and see' approach to outcomes from prematurity, our data and that of others provide strong evidence for assessments prior to 3 months CA. In the cases for those children with the most serious adverse outcomes of cerebral palsy, clinicians are encouraged by recent International Clinical Practise Guidelines for the early detection of Cerebral Palsy, to be using these tools and providing clear early diagnosis and prognosis to families as early as 3–4 months CA.6 Early interventions are being offered earlier than previously possible, starting from 3 to 6 months CA.35,36 It could be argued then that little is added by assessment earlier than term, and the resource cost may not be justified. Many neuroprotective, neurorestorative and rehabilitative treatment options are however being developed for application shortly after birth and while the infant is still in the neonatal intensive care unit. These include treatments with creatine, melatonin, amnion cells and developmental care. In that context, both clinical assessment and MRI, early in the neonatal period, are critical to enable selection of infants for eligibility and participation in these research trials. Our data provide evidence for tools for very early identification of participants who may warrant and benefit from new early interventions aimed at improving outcomes. Items from the early timepoint with significant associations with motor outcome include the MRI global score, GMs, HNNE reflexes, HNNE abnormal signs and NNNS regulation. Early MRI cerebellar score, HNNE reflexes, NNNS stress, NNNS arousal and the neurological subscale of the Premie-Neuro were significantly associated with cognitive outcome. This very early timepoint of assessment therefore offers a critical new window for early interventions to be implemented and evaluated.

Our finding that TEA HNNE total score was as strongly associated with cognitive outcome as the 3-month GMs is an important finding. The HNNE is a simple, clinically accessible tool that can be used anytime from birth until 2 months post term age in very preterm born infants.21 Together with Early HNNE reflexes being strongly associated with both motor and cognitive outcomes, and Early HNNE abnormal signs being associated with motor outcomes, inclusion of this tool in clinical care of these high-risk infants is warranted. Cognitive outcome is often overlooked with a tendency to focus on motor outcomes of preterm survivors. A systematic review and meta-analysis of >64,000 preterm infants found that while risk for adverse cognitive outcomes was highest for infants with the lowest gestational ages, children with gestational ages at birth of 28–34 weeks performed almost as poorly as those born <28 weeks.1 Traditional follow-up and early intervention programs that focus predominantly on extremely or very preterm infants are therefore likely missing many children at risk of adverse cognitive outcomes. Targeted early interventions need to incorporate and evaluate cognitive development strategies.35 The fact that the HNNE can be completed in the neonatal intensive care unit prior to discharge can assist clinicians in planning future screening, enabling targeted surveillance and service requirements for the infant and family before they return home.

Our cohort had 4 children out of 98 with an outcome of CP and it is tempting to consider this a low-risk cohort. This prevalence of 4% is however aligned with other contemporary cohorts with documented reductions in CP prevalence compared to earlier cohorts.37,38 A prevalence rate of 3.8% in extremely low birthweight infants (<1000 g) and 3.6% in very low birthweight infants (1000–1499 g) has been reported from data of 20 population-based CP registers of infants born in 2003.37 Outcomes on the Bayley III motor composite score in the present study were a mean of 96 and SD 15, similar to the standardised Mean of 100 and SD 15.31 There has however been criticism of the Bayley III in underestimating adverse motor outcomes.39,40 Data of healthy term born infants in Australia, assessed at 2 years, reported a mean motor composite score of 118 and a standard deviation of 17.39 Our preterm cohort therefore scored lower by 1 SD on average when compared to typically developing children in Australia and is in line with other cohorts of very preterm infants assessed at 2 years CA.39 These data confirm that our cohort is representative of contemporary populations of very preterm born infants.

Strengths of the current study include the large sample of Early MRI data with concurrent clinical data of infants born very preterm in a contemporaneous study cohort with masked analysis of both clinical and MRI assessments and good participant retention at 2 years CA (>80%). MRI was acquired at 3T and we chose a clinically accessible scoring system for structural MRI rather than more complex volumetric or diffusion analyses. Limitations exist for multivariable modelling. Different statistical methodologies produce slightly different models, and it is a great challenge to decide which methodology truly produces the ‘correct’ or ‘best’ model. There is a tendency to give abundant attention to the measures that are included in the final model, and disregard all others. To mitigate this, we have presented the results in multiple ways (univariable, adjusted for 3-month GMs and full multivariable models). The variables that narrowly missed out inclusion in the full models, all feature in the univariable analyses. We did not correct for multiple comparisons, however set our methodology for the multivariable model building to use p < 0.01 for selection of addition to the model.

Conclusion

Our study findings support previous work highlighting the value of term MRI and 3-month GMs in identifying motor outcomes in very preterm born infants. The clinical implications of our study findings suggest that early and term neurological assessment, in particular the HNNE, adds value in understanding motor and cognitive outcomes. These findings lay the groundwork for our PPREMO toolbox and future work will examine predictive accuracy of each of the biomarkers and also include our advanced diffusion imaging biomarkers. Until diagnostic accuracy analyses definitively compare the predictive accuracy of early MRI compared with TEA, TEA MRI remains recommended over early MRI for prediction of longer term outcomes.