Introduction

Transient hypothyroxinaemia of prematurity (THOP), characterised by low thyroxine (T4) levels with normal or low thyroid stimulating hormone levels, occurs in more than half of the extremely low birth weight infants (ELBWIs) born at <30 weeks1,2. The low T4 levels reach a nadir at 7–10 days postnatal and could remain low for the first 3–6 weeks of the postnatal period, depending on the extent of prematurity3,4. Although the aetiologies of THOP are multifactorial, including the loss of maternal placental transfer of T4 after birth, immaturity of the hypothalamus-pituitary-thyroid axis, limited thyroid capacity to increase synthesis, and metabolism and drugs affecting thyroid function5, recent studies have demonstrated that THOP is associated with increases in perinatal mortality and morbidity6; prolonged mechanical ventilation7; and non-thyroidal illnesses such as respiratory distress syndrome (RDS)3,8, intraventricular haemorrhage (IVH)9,10, and cerebral white matter damage11. Furthermore, in concordance with the adult data on non-thyroidal illness, the illness severity in premature infants seemed to be an important determinant of low serum T4 levels12,13. Collectively, these findings suggest that THOP might just be the epiphenomenon reflecting the severity of the non-thyroidal illness in ELBWIs.

Recently, we reported improved survival rate and increased survival rates without composite morbidities such as bronchopulmonary dysplasia (BPD), severe IVH, necrotizing enterocolitis (NEC), and retinopathy of prematurity (ROP) with better perinatal and neonatal intensive care for the ELBWIs14,15,16. However, the temporal alterations in the incidence of THOP associated with the recently improved survival rates without composite morbidities of ELBWIs have not been studied yet. Furthermore, although prediction of mortality and morbidities in ELBWIs prior to discharge is of the utmost clinical importance, the predictability of THOP severity status for mortality, morbidities and survival without composite morbidities has not been tested yet. Therefore, in the present study, we investigated whether survival without composite morbidities was associated with the altered incidence of THOP, and if applicable, the potential of THOP severity to predict outcome for ELBWIs. Specifically, we compared the predictive power of the initial T4 levels for estimating mortality, composite morbidities, and survival without major morbidities in ELBWIs with that of the Apgar score at 5 min and clinical risk index for babies II (CRIB II) score. The Apgar and CRIB II scores were found to be good predictors of mortality in our previous study17.

Results

Clinical characteristics

Table 1 shows the demographic and clinical findings of the enrolled infants in each subgroup and study period. The total GA and birth weight were significantly lower and antenatal steroid use was significantly higher, especially in the infants at 23–24 weeks’ gestation, during period II than in period I. The total Apgar scores at 1 min and 5 min of infants with a gestational age of 23–28 weeks were significantly higher during period II than during period I. Other variables including sex, SGA, and caesarean section did not differ significantly between infants enrolled in the 2 study periods.

Table 1 Clinical characteristics of enrolled infants in each subgroup and study period.

Changes in mortality, morbidities, and survival without major morbidity rates

Table 2 demonstrates the clinical outcomes for enrolled infants in each subgroup and study period. While the total mortality rate did not differ significantly during the study periods, the total BPD rate was significantly lower, the IVH rate was significantly higher and the survival without composite morbidities rate in the infants at 25–26 and 27–28 weeks’ gestation was significantly higher during period II than during period I.

Table 2 Clinical outcomes of enrolled infants in each subgroup and study period.

Changes in the initial thyroxine levels

Table 3 shows the alterations in the total and fT4 levels between enrolled infants in each subgroup and study period. While the incidence of THOP in each subgroup did not differ significantly between the 2 study periods, the incidence of normal initial thyroxine levels (T4 ≥ 4.5 ng/dl and/or fT4 ≥ 0.9 ng/dl) in the infants at 25–26 weeks’ and 27–28 weeks’ gestation during period II was significantly higher than during period I.

Table 3 Initial total and/or free thyroxine levels between enrolled infants in each subgroup and study period.

Clinical outcomes according to the initial thyroxine level

The clinical outcomes of the infants were dependent on the initial T4 levels, with the highest mortality and composite morbidities rates and lowest survival without morbidities rates observed when initial T4 levels of <2.5 ng/dl and/or f T4 < 0.5 ng/dl, and the lowest mortality and composite morbidities rates and highest survival without morbidities rates observed when the initial T4 levels were ≥4.5 ng/dl and/or fT4 ≥ 0.9 ng/dl (Table 4). In binary logistic regression analyses with adjustment of gestational age, birth weight, sex and antenatal steroid use, the odds ratios of composite morbidities including BPD (≥moderate) and IVH (Gr ≥ 3) were significantly increased, and the odds ratio of without composite morbidities was significantly reduced in initial T4 levels of <4.5 ng/dl and/or f T4 < 0.9 ng/dl, but the odds ratios of mortality and ROP requiring laser therapy was significantly increased only in initial T4 levels of <2.5 ng/dl and/or f T4 < 0.5 ng/dl (Table 5).

Table 4 Clinical outcomes according to the initial total and/or free thyroxine levels
Table 5 Results of binary logistic regressions on clinical outcomes according to the initial total and/or free thyroxine levels.

ROC curves of variables for predicting mortality, composite morbidities, and intact survival

Figure 1 and Table 6 demonstrate the ROC curves of the initial T4 level, Apgar, and CRIB II scores for predicting mortality, composite morbidities, and survival without morbidities, respectively. The initial T4 level was a better predictor for mortality (cut-off value 2.55, AUC 0.769, sensitivity 76%, specificity 70%), composite morbidities (cut-off value 3.62, AUC 0.721, sensitivity 43%, specificity 83%), and survival without morbidities (cut-off value 3.62, AUC 0.727, sensitivity 16%, specificity 55%) than the CRIB II and Apgar scores.

Figure 1
figure 1

ROC curves of the initial thyroxine level, Apgar score at 5 min, and clinical risk index for babies II score for predicting mortality (a), composite morbidity (b), and intact survival (c).

Table 6 Screening analysis and values of the area under the ROC curve of initial thyroxine level, Apgar score at 5 min and clinical risk index for babies II score for predicting mortality, composite morbidity, and intact survival.

Discussion

THOP has been known to be associated with increases in perinatal mortality and morbidities10,12,18. Recently, we reported significantly improved mortality and morbidities in extremely preterm (EPT) infants with better perinatal and neonatal intensive care14,15,16. In this study, while the total GA and birth weight were significantly lower, the Apgar scores at 1 and 5 min were significantly higher, and the incidence of normal TFTs in ELBWIs at 25–26 and 27–28 weeks’ gestation significantly increased from 22% and 40% during period I to 37% and 61% during period II respectively. This increase was in strict concordance with the significantly increased survival without composite morbidities from 21% and 38% during period I to 38% and 58% during period II, respectively. To the best of our knowledge, this is the first study that studied temporal changes in the incidence of THOP. The change may have been linked to the continuing improvement in perinatal and neonatal intensive care resulting in a better outcome for EPT infants. Collectively, these findings suggest that THOP might be an epiphenomenon of non-thyroidal illness in ELBWIs, and its incidence could thus be reduced by attenuating non-thyroidal illness with better clinical management in ELBWIs.

In the present study, the mortality, the prevalence of morbidities, composite morbidities and survival without morbidities was dependent on the initial T4 level, with the highest mortality and lowest survival without morbidities observed with the lowest T4 levels of less than 2.5 ng/dl. In concordance with our data, several studies have noted that the severity of the non-thyroidal illness is an important determinant of low serum T4 levels in premature infants12,13,19,20,21,22. Overall, these findings suggest that the initial T4 levels reflect the severity of the non-thyroidal illness and thus could predict the outcome in ELBWIs.

Thyroid hormone has been known to play a critical role in perinatal neural development including brain myelination and production and maturation of oligodendrocytes23. Therefore, despite its transient nature and the epiphenomenon of severe illness, hypothyroxinaemia in ELBWIs could be a risk factor by itself for both acute mortality and morbidities10 and later neurodevelopmental deficits in ELBWIs24,25,26. However, a recent study reported that GA, but not THOP, played a key role in neuromotor dysfunction in preterm babies27. Moreover, thyroid hormone treatment did not attenuate the severity of RDS28,29, reduce mortality up to the point of discharge29, or improve long-term neurodevelopmental outcome28,29. Overall, these findings suggest that other factors associated with acute non-thyroidal illness in premature infants, rather than hypothyroxinaemia itself, might play a pivotal role in negatively impacting brain development and subsequent neurodevelopmental outcome. Further studies strictly limiting the study cohort to ELBWIs with THOP would be necessary to determine the potential beneficial effects of T4 supplementation10.

Predicting the outcome of ELBWIs at the highest risk of mortality and morbidities would be very helpful for the difficult decision making regarding treatment30. In our previous study, the Apgar score at 5 min and the lowest serum albumin level were the best predictors of mortality during ≤7th and >7th postnatal days, respectively in ELBWIs17. In this study using the same data set, the initial T4 level showed the best predictability of mortality, composite morbidities, and survival without major morbidities compared with Apgar score at 5 min and the CRIB II score which was shown to be a good predictor for ELBWI mortality in our previous study17 As the initial T4 level reflects the severity of non-thyroidal illness, and is associated with the ensuing mortality and morbidities in ELBWIs in the present and other studies12,13,19,20,21,22, these findings suggest that an increased initial T4 level might best reflect recent improvements in clinical management and the resultant better outcome for ELBWIs, and thus could be most effective for predicting their prognosis compared with the currently used CRIB II and Apgar scores. Accordingly, further studies will be necessary to develop new and effective models for better prediction of prognosis in ELBWIs.

The present study had several limitations including its retrospective and uncontrolled observational study design. We determined mortality and composite morbidities only at hospital discharge. Therefore, it was not apparent whether there was any causal relationship between the initial T4 levels and mortality and/or morbidities in these infants. Another limitation was that as our data were obtained from a single institution, the results obtained in this study might not be generalizable to other institutions. Nonetheless, we used a relatively large sample size of 546 ELBWIs born and admitted to a single centre with similar baseline clinical characteristics. Furthermore, although the consistency of clinical management was not measured, our data showing more antenatal steroid use and significantly higher Apgar scores at 1 and 5 min despite significantly lower GA and birth weight suggest that better perinatal and neonatal intensive care was provided during period II than during period I. Therefore, these findings imply that this single-centre study appropriately determined that the temporal alterations in the incidence and severity of THOP were associated with better clinical treatment and the ensuing improved outcomes in ELBWIs. These factors might also compensate for the limitations of the study.

In summary, the incidence of THOP was significantly reduced in ELBWIs at 25–28 weeks’ gestation during period II compared to period I, in parallel with their significantly higher Apgar scores, decreased morbidities and increased survival without composite morbidities. In ELBWIs, the initial T4 level was the best predictor of mortality, composite morbidities, and survival without composite morbidities in comparison to CRIB II and Apgar scores.

Methods

Ethics statement

The data collection procedure was approved by the Institutional Review Board of Samsung Medical Center (2018-09-024), and the Institutional Review Board waived the requirement for informed consent in this retrospective chart review.

Primary outcome

Medical records of 546 ELBWIs (birth weights < 1,000 g) with gestational ages (GA) between 23 and 28 weeks who were born at and admitted to the Samsung Medical Center neonatal intensive care unit and had undergone initial thyroid function tests (TFTs) within the first 2 postnatal weeks, between January 2000 and July 2013 were retrospectively reviewed. Among them, 156 were born between January 1, 2000 and December 31, 2005 (period I), while 390 were born between January 1, 2006 and July 31, 2013 (period II). We arbitrarily divided the study period according to the survival rate of these ELBWIs as estimated in our previous studies14,15,16,17, and stratified them into the 23–24, 25–26 and 27–28 weeks’ gestation subgroups. We compared maternal and neonatal variables, survival and morbidity rates and incidence of thyroid dysfunction between the infants belonging to the 2 time periods.

Data collection

Clinical characteristics, including GA, birth weight, Apgar scores at 1 and 5 min, sex, delivery mode, small for GA (SGA) (birth weight below the 10th percentile), pregnancy-induced hypertension, gestational diabetes mellitus and antenatal steroid use, were analysed. GA was determined based on the maternal last menstrual period and the modified Ballard test. We analysed outcome measures, including death before discharge, BPD (≥moderate)31, IVH (≥grade 3)32, NEC (≥Bell’s stage 3b)33 and ROP requiring laser treatment.

For TFTs, we defined THOP as a temporarily low initial T4 level of <4.5 ng/dl and/or free T4 (fT4) level of <0.9 ng/dl with a TSH level of <20.0 µIU/ml and considered severe THOP to be having an initial T4 level of <2.5 ng/dl and/or fT4 level of <0.5 ng/dl. TFTs were followed up every 2–6 weeks until hospital discharge.

We calculated the CRIB II score for each infant using the following variables: sex, GA, birth weight, and base excess.

Statistical analysis

Continuous variables were presented as means ± standard deviation and compared using the Student’s t-test or Mann-Whitney U test. Categorical variables were presented as percentages and frequencies and compared using the chi-square or Fisher’s exact test. We carried out binary logistic regression analyses with adjustment of gestational age, birth weight, sex and antenatal steroid use to obtain odds ratios of clinical outcomes according to the initial thyroxine level. The analyses of specificity and sensitivity of the Apgar score at 5 min, CRIB II score and initial T4 level were performed through receiver operating characteristic (ROC) curves, and the area under the curve (AUC) was conducted in order to define the differences with statistical significance in predicting mortality rate with selection of the most suitable cut-off point. SPSS version 19.09 (SPSS Inc., Chicago, IL, USA) was used for all statistical analyses, and a P < 0.05 was considered statistically significant.