Main

Newborn screening for congenital hypothyroidism (CH) and early substitution treatment has eliminated mental retardation in children with CH (1). In the late 1970s and early 1980s, children with CH were treated with a levothyroxine (L-T4) dose of 5–10 μg/kg/d and with an age at onset of treatment later than 14 d. Intellectual outcome of these children was significantly lower than of healthy controls (24). Thus, concerns were raised about the adequacy of treatment as well as the importance of earlier (1) and higher initial thyroid replacement therapy (5) was stressed. Furthermore, the severity of CH was recognized as an important predictor of adverse outcome (6). As a result, treatment guidelines included earlier treatment initiation (within 14 d) and higher initial doses (10–15 μg/kg/d) with the goal to improve outcome, in particular, for those children with severe CH (7,8).

Several recent studies have examined the long-term intellectual outcome after CH. They reported persistent intellectual deficits for adolescents and young adults (913). However, these patients were not treated according to the treatment guidelines issued in the 1990s (7,8). It remains unclear whether children treated according to these guidelines remain at risk for long-term intellectual deficits and to what extent disease severity and treatment course are related to long-term outcome.

When newborn screening for CH was introduced in Switzerland in the 1970s, early high-dose substitution treatment in accordance with the guidelines issued in the 1990s (7,8) was already implemented at the University Children's Hospital Zurich. Therefore, we have the unique opportunity to report long-term intellectual outcome and its predictors in a cohort of prospectively enrolled children with CH with early high-dose initial treatment.

METHODS

Design.

Screening for CH was introduced in Switzerland in the mid-1970s (14,15). Since then, the standard therapy has been early high-dose substitution with L-T4. To investigate the long-term effects of this treatment strategy, a regional cohort of children diagnosed with CH living in the greater area of Zurich was prospectively enrolled and followed until adolescence. The study was approved by the Hospital internal review board and the ethic committee of the canton Zurich and informed consent was obtained by the parents.

Neonatal screening and therapy.

In the Swiss neonatal screening program, thyroid-stimulating hormone (TSH) is measured in blood spots dried on filter paper taken by heel puncture at days 3–4 after birth. If the TSH value is higher than 50 mU/L blood, therapy is started immediately after taking a second venous blood sample of both child and mother to confirm the diagnosis. Therapy is started without etiological diagnosis. Hormonal substitution usually consists of an oral dose of 50 μg LT-4 per day administered as tablets irrespective of weight. This is equivalent to an initial dose of 10–15 μg/kg/d. In the studied cohort, only one child had a TSH level lower than 50 mU/L blood (30 mU/L). In this child, treatment was initiated on day 6 after birth with 25 μg L-T4 until the diagnosis was confirmed with the second venous blood sample (drawn on day 6) and then elevated to 50 μg L-T4 per day.

The therapeutic regimen was regularly monitored by pediatricians in private practice, initially every 2–4 wk, then every 3–6 mo, by capillary blood on filter paper. However, no predefined protocol for these intervals was established. Dosage adjustments were made by the treating pediatrician after consulting with the team of the Division of Endocrinology and Diabetology of the University Children's Hospital. Dosages of L-T4, thyroxine (T4), and TSH values were recorded on a synoptic card. In addition, regular endocrinological examinations by venous blood samples were performed at the Division of Endocrinology of the University Children's Hospital Zurich at ages 1, 2, 4, 7, 10, 12, 14, and 18 y. TSH values served to adjust dosage and to evaluate compliance and adequate treatment regimen.

Population.

The study initially included 89 prospectively enrolled children with CH, detected by neonatal screening. The children were born between January 1978 and January 1991 and living in the area of Zurich. One child died at the age of 3 mo because of a congenital echovirus infection leading to severe bronchopneumopathy. Five children could not be enrolled because the parents declined participation. Another one was excluded because of pseudohypoparathyroidism and dysmorphic features suspecting fetal alcohol syndrome. Eleven children did not return for the 14-y examination, although they all had completed the neurodevelopmental assessment at 7 y. They did not differ with respect to intelligence quotient (IQ) level, birth weight, pretreatment T4 and TSH, age at onset of treatment, and initial dose of L-T4 compared with those who were followed until 14 y. A total of 71 (80%) adolescents returned for the 14-y examination. Seven children were excluded because the start of therapy was later than 20 d and they did not fulfill including criteria of early treatment. Six of those children had a dysgenesis with age at onset of treatment between 40 and 170 d. One child had an athyreosis, but substitution therapy started only at 90 d because the child's blood sample was mistaken for that of a healthy child. One additional adolescent could not complete the intelligence test because of foreign language. Thus, 63 adolescents (49 girls, 14 boys) were included in the final analysis. They were compared with a healthy control group of 175 (86 girls, 89 boys) 14-y-old adolescents. Controls were selected from the Zurich Generational Study (16). They were born at term and had no neonatal risk factors for adverse outcome. All were prospectively followed from birth until adolescence. The response rate in the Zurich Generational Study was 90% for the 14-y follow-up. They were of the same geographic area and were tested at the same age intervals as the CH children. In addition, the developmental pediatricians who examined the CH children performed the neurodevelopmental examinations of the controls.

In both cohorts, socioeconomic status (SES) was estimated based on paternal occupation and maternal education (17). The scores ranged from 2 (lowest SES) to 12 (highest SES). Mean SES was comparable between the two groups (CH group 7.8 versus control group 8.3; Mann–Whitney U test, p = 0.07).

Neurodevelopmental assessment.

At age 14 y (range 13–14.5 y), CH adolescents and controls were examined with the German version of the Wechsler Intelligence Scale for Children-Revised (18). This test consists of verbal IQ (VIQ) and performance IQ (PIQ), which form the full-scale IQ (FSIQ). The examinations were performed by experienced developmental pediatricians from the Child Development Center at the same day as the endocrinological evaluation. Examiners were not involved in the endocrinological follow-up and thus were independent. The pediatricians were aware of the diagnosis, but not of current endocrinological values.

Etiology, neonatal and pretreatment endocrinological variables.

CH children were grouped into three etiological categories on the basis of scintigraphic and sonographic findings carried out during childhood: dysgenesis (ectopy or hypoplasia), athyreosis, and dyshormonogenesis.

Table 1 shows the demographic, neonatal and pretreatment endocrinological variables of the children with CH. TSH and T4 values are pretreatment values from a venous sample taken for confirmation of the screening result (usually 1–2 d after CH screening). In children with a dysgenesis, 31 had an ectopy and two had a hypoplasia. The children with a dysgenesis had significantly higher T4 levels and lower TSH levels than children with an athyreosis. Substitution therapy with L-T4 was started for all CH children at a median age of 9 d with a median initial dose of 14.7 μg/kg/d.

Table 1 Demographic, neonatal and pretreatment endocrinological variables of children with CH stratified for etiology

Treatment.

Treatment was monitored throughout the childhood. The following variables were obtained: the total number of blood tests received by the treating pediatricians (capillary blood on filter paper), median T4 and TSH values per each child per time period, which was further summarized into an overall median T4 and TSH value per age period, the percentage of T4 and TSH values outside the normal range (65–165 nmol/L for T4 and 1–3.7 mU/L for TSH) (Table 2), the L-T4 dosage per weight (Fig. 1), and free venous blood T4 at 14 y (reference values for free T4: 10.3–29.7 pmol/L). Furthermore, the percentage of T4 and TSH values outside the new recommendations (19) were also calculated (130–206 nmol/L for T4 and 0.5–2.0 mU/L for TSH) (Table 2 and Fig. 1). Defined age periods were formed (2 mo to 1 y, 1–4, 4–7, and 7–14 y).

Table 2 Treatment variables for specific age periods
Figure 1
figure 1

Box plots for L-T4 per weight, TSH (log transformation), and T4 values per age periods. Box length: interquartile range; width: correlates with number of observations; Box line: median; upper/lower extreme: 1.5 × interquartile range; discontinuous horizontal lines: T4/TSH reference levels (T4 < 65 nmol/L; TSH > 3.7 mU/L) and treatment recommendations (T4 130–206 nmol/L; TSH 0.5–2.0 mU/L) (19).

Compliance or inadequate treatment may be of concern in the treatment of CH. Thus, the individual T4 and TSH courses as a function of age were calculated. The assumption was that large fluctuations of T4 and TSH suggest poor compliance or inadequacy of therapy. According to the new recommendations (19), we defined four episodes or more of TSH levels >5 mU/L as an indicator of poor compliance.

Statistics.

Data were analyzed with S-PLUS 7 for Windows (Insightful Corporation, Seattle, WA). Significance level was defined as ≤0.05. Because most variables were non-normally distributed, nonparametric descriptive analysis was performed, except for IQ, where parametric tests were used. Differences between groups were analyzed with the Mann–Whitney U test or with χ2 analysis. Because of the increase of IQ scores in the population over the past decades [Flynn effect (20)], the control group was used as the reference group to express abnormal intellectual performance (1 or 2 SD below the mean of the control group).

To determine the independent effect of CH on intellectual outcome, an analysis of covariance was performed, including the variables “group” (CH versus control), gender, and SES. Subgroup analysis for etiology (dysgenesis and athyreosis) compared with controls was carried out with multiple comparisons after Tukey. Because the number of children with a dyshormonogenesis was small (n = 4), they were excluded from this analysis. Potential interactions were explored. Because the interaction was significant for PIQ, mean IQ differences were expressed for a mean SES of 8.2.

The influence of pretreatment and treatment variables on intellectual outcome at 14 y was analyzed also using analysis of covariance. IQ at 14 y was the dependent variable, SES and gender were covariates for which each analysis was corrected. Standardized regression coefficients are presented to express the independent effect of the analyzed variables (Tables 4 and 5).

Table 4 Standardized regression coefficients for pretreatment and treatment variables in relation to intellectual outcome at 14 y adjusted for socioeconomic status and gender
Table 5 Standardized regression coefficients for the treatment variables during specific age periods in relation to intellectual outcome at 14 y adjusted for socioeconomic status and gender

RESULTS

Intellectual outcome at age 14 y.

In the CH group, mean unadjusted IQs were within the normal range, but lower than in the control group (Table 3). After adjusting for SES and gender, the mean difference was 9.7 IQ scores for the FSIQ. The athyreosis group had the lowest IQ scores compared with controls. Mean differences between the dysgenesis and the control group were significant, but smaller. Differences between the athyreosis group and the dysgenesis group were only significant for the PIQ. FSIQ in the dyshormogenesis group (n = 4) ranged from 85 to 119.

Table 3 Intellectual outcome (WISC-R) adjusted for socioeconomic status and gender

We used the control group as the reference for intellectual performance below the norm. In the CH group, 52% (n = 33/63) compared with 14% (n = 25/175) in the control group (χ2; p < 0.0001, χ2 = 35.9) had IQ scores lower than 1 SD of the mean. Moreover, 21% (n = 13/63) in the CH group compared with 4% (n = 7/175) in the control group had IQ scores lower than 2 SD below the mean (χ2; p = 0.0001, χ2 = 14.6). The majority (n = 9, 70%) of CH children who scored more than 2 SD below the control mean were in need of either special educational services during regular school or were in special schools.

We found an interaction between group (CH versus control) and SES for IQ; however, this was only significant (p = 0.03) for the PIQ (Fig. 2). The interaction indicates a steeper IQ decline with decreasing social class for the CH group than for the control group (the regression lines significantly differed in their slopes: 0.88 for the controls and 3.25 for the CH group, respectively; r2 = 0.189). There was a trend toward stronger interaction in the athyreosis group (slope 4.3, SE 1.7) compared with the dysgenesis group (slope 1.6, SE 1.4). No interaction was found between group (CH versus control) and gender or between gender and SES.

Figure 2
figure 2

Relation between performance IQ (PIQ) and socioeconomic status (SES) for congenital hypothyroidism (CH) and control group. The triangles and circles are group averages of PIQ (CH versus controls) for each SES score. Continuous line represents the regression line for the control group, dashed line for the CH group. Significant interaction (p = 0.03) between group (CH versus control) and SES.

In our cohort, seven children had initial L-T4 doses higher than 18 μg/kg/d, three with an athyreosis and four with a dysgenesis. Intellectual outcome of these children was not different from those with initial doses below 18 μg/kg/d (FSIQ Mann–Whitney p = 0.9).

Neonatal and pretreatment endocrinological variables.

Only pretreatment T4 levels were significantly correlated with FSIQ and PIQ, but not with VIQ after adjusting for SES and gender (Table 4). We could not detect any threshold effect of pretreatment T4 on IQ, neither at 42.8 nmol/L (21) (FSIQ Mann–Whitney p = 0.2), nor with other pretreatment T4-values (data not shown).

Pretreatment TSH levels, age at onset of therapy, initial dose of L-T4, and birth weight were not significantly correlated with IQ (Table 4).

Treatment.

The majority of endocrinological T4 and TSH values were within the ranges of the new recommendations, in particular in the first two age periods (2 mo to 1 y and 1–4 y). In addition, the number of children with equal or more than four episodes of TSH values >5 mU/L was small (Table 2). Endocrinological variables were not related to IQ at 14 y (Table 5). In addition, under- or over-treatment (see Table 2) was not related to low IQ at 14 y. The L-T4 dosage per weight was not correlated with IQ at 14 y (Table 4). At 14 y, median free T4 was 20.4 pmol/L (range 6.3–46.8 pmol/L) and median TSH level was 2.05 mU/L (range 0.003–197mU/L). Both parameters were not related to IQ at 14 y (Table 4).

Furthermore, none of the endocrinological variables throughout childhood significantly correlated with SES (data not shown). Thirteen children in the CH group had four or more consecutive episodes of insufficiently suppressed TSH (>5 mU/L) (19). Their mean IQ at 14 y was not significantly different from the mean IQ of CH children whose TSH values were lower. Children who had endocrinological values outside the newly recommended guidelines (T4 <130 nmol/L and TSH >5 mU/L) had similar mean IQs at 14 y compared with children whose levels were within that range.

DISCUSSION

Intellectual outcome at age 14 y.

We evaluated intellectual outcome of children with CH after early high-dose treatment. Mean IQ at 14 y was in the normal range, but significantly lower than in controls with a mean difference of almost 10 IQ scores after adjusting for SES and gender. Adolescents with an athyreosis were particularly affected. Accordingly, a high proportion (21%) of children with CH manifested significant intellectual deficits (IQ < −2SD below mean of control group). Most of these children were in need of special educational services during regular school or in special schools. Our findings are in agreement with a study using a comparable treatment strategy (initial dose 10–15 μg/kg/d, onset of treatment 8–9 d) where IQ scores were reported to lie in the normal range for children at ages 10–16 y (22). However, the results of the above-mentioned study were not compared with a control group. Interestingly, two studies on intellectual outcome in adolescence after a late and/or low-dose initial treatment reported comparable intellectual outcome to our findings (FSIQ 102.4 and 101.6, respectively) (11,12). Another study reported considerably lower mean IQ levels both in the CH and in the sibling control group (FSIQ 88.4 for CH group, 91.8 for controls, respectively) with similar initial treatment (13). IQ deficits in all three studies ranged between 6 and 9 IQ scores. We are aware that VIQ and PIQ are summary scores of multiple abilities in two major functional domains and that subtest results are not presented here.

In conclusion, despite early high initial treatment according to the guidelines issued in the early 1990s, intellectual deficits compared with controls persist. Thus, other factors potentially affecting intellectual performance must be taken into consideration.

Socioeconomic status.

In our study, SES was an independent predictor of intellectual outcome. This finding is a well-known phenomenon in normal and at-risk populations and has also been reported for children with CH (10,21,22). We found an interaction between group and intellectual performance for SES. It was a general trend, however, only significant for the PIQ: IQ of children with CH became disproportionately lower with decreasing SES than in controls. Two hypotheses have to be considered for this finding. First, children from families with lower SES may not be sufficiently supplemented because of poorer compliance. However, we found no association between SES and any of the endocrinological parameters measured throughout childhood, rendering this hypothesis unlikely. Another explanation could be that children from lower SES are more vulnerable because of suboptimal environmental input than children from a more stimulating social environment. This view is supported by studies on early intervention programs for children at risk, demonstrating that intensive programs may be particularly effective in low-income families (23). We only found a significant interaction for the PIQ and not for the VIQ. Because PIQ is particularly affected in children with athyreosis, which may reflect a specific injury pattern, it is conceivable that also the interaction between SES and group is stronger within PIQ compared with VIQ.

Etiology, pretreatment endocrinological and neonatal variables.

We found that initial disease severity was the only predictor of later intellectual outcome. Children with athyreosis had lower IQ than those with dysgenesis. This is in line with earlier studies (10,13). In addition, among all neonatal variables, only pretreatment T4 levels were associated with outcome. In contrast, other studies using similar treatment regimens reported no differences between etiology (22) or severity groups (24,25). However, comparability to our findings is limited because of differing study designs. In addition, study samples are small in some of these studies (24,25). Our finding supports the notion that cerebral injury occurs prenatally.

The correlation between pretreatment T4 and outcome was strongest for the PIQ. We think that this finding may be related to the visuomotor deficits reported for children with CH (26,27). This explanation is in line with our findings where children with an athyreosis presented particular deficits in the PIQ. This may be due to differential effects of prenatal thyroid hormone loss affecting selective brain regions at different times in development.

Other variables, such as age at onset of treatment, initial dose of L-T4, pretreatment TSH, and birth weight, were not related to intellectual outcome at 14 y. This is in accordance with the findings of other studies (22,28). This lack of association can be attributed to the early high-dose treatment regimen, which may eliminate the effects of initial dose or onset of treatment as reported by studies using either low dose (<10 μg/kg/d) or late onset of treatment or both (11,29,30).

Treatment.

Recently, emphasis has been placed on postinitial treatment regimen. A recent study reported that mild overtreatment (TSH ≤0.5 mU/L) was associated with better IQ in school-aged children (26). New recommendations have been published (19) stressing a narrower treatment range in early childhood, which includes keeping the total T4 in the upper half of the reference range (130–206 nmol/L) and maintaining low TSH levels (0.5–2.0 mU/L). Interestingly, we could not detect an association between median endocrinological levels throughout childhood or between undertreatment as well as over-treatment and intellectual impairment. This may be explained by the high proportion of endocrinological values that were within the newly recommended treatment range.

In contrast to our findings, some studies have reported an association between endocrinological parameters throughout childhood and intellectual outcome (10,13) and language performance (11). However, in these studies initial treatment regimen consisted of late and low-dose substitution therapy (10,11,13). This suggests that cerebral injury occurs not only prenatally, but may continue postnatally if initial therapy regimen consists of late and low-dose treatment. Optimal substitution treatment during early childhood may ameliorate this effect.

We could show that early high-dose treatment and optimal substitution treatment throughout childhood cannot fully normalize intellectual performance in children with CH. Thus, the question arises whether substitution therapy throughout childhood should even be further elevated. Fisher et al.(31) suggested that in adults higher T4 replacement therapy for hypothyroidism is needed than in the normal euthyroid state to maintain normal serum TSH concentrations. However, an increase of the substitution doses may raise concerns about potential adverse effects, especially on behavior and attention (12).

Strengths and limitations.

The strength of this study was the introduction of early high-dose initial treatment from the beginning of CH screening in the mid-1970s. Outcome was compared with an optimal control group. It is important to note that intellectual outcome must be compared with control children, because test references are lower than those of the classmates because of a steady IQ increase over the years [Flynn effect (20)]. Some studies have used bone age to classify disease severity (10,24). In our study, bone age at birth could not be assessed because of incomplete data. A recent study has shown that TSH normalization time also impacts on neurodevelopmental outcome (32). We could not examine this question because TSH and T4 values were not assessed according to a predefined protocol in the first few weeks after the beginning of the substitution treatment. In addition, behavior was not assessed in a systematic way and thus we were not able to examine the effects of early substitution treatment and endocrinological variables throughout childhood on behavior. In addition, the summary scores of the WISC-R exclude relevant abilities, such as attention and memory, which are sensitive to thyroid hormone levels (33).

We conclude that despite early high initial treatment and optimal substitution during childhood, a significant proportion of children with CH manifest intellectual deficits in adolescence, in particular those with an athyreosis. Thus, this ideal treatment regimen is still not sufficient to fully overcome the cerebral injury caused by CH before onset of therapy and to complete normalization of hormone lines. Nevertheless, initial early high-dose therapy must constitute the first component of an adequate substitution therapy. In addition, more emphasis should be placed on the postinitial treatment strategy achieving endocrinological values throughout childhood according to new recommendations (19). SES is an additional risk factor for adverse outcome. Intellectual functioning of children with CH from low-income families is disproportionately affected. Thus, close surveillance, early detection of developmental delay, and intervention programs are particularly important for this specific subgroup of children with CH.