Introduction

Childhood family background strongly influences the formation of adult social position,1 and education plays the most important mediating role when parental social resources are transmitted to the next generation.2 However, the effect of the childhood environment on social position can already have prenatal origins. Birth weight (BW), the most common indicator of the prenatal environment, is positively associated with educational performance in childhood3 and earnings in adulthood.4 Cognitive ability is most likely an important contributor to these associations since low BW is associated with lower IQ from childhood to adulthood5 and even with a smaller brain volume when compared to average BW children.6 However, the neurocognitive consequences of BW are not limited only to low BW since the association between BW and IQ both in childhood7 and adulthood8 can also be found across the total variation of BW. This suggests that BW can provide information on the prenatal environment relevant for further cognitive development across the full range of BW values. This interpretation is supported by studies showing that genetic variants of BW are not associated with IQ9 or academic performance10 in Mendelian randomization studies, thus suggesting that these associations are not causal but rather reflect the role of the prenatal environment.

When evaluating the role of BW as an indicator of the prenatal environment, it needs to be recognized that BW can reflect prenatal risk factors differently according to the social characteristics of the family. The variation of birth size is affected by both genetic and environmental factors.11 It is thus possible that BW more robustly reflects the intrauterine environment in families with low socio-economic position (SEP), whereas in families with higher SEP, it may be more affected by genetic factors not associated with cognitive development of offspring. It is also noteworthy that BW is a combination of birth length (BL) and body fatness, typically measured in neonates using the ponderal index, which can differently reflect the intrauterine environment. BL is not as widely used as BW in large-scale studies, but there is still convincing evidence that BL is positively associated with neurocognitive outcomes.12 Studies analyzing birth ponderal index (BPI) and later cognitive performance are rare. However, there is evidence that high maternal pre-pregnancy body mass index (BMI) is associated with increased BW and body fatness13 as well as delayed neurobehavioral development of offspring.14 BPI is also positively associated with adult BMI,15 which further shows a negative association with IQ in early adulthood.12 Thus, we can speculate that BL and BPI are associated with later cognitive performance in opposite directions, which can explain the leveling off or even decrease of the association between BW and cognitive performance at the highest level of BW.16 All these associations may also differ according to family SEP.

Since studies analyzing how different indicators of birth size are associated with school performance according to family SEP are rare, we studied this issue using a large population-based longitudinal cohort of children. We used three indicators of body size at birth (BW, BL and BPI) and three family SEP indicators (maternal education, paternal education and household income). We present the following study hypotheses: (i) BL shows a positive and BPI a negative association with school performance, contributing to an expected curvilinear association between BW and school performance; (ii) adjusting the results for other risk factors measured directly or indirectly by comparing full siblings and twins diminishes these associations; and (iii) these associations are stronger in low SEP families as compared to high SEP families, indicating a stronger influence of the prenatal environment.

Data and methods

Supplementary fig. 1 presents the flow diagram of our study cohort. Our baseline data included all children born in Finland during the years 1987–1997 (N = 692,058). BW (kg) and BL (cm), as well as Apgar score at 1 min after the delivery, were measured in the hospital and made available for us through the National Medical Birth Register covering all live births in Finland. Based on this register, we also obtained information on gestational age, previous pregnancies and maternally reported smoking during the pregnancy (did not smoke, quit smoking during the first trimester, smoked and missing). We excluded children having missing information on BW or BL (N = 7017) or other birth-related measures (N = 2649) from the analyses. We used BPI calculated by dividing BW in kilograms (kg) by the cube of body length in meters (m3) as a measure of relative weight. Information on maternal and paternal education (basic, secondary and tertiary), the mother’s birth year and child’s birth year was obtained from the National Population Register. Household income after tax based on information on different income sources of all household members was obtained from the tax officials and classified in yearly quintiles. Children without information on family SEP were excluded (N = 3186).

In Finland, all children start the compulsory 9-year primary school at the age of 7, and thus are at the age of 16 at the end of primary school. The school grade in the final class of primary school was available from the Admission Register for Higher Education for all children who applied to secondary education. We used the mean of grades of all school subjects, which can range from 4 to 10. Children who did not start secondary education, had not finished primary school or died before their 16th birthday were excluded from the data (N = 32,781). All registers were linked using unique personal identification numbers by Statistics Finland and then delivered to the research group after removing personal identification codes. Our final study cohort included 646,425 children (49% girls; 93% of the original cohort). However, we had outliers for BL (<40 cm or >60 cm; N = 2098) and BPI (<20 kg/m3 or >40 kg/m3; N = 1049) which were removed from these analyses.

First, we analyzed the associations between birth size and school performance at the individual level using population-averaged regression models. In these analyses, we removed 16,161 twins or higher order multiples since they had, on average, lower BW, BL and BPI, but better school performance than singletons (Supplementary Table 1). We started these analyses by testing the linearity of these associations by fitting a squared term for each birth size indicator into a model that already included a linear term. For BW, the squared term was statistically significant for boys and girls (p < 0.00001). This was because the association leveled off at the higher end of BW distribution (Supplementary Table 2). For BL, the squared term was not statistically significant in boys (p = 0.6936) or girls (p = 0.6265), whereas for BPI, the squared term was statistically significant for girls (p = 0.0211) but not for boys (p = 0.1625). However, since even for girls the association did not strongly diverge from a linear association (Supplementary Table 2 and visual inspection) and the squared term was statistically significant mainly because of our very large sample size, we used only linear terms for BL and BPI for both sexes in the further analyses. In our statistical models, we standardized the indicators of birth size (mean of 0 and standard deviation of 1) and added the number 10 to guarantee that all values were positive, allowing us to correctly estimate the quadratic effects. Because of the strong correlations between BL, BW and BPI, we estimated all models separately for these indicators to avoid multicollinearity.

We started the individual-level analyses by estimating the associations between the three birth size indicators and school performance and studying how these associations were attenuated by other birth- and family-related factors. We adjusted the models firstly for demographic factors (birth year, maternal age at the time of delivery and parity), secondly for gestational-related risk factors (maternal smoking during pregnancy, gestational age, square of gestational age and Apgar score), and finally for the family SEP (maternal and paternal education and household income). We then continued these analyses by studying whether the associations between the birth size indicators and school performance differed by family SEP by stratifying the models by each indicator of family SEP. In these interaction analyses, we used the original classification for maternal and paternal education, but for family income we pooled quintiles 2–4 as the middle category.

After the individual-level analyses, we conducted analyses within full same-sex sibships and co-twins. Previous research has demonstrated that both family environment and genetic factors have an influence on education.17 Since full-siblings share their childhood family and 50% of genetic variation, using within-family methods we can control for the influence of early family environment and also genetic influences, partially. We assumed that if the family environment or genetic factors affect the association between birth size and school performance, the associations should be reduced or disappear in these within-family analyses. Co-twins optimally share their family postnatal environment, and further monozygotic twins are genetically identical. There can also be considerable variation in the prenatal environment between twin siblings in terms of sharing or not sharing the placenta or amniotic sac.18 However, because only register-based information was available, we could not distinguish between monozygotic and same-sex dizygotic twins. Together, we had 55,063 families with at least two brothers and 51,915 families with at least two sisters. For the twin-pair analyses, we had 2352 male and 2271 female complete same-sex pairs. These within- family analyses were conducted using fixed-effects linear regression models. In practice, this model creates a dummy variable for each family, thus removing all between-family variation from the associations between birth size and school performance.19 We adjusted these models first for demographic factors and then gestational-related risk factors to test whether within-family variation in them can modify the associations found. All analyses were conducted by the Stata/MP 17.0 for Windows statistical software (StataCorp, College Station, TX). Since the nonlinear associations are difficult to interpret only based on parameter estimates (the linear term and the squared term), we calculated marginal effects based on estimated parameters for the most important BW results and presented them as figures.

The study has been approved by Statistics Finland Board of Statistical Ethics (TK-53–1490–18) and the Social and Health Data Authority Findata (THL/2180/14.02.00/2020), which deemed exempt from informed consent when awarding the permission to use the data. The register data were originally collected for administrative and statistical purposes. The legal basis for processing this kind of personal information is scientific research as stated in the Finnish Personal Data Act (523/1999) and the EU General Data Protection Regulation. Use of the data was regulated and permitted by the Act on Secondary Use of Social and Healthcare Data (552/2019) and the Finnish Statistics Act (280/2004).

Results

Table 1 presents the descriptive statistics of school performance and birth size indicators by the categories of family SEP and sex. Girls had better school performance and higher BPI but lower BW and BL than boys. For school performance, the largest SEP gradient was found for maternal education and smallest for household income. BW and BL showed positive gradients for all SEP indicators. For BPI, the SEP gradients were negative but generally weak.

Table 1 Percentages of participants and means and standard deviations (SD) of school performance at 16 years of age and weight, length and ponderal index at birth according to childhood family socio-economic position.

We started the statistical modeling by studying the associations between birth size and school performance at the individual level (Table 2). Greater BW and BL were generally associated with better school performance and greater BPI with lower school performance (Model 1). BL explained a larger proportion of variation of school performance (0.08% in boys and 0.15% in girls) than BW (0.02% and 0.05%, respectively) or BPI (0.07% and 0.04%, respectively). Adjusting the results for demographic risk factors strengthened the association for BL but practically eliminated the association for BPI (Model 2). This was mainly because parity was negatively associated with school performance but positively associated with birth size (Supplementary Table 3). For BW, adjusting the results for demographic and gestational risk factors (Model 3) strengthened the association between smaller BWs and school performance. This made the association between BW and school performance more curvilinear as demonstrated in Fig. 1. For BL, adjusting the results for gestational risk factors and family SEP (Model 4) explained a part of the association (32% in boys and 22% in girls) when compared to the results adjusted only for demographic factors (Model 2). For the other birth size indicators, the effect of adjustment was less systematic: for BW, the adjustment for family SEP weakened the association with smaller BWs (i.e., decreased the linear effect) whereas for BPI, the negative association was seen after the adjustment for gestational risk factors, but the further adjustment for family SEP explained a part of this negative association.

Table 2 Regression coefficients (β) for school performance with 95% confidence intervals (CI) of standardized birth weight, length and ponderal index in boys and girls.
Fig. 1
figure 1

The association between birth weight and school performance at 16 years of age by sex.

Next, we analyzed how the association between BW and school performance was modified by family SEP (Table 3; the interaction parameter estimates are available in Supplementary Table 4). Generally, the effect sizes for both the positive linear and the negative quadratic BW effects were stronger in higher SEP categories in boys and girls when adjusting only for demographic factors (Model 1). This suggests that in higher SEP families, the association between BW and school performance was seen in particular with smaller BWs whereas in lower SEP families the association was seen over a wider range of the BW distribution. This is demonstrated in Supplementary Figs 24: where in lower SEP families the association between BW and school performance was nearly linear, in higher SEP families the association was weak and slightly lower school performance was seen only at the smaller end of the BW distribution. Adjusting the results for gestational risk factors strengthened both linear and quadratic effects (Model 2): this suggests that these adjustments made the association between BW and school performance more curvilinear.

Table 3 Regression coefficients (β) with 95% confidence intervals (CI) of standardized birth weight according to family socio-economic position in boys and girls.

We then repeated similar SEP interaction analyses for BL and BPI (Table 4; the interaction parameter estimates are available in Supplementary Table 5). When adjusting the results for the demographic factors, BL systematically showed the weakest positive association with school performance in high SEP families (Model 1). Adjusting the results for the gestational risk factors decreased these SEP gradients, but they were still seen in particular in girls (Model 2). For BPI, no association with school performance was seen in any SEP category when adjusting the results for the demographic factors (Model 1). After adjusting the results for gestational risk factors, weak negative associations with school performance were seen (Model 2). However, these associations did not systematically differ between the SEP categories.

Table 4 Regression coefficients (β) with 95% confidence intervals (CI) of standardized birth length and ponderal index according to family socio-economic position in boys and girls.

Finally, we analyzed the association between body size and school performance within families to take into account unmeasured shared family characteristics (Table 5). Within full siblings, the associations between birth size indicators and school performance were weak in the model adjusted only for the demographic factors (Model 1). However, when we adjusted the results for gestational risk factors, we found that BW and BL were associated with school performance in a similar direction as in the individual-level analyses (Model 2). However, the effect sizes were weaker than in the individual-level analyses in boys and girls. When we analyzed these associations within twin pairs, the associations were again in the same direction as in the individual-level and full-sibling analyses. However, because of the more limited sample size, the confidence intervals were wide thus making it difficult to evaluate the effect sizes. As in the individual-level analyses, the association between BPI and school performance was weak, both within full siblings and twins.

Table 5 Within family regression coefficients (β) with 95% confidence intervals (CI) standardized birth weight, length and ponderal index in full same sex siblings and twins.

Discussion

In this large longitudinal study of children, we found that BL showed a linear association with school performance in late adolescence, whereas the association between BW and school performance was curvilinear. BL also explained more of the variation of school performance than BW. Even though some previous research has shown a positive association between BL and cognitive performance,12 the studies are still rare when compared to the large number of studies analyzing BW.7,8 Our results suggest that BL is a better indicator of the prenatal environment than BW within the normal variation of birth size. According to our initial hypothesis, BPI was negatively associated with school performance. However, this negative association was explained by birth order whereas the adjustment for birth order made the association between BW and school performance more curvilinear. This was because higher birth order was associated with lower school performance, as is also well demonstrated in previous studies,20,21 but larger birth size. It is possible that there are different mechanisms behind these associations if, for example, family dynamics or social resources explain the negative association between birth order and school performance but physiological changes of the uterus during consequent pregnancies explain the positive association between birth order and birth size. Thus, in contrast to our hypothesis, BPI does not contribute to the curvilinear association between BW and school performance.

Even when BL showed a robust association with school performance, the absolute effect size was modest and BL variation explained only a small proportion of school performance variation (0.08% in boys and 0.15% in girls). This translates to a 0.03 increase of mean grade in boys and 0.04 in girls per 1 SD change in BL. However, even when small, the explained part of school performance variation in our study was still considerably larger than that found between BL and several IQ measures in adulthood (0.02% or less) in a Danish study,22 and in both of these studies it was larger than that found for BW. A limitation in the previous Danish study22 is that it pooled males and females and thus cannot study sex differences, which was also the case in a previous Norwegian study studying BL that only included males.12 Thus, our study suggesting that BL has a stronger effect on further cognitive performance in girls than in boys is unique. The small effect size is expected when considering that BL and BW are only indicators of the prenatal environment rather than causal risk factors of further school performance, as also supported by previous Mendelian randomization studies.9,10 This observation is also consistent with our results that twins had lower BL and BW than singletons, but their school performance was nevertheless better. This result is not surprising considering that there are likely to be different mechanisms behind growth restriction in twin and singleton pregnancies.23

We obtained more robust evidence on the role of BL as an indicator of environmental risk factors in analyses of how family-level factors modified these associations. Adjusting the results for gestational risk factors and family SEP explained around a third of the association between BL and school performance. Further, the associations of BL and BW with school performance were weaker within families than between individuals supporting the idea that the birth size indicators partly reflect family-level risk factors. However, it is noteworthy that in particular after taking into account gestational risk factors, the association of BL and BW with school performance was also seen within families. This is consistent with previous findings that lower BW is associated with shorter education24 and lower IQ25 within twin pairs. These results support the idea that the association between birth size and later school performance is not only explained by family background but can also reflect unique risks related to each pregnancy. Our results stratified by family SEP are also consistent with the hypothesis that BL and BW reflect familial risk factors. The association between BL and school performance was stronger in low and intermediate SEP families as compared to high SEP families. Family SEP also modified the shape of the association between BW and school performance: where in low SEP families the association was nearly linear, in high SEP families it was weak and curvilinear, with associations observed only with smaller BWs. Thus, population heterogeneity according to SEP can contribute to the curvilinear association between BW and school performance.

The stronger association of BL than BW with school performance can reflect the changing effects of intra-uterine environmental factors during pregnancy. The fetus gains most weight during the third trimester and most length during the second trimester of pregnancy.26 Thus, BL may better reflect environmental conditions during the periods most critical for neurodevelopment. However, this is difficult to determine with any certainty since the human brain also develops during the third trimester.26 It is also possible that BW more reflects the natural constraints of the uterus to adjust to the growing fetus than BL. Our results are consistent with this idea since multiple pregnancies, which are more prone to the physical restrictions of the uterus than singleton pregnancies, were more strongly associated with BW than BL, but twins nevertheless had better school performance than singletons (Supplementary Table 1). This hypothesis is also consistent with the results that in girls, who have smaller birth size than boys and are thus less prone to these constraints, BL is more strongly associated with school performance. Even though we cannot empirically evaluate all the possible mechanisms, our results suggest that BL better reflects environmental variations during the most critical phases of pregnancy for cognitive development than BW.

Our data have strengths but also weaknesses. Our main strength is our very large and representative register-based data that also includes girls, which has not been available in some previous large-scale studies utilizing conscription registers. Parental education and income are register based and thus not prone to reporting bias. Our baseline data cover all children born in Finland. However, during the follow-up, we lost around 7% of these children because of early mortality, emigration, drop-out from primary school or not continuing to secondary education. This has probably made our results more conservative. A limitation is that we do not have direct information on IQ or other direct indicators of cognitive performance. Further studies should evaluate whether the association between birth size and school performance are mediated through IQ or whether there are also other mediating mechanisms. Additionally, our anthropometric measures were limited to BW and BL. More detailed anthropometric measures, such as head and waist circumferences and body composition measures, could provide more information on these associations.

In conclusion, BL shows a linear and BW a curvilinear association with school performance in late adolescence. Family SEP modified these associations, which can indicate that the birth size indicators reflect more the intrauterine environment in the presence of low family social resources. The effect sizes are small and thus birth size cannot be used, e.g., for identifying children with particular schooling needs. However, at a population level, BL can offer useful information on the intrauterine environment affecting later-life cognitive development.