Introduction

Following the lead of Becker and Lewis1, economists have studied the effect of family size on educational attainment, including years engaged in schooling and level if education completed. Based on this research, one well-established opinion is that, because they receive fewer educational investments, children in large families may not attain high level of education2,3,4,5. Becker and Lewis's1 theory predicts a negative family size effect on educational attainment. Many studies follow and extend their theory. Social scientists6,7 have revealed that family size has a negative effect on the educational performance of adults, primarily caused by the dilution of resources that accompanies family growth6. Additionally, the marginal cost of each additional sibling can amount to a loss of one fifth of a year of schooling8. Increasing family size also lowers the share of financial resources for each child, as well as benefits that are transferred across generations, such as their inheritance. Some evidence9,10 supports that this effect on children can be evidenced both immediately and over time. Using a provincial sample, Chen et al.11 indicated that sibship size has significant effects on decreasing high school and college enrollment. However, Åslund and Grönqvist’s12 study showed that sibship size was a minor but significant negative impact on grades in compulsory and secondary school among children who might be vulnerable to further restriction of parental investment. They did not find any evidence of a causal effect on long-term outcomes. Furthermore, family size also has a negative effect on parents’ labor market outcomes. Angrist and Evans13 used twins as an instrumental variable for identifying the causality of family size on parents’ income and labor market participation. They found that the incomes of both parents and their labor market participation decreased when the family size increased, which might have a negative impact on their children’s educational performance. Parents invest resources, including financial resources, time, and effort, in their children. Parents' resources are relevant to children's general skill development. Increasing the number of children in a family may dilute the parents' ability to invest meaningfully in each child's human capital. This may lead to a child's low educational attainment.

While a number of studies find that sibship size may have a negative effect on children’s performance, there are still differing opinions regarding this. Steelman et al.14 suggested that not all parents' resources were associated with improved children's educational outcomes. Frenette15 mentioned that the current literature had not yet reached a consensus on the association between parental investment in child quality and a child's cognitive ability. Furthermore, the family size effect on educational attainment is uncertain16. In fact, an additional child may not cause resource dilution. Diaz and Fiel17 suggested that a sibling had little effect on the educational attainment of older siblings in families having fewer than five children. The extent of the sibship effect seems to weaken over time in Asian countries18,19. Thus, there may be economies of scale relative to raising more children. Theoretically, the scale economy effect should be stronger at smaller quantities20. Effective sibling interactions may help overcome the shortage of resource dilution, and parents may gain experience from raising older children, which can in turn help them rear their younger children more effectively. Parents may also learn to prevent the dilution of resources that are the keys to children's educational success15,21, Lafortune and Lee22.

This study focuses on the only-child effect on student educational attainment. Specifically, we examine the difference in test scores in each subject, such as Chinese, mathematics, and English, between children in one-child families and two-child families. Most previous literature obtained causal estimates by looking at multiple births. They usually estimated the effects of family size from 2 to 3 (such as3,23 or 3 to 411. However, it is less clear what happens when the family size goes from 1 to 2. In other labor market contexts, Vladasel24 found that negative marginal family size effects for incorporated entrepreneurship on all offspring in families with five or more children. His finding implies that the negative marginal effects do not work until the family size is larger than five children. In education contexts, our results may be similar to Vladsel’s24 finding. The potential contribution of this paper is that it may provide a precise estimation of the marginal effect of family size on educational attainment in families of one child or more than one child.

To obtain the causal effect of only children on educational performance, we apply the random gender of the firstborn as an exogenous instrument for whether a student is an only child. Lee25 first introduced the "first girl" as an instrument for family size in South Korea. Since both China and South Korea belong to the Confucian cultural circle, this instrument is also applicable to China. The results show that children in one-child families have significantly lower test scores than children in two-child families, even after controlling for potential confounders reflecting students' characteristics and family background. An inverted U-shaped relationship can be expected, as the scale economies effect dominates when there are a small number of children in a family.

Data

We used a dataset from the China Education Panel Survey (CEPS) for our empirical analysis. The CEPS contains information on students, parents, classrooms, and schools. It is a school-based and nationally representative survey, including 438 classrooms in 112 schools in 28 county-level units in mainland China. We examined the dataset for the 2013–2014 academic year, which includes information for over 20,000 students in seventh and ninth grades.

Since this paper aims to compare the difference in educational performance for children from one-child and two-child families, we used samples from one-child and two-child families in the baseline regression. We also used children from one-child and three-child families as a sample in the heterogeneity analysis, whereas we used compliers as a sample in causality.

In addition to a baseline survey on students' personal, family, classroom, and school characteristics, the dataset also contains their standardized test scores in mathematics, Chinese, and English from the fall term. The mean value of the standardized scores is 70, while the standard deviation is 10. The distributions of variables are summarized in Column (1) of Table 1. Figures 1, 2, 3 show the distributions in the Chinese, mathematics and English scores.

Table 1 Summary statistics.
Figure 1
figure 1

Distribution of Chinese test score.

Figure 2
figure 2

Distribution of mathematics test score.

Figure 3
figure 3

Distribution of English test score.

The regressor of interest (only_child) is a dummy variable. Respondents from one-child families have a value of 1, whereas respondents from two-child families (or three-child families, in heterogeneity analysis) have a value of 0. Table 1 shows that children in one-child families accounted for more than half of the sample (58%).

We use willpower, parental control, and educational expectations from parents as mediators in the mechanism check. Table 1 also shows the description of mediator variables.

First, we include three items about self-control to measure willpower, "I still go to school even if I feel a little uncomfortable, and I have not had any absent excuses", "I will try my best to finish my homework even if I do not like it", and "I will try my best to finish my homework even if I have to spend a great deal of time on it". These statements are designed to gauge the willpower of an individual resisting temptation, demonstrating impulse control and overcoming procrastination. This represents the three dimensions of willpower "I Will", "I Won't" and "I Want". To specify, the numerical value 1–4 represents the lowest willpower to the highest ("strongly disagree", "disagree", "agree", and "strongly agree").

Second, we included three items to measure the extraversion of a student, "I consider myself easy to get along with", "I feel close to people in this school", and "I often take part in school/class activities" The answer choice were "strongly disagree", "disagree", "agree" and "strongly agree". All variables are intended to measure students’ extraversion.

Third, we include three items regarding the strictness if parents to measure parental control. They are "how strict are your parents concerning your homework and exams", "how strict your parents are with your school performance", "how strict are your parents concerning your school activities", "how strict are your parents regarding curfew", "how strict are your parents regarding your time using the Internet", and "how strict are your parents regarding your time spent watching television". The numerical value 1–3 represents the parents' strictness from lowest to highest ("1-not strict", "2-normal", and "3-strict").

Finally, we include two items related to parental support, "How frequently do your parents check your homework and assignments?" (check) and "How frequently do your parents tutor you while working on homework assignments?" (tutor). The numerical value 1–4 represents the lowest parental support to the highest ("never", "1–2 days per week", "3–4 days per week ", and "everyday").

Empirical strategy and results

Baseline regression

Our interest is in determining whether there is an association between a student’s standardized test scores (Score) and being from a one-child family exploring the influence of personal characteristics and family characteristics. We estimate the following model using the ordinary linear squared method (OLS):

$$Score_{igc} = \beta_{0} + \beta_{1} \times Only\_Child_{igc} + X\beta + Grade + Classroom + \varepsilon_{igc}$$
(1)

where \({Score}_{igc}\) represents scores of a grade g student i from classroom c. We compare the scores of children in one-child families to children in two-child families. If the value of \({Only\_Child}_{igc}\) was 1, student i was from a one-child family; otherwise, the student i was from a two-child family. The set of control variables X includes student characteristics (i.e., ethnicity, gender, age, and age squared) and family characteristics (i.e., economic status, hukou type at birth, parental education). The Hukou is Chinese household registration system. The system descends in part from ancient Chinese household registration systems. The hukou system also influenced similar systems in neighboring East Asian countries—such as one within the public administration structures of Japan (koseki) and Korea (hoju), as well as Vietnam (hộ khẩu). The functions of the system regulate population distribution. In mainland China, all PRC nationals’ personal hukou status is classified by residential location (rural/non-rural/urban). The hukou status always connects to individual’s benefits. For example, rural residents have right to receive benefits from Village-owned Enterprise and land rent of their village. \(Grade\) is the grade fixed effect, and \(Classroom\) is the classroom fixed effect. For ease of interpretation, all scores are standardized to have a mean of 70 and a standard deviation of 10.

Table 2 reports the results of estimating Eq. (1) for the standardized scores of Chinese, mathematics, and English. Columns (1), (3), and (5) report results without any controls, whereas Columns (2), (4), and (6) include the full set of control variables. For Chinese students, the predictions show that being an only child has a statistically significant negative effect on exam performance. Specifically, only child Chinese exam scores are 0.54 lower than children from two-child families. For mathematics, only child exam scores are 0.633 lower than the children from two-child families. However, the result of Column (3) is insignificant. The negative effect may be hidden by other confounding factors. For English, the only child effects are insignificant. We also show results of correlation matrix in Tables A1, A2, A3 in Appendix.

Table 2 Associations between only-child status and standardized test scores.

Let us now turn to the heterogeneity analysis. Table 3 shows heterogeneous effects by gender. We consider the interaction terms of the results in Table 3. The interaction terms in Columns (1) and (2) are insignificant, whereas the interaction term is significantly positive in Column (3). This implies that men perform worse than women, and females in only-child families perform worse than females in two-child families.

Table 3 Heterogeneity analysis: only children's gender.

There are still some confounding factors that should be considered. As26,27 explain in substantial detail, this estimation conflates the effects of family size and birth order. Black et al.26 find a negative correlation between family size and children's education, but when birth order is included, family size effects become negligible. The birth order effect is strong in long term27. They reveal that birth order effects tend to be negative, which would make the estimate of the only-child effect an underestimate. However, this effect is reversed in some contexts. De Haan et al.28, using family fixed-effects models, find positive and persistent birth order effects. They suggest that earlier-born children receive less quality time from their mothers and are breastfed for a shorter period. Therefore, we use children from one-child and two-child families as a sample to eliminate the bias caused by birth order effects. We compare the educational outcomes of only children to children from three-child families as well.

Columns (1)–(3) of Table 4 show that children from one-child families perform as well as firstborn children in two-child families in Chinese and English, but children from one-child families perform worse in Mathematics than firstborns in two-child families. This may imply that the significant only-child effect in Table 1 is caused by the outstanding performance of second-born children in two-child families. Columns (4)–(6) of Table 4 show that there is no difference between only children and firstborns in three-child families. The firstborns may be “immune” from the sibship size effect in small families. Our results are consistent with those of Shen et al.20.

Table 4 The comparison of educational outcomes between only children and firstborns in two-child/three-child families.

Table 5 shows heterogeneous effects by gender. Columns (1)–(3) of Table 5 show that children from female one-child families perform worse than female firstborn children in two-child families in Chinese, Mathematics and English, whereas Columns (4) and (5) show that children from female one-child families perform as well as female firstborn children in three-child families in Chinese and Mathematics. However, children from female one-child families perform worse than female firstborn children in three-child families in English. It is worthy to note that males perform worse than females in the sample of only children and firstborn from two-child and three-child families. We also consider the interaction terms of the results in Table 5. The interaction term in Column (3) is significant, whereas the interaction term is other Columns are insignificant. This analysis shows that females’ performance improves when they have a younger sibling. Their performance changes back when they have a second younger sibling.

Table 5 The comparison of educational outcomes between only children and firstborns in two-child/three-child families by genders.

In sum, the academic performance of children from one-child families is not better than that of children from two-child families. An only child who is male performs as well as male children from two-child families in English. Our results follow Lao and Dong’s3 findings and fill the gap. Lao and Dong3 simply compare the average test scores of only children and non-only children, which conflates the effects of family size and birth order. Furthermore, they use the samples whose sibling number is smaller than five, which may enhance the bias from birth order. We compare educational outcomes of only children and children (especially firstborns) in two-child families, which eliminates the bias from birth order.

Causality

Fertility decisions depend on common factors29. Omitted variable bias can make the results of estimation unreliable. To address this problem, we use the gender of the first child to instrument whether a student is in an only child family. Son preference means that parents continue to have children until they have an ideal number of boys. It is a norm deeply rooted in social, cultural, and economic factors in Asian countries such as China, Korea, and India30. If parents have son preferences and their first child is a girl, they may have the propensity to try to have another child. Thus, the first child's gender may be a predictor for the probability of being a one-child family.

We use the IV method and estimate a two-stage least squares (2SLS) model. The key is to identify a variable that predicts Only_Child but is uncorrelated with the error term \({\epsilon }_{igc}\) in Eq. (1). We use an indicator for a firstborn girl (first_girl) as an instrument and estimate the 2SLS model:

$${Only\_Child}_{igc}={\alpha }_{0}+{\alpha }_{1}\times {first\_girl}_{igc}+X\alpha +Grade+Classroom+{\mu }_{igc}$$
(2)
$${Score}_{igc}={\pi }_{0}+{\pi }_{1}\times \widehat{{Only\_Child}_{igc}}+X\pi +Grade+Classroom+{v}_{igc}$$
(3)

where \({first\_girl}_{igc}\) is a dummy variable that equals 1 if the firstborn is a girl; otherwise, it is 0. This approach is similar in spirit to that of Lee25. Lee25 first introduced that "first girl" can be a good instrument for family fertility decisions in which strong preferences for sons and small families are social norms. Equations (2) and (3) are the first-stage regression and the second-stage regression, respectively. Standard errors are clustered at the classroom level.

Let us now turn to discussing the validity of the "first girl" as an instrumental variable. If parents controlled births based on sex, the sex of the firstborn cannot be said to be random. Thus, sex-selective abortions may make the instrument invalid. Chinese authorities have been made aware of the misuse of ultrasound machines for prenatal sex determination, which have led to the subsequent abortion of female fetuses. In the Law on Maternal and Infant Health Care, authorities strictly prohibited the use of medical techniques for the identification of fetal sex that may in turn lead to selective abortion for nonmedical reasons. In addition, many studies31,32 have found that Chinese parents do not use sex-selective abortions for firstborns, but they may for those of higher birth order. Nie32 notes that the policy permits a couple to have a second child if the sex of the firstborn is female. This may serve to dissuade parents from using sex-selective abortion for firstborns. These studies illustrate that the sex of the firstborn is a credible instrumental variable.

Moreover, Kugler and Kumar33 point out that the sex of the firstborn is not correlated with educational attainment because it is determined randomly. In other words, the sex of the firstborn will not satisfy the exclusion restriction if it is not random. As discussed in the last paragraph, Chinese parents do not use sex-selective abortion for firstborns. Therefore, Kugler and Kumar33 suggest that the sex of the firstborn satisfies the exclusion restriction of the instrument and there is little evidence that households with firstborn girls are different in other ways from those with firstborn boys.

Last, the sex of the firstborn child may be correlated with educational attainment due to the parents’ son preference. Based on the concept of son preference, parents may elect to have one more child if the firstborn is a girl. Parents with a son preference may spend more time on sons, which may lead the only-child effect impact being overestimated when it is positive (and underestimated when it is negative). Thus, we control for students’ gender in the IV estimation.

We also investigate whether the instrument is likely to be exogenous. The results in Table 6 show that the explanatory variables, except for economic status (wealthy) and father's years of schooling, are insignificant. As summarized in Table 1, the proportion of students in wealthy families accounts for 0.2% (26 respondents). Therefore, it has little impact on the only-child effect. This means that the gender of the firstborn is likely exogenous because it is not related to observable family characteristics.

Table 6 Regression of firstborn girl on household characteristics.

Table 7 reports the results of estimating Eqs. (2) and (3) for the standardized scores of Chinese, mathematics, and English. The IVs in the first-stage regression have significant explanatory power. The first girl significantly decreases the students' probability of being only children. All the F-statistic values for the instrument are much larger than the rule-of-thumb threshold of 1034. The results, except English, are significantly negative. The IV estimates show that the OLS estimates are underestimated, possibly due to omitted factors.

Table 7 The IV estimation of the effect of only-child status on standardized test scores.

The IV estimation is called the Local Average Treatment Effect (LATE). The change in the instrumental variable causes the endogenous variable to have different probabilities in different values in the first stage, and this difference is then transferred to the dependent variable in the second stage35 (Abadie 200336). Imbens35, and Abadie (2003)36 divide the population into four categories: always-takers, defiers, compliers, and never-takers. In other words, LATE can only be identified by estimating compliers. Thus, we estimate the only-child effect using compliers as a sample. In our case, the gender of the firstborn is an instrumental variable. When the first child of a family is a girl, parents have a higher likelihood of having one more child. Thus, we define families with an only girl as defiers, whereas two-child families with a male firstborn as always-takers and never-takers.In our case, it is difficult to identify always-takers and never-takers. We cannot know the exact fertility motivation of two-child families with male firstborns. We remove defiers, always-takers, and never-takers and define the rest of the sample as compliers.

Table 8 shows an estimation of the only-child effect using compliers as a sample. The results of Table 8 are consistent with the results of Table 2. The only-child effects are significant in the subjects of Chinese and mathematics, but they are insignificant in English. Specifically, children in one-child families on average have lower (0.96 and 1.06) standard scores in Chinese and mathematics, respectively, than children in two-child families.

Table 8 The analysis using compliers as a sample.

Table 9 shows the estimation of the effect of only-child status on standardized test scores by gender. We use the male and female subsamples with IV estimation. The IVs in the first-stage regression have significant explanatory power. The first girl significantly decreases the students' probability of being only children. All the F-statistic values for the instrument are much larger than the rule-of-thumb threshold of 1034. The results of the male subsample are consistent with Table 6. Only-child effects are significant in the subjects of Chinese and mathematics, while these effects do not work in female subsamples.

Table 9 The IV estimation of the effect of only-child status on standardized test scores by genders.

In summary, only-child status has causal effect on scores in Chinese and mathematics. Male children in one-child families perform worse in Chinese and mathematics than males in two-child families. Finally, only-child effects do not work in female subsamples.

Mechanism check and mediation analysis

We find that an only-child status has significantly negative effects on students’ Chinese and mathematics performance. Shen et al.20 mentioned that the sibship size effect depends on the relative magnitude of two competing effects: scale economies and resource dilution. In this section, we explore potential mechanisms and, in particular, focus on economies of scale (including willpower and extraversion) and resource dilution effects (including parental control and parental support), which may change when an additional child is added to a one-child family. Accordingly, we conduct a decomposition analysis, which shows that these channels can explain the only-child effect.

Economies of scale

Here, we examine how economies of scale vary by the only-child status of a child.

The increase in sibship size may benefit children in the family by reducing the average cost of each child, which is referred to as economies of scale. The economy of scale affects children’s quality in two ways. First, parents can practice and improve their skills when the family size increases. They know how the shape their children's personalities efficiently. Second, children can play with their siblings, which reduces the cost to search for peers. Children are forced to develop interpersonal skills in the presence of siblings37,38, and they learn self-control as well39. In fact, Cameron et al.40 claim that “sibling deprivation” results in only children in fewer skills for peer-based interactions, which increase the cost of acquiring social interaction. It seems that the increase in sibship size reduces the costs of children’s development.

However, it is difficult to measure all costs of raising children. Some costs are unobservable. For example, we cannot measure the opportunity cost of interacting with siblings. Therefore, we assume that children with low costs are easy to acquire a better quality of personality. We use willpower and extraversion as proxies for economies of scale.

First, it is possible that children from one-child families may have an average lower level of willpower, which, in turn, affects student outcomes. To assess the relevance of this mechanism, we construct an index using principal component analysis (PCA) and three items from the student survey: "I still go to school even if I feel a little uncomfortable or have any absent excuses", "I will try my best to finish my homework even if I do not like it", and "I will try my best to finish my homework even if I have to spend a great deal of time on it". Students are asked to rate to what extent they agree with the statement on a scale from 1 (strongly disagree) to 4 (strongly agree). The Cronbach’s alpha of the index is 0.728.

Table 10 reports the estimation results, showing that the willpower of children in one-child families is on average lower than that of children in two-child families. Zhang and Qian41 and Yang et al.42 found that children from one-child families have less self-control. The results of this paper are consistent with their findings. Furthermore, Tangney, Baumeister, and Boone43 suggest that a higher self-control level was associated with higher academic performance. They use college students as a sample and found that students with higher scores on self-control had better grade point averages. This paper supports their opinion. We find that junior high school students' willpower is correlated with their educational attainment. Table 9 shows that the coefficients of willpower to Chinese and mathematics are 0.95 and 1.11, respectively. This means that students with higher level of willpower have better academic performance. The only-child status can indirectly affect educational attainment through a mediating mechanism—significantly lowering willpower levels. In particular, 8.18% of the total effect on success in Chinese is mediated by willpower, whereas 8.22% of the total effect on mathematics is mediated by willpower.

Table 10 Mechanism: Willpower.

Some explanations of the Chinese case are as follows. Children in one-child families are more likely to be spoiled. Furnham and Wu44 explain that parents may invest more financial and nonfinancial resources in their only-child, particularly boys. Parents may excessively dote on their only child. They may protect their only child from "normal" challenges, weakening the willpower of the child. Moreover, self-control predicts academic performance at all levels of schooling45, including middle school46,47. It improves homework completion and classroom conduct48. This may be the explanation of willpower as a mediator of the only-child effect on educational attainment.

Second, it is possible that extraversion level of children in one-child families may be lower on average, which affects student outcomes. To assess the relevance of this mechanism, we construct an index using PCA and three items from the student survey: "I consider myself easy to get along with", "I feel close to people in this school", and "I often take part in school/class activities". Students are asked to rate to what extent they agree with the statement on a scale from 1 (strongly disagree) to 4 (strongly agree). The Cronbach’s alpha of the index is 0.711.

Table 11 reports the estimation results, which show that the extraversion of children in one-child families is on average lower than that of children in two-child families. Children in two-child families have a higher rate of interaction with others (such as a sibling), which may lead to the children in two-child families developing a higher level of extraversion. Extraversion has been linked with academic success in school. For example, students with high extraversion are likely to invest their efforts in intellectual activities49,therefore, there is a positive relationship between cognitive ability and extraversion50. Moreover, Steel et al.51 suggest that the sociability component of extraversion help students learn due to frequent interactions with teachers. Teachers have a tendency to perceive talkative children as more intelligent and more academically gifted than shy students52. Our results are consistent with their opinion. Table 10 shows that the coefficients of extraversion to Chinese and mathematics are 1.07 and 1.05, respectively. This means that students with higher-level willpower have better academic performance. A total of 12.48% of the total effect on Chinese is mediated by extraversion, whereas 10.44% of the total effect on mathematics is mediated by extraversion.

Table 11 Mechanism: Extraversion.

In our case, the expansion of family size has a positive effect on children’s development. An additional sibling may help them improve their personalities (in willpower and extraversion). We suggest that is the above improvement may be attributed to scale economy effect.

Resource dilution

A second possible mechanism is the resource dilution effect. Parents may be strict with their only children and likely to increase parental control and parental support because they are investing more in their only child. Parental control is defined as parental intrusiveness, pressure, or domination intended to control children's behavior53, whereas parental support can be defined as the value of parental assistance. Pomerantz et al.54 found that parental control decreased children's efforts in challenging learning situations and lowered their sense of competence and autonomy.

We explored three items about strictness of parents to measure parental control. These included "how strict are your parents concerning your homework and exams", "how strict your parents are with your school performance", "how strict are your parents concerning your school activities", "how strict are your parents regarding curfew", "how strict are your parents regarding your time using the Internet", and "how strict are your parents regarding your time spent watching television". The numerical value 1–3 represents the parents' strictness from low to high. The Cronbach’s alpha of the index is 0.731. Table 12 reports the estimation results. The results show that parental control of children in one-child families is on average lower than that of children in two-child families and is not correlated to children’s outcomes. It is interesting that the average parental control level of only children is lower than that of children in two-child families. This seems to be the net effect of the little emperor. Families do what the child wants, not the other way around. Only children are more likely to be spoiled.

Table 12 Mechanism: Parental control.

Finally, we explored two items about parental support. They are "how frequently do your parents check your homework and assignment" (check) and "how frequently do your parents tutor you while working on homework and assignments" (tutor). The numerical value 1–4 represents "never", "1–2 days per week", "3–4 days per week ", and "everyday". The Cronbach’s alpha of the index is 0.761. Table 13 reports the estimation results, which show that the parental support of children in one-child families di not differ from that of children in two-child families.

Table 13 Mechanism: Parental support.

Tables 12 and 13 show that parental control and parental support are not mediators in the only-child effect. Thus, the resource dilution effect does not work in our study. Shen et al.20 suggested that the scale economies effect was strong, and the resource dilution effect was weak with a low number of siblings. Our results support their opinions.

Finally, we discuss the overlap between economies of scale and resource dilution. Table 11 shows that there are differences in parental control between only children and children in two-child families. Less controlling parents can have more extroverted children. Children may develop their conscientiousness when parental control is absent. Table 14 shows that parental control is correlated positively to willpower and extraversion. It may imply that parental control helps the personality development of children. Since children in two-child families have a higher level of control from parents, they have a high level of willpower and extraversion, which benefit educational performance. In this opinion, willpower and extraversion are not only proxies of economy scale but also proxies of resource dilution.

Table 14 The association between parental control and children’s personalities.

However, the evidence of this paper is insufficient to support the above argument. First, the correlation does not eliminate omitted factors. There are some factors affecting both parental control and children’s personalities. For instance, the community and neighborhood may strengthen both parental control and children’s personalities. Second, there is simultaneity in the correlation between parental control and children’s personalities. Parents change their control over children when children’s personalities change. Thus, we suggest that there may be an overlap between economies of scale and resource dilution but we cannot claim that the overlap does exist. It can be explored in the future studies.

Conclusion and discussion

The Chinese government abolished the only-child policy in 2015, and it has encouraged increased fertility since 2015. The impact on existing children's educational attainment may be one of the most important factors in parents' fertility decisions. This paper provides evidence that the average test scores of children in one-child families are lower than those of children in two-child families. This implies that the marginal effect o sibship size on the educational performance of children in one-child families is nonnegative.

The study of quantity-quality trade-off is well established (e.g.,5. Lao and Dong3 found that only children's educational outcomes are significantly higher than the educational outcomes of children having siblings. Using the same dataset of this paper, they compared children in one-child families to children having more than one sibling but fewer than five, and found that the sibship effect on educational attainment may be overestimated. They neglect the scale economies effect, which is one of the contributions of this paper, as we found that the scale economies effect is dominant to the sibship size effect when the family size is small.

This article uses a sample of students from one-child families and two-child families. It provides evidence that an additional sibling is not harmful to the educational attainment of the original only child. This result is similar to Shen et al.20, Diaz and Fiel's study17 and has practical use for China's policymaking. Although Lao and Dong3 introduce personalities to analyze the only-child effect, the large family size causes some bias. For example, they suggest that the extraversion of children in one-child families is higher than that of children in large families,interestingly, the "middle child syndrome" may lead to a positive effect. Since children in large families have more opportunities to play with others (their sibling), their level of extraversion should be greater than that of an only child. This paper uses a sample of students from one-child families and two-child families to eliminate the "middle child syndrome" bias. Furthermore, Lao and Dong3 neglect the role of willpower in the only-child effect. We fill this gap and find that willpower is an important influencing factor to the negative only-child effect.

Scholars illustrate the quantity-quality tradeoff by the resource dilution hypothesis (e.g.,55. They suggest that the arrival of newborns dilute a family's financial resources. However, Lao and Dong3 pointed out that some resources can be shared (such as books). Although Lao and Dong3 admited that the resource dilution effect was an important channel of the sibship size effect, our results find that parental concerns are the same in both one-child families and two-child families. Therefore, there may be little or no dilution effect of resources for small families. Furthermore, economies of scale regarding raising more children may offset parts of the dilution effect. Both parents' experiences of raising older children and effective sibling interactions may also weaken the dilution effect15,21. Thus, the effect of sibship size on educational performance may be an inverted U shape. Since the marginal sibship size effect for an only child is nonnegative, a "two-child policy" may not be harmful to the educational attainment of children.

There are some limitations of these data. First, this data cannot identify the firstborn's gender in three-child families. Second, the sample size of three-child families is too small (approximately 1531 observations) compared to the sample size of one-child and two-child families. Both limitations above make further studies on three-child families difficult. Finally, the dataset is based on schools and classrooms and not just family. Thus, we cannot control for family fixed effects using this dataset. Although the "three-child policy" has been implemented since May 31st, 2021, it is still worth further studying the sibship effect on educational attainment for one-child families due to the current low fertility desire of China.

Ethical approval

All methods were performed in accordance with relevant guidelines and regulations.