Over the last decades, human reproductive research has increasingly focused on biodemographic and genetic factors.1 As child mortality diminished in contemporary societies, evolutionary researchers used childlessness and number of children as a proxy for reproductive fitness, which is the ability to pass on genes to subsequent generations. Additive genetic variance in fitness implies natural selection in populations, with the underlying assumption that alleles leading to higher reproductive success are passed on with a higher frequency in future generations.2 The erroneous misinterpretation of Fisher’s Fundamental Theorem of Natural Selection that genetic variance in fitness should be close to zero has resulted in less attention to the study of genetics and reproduction.3 Fisher argued that reproductive fitness is moderately heritable in humans, with a growing number of twin and family studies showing reproduction to be 25–50% heritable.1 Previous research has found genetic influences on fecundity and reproductive desires1, 4 with a recent GWAS isolating 12 genetic loci implicated in the timing and number of children.5

Reasons for genetic effects on childlessness could be gene–environment interaction, non-additive genetic effects, or new mutations that restore any genetic variance lost to selection. Another hypothesis is that sexual dimorphism or in other words differences in secondary sex characteristics, operates since genes contributing to male childlessness are inherited via the female lineage and those for female childlessness via the male lineage.6 There are likewise sex differences in biological makeup, processes and diseases implicated in infertility and behavior. For women, ovulatory problems, tubal damage, endometriosis, cervix cancer and polycystic ovary syndrome are prominent causes of infertility, with sperm defects and testicular cancer being central factors for men.7 These diseases are partly heritable.8, 9 There is also a behavioral component to sexual dimorphism, since genes are implicated in different ways in relation to educational level and certain personality traits, including sociability, impulsivity and emotionality.10 These traits, which potentially have different effects on male and female fertility11, 12, 13 have also been shown to have a heritable component in previous research.13, 14 Isolating the extent of sexual dimorphism in childlessness fosters a better understanding of why genetic variation in this trait still exists.

Data from the TwinGene project of the Swedish Twin Registry, which includes genotyped same sex and opposite sex twin pairs, is used to answer this question. This study extends previous research in three central ways. First, research on childlessness has been sparse in behavior genetics. Second, we also focus on men, who have been largely neglected in this area of research.15 Third, heritability (ie, the proportion of variation in a trait within a specific population due to genetic variation), as well as sex differences, are estimated and contrasted using three different methods (see Figure 1 for an overview): the classical twin method, the genomic-relatedness-matrix restricted maximum likelihood (GREML) method on twins and polygenic scores (PGS) from a recently published GWAS on timing and number of children5 to assess the influence of SNPs (single-nucleotide polymorphisms) on childlessness for men and women at the molecular genetic level. SNPs are variations in a single nucleotide that occur at a specific position in the genome, where each variation is present to some degree within a population.16 The twin method enables us to compare results found in other countries and time periods. DZ twins share between 35 and 65% of their segregating genes,17 which is assumed to be 50% in twin studies. The GREML method uses actual measured genetic similarity between twins, resulting in more precise estimates of heritability (see Supplementary Figure 2 for a comparison between methods). We use the polygenic scores to assess to what extent a specific set of previously found SNPs influences childlessness differently in men and women.

Figure 1
figure 1

Methods used to examine heritability and sex difference in heritability for childlessness.

Materials and methods

Data and zygosity

This study uses data from the Swedish Twin Registry (STR) which is compiled from the national birth registry.18 The STR contains multiple sub-studies of which we use the TwinGene project, where a subset of twins from the Screening Across the Lifespan Twin study (SALT) were genotyped. Between 1998 and 2002 data was collected from twins born between 1911 and 1958. Between 2004 and 2008 a subset of SALT participants were genotyped for the TwinGene project.18 We restrict our analyses to TwinGene participants to ensure comparability of results between our different methodological techniques.

We include women aged 45 and older and men 50 and older, since <0.05% of children are born to mothers older than 4519 and no men in our sample had a first child after the age of 50. The TwinGene sample contains 10 909 individuals. After age restrictions, 9942 individuals remained, with 4534 men and 5408 women coming from a total of 6330 different families, 3612 complete twin pairs and 2718 individuals for whom there is only one twin from the pair in the sample.

Monozygotic (MZ) and dizygotic (DZ) twin pairs were distinguished using two different methods.18 If blood samples were available for both twins, genetic-based analyses were used to determine zygosity. This was not possible in two cases. In the TwinGene data, in some cases only one twin of a pair participated and was genotyped. Furthermore, in the case of presumed MZ pairs, also only one of each MZ pair was genotyped. In these two cases, zygosity was determined based on the responses to two questions. ‘During childhood, were you and your twin partner as alike as ‘two peas in a pod’ or not more alike than siblings in general?’ and, ‘How often did strangers have difficulty in distinguishing between you and your twin partner when you were children?’. If individuals responded ‘as alike as two peas in a pod’ and that strangers got confused ‘almost always or always’ or ‘often’, they were classified as MZ. This measure was tested by use of DNA markers and is accurate in 99% of the cases.20


The individuals were genotyped using the Illumina OmniExpress 700 K chip, imputed to the 1000 genome imputation panel. After imputation we selected SNPs from the HapMap3 panel since this SNP set was optimized to capture common genetic variation,21 which is required for the GREML analysis. For quality control, SNPs with a minor allele frequency <1%, a higher missing rate than 0.03 and that failed the Hardy-Weinberg equilibrium for a threshold of 10−6 were removed. The first 20 principal components are included as covariates to adjust for population stratification.22

Measure of childlessness

Women are considered childless if they have no children who are living and no children who are dead (stillborn children are not counted as children who are dead). Men are considered childless if they have no children who are living. This results in a small discrepancy between the measurement of childlessness for men and women and for that reason we do robustness checks using only living children for men and women.

Statistical methods

To test heritability and sex differences of childlessness we used three methods: twin and GREML models and polygenic scores, illustrated in Figure 1.

Twin method

To quantify the genetic contribution to childlessness (a binary trait), we estimated a liability threshold model.23 This model assumes an underlying normal liability distribution that divides individuals into the two groups of childless versus not childless. Thresholds (z-values) for dividing these groups were estimated based on the proportion of childless individuals. The tetrachoric correlation of the liabilities in childlessness among MZ and DZ twin pairs was estimated using trait concordances.23 These correlations were then used to estimate the contribution of genetic and environmental factors in the same way covariances are used for continuous traits. In all twin models we control for birth year.

To test for genetic and environmental influences of childlessness, ACE, ADE, AE and CE models were fitted, which estimate the effect of additive genetic factors (A), non-additive/dominance genetic factors (D), shared (common) environmental factors (C) and individual (unique) environmental factors (E). The latter also contains measurement error. As shown in Supplementary Figure 1, A correlates 0.5 in DZ twin pairs and 1 in MZ twin pairs, D correlates 0.25 in DZ twin pairs and 1 in MZ twin pairs and C correlates 1 in both MZ and DZ twin pairs.24 When MZ correlations are more than twice the DZ correlations, an ADE model is estimated, which tests for dominant genetic effects, since D correlates perfectly for MZ twins but only 0.25 for DZ twins. Since they are confounded, C and D cannot be estimated simultaneously in univariate models. The A, C and D parameters were estimated using a model-fitting approach in which A, C and D factors were dropped in a stepwise fashion from the full model (ACE model or ADE model) and sub models were compared to the full model by hierarchical chi-square tests. The difference in the goodness-of–fit (–2 log likelihood) between the sub- and full model is approximately chi-square distributed, with degrees of freedom equal to the difference in degrees of freedom. The model with the lowest Akaike’s information criterion (AIC=X2 –2df) reflects the optimal balance between goodness-of-fit and parsimony.

To examine quantitative and qualitative differences in the genetic and environmental etiology between males and females, sex limitation models were applied.25 Qualitative sex difference refers to whether different genes influence childlessness for males and females, tested by fitting a model in which the genetic correlation between opposite sex twin pairs was freely estimated and a model in which the genetic correlation is set at 0 (indicating independent genetic effects by sex) and by comparing these models to one in which the genetic correlation was fixed at 0.5 (indicating no sex differences). A better fit of the model with the genetic correlation set to 0 thus indicates that different genes are implicated in male and female childlessness.

Quantitative sex difference refers to different proportions of additive genetic (A), shared environmental (C) and individual specific environmental (E) influence. We first ran a heterogeneity model in which the A, C and E parameters can differ between males and females followed by a homogeneity model where parameters for A, C and E are fixed as the same for the sexes. Differences between the goodness-of-fit of models were tested as described previously. For all twin models we used the OpenMx package in R.26

GREML method

A second method used to estimate heritability is the genomic-relatedness-matrix restricted maximum likelihood (GREML) method on twins, which simultaneously considers the additive effect of all genotyped SNPs. The GREML method contains two steps. First, for each pair of individuals, the genetic similarity is estimated based on similarity in SNPs. Second, this genetic relatedness is used as the input as a random effect in a mixed linear model in which the genetic relatedness explains phenotypic similarity. This is done by a comparison of a matrix of pairwise genomic similarity to a matrix of pairwise phenotypic similarity.27 As childlessness is a binary trait, the liability threshold model applies. The estimate of variance explained by SNPs on the observed scale is transformed to that on the underlying continuous scale.28 We controlled for the first 20 principal components as well as birth year.

In this paper, we used a recently developed method that allows heritability to be estimated using both related and unrelated individuals.29 We estimated narrow sense heritability (h2), commonly estimated in twin or family studies, and heritability based on genotyped SNPs (h2snp). To estimate both h2 and h2snp, two covariance matrices were used: the identity-by-descent (IBD) and identity-by-state (IBS) matrices. The IBD matrix only includes individuals with relatedness above 0.05, for whom similarity of measured SNPs is an indicator of similarity over the whole genome. In the IBD matrix, genetic similarity for unrelated individuals (relatedness <0.05) is set to 0. The IBS matrix includes all individuals, but uses only information on unrelated individuals, because the information on related individuals is already captured by the IBD matrix. The IBS matrix thus captures only the genetic covariance for the SNPs in the genotyping array. We applied the joint model, which includes both the IBD and IBS matrices. The IBS matrix estimated h2snp and the IBD matrix estimates additional effects within families (h2–h2snp), which together provide an estimate for narrow sense heritability (h2).

To examine whether the same or different genetic variants are implicated in male and female childlessness, bivariate GREML analysis were conducted with male childlessness considered as the first trait and female childlessness as the second trait, also used by Lee et al. for sex differences in schizophrenia.30 The GCTA software31 was used for the GREML analysis.

Polygenic scores

The third method we used to assess the influence of genes on childlessness was creating the polygenic scores (PGS) for number of children ever born (NEB) and the age at which people have their first birth (AFB) and examine to what extend these PGS influence childlessness. The PGS is the sum of the risk alleles weighted by their effect size and is thus a summary measure of genetic variants that increase the risk for a trait.32 Different risk scores are created depending on P-value cutoffs, from using only genome wide significant SNPs (P-value of 5 × 10−8) to including all genotyped SNPs (P-value of 1). Polygenic scores are created with the PRSice tool in PLINK.33 An LD threshold of 0.1 and a distance threshold of 250 kb are used, indicating that if two SNPs are included in the PGS that have a correlation of 0.1 or greater, or a distance of 250 kb or smaller, one of the two SNPs is removed. The original sample of the GWAS from which we create the PGS included the STR sample. For that reason we used the GWAS results from the sample excluding STR and based our PGS on these results. We will run logistic regression models on childlessness with the standardized polygenic scores as independent variable controlling for year of birth and years of education. Only one individual from each twin pair is included in these analyses to meet the criteria of independent observations.

To assess sex differences in the effect of the polygenic risk score on childlessness fitted a logistic regression model including an interaction between the polygenic risk score and sex.


Background analysis

Around 12.6% of the women in the sample were childless, representative of childless women in Sweden, which has remained constant over the last decades at 13%.34 Around 14.3% of the men in the sample were childless, which is lower than the overall rate in Sweden, which ranged between 17 and 20% in the period studied.34 The correlation among MZ twins was in all cases higher than the correlation in DZ twins (Table 1). This is an indicator that genetic factors have a role in childlessness. Among opposite sex twins, the tetrachoric correlation is −0.06. This is much lower than for same sex DZ twins, which was 0.17 for men and 0.28 for women (only the difference between opposite sex pairs and female DZ pairs is significant). This is a first indicator that the genetic or common environmental influence on childlessness differs between the sexes.25 We now discuss results from each method, summarized in Figure 2.

Table 1 Concordance and tetrachoric correlations for childlessness in MZ and DZ twin pairs
Figure 2
figure 2

Results for heritability and sex differences of childlessness from the twin, GREML and PGS (polygenic score) models. Twin estimates are from Table 2; model 15 (best fitting model) where the genetic correlation was set to 0 and the heritability estimate was set as equal between men and women. The model in which the genetic correlation was freely estimated was estimated at 0.14. GREML heritability estimates are taken from the model where heritability was estimated separately for men and women. Odds ratios come from Table 4; model 4, in which we use the genome wide genetic risk score (P-value of 1). The estimate for women is the main effect in this model and the estimate for men is the main effect × the interaction for sex (1.262 × 0.753=0.950).

Results from the twin method

To estimate heritability in the twin model, univariate ACE models were estimated separately by sex. For males, ADE models were also estimated, since for male MZ twins, the tetrachoric correlation is more than twice as large as for male DZ twins, which is an indicator of dominant genetic effects. Both goodness-of-fit and parameter estimates for each model are listed in Table 2, with the best fitting models printed in bold. Comparing model 1 and 2, we see that for females, dropping C from the model does not significantly reduce model fit (P=0.796) and when comparing model 1 and 3, dropping A resulted in a borderline significant reduction in model fit (P=0.072). The best fitting model for females is thus model 2—the AE model. The estimated heritability in this model was 0.48 (95% CI 0.33–0.62). For males, when contrasting model 5 with 6 and 9 with 10, we see that dropping C or D did not result in a significantly decreased model fit (P=1 and 0.461 respectively). Comparing model 5 and 7 shows that dropping A resulted in a significant drop in model fit (P=0.017), suggesting that the best fitting model for males is the AE (model 6). A heritability estimate of 0.46 (95% CI 0.30–0.61) indicates that almost half of the variance in childlessness is attributed to genetic factors. For both sexes, there was no significant effect of shared environment, with the individual environment estimated slightly above 50%.

Table 2 Comparison of twin models and parameter estimates

To examine whether there were different genetic influences on childlessness for males and females we fitted sex limitation models. Goodness-of-fit statistics as well as parameter estimates are displayed in model 12 to 16 of Table 2. To examine qualitative sex differences we tested whether the genetic correlation (rg) between men and women was different from the theoretical value of 0.5 or from 0. As we did not find any shared environmental factors, we did not test whether the shared environmental correlation was different from the theoretical value of 1 and thus focused on testing the sex limitations on our AE models. When the genetic correlation could be freely estimated in model 12, the estimate is 0.142. Model 13 in which the genetic correlation was set to 0.5 has a significantly lower model fit (P=0.023). Model 14 in which the genetic correlation was set to 0 (which indicates that different genes have a role for males and females) has the best model fit, indicated by a value of –1.090 for the AIC. For this reason we adopt this model with the genetic correlation set to 0.

To test for quantitative sex differences, we examined whether the values for additive genetic influences (A) and individual environmental influences (E) could be set as equal between the sexes. This would indicate that the influence of the additive genetic and individual environment is equally important for men and women. Model 15, the homogeneous model, had a lower AIC value than the heterogeneous model 14 of –3.939 and for that reason the values for A and E were set as equal between men and women.

To examine whether the estimated heritability from this model was significantly different from 0, in model 16 we tested if the A parameter could be dropped from the model. Dropping A significantly reduced model fit (P-value=0.000), leaving the final best fitting model to be model 15. This was the model in which the genetic correlation between men and women was set to 0 without any effect of common environmental factors and equally high heritability estimates for men and women, with heritability estimated at 0.47 (95% CI 0.37–0.58) and individual environmental influences at 0.53 (95% CI 0.42–0.63) (see Table 2 and Figure 2). This indicates that there were no significant differences in the extent to which genes influence childlessness, but that there were qualitative genetic differences between male and female childlessness and that different genes influence childlessness in men compared to women.

Results from the GREML method

In the next step, we examined heritability in twins using the GREML method (Table 3). The estimated narrow sense heritability for the overall sample was 0.46 (95% CI 0.43–0.57). For females, the estimated narrow sense heritability was 0.59 (95% CI 0.41–0.77) and for males, 0.56 (CI 0.39–0.83) (Figure 2). All estimates were significantly different from 0. The overall estimate is not the average of the male and female estimate, since male-female pairs were included in the overall analyses, which reduced heritability. Although the estimates from the GREML method are slightly higher than the twin model estimates, they do not significantly differ. The twin model estimates for both sexes is 0.474, which lies within the 95 confidence intervals of the GREML estimates for males (CI 0.394–0.732) and females (CI 0.413–0.769).

Table 3 GREML analysis on childlessness in the twin sample

To further examine whether the same genes influence male and female childlessness, bivariate GREML models on childlessness were fitted to estimate the genetic correlation between childlessness by sex. The results are displayed in the bottom panel of Table 3 and in Figure 2. From the GREML analysis including twins, the genetic correlation between childlessness in males and females is –0.22, which is significantly different from 1 and not significantly different from 0. This indicates that, at least within this Swedish sample, a male and a female who have a higher genetic similarity do not have a higher similarity on childlessness. This shows that different genetic variants influence childlessness among males and females.

Results from the PGS models

We then tested the effect of genes on childlessness by fitting logistic regression models on childlessness and testing the effect of the PGS of AFB and NEB. Table 4 and Figure 2 display the results for the AFB score. The models that use the PGS for NEB are not displayed since we did not find any significant results, which is not surprising since only 3 genetic loci were significantly related to NEB. We display 4 models in Table 4. Model 1 includes the PGS including only genome wide significant SNPs, model 2 the PGS using all SNPs significant at the 0.05 level, model 3 all SNPs significant at the 0.5 level and model 4 all genotyped SNPs. For the PGS using only genome wide significant SNPs (P-value of 5 × 10−8) we find no significant effect on childlessness. For all other PGS’s we find a significant effect with odds ratios of around 1.25. This indicates that the odds of remaining childless are about 1.25 times as high for individuals with a 1 standard deviation higher score on the AFB genetic risk score. Individuals with a greater risk of having a higher age at first birth are thus more often childless.

Table 4 Results for the logistic regression models on childlessness using the polygenic risk scores for age at first birth using women and men over the age of 45 and 50, respectively

To test the sex differences in the effect of the polygenic risk score on childlessness, Table 4 and Figure 2 also display the results for the interaction between sex and the polygenic risk score. In all models except for the model that includes only genome wide significant SNPs, the interaction is significant and around 0.75. When looking at model 4, we see that the odds ratio for women is 1.262 and for men 0.950 (1.262 × 0.753). From this we can conclude that genes related to a higher age at first birth influence childlessness in women but not in men.

Robustness checks

The measure of childlessness for men and women are not exactly the same. For women both living children as children who are dead are taken into account, while for men only children who are still living. Furthermore, only men over the age of 50 are included while women over the age of 45 are included. We fitted sex limitation models as well as the logistic regression models using the PGS on the measure on living children for men and women as well as on all men and women over 45 and over 50 to examine if this influences the results. Results are displayed in the Supplementary Material and in the Supplementary Tables, and show that neither the different age selection for men and women, nor the different measures of childlessness for men and women has a major impact on the results from our study. We furthermore show that the proportion of men who are considered childless because all their children died is relatively small.


The goal of this study was to examine sex differences in the genetic influence on childlessness. We provide clear evidence that there are different genetic influences on childlessness for men and women. Although the level of the heritability of childlessness is approximately equal for both sexes, the actual genes that have a role vary. We infer this by applying classical twin modeling, the GREML method and a molecular genetic PGS approach. Future research should investigate which pathways genetic factors influence male and female childlessness. The question remains as to whether they are mainly physiological, behavioral, or whether gene–environment interactions work differently for men and women. For example, since women have a shorter reproductive window, the postponement of childbearing may have a larger impact on genetic factors influencing female childlessness.

We contrasted three different methods and compared their results in relation to male versus female childlessness. In the first classical twin method, we found that almost half of the variation (47%) in childlessness was due to genetic variation and that different genes influence male and female childlessness. We then applied the GREML method on twins. The main difference between the twin and GREML methods is that in the GREML method, genetic similarity between DZ twins is not assumed to be 50%, but measured on actual SNP similarity. Although the differences are not statistically significant, we find slightly higher heritability estimates of 59% with the GREML than the twin method, and also isolate that different genes influence male and female childlessness. Finally, using a PGS for AFB we found that genes previously found to be related to fertility timing are also related to childlessness for women, but not for men.

When comparing this study to previous twin studies on childlessness, we find comparable estimates of heritability in Finland (0.39 for women and 0.50 for men)13 and Denmark (for individuals born between 1880 and 1890 estimated at 0.45 for men and 0.70 for women and for individuals born between 1953 and 1964 estimates are 0.18 for men and 0.42 for women).35 One previous study on the STR found sex limitations in genetic influences on the total number of children,36 which is in line with our findings. We extend this study, however, by looking at the different reproductive trait of childlessness instead of number of children or the age at first birth, use a broader birth cohort of Swedish twins born between 1911 and 1958 (instead of 1915–1930 in Zietsch and colleagues36) and examine sex differences using three different methods.

Our findings somewhat contradict the recently published GWAS on human reproduction (AFB and NEB), where only some sex-specific genetic effects in fertility were reported (section 5, SI).5 In that study, out of the 12 independent loci isolated for human reproduction, two had a sex-specific effect. All signals found for AFB and two of the three signals for NEB had a consistent direction across the sexes. Using both LD score bivariate regression and GREML bivariate analyses that study found a high genetic correlation among men and women for both traits. It is notable, however, that for AFB, the LD score regression results suggested that there were in fact sex-specific variants for AFB (ie, the null hypothesis was rejected) and that genetic risk scores for NEB only significantly predicted childlessness in women and not in men (Barban et al.,5 Supplementary Table 21). Another notable difference is that the GWAS examined continuous variables (ie, AFB and NEB) and in this paper, we look at the binary outcome of childlessness. However, we do note that in our study we find much stronger sex differences, and more studies are needed to confirm our conclusions and to clarify under which circumstances and for which fertility traits genes influence men and women differently.

We argue that different genes influence childlessness in males and females. A counterargument might be that differences in childlessness similarity in opposite sex twin pairs are not due to different genetic influences, but rather to different family socialization processes. However, we find no shared family influences in same sex twin pairs which is in line with previous research that does not find that family characteristics such as sociodemographic background, family religiosity or socialization influence male and female fertility differently.37 This makes it implausible that there are shared environmental family influences that make opposite sex siblings more dissimilar than same sex twin pairs. Furthermore, also our results from the PGS on unrelated individuals confirm our findings.

A shortcoming of this study is that we were unable to distinguish between voluntary (childfree) and involuntary childlessness, which might result in heterogeneity within the group of childless individuals. Genetic factors could influence the desire or predisposition to have children, biological fecundity or other pathways leading to childlessness. However, for the sake of examining whether genetic factors can be passed on to the next generation by sex differences, these findings are relevant regardless of our lack of distinction between (in)voluntary childlessness. A more general concern often raised with regard to twin studies, is the question of whether the trait of interest is the same amongst twins compared to the overall population. In this sample we find that the proportion of women who remain childless is equal to the overall proportion of childless women in Sweden. For men, the percentage in our sample is lower than the national percentage.38 Previous research that examined this found no systematic differences between childlessness among twins and in the general population.39 It is thus very likely that the lower percentage in our male sample is not attributed to the difference between twins and the general population, but rather differences in the measurement or response rates related to male reproduction. Another concern in twin studies is whether DZ twins share their environment to the same extent as MZ twins, referred to as the equal environment assumption (EEA). For several outcomes, this assumption has been tested by comparing the influence of perceived and actual zygosity, which gained plausible support.40 Another study found that even though MZ twins share their environment to a higher extent than DZ twins, controlling for this rarely results in a significant reduction of the heritability estimate.41 Furthermore, previous research on unrelated individuals also found heritability of fertility traits.29, 42 This indicates that estimates from the twin study might be an overestimate of the actual heritability, but that the overestimation is unlikely to be severe.

Finally, there are three concerns for potential inflation of heritability estimates in the GREML models we apply. First, ascertainment bias from the overrepresentation of cases in case–control studies cannot be corrected for if extended genealogical data is used.29 However, given that in contrast to Zaitlen et al29 we only include pairs of twins in our study, this issue should not impact our results. Second, dominant genetic effects might bias narrow sense heritability estimates upwards in the GRM models.29 Our twin models report no evidence for dominant genetic effects for childlessness—which is in line with the findings from recent reproductive1 and molecular genetics research.5 Third—and as discussed previously—shared environmental influences amongst siblings might influence fertility and correlate with genetic relatedness, inflating heritability estimates. Zaitlen et al29 find no evidence for a bias due to shared environmental influences when introducing this approach to the literature for several traits including number of children—in line with findings from twin studies.41 Given the similarity of our findings from both the twin and GREML modeling approach, we are confident that our results are robust, also since twin models account for shared environmental influences and dominant genetic effects on childlessness.

This study has confirmed findings from previous research in showing a genetic influence on childlessness,35 but uniquely identified genetic sexual dimorphism, which may be one of the explanations for previous findings on the moderate heritability of fertility.