Introduction

The ratio of second (index) to fourth (ring) finger length (digit ratio or 2D:4D) has been hypothesised to reflect individual differences in prenatal exposure to sex hormones. More specifically, it has been suggested that a high concentration of testosterone1,2 or high ratio of testosterone-to-estradiol (T:E)3,4,5 during prenatal development results in low 2D:4D. Researchers typically measure digit ratios for the right hand (R2D:4D) and/or left hand (L2D:4D), though sometimes also examine directional asymmetry (D[R−L]). Low R2D:4D relative to L2D:4D has been suggested to reflect high prenatal androgen exposure6,7,8. Small-to-medium sized sex differences (male < female) are reliably observed for R2D:4D (d = 0.457) and L2D:4D (d = 0.376)9, but that for D[R−L] appears to be much smaller (d = 0.065)8. However, it should be noted that this effect size estimate for D[R−L] may be attenuated; this is due to it having been derived from self-measured finger lengths, which are known to be associated with high levels of random measurement error8. Despite enduring popularity, results from studies that have tried to validate digit ratio measures have been equivocal10,11. As there is a vast and rapidly growing literature examining 2D:4D in relation to an extensive range of variables across multiple research fields (e.g., psychiatry12, social science13,14, cancer research15, criminology16, sports science17,18), it is important to consider this lack of consistent evidence rather than rely on the assumption that 2D:4D is a valid and reliable proxy for the prenatal hormonal environment.

Some studies have explored the 2D:4D validity question by examining samples of individuals with medical conditions that affect endocrinological pathways. Complete androgen insensitivity syndrome (CAIS) is an X-linked recessive condition characterised by defective or absent androgen receptors. Despite the presence of normal (or even elevated) androgen levels, this condition results in testosterone being unable to exert physiological effects on the developing tissues. A female-typical phenotype therefore develops in presence of a male-typical (46XY) karyotype and prenatal hormonal environment. Two studies19,20 have reported feminised digit ratios in women with CAIS. However, both relied on small samples, and, notably, the variance in 2D:4D did not differ between 46XX and 46XY women. If differential prenatal exposure to the physiological effects of testosterone influences the development of 2D:4D, lower variance should be expected in the latter group than the former21. Researchers have also tested for associations between 2D:4D and variations in the trinucleotide CAG repeat sequence located on exon 1 of the androgen receptor gene, a genetic polymorphism believed to influence individual differences in androgen sensitivity22. However, although an early small-scale study23 reported that low frequencies of CAG repeats, indicative of high sensitivity to androgens, were associated with low (male-typical) R2D:4D and D[R−L], subsequent meta-analyses have not confirmed a meaningful association24,25,26.

Studies of 2D:4D have also been conducted in relation to congenital adrenal hyperplasia (CAH), a suite of conditions characterised by elevated prenatal androgen exposure. A recent meta-analysis detected significant effects for R2D:4D in males and L2D:4D in females (i.e., lower ratios in people with CAH compared to controls), but not for R2D:4D in females or L2D:4D in males10. Notably, the effect sizes observed were ~ 50% smaller than those reported in a meta-analysis published a decade previously9, implying that early studies may have overestimated the magnitude of these effects. Two studies have also reported feminised 2D:4D in 47XXY men with Klinefelter syndrome27,28. However, these findings are difficult to interpret considering that testosterone concentrations measured from amniotic fluid sampled at 16–20 weeks’ gestation did not differ between males with and without this condition29.

Evidence of the validity (or lack thereof) for 2D:4D derived from studies of complex medical conditions, such as CAIS, CAH, and Klinefleter syndrome, may be questioned because such conditions affect a wide range of developmental processes. This makes isolating effects attributable to prenatal sex hormones challenging, leading some researchers to examine 2D:4D in more generalisable populations. Twin studies indicate moderate-to-high heritability for 2D:4D, with additive genetic factors explaining most of the inter-individual variation30,31,32,33. The Twin Testosterone Transfer (TTT) hypothesis (see Ahrenfeldt et al.34) has also been examined. This predicts that females with male cotwins will exhibit lower (i.e., more male-typical) 2D:4D ratios than females with female cotwins due to elevated testosterone exposure associated with gestating in close proximity to a male. Although two small-scale studies30,35 initially provided some confirmatory evidence, others, including one with a much larger sample size, did not36,37,38. Medland and Loehlin32 further reported that 2D:4D ratios were no more similar within dizygotic (DZ) twin pairs than they were within DZ twin/non-twin sibling dyads. This is also inconsistent with a TTT effect on 2D:4D because DZ twins, who, on average, share the same amount of genes identical by descent as non-twin full siblings, would be expected to show elevated concordance for 2D:4D due to their shared prenatal hormonal environment. However, it should be noted that the pattern of results obtained from TTT studies in humans is generally inconsistent34, suggesting that if there is an effect for 2D:4D it is likely to be small in magnitude.

Another approach taken in exploring the 2D:4D validity question is to examine digit ratios in relation to actual hormone concentrations obtained during prenatal development. Lutchmaya et al.3 reported a negative correlation between the T:E ratio in second trimester amniotic fluid and R2D:4D measured at 2-year follow-up. However, they did not observe a comparable effect for L2D:4D (and did not examine D[R−L]). Ventura et al.39 later reported a significant negative correlation between amniotic testosterone and L2D:4D (but not R2D:4D) in female (but not male) neonates. However, a reanalysis of the data of Ventura et al.39 showed no correlation between amniotic testosterone and D[R−L]40. Importantly, the statistically significant findings of the studies by Lutchmaya et al.3 and Ventura et al.39 could not be replicated in an independent cohort41.

Some researchers have examined sex hormones measured from the maternal circulation during pregnancy. However, maternal serum testosterone levels assayed during the second and third trimesters has been reported not to correlate with testosterone in second trimester amniotic fluid or umbilial cord blood at birth42. Likewise, another study observed no significant correlations between second trimester testosterone concentrations measured from maternal plasma, fetal plasma, and amniotic fluid43. However, positive correlations have been reported for estradiol measured from second and third trimester maternal serum and second trimester amniotic fluid42. Despite the equivocal evidence for meaningful associations between maternal and fetal sex hormone concentrations, those measured from the maternal circulation have been observed to correlate with phenotypic outcomes in the mothers’ offspring. For instance, elevated maternal testosterone was reported to predict male-typical gender role behaviour in daughters44 (though see45) and maternal estradiol was reported to correlate positively with autistic traits in sons46. There is also evidence of a weak negative correlation between a mother’s second trimester plasma testosterone concentrations and the 2D:4D of her child39,47, although this effect has not been observed by all studies48. It should also be noted that, although there may be limited transfer of testosterone from mother to fetus via the placenta, testosterone levels are moderately heritable49, meaning that correlations between maternal testosterone and outcomes observed in their offspring could reflect genetic rather than hormonal effects44,45.

In addition to hormones measured from amniotic fluid and maternal circulation, some researchers have examined concentrations present in umbilical cord blood assayed at birth. Taken together, the findings of these studies indicate no association with 2D:4D48,50,51,52,53,54,55. This is consistent with hormones sampled in this way representing late gestation56, whereas sexual dimorphism in 2D:4D has been detected much earlier57,58. Considering this, a possible reason for the overall inconsistency in findings from studies attempting to validate digit ratios by correlating them with circulating hormone levels is that such studies have typically assayed those hormones during the second or third trimesters, whereas the critical period for 2D:4D development may exist towards the end of the first trimester59.

As it is not possible to measure hormones from the fetal circulation for research purposes, the current study investigates whether maternal sex hormones assayed during late first trimester/early second trimester correlate with the digit ratios of her child. Barrett et al.60 recently reported that neither T nor E measured from first trimester maternal serum correlated with infants’ digit ratios (n = 154 males; n = 167 females). However, they did not examine the T:E ratio. Although not numerically or physiologically independent of the individual hormonal measurements, Lutchmaya et al.3 notably reported that the T:E ratio in amniotic fluid was a significant predictor of R2D:4D even though T and E themselves were not.

We pre-registered our analysis plan on the Open Science Framework (https://osf.io/bmp6h) and predicted that neither maternal T nor E assayed from late first trimester/early second trimester maternal circulation would be a significant predictor of digit ratio when measured in offspring at 18–22-month follow-up. However, we also predicted that there would be negative correlations between the T:E ratio and R2D:4D and L2D:4D (but not D[R−L]). The reason we predicted no effect for D[R−L] is that evidence for the validity of this measure is particularly weak. Although a small study (n = 26 mother–offspring dyads) reported that maternal urinary testosterone-to-estrone conjugate levels in pregnant Titi monkeys correlated negatively with D[R−L] in their offspring, the effect disappeared after covariates were controlled for61. Human studies have reported that D[R−L] does not differ between people with and without CAH10, and that it does not correlate with sex hormones measured from amniotic fluid40,41, umbilical cord blood48,50 and second trimester maternal plasma40.

Results

Maternal hormone data (T and/or E) were present for n = 122 (56.22%), digit ratio (direct and/or photocopy) for n = 95 (43.78%), and both for n = 56 (25.81%) (n = 31 females, n = 25 males). For this subsample, mean maternal age at scan was 33.36 years (SD = 4.05), mean birth weight was 3406.06 g (SD = 503.95), and mean age at follow-up (adjusted for gestational age) was 455.94 days (SD = 97.35). Only n = 2 (3.57%) of these mothers reported polycystic ovary syndrome (PCOS), and most experienced hirsuitism (no affected areas, n = 2 [3.57%]; one affected area, n = 33 [58.93%]; two or more affected areas, n = 21 [37.50%]).

Intraclass correlation coefficients (ICC) (two-way mixed, single measures with absolute agreement definition) were computed to determine inter-rater reliability (please note that in our pre-registration it was stated that only one set of direct measurements was obtained: this was incorrect [two sets of direct measurements were available for a subsample of participants]). These revealed that inter-rater reliability was low for the direct measures: R2D:4D, ICC (n = 53) = 0.444, p < 0.001; L2D:4D, ICC (n = 52) = 0.614, p < 0.001; D[R−L], ICC (n = 52) = 0.462, p < 0.001, but higher for the photocopy measures: R2D:4D, ICC (n = 70) = 0.829, p < 0.001; L2D:4D, ICC (n = 72) = 0.885, p < 0.001; D[R−L], ICC (n = 66) = 0.811, p < 0.001. Contrary to expectation, the direct and photocopy measurements were uncorrelated: R2D:4D, r(65) = 0.065, p = 0.602; L2D:4D, r(66) = − 0.029, p = 0.814; D[R−L], r(61) =  − 0.017, p = 0.893.

Descriptive statistics for digit ratio and maternal hormone variables are presented in Table 1. The hormonal measures did not differ in regard to fetal sex. L2D:4D measured from photocopies was lower in males than females, though all other comparisons were non-significant. Compared with direct measures, photocopies yielded lower R2D:4D (direct: M = 0.97, SD = 0.08; photocopy: M = 0.93, SD = 0.05), t(66) = 3.424, p = 0.001, d = 0.572, and L2D:4D (direct: M = 0.97, SD = 0.07; photocopy: M = 0.94, SD = 0.05), t(67) = 2.642, p = 0.010, d = 0.460. There was no difference for D[R−L] (direct: M = 0.00, SD = 0.08; photocopy: M = – 0.01, SD = 0.06), t(62) = 1.068, p = 0.290, d = 0.192. Due to low reliability of the direct measures, and because they did not correlate with those obtained from photocopies, further analyses utilise photocopy measures only.

Table 1 Descriptive statistics for digit ratio (2D:4D) and hormone variables.

Although not in our pre-registration plan, we report bivariate associations between maternal hormones and children’s digit ratios to facilitate comparison with studies that have not controlled for covariates (Table 2). T:E correlated negatively with R2D:4D and D[R−L] but there was no association with L2D:4D (Fig. 1). None of the other hormone-digit ratio correlations were statistically significant. In our pre-registration, we specified that we would conduct bootstrapped (10,000 resamples) hierarchical multiple regression analyses (Step 1: enter covariates; Step 2: enter T, E, and the T × sex and E × sex interaction terms; Step 3: enter T:E ratio and the T:E × sex interaction term). However, in the interests of parsimony, we decided instead to run separate models for each predictor (along with its respective interaction term with sex). We included the following covariates: infant sex (0 = female, 1 = male), maternal PCOS status (1 = absent, 2 = present), maternal hirsutism (1 = no areas affected, 2 = one area affected, 3 = more than one area affected), infant’s birth weight (grams), infant’s age at follow-up corrected for gestational age (days) (we did not include child’s birth length as specified in our pre-registration because this variable was not made available). Since the covariates were included for the sole purpose of controlling for factors that may be associated with hormonal profiles during pregnancy and infant growth factors that could affect 2D:4D, we report in Table 2 the effect size estimates and bias corrected and accelerated 95% confidence intervals (BCa 95% CIs) for the predictors and not the covariates. Although T:E was no longer significantly associated with R2D:4D and D[R−L], T correlated positively with L2D:4D and negatively with D[R−L].

Table 2 Associations between maternal hormone concentrations and offspring digit ratios.
Figure 1
figure 1

Scatterplots showing the associations between maternal T:E ratio and children’s (a) R2D:4D, (b) L2D:4D, and (c) D[R−L]. Raw data (not controlled for covariates) are shown; T:E ratio is multiplied by 1000.

Discussion

The current study aimed to investigate whether maternal circulatory T:E ratio in early pregnancy predicts digit ratios in infancy. Our main prediction was that a high T:E ratio would correlate with low (male-typical) R2D:4D and L2D:4D, and that there would be no association with D[R−L]. Although we detected the predicted association with R2D:4D, T:E did not correlate with L2D:4D, and there was a negative correlation with D[R−L]. However, once covariates (including sex, and the relevant hormone × sex interaction term) were controlled for, T:E did not correlate significantly with any digit ratio variable, although T correlated positively with L2D:4D and negatively with D[R−L]. It is unclear why the results changed in this manner with the inclusion of covariates. However, we do not attempt to provide specific explanations for this, as they would necessarily be speculative in nature.

A negative correlation between maternal T:E ratio and offspring R2D:4D would be consistent with the theory that differential prenatal exposure to sex hormones affects development of digit ratios1,2,4,6,7, and in line with results from studies of human infants3 and experimental animal research5. However, this effect was not significant after controlling for covariates, and other research has reported no correlation between R2D:4D and the T:E ratio measured from amniotic fluid41 or between R2D:4D and the androgen-to-estrogen ratio measured from perinatal umbilical cord blood50. Additionally, the only other study to investigate offspring digit ratio in relation to maternal sex hormones in early pregnancy60 did not examine the T:E ratio. Further research using larger samples will be required to determine whether these variables are meaningfully related.

Contrary to our pre-registered prediction, a significant negative correlation between maternal T:E ratio and D[R−L] was observed in bivariate association. However, this did not remain significant after controlling for demographic and obstetric covariates. This may therefore be considered consistent with observations that D[R−L] does not correlate with sex hormones measured from amniotic fluid40,41, umbilical cord blood48,50 and second trimester maternal plasma40, and that it does not differ between people with and without CAH10. Although Baxter et al.61 recently reported a significant negative correlation between the urinary testosterone-to-estrone conjugate ratio of pregnant Titi monkeys and the D[R−L] of their offspring, the effect reported there also did not retain statistical significance once covariates had been controlled for. It further remains unclear to what degree, if at all, maternal urinary sex hormone concentrations relate to those of the developing fetus.

The current study adds to a literature replete with inconsistent findings and replication failures11. In particular, there appears to be a concerning pattern in which smaller studies attempting to test the 2D:4D validity question report positive findings and larger ones do not. For example, small studies of CAH1,62, which have been heavily cited in the literature, have reported significant effects, whereas larger ones have not63,64,65. The same pattern is observed for twin research, with some early small studies observing significant effects30,35 but the largest in the area reporting only null findings36. Similarly, the negative correlation between amniotic T:E ratio and R2D:4D reported by Lutchmaya et al.3 was not replicated in a larger cohort41. The current study may also fit this general pattern: although some statistically significant effects were observed, the larger study by Barrett et al.60 found no correlation between early pregnancy maternal sex hormone concentrations and the 2D:4D of their children.

These issues present a serious challenge to the credibility of digit ratio research, particularly when considered in conjunction with the ease with which data can be collected and the considerable researcher degrees of freedom afforded at the analysis stage66,67. For instance, it has been noted that researchers often examine several digit ratio predictor variables in the same study (e.g., R2D:4D, L2D:4D, D[R−L], and the average of R2D:4D and L2D:4D [M2D:4D]) and also stratify their analyses by sex10,14,68. Unless effective controls for alpha inflation are in place, this necessarily increases the chances of observing statistically significant effects, and, hence, making Type 1 errors. This problem is further compounded when researchers measure multiple outcome variables68, and particularly so if not all of those outcomes are reported67,69. A further issue is that one significant effect in the predicted direction may be taken as evidence in favour of rejecting the null hypothesis despite the greater weight of evidence being in favour of its acceptance10. To take a conservative example: if researchers were to examine R2D:4D and L2D:4D, separately in males and females, in relation to a single outcome variable, one statistically significant effect (e.g., for R2D:4D in males) might be emphasised over three concurrent null results for the same hypothesis (i.e., null effects for L2D:4D in males and for R2D:4D and L2D:4D in females). Such practices do not only lead to biased interpretations of individual datasets, but they also make it difficult to detect publication bias10, a problem which may be prevalent within the digit ratio literature10,14,70. This is because if there is a bias for publishing positive findings, it may be irrelevant in this field whether a statistically significant effect is found in relation to the right hand or left hand or in males or in females etc. An obvious way to address this issue going forwards is to pre-register studies with specific a priori hypotheses, predictions, and analysis plans, and to note in publications where analyses have deviated from these14,71,72.

R2D:4D and L2D:4D values in the current study were lower when measured from photocopies than when measured directly from participants’ hands. This corroborates findings from studies of older populations73,74,75, and implies that the process of photocopying may distort the soft tissue in a way that acts differentially across the second and fourth fingers. What was more surprising was that the direct and photocopy measures were uncorrelated in our sample, as previous research has identified both techniques to be reliable76,77. As (after removal of outliers) 2D:4D measurements taken directly (self-measured) from young adults have been reported to correlate moderately (R2D:4D, r = 0.518; L2D:4D, r = 0.409) with those taken from photocopies (researcher-measured)78, the lack of intercorrelation observed within the current study may simply reflect the difficulty associated with obtaining accurate measurements directly from the hands of toddlers. Correlations between direct and photocopy measures of digit ratio in young children may generally be fairly low (e.g., R2D:4D: r = 0.421; L2D:4D: r = 0.373 [Constantinescu, 2009, p. 3265]) but it remains unclear why we observed no association at all. We suggest that, despite the slight distortion caused, photocopies/scans may be the most effective method of obtaining reliable data in this population. Although one might still make the case for using direct measurements, we examined those derived from photocopies because they are more likely to be reliable. This is indicated by the higher ICCs and lower SDs observed in the current study, and by previous research reporting computer-assisted measurements to yield the most reliable results76,77. Additionally, we used these measures because it would be possible to check the data against the original photocopies whereas this is not so for the direct measurements. Analysis of both sets of measurements would have increased the number of null hypothesis significance tests used, and, therefore, the chances of making Type 1 errors.

The current study has strength in that it not only reported on two hormones in isolation and their interactions with fetal sex, but also examined the T:E ratio, a variable posited to play a key role in the determination of digit ratios3,4,5,6. It is additionally important that sex hormones were assayed in close proximity to the time at which they are hypothesised to exert their greatest influence on digit ratio development59. However, some limitations should also be considered. Though comparable with previous research3,41,48,50, our sample is only modest in size; it therefore, lacks the statistical power required to detect small effects. Second, although inter-rater reliability for digit ratios measured from photocopies was high, it is notable that these measures did not correlate with those taken directly from participants’ hands. Third, fetal steroids were not measured directly, and it remains unclear how representative maternal serum samples may be of the fetal circulation. Previous research has shown estradiol measured from second trimester amniotic fluid to be positively correlated with that present in maternal serum sampled in the second and third trimesters42, and human chorionic gonadotropin (hCG) levels measured from amniotic fluid and maternal serum are positively correlated during the second (but not third) trimester79. Conversely, although there are exceptions (e.g.80,81), most studies have found maternal serum testosterone to be unrelated to fetal sex42,44,82,83,84,85, and testosterone concentrations in the maternal and fetal plasma have been reported to be uncorrelated43. Considering that digit ratios have been hypothesised to relate primarily to sex steroid concentrations present within the fetal rather than maternal circulation, the findings of the present study should therefore be interpreted cautiously.

Conclusions

The current study contributes the first pre-registered analysis of 2D:4D in relation to maternal sex hormone concentrations, as well as the first empirical test of whether the right-left difference in digit ratios (D[R−L]) is correlated with hormonal concentrations measured during early pregnancy. We attempted to test the hypothesis that mothers’ T:E ratio is a predictor of their offspring’s digit ratios. Although we observed statistically significant effects linking high T:E with low R2D:4D and D[R−L] at 18–22-month follow-up, neither effect remained statistically significant once covariates had been controlled for. Furthermore, multivariate analyses revealed that T correlated positively with L2D:4D and negatively with D[R−L], the first of these effects being inconsistent with well established theory. Taken together, the results of this study suggest that further research with larger sample sizes will be required to determine whether digit ratios are valid proxy measures of maternal sex hormone exposure.

Methods

Participants

Mothers (n = 217) were recruited early in their pregnancy, during or before their routine 20-week ultrasound scan, as part of the Cambridge Ultrasound Siblings and Parents Study (CUSP) at the Rosie Hospital, Cambridge University Hospitals NHS Foundation Trust. Ethical approval was provided by the East of England Cambridge Central Research Ethics Committee (ref: 16/EE/0004) and the Research and Development department of Cambridge University Hospitals. All mothers gave written informed consent for access to their pregnancy-related clinical records, test results, and biological samples obtained during routine clinical care, and the procedures were conducted in accordance with the Declaration of Helsinki. Inclusion criteria were as follows: (1) little/no consumption of alcohol during pregnancy, (2) no smoking or recreational drug use during pregnancy, (3) a singleton fetus whose measurements did not indicate intrauterine growth restriction or large-for-gestational age, (4) absence of any major fetal anomalies, and (5) birth of a clinically healthy baby. At time of scan mothers were asked to complete a pregnancy history questionnaire to self-report metabolic, reproductive, and diagnosed conditions. Detailed description of this sample has already been reported elsewhere46,86.

Hormone assays

Serum samples were collected by a specialist phlebotomist at the Rosie Hospital and stored at − 80 °C, as part of a national screening programme at the end of first trimester/start of second trimester (M = 12.7 [SD = 0.7] weeks gestation46) for biomarkers of Down’s Syndrome and other conditions. Samples from CUSP participants (n = 122) were thawed and transferred to separate vials (1 ml aliquots per sample), which were anonymised and sent for analysis at the Core Biochemical Assays Laboratory (CBAL) at Addenbrookes Hospital, Cambridge.

Concentrations of testosterone (T), estradiol (E), dehydroepiandrosterone sulphate (DHEAS), progesterone (P), and sex hormone-binding globulin (SHBG) were measured. Samples were analysed on a DiaSorin Liaison® XL automated immunoassay analyser using a one-step competitive chemiluminescence immunoassay for each hormone and two monoclonal antibodies for each peptide. All reagents, standards and consumables are those supplied by DiaSorin (DiaSorin S.p.A, 13040 Saluggia [VC], Italy). Although SHBG was assayed, allowing for estimation of the free testosterone index and the free estradiol index, we instead include the total hormone levels in our statistical models. This is because SHBG does not easily cross the placenta87, and so it is unclear whether SHBG in the maternal serum is reflective of fetal bioactivity.

Digit ratio (2D:4D) measurements

Parents and infants were invited for an in-person follow-up visit to measure 2D:4D and physical growth (infants’ Mage = 19.87, SD = 0.86, range = 18.20–21.95 months). Direct measures of finger length were taken by Research Assistants using a standard tape measure. A Canon LiDE 300 flatbed scanner was used to scan infants’ left and right hands, and colour images were made at a resolution of 2400 × 4800 dpi. 2D:4D ratios were calculated from these by two researchers (GR and EA) using AutoMetric 2.2 for Windows88.

Statistical analysis

We computed the averages for R2D:4D, L2D:4D, and D[R−L] across the two sets of measurements (separately for direct and photocopy measures). We checked for correlation (Pearson’s tests) and differences (paired samples t-tests) between the direct and photocopy measures, and used independent samples t-tests to examine for sex differences. We then used bootstrapped (10,000 resamples) Pearson’s correlations and multiple linear regression analyses to determine whether the maternal hormones (T, E, and T:E ratio) were associated with digit ratio (R2D:4D, L2D:4D, and D[R−L]). We included the following covariates: child’s sex (0 = female, 1 = male), maternal polycystic ovary syndrome (PCOS) status (1 = absent, 2 = present), maternal hirsutism score (1 = no areas affected, 2 = one area affected, 3 = more than one area affected), child’s birth weight (grams), child’s age at follow-up corrected for gestational age (days). We included covariates to control for factors related to the maternal hormonal environment (PCOS and hirsuitism are associated with elevated androgen concentrations89,90) and infant factors related to growth trajectories that could affect 2D:4D (2D:4D may fluctuate considerably during early postnatal life91,92; it may also correlate with birth weight, although empirical findings are mixed93,94,95,96). We also included infant sex as a covariate because 2D:4D exhibits marked sex differences9 and because associations with prenatal hormonal variables could differ between males and females39.

When utilising a bootstrapping approach, a specified number of resamples (in this case, 10,000) the size of the original is drawn with replacement from the available data. The chosen statistic is then computed for each resample. These resamples are considered equivalent to samples derived in the usual way from an infinitely large population with similar characteristics to those of the observed data. The variation among resamples indicates what would be expected from sampling variation under such circumstances (see Loehlin et al., p. 300–30197). We used bootstrapping because it does not assume a normal distribution of the error term98, and may be advantageous when examining variables that exhibit marked deviations from the normal distribution as well as presence of datapoints that would be considered outliers in the context of a normal distribution41,97,99,100.