The impact of reproductive factors on the metabolic profile of females from menarche to menopause

Clayton, Gemma L.; Borges, Maria Carolina; Lawlor, Deborah A.

doi:10.1038/s41467-023-44459-6

Download PDF

Article
Open access
Published: 06 February 2024

The impact of reproductive factors on the metabolic profile of females from menarche to menopause

Nature Communications volume 15, Article number: 1103 (2024) Cite this article

1698 Accesses
5 Altmetric
Metrics details

Subjects

Abstract

We explore the relation between age at menarche, parity and age at natural menopause with 249 metabolic traits in over 65,000 UK Biobank women using multivariable regression, Mendelian randomization and negative control (parity only). Older age of menarche is related to a less atherogenic metabolic profile in multivariable regression and Mendelian randomization, which is largely attenuated when accounting for adult body mass index. In multivariable regression, higher parity relates to more particles and lipids in VLDL, which are not observed in male negative controls. In multivariable regression and Mendelian randomization, older age at natural menopause is related to lower concentrations of inflammation markers, but we observe inconsistent results for LDL-related traits due to chronological age-specific effects. For example, older age at menopause is related to lower LDL-cholesterol in younger women but slightly higher in older women. Our findings support a role of reproductive traits on later life metabolic profile and provide insights into identifying novel markers for the prevention of adverse cardiometabolic outcomes in women.

Associations of dietary patterns with brain health from behavioral, neuroimaging, biochemical and genetic analyses

Article Open access 01 April 2024

Fasting-mimicking diet causes hepatic and blood markers changes indicating reduced biological age and disease risk

Article Open access 20 February 2024

Development and validation of a new algorithm for improved cardiovascular risk prediction

Article Open access 18 April 2024

Introduction

Markers of women’s reproductive health, such as age at menarche, parity and age at menopause, have been associated with several common chronic conditions, including cardiometabolic diseases^1,2,3,4,5,6 and breast, ovarian and endometrial cancer^{7,8,9,10,11,12}. Some attempts have been made to explore the extent to which these associations are causal, as opposed to explained by residual confounding, using approaches such as Mendelian randomisation (MR) and negative control designs, which are less prone to bias by key confounders from conventional observational studies. MR studies suggest a direct positive effect of age at menarche on breast cancer and an indirect inverse effect via body mass index (BMI)¹³, as well as a possible bidirectional relationship between age at menarche and BMI^13,14. MR also supports a protective effect of older age at first birth on type 2 diabetes and cardiovascular diseases¹⁵ and lower mean levels of BMI, fasting insulin and triglycerides in women and men¹⁶, while a partner negative control study provides some evidence of a ‘J-shaped’ effect of parity on coronary heart disease risk⁵. In addition, evidence from MR studies indicate that older age at menopause increases the risk of breast, endometrial and ovarian cancer, reduces the risk of bone fractures and type 2 diabetes, and do not substantially affect BMI or cardiovascular diseases risk¹⁷.

Metabolites could act as mediators of the relationship of reproductive markers, and related hormonal changes, with chronic diseases^18,19,20. Determining the effect of women’s reproductive markers on multiple metabolites would be the first step to exploring this and could provide crucial insights into mechanisms underlying women’s long-term health. We have previously shown marked changes in metabolites, such as lipids, fatty acids, amino acids and inflammatory markers during pregnancy²⁰, through the menopausal transition²¹, and among women on hormonal contraceptives containing oestrogen²². Many of these same metabolic measures are also related to cardiovascular diseases¹⁹ and some cancers^23,24,25,26. The aim of this paper is to explore the extent to which women’s reproductive markers have a causal effect on 249 metabolic measures (covering lipids, fatty acids, amino acids, glycolysis, ketone bodies and inflammatory markers). We focus on three reproductive traits that represent key events in women’s reproductive lives: (i) age at menarche, a marker of puberty timing, (ii) parity, a marker of repeated exposure to the physiological challenges of pregnancy, and (iii) age at menopause, a marker of reproductive aging. We explore the causal relationships between these reproductive markers and metabolic measures by triangulating evidence²⁷ across multivariable regression, a negative control design (for parity only), and MR (Fig. 1). Given each of these approaches has unique strengths and limitations, results that agree across them are less likely to be spurious²⁷.

**Fig. 1: Infographic summarising the different approaches taken to assess the relationship between reproductive traits and metabolites.**

In this paper we show that reproductive factors are likely to impact females’ metabolic profile later in life. Evidence supporting a relation between later pubertal timing and a less atherogenic metabolic profile is largely explained by adult BMI. Findings linking higher parity to a more atherogenic profile were supported by the negative control analyses but imprecisely estimated in Mendelian randomisation. Evidence supporting a relation between slower reproductive aging and a less atherogenic metabolic profile was mostly observed among younger women. These results could contribute to identifying novel markers for the prevention of adverse cardiometabolic outcomes in women and/or methods for accurate risk prediction.

Results

We used data from 65,699 UK Biobank female participants with 249 metabolic measures quantified by nuclear magnetic resonance (NMR). Self-reported age at menarche (in years), parity (in number of live born children) and age at menopause (in years) were reported at baseline when participants mean age was 56 years old (range: 38–73). NMR metabolites were measured on blood samples taken at baseline or first repeat assessment (more details in methods). The characteristics of these participants are shown in Table 1 (and split by each of our reproductive markers (categorised) in Supplementary data 1–3). At recruitment (baseline) women were aged (mean) 56 (SD = 8.0) years, 21% drank three or four times a week and 40% were previous/current smokers. 81% of women had one or more live births whilst the mean age of menarche was 13 years (SD = 1.3). 59% (37,248) women reported they went through a natural menopause with a mean age of menopause of 49.7 years (SD = 5.1). Supplementary data 4 shows the distribution of NMR metabolic measures among UK Biobank females. The proportion of women with missing data across metabolic measures ranged from 0.3% to 6.1%.

Table 1 Distribution of characteristics of UK Biobank participants (females only) with NMR metabolomics data

Full size table

We used three approaches relying on different assumptions to explore the causal role of women’s reproductive markers on later life metabolic profile. For the first approach (‘multivariable regression’), we used linear regression models to estimate the association of reproductive markers with metabolic measures after adjusting for age at baseline, education and body composition at age 10. In sensitivity analyses, for the 55 non-derived metabolites, we categorised age at menarche, parity and age at natural menopause, tested for a linear trend and, where there was evidence of non-linearity, fit restricted cubic splines. For the second approach (‘negative control design’ – only applicable for parity), we used linear regression models to test whether number of live born children in men was associated with their metabolic measures. Men do not experience the repeated physiological stress of pregnancy but are likely to demonstrate the same associations of confounders (eg. socioeconomic position, BMI, smoking) with number of live births. Therefore, similar associations of number of live births with metabolic measures between men and women would indicate bias (e.g. due to confounding) rather than a causal effect of being exposed to the physiological stress of pregnancy on women’s metabolic profile. For the third approach (‘MR’), we selected single nucleotide polymorphisms (SNPs) as genetic instruments for each reproductive marker from previous genome-wide association studies (GWAS) and performed two-sample MR to estimate the effect of reproductive markers on metabolic measures using the standard inverse variance weighted (IVW) method. For both multivariable regression and MR analyses, we adopted P-value < 0.00093, which accounts for the approximate number of independent tests as detailed in ‘Statistical analyses’.

Age at menarche

In the main multivariable regression analyses (adjusting for age at baseline, education and body composition at age 10), older age at menarche was associated with higher concentrations of glutamine, glycine, albumin, apolipoprotein A1, cholines, phosphatidylcholines, and sphingomyelins but lower concentrations of alanine, branched-chain amino acids (isoleucine, leucine and valine), aromatic amino acids (phenylalanine and tyrosine), fatty acids (monounsaturated fatty acids (MUFA), omega-3 polyunsaturated fatty acids (PUFA), and saturated fatty acids (SFA)), glycolysis-related metabolites (glucose, lactate, pyruvate), acetoacetate, and glycoprotein acetyls (GlycA) (P < 0.00093) (Fig. 2 and Supplementary data 5). Older age at menarche was also associated with numerous lipoprotein-related traits at P < 0.00093, particularly with higher numbers of particles, size, and lipid content in high-density lipoproteins (HDL) and lower numbers of particles, size, and lipid content in very low-density lipoproteins (VLDL) (Fig. 2). The associations of age at menarche with HDL-related traits were mostly due to larger HDL subclasses (i.e. medium, large and very large particles), while associations with VLDL-related traits were observed across VLDL subclasses (Supplementary Fig. 1 and Supplementary data 5). In sensitivity analyses with further adjustments for BMI, smoking and alcohol status at baseline, findings for an association of older age at menarche were largely or completely attenuated towards the null for most metabolic measures with few exceptions, such as glutamine, glycine, omega-3 PUFA, pyruvate, lactate, and acetoacetate (Supplementary Fig. 2). There was evidence of non-linearity between categories of age at menarche (<13, 13–14, >14 years) and 17 metabolites (Supplementary data 6 and Supplementary Fig. 3). Restricted cubic spline models (with 3 knots at ages 11, 13, and 15 years) generally showed an increase in albumin, apolipoprotein A1, cholines, docosahexaenoic acid (DHA), linoleic acid (LA), and phosphatidylcholines with older age at menarche until approximately age 13, in line with our linear association, and then began to flatten and/or decrease (Supplementary data 7 and Supplementary Fig. 4). Whilst older age at menarche was related to a decrease in GlycA until ~13 years and then began to flatten.

For the MR analyses, we selected 389 SNPs as instruments for age at menarche, which explained 7.4% of its phenotypic variance with a corresponding mean F statistic of 63 (Supplementary data 8). Overall, MR estimates using IVW were in agreement with multivariable regression estimates in direction and magnitude (Fig. 2 and Supplementary Fig. 1); however, due to the higher degree of uncertainty for IVW estimates, no result passed our threshold for multiple testing correction (P < 0.00093). Following reviewer’s comments, we repeated the IVW analyses for a larger sample of women (N = 216,514–239,803) for the eight biomarkers assayed using clinical chemistry techniques that matched measures in the NMR metabolomics platform — i.e. albumin, apolipoprotein A1, apolipoprotein B, glucose, HDL-cholesterol, low-density lipoprotein (LDL)-cholesterol, total cholesterol, and triglycerides. These results provided further evidence of older age at menarche being related to higher albumin, apolipoprotein A1, HDL-cholesterol, and lower triglycerides (P < 0.00093) (Fig. 5). Given the a priori evidence of bidirectional effects between age at menarche and BMI, we also performed multivariable IVW accounting for adult BMI to estimate the direct effects of age at menarche on metabolic measures, which resulted in estimates partly or completely attenuated to the null for most metabolic measures with few exceptions, such as glutamine and glycine (Supplementary Figs. 5 and 6).

Parity

In the main multivariable regression analyses (adjusting for age at baseline, education and body composition at age 10), higher parity was related to higher concentrations of glycine and leucine, but lower concentrations of histidine, fatty acids (DHA, Omega 3, Omega 6 PUFA), pyruvate, ketone bodies (acetate, acetoacetate, acetone and β-hydroxybutyrate), and apolipoprotein A1 (P < 0.00093) (Fig. 3 and Supplementary data 9). Higher parity was also associated with numerous lipoprotein-related measures at P < 0.00093, particularly with lower and higher number of particles, size, and lipid content for HDL and VLDL, respectively, as well as lower size of LDL particles (Fig. 3). The associations of parity with lipoprotein-related measures were observed across most VLDL and HDL subclasses, whereas associations with LDL-related measures were mostly driven by larger LDL particles (Supplementary Fig. 7 and Supplementary data 9). In sensitivity analyses with further adjustments for BMI, smoking and alcohol status at baseline, higher parity associations were consistent for glycine, histidine, fatty acids, pyruvate, ketone bodies, apolipoprotein A1, and partly attenuated towards the null for VLDL- and HDL-related traits (Supplementary Fig. 8). There was some evidence of non-linearity between parity (0,1,2,3+) and 28 metabolites (Supplementary data 6 and Supplementary Fig. 9). However, restricted cubic spline models (with knots at 1, 2, and 3) generally showed monotonic relationships for those with no to four pregnancies, consistent with the main analysis models (Supplementary data 10 and Supplementary Fig. 10).

We used males as a negative control since men cannot experience the effects of being exposed to the stress test of pregnancy. Therefore, similar results between men and women would be indicative of bias, such as due to confounding by sociodemographic (e.g. education attainment) and biological (e.g. infertility) factors, rather than by an effect of repeated exposure to pregnancy. When using number of children in males as a negative control, we observed that associations for leucine, histidine, pyruvate, and ketone bodies were similar between men and women (i.e. directionally consistent, similar effect estimates and 95% confidence intervals overlapped between male and female estimates). On the other hand, association estimates for fatty acids, apolipoprotein A1, and lipoprotein-related traits were weaker or consistent with the null, and glycine was in opposite direction, in males compared to females (Fig. 4). For the MR analyses, we selected 32 SNPs as instruments for parity, which explained 0.2% of its phenotypic variance with a corresponding mean F statistic of 31 (Supplementary data 8). It is unclear whether estimates from multivariable regression and MR analyses are consistent with each other due to the high level of uncertainty in the latter (Fig. 3 and Supplementary Fig. 7), which persisted even when using the larger sample of women with selected biomarkers assayed by clinical chemistry (Fig. 5).

Fig. 4: Multivariable regression estimates for the associations of parity (females, red) or number of children (males, black) with metabolic measures: negative control analyses Models adjusted for age at baseline, education, and body composition at age 10.

Fig. 5: Mendelian randomisation estimates for the relation between older age at menarche, number of children ever born, older age at menopause - and metabolic measures among females measured using NMR metabolomics (black, squares) or clinical chemistry methods (pink, circles).

Age at natural menopause

In the main multivariable regression analyses (adjusting for age at baseline, education and body composition at age 10), older age at menopause was related to higher glycine, PUFA (e.g. DHA and LA), albumin, apolipoprotein B and sphingomyelins, but lower concentration of MUFA, pyruvate, acetoacetate, creatinine and GlycA (P < 0.00093) (Fig. 6 and Supplementary data 11). Older age at menopause was also associated with numerous lipoprotein-related traits at P < 0.00093, particularly with higher number of particles and lipid content in LDL, larger size of HDL particles, and lower size of VLDL particles (Fig. 6). The associations between age at menopause and LDL-related traits were observed across LDL subclasses (i.e. from small to large), whereas associations with HDL-related traits were mostly driven by larger HDL particles (Supplementary Fig. 11). In sensitivity analyses with further adjustments for BMI, smoking and alcohol status at baseline, associations between older age at natural menopause and metabolites remained similar, except for associations with HDL-related traits which were partly attenuated (Supplementary Fig. 12). There was evidence of non-linearity across 24 metabolites (Supplementary data 6 and Supplementary Fig. 13) in the multivariable regression when menopause was categorised (<49, 49–50, 51–53, >53 years). Restricted cubic spline models (with 4 knots) were generally consistent with the main analysis (assuming a linear association) until age at menopause ~55 years when most metabolites decreased (Supplementary data 12 and Supplementary Fig. 14).

For the MR analyses, we selected 290 SNPs as instruments for age at natural menopause, which explained 8.2% of its phenotypic variance with a corresponding mean F statistic of 141 (Supplementary data 8). Estimates from multivariable regression and MR analyses were inconsistent in direction for many metabolic measures (Fig. 6). In particular, in contrast to results from multivariable regression, MR analyses indicated older age at menopause to be related to lower concentration of fatty acids (e.g. LA), albumin, apolipoprotein B, as well as lower number of particles and lipid content in LDL across subtypes (from small to large) (Fig. 6 and Supplementary Fig. 11). For some metabolites, such as GlycA and HDL-related traits, results were consistent in direction between multivariable regression and MR. For alanine, glutamine and glucose, MR analysis suggested older age of menopause to be related to lower circulating metabolite levels, which had not been observed in multivariable regression analysis (Fig. 6 and Supplementary data 11). As expected, there was more uncertainty in MR estimates and only results for glutamine and some LDL- and VLDL-related measures passed the threshold for multiple test correction (P < 0.00093). Repeating the MR analyses in the larger sample of women (N = 216,514–239,803) with selected biomarkers assayed by clinical chemistry confirmed that older age at natural menopause was related to lower albumin, LDL-cholesterol, and total cholesterol at P < 0.00093 (Fig. 5).

We performed further analyses to investigate reasons underlying discrepant findings between multivariable regression and MR estimates for some metabolic measures (see Methods: Additional analyses for age at natural menopause: exploring the role of medication and chronological age). These analyses were restricted to the eight clinical chemistry biomarkers matching measures in the NMR platform to maximise statistical power since they have been measured in the full UK Biobank sample. First, we hypothesised that discrepant findings were related to differences in the sample used for multivariable regression, which excludes women with missing data on age at menopause because they had yet to go through it or had a surgical menopause (hereafter ‘selected sample’), and two-sample MR, which includes women even if they are missing data on age at natural menopause (hereafter ‘full sample’). To test that, we compared estimates from multivariable regression on the selected sample to MR on both the selected sample and full sample. In agreement with our hypothesis, multivariable regression and MR estimates for LDL-cholesterol and related traits are comparable when restricting to the selected sample. In contrast, for albumin, discrepant results were related to differences between multivariable regression and MR rather than between selected and full sample (Supplementary Fig. 15). Second, given women with missing data on age at menopause are typically pre-menopausal and younger, we explored age-stratified multivariable and MR estimates, which revealed a strong effect modification by chronological age on the association of age at menopause with LDL-cholesterol and related traits – e.g. older age at menopause is related to substantially lower LDL-cholesterol in younger women (≤50 y) (e.g. multivariable regression (MV): −0.018 SD, 95% CI: −0.021, −0.015), but slightly higher LDL-cholesterol in older women (>63 y) (e.g. MV: 0.004 SD, 95% CI: 0.003, 0.006) (Supplementary Fig. 15). Differences related to chronological age at baseline were also observed for other biomarkers, such as albumin. When excluding women taking statins at baseline, we observed that the association between age at menarche and LDL-cholesterol estimated by multivariable regression was partly attenuated (−0.001 SD, 95% CI: −0.002, 0.000). However, excluding women using statins or hormone replacement therapy (HRT) at baseline did not substantially altered the chronological age patterned results (Supplementary Fig. 16).

Exploring the plausibility of MR assumptions

We conducted a series of sensitivity analyses to explore the plausibility of key MR assumptions, required for the method to provide a valid test of the presence of a causal effect.

First, we tested whether MR findings are likely to be biased by population stratification, assortative mating and indirect genetic effects of parents using two approaches: (i) performing two-sample MR analyses using (sex-combined) data from a recent within-siblings GWAS, and (ii) conducting two-sample MR on negative control outcomes (i.e. skin colour and skin tanning ability). Two-sample MR estimates for the effect of genetic susceptibility for older age at menarche, parity, and age at natural menopause on five available biomarkers was broadly consistent when estimated among unrelated individuals or between siblings. Results for age at menarche were slightly overestimated for triglycerides and underestimated for glycated haemoglobin in unrelated individuals, while results for a positive relation between age at natural menopause and HDL-cholesterol was supported by analyses between siblings but not among unrelated individuals (Supplementary Fig. 17). We did not observe an association of genetically-predicted reproductive markers with skin colour or tanning (Supplementary data 13). Taken together, these sensitivity analyses indicate that our main MR estimates are unlikely to be substantially biased by population stratification, assortative mating, or indirect genetic effects of parents.

Second, we explored the presence of bias due to pleiotropic variants by using MR methods other than IVW: the weighted median estimator and MR-Egger. These methods can provide valid tests for the presence of a causal effect under different (and weaker) assumptions about the nature of the underlying horizontal pleiotropy compared to IVW. Estimates from IVW and weighted median were consistent in direction for most relationships between reproductive markers and metabolic measures. In most instances, estimates from MR-Egger method were uninformative given the high degree of uncertainty (Supplementary Figs. 18–20).

Third, we assessed potential bias due to sample overlap from including UK Biobank individuals in genetic association estimates for both exposures and outcomes. This was achieved by using data from previous GWAS that did not include UK Biobank, available for age at menarche and age at natural menopause, to select SNPs (and genetic associations estimates with exposures) for two-sample MR analyses (Supplementary data 14). When using SNPs selected from previous GWASes that did not include UK Biobank participants, results for of age at menarche and age at natural menopause were largely consistent, although less precise, compared to estimates from the main analyses using data with overlapping samples (Supplementary Figs. 21 and 22).

Discussion

Our findings indicate that reproductive markers across women’s lifespan are associated with distinct metabolic signatures in later life. Age at menarche, parity and age at natural menopause were related to numerous metabolic measures, representing multiple dimensions of metabolism, including amino acids, fatty acids, glucose, ketone bodies, and lipoprotein metabolism.