Multivariate genome-wide analysis of aging-related traits identifies novel loci and new drug targets for healthy aging

Rosoff, Daniel B.; Mavromatis, Lucas A.; Bell, Andrew S.; Wagner, Josephin; Jung, Jeesun; Marioni, Riccardo E.; Davey Smith, George; Horvath, Steve; Lohoff, Falk W.

doi:10.1038/s43587-023-00455-5

Download PDF

Article
Open access
Published: 07 August 2023

Multivariate genome-wide analysis of aging-related traits identifies novel loci and new drug targets for healthy aging

Nature Aging volume 3, pages 1020–1035 (2023)Cite this article

15k Accesses
15 Citations
41 Altmetric
Metrics details

Subjects

Abstract

The concept of aging is complex, including many related phenotypes such as healthspan, lifespan, extreme longevity, frailty and epigenetic aging, suggesting shared biological underpinnings; however, aging-related endpoints have been primarily assessed individually. Using data from these traits and multivariate genome-wide association study methods, we modeled their underlying genetic factor (‘mvAge’). mvAge (effective n = ~1.9 million participants of European ancestry) identified 52 independent variants in 38 genomic loci. Twenty variants were novel (not reported in input genome-wide association studies). Transcriptomic imputation identified age-relevant genes, including VEGFA and PHB1. Drug-target Mendelian randomization with metformin target genes showed a beneficial impact on mvAge (P value = 8.41 × 10⁻⁵). Similarly, genetically proxied thiazolidinediones (P value = 3.50 × 10⁻¹⁰), proprotein convertase subtilisin/kexin 9 inhibition (P value = 1.62 × 10⁻⁶), angiopoietin-like protein 4, beta blockers and calcium channel blockers also had beneficial Mendelian randomization estimates. Extending the drug-target Mendelian randomization framework to 3,947 protein-coding genes prioritized 122 targets. Together, these findings will inform future studies aimed at improving healthy aging.

Mendelian randomization of genetically independent aging phenotypes identifies LPA and VCAM1 as biological targets for human aging

Article 20 January 2022

Multi-omic underpinnings of epigenetic aging and human longevity

Article Open access 19 April 2023

A metabolomic profile of biological aging in 250,341 individuals from the UK Biobank

Article Open access 15 September 2024

Main

While human aging is a multifaceted process influenced by many factors^1,2 and characterized by reduced maintenance of homeostatic mechanisms, age-related diseases and death³, there exists substantial variability in how humans age³. Some individuals may be subject to chronic health problems and disease and die early while others may reach old age relatively healthy³. Understanding the factors underlying this variation is important for the development of public health interventions and therapeutics to improve healthy aging⁴.

Genome-wide association studies (GWASs) have begun to identify aging-related loci using single-phenotype approaches^5,6,7, including extreme longevity⁸, healthspan⁹ and parental lifespan¹. However, these single-endpoint approaches fail to account for the shared genetics among these traits or other aging-related traits, such as epigenetic age acceleration (EAA)¹⁰ and frailty¹¹, which would provide further insight into the broad genetic architecture underlying how humans age and inform the shifting focus in geroscience from studying survival toward incorporating complementary measures of age-related outcomes¹² to improve healthy aging—defined as the maintenance of well-being in old age that includes both the absence of disease and the presence of happiness, satisfaction and fulfillment¹³. Further, EAA may be reversible¹⁰, underscoring the potential value of elucidating mechanisms and discovering targets that slow aging¹⁰. Recent advances in multivariate GWAS approaches incorporate univariate GWAS summary statistics to facilitate discovery of the genetic architecture underlying related phenotypes¹⁴. In contrast to single-phenotype GWAS methods, multivariate approaches enhance discovery of novel biological correlates by boosting statistical power through increased effective sample sizes¹⁴ and have been recently applied to identify genomic loci shared across neuropsychiatric disorders¹⁴, alcohol consumption behaviors¹⁵ and externalizing behaviors¹⁶.

We apply genomic structural equation modeling (genomic SEM)¹⁴ to summary-level GWASs on healthspan⁹, parental lifespan¹, extreme longevity⁸, frailty¹¹ and epigenetic aging¹⁰ to construct a multivariate aging-related GWAS (here termed ‘mvAge’) to identify novel genetic variants that broadly impact healthy aging processes. We perform bioannotation—including fine mapping, a transcriptome-wide association study (TWAS) and cell-type enrichment. We also use several applications of Mendelian randomization (MR)¹⁷ aimed at identifying modifiable risk factors and biomarkers to support healthy aging initiatives. Further, given the importance and interest in repurposing and developing therapeutics to improve healthy aging (for example, ongoing clinical trials evaluating the potential of metformin¹⁸), and because genetic evidence supporting candidate compounds entering clinical trials increases the probability of clinical success, we use drug-target MR¹⁹ to investigate potential therapeutic repurposing opportunities among gene targets of metformin, 6 other antidiabetic classes, 15 lipid-lowering therapies and 5 classes of antihypertensive drugs. We also leverage drug-target MR to perform screens of protein-coding genes that will inform future studies investigating potential therapeutic targets to improve healthy aging.

Results

A study overview is presented in Fig. 1.

Structural equation modeling

Linkage disequilibrium (LD) score regression indicated that the five univariate input GWASs, representing the genetic aging-related liabilities of healthspan, frailty, exceptional longevity, parental lifespan and EAA, were positively correlated (frailty and EAA were reverse coded) (Fig. 2 and Supplementary Tables 1 and 2). We performed SEM in preparation for the multivariate GWAS. The common factor model fit of the implied genetic covariance matrix between the five input GWASs with the empirical covariance matrix was good (comparative fit index (CFI) = 0.97, standardized root mean square residual (SRMR) = 0.069) (Fig. 2 and Supplementary Tables 3 and 4), suggesting evidence for a shared genetic factor mvAge.

**Fig. 2: Multivariate aging GWAS modeled with genomic SEM.**

Multivariate GWAS meta-analysis

Expanding the SEM model to incorporate individual variants, we generated a multivariate GWAS estimating 6,793,898 associations at single nucleotide polymorphism (SNP) level (Supplementary Table 5) for our shared aging factor mvAge. Mean χ² and λ_GC (genomic control) were estimated at 1.43 and 1.52, respectively, and the LD score intercept, 0.997 (s.e. = 0.0098), suggesting inflation due to polygenic heritability signals rather than population stratification bias (Supplementary Fig. 1)^14,20. Effective sample size was calculated at 1,958,774 using mvAge summary statistics restricted to minor allele frequency (MAF) limits of 10% and 40% to produce stabler estimates¹⁴. We identified 52 lead SNPs in 38 genomic loci (P value < 5 × 10⁻⁸) (Fig. 2 and Supplementary Table 6). Twenty of the 52 SNPs were novel compared to loci identified in the five input GWASs underlying mvAge (Supplementary Tables 6 and 7 and Supplementary Figs. 2–21), highlighting the increased power of genomic SEM. Novel mvAge lead SNPs were generally enriched for traits, that is, systolic blood pressure (SBP), body mass index (BMI), brain morphology and type 2 diabetes (T2D). For 11 of the 20 novel SNPs, the lead SNP was identified in a previous GWAS from the GWAS Catalog, while the other previous associations were from variants in LD with the lead SNPs; 4 of the 11 novel variants (rs12769128, rs17499404, rs2643826 and rs9277988) were linked with aging-related traits (that is, previous GWASs of specific aging phenotypes like lifespan in the AncestryDNA cohort²¹), while another four (rs114298671, rs1689046, rs78438918 and rs6062322) were not linked with specific aging phenotypes; however, they were associated with important factors that may influence healthy aging (for example, cognition, BMI and hypertension) (Table 1 and Supplementary Tables 8 and 9).

Table 1 Novel lead SNPs identified in mvAge

Full size table

Fine mapping

Fine-mapping analysis identified strong associations with several loci, including on chromosomes 1 (rs1230666, intronic variant in MAGI3); 6 (rs12203592 and rs9277988); 8 (rs268, an exonic variant within LPL); 19 (rs7412 in the APOE locus); and 20 (rs1737896). Regional plots show clear peaks at these loci with other credible set variants showing evidence of association (Supplementary Figs. 22–33 and Supplementary Table 10).

Q _SNP heterogeneity

Evaluating whether the multivariate SNP associations are appropriately modeled through a multivariate framework²², 9 of the 52 lead SNPs generated Q_SNP P values < 9.62 × 10⁻⁴, the Bonferroni-adjusted threshold, suggesting these SNPs impact the input aging-related GWAS endpoint by pathways other than mvAge²². However, none of the 20 novel SNPs had Q_SNP P values < 9.62 × 10⁻⁴ (Supplementary Table 6).

Transcriptomic imputation

Next, we performed a TWAS using FUSION²³ to identify gene-level associations with the mvAge genetic signature. We found 57 genes surpassing correction for multiple comparisons (Extended Data Fig. 1 and Supplementary Table 11). We took these genes forward for further testing, including colocalization²⁴ and FOCUS fine mapping for TWAS²⁵. Of the 57 TWAS-significant genes, 18 represented colocalized and potentially causal signals with mvAge. These ‘high-confidence’ gene-level associations included CDKN2A, PTPN22, PHB1, and VEGFA. TWAS Z scores for CDKN2A and VEGFA were both >0, indicating that predicted gene expression positively associated with mvAge, suggesting upregulation of the genes may be associated with increased mvAge. By contrast, with TWAS Z scores <0, results suggest downregulation of PTPN22 and PHB1 associated with increased mvAge.

Exploratory two-factor analysis

Genomic SEM’s exploratory factor analysis (EFA) was used to guide the specification of a more nuanced two-factor model (Supplementary Table 3). Based on the EFA, we ran a follow-up confirmatory factor analysis. A two-factor solution provided good fit to the data (CFI = 0.996, SRMR = 0.035) (Supplementary Table 4), with factor loadings suggesting one latent factor comprised life-expectancy-related GWASs (parental lifespan, extreme longevity and PhenoAge EAA) and the other comprised the healthy-aging-related GWASs (healthspan and frailty). A strong genetic correlation between the two factors (r_g= 0.76, P value = 2.4 × 10⁻⁴⁸) suggests shared but distinct components of life expectancy/lifespan and healthy aging, both captured by mvAge.

Pathway, cell-type and Mendelian-disease-gene enrichment

Multimarker analysis of genomic annotation (MAGMA)²⁶ gene-based mapping found 164 genes (Supplementary Table 12) that we used to perform our gene-set analysis, which were enriched for gene ontology and REACTOME terms (Supplementary Table 13); many of the gene sets were related to lipids (that is, plasma lipoprotein assembly, triglyceride (TG) and very low-density lipoprotein clearance, TG metabolic processes, protein–lipid complex assembly and components of chylomicrons). Cell-type enrichment showed six cell types surpassing correction for multiple comparisons (Supplementary Table 14). The top two cell types were the lymphoid and granulocyte/monocyte progenitor cells. mvAge was enriched primarily in immune cells/immune cell progenitors, with 8 of 13 cell types with P values < 0.05 related to immune cells. Tests for enrichment of mvAge in Mendelian disease genes and associated pathways showed six Mendelian diseases, including primary ciliary dyskinesia (Supplementary Table 15), and 21 phenotypic abnormalities, including several gene sets related to respiratory system function (Supplementary Table 16).

MR with modifiable risk factors and biomarkers

We used MR to assess possible causal relationships of 73 genetically predicted biomarkers and risk factors on mvAge (Supplementary Table 17). Twenty-five of 73 risk factors and biomarkers generated MR estimates surpassing the Bonferroni correction. MR estimates were consistent across complementary MR methods used as sensitivity analyses for the primary inverse variance weighted (IVW) estimate. In addition, MR Lap estimates were consistent with the IVW estimates, suggesting minimal bias due to sample overlap²⁷. (Full results are described in Supplementary Results.)

Metformin target genes impact mvAge

Results are oriented to mimic pharmacological modulation of the drug target, namely, a lowering of HbA1c (s.d. decrease (mmol mol⁻¹)) (Fig. 3 and Supplementary Table 18). (Supporting its validity as a genetic proxy for metformin, the primary instrument was associated with reduced risk for T2D (odds ratio = 0.40, P value = 2.5 × 10⁻⁴).) Lowering HbA1c via the metformin target genes linked beneficially with mvAge, which beneficial relationship remained after removing SNPs nominally associated with T2D from the primary instrument as well as in analyses using a second instrument from a recent MR study evaluating the impact of metformin on dementia²⁸. Analyses of the univariate aging-related input GWASs suggested beneficial relationships of metformin target genes with healthspan, epigenetic aging and longevity. Finally, MR distinguishing individual metformin gene targets showed relationships of the mitochondrial complex 1 and the GDF15 targets with mvAge.

**Fig. 3: Drug-target MR results assessing proxying metformin and other antidiabetics targets with mvAge.**

Genetic impact of targets for cardiometabolic drug classes

Given the roles of HbA1c, circulating lipid levels, and SBP in our polygenic MR, we performed drug-target MR evaluating the potential of genetically proxied antidiabetics (proxying lower HbA1c levels) and blood-pressure-lowering therapies on mvAge. Results are oriented to mimic the pharmacological modulation of the drug target, antidiabetics/HbA1c lowering, PCSK9 inhibition/low-density lipoprotein cholesterol (LDL-C) lowering (per s.d., mmol l⁻¹), LPL enhancement/TG lowering (per s.d., mmol l⁻¹), increase in high-density lipoprotein cholesterol (HDL-C) (per s.d., mmol l⁻¹), lowering of SBP (per s.d., mmHg) (Fig. 4 and Supplementary Tables 19 and 20). Among the antidiabetic targets, we observed beneficial relationships with mvAge for thiazolidinediones. MR estimates for sulfonylureas were also protective but less precise (P value < 0.05). Among LDL-C-lowering targets, PCSK9 and ABCG5/8 were each related beneficially to mvAge; HMGCR had a similar but less precise estimate. Among TG-lowering targets, ANGPTL4 inhibition and LPL enhancement were related beneficially to mvAge; as was increasing HDL-C via CETP inhibition. The protective estimates of PCSK9 inhibition, ABCG5/8 inhibition, LPL enhancement, CETP inhibition and LPA inhibition on mvAge each replicated in genetic instruments derived from independent GWAS from the Global Lipid Genetics Consortium (GLGC). (While several MR estimates showed evidence of heterogeneity, all were robust to removal of pleiotropic variants.) Similarly, we observed differences in the associations among the different classes of antihypertensive targets; for example, among genes in the renin–angiotensin–aldosterone pathway, genetically proxied angiotensinogen (AGT) inhibition was more strongly related to mvAge than genetically proxied angiotensin-converting-enzyme (ACE) inhibition.

**Fig. 4: Drug-target MR results assessing the impact of lipid-modulating and antihypertensive gene targets on mvAge.**

Cis-instrument MR prioritizes potential mvAge targets

Leveraging biomarker data and cis-instrument MR²⁹, we performed a screen of several thousand protein-coding genes located near the genomic loci of the biomarkers, that is, HbA1c, HDL-C, LDL-C, TG and SBP, corresponding to the physiological responses to glucose-lowering, lipid-modulation and antihypertensive therapies. In the first stage, 523 of 6,718 genes across the 5 biomarkers demonstrated evidence of colocalization (PP.H4 > 0.6) (Supplementary Tables 21–25). We were able to cis-instrument and analyze 354 of the 523 genes on mvAge. Across the biomarkers, 158 genes (121 unique genes) evinced genetic relationships with mvAge, 122 with beneficial MR estimates directionally consistent with the conventional physiological response to pharmacological modulation of the biomarkers, that is, lowered HbA1c, LDL-C, triglycerides and SBP, and increased HDL-C (Supplementary Tables 26–32). Twenty-five genes located near HbA1c evinced beneficial relationships with mvAge, including FADS1, previously linked with glucose intolerance³⁰, and also 23 near LDL-C, including replication of the PCSK9 results, and FGF21, an important regulator of several metabolic pathways³¹ (Fig. 5). Twenty-three genes near HDL-C, 26 near TGs and 25 near SBP similarly showed beneficial relationships with mvAge (Extended Data Fig. 2). Several genes (for example, ATXN2) were related to mvAge in more than one biomarker. Thirty-two are considered ‘druggable,’²⁹ and we observed drug–gene interactions, including an interaction of FADS2 with oleic acid (Supplementary Tables 33 and 34 and Extended Data Fig. 3). Regarding replication, 36 of the 41 genes available for instrumentation in independent biomarker data replicated at P value < 0.05 and had directionally consistent MR estimates with the primary MR analyses (Supplementary Table 32). Cis-instrument/drug-target MR results of the 68 circulating proteins derived from approximately 30,000 participants in the SCALLOP OLINK data³² are presented in Supplementary Results, Supplementary Tables 35 and 36 and Extended Data Fig. 4.

**Fig. 5: *Cis*-instrument MR results assessing the impact of protein-coding genes on mvAge through their associations with HbA1c and LDL-C.**

Discussion

Using new GWAS methods leveraging genetic correlations among correlated univariate aging-related traits, we performed a multivariate GWAS with a resultant effective sample size of 1.9 million participants and identified 20 variants not previously associated with aging, including rs268, an exonic SNP in the LPL locus, and rs2863761, an intronic variant not previously associated with any GWAS in the GWAS catalog. However, because no other age-related variant is in LD with rs2863761, replication will be necessary to test the robustness of this association. Next, we used fine mapping to prioritize several strongly associated variants (such as rs268), and transcriptomic imputation followed by gene-level colocalization and transcriptomic imputation-based fine mapping to identify high-confidence genes associated with mvAge, including several with evidence of involvement in aging processes (VEGFA³³ and PHB1 (ref. ³⁴)).

The genetic signature of mvAge was enriched for gene sets linked with aging-related areas, including neural functioning, growth and development, and lipid metabolism. In line with the lipid-related enrichment, we used the MR framework to both identify adverse causal roles for lipid levels in aging and show that genetically modeled modulation of lipid-lowering gene targets, such as PCSK9, ANGPTL4 and LPL, have beneficial relationships with healthy aging, suggesting potential therapeutic targets for future investigation. The association of rs268 in the LPL locus with mvAge, its inclusion in a 95% credible set, and the MR evidence for a role of circulating triglycerides—using both the polygenic, genome-wide instrument and instrument variants within the LPL locus—indicate a potential pathway through which rs268 may impact healthy aging. The results of our drug-target MR analyses proxying pharmacological lipid-lowering and antihypertensives support the hypothesis that therapeutic management of circulating lipids and blood pressure impact healthy aging in the general population not selected for cardiovascular diseases^35,36. Overall, the gene-set enrichment and MR findings between several mvAge loci and cardiovascular health are in line with cardiovascular disease being the leading global cause of death³⁷. They also likely reflect the composition cohorts underlying the input GWAS studies. For example, healthspan was defined as the incidence of the eight most common diseases in the study sample⁹, and thus is dependent on UK Biobank (UKB) selection protocols. The UKB cohort is comprised of adults between 40 and 69 years, and is enriched in cardiovascular disease and cancer, but has relatively few cases of Alzheimer’s disease³⁸. This composition may explain, in part, the strong associations observed with cardiovascular health, and also suggests that other key aging signals may be missed. Therefore, future multitrait studies with different cohort composition are warranted to further our understanding of aging.

Our additional exploratory and confirmatory factor analyses aiming to further examine the relationship of the genetics of life expectancy (lifespan and longevity) and healthspan suggested that in a two-factor model, lifespan and longevity load on one factor while healthspan, frailty and EAA load on the other factor. The two factors are correlated, suggesting that there may be shared yet distinct components of life expectancy/lifespan and healthspan/frailty/epigenetic aging captured by mvAge. Because one of the major goals currently in geroscience is reducing the life expectancy–healthspan gap³⁹, our findings suggest that analyzing mvAge and related shared factors in future studies, in additional populations, will improve our understanding of the genetics linking life expectancy and healthspan.

We focused downstream analyses on identifying modifiable risk factors that may facilitate public health intervention and prevention strategies, as well as extensive drug-target MR to investigate the impact of existing therapies and also identify targets for future work focused on healthy aging. There remain substantial challenges to running randomized controlled trials testing aging-related therapeutics (that is, long study duration, large sample sizes, patient selection)⁴⁰. Especially relevant to the development of anti-aging therapeutics will be the continued investigation of the relationship between aging and age-related diseases⁴⁰. Many diseases are due, in part, to age-related biological dysregulation⁴¹, and studies incorporating aspects of both aging and age-related diseases may identify drug targets and facilitate development of therapeutics that both improve healthy aging and reduce disease burden.

To the point, we found several of the novel variants not previously linked directly with aging-specific GWASs were indeed associated with important aging factors and processes, for example, rs78438918 associated with cognition, rs114298671 associated with BMI, and rs6062322 associated with blood pressure and antihypertensive medication use (Table 1). Hundreds of variants have been identified in GWASs of BMI⁴², cognition⁴³, blood pressure⁴⁴, etc., and while these are important aging factors and processes, not all of the identified variants may be directly linked with healthy aging. Because the mvAge signature represents the shared genetics of the five specific aging phenotypes, finding that a subset of variants previously implicated in aging factors and processes comprise, in part, the mvAge signature highlights potential pathways linking the aging factors and processes with healthy aging, and suggesting potentially important loci for future characterization, especially in studies linking these factors/processes with healthy aging.

Our metformin targets analysis complements previous studies showing that metformin is beneficial for healthy aging⁴⁵. Given the early stages of the ongoing clinical studies (MILES (Metformin in Longevity Study) and TAME (Targeting Aging with Metformin)^18,46) investigating the aging benefits of metformin, it will be several years before the studies will be concluded. Our results constitute preliminary genetic evidence and provide triangulating evidence strengthening inference for metformin’s role in aging. Preliminary analysis of the MILES data indicates that metformin induces transcriptional changes related to reduced aging⁴⁷, and we showed that metformin had a beneficial impact on slowing epigenetic aging, together suggesting another biological mechanism that corresponds with clinical trial data from the first human study designed to reverse biological hallmarks of aging, including EAA in a population of healthy middle-aged men⁴⁸. We found that the metformin instrument may be driven by its mitochondrial-related targets, MCI, MG3 and GDF15. Mitochondrial function is impaired in disease states and aging⁴⁹ and it has been suggested that metformin may regulate mitochondrial functioning by mitophagy and removing damaged mitochondria⁴⁷, which could improve aging. GDF15 has become an important target in the aging field with previous studies linking it with all-cause mortality⁵⁰; showing that among 1,301 proteins, it was the most strongly associated with age⁵¹; and finding high expression among frail older individuals compared to healthy controls⁵². GDF15 is a key molecule in the human stress response⁵³, and recent work found that patients with primary mitochondrial oxidative phosphorylation defects demonstrate increased resting energy expenditure, elevated stress responses (including elevated GDF15 levels) and accelerated biological aging⁵⁴. Given the impact of glucose homeostasis on energy⁵⁵, our results showing that HbA1c lowering via metformin’s mitochondrial targets further support evidence linking mitochondrial function and energy expenditure with accelerated aging. We note that our metformin targets were derived from data proxying their anti-hyperglycemic effects, and while it has been suggested that the main anti-aging role of metformin is mediated via its action on glucose metabolism⁴⁵, metformin and its targets impact many pathways, including ones that have yet to be elucidated⁴⁵. Therefore, these results should not be interpreted as complete proxies of metformin use or its mechanisms via other pathways.

The PCSK9 and other lipid-lowering findings extend work showing that life expectancy in familial hypercholesterolemic patients is shortened by 20–30 years relative to the general population⁵⁶. While a recent meta-analysis of 38 randomized controlled trials did not find an impact of PCSK9 inhibition on all-cause mortality among study participants selected for cardiovascular diseases⁵⁷, the studies included in the meta-analysis were potentially not long enough in duration to detect a role of PCSK9 inhibition on aging (~36.4 weeks)⁵⁷. Ultimately, causal inference requires triangulating study designs⁵⁸, highlighting the need for additional studies investigating the relationship of PCSK9 expression, PCSK9 inhibition, and aging. Similarly, our blood pressure and antihypertensive target findings align with and extend previous work showing that intensive blood pressure reduction in older adults with hypertension extends life expectancy, highlighting the importance of blood pressure control to prolong patient health and well-being⁵⁹. Our results showing that ADRBI—the beta-blockers target—beneficially impacts aging extend prior genetics-based analyses finding ADRB1 beneficial in human longevity⁶⁰.

In addition to validating important loci (for example, the ATXN2 in mvAge findings aligns with previous genetics work implicating ATXN2 in human longevity⁶ and showing that therapeutic modulation of ATXN2 increases lifespan in mice⁶¹), our analyses extending the drug-target/cis-instrument MR framework using GWAS data for biomarkers to investigate possible protein-coding genes that may influence mvAge via their downstream biomarker (that is, HbA1c, circulating lipids, SBP) also identified several notable targets. For example, FADS1 and FADS2 are important genes in the biosynthesis of unsaturated fatty acids⁶², and protein–chemical interaction analysis showed FADS2 has interactions with oleic acid, the main fatty acid in olive oil, a major Mediterranean diet component⁶³, linked with increased lifespan and reduced age-related diseases⁶³. We also found that reduced LDL-C levels by variants within the FGF21 locus increased mvAge. FGF21 encodes fibroblast growth factor 21, a metabolic hormone important for regulation of systems related to energy homeostasis, including lipids³¹, and increased FGF21 expression extends lifespan in mice⁶⁴. Interestingly, early studies of FGF21 analogs in individuals with T2D found they alleviated dyslipidemia but did not impact glycemic control³¹, and a new long-acting therapeutic compound, LLF580, was found to lower circulating LDL-C and reduce hepatic fat content in a phase 1 clinical trial of 64 obese adults (clinical trial identifier, NCT03466203) (ref. ⁶⁵). LLF580 also showed minimal side-effects for the 12-week study with the exception of increased reporting of mild-to-moderate gastrointestinal distress⁶⁵. Our FGF21 results provide triangulating support for a role of FGF21 in cardiometabolic diseases⁶⁶ and longevity⁶⁴, and also highlight the potential value of the recently developed extension of the drug-target MR paradigm to identifying and prioritizing novel targets for future study⁶⁷.

The exploratory MR based on protein quantitative trait loci (pQTL) should also motivate future research into therapies to improve healthy aging because most approved pharmacotherapies target proteins¹⁹. We identified and replicated in independent pQTL datasets three circulating proteins, CSF-1, MMP-1 and IL-6RA. Increased circulating levels of CSF-1 and MMP-1 adversely impacted mvAge. In the Supplementary Discussion we highlight reported relationships of CSF-1, MMP-1 and IL-6RA, biomarkers and risk factors with a range of physical health diseases that further support their role in healthy aging.

The drug-target MR has limitations, including its inability to mimic certain mechanisms of action for the therapeutics. For example, the genetic estimates for PCSK9 inhibition derived from circulating LDL-C levels may not fully approximate the impact of specific drug classes, such as inclisiran, a novel small interfering RNA inhibitor of hepatic PCSK9 expression with a liver-specific mechanism of action⁶⁸. Similarly, our antidiabetic and antihypertensive instruments may not capture pathways beyond the HbA1c and SBP biomarkers. For example, Liu et al.⁶⁹ also showed that verapamil promoted autophagy and increased levels of calcineurin activity in C. elegans⁶⁹, offering additional life-extending mechanisms through which calcium channel blockers (CCBs) may impact healthy aging; however, our SBP-derived CCB instrument precludes direct investigation of these non-blood-pressure-mediated pathways. Our formulation of the metformin targets instrument is a novel application developed in recent drug-target MR work²⁸ and we underscore that the estimates reflect the metformin target-specific effects. Our metformin targets findings should be viewed in the context of triangulating study designs finding beneficial a relationship between metformin and aging. We also emphasize the results do not suggest these therapeutic targets be viewed as panaceas for improved aging and should not be considered replacements related to healthy lifestyle choices that are important for healthy aging⁷⁰. More broadly, the MR analyses may be subject to collider bias⁷¹, which occurs when a third, collider variable caused by both the exposure and outcome variables in an MR distorts the true underlying association⁷². Here, potential colliders include chronic diseases or their causal risk factors that may impact analyses with outcomes related to exceptional longevity. For example, while we found that metformin target genes reduced epigenetic aging and increased exceptional longevity, it has been hypothesized that targeting frailty or therapeutics aimed at chronic diseases will not increase the limit of human lifespan⁷³, suggesting that these drug target findings may be primarily driven by their impact on their indicated disease(s), which, while not directly extending the limit of human lifespan, may still have important implications for improving healthy aging, including reducing the life expectancy–healthspan gap.

We note additional study limitations. First, genomic SEM provides a composite phenotype representing the joint genetic structure of broad liability across complex traits. Therefore, resulting multivariate GWASs, including mvAge, do not have conventional units¹⁴. This has implications for our MR analyses; however, because MR estimates reflect lifelong estimates, clinical interpretation of numerical estimates is challenging in all MR studies^17,19. (Further discussion regarding the contextualizing of mvAge estimates, especially as it relates to MR, with a focus on the metformin target estimates is presented in the Supplementary Discussion). Genomic SEM, like most GWAS studies, assumes an additive model for genetic variants¹⁴, which may not capture the impact of recessive variants that may have large adverse effects on aging^3,74. Correlation among genes used as input for our gene-set enrichment analysis may increase the potential for type 1 errors⁷⁵; however, at least for the lipid-related pathways, concurrent MR evidence linking both circulating lipids and lipid-lowering therapeutic gene targets strengthens the evidence for involvement of lipids in healthy aging. mvAge and downstream analyses were limited to participants of European ancestry. Therefore, we underscore the limited generalizability of study findings to populations of other ethnicities/ancestries, highlighting the need for follow-up studies validating these findings in other populations when the GWAS data becomes available. Finally, limitations may derive from the univariate GWAS data (input for the multivariate GWAS analysis); for example, the parental lifespan GWAS may reflect the common causes of death in the UK from several decades ago, which have changed over time⁷⁶, and may not fully capture current demographic characteristics.

Conclusions

We leveraged recently developed multivariate GWAS methods to elucidate the genetic underpinnings of the broad predisposition of healthy aging. The identified loci reflecting this underlying aging-related trait align with the current shift in geroscience toward a systems-level focus aimed at improving healthy aging and slowing aging processes⁷⁰. Bioannotation and MR characterized putatively causal genes, cell types, biomarkers and modifiable risk factors. Drug-target MR of approved and proposed antidiabetic, lipid-modulating and antihypertensive targets highlighted important repurposing opportunities, while our extended cis-instrument MR screen of protein-coding genes finding more than 120 candidates will inform the prioritization of therapies for healthy aging.

Methods

Data sources

The univariate input GWAS data from participants of European ancestry comprising our multivariate GWAS on human aging was derived from five GWASs encompassing related aspects of human longevity, including healthspan⁹, parental lifespan¹, exceptional longevity⁸, EAA¹⁰ and frailty¹¹. All input GWASs have existing ethical permissions from their respective institutional review boards and include participant informed consent with rigorous quality control.

The healthspan GWAS endpoint (n = 300,477, 54.2% female, in the UKB) was defined as the incidence of the eight most common diseases in the study sample⁹, and the study employed Cox–Gompertz survival models with clinical events in seven disease categories (that is, cancer, myocardial infarction, chronic obstructive plumonary disease, diabetes, stroke and dementia) to determine length of healthspan. Participants having one or more of these events were considered to have completed healthspans; 84,949 participants experienced an event, completing their healthspans⁹. Our frailty data was derived from a meta-analysis of participants in the UKB (n = 164,610, 51.3% female, between the ages of 60 and 70) and TwinGene (n = 10,616, 52.5% female) cohorts¹¹. The UKB frailty index was based upon an accumulation-of-deficits model⁷⁷ using 49 self-reported UKB variables from a range of physical and mental health endpoints, symptoms, disabilities and diagnosed diseases¹¹. The frailty index in the TwinGene cohort was also constructed using self-reported questionnaire data (44 deficits)¹¹.

We used summary statistics from a recent parental lifespan GWAS representing 512,047 and 500,196 maternal and paternal lifespans⁹. Across cohorts, Cox survival models for mothers and fathers had been fitted and Martingale residuals of corresponding survival models were regressed against subject gene dosages to generate the GWAS. We used summary statistics from Deelen et al. assessing the genetic underpinnings of exceptional old age using 11,262 unrelated participants reaching ≥90th survival percentile and performing a GWAS comparing this extreme longevity group with 25,483 participants whose age at death was ≤60th survival percentile (n = 36,745, 58.0% female)⁸. Survival percentiles were based upon country-specific cohort life tables (for example, the 90th survival percentile for the United States 1920 birth cohort is 89 years of age for men and 95 years of age for women and the 60th percentile is 75 and 83, respectively)⁸.

Our EAA GWAS data came from a meta-analysis in 29 cohorts (n = 36,112, 58.8% female) of four separate epigenetic clocks¹⁰. After using cross-trait LD score regression²⁰ to test the genetic correlation among each epigenetic clock and the other longevity-related univariate GWAS included in the study, we used the second-generation epigenetic clock PhenoAge, which demonstrated genetic correlations with extreme longevity, healthspan, parental lifespan and frailty, and strong genomic characterization (Supplementary Table 1). We reversed the frailty and PhenoAge coding to generate positive correlations with the other aging-related traits.

Sample overlap

At least 571,260 unique genomes are represented in our analysis (Supplementary Table 1), accounting for maximum potential overlap between the study cohorts contributing to the five aging GWAS study cohorts included in the multivariate GWAS analysis, as well as the potential overlap of UKB and other non-UKB UK cohorts, and the overlap of genomes underlying the parental lifespans cohorts.

Genomic SEM

We used genomic SEM implemented in the GenomicSEM R package v.0.0.5 to perform the multivariate GWAS analysis of healthspan, exceptional longevity, parental lifespan, frailty and PhenoAge, investigating a broad genetic liability underlying these aging-related traits. Genomic SEM is a recently developed multivariate method enabling investigation of multiple potential multivariate models of the underlying architecture of the traits of interest¹⁴. (Full technical details of genomic SEM methods are described in Supplementary Methods.) Genomic SEM is not biased by sample overlap, that is, UKB participants in multiple input GWASs, or imbalanced sample size¹⁴. Genomic SEM also facilitates identification of variants only influencing some but not all of the complex traits, and which therefore do not represent a broad cross-trait liability¹⁴.

Genomic SEM proceeds in two stages. Stage 1 estimates the empirical genetic covariance matrix and corresponding sampling covariance matrix. We prepared the aging-related GWAS summary statistics for stage 1 and used the multivariate extension of cross-trait LD score regression^14,20 to generate the empirical genetic covariance matrix between the five traits as input for the SEM common factor model¹⁴ (Supplementary Table 3). Stage 2 specifies an SEM that minimizes the hypothesized covariance matrix and the empirical covariance matrix calculated in stage 1 (ref. ¹⁴). Here, our primary study aim was to identify a genetic signature underlying the five aging-related traits; we therefore tested a one-factor model. Model fit was assessed using the SRMR, model χ², the Akaike information criterion and the CFI (Supplementary Table 4)⁷⁸.

Preparing the multivariate summary statistics for multivariate GWAS, we used the recommended default parameters implementing LD score regression, including removing SNPs with MAF >0.01 (linkage disequilibrium score regression inflates standard errors of estimates with low MAF) and information scores <0.9, and filtering SNPs to HapMap3, using the 1000 Genomes Phase 3 EUR panel (Supplementary Table 5). The summary statistics are restricted to HapMap3 SNPs only for estimating genetic covariance and sampling covariance matrix in LD score regression. We use all autosomal SNPs from the five input aging-related GWASs passing recommended default quality control filters for the multivariate GWAS analysis, filtering to the 1000 Genomes Phase 3 EUR panel, removing SNPs with MAF <0.01 (prone to error due to fewer samples within the genotype cluster and LD score regression standard errors for these SNPs tend to be high), SNPs with effect values estimated to be exactly equal zero (so as to avoid compromising matrix inversion necessary for genomic SEM), SNPs not matched with the reference panel, and SNPs with mismatched alleles. After quality control, 6,793,898 SNPs common to all input GWASs remain in the multivariate summary statistics taken forward to run the multivariate GWAS. Applying the appropriate common factor SEM specification, the individual autosomal SNP associations are incorporated into the genetic and associated sample covariance matrices to generate the multivariate genome-wide analysis (mvAge) of the shared covariance across the five input aging-related GWASs¹⁴.

Q _SNP heterogeneity

To evaluate whether the mvAge SNP associations are appropriately modeled within a multivariate SEM framework, Q_SNP heterogeneity statistics are calculated¹⁴. The null hypothesis of the Q_SNP test is that the SNP associations on the single-phenotype GWASs are statistically mediated by mvAge¹⁴. Therefore, significant Q_SNP tests in mvAge would suggest that the SNP impacts the single-phenotype GWASs by pathways other than the shared genetics of aging modeled by mvAge¹⁴. We used a Bonferroni-corrected P-value threshold of 9.62 × 10⁻⁴, correcting for 52 lead SNPs, to evaluate Q_SNP heterogeneity.

Exploratory two-factor model analysis

Increases in life expectancy have outpaced improvements in healthspan resulting in an approximately nine-year life expectancy–healthspan gap⁷⁹. This relationship between life expectancy and healthspan has important implications for public health intervention/prevention strategies and drug discovery and repurposing endeavors³⁹. Given that our input aging-related GWAS data encompass aspects of life expectancy (parental lifespan and extreme longevity), biological aging (EAA) and healthy aging (healthspan and frailty), as sensitivity analyses, we constructed a two-factor genomic SEM model assessing the relationship between healthspan and longevity, and investigated whether our shared aging factor—mvAge—encompasses aspects of life expectancy and healthy aging.

Defining genomic loci and determining novel variants

We used ‘functional mapping and annotation of genetic associations’ implemented in functional mapping and annotation of genetic associations (FUMA)⁸⁰ v.1.3.5e to identify genomic loci, and lead/sentinel SNPs in LD (R² < 0.1) associated with mvAge at genome-wide significance (P value < 5 × 10⁻⁸). A locus was defined by lead SNPs within a 250 kb range and all SNPs in high LD (R² > 0.6) with at least one independent SNP. First, we extracted the summary statistics for these lead mvAge SNPs from the input univariate GWASs to assess the strength of their associations. We also compared lead SNPs and loci with the original univariate GWAS and defined loci to be novel if they were >1 Mb from loci identified in the univariate GWAS data. To determine whether any of the 52 lead SNPs in mvAge showed evidence of pleiotropic associations, we also performed a look-up of published GWAS-significant associations (P value < 5 × 10⁻⁸) in the GWAS Catalog⁸¹.

Fine mapping and transcriptomic imputation

We used SuSIE^82,83 and FINEMAP⁸⁴ implemented in the R package, echolocatoR⁸⁵ v.2.0.3 to identify the most plausible causal variants associated with mvAge. We used a 250 kb window around the lead SNP in the 38 genomic loci and a probability threshold of 0.95 to define credible sets of potentially causal variants. echolocatoR defines a ‘consensus SNP’ as a variant included in both SuSIE and FINEMAP⁸⁵, calculates an average posterior probability and determines an average credibility set (set to 1 when the mean SNP-wise posterior probability across SuSIE and FINEMAP exceed 0.95 and 0 otherwise⁸⁵). Next, we performed a TWAS to prioritize genes associated with mvAge. We used the TWAS FUSION method⁸⁶ and used TWAS weights for 37,920 precomputed expression quantitative trait loci features (that is, gene/tissue pairs) from GTEx v.8 (ref. ⁸⁶). Our mvAge had sufficient variants to analyze 36,149 of the 37,920 features. We took forward TWAS genes associated with mvAge surpassing Bonferroni correction for multiple comparisons (P value < 1.38 × 10⁻⁶) for additional analysis, including colocalization²⁴ and fine mapping using the FOCUS method designed for TWAS studies²⁵. We prioritized ‘high-confidence’ mvAge genes identified by FUSION based upon additional evidence for colocalization and fine mapping. Following previous work⁸⁷, we considered TWAS-significant genes associated with mvAge that also colocalized (PP.H4 >0.6) and are likely to be causal (FOCUS posterior inclusion probability >0.5). See Supplementary Methods for additional information about fine mapping and transcriptomic imputation.

Gene-set and disease ontology enrichment

We used MAGMA²⁶ with data from GTEx (v.8) to perform gene-based and gene-set analyses and investigated the potential relationships of mvAge with Mendelian disease genes and associated pathways with MendelVar⁸⁸. See Supplementary Methods for further details.

Cell-type enrichment

To identify etiological cell types associated with mvAge, we integrated single-cell RNA sequencing (scRNA-seq) data using cell-type expression-specific integration for complex traits (CELLECT)⁸⁹. We used scRNA-seq data from Tabula Muris⁹⁰, a database containing transcriptomic data from 100,000 cells and 20 organs and tissues of Mus musculus. We prepared the Tabula Muris scRNA-seq data using CELLEX⁸⁹, calculating expression specificity likelihood scores for each gene following normalization and preprocessing⁸⁹. Using CELLECT’s default settings, we performed the cell-type enrichment with MAGMA. In CELLECT, MAGMA measures the extent to which genetic associations with a phenotype increase as a function of gene expression specificity for a given cell type⁸⁹. We categorized our cell types following the nomenclature used in the original Tabula Muris study⁹⁰ and used a false discovery rate (FDR) threshold of 0.05.

Mendelian randomization

All MR analyses have been reported in accordance with the STROBE-MR guidelines⁹¹ (Supplementary Checklist). We implemented MR using MendelianRandomization R package v.0.7.0 and TwoSampleMR package v.0.5.6.

Polygenic MR

Extended Data Fig. 5 presents a graphical overview of the analyses. To investigate whether mvAge was causally influenced by lifestyle factors and circulating biomarkers, we performed MR with 73 risk factors and biomarkers derived from GWAS with participants of European ancestry (Supplementary Table 37). Because the focus of our downstream analyses is the identification of targets for intervention, prevention and therapeutic strategies for improved aging, we selected modifiable risk factors that would inform public health initiatives⁹². Reliable causal biomarkers to identify the consequences of aging also are needed⁹³. Therefore, we curated a range of biomarkers (for example, lipids, blood pressure, markers of inflammation). Given mvAge and many of the exposures included in the MR analyses are derived from the UKB, sample overlap may introduce bias⁹⁴. We thus applied the MR Lap method, which accounts for sample overlap (even when the exact overlap percentage is unknown) and also assesses weak instrument bias and winner’s curse²⁷, as an additional sensitivity test for these MR analyses. We used a Bonferroni-corrected threshold P value = 6.85 × 10⁻⁴ adjusting for 73 comparisons. See Supplementary Methods for more detail regarding motivation, instrumentation and MR assumptions.

Drug-target MR of metformin gene targets

Given the polygenic MR results showing an adverse relationship of HbA1c and mvAge, we investigated whether HbA1c lowering via metformin target genes may improve mvAge. In line with recent work by Zheng et al.²⁸ proxying the impact of metformin target genes on Alzheimer’s disease risk, we first identified five primary metformin targets (AMPK, MCI, MG3, GSD15 and GLP1) from the literature^95,96,97. We used the ChEMBL database⁹⁸ to identify genes related to the mechanism of action for the five metformin target genes (Supplementary Table 38). We extracted variants within 100 kb of the gene boundaries (cis-instrumentation) from the GWAS of circulating HbA1c levels used for the polygenic MR (UKB participants of European ancestry, n = 361,194) (ref. ⁹⁹). We clumped extracted SNPs at the LD R² < 0.2 threshold (250 kb) using the 1000 Genomes Phase 3 EUR reference population¹⁰⁰, and calculated F statistics to evaluate instrument strength (Supplementary Table 39). We then performed MR IVW (random-effects analysis performed when there were more than three variants) and MR Egger analyses accounting for the correlation between our instrument variants (the requisite correlation matrices for the analyses were generated using the 1000 Genomes Project EUR population as reference¹⁰⁰) to increase statistical precision by including additional, partially independent variants in the drug-target instruments^19,67. To facilitate interpretation of HbA1c-lowering mechanisms of metformin, we scaled MR estimates to correspond to a lowering effect in the HbA1c GWAS data. We performed additional sensitivity analyses. First, we tested each metformin target separately. Second, we removed variants associated with T2D at nominal statistical significance (P value < 0.05), evaluating whether T2D variants are driving the observed metformin–mvAge relationship. Third, we performed an additional MR, clumping metformin targets at LD R² of 0.001. Fourth, we used a second metformin instrument using variants comprising the Zheng et al. variants. Finally, given the reported impact of metformin on slowing epigenetic aging⁴⁸, we analyzed the PhenoAge GWAS and other univariate GWAS data to determine whether there was evidence of a univariate GWAS signal.

Drug-target MR of cardiometabolic drug classes

Extended Data Fig. 6 presents a graphical overview of the analyses. Given the MR findings showing that elevated HbA1c, lipid levels and blood pressure demonstrate adverse relationships with mvAge, we performed drug-target MR using downstream biomarkers (that is, HbA1c, circulating lipids and SBP) proxying pharmacological modulation via therapeutics lowering HbA1c, lipids and blood pressure. Briefly, we proxied pharmacological modulation of these drug targets by extracting SNPs cis-acting loci (±100 kb of gene boundaries) associated with their respective biomarker, that is, the primary physiological response to pharmacological modulation of that target: antidiabetics targets were extracted from HbA1c data; PCSK9, HMGCR, ACLY, ABCG5/8 and NPC1LC SNP effect estimates were extracted from LDL-C; ANGPTL3, ANGPTL4, ANGPTL8, APOC3, PPARA and LPL SNP effect estimates were extracted from TG; APOA1 and CETP SNPs were extracted from HDL-C; LPA variants were extracted from Lp(a) GWAS; and variants in the antihypertensive targets were extracted from the SBP (Supplementary Tables 40–42 contain instrument information). Supplementary Methods provides more detail about selection criteria for drug-target genes.

Drug-target MR methods

We clumped drug targets as described for the metformin analyses (LD R² < 0.2 threshold (250 kb) using the 1000 Genomes Phase 3 EUR reference), and performed correlated MR as detailed above to increase statistical precision by including additional, partially independent variants in the drug-target instruments^19,67. To facilitate interpretation of these HbA1c-lowering, lipid-modulating and SBP-lowering therapeutic targets, we scaled MR estimates to correspond to the physiological impact in the respective downstream biomarkers. We used a Bonferroni-corrected threshold P value = 0.002, adjusting for 25 total drug targets tests. See Supplementary Methods for more details regarding drug-target MR.

Cis-instrument MR screen of cardiometabolic genes

Extended Data Fig. 7 presents a graphical overview. Given the strong MR evidence linking mvAge with both the polygenic measures of circulating lipids, HbA1c and SBP, corresponding drug-target MRs with the gene targets for metformin, and approved lipid-lowering and blood-pressure-lowering therapeutics, we performed cis-instrument/drug-target MR screens evaluating the causal impact of the lipid subfractions, HbA1c and SBP via the action of protein-coding genes located near the genomic loci of these biomarkers. Results of these analyses are targets that may impact mvAge through glycated hemoglobin, lipids or blood pressure. This approach has recently been developed, identifying and validating 30 gene targets exhibiting beneficial associations with lipids that may reduce coronary artery disease risk⁶⁷. The aim of these analyses is to provide genetic evidence supporting targets that may be important to inform future mechanistic investigation for therapeutics that may improve healthy aging.

We used the same lipids, HbA1c and SBP GWAS data as used for the drug-target MR analyses described above. First, we extracted lead variants with P values < 5 × 10⁻⁸ (LD R² < 0.1) associated with their respective GWAS biomarker (the same HbA1c, lipids and SBP GWAS data used for the drug-target MR analyses outlined above) and identified protein-coding genes located within 50 kb of the lead variants (Supplementary Tables 43–47). We then performed colocalization analyses to assess the posterior probability of a shared genetic signal between mvAge and biomarkers at the protein-coding gene locus²⁴. We considered genes with posterior probabilities (PP.H4 >0.6) to have evidence of shared causal variants between mvAge and the respective biomarker at the gene. We took these protein-coding genes (523 across 5 biomarkers) forward to cis-instrumentation. We constructed cis-instruments for the protein coding using genetic variants (LD R² < 0.2) associated with their respective biomarkers at conventional genome-wide statistical significance (P value < 5 × 10⁻⁸) within gene boundaries and SNP F statistics >10 (indicating strong instruments and evidence that MR estimates are unlikely to be subject to weak instrument bias); 354 of 523 protein-coding genes identified by colocalization had variants meeting these criteria. We performed MR accounting for correlation between instruments to increase estimate precision^19,67, and used a Bonferroni-corrected threshold P value = 1.41 × 10⁻⁴, adjusting for the 354 genes tested. We were able to perform replication analyses using independent HbA1c (ref. ¹⁰¹) and lipids GWAS summary data¹⁰² for 41 of the 132 protein-coding genes identified by cis-instrument MR (an independent GWAS was not available for SBP, and thus we were not able to replicate these genes). Next, we assessed the potential therapeutic actionability of these protein-coding genes in several ways, including assessing druggability (Supplementary Table 48)²⁹, protein–chemical interactions with the STITCH interaction database¹⁰³ and drug–gene interactions with DGIdb¹⁰⁴. See Supplementary Methods for more detail.

Proxying circulating proteins

To complement the drug-target MR analyses described above leveraging biomarker data, we next explored the causal role of circulating proteins measured in 30,391 participants of European ancestry from the first results provided by the SCALLOP consortium¹⁰⁵, which used the Olink platform to perform a pQTL mapping of plasma proteins to investigate additional possible anti-aging therapeutic opportunities that may be useful for future investigation. Clumping, harmonization and MR analysis proceeded as above, including several proteins instrumented by a single variant. We used the Wald Ratio method¹⁰⁶ to obtain MR estimates (Supplementary Tables 49 and 50). See Supplementary Methods for more detail.

Statistics and reproducibility

Methods describes the statistical methods used in this study to analyze the data. The statistical tests used in this study, except gene-set and cell-type enrichment, were two-sided. R v.4.0.2 was used for data processing and analysis unless otherwise specified. Predetermination of sample size, experiment randomization and blinding of investigators to experiments were not applicable for this type of study.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All analyses were based upon publicly available data. Summary-level statistics for the mvAge GWAS generated in this study are available at https://zenodo.org/record/7926323. Summary-level statistics for longevity are available at https://www.longevitygenomics.org/downloads; parental lifespan, https://datashare.ed.ac.uk/handle/10283/3209; healthspan, https://www.gwasarchive.org/; the frailty index, https://figshare.com/articles/dataset/Genome-Wide_Association_Study_of_the_Frailty_Index_-_Atkins_et_al_2019/9204998; and the epigenetic clocks, https://datashare.ed.ac.uk/handle/10283/3645. GTEx weights for FUSION analyses are available at https://gusevlab.org/projects/fusion/. Single-cell gene expression data from the Tabula Muris study are available at https://tabula-muris.ds.czbiohub.org/. Summary-level statistics used for Mendelian randomization analyses are accessible in the IEU Open GWAS Project at https://gwas.mrcieu.ac.uk/ using the IEU Open GWAS Project IDs (provided in Supplementary Table 37 accompanying the manuscript). Circulating protein levels from the SCALLOP Consortium are available at https://zenodo.org/record/2615265#.ZGEzyezMLN0. All other data supporting the findings of this study are available from the corresponding author upon reasonable request.

Code availability

The software used in this study is publicly available and accessed without restriction. R package GenomicSEM v.0.0.5 is available at https://github.com/GenomicSEM/GenomicSEM. For implementing Mendelian randomization analyses, R package TwoSampleMR v.0.5.6 is available at https://mrcieu.github.io/TwoSampleMR/; R package MendelianRandomization v.0.7.0: https://cran.r-project.org/web/packages/MendelianRandomization/index.html; and R package MRlap v.0.0.3.0: https://github.com/n-mounier/MRlap. STITCH is available at http://stitch.embl.de/. R FUSION Pipeline v.1.4.2 for TWAS analysis is available at http://gusevlab.org/projects/fusion/. Python package FOCUS v.0.6.10 for FOCUS fine mapping for FUSION is available at http://github.com/bogdanlab/focus/. R package coloc v.5.1.0.1 for colocalization is available at https://cran.r-project.org/web/packages/coloc/index.html. Python package CELLECT v.1.3.0 for single-cell enrichment analyses is available at https://github.com/perslab/CELLECT. Python package CELLEX v.1.2.1 for single-cell processing is available at https://github.com/perslab/CELLEX. R package echolocatoR v.2.0.3 for fine mapping using SuSIE and FINEMAP is available at https://github.com/RajLabMSSM/echolocatoR. MendelVar is available at https://mendelvar.mrcieu.ac.uk/. FUMA and MAGMA v.1.4.0 are available at https://fuma.ctglab.nl/. R v.4.2.1 and Python v.3.8 were used to format data for analyses. Supplementary Figs. 1–21 were generated using FUMA v.1.4.0. Supplementary Figs. 22–33 were generated using R package echolocatoR v.2.0.3.

References

Timmers, P. R. H. J. et al. Genomics of 1 million parent lifespans implicates novel pathways and common diseases and distinguishes survival chances. eLife 8, e39856 (2019).
Article PubMed PubMed Central Google Scholar
Sathyan, S. & Verghese, J. Genetics of frailty: a longevity perspective. Transl. Res. 221, 83–96 (2020).
Article CAS PubMed PubMed Central Google Scholar
Timmers, P. R. H. J., Wilson, J. F., Joshi, P. K. & Deelen, J. Multivariate genomic scan implicates novel loci and haem metabolism in human ageing. Nat. Commun. 11, 3570 (2020).
Article CAS PubMed PubMed Central Google Scholar
Mullard, A. Anti-ageing pipeline starts to mature. Nat. Rev. Drug Discovery 17, 609–612 (2018).
Article CAS PubMed Google Scholar
McDaid, A. F. et al. Bayesian association scan reveals loci associated with human lifespan and linked biomarkers. Nat. Commun. 8, 15842 (2017).
Article CAS PubMed PubMed Central Google Scholar
Pilling, L. C. et al. Human longevity: 25 genetic loci associated in 389,166 UK biobank participants. Aging (Albany NY) 9, 2504–2520 (2017).
Article CAS PubMed Google Scholar
Joshi, P. K. et al. Genome-wide meta-analysis associates HLA-DQA1/DRB1 and LPA and lifestyle factors with human longevity. Nat. Commun. 8, 910 (2017).
Article PubMed PubMed Central Google Scholar
Deelen, J. et al. A meta-analysis of genome-wide association studies identifies multiple longevity genes. Nat. Commun. 10, 3669–3669 (2019).
Article PubMed PubMed Central Google Scholar
Zenin, A. et al. Identification of 12 genetic loci associated with human healthspan. Commun. Biol. 2, 41 (2019).
Article PubMed PubMed Central Google Scholar
McCartney, D. L. et al. Genome-wide association studies identify 137 genetic loci for DNA methylation biomarkers of aging. Genome Biol. 22, 194 (2021).
Article CAS PubMed PubMed Central Google Scholar
Atkins, J. L. et al. A genome-wide association study of the frailty index highlights brain pathways in ageing. Aging Cell 20, e13459 (2021).
Article CAS PubMed PubMed Central Google Scholar
Palliyaguru, D. L., Moats, J. M., Di Germanio, C., Bernier, M. & de Cabo, R. Frailty index as a biomarker of lifespan and healthspan: focus on pharmacological interventions. Mech. Ageing Dev. 180, 42–48 (2019).
Article PubMed PubMed Central Google Scholar
Beard, J. R. et al. The World report on ageing and health: a policy framework for healthy ageing. Lancet 387, 2145–2154 (2016).
Article PubMed Google Scholar
Grotzinger, A. D. et al. Genomic structural equation modelling provides insights into the multivariate genetic architecture of complex traits. Nat. Hum. Behav. 3, 513–525 (2019).
Article PubMed PubMed Central Google Scholar
Mallard, T. T. et al. Item-level genome-wide association study of the alcohol use disorders identification test in three population-based cohorts. Am. J. Psychiatry 179, 58–70 (2021).
Article PubMed PubMed Central Google Scholar
Karlsson Linnér, R. et al. Multivariate analysis of 1.5 million people identifies genetic associations with traits related to self-regulation and addiction. Nat. Neurosci. 24, 1367–1376 (2021).
Article PubMed Google Scholar
Sanderson, E. et al. Mendelian randomization. Nat. Rev. Methods Primers 2, 6 (2022).
Article CAS PubMed PubMed Central Google Scholar
Barzilai, N., Crandall, J. P., Kritchevsky, S. B. & Espeland, M. A. Metformin as a tool to target aging. Cell Metab. 23, 1060–1065 (2016).
Article CAS PubMed PubMed Central Google Scholar
Schmidt, A. F. et al. Genetic drug target validation using Mendelian randomisation. Nat. Commun. 11, 3255–3255 (2020).
Article CAS PubMed PubMed Central Google Scholar
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Wright, K. M. et al. A prospective analysis of genetic variants associated with human lifespan. G3-Genes Genom. Genet. 9, 2863–2878 (2019).
Article Google Scholar
Horvath, S. & Raj, K. DNA methylation-based biomarkers and the epigenetic clock theory of ageing. Nat. Rev. Genet. 19, 371–384 (2018).
Article CAS PubMed Google Scholar
Gusev, A. et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat. Genet. 48, 245–252 (2016).
Article CAS PubMed PubMed Central Google Scholar
Giambartolomei, C. et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 10, e1004383 (2014).
Article PubMed PubMed Central Google Scholar
Mancuso, N. et al. Probabilistic fine-mapping of transcriptome-wide association studies. Nat. Genet. 51, 675–682 (2019).
Article CAS PubMed PubMed Central Google Scholar
de Leeuw, C. A., Mooij, J. M., Heskes, T. & Posthuma, D. MAGMA: generalized gene-set analysis of GWAS data. PLoS Comput. Biol. 11, e1004219 (2015).
Article PubMed PubMed Central Google Scholar
Mounier, N. & Kutalik, Z. Bias correction for inverse variance weighting Mendelian randomization. Genet. Epidemiol. 47, 314–331 (2023).
Article CAS PubMed Google Scholar
Zheng, J. et al. Evaluating the efficacy and mechanism of metformin targets on reducing Alzheimer’s disease risk in the general population: a Mendelian randomisation study. Diabetologia 10, 16641675 (2022).
Google Scholar
Finan, C. et al. The druggable genome and support for target identification and validation in drug development. Sci. Transl. Med. 9, eaag1166 (2017).
Article PubMed PubMed Central Google Scholar
Gromovsky, A. D. et al. Δ-5 fatty acid desaturase FADS1 impacts metabolic disease by balancing proinflammatory and proresolving lipid mediators. Arter. Thromb. Vasc. Biol. 38, 218–231 (2018).
Article CAS Google Scholar
Geng, L., Lam, K. S. L. & Xu, A. The therapeutic potential of FGF21 in metabolic diseases: from bench to clinic. Nat. Rev. Endocrinol. 16, 654–667 (2020).
Article CAS PubMed Google Scholar
Folkersen, L. et al. Genomic and drug target evaluation of 90 cardiovascular proteins in 30,931 individuals. Nat. Metab. 2, 1135–1148 (2020).
Article CAS PubMed PubMed Central Google Scholar
Grunewald, M. et al. Counteracting age-related VEGF signaling insufficiency promotes healthy aging and extends life span. Science 373, eabc8479 (2021).
Article CAS PubMed Google Scholar
Belser, M. & Walker, D. W. Role of prohibitins in aging and therapeutic potential against age-related diseases. Front. Genet. 12, 714228 (2021).
Article CAS PubMed PubMed Central Google Scholar
Swerdlow, D. I. et al. HMG-coenzyme A reductase inhibition, type 2 diabetes, and bodyweight: evidence from genetic analysis and randomised trials. Lancet 385, 351–361 (2015).
Article CAS PubMed PubMed Central Google Scholar
Porcher, R., Tran, V.-T., Blacher, J. & Ravaud, P. Potential of stratified medicine for high blood pressure management. Hypertension 74, 1420–1427 (2019).
Article CAS PubMed Google Scholar
Roth, G. A. et al. Global burden of cardiovascular diseases and risk factors, 1990–2019: update from the GBD 2019 study. J. Am. Coll. Cardiol. 76, 2982–3021 (2020).
Article PubMed PubMed Central Google Scholar
Fry, A. et al. Comparison of sociodemographic and health-related characteristics of UK biobank participants with those of the general population. Am. J. Epidemiol. 186, 1026–1034 (2017).
Article PubMed PubMed Central Google Scholar
United Nations DoEaSA, Population Division. World Population Prospects 2022: Summary of Results. UN DESA/POP/2022/TR/NO. 3 (United Nations, 2022).
Kennedy, B. K. & Pennypacker, J. K. Drugs that modulate aging: the promising yet difficult path ahead. Transl. Res. 163, 456–465 (2014).
Article PubMed Google Scholar
Yang, J.-H. et al. Loss of epigenetic information as a cause of mammalian aging. Cell. 186, 305–326.e327 (2023).
Article CAS PubMed Google Scholar
Yengo, L. et al. Meta-analysis of genome-wide association studies for height and body mass index in ~700000 individuals of European ancestry. Hum. Mol. Genet. 27, 3641–3649 (2018).
Article CAS PubMed PubMed Central Google Scholar
Lee, J. J. et al. Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nat. Genet. 50, 1112–1121 (2018).
Article CAS PubMed PubMed Central Google Scholar
Evangelou, E. et al. Genetic analysis of over 1 million people identifies 535 new loci associated with blood pressure traits. Nat. Genet. 50, 1412–1425 (2018).
Article CAS PubMed PubMed Central Google Scholar
Kulkarni, A. S., Gubbi, S. & Barzilai, N. Benefits of metformin in attenuating the hallmarks of aging. Cell Metab. 32, 15–30 (2020).
Article CAS PubMed PubMed Central Google Scholar
Medicine AECo. Metformin in Longevity Study (MILES). NIH https://clinicaltrials.gov/ct2/show/NCT02432287 (accessed 5 July 2022).
Mohammed, I., Hollenberg, M. D., Ding, H. & Triggle, C. R. A critical review of the evidence that metformin is a putative anti-aging drug that enhances healthspan and extends lifespan. Front. Endocrinol. (Lausanne) 12, 718942 (2021).
Article PubMed Google Scholar
Fahy, G. M. et al. Reversal of epigenetic aging and immunosenescent trends in humans. Aging Cell 18, e13028 (2019).
Article CAS PubMed PubMed Central Google Scholar
Bratic, A. & Larsson, N. G. The role of mitochondria in aging. J. Clin. Invest. 123, 951–957 (2013).
Article CAS PubMed PubMed Central Google Scholar
Wiklund, F. E. et al. Macrophage inhibitory cytokine-1 (MIC-1/GDF15): a new marker of all-cause mortality. Aging Cell 9, 1057–1064 (2010).
Article CAS PubMed Google Scholar
Tanaka, T. et al. Plasma proteomic signature of age in healthy humans. Aging Cell 17, e12799 (2018).
Article PubMed PubMed Central Google Scholar
Tavenier, J. et al. Association of GDF15 with inflammation and physical function during aging and recovery after acute hospitalization: a longitudinal study of older patients and age-matched controls. J. Gerontol. A Biol. Sci. Med. Sci. 76, 964–974 (2021).
Article CAS PubMed Google Scholar
Conte, M. et al. GDF15, an emerging key player in human aging. Ageing Res. Rev. 75, 101569 (2022).
Article CAS PubMed Google Scholar
Sturm, G. et al. OxPhos defects cause hypermetabolism and reduce lifespan in cells and in patients with mitochondrial diseases. Commun. Biol. 6, 22 (2023).
Article CAS PubMed PubMed Central Google Scholar
Rui, L. Energy metabolism in the liver. Compr. Physiol. 4, 177–197 (2014).
Article PubMed PubMed Central Google Scholar
Mortensen, G. L., Madsen, I. B., Kruse, C. & Bundgaard, H. Familial hypercholesterolaemia reduces the quality of life of patients not reaching treatment targets. Dan. Med. J. 63, A5224 (2016).
PubMed Google Scholar
Dicembrini, I., Giannini, S., Ragghianti, B., Mannucci, E. & Monami, M. Effects of PCSK9 inhibitors on LDL cholesterol, cardiovascular morbidity and all-cause mortality: a systematic review and meta-analysis of randomized controlled trials. J. Endocrinol. Invest. 42, 1029–1039 (2019).
Article CAS PubMed Google Scholar
Lawlor, D. A., Tilling, K. & Davey Smith, G. Triangulation in aetiological epidemiology. Int. J. Epidemiol. 45, 1866–1886 (2016).
PubMed Google Scholar
Group, S. R. A randomized trial of intensive versus standard blood-pressure control. New Engl. J. Med. 373, 2103–2116 (2015).
Article Google Scholar
Zhao, L. et al. Common genetic variants of the β2-adrenergic receptor affect its translational efficiency and are associated with human longevity. Aging Cell 11, 1094–1101 (2012).
Article CAS PubMed Google Scholar
Becker, L. A. et al. Therapeutic reduction of ataxin-2 extends lifespan and reduces pathology in TDP-43 mice. Nature 544, 367–371 (2017).
Article CAS PubMed PubMed Central Google Scholar
Park, H. G. et al. The role of fatty acid desaturase (FADS) genes in oleic acid metabolism: FADS1 Δ7 desaturates 11-20:1 to 7,11-20:2. Prostaglandins Leukot. Essent. Fatty Acids 128, 21–25 (2018).
Article CAS PubMed Google Scholar
Fadnes, L. T., Økland, J.-M., Haaland, Ø. A. & Johansson, K. A. Estimating impact of food choices on life expectancy: a modeling study. PLoS Med. 19, e1003889 (2022).
Article PubMed PubMed Central Google Scholar
Zhang, Y. et al. The starvation hormone, fibroblast growth factor-21, extends lifespan in mice. eLife 1, e00065 (2012).
Article PubMed PubMed Central Google Scholar
Rader, D. J. et al. LLF580, an FGF21 analog, reduces triglycerides and hepatic fat in obese adults with modest hypertriglyceridemia. J. Clin. Endocrinol. Metab. 107, e57–e70 (2022).
Article PubMed Google Scholar
Yang, R., Xu, A. & Kharitonenkov, A. Another kid on the block: long-acting FGF21 analogue to treat dyslipidemia and fatty liver. J. Clin. Endocrinol. Metab. 107, e417–e419 (2022).
Article PubMed Google Scholar
Gordillo-Marañón, M. et al. Validation of lipid-related therapeutic targets for coronary heart disease prevention using human genetics. Nat. Commun. 12, 6120 (2021).
Article PubMed PubMed Central Google Scholar
Ray, K. K. et al. Two phase 3 trials of inclisiran in patients with elevated LDL cholesterol. N. Engl. J. Med. 382, 1507–1519 (2020).
Article CAS PubMed Google Scholar
Liu, W. et al. Verapamil extends lifespan in Caenorhabditis elegans by inhibiting calcineurin activity and promoting autophagy. Aging (Albany NY) 12, 5300–5317 (2020).
Article CAS PubMed Google Scholar
Howlett, S. E., Rutenberg, A. D. & Rockwood, K. The degree of frailty as a translational measure of health in aging. Nat. Aging 1, 651–665 (2021).
Article PubMed Google Scholar
Griffith, G. J. et al. Collider bias undermines our understanding of COVID-19 disease risk and severity. Nat. Commun. 11, 5749 (2020).
Article CAS PubMed PubMed Central Google Scholar
Elwert, F. & Winship, C. Endogenous selection bias: the problem of conditioning on a collider variable. Annu. Rev. Sociol. 40, 31–53 (2014).
Article PubMed PubMed Central Google Scholar
Pyrkov, T. V. et al. Longitudinal analysis of blood markers reveals progressive loss of resilience and predicts human lifespan limit. Nat. Commun. 12, 2765 (2021).
Article CAS PubMed PubMed Central Google Scholar
Pilling, L. C. et al. Common conditions associated with hereditary haemochromatosis genetic variants: cohort study in UK Biobank. Brit. Med. J. 364, k5222 (2019).
Article PubMed PubMed Central Google Scholar
Goeman, J. J. & Bühlmann, P. Analyzing gene expression data in terms of gene sets: methodological issues. Bioinformatics 23, 980–987 (2007).
Article CAS PubMed Google Scholar
Public Health England. Chapter 2: trends in mortality. Health Profile for England: 2018 (GOV.UK, 2018); https://www.gov.uk/government/publications/health-profile-for-england-2018/chapter-2-trends-in-mortality
Searle, S. D., Mitnitski, A., Gahbauer, E. A., Gill, T. M. & Rockwood, K. A standard procedure for creating a frailty index. BMC Geriatr. 8, 24 (2008).
Article PubMed PubMed Central Google Scholar
Savalei, V. & Bentler, P. M. A two-stage approach to missing data: theory and application to auxiliary variables. Struct. Equ. Modeling: A Multidisciplinary Journal 16, 477–497 (2009).
Article Google Scholar
Garmany, A., Yamada, S. & Terzic, A. Longevity leap: mind the healthspan gap. NPJ Regen. Med. 6, 57 (2021).
Article CAS PubMed PubMed Central Google Scholar
Watanabe, K., Taskesen, E., van Bochoven, A. & Posthuma, D. Functional mapping and annotation of genetic associations with FUMA. Nat. Commun. 8, 1826 (2017).
Article PubMed PubMed Central Google Scholar
Buniello, A. et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 47, D1005–d1012 (2019).
Article CAS PubMed Google Scholar
Wang, G., Sarkar, A., Carbonetto, P. & Stephens, M. A simple new approach to variable selection in regression, with application to genetic fine mapping. J. R. Stat. Soc. Series B (Stat. Methodol.). 82, 1273–1300 (2020).
Article PubMed Google Scholar
Zou, Y., Carbonetto, P., Wang, G. & Stephens, M. Fine-mapping from summary data with the ‘Sum of Single Effects’ model. PLoS Genet. 18, e1010299 (2022).
Article CAS PubMed PubMed Central Google Scholar
Benner, C. et al. FINEMAP: efficient variable selection using summary data from genome-wide association studies. Bioinformatics 32, 1493–1501 (2016).
Article CAS PubMed PubMed Central Google Scholar
Schilder, B. M., Humphrey, J. & Raj, T. echolocatoR: an automated end-to-end statistical and functional genomic fine-mapping pipeline. Bioinformatics 38, 536–539 (2022).
Article CAS PubMed Google Scholar
Feng, H. et al. Leveraging expression from multiple tissues using sparse canonical correlation analysis and aggregate tests improves the power of transcriptome-wide association studies. PLoS Genet. 17, e1008973 (2021).
Article CAS PubMed PubMed Central Google Scholar
Dall’Aglio, L., Lewis, C. M. & Pain, O. Delineating the genetic component of gene expression in major depression. Biol. Psychiatry 89, 627–636 (2021).
Article PubMed PubMed Central Google Scholar
Sobczyk, M. K., Gaunt, T. R. & Paternoster, L. MendelVar: gene prioritization at GWAS loci using phenotypic enrichment of Mendelian disease genes. Bioinformatics 37, 1–8 (2021).
Article CAS PubMed PubMed Central Google Scholar
Timshel, P. N., Thompson, J. J. & Pers, T. H. Genetic mapping of etiologic brain cell types for obesity. eLife 9, e55851 (2020).
Article CAS PubMed PubMed Central Google Scholar
Schaum, N. et al. Single-cell transcriptomics of 20 mouse organs creates a Tabula Muris. Nature 562, 367–372 (2018).
Article PubMed Central Google Scholar
Skrivankova, V. W. et al. Strengthening the reporting of observational studies in epidemiology using mendelian randomisation (STROBE-MR): explanation and elaboration. Brit. Med. J. 375, n2233 (2021).
Article PubMed PubMed Central Google Scholar
Bosnes, I. et al. Lifestyle predictors of successful aging: a 20-year prospective HUNT study. PLoS ONE 14, e0219200 (2019).
Article CAS PubMed PubMed Central Google Scholar
Hartmann, A. et al. Ranking biomarkers of aging by citation profiling and effort scoring. Front. Genet. 12, 686320 (2021).
Article PubMed PubMed Central Google Scholar
Burgess, S., Davies, N. M. & Thompson, S. G. Bias due to participant overlap in two-sample Mendelian randomization. Genet Epidemiol. 40, 597–608 (2016).
Article PubMed PubMed Central Google Scholar
Rena, G., Hardie, D. G. & Pearson, E. R. The mechanisms of action of metformin. Diabetologia 60, 1577–1585 (2017).
Article CAS PubMed PubMed Central Google Scholar
Madiraju, A. K. et al. Metformin suppresses gluconeogenesis by inhibiting mitochondrial glycerophosphate dehydrogenase. Nature 510, 542–546 (2014).
Article CAS PubMed PubMed Central Google Scholar
Gerstein, H. C. et al. Growth differentiation factor 15 as a novel biomarker for metformin. Diabetes Care 40, 280–283 (2017).
Article CAS PubMed Google Scholar
Mendez, D. et al. ChEMBL: towards direct deposition of bioassay data. Nucleic Acids Res. 47, D930–D940 (2019).
Article CAS PubMed Google Scholar
Neale-Lab. UK Biobank GWAS. UK Biobank http://www.nealelab.is/uk-biobank/ (2018).
Auton, A. et al. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Article PubMed Google Scholar
Chen, J. et al. The trans-ancestral genomic architecture of glycemic traits. Nat. Genet. 53, 840–860 (2021).
Article CAS PubMed PubMed Central Google Scholar
Willer, C. J. et al. Discovery and refinement of loci associated with lipid levels. Nat. Genet. 45, 1274–1283 (2013).
Article CAS PubMed PubMed Central Google Scholar
Kuhn, M., von Mering, C., Campillos, M., Jensen, L. J. & Bork, P. STITCH: interaction networks of chemicals and proteins. Nucleic Acids Res. 36, D684–D688 (2008).
Article CAS PubMed Google Scholar
Griffith, M. et al. DGIdb: mining the druggable genome. Nat. Methods 10, 1209–1210 (2013).
Article CAS PubMed PubMed Central Google Scholar
Folkersen, L. et al. Genomic and drug target evaluation of 90 cardiovascular proteins in 30,931 individuals. Nat. Metab. 10, 1135–1148 (2020).
Article Google Scholar
Hemani, G. et al. The MR-Base platform supports systematic causal inference across the human phenome. eLife 7, e34408 (2018).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We want to acknowledge the participants and investigators of the many studies used in this research without whom this effort would not be possible. We also want to acknowledge the Medical Research Council Integrative Epidemiology Unit (MRC-IEU, University of Bristol, UK), especially the developers of the MRC-IEU UK Biobank GWAS Pipeline. G.D.S. is a member of the UK MRC-IEU, which is funded by the MRC (MC_UU_00011/1) and the University of Bristol. This work was supported by the National Institutes of Health intramural funding (ZIA-AA000242 to F.W.L.) and the Division of Intramural Clinical and Biological Research of the Natiocholism. Figure 1 and Extended Data Figs. 5, 6 and 7 were created using BioRender.com.

Author information

Authors and Affiliations

Section on Clinical Genomics and Experimental Therapeutics, National Institute on Alcohol Abuse and Alcoholism, National Institutes of Health, Bethesda, MD, USA
Daniel B. Rosoff, Lucas A. Mavromatis, Andrew S. Bell, Josephin Wagner, Jeesun Jung & Falk W. Lohoff
NIH-Oxford-Cambridge Scholars Program; Radcliffe Department of Medicine, University of Oxford, Oxford, UK
Daniel B. Rosoff
MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, UK
Daniel B. Rosoff & George Davey Smith
Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, UK
Riccardo E. Marioni
Department of Biostatistics, Fielding School of Public Health, University of California Los Angeles, Los Angeles, CA, USA
Steve Horvath
Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA
Steve Horvath
San Diego Institute of Science, Alto Labs, San Diego, CA, USA
Steve Horvath

Authors

Daniel B. Rosoff
View author publications
You can also search for this author in PubMed Google Scholar
Lucas A. Mavromatis
View author publications
You can also search for this author in PubMed Google Scholar
Andrew S. Bell
View author publications
You can also search for this author in PubMed Google Scholar
Josephin Wagner
View author publications
You can also search for this author in PubMed Google Scholar
Jeesun Jung
View author publications
You can also search for this author in PubMed Google Scholar
Riccardo E. Marioni
View author publications
You can also search for this author in PubMed Google Scholar
George Davey Smith
View author publications
You can also search for this author in PubMed Google Scholar
Steve Horvath
View author publications
You can also search for this author in PubMed Google Scholar
Falk W. Lohoff
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.B.R. and F.W.L. designed the study. D.B.R. performed the analyses. D.B.R. and F.W.L. interpreted the data. D.B.R. drafted the manuscript. F.W.L. supervised the study. D.B.R., F.W.L., L.A.M., A.S.B., J.W., J.J., R.E.M., G.D.S. and S.H. critically revised the study for important intellectual content and approved the final manuscript.

Corresponding author

Correspondence to Falk W. Lohoff.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Aging thanks Luke Pilling and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Gene-level Manhattan plot of transcriptome-wide association study.

TWAS Z-scores (effect estimates of the imputed gene expression on mvAge) are plotted (mvAge N = 1,958,774). The expression quantitative trait loci (eQTL) data are derived from the GTEx Version 8 sparse canonical correlation analysis performed by Feng et al. 2021 (see References). The blue lines represent Z = ± 4.69, corresponding to the Bonferroni adjusted P-value threshold = 1.38 × 10⁻⁶ (0.05/36,149 sCCA features available for analysis). Red points and labels indicate genes (Ensembl gene IDs) surpassing the threshold. TWAS, transcriptome-wide association study; sCCA, sparse canonical correlation analysis.

Source data

Extended Data Fig. 2 Cis-instrument Mendelian randomization assessing the impact of protein coding genes on mvAge through their associations with HDL-C, LDL-C, triglycerides, and SBP.

Presented is a volcano plot of the Z scores (versus the negative log₁₀(P-value)) of the MR estimates (beta/se) for the inverse variance weighted (IVW) MR method aligned to proxy the pharmacological effect of increased HDL-C (N = 403,943), lowered LDL-C (N = 440,546), lowered TG (N = 441,016), or lowered SBP (N = 436,419) on mvAge (N = 1,958,774). The dotted line indicates the Bonferroni corrected P-value threshold (1.92 × 10⁻³). Labeled genes are those with beneficial estimates on mvAge that surpass Bonferroni corrected P-value threshold and align with increased HDL-C, lowered LDL-C, lowered TG, or lowered SBP. HDL-C, high-density lipoprotein cholesterol; LDL-C, low-density lipoprotein cholesterol; TG, triglycerides; SBP, systolic blood pressure; MR, Mendelian randomization.

Source data

Extended Data Fig. 3 Protein-protein and protein-chemical interaction for protein coding genes located near the lead SNPs of the HDL-C, LDL-C, triglycerides, and SBP GWASs associated with mvAge in the cis-instrument Mendelian randomization.

Analyses were performed using STITCH (http://stitch.embl.de/). Results plotted are STITCH PPI interaction scores. The combined scores are computed by calculating the probabilities from the STITCH database sources of evidence and correcting by the probabilities of randomly observing interactions between the proteins and chemicals (see ref. ⁹⁹ in the Reference list for additional information). Stronger associations are annotated with thicker lines. Protein-protein interactions are represented by grey lines, protein-chemical interactions are represented by green lines, and chemical-chemical interactions are represented by red lines. Per STITCH guidelines, ‘high confidence’ interactions are considered those with combined scores ≥ 0.90. HDL-C, high-density lipoprotein cholesterol; LDL-C, low-density lipoprotein cholesterol; TG, triglycerides; SBP, systolic blood pressure; MR, Mendelian randomization.

Source data

Extended Data Fig. 4 Cis-instrument Mendelian randomization analysis of 68 circulating proteins.

Data presented are MR effect estimates (betas) for the inverse variance weighted (IVW) MR method and corresponding 95% confidence intervals (CIs). The impact of 68 circulating proteins on mvAge (N = 1,958,774) was analyzed using protein quantitative trait loci (pQTL) data derived from 30,391 participants of European ancestry in the SCALLOP Consortia dataset (http://www.olink-improve.com/). We used pQTLs associated with the respective plasma protein at P-value < 5 × 10⁻⁸ within or near the cis-acting locus of the target gene boundary, that is, 100 kilobases on either side of the respective gene boundary. Extracted SNPs were clumped at the LD R² ≤ 0.2 threshold (250 kb) using the 1000 Genomes Phase 3 European reference population and MR IVW (random-effects analysis when there were more than three variants) and MR Egger implemented accounting for correlation between the instrument variants. MR, Mendelian randomization.

Source data

Extended Data Fig. 5 Mendelian randomization model overview (directed acyclic graph).

B₂ is the genetic association of interest, estimated by B₂ = B₁/ B_3. B₁ and B₃ represent the estimated MR association of the genetic variants on the exposure and the outcomes. MR assumes that the genetic variants comprising the instrument for the exposure only impact the outcome of interest via the exposure and not directly, or via confounders (dotted lines). MR, Mendelian randomization.

Extended Data Fig. 6 Drug-target Mendelian randomization analysis overview of anti-diabetics, lipid-modulating targets, and antihypertensives.

Diagram depicts flow diagram and details of the drug-target MR analyses of the cardiometabolic targets on mvAge (N = 1,958,774). Prior to step 1, colocalization analysis was employed to prioritize protein-coding targets for the screen. See Methods and Supplementary Methods for details on selection and identification of the individual targets in the three broad drug classes (anti-diabetics, lipid-modulating targets, and antihypertensive). In step 2, cis-instrumentation was performed using genome-wide association study (GWAS) of biomarkers that are the primary indications of pharmacological modulation of these targets. For antidiabetics, GWAS data of HbA1c (N = 344,182) was used; for lipid-modulating targets, several lipid subfractions including LDL-C (N = 440,546), triglycerides (N = 441,016), and HDL-C (N = 403,943) were used; and for antihypertensives, GWAS data of SBP (N = 436,419). Independent variants (LD R² < 0.2) at P-values < 5 × 10⁻⁸) were extracted, and cis-instruments constructed for each target, which exposure variants were then extracted from the mvAge GWAS (outcome), harmonized, and then analyzed using multiple MR methods (steps 3 and 4). See Methods and Supplementary Methods for further details. MR, Mendelian randomization; LD, linkage disequilibrium. LDL-C, low-density lipoprotein cholesterol; HDL-C, high-density lipoprotein cholesterol; TG, triglycerides; SBP, systolic blood pressure.

Extended Data Fig. 7 Analysis overview of cis-instrument Mendelian randomization screen of protein coding genes associated with either HbA1c, LDL-C, HDLC, triglycerides, or SBP.

Results show number of protein-coding genes including in the stages of the colocalization and cis-instrument MR screens. Exposure genome-wide association study data for each of the biomarkers came from HbA1c (N = 344,182); several lipid subfractions including LDL-C (N = 440,546), triglycerides (N = 441,016), and HDL-C (N = 403,943); and SBP (N = 436,419). Outcome data was mvAge (N = 1,958,774). HbA1c, glycated hemoglobin; HDL-C, high-density lipoprotein cholesterol; LDL-C, low-density lipoprotein cholesterol; TG, triglycerides; SBP, systolic blood pressure; MR, Mendelian randomization.

Supplementary information

Supplementary Information

Table of Contents, Supplementary Results, Discussion, Methods, STROBE Checklist, References, Figs. 1–33 and table captions.

Reporting Summary

Supplementary Table

Supplementary Tables 1-50

Source data

Source Data Fig. 2

Excel file containing data underlying Fig. 2.

Source Data Fig. 3

Excel file containing data underlying Fig. 3.

Source Data Fig. 4

Excel file containing data underlying Fig. 4.

Source Data Fig. 5

Excel file containing data underlying Fig. 5.

Source Data Extended Data Fig. 1

Excel file containing data underlying Extended Data Fig. 1.

Source Data Extended Data Fig.2

Excel file containing data underlying Extended Data Fig. 2.

Source Data Extended Data Fig.3

Excel file containing data underlying Extended Data Fig. 3.

Source Data Extended Data Fig.4

Excel file containing data underlying Extended Data Fig. 4.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Rosoff, D.B., Mavromatis, L.A., Bell, A.S. et al. Multivariate genome-wide analysis of aging-related traits identifies novel loci and new drug targets for healthy aging. Nat Aging 3, 1020–1035 (2023). https://doi.org/10.1038/s43587-023-00455-5

Download citation

Received: 01 August 2022
Accepted: 07 June 2023
Published: 07 August 2023
Issue Date: August 2023
DOI: https://doi.org/10.1038/s43587-023-00455-5

Subjects

Abstract

Similar content being viewed by others

Main

Results

Structural equation modeling

Multivariate GWAS meta-analysis

Fine mapping

Q SNP heterogeneity

Transcriptomic imputation

Exploratory two-factor analysis

Pathway, cell-type and Mendelian-disease-gene enrichment

MR with modifiable risk factors and biomarkers

Metformin target genes impact mvAge

Genetic impact of targets for cardiometabolic drug classes

Cis-instrument MR prioritizes potential mvAge targets

Discussion

Conclusions

Methods

Data sources

Sample overlap

Genomic SEM

Q SNP heterogeneity

Exploratory two-factor model analysis

Defining genomic loci and determining novel variants

Fine mapping and transcriptomic imputation

Gene-set and disease ontology enrichment

Cell-type enrichment

Mendelian randomization

Polygenic MR

Drug-target MR of metformin gene targets

Drug-target MR of cardiometabolic drug classes

Drug-target MR methods

Cis-instrument MR screen of cardiometabolic genes

Proxying circulating proteins

Statistics and reproducibility

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links

Q _SNP heterogeneity

Q _SNP heterogeneity