Common genetic associations between age-related diseases

Dönertaş, Handan Melike; Fabian, Daniel K.; Fuentealba, Matías; Partridge, Linda; Thornton, Janet M.

doi:10.1038/s43587-021-00051-5

Analysis
Published: 08 April 2021

Common genetic associations between age-related diseases

Nature Aging volume 1, pages 400–412 (2021)Cite this article

3897 Accesses
41 Citations
98 Altmetric
Metrics details

Subjects

Abstract

Age is a common risk factor in many diseases, but the molecular basis for this relationship is elusive. In this study we identified four disease clusters from 116 diseases in UK Biobank data, defined by their age-of-onset profiles, and found that diseases with the same onset profile are genetically more similar, suggesting a common etiology. This similarity was not explained by disease categories, co-occurrences or disease cause–effect relationships. Two of the four disease clusters had an increased risk of occurrence from ages 20 and 40 years, respectively. They both showed an association with known aging-related genes, yet differed in functional enrichment and evolutionary profiles. Moreover, they both had age-related expression and methylation changes. We also tested mutation accumulation and antagonistic pleiotropy theories of aging and found support for both.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: Age-of-onset profiles clustered by the PAM algorithm, using dissimilarities calculated with temporal correlation measure (CORT).**

**Fig. 2: Genetic similarities and mediated pleiotropy across diseases.**

**Fig. 3: Enrichment of disease-associated genes in known longevity modulators and GO categories.**

**Fig. 4: Risk allele frequencies for diseases associated with different age-of-onset clusters.**

The genetics of human ageing

Article 05 November 2019

David Melzer, Luke C. Pilling & Luigi Ferrucci

The impact of non-additive genetic associations on age-related complex diseases

Article Open access 23 April 2021

Marta Guindo-Martínez, Ramon Amela, … David Torrents

Integrative GWAS and co-localisation analysis suggests novel genes associated with age-related multimorbidity

Article Open access 25 September 2023

Clare E. West, Mohd Karim, … Brian D. Marsden

Data availability

The primary data source used in the study was the UKBB resource²⁰, which requires an application for access (https://www.ukbiobank.ac.uk/). This study was conducted under application number 30688. The UKBB GWAS summary statistics provided by Neale laboratory were downloaded for Townsend Deprivation Index and diet regimes (http://www.nealelab.is/uk-biobank). GTEx v.8 eQTL and expression data were accessed on 20 October 2020 via the GTEx data portal (https://www.gtexportal.org/home/datasets)³³. GWAS Catalog v.1.0.2 e96 (ref. ⁷⁰) dataset was accessed on 30 July 2019 via https://www.ebi.ac.uk/gwas/docs/file-downloads. The gene lists available in ‘Human Ageing Genomic Resources’³⁴,³⁵ were downloaded using https://genomics.senescence.info/download.html and CellAge data were kindly made available before the data release on 2 October 2019 by Avelar et al.³⁶. We accessed ChEMBL (https://www.ebi.ac.uk/chembl/)⁷³ and PubChem (https://pubchem.ncbi.nlm.nih.gov/)⁷¹ using their APIs and UniChem (https://www.ebi.ac.uk/unichem/)⁷² mappings were used to map PubChem CIDs to ChEMBL IDs. DGIdb (https://www.dgidb.org/)⁷⁴ was used to compile drug–target gene interactions. Results of Adelman et al.³⁷ and Marttila et al.³⁸ age-related methylation studies were downloaded as article supplementary files. We accessed 1000 Genomes Project⁴¹ allele frequencies using the vcf file provided on the 1000 Genomes Project website (https://www.internationalgenome.org/data).

The full set of GWAS results from this study can be accessed using BioStudies (S-BSST407) and all other results generated in the analysis are provided as Supplementary Datasets and Tables.

Code availability

BOLT-LMM (v.2.3.2) (https://data.broadinstitute.org/alkesgroup/BOLT-LMM/)⁶⁰, PLINK (v.1.90b6.4) (https://www.cog-genomics.org/plink/)⁶¹ and VarMap (https://www.ebi.ac.uk/thornton-srv/databases/VarMap)⁶² software were used. All remaining analyses were performed using R⁸¹ (v.3.5.0) (https://cran.r-project.org/), using RStudio IDE (v.1.1.453) (https://rstudio.com/). The following R packages were used: TSclust⁵⁵ (v.1.2.4), cluster (v.2.0.7.1), HDL³¹ (v.1.3.8) (https://github.com/zhenin/HDL/), VariantAnnotation⁶³ (v.1.28.13), TxDb.Hsapiens.UCSC.hg19.knownGene⁶⁴ (v.3.2.2), GenomicRanges⁶⁵ (v.1.32.3), biomaRt⁶⁶ (v.2.36.1), RCurl⁸² (v.1.98.1.2), jsonlite⁸³ (v.1.7.1), rtracklayer⁶⁷ (v.1.40.3), liftOver⁶⁸ (v.1.12.0), goseq⁷⁵ (v.1.40.0), preprocessCore⁸⁰ (v.1.50.0) and LCV³² method (https://github.com/lukejoconnor/LCV) implemented in R. The following packages were used for data handling: tidyverse⁸⁴ (v.1.3.0) and data.table⁸⁵ (v.1.12.4). The following packages were used for data visualization: igraph⁸⁶ (v.1.2.1), ggnetwork⁸⁷ (v.0.5.8), ggforce⁸⁸ (v.0.2.2.9000), ggpubr⁸⁹ (v.0.4.0), ggrepel⁹⁰ (v.0.8.2), GGally⁹¹ (v.2.0.0), RColorBrewer⁹² (v.1.1.2), scales⁹³ (v.1.1.1), ggthemes⁹⁴ (v.4.2.0) and pheatmap⁹⁵ (v.1.0.12).

All other analysis was performed using custom codes written in bash (v.4.2) or R (v.3.5.0) and are available in GitHub at https://github.com/mdonertas/ukbb_ageonset.

References

López-Otín, C., Blasco, M. A., Partridge, L., Serrano, M. & Kroemer, G. The hallmarks of aging. Cell 153, 1194–1217 (2013).
Article PubMed PubMed Central Google Scholar
Crimmins, E. M. Lifespan and healthspan: past, present, and promise. Gerontologist 55, 901–911 (2015).
Article PubMed PubMed Central Google Scholar
Partridge, L., Deelen, J. & Slagboom, P. E. Facing up to the global challenges of ageing. Nature 561, 45–56 (2018).
Article CAS PubMed Google Scholar
Niccoli, T. & Partridge, L. Ageing as a risk factor for disease. Curr. Biol. 22, R741–R752 (2012).
Article CAS PubMed Google Scholar
Flatt, T. & Partridge, L. Horizons in the evolution of aging. BMC Biol. 16, 93 (2018).
Article PubMed PubMed Central Google Scholar
Medvedev, Z. A. An attempt at a rational classification of theories of ageing. Biol. Rev. Camb. Philos. Soc. 65, 375–398 (1990).
Article CAS PubMed Google Scholar
Medawar, P. B. Unsolved problem of biology. Med. J. Aust. 1, 854–855 (1953).
Article Google Scholar
Williams, G. C. Pleiotropy, natural selection, and the evolution of senescence. Evolution 11, 398–411 (1957).
Article Google Scholar
Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 47, 1236–1241 (2015).
Article CAS PubMed PubMed Central Google Scholar
Pickrell, J. K. et al. Detection and interpretation of shared genetic influences on 42 human traits. Nat. Genet. 48, 709–717 (2016).
Article CAS PubMed PubMed Central Google Scholar
Cross-Disorder Group of the Psychiatric Genomics Consortium et al. Genetic relationship between five psychiatric disorders estimated from genome-wide SNPs. Nat. Genet. 45, 984–994 (2013).
Cortes, A., Albers, P. K., Dendrou, C. A., Fugger, L. & McVean, G. Identifying cross-disease components of genetic risk across hospital data in the UK Biobank. Nat. Genet. 52, 126–134 (2020).
Article CAS PubMed Google Scholar
Solovieff, N., Cotsapas, C., Lee, P. H., Purcell, S. M. & Smoller, J. W. Pleiotropy in complex traits: challenges and strategies. Nat. Rev. Genet. 14, 483–495 (2013).
Article CAS PubMed PubMed Central Google Scholar
Ellinghaus, D. et al. Analysis of five chronic inflammatory diseases identifies 27 new associations and highlights disease-specific patterns at shared loci. Nat. Genet. 48, 510–518 (2016).
Article CAS PubMed PubMed Central Google Scholar
Parkes, M., Cortes, A., van Heel, D. A. & Brown, M. A. Genetic insights into common pathways and complex relationships among immune-mediated diseases. Nat. Rev. Genet. 14, 661–673 (2013).
Article CAS PubMed Google Scholar
Bien, S. A. & Peters, U. Moving from one to many: insights from the growing list of pleiotropic cancer risk genes. Br. J. Cancer 120, 1087–1089 (2019).
Article PubMed PubMed Central Google Scholar
Johnson, S. C., Dong, X., Vijg, J. & Suh, Y. Genetic evidence for common pathways in human age-related diseases. Aging Cell 14, 809–817 (2015).
Article CAS PubMed PubMed Central Google Scholar
Fernandes, M. et al. Systematic analysis of the gerontome reveals links between aging and age-related diseases. Hum. Mol. Genet. 25, 4804–4818 (2016).
CAS PubMed PubMed Central Google Scholar
Wang, J., Zhang, S., Wang, Y., Chen, L. & Zhang, X.-S. Disease-aging network reveals significant roles of aging genes in connecting genetic diseases. PLoS Comput. Biol. 5, e1000521 (2009).
Article PubMed PubMed Central Google Scholar
Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).
Article CAS PubMed PubMed Central Google Scholar
Data-Field 41262 (UK Biobank, accessed 1 December 2020); https://biobank.ndph.ox.ac.uk/showcase/field.cgi?id=41262
Chouakria, A. D. & Nagabhushan, P. N. Adaptive dissimilarity index for measuring time series proximity. Adv. Data Anal. Classif. 1, 5–21 (2007).
Article Google Scholar
Loh, P.-R. et al. Efficient Bayesian mixed-model analysis increases association power in large cohorts. Nat. Genet. 47, 284–290 (2015).
Article CAS PubMed PubMed Central Google Scholar
Anderson, C. A. et al. Data quality control in genetic case-control association studies. Nat. Protoc. 5, 1564–1573 (2010).
Article CAS PubMed PubMed Central Google Scholar
Zhou, W. et al. Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies. Nat. Genet. 50, 1335–1341 (2018).
Article CAS PubMed PubMed Central Google Scholar
Pe’er, I., Yelensky, R., Altshuler, D. & Daly, M. Estimation of the multiple testing burden for genomewide association studies of common variants. Nat. Preced. https://doi.org/10.1038/npre.2007.359.1 (2007).
Panagiotou, O. A. & Ioannidis, J. P. A. What should the genome-wide significance threshold be? Empirical replication of borderline genetic associations. Int. J. Epidemiol. 41, 273–286 (2012).
Article PubMed Google Scholar
Human genome region MHC (Genome Reference Consortium, accessed June 2019); https://www.ncbi.nlm.nih.gov/grc/human/regions/MHC?asm=GRCh37
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Berisa, T. & Pickrell, J. K. Approximately independent linkage disequilibrium blocks in human populations. Bioinformatics 32, 283–285 (2016).
CAS PubMed Google Scholar
Ning, Z., Pawitan, Y. & Shen, X. High-definition likelihood inference of genetic correlations across human complex traits. Nat. Genet. https://doi.org/10.1038/s41588-020-0653-y (2020).
O’Connor, L. J. & Price, A. L. Distinguishing genetic correlation from causation across 52 diseases and complex traits. Nat. Genet. 50, 1728–1734 (2018).
Article PubMed PubMed Central Google Scholar
Gamazon, E. R. et al. Using an atlas of gene regulation across 44 human tissues to inform complex disease- and trait-associated variation. Nat. Genet. 50, 956–967 (2018).
Article CAS PubMed PubMed Central Google Scholar
Tacutu, R. et al. Human ageing genomic resources: new and updated databases. Nucleic Acids Res. 46, D1083–D1090 (2018).
Article CAS PubMed Google Scholar
Barardo, D. et al. The DrugAge database of aging-related drugs. Aging Cell 16, 594–597 (2017).
Article CAS PubMed PubMed Central Google Scholar
Avelar, R. A. et al. A multidimensional systems biology analysis of cellular senescence in aging and disease. Genome Biol. 21, 91 (2020).
Article CAS PubMed PubMed Central Google Scholar
Adelman, E. R. et al. Aging human hematopoietic stem cells manifest profound epigenetic reprogramming of enhancers that may predispose to leukemia. Cancer Discov. 9, 1080–1101 (2019).
Article PubMed PubMed Central Google Scholar
Marttila, S. et al. Ageing-associated changes in the human DNA methylome: genomic locations and effects on gene expression. BMC Genomics 16, 179 (2015).
Article PubMed PubMed Central Google Scholar
Sun, L.-Q. et al. Growth retardation and premature aging phenotypes in mice with disruption of the SNF2-like gene, PASG. Genes Dev. 18, 1035–1046 (2004).
Article CAS PubMed PubMed Central Google Scholar
Rodríguez, J. A. et al. Antagonistic pleiotropy and mutation accumulation influence human senescence and disease. Nat. Ecol. Evol. 1, 55 (2017).
Article PubMed Google Scholar
Genomes Project Consortium et al. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Bitarello, B. D. et al. Signatures of long-term balancing selection in human genomes. Genome Biol. Evol. 10, 939–955 (2018).
Article CAS PubMed PubMed Central Google Scholar
Kosiol, C. et al. Patterns of positive selection in six mammalian genomes. PLoS Genet. 4, e1000144 (2008).
Article PubMed PubMed Central Google Scholar
Nielsen, R. et al. A scan for positively selected genes in the genomes of humans and chimpanzees. PLoS Biol. 3, e170 (2005).
Article PubMed PubMed Central Google Scholar
Shultz, A. J. & Sackton, T. B. Immune genes are hotspots of shared positive selection across birds and mammals. eLife 8, e41815 (2019).
Article PubMed PubMed Central Google Scholar
Fisher, R. A. The Genetical Theory of Natural Selection (Clarendon Press, 1930).
Wolfson, M., Budovsky, A., Tacutu, R. & Fraifeld, V. The signaling hubs at the crossroad of longevity and age-related disease networks. Int. J. Biochem. Cell Biol. 41, 516–520 (2009).
Article CAS PubMed Google Scholar
Kanchi, K. L. et al. Integrated analysis of germline and somatic variants in ovarian cancer. Nat. Commun. 5, 3156 (2014).
Article PubMed Google Scholar
Khurana, E. et al. Role of non-coding sequence variants in cancer. Nat. Rev. Genet. 17, 93–108 (2016).
Article CAS PubMed Google Scholar
Poduri, A., Evrony, G. D., Cai, X. & Walsh, C. A. Somatic mutation, genomic variation, and neurological disease. Science 341, 1237758 (2013).
Article PubMed PubMed Central Google Scholar
Zhang, L. & Vijg, J. Somatic mutagenesis in mammals and its implications for human disease and aging. Annu. Rev. Genet. 52, 397–419 (2018).
Article CAS PubMed PubMed Central Google Scholar
Finkel, T., Serrano, M. & Blasco, M. A. The common biology of cancer and ageing. Nature 448, 767–774 (2007).
Article CAS PubMed Google Scholar
Sánchez-Valle, J. et al. Interpreting molecular similarity between patients as a determinant of disease comorbidity relationships. Nat. Commun. 11, 2854 (2020).
Article PubMed PubMed Central Google Scholar
Gutiérrez-Sacristán, A. et al. comoRbidity: an R package for the systematic analysis of disease comorbidities. Bioinformatics 34, 3228–3230 (2018).
Article PubMed PubMed Central Google Scholar
Montero, P. & Vilar, J. TSclust: an R package for time series clustering. J. Stat. Software, Articles 62, 1–43 (2014).
Google Scholar
Berndt, D. J. & Clifford, J. Using dynamic time warping to find patterns in time series. In Proc. 3rd International Conference on Knowledge Discovery and Data Mining 359–370 (AAAI Press, 1994).
Kaufman, L. & Rousseeuw, P. J. in Finding Groups in Data Ch. 2 (John Wiley & Sons, Inc., 1990).
Maechler, M., Rousseeuw, P., Struyf, A., Hubert, M. & Hornik, K. cluster: cluster analysis basics and extensions. Version 2.0.7.1 (2019).
Tibshirani, R., Walther, G. & Hastie, T. Estimating the number of clusters in a data set via the gap statistic. J. R. Stat. Soc. B 63, 411–423 (2001).
Article Google Scholar
Loh, P.-R. BOLT-LMM v2. 3.1 User Manual https://data.broadinstitute.org/alkesgroup/BOLT-LMM/ (2017).
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
Article PubMed PubMed Central Google Scholar
Stephenson, J. D., Laskowski, R. A., Nightingale, A., Hurles, M. E. & Thornton, J. M. VarMap: a web tool for mapping genomic coordinates to protein sequence and structure and retrieving protein structural annotations. Bioinformatics https://doi.org/10.1093/bioinformatics/btz482 (2019).
Obenchain, V. et al. VariantAnnotation: a Bioconductor package for exploration and annotation of genetic variants. Bioinformatics 30, 2076–2078 (2014).
Article CAS PubMed PubMed Central Google Scholar
Carlson, M. TxDb.Hsapiens.UCSC.hg19.knownGene: annotation package for TxDb object(s). R package version 3.2.2 (Bioconductor Package Maintainer, 2015).
Lawrence, M. et al. Software for computing and annotating genomic ranges. PLoS Comput. Biol. 9, e1003118 (2013).
Article CAS PubMed PubMed Central Google Scholar
Durinck, S., Spellman, P. T., Birney, E. & Huber, W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat. Protoc. 4, 1184–1191 (2009).
Article CAS PubMed PubMed Central Google Scholar
Lawrence, M., Gentleman, R. & Carey, V. rtracklayer: an R package for interfacing with genome browsers. Bioinformatics 25, 1841–1842 (2009).
Article CAS PubMed PubMed Central Google Scholar
liftOver: Changing genomic coordinate systems with rtracklayer::liftOver. Version 1.12.0 (Bioconductor Package Maintainer, 2020).
Lage, K. et al. A large-scale analysis of tissue-specific pathology and gene expression of human disease genes and complexes. Proc. Natl Acad. Sci. USA 105, 20870–20875 (2008).
Article CAS PubMed PubMed Central Google Scholar
Buniello, A. et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 47, D1005–D1012 (2019).
Article CAS PubMed Google Scholar
Kim, S. et al. PubChem 2019 update: improved access to chemical data. Nucleic Acids Res. 47, D1102–D1109 (2019).
Article PubMed Google Scholar
Chambers, J. et al. UniChem: a unified chemical structure cross-referencing and identifier tracking system. J. Cheminform. 5, 3 (2013).
Article CAS PubMed PubMed Central Google Scholar
Gaulton, A. et al. The ChEMBL database in 2017. Nucleic Acids Res. 45, D945–D954 (2017).
Article CAS PubMed Google Scholar
Cotto, K. C. et al. DGIdb 3.0: a redesign and expansion of the drug–gene interaction database. Nucleic Acids Res. 46, D1068–D1073 (2018).
Article CAS PubMed Google Scholar
Young, M. D., Wakefield, M. J., Smyth, G. K. & Oshlack, A. Gene ontology analysis for RNA-seq: accounting for selection bias. Genome Biol. 11, R14 (2010).
Article PubMed PubMed Central Google Scholar
Ashburner, M. et al. Gene ontology: tool for the unification of biology. Nat. Genet. 25, 25–29 (2000).
Article CAS PubMed PubMed Central Google Scholar
Gene Ontology Consortium.The Gene Ontology Resource: 20 years and still GOing strong. Nucleic Acids Res. 47, D330–D338 (2019).
Article Google Scholar
Benjamini, Y. & Yekutieli, D. The control of the false discovery rate in multiple testing under dependency. Ann. Stat. 29, 1165–1188 (2001).
Article Google Scholar
GTEx Consortium.The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–660 (2015).
Article PubMed Central Google Scholar
Bolstad, B. preprocessCore: a collection of pre-processing functions. Version 1.50.0 (2020).
R Core Team. R: A Language and Environment for Statistical Computing. Version 3.5.0 (2018).
Temple Lang, D. RCurl: General Network (HTTP/FTP/…) Client Interface for R. Version 1.98.1.2 (2020).
Ooms, J. The jsonlite package: a practical and consistent mapping between JSON data and R objects. Preprint at arXiv https://arxiv.org/abs/1403.2805 (2014).
Wickham, H. et al. Welcome to the tidyverse. J. Open Source Softw. 4, 1686 (2019).
Article Google Scholar
Dowle, M. & Srinivasan, A. data.table: extension of data.frame. Version 1.12.4 (2020).
Csardi, G. & Nepusz, T. The igraph software package for complex network research. Version 1.2.1 (2006).
Briatte, F. ggnetwork: geometries to plot networks with ‘ggplot2’. Version 0.5.8 (2020).
Pedersen, T. L. ggforce: accelerating ‘ggplot2’. Version 0.2.2.9000 (2020).
Kassambara, A. ggpubr: ‘ggplot2’ based publication ready plots. Version 0.4.0 (2020).
Slowikowski, K. ggrepel: automatically position non-overlapping text labels with ‘ggplot2’. Version 0.8.2 (2020).
Schloerke, B. et al. GGally: extension to ‘ggplot2’. Version 2.0.0 (2020).
Neuwirth, E. RColorBrewer: colorbrewer palettes. Version 1.1.2 (2014).
Wickham, H. & Seidel, D. scales: scale functions for visualization. Version 1.1.1 (2020).
Arnold, J. B. ggthemes: extra themes, scales and geoms for ‘ggplot2’. Version 4.2.0 (2019).
Kolde, R. pheatmap: pretty heatmaps. Version 1.0.12 (2019).

Download references

Acknowledgements

This research has been conducted using the UK Biobank Resource (application no. 30688). The authors thank the GWAS Catalog team for providing the list of studies using UK Biobank data; J. Stephenson and R. Laskowski for their help in running VarMap tool; and M. Somel, S. Ozanne, P. Beltrao and W. Huber for fruitful discussions. This research was funded in whole, or in part, by the Wellcome Trust (098565/Z/12/Z). For the purpose of Open Access, the authors have applied a CC BY public copyright licence to any author accepted manuscript version arising from this submission. H.M.D., D.K.F., L.P. and J.M.T. were funded by this Wellcome Trust grant. The work was also supported by the European Molecular Biology Laboratory (J.M.T), the EMBL International PhD Programme (H.M.D) and Comisión Nacional de Investigación Científica y Tecnológica - Government of Chile (CONICYT scholarship; M.F.). The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.

Author information

Authors and Affiliations

European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
Handan Melike Dönertaş, Daniel K. Fabian, Matías Fuentealba & Janet M. Thornton
Institute of Healthy Aging, Department of Genetics, Evolution and Environment, University College London, London, UK
Daniel K. Fabian, Matías Fuentealba & Linda Partridge
Max Planck Institute for Biology of Aging, Cologne, Germany
Linda Partridge

Authors

Handan Melike Dönertaş
View author publications
You can also search for this author in PubMed Google Scholar
Daniel K. Fabian
View author publications
You can also search for this author in PubMed Google Scholar
Matías Fuentealba
View author publications
You can also search for this author in PubMed Google Scholar
Linda Partridge
View author publications
You can also search for this author in PubMed Google Scholar
Janet M. Thornton
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.M.D. conceived and designed the study with contributions from L.P. and J.M.T. H.M.D. analyzed the data with the help of D.K.F. and M.F.V. H.M.D. interpreted the results and wrote the manuscript with contributions from all authors. All authors read, revised and approved the final version of this manuscript.

Corresponding authors

Correspondence to Handan Melike Dönertaş or Janet M. Thornton.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Aging thanks Luke O’Connor, Luke Pilling and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Disease categories and co-occurrences.

a, Disease hierarchy for the 116 diseases included in the analysis. The nodes are colored by the disease categories as indicated in the legend. b, Disease co-occurrence matrix summarizing relative risk scores and correlations. Each row and column denote diseases ordered by hierarchical clustering of risk scores. The color is defined by relative risk scores while the size is determined by ϕ value, indicating the robustness of the association (see Methods). The diagonal tiles are colored by the UK Biobank’s disease hierarchy to visualize if diseases from the same category cluster together. Associations for the 62 diseases that have at least one relative risk ratio higher than four (log₂RR ≥ 2) or lower than minus four log₂RR ≤ −2 are plotted.

Source data

Extended Data Fig. 2 Distribution of median age-of-onset across disease categories.

Points show diseases grouped by categories (individual boxplots). Categories are ordered by the median value of the median age-of-onset. The boxplots show the first and third quartiles, the median (dark line), and the whiskers extend from the quartiles to the last point in 1.5xIQR distance to the quartiles.

Source data

Extended Data Fig. 3 The number of significant variants across diseases, age-of-onset clusters, and disease categories.

a, Number of diseases for different number of significant variants (p_BOLT-LMM≤5e-8). Diseases with the highest number of associations (N≥10,000) are given as an inset table. b, Comparison of the number of significant associations (y-axis, on a log scale) across age-of-onset clusters (x-axis) (ANOVA after excluding cluster 4, p = 0.06). Since the y-axis is on a log scale, diseases with zero significant associations are not shown on the graph. c, The same as (b) but for disease categories. Categories are ordered by the median number of significant SNPs. The boxplots (b-c) show the first and third quartiles, the median (dark line), and the whiskers extend from the quartiles to the last point in 1.5xIQR distance to the quartiles.

Source data

Extended Data Fig. 4 The raw and corrected values of genetic similarities within and across age-of-onset clusters.

a, The difference between genetic similarity within and across age-of-onset clusters, calculated between 47 diseases. Y-axis shows the genetic similarity (see Methods). b, The same as (a) but the y-axis is corrected for disease category and co-occurrence using a linear model. This panel is the same as Figure 2b and given here only for easier comparison. The boxplots show the first and third quartiles, the median (dark line), and the whiskers extend from the quartiles to the last point in 1.5xIQR distance to the quartiles. P-values are calculated using F-test on a linear model between genetic similarity scores and different/same age of onset clusters for panel a and including different/same disease category and disease co-occurrence (risk ratio) as covariates in panel b.

Source data

Extended Data Fig. 5 Genetic similarities calculated using the high-definition likelihood (HDL) inference method.

a, The correlation between the genetic similarity scores calculated using the SNP overlap-based odds ratio (x-axis) and HDL (y-axis). Blue points show the similarities calculated between diseases in different age of onset clusters and red points show the similarities calculated between diseases in the same age of onset cluster. The correlation coefficient and p-value are calculated using a two-sided Spearman correlation test. The linear regression line (blue) and 95% confidence interval (gray shaded area) are shown. b, The difference between genetic similarity within and across age-of-onset clusters, calculated between 59 diseases. Y-axis shows the genetic similarity calculated using HDL. The difference between different and same age clusters is tested using a two-sided Wilcoxon test. The boxplots show the first and third quartiles, the median (dark line), and the whiskers extend from the quartiles to the last point in 1.5xIQR distance to the quartiles.

Source data

Extended Data Fig. 6 The overlap between genes associated with selected aging-related traits and genes associated with diseases in different clusters.

The x-axis shows the log2 enrichment score, and the y-axis shows the age-of-onset clusters. The numbers of genes in each cluster (for both multidisease and multicategory genes) are given. The size of the points shows the statistical significance based on a one-sided permutation test (large points show nominal p-value ≤ 0.05, small ‘x’ indicates non-significant overlaps – none of the associations are significant after multiple testing correction), and the color shows different aging-related GWAS Catalog traits. The colored numbers near the points show the numbers of overlapping genes.

Source data

Extended Data Fig. 7 Drug-target gene interaction network for the drugs specifically targeting multicategory genes in age-dependent clusters.

‘Drug-target gene’ interaction network for the drugs that specifically target multicategory cluster 1, cluster 2, or cluster ‘1 & 2’ genes as determined by Fisher’s exact test. Blue diamonds show the drugs with a significant association or targeting only one gene in these gene groups. Diamonds without written names are only represented with the ChEMBL IDs in the datasets and did not have names. Drug labels written in bold are drugs approved for different conditions. Circles represent the genes targeted by the significant hits, colored by their age-of-onset cluster. Gray circles show the genes targeted by these drugs but are not among the gene set of interest.

Source data

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dönertaş, H.M., Fabian, D.K., Fuentealba, M. et al. Common genetic associations between age-related diseases. Nat Aging 1, 400–412 (2021). https://doi.org/10.1038/s43587-021-00051-5

Download citation

Received: 17 June 2020
Accepted: 26 February 2021
Published: 08 April 2021
Issue Date: April 2021
DOI: https://doi.org/10.1038/s43587-021-00051-5

This article is cited by

Human genetic associations of the airway microbiome in chronic obstructive pulmonary disease
- Jingyuan Gao
- Yuqiong Yang
- Zhang Wang
Respiratory Research (2024)
Causal relationship between gut microbiota and tuberculosis: a bidirectional two-sample Mendelian randomization analysis
- Zongxiang Yuan
- Yiwen Kang
- Li Ye
Respiratory Research (2024)
SCGB1D2 inhibits growth of Borrelia burgdorferi and affects susceptibility to Lyme disease
- Satu Strausz
- Erik Abner
- Hanna M. Ollila
Nature Communications (2024)
Whole genome sequencing across clinical trials identifies rare coding variants in GPR68 associated with chemotherapy-induced peripheral neuropathy
- Zia Khan
- Min Jung
- Tushar Bhangale
Genome Medicine (2023)
Interaction between the GCKR rs1260326 variant and serum HDL cholesterol contributes to HOMA-β and ISIMatusda in the middle-aged T2D individuals
- Min Shen
- Liying Jiang
- Kuanfeng Xu
Journal of Human Genetics (2023)

Subjects

Abstract

Access options

Similar content being viewed by others

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Extended data

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links