Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

A cross-population atlas of genetic associations for 220 human phenotypes

Abstract

Current genome-wide association studies do not yet capture sufficient diversity in populations and scope of phenotypes. To expand an atlas of genetic associations in non-European populations, we conducted 220 deep-phenotype genome-wide association studies (diseases, biomarkers and medication usage) in BioBank Japan (n = 179,000), by incorporating past medical history and text-mining of electronic medical records. Meta-analyses with the UK Biobank and FinnGen (ntotal = 628,000) identified ~5,000 new loci, which improved the resolution of the genomic map of human traits. This atlas elucidated the landscape of pleiotropy as represented by the major histocompatibility complex locus, where we conducted HLA fine-mapping. Finally, we performed statistical decomposition of matrices of phenome-wide summary statistics, and identified latent genetic components, which pinpointed responsible variants and biological mechanisms underlying current disease classifications across populations. The decomposed components enabled genetically informed subtyping of similar diseases (for example, allergic diseases). Our study suggests a potential avenue for hypothesis-free re-investigation of human diseases through genetics.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Overview of the identified loci in the cross-population meta-analyses of 220 deep-phenotype GWASs.
Fig. 2: Number of significant associations per variant.
Fig. 3: HLA and ABO association PheWAS.
Fig. 4: The deconvolution analysis of a matrix of summary statistics of 159 diseases across populations.
Fig. 5: Examples of disease-component correspondence and biological interpretation of the components by projection and enrichment analysis using GREAT.

Similar content being viewed by others

Data availability

The genotype data of BBJ used in this study are available from the Japanese Genotype-phenotype Archive (JGA) with accession codes JGAS000114/JGAD000123 and JGAS000114/JGAD000220, which can be accessed through application at https://humandbs.biosciencedbc.jp/en/hum0014-latest. The UKB analysis was conducted via application number 47821. The genotype and phenotype data can be accessed through application at https://www.ukbiobank.ac.uk. This study used the FinnGen release 3 data. Summary results can be accessed through application at https://www.finngen.fi/en/access_results. We provide downloadable full GWAS summary statistics with an interactive visualization of Manhattan, LocusZoom and PheWAS plots at our PheWeb.jp website (https://pheweb.jp/). The summary statistics of GWASs in this study (BioBank Japan, European and cross-population meta-analyses) are also deposited at the National Bioscience Database Center (NBDC) Human Database (https://humandbs.biosciencedbc.jp/en/) with the accession code hum0197, and the GWAS Catalog (https://www.ebi.ac.uk/gwas/) with the study accession IDs from GCST90018563 (https://www.ebi.ac.uk/gwas/studies/GCST90018563) to GCST90019002 (https://www.ebi.ac.uk/gwas/studies/GCST90019002) (full IDs are described in the Supplementary Notes). The summary statistics of metabolite GWASs in the Japanese population (Tohoku Medical Megabank Organization) which we used for decomposition–projection analysis are available at https://jmorp.megabank.tohoku.ac.jp/202102/gwas/TGA000005. We used gnomAD database (https://gnomad.broadinstitute.org/) to refer to the allele frequencies.

Code availability

We used publicly available software for the analyses. The software used is listed and described in the Methods section of our manuscript.

References

  1. Berger, D. A brief history of medical diagnosis and the birth of the clinical laboratory. Part 1—ancient times through the 19th century. MLO Med. Lab. Obs. 31, 28–30 (1999).

    CAS  PubMed  Google Scholar 

  2. Organización Mundial de la Salud. International Statistical Classification of Diseases and Related Health Problems, 10th revision (ICD-10) (World Health Organization, 2016).

  3. Denny, J. C. et al. Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data. Nat. Biotechnol. 31, 1102–1110 (2013).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Welter, D. et al. The NHGRI GWAS Catalog, a curated resource of SNP–trait associations. Nucleic Acids Res. 42, D1001–D1006 (2014).

    Article  CAS  PubMed  Google Scholar 

  5. Denny, J. C. et al. PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations. Bioinformatics 26, 1205–1210 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Claussnitzer, M. et al. A brief history of human disease genetics. Nature 577, 179–189 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Martin, A. R. et al. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat. Genet. 51, 584–591 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Tanigawa, Y. et al. Components of genetic associations across 2,138 phenotypes in the UK Biobank highlight adipocyte biology. Nat. Commun. 10, (2019).

  9. Nagai, A. et al. Overview of the BioBank Japan project: study design and profile. J. Epidemiol. 27, S2–S8 (2017).

    Article  PubMed  PubMed Central  Google Scholar 

  10. Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).

    CAS  PubMed  PubMed Central  Google Scholar 

  11. Ritchie, M. D. et al. Robust replication of genotype–phenotype associations across multiple diseases in an electronic medical record. Am. J. Hum. Genet. 86, 560–572 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Ishigaki, K. et al. Large-scale genome-wide association study in a Japanese population identifies novel susceptibility loci across different diseases. Nat. Genet. 52, 669–679 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Kanai, M. et al. Genetic analysis of quantitative traits in the Japanese population links cell types to complex human diseases. Nat. Genet. 50, 390–400 (2018).

    Article  CAS  PubMed  Google Scholar 

  14. Akiyama, M. et al. Characterizing rare and low-frequency height-associated variants in the Japanese population. Nat. Commun. 10, 4393 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Zhou, W. et al. Efficiently controlling for case–control imbalance and sample relatedness in large-scale genetic association studies. Nat. Genet. 50, 1335–1341 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Loh, P. R. et al. Efficient Bayesian mixed-model analysis increases association power in large cohorts. Nat. Genet. 47, 284–290 (2015).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Bulik-Sullivan, B. et al. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Karczewski, K. J. et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Hagiya, H. et al. Trends in incidence and mortality of tuberculosis in Japan: a population-based study, 1997-2016. Epidemiol. Infect. 147, e38 (2019).

    Article  Google Scholar 

  20. WHO. Global Tuberculosis Report. https://apps.who.int/iris/bitstream/handle/10665/336069/9789240013131-eng.pdf (2020).

  21. Wu, Y. et al. Genome-wide association study of medication-use and associated disease in the UK Biobank. Nat. Commun. 10, 1891 (2019).

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  22. Gagliano Taliun, S. A. et al. Exploring and visualizing large-scale genetic associations by using PheWeb. Nat. Genet. 52, 550–552 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Watanabe, K. et al. A global overview of pleiotropy and genetic architecture in complex traits. Nat. Genet. 51, 1339–1348 (2019).

    Article  CAS  PubMed  Google Scholar 

  24. Canela-Xandri, O., Rawlik, K. & Tenesa, A. An atlas of genetic associations in UK Biobank. Nat. Genet. 50, 1593–1599 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Pendergrass, S. A. et al. A phenome-wide association study (PheWAS) in the Population Architecture using Genomics and Epidemiology (PAGE) study reveals potential pleiotropy in African Americans. PLoS ONE 14, e0226771 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Verma, A. et al. PheWAS and beyond: the landscape of associations with medical diagnoses and clinical measures across 38,662 individuals from Geisinger. Am. J. Hum. Genet. 102, 592–608 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Field, Y. et al. Detection of human adaptation during the past 2000 years. Science 354, 760–764 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  28. Okada, Y. et al. Deep whole-genome sequencing reveals recent selection signatures linked to evolution and disease risk of Japanese. Nat. Commun. 9, 1631 (2018).

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  29. Qi, H. et al. Discovery of susceptibility loci associated with tuberculosis in Han Chinese. Hum. Mol. Genet. 26, 4752–4763 (2017).

    Article  CAS  PubMed  Google Scholar 

  30. Sveinbjornsson, G. et al. HLA class II sequence variants influence tuberculosis risk in populations of European ancestry. Nat. Genet. 48, 318–322 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Baldini, C., Ferro, F., Mosca, M., Fallahi, P. & Antonelli, A. The association of Sjögren syndrome and autoimmune thyroid disorders. Front. Endocrinol. 9, 121 (2018).

    Article  Google Scholar 

  32. Nakao, M. et al. ABO blood group alleles and the risk of pancreatic cancer in a Japanese population. Cancer Sci. 102, 1076–1080 (2011).

    Article  CAS  PubMed  Google Scholar 

  33. Edgren, G. et al. Risk of gastric cancer and peptic ulcers in relation to ABO blood type: a cohort study. Am. J. Epidemiol. 172, 1280–1285 (2010).

    Article  PubMed  Google Scholar 

  34. Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 47, 1236–1241 (2015).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Anttila, V. et al. Analysis of shared heritability in common disorders of the brain. Science 360, eaap8757 (2018).

    Article  PubMed  CAS  Google Scholar 

  36. Shi, H., Mancuso, N., Spendlove, S. & Pasaniuc, B. Local genetic correlation gives insights into the shared genetic architecture of complex traits. Am. J. Hum. Genet. 101, 737–751 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Burren, O. S. et al. Informed dimension reduction of clinically-related genome-wide association. Preprint at bioRxiv https://www.biorxiv.org/content/10.1101/2020.01.14.905869v3 (2020).

  38. Chasman, D. I., Giulianini, F., Demler, O. V. & Udler, M. S. Pleiotropy-based decomposition of genetic risk scores: association and interaction analysis for type 2 diabetes and CAD. Am. J. Hum. Genet. 106, 646–658 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. McLean, C. Y. et al. GREAT improves functional interpretation of cis-regulatory regions. Nat. Biotechnol. 28, 495–501 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Meuleman, W. et al. Index and biological spectrum of human DNase I hypersensitive sites. Nature 584, 244–251 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. GTEx Consortium, F. et al. Genetic effects on gene expression across human tissues. Nature 550, 204–213 (2017).

    Article  PubMed Central  Google Scholar 

  42. Shin, S. Y. et al. An atlas of genetic influences on human blood metabolites. Nat. Genet. 46, 543–550 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Portincasa, P. & Wang, D. Q. H. Intestinal absorption, hepatic synthesis, and biliary secretion of cholesterol: where are we for cholesterol gallstone formation? Hepatology 55, 1313–1316 (2012).

    Article  CAS  PubMed  Google Scholar 

  44. Vessey, D. A. The biochemical basis for the conjugation of bile acids with either glycine or taurine. Biochem. J. 174, 621–626 (1978).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Coombs, R. R. A. & Gell, P. G. (eds) in Clinical Aspects of Immunology 317–337 (Blackwell Science, 1963).

  46. Stone, K. D., Prussin, C. & Metcalfe, D. D. IgE, mast cells, basophils, and eosinophils. J. Allergy Clin. Immunol. 125, S73 (2010).

    Article  PubMed  PubMed Central  Google Scholar 

  47. Kobayashi, K., Kaneda, K. & Kasama, T. Immunopathogenesis of delayed-type hypersensitivity. Microsc. Res. Tech. 53, 241–245 (2001).

    Article  CAS  PubMed  Google Scholar 

  48. Okada, Y. et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature 506, 376–381 (2014).

    Article  CAS  PubMed  Google Scholar 

  49. Sakaue, S. & Okada, Y. GREP: Genome for REPositioning drugs. Bioinformatics 35, 3821–3823 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  50. Sakaue, S. et al. Trans-biobank analysis with 676,000 individuals elucidates the association of polygenic risk scores of complex traits with human lifespan. Nat. Med. 26, 542–548 (2020).

    Article  CAS  PubMed  Google Scholar 

  51. McLaren, W. et al. The Ensembl Variant Effect Predictor. Genome Biol. 17, 122 (2016).

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  52. Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  53. Brown, B. C., Ye, C. J., Price, A. L. & Zaitlen, N. Transethnic genetic-correlation estimates from summary statistics. Am. J. Hum. Genet. 99, 76–88 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  54. Raychaudhuri, S. et al. Five amino acids in three HLA proteins explain most of the association between MHC and seropositive rheumatoid arthritis. Nat. Genet. 44, 291–296 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  55. Hirata, J. et al. Genetic and phenotypic landscape of the major histocompatibilty complex region in the Japanese population. Nat. Genet. 51, 470–480 (2019).

    Article  CAS  PubMed  Google Scholar 

  56. Jia, X. et al. Imputing amino acid polymorphisms in human leukocyte antigens. PLoS ONE 8, e64683 (2013).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  57. Severe Covid-19 GWAS Group et al. Genomewide association study of severe Covid-19 with respiratory failure. N. Engl. J. Med. 383, 1522–1534 (2020).

  58. Zheng, J. et al. LD Hub: a centralized database and web interface to perform LD score regression that maximizes the potential of summary level GWAS data for SNP heritability and genetic correlation analysis. Bioinformatics 33, 272–279 (2017).

    Article  CAS  PubMed  Google Scholar 

  59. Gazal, S. et al. Linkage disequilibrium-dependent architecture of human complex traits shows action of negative selection. Nat. Genet. 49, 1421–1427 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  60. Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  61. Finucane, H. K. et al. Heritability enrichment of specifically expressed genes identifies disease-relevant tissues and cell types. Nat. Genet. 50, 621–629 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  62. Rüeger, S., McDaid, A. & Kutalik, Z. Evaluation and application of summary statistic imputation to discover new height-associated loci. PLoS Genet. 14, e1007371 (2018).

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  63. Kuriyama, S. et al. The Tohoku Medical Megabank Project: design and mission. J. Epidemiol. 26, 493–511 (2016).

    Article  PubMed  Google Scholar 

  64. Tadaka, S. et al. JMorp: Japanese Multi Omics Reference Panel. Nucleic Acids Res. 46, D551–D557 (2018).

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

We thank all the participants of BioBank Japan, UK Biobank and FinnGen. We thank K. Watanabe for her input in the analysis of phenotypic correlations and pleiotropy. This research was supported by the Tailor-Made Medical Treatment program (the BioBank Japan Project) of the Ministry of Education, Culture, Sports, Science, and Technology (MEXT), the Japan Agency for Medical Research and Development (AMED). The FinnGen project is funded by two grants from Business Finland (grant nos. HUS 4685/31/2016 and UH 4386/31/2016) and nine industry partners (AbbVie, AstraZeneca, Biogen, Celgene, Genentech, GSK, MSD, Pfizer and Sanofi). The following biobanks are acknowledged for collecting the FinnGen project samples: Auria Biobank (https://www.auria.fi/biopankki/), THL Biobank (https://thl.fi/fi/web/thl-biopank), Helsinki Biobank (https://www.terveyskyla.fi/helsinginbiopankki/), Northern Finland Biobank Borealis (https://www.ppshp.fi/Tutkimus-ja-opetus/Biopankki), Finnish Clinical Biobank Tampere (https://www.tays.fi/biopankki), Biobank of Eastern Finland (https://ita-suomenbiopankki.fi), Central Finland Biobank (https://www.ksshp.fi/fi-FI/Potilaalle/Biopankki), Finnish Red Cross Blood Service Biobank (https://www.bloodservice.fi/Research%20Projects/biobanking) and Terveystalo Biobank Finland (https://www.terveystalo.com/fi/Yritystietoa/Terveystalo-Biopankki/Biopankki/). S.S. was in part supported by the Mochida Memorial Foundation for Medical and Pharmaceutical Research, Kanae Foundation for the Promotion of Medical Science, Astellas Foundation for Research on Metabolic Disorders and the JCR Grant for Promoting Basic Rheumatology. M. Kanai was supported by a Nakajima Foundation Fellowship and the Masason Foundation. Y. Tanigawa is in part supported by a Funai Overseas Scholarship from the Funai Foundation for Information Technology and the Stanford University School of Medicine. M.A.R. is in part supported by the National Human Genome Research Institute (NHGRI) of the National Institutes of Health (NIH) under award no. R01HG010140, and an NIH Center for Multi- and Cross-population Mapping of Mendelian and Complex Diseases grant (no. 5U01 HG009080). The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH. Y.O. was supported by the Japan Society for the Promotion of Science (JSPS) KAKENHI (grant nos. 19H01021, 20K21834) and AMED (grant nos. JP21km0405211, JP21ek0109413, JP21ek0410075, JP21gm4010006 and JP21km0405217), JST Moonshot R&D Grant Number JPMJMS2021, Takeda Science Foundation and the Bioinformatics Initiative of Osaka University Graduate School of Medicine, Osaka University.

Author information

Authors and Affiliations

Authors

Consortia

Contributions

S.S., M. Kanai and Y.O. conceived the study. S.S., M. Kanai, Y. Tanigawa., M.A.R. and Y.O. wrote the manuscript. S.S., M. Kanai, J.K., M. Kurki, T. Konuma, Kenichi Yamamoto, M.A., K. Ishigaki, Kazuhiko Yamamoto, Y. Kamatani, A.P., M.J.D. and Y.O. conducted GWAS data analysis. S.S., Y. Tanigawa and M.A.R. conducted statistical decomposition analysis. S.S., S.K., A.N., G.T. and Y.O. conducted metabolome analysis. A.S., K.S., W.O., K. Yamaji, K.T., S.A., Y. Takahashi, T.S., N.S., H.Y., S. Minami, S. Murayama, K. Yoshimori, S.N., D.O., M.H., A.M., Y. Koretsune, K. Ito, C.T., T.Y., I.K., T. Kadowaki, M.Y., Y.N., M. Kubo, Y.M., Kazuhiko Yamamoto and K.M. collected and managed samples and data. A.P. and M.J.D. coordinated collaboration with FinnGen.

Corresponding authors

Correspondence to Saori Sakaue, Koichi Matsuda or Yukinori Okada.

Ethics declarations

Competing interests

M.A.R. is on the SAB of 54Gene and the Computational Advisory Board for Goldfinch Bio and has advised BioMarin, Third Rock Ventures, MazeTx and Related Sciences. The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript. The remaining authors declare no competing interests.

Additional information

Peer review information Nature Genetics thanks Caroline Hayward, Marylyn Ritchie, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Overview of this study.

We performed 220 deep-phenotype GWASs in BioBank Japan, including 108 novel GWASs ever conducted in East Asian population. We performed trans-biobank meta-analyses with UK Biobank and FinnGen (ntotal = 628,000), resulting in discovery of 5,343 novel loci. All summary statistics are openly shared through pheweb.jp web portal. As downstream analyses, we performed (i) cross-population comparison of pleiotropy and genetic correlation, (ii) comprehensive HLA fine-mapping, and (iii) statistical decomposition of a matrix of summary statistics to gain insights into biology underlying current disease classifications, by incorporating functional genomics, metabolomics, and biomarker data.

Extended Data Fig. 2 Locus plots for representative loci.

(a) Regional association plots for Pulmonary Tuberculosis (PTB) in BBJ are shown. The lead variant (rs140780894) is colored in pink, and colors of other dots indicate linkage disequilibrium measure r2 with the lead variant. (b) Regional association plots for cholelithiasis in BBJ are shown. The lead variant (rs715) is colored in pink, and colors of other dots indicate linkage disequilibrium measure r2 with the lead variant. (c) Regional association plots for gastric diseases in BBJ at the PSCA locus in gastric ulcer, gastric cancer, and gastric polyp are shown. Rs2976397, which was a lead variant in gastric ulcer, is colored in pink, and colors of other dots indicate linkage disequilibrium measure r2 with the lead variant. (d) Regional association plots at the FUT3 locus in gall bladder polyp and cholelithiasis in BBJ are shown. Rs28362459, which was a lead variant in gall bladder polyp, is colored in pink, and colors of other dots indicate linkage disequilibrium measure r2 with the lead variant. (e) Regional association plots for urticaria in BBJ are shown. The lead variant (rs56043070) is colored in pink, and colors of other dots indicate linkage disequilibrium measure r2 with the lead variant. (f) Regional association plots for salicylic acids prescription in BBJ are shown. The lead variant (rs151193009) is colored in pink, and colors of other dots indicate linkage disequilibrium measure r2 with the lead variant.

Extended Data Fig. 3 The effect size correlation between BBJ GWAS and European GWAS.

The marginal effect sizes of genome-wide significant variants across traits in BBJ are compared with those in European GWAS. Each plot represents a variant, and is colored based on the significance in European GWAS as shown in the left top legend. Pearson’s correlation r and P value (two-sided) between BBJ GWAS and European GWAS are also shown in the legend.

Extended Data Fig. 4 Phenotypic correlation across 220 phenotypes in BBJ.

a. Heatmap of pair-wise phenotypic correlation matrix. The color of the cells indicates the value of correlation r as shown in a color scale at the bottom. The traits (rows and columns) were hierarchically clustered by hclust library in R. b. Silhouette score for clustering of closely related phenotypes with different number of clusters (Supplementary Notes).

Extended Data Fig. 5 The degree of pleiotropy in BBJ after accounting for phenotypic or genetic correlations.

The Manhattan-like plots show the number of significant associations (P < 5.0×10−8) at each tested genetic variant in Japanese. a. For all traits (ntrait = 220; as shown in Fig. 2a). b. After accounting for phenotypic correlations. c. After accounting for genetic correlations.

Extended Data Fig. 6 Genetic correlation matrices across populations.

The matrices describe pairwise genetic correlation rg in Japanese GWAS (a; n = 5,565) and in European GWAS (b; n = 10,878), which was estimated by bivariate LD score regression. A color of the cells indicates the value of rg as shown in a color scale at the bottom. The traits (rows and columns) were hierarchically clustered by hclust library in R, and trait domains are displayed as colored boxes (see Methods).

Extended Data Fig. 7 Network representation of the TSVD analysis.

Two-dimensional illustration of interconnection among 159 diseases and 40 latent components. Plots in blue indicate each trait’s statistics, and plots in pink indicate the latent components derived by TSVD. White lines represent the contribution of each phenotype in each component. The width of the lines indicates the strength of the contribution based on the squared cosine score.

Extended Data Fig. 8 Heatmap representation of squared cosine scores of diseases to components.

The components (rows) are shown from 1 (top) to 40 (bottom), and the diseases (columns) are sorted based on the contribution of each component to the disease based on the squared cosine score (from component 1 to 40). Each cell is colored based on the squared cosine score of a given trait to a given component, as shown in a color scale at the bottom right.

Extended Data Fig. 9 Enrichment analyses of genes explaining each component with tissue specificity.

A heatmap representation of the enrichment analyses of genes explaining each component with tissue-specific genes defined by GTEx expression profile (a) and regulatory vocabulary from ENCODE3 data (b). Each cell is colored based on Penrichment from Fisher’s exact tests to assess the enrichment of the genes comprising each component within each tissue-specific gene set as shown in a color scale at the bottom right.

Extended Data Fig. 10 Genetic variants analyzed in the three cohorts.

The Venn diagram showing the number of genetic variants analyzed in this study in each of the three cohorts (BBJ, UKB, and FinnGen) and overlapping variants across the cohorts.

Supplementary information

Supplementary Information

Supplementary Figs. 1 and 2, Supplementary Notes and Supplementary references

Reporting Summary

Supplementary Table 1

Supplementary Tables 1–12

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sakaue, S., Kanai, M., Tanigawa, Y. et al. A cross-population atlas of genetic associations for 220 human phenotypes. Nat Genet 53, 1415–1424 (2021). https://doi.org/10.1038/s41588-021-00931-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/s41588-021-00931-x

This article is cited by

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing