Estimation of effect size distribution from genome-wide association studies and implications for future discoveries

Park, Ju-Hyun; Wacholder, Sholom; Gail, Mitchell H; Peters, Ulrike; Jacobs, Kevin B; Chanock, Stephen J; Chatterjee, Nilanjan

doi:10.1038/ng.610

Analysis
Published: 20 June 2010

Estimation of effect size distribution from genome-wide association studies and implications for future discoveries

Ju-Hyun Park¹,
Sholom Wacholder¹,
Mitchell H Gail¹,
Ulrike Peters²,
Kevin B Jacobs³,
Stephen J Chanock^1,3 &
…
Nilanjan Chatterjee¹

Nature Genetics volume 42, pages 570–575 (2010)Cite this article

10k Accesses
488 Citations
24 Altmetric
Metrics details

Subjects

Abstract

We report a set of tools to estimate the number of susceptibility loci and the distribution of their effect sizes for a trait on the basis of discoveries from existing genome-wide association studies (GWASs). We propose statistical power calculations for future GWASs using estimated distributions of effect sizes. Using reported GWAS findings for height, Crohn's disease and breast, prostate and colorectal (BPC) cancers, we determine that each of these traits is likely to harbor additional loci within the spectrum of low-penetrance common variants. These loci, which can be identified from sufficiently powerful GWASs, together could explain at least 15–20% of the known heritability of these traits. However, for BPC cancers, which have modest familial aggregation, our analysis suggests that risk models based on common variants alone will have modest discriminatory power (63.5% area under curve), even with new discoveries.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 1: Nonparametric estimates for distributions of effect sizes for susceptibility loci.**

**Figure 2: Receiver operating characteristic curves for genetic risk models.**

Phenome-wide Mendelian randomisation analysis of 378,142 cases reveals risk factors for eight common cancers

Article Open access 25 March 2024

Molly Went, Amit Sud, … Richard Houlston

Boosting the power of genome-wide association studies within and across ancestries by using polygenic scores

Article 18 September 2023

Adrian I. Campos, Shinichi Namba, … Loic Yengo

Assessment of polygenic architecture and risk prediction based on common variants across fourteen cancers

Article Open access 03 July 2020

Yan Dora Zhang, Amber N. Hurson, … Montserrat Garcia-Closas

References

Manolio, T.A. et al. Finding the missing heritability of complex diseases. Nature 461, 747–753 (2009).
Article CAS Google Scholar
Hirschhorn, J.N. Genomewide association studies–illuminating biologic pathways. N. Engl. J. Med. 360, 1699–1701 (2009).
Article CAS Google Scholar
Goldstein, D.B. Common genetic variation and human traits. N. Engl. J. Med. 360, 1696–1698 (2009).
Article CAS Google Scholar
Kraft, P. et al. Beyond odds ratios–communicating disease risk based on genetic profiles. Nat. Rev. Genet. 10, 264–269 (2009).
Article CAS Google Scholar
Pharoah, P.D. et al. Polygenic susceptibility to breast cancer and implications for prevention. Nat. Genet. 31, 33–36 (2002).
Article CAS Google Scholar
Gail, M.H. Value of adding single-nucleotide polymorphism genotypes to a breast cancer risk model. J. Natl. Cancer Inst. 101, 959–963 (2009).
Article CAS Google Scholar
Gail, M.H. Discriminatory accuracy from single-nucleotide polymorphisms in models to predict breast cancer risk. J. Natl. Cancer Inst. 100, 1037–1041 (2008).
Article CAS Google Scholar
Xu, J. et al. Estimation of absolute risk for prostate cancer using genetic markers and family history. Prostate 69, 1565–1572 (2009).
Article CAS Google Scholar
Meigs, J.B. et al. Genotype score in addition to common risk factors for prediction of type 2 diabetes. N. Engl. J. Med. 359, 2208–2219 (2008).
Article CAS Google Scholar
Wacholder, S. et al. Performance of common genetic variants in breast-cancer risk models. N. Engl. J. Med. 362, 986–993 (2010).
Article CAS Google Scholar
Kraft, P. & Hunter, D.J. Genetic risk prediction–are we there yet? N. Engl. J. Med. 360, 1701–1703 (2009).
Article CAS Google Scholar
Visscher, P.M. Sizing up human height variation. Nat. Genet. 40, 489–490 (2008).
Article CAS Google Scholar
Gudbjartsson, D.F. et al. Many sequence variants affecting diversity of adult human height. Nat. Genet. 40, 609–615 (2008).
Article CAS Google Scholar
Lettre, G. et al. Identification of ten loci associated with height highlights new biological pathways in human growth. Nat. Genet. 40, 584–591 (2008).
Article CAS Google Scholar
Weedon, M.N. et al. Genome-wide association analysis identifies 20 loci that influence adult height. Nat. Genet. 40, 575–583 (2008).
Article CAS Google Scholar
Weedon, M.N. & Frayling, T.M. Reaching new heights: insights into the genetics of human stature. Trends Genet. 24, 595–603 (2008).
Article CAS Google Scholar
Barrett, J.C. et al. Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease. Nat. Genet. 40, 955–962 (2008).
Article CAS Google Scholar
Lichtenstein, P. et al. Environmental and heritable factors in the causation of cancer–analyses of cohorts of twins from Sweden, Denmark, and Finland. N. Engl. J. Med. 343, 78–85 (2000).
Article CAS Google Scholar
Easton, D.F. et al. Genome-wide association study identifies novel breast cancer susceptibility loci. Nature 447, 1087–1093 (2007).
Article CAS Google Scholar
Eeles, R.A. et al. Multiple newly identified loci associated with prostate cancer susceptibility. Nat. Genet. 40, 316–321 (2008).
Article CAS Google Scholar
Houlston, R.S. et al. Meta-analysis of genome-wide association data identifies four new susceptibility loci for colorectal cancer. Nat. Genet. 40, 1426–1435 (2008).
Article CAS Google Scholar
Thomas, G. et al. A multistage genome-wide association study in breast cancer identifies two new risk alleles at 1p11.2 and 14q24.1 (RAD51L1). Nat. Genet. 41, 579–584 (2009).
Article CAS Google Scholar
Thomas, G. et al. Multiple loci identified in a genome-wide association study of prostate cancer. Nat. Genet. 40, 310–315 (2008).
Article CAS Google Scholar
Eeles, R.A. et al. Identification of seven new prostate cancer susceptibility loci through a genome-wide association study. Nat. Genet. 41, 1116–1121 (2009).
Article CAS Google Scholar
Orr, H.A. The population genetics of adaptation: The distribution of factors fixed during adaptive evolution. Evolution 52, 935–949 (1998).
Article Google Scholar
Eberle, M.A. et al. Power to detect risk alleles using genome-wide tag SNP panels. PLoS Genet. 3, 1827–1837 (2007).
Article CAS Google Scholar
Schork, N.J. Power calculations for genetic association studies using estimated probability distributions. Am. J. Hum. Genet. 70, 1480–1489 (2002).
Article CAS Google Scholar
Ambrosius, W.T., Lange, E.M. & Langefeld, C.D. Power for genetic association studies with random allele frequencies and genotype distributions. Am. J. Hum. Genet. 74, 683–693 (2004).
Article CAS Google Scholar
Spencer, C.C., Su, Z., Donnelly, P. & Marchini, J. Designing genome-wide association studies: sample size, power, imputation, and the choice of genotyping chip. PLoS Genet. 5, e1000477 (2009).
Article Google Scholar
Dickson, S.P., Wang, K., Krantz, I., Hakonarson, H. & Goldstein, D.B. Rare variants create synthetic genome-wide associations. PLoS Biol. 8, e1000294 (2010).
Article Google Scholar
Yu, K. et al. Flexible design for following up positive findings. Am. J. Hum. Genet. 81, 540–551 (2007).
Article CAS Google Scholar
Ghosh, A., Zou, F. & Wright, F.A. Estimating odds ratios in genome scans: an approximate conditional likelihood approach. Am. J. Hum. Genet. 82, 1064–1074 (2008).
Article CAS Google Scholar
Li, B. & Leal, S.M. Discovery of rare variants via sequencing: implications for the design of complex trait association studies. PLoS Genet. 5, e1000481 (2009).
Article Google Scholar
Li, B. & Leal, S.M. Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. Am. J. Hum. Genet. 83, 311–321 (2008).
Article CAS Google Scholar
Zhong, H. & Prentice, R.L. Bias-reduced estimators and confidence intervals for odds ratios in genome-wide association studies. Biostatistics 9, 621–634 (2008).
Article Google Scholar
Zhong, H. & Prentice, R.L. Correcting “winner's curse” in odds ratios from genomewide association findings for major complex human diseases. Genet. Epidemiol. 34, 78–91 (2009).
Google Scholar

Download references

Acknowledgements

This work was supported by the intramural program of the National Cancer Institute, US National Institutes of Health. The research of N.C. and J.-H.P. was also partially funded by the Gene-Environment Initiative of the National Institutes of Health.

Author information

Authors and Affiliations

Division of Cancer Epidemiology and Genetics, US Department of Health and Human Services, National Cancer Institute, National Institutes of Health, Rockville, Maryland, USA
Ju-Hyun Park, Sholom Wacholder, Mitchell H Gail, Stephen J Chanock & Nilanjan Chatterjee
Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
Ulrike Peters
US Department of Health and Human Services, Core Genotyping Facility, National Cancer Institute, National Institutes of Health, Gaithersburg, Maryland, USA
Kevin B Jacobs & Stephen J Chanock

Authors

Ju-Hyun Park
View author publications
You can also search for this author in PubMed Google Scholar
Sholom Wacholder
View author publications
You can also search for this author in PubMed Google Scholar
Mitchell H Gail
View author publications
You can also search for this author in PubMed Google Scholar
Ulrike Peters
View author publications
You can also search for this author in PubMed Google Scholar
Kevin B Jacobs
View author publications
You can also search for this author in PubMed Google Scholar
Stephen J Chanock
View author publications
You can also search for this author in PubMed Google Scholar
Nilanjan Chatterjee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.-H.P. and N.C. developed the statistical methods and designed the analyses. J.-H.P. implemented the methods and carried out all analyses. N.C. and S.J.C. drafted the manuscript. S.W., M.H.G., K.B.J. and U.P. made important suggestions for presentation and interpretation of the results. All the authors participated in critically reviewing the paper and approved the final version of the manuscript.

Corresponding author

Correspondence to Nilanjan Chatterjee.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Text and Figures

Supplementary Tables 1–7 and Supplementary Note. (PDF 544 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Park, JH., Wacholder, S., Gail, M. et al. Estimation of effect size distribution from genome-wide association studies and implications for future discoveries. Nat Genet 42, 570–575 (2010). https://doi.org/10.1038/ng.610

Download citation

Received: 03 February 2010
Accepted: 26 May 2010
Published: 20 June 2010
Issue Date: July 2010
DOI: https://doi.org/10.1038/ng.610

This article is cited by

The APOE locus is linked to decline in general cognitive function: 20-years follow-up in the Doetinchem Cohort Study
- M. Liset Rietman
- N. Charlotte Onland-Moret
- W. M. Monique Verschuren
Translational Psychiatry (2022)
Different responses to risperidone treatment in Schizophrenia: a multicenter genome-wide association and whole exome sequencing joint study
- Mingzhe Zhao
- Jingsong Ma
- Shengying Qin
Translational Psychiatry (2022)
Reconstructing SNP allele and genotype frequencies from GWAS summary statistics
- Zhiyu Yang
- Peristera Paschou
- Petros Drineas
Scientific Reports (2022)
A genome-wide association analysis for body weight at 35 days measured on 137,343 broiler chickens
- Christos Dadousis
- Adriana Somavilla
- John M. Hickey
Genetics Selection Evolution (2021)
Competition for priority harms the reliability of science, but reforms can help
- Leonid Tiokhin
- Minhua Yan
- Thomas J. H. Morgan
Nature Human Behaviour (2021)