The contribution of genetic variants to disease depends on the ruler

Witte, John S.; Visscher, Peter M.; Wray, Naomi R.

doi:10.1038/nrg3786

Analysis
Published: 16 September 2014

The contribution of genetic variants to disease depends on the ruler

John S. Witte^1,2,3,
Peter M. Visscher^4,5 &
Naomi R. Wray⁴

Nature Reviews Genetics volume 15, pages 765–776 (2014)Cite this article

11k Accesses
107 Citations
65 Altmetric
Metrics details

Subjects

Key Points

Although the historically different fields of quantitative genetics and epidemiology are converging to answer fundamental questions about genetic variation in risk underlying human diseases, the plethora of measures to quantify the contribution of variants to disease risk have differing terminology and assumptions, which obfuscate their use and interpretation.
In this Analysis, we consider and contrast the most commonly used measures that assess disease risk contributed to the population by individual variants — the heritability of disease liability explained, approximate heritability explained, the sibling recurrence risk explained, the proportion of genetic variance explained on a logarthimic relative risk scale, the area under the receiver–operating curve (AUC) and the population attributable fraction (PAF) — and give numerical examples in breast cancer, Crohn's disease, rheumatoid arthritis and schizophrenia.
We discuss the properties of these measures, show how they are connected to each other, consider the situations for which they are best suited and provide an online tool for their calculation.
The most appropriate measure to use depends on the importance given to the frequency of a risk variant relative to its effect size on disease and on the baseline to which importance is expressed. These factors should be explicitly considered when assessing the contribution of genetic variants to disease.
We recommend investigators to focus primarily on the heritability of liability or genetic variance on the logarthimic relative risk scale explained, as they give estimates that are less sensitive to rare high-risk variants than the other measures considered here. Moreover, we caution against using the PAF for genetic risk variants because it has various undesirable properties.
The concept of individual loci providing an explanation for disease is less straightforward than it may seem at first sight, and we recommend investigators to undertake sensitivity analyses that explore how measures of the contribution of genetic variants to risk vary across a range of underlying assumptions.

Abstract

Our understanding of the genetic basis of disease has evolved from descriptions of overall heritability or familiality to the identification of large numbers of risk loci. One can quantify the impact of such loci on disease using a plethora of measures, which can guide future research decisions. However, different measures can attribute varying degrees of importance to a variant. In this Analysis, we consider and contrast the most commonly used measures — specifically, the heritability of disease liability, approximate heritability, sibling recurrence risk, overall genetic variance using a logarithmic relative risk scale, the area under the receiver–operating curve for risk prediction and the population attributable fraction — and give guidelines for their use that should be explicitly considered when assessing the contribution of genetic variants to disease.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 1: Different measures of genetic effects on disease.**

**Figure 2: Empirical evaluation of measures of genetic effects.**

**Figure 3: Application of measures to four diseases.**

**Figure 4: Aspects of disease heritability: known, hiding and missing.**

Refining the impact of genetic evidence on clinical success

Article Open access 17 April 2024

Genome-wide association studies

Article 26 August 2021

Utility of polygenic scores across diverse diseases in a hospital cohort for predictive modeling

Article Open access 12 April 2024

References

Visscher, P. M., Brown, M. A., McCarthy, M. I. & Yang, J. Five years of GWAS discovery. Am. J. Hum. Genet. 90, 7–24 (2012).
Article CAS PubMed PubMed Central Google Scholar
Witte, J. S. Genome-wide association studies and beyond. Annu. Rev. Publ. Health 31, 9–20 (2010).
Article Google Scholar
Manolio, T. A. et al. Finding the missing heritability of complex diseases. Nature 461, 747–753 (2009).
Article CAS PubMed PubMed Central Google Scholar
Wray, N. R., Yang, J., Goddard, M. E. & Visscher, P. M. The genetic interpretation of area under the ROC curve in genomic profiling. PLoS Genet. 6, e1000864 (2010).
Article PubMed PubMed Central Google Scholar
Yang, J. et al. Common SNPs explain a large proportion of the heritability for human height. Nature Genet. 42, 565–569 (2010).
Article CAS PubMed Google Scholar
Cole, P. & MacMahon, B. Attributable risk percent in case–control studies. Br. J. Prev. Soc. Med. 25, 242–244 (1971).
CAS PubMed PubMed Central Google Scholar
Lee, S. H. et al. Estimating the proportion of variation in susceptibility to schizophrenia captured by common SNPs. Nature Genet. 44, 247–250 (2012).
Article CAS PubMed Google Scholar
Barrett, J. C. et al. Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease. Nature Genet. 40, 955–962 (2008).
Article CAS PubMed Google Scholar
Wang, K. et al. Interpretation of association signals and identification of causal variants from genome-wide association studies. Am. J. Hum. Genet. 86, 730–742 (2010).
Article CAS PubMed PubMed Central Google Scholar
Dempster, E. R. & Lerner, I. M. Heritability of threshold characters. Genetics 35, 212–236 (1950). This study explores the relationship between heritability on disease and liability scales.
CAS PubMed PubMed Central Google Scholar
Slatkin, M. Exchangeable models of complex inherited diseases. Genetics 179, 2253–2261 (2008).
Article PubMed PubMed Central Google Scholar
Falconer, D. The inheritance of liability to certain diseases, estimates from the incidence among relatives. Ann. Hum. Genet. 29, 51–76 (1965). This paper presents a formal derivation of the relationship between disease risk in relatives and heritability, and also provides a thoughtful exploration of scenarios and caveats.
Article Google Scholar
Falconer, D. & Mackay, T. F. Introduction to Quantitative Genetics, (Pearson Education, 1996).
Google Scholar
Risch, N. J. Searching for genetic determinants in the new millennium. Nature 405, 847–856 (2000). This paper describes variance explained by a single locus on the disease and liability scale.
Article CAS PubMed Google Scholar
Purcell, S. M. et al. Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature 460, 748–752 (2009).
Article CAS PubMed Google Scholar
Stahl, E. A. et al. Bayesian inference analyses of the polygenic architecture of rheumatoid arthritis. Nature Genet. 44, 483–489 (2012).
Article CAS PubMed Google Scholar
Pharoah, P. D. et al. Polygenic susceptibility to breast cancer and implications for prevention. Nature Genet. 31, 33–36 (2002). This is a clear presentation of the logRR model.
Article CAS PubMed Google Scholar
Wray, N. R. & Goddard, M. E. Multi-locus models of genetic risk of disease. Genome Med. 2, 10 (2010).
Article PubMed PubMed Central Google Scholar
Pharoah, P. D., Day, N. E., Duffy, S., Easton, D. F. & Ponder, B. A. Family history and the risk of breast cancer: a systematic review and meta-analysis. Int. J. Cancer 71, 800–809 (1997).
Article CAS PubMed Google Scholar
James, J. W. Frequency in relatives for an all-or-none trait. Ann. Hum. Genet. 35, 47–49 (1971).
Article CAS PubMed Google Scholar
Pharoah, P. D., Antoniou, A. C., Easton, D. F. & Ponder, B. A. Polygenes, risk prediction, and targeted prevention of breast cancer. N. Engl. J. Med. 358, 2796–2803 (2008).
Article CAS PubMed Google Scholar
Park, J. H. et al. Estimation of effect size distribution from genome-wide association studies and implications for future discoveries. Nature Genet. 42, 570–575 (2010).
Article CAS PubMed Google Scholar
Jostins, L. et al. Host–microbe interactions have shaped the genetic architecture of inflammatory bowel disease. Nature 491, 119–124 (2012).
Article CAS PubMed PubMed Central Google Scholar
Chen, G.-B. et al. Estimation and partitioning of (co)heritability of inflammatory bowel disease from GWAS and immunochip data. Hum. Mol. Genet. 23, 4710–4720 (2014).
Article CAS PubMed PubMed Central Google Scholar
Okada, Y. et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature 506, 376–381 (2014).
Article CAS PubMed Google Scholar
Ripke, S. et al. Genome-wide association analysis identifies 13 new risk loci for schizophrenia. Nature Genet. 45, 1150–1159 (2013).
Article CAS PubMed Google Scholar
Kirov, G. et al. Neurexin 1 (NRXN1) deletions in schizophrenia. Schizophr Bull. 35, 851–854 (2009).
Article PubMed PubMed Central Google Scholar
Kirov, G. et al. Support for the involvement of large copy number variants in the pathogenesis of schizophrenia. Hum. Mol. Genet. 18, 1497–1503 (2009).
Article CAS PubMed PubMed Central Google Scholar
International Schizophrenia Consortium. Rare chromosomal deletions and duplications increase risk of schizophrenia. Nature 455, 237–241 (2008).
Stefansson, H. et al. Large recurrent microdeletions associated with schizophrenia. Nature 455, 232–236 (2008).
Article CAS PubMed PubMed Central Google Scholar
Sullivan, P. F., Kendler, K. S. & Neale, M. C. Schizophrenia as a complex trait — evidence from a meta-analysis of twin studies. Arch. Gen. Psychiatry 60, 1187–1192 (2003).
Article PubMed Google Scholar
Rockhill, B., Weinberg, C. R. & Newman, B. Population attributable fraction estimation for established breast cancer risk factors: considering the issues of high prevalence and unmodifiability. Am. J. Epidemiol. 147, 826–833 (1998). This study considers the limitations of the PAF.
Article CAS PubMed Google Scholar
Saha, S., Chant, D., Welham, J. & McGrath, J. A systematic review of the prevalence of schizophrenia. PLoS Med. 2, e141 (2005).
Article PubMed PubMed Central Google Scholar
Alonso, A., Logroscino, G., Jick, S. S. & Hernan, M. A. Incidence and lifetime risk of motor neuron disease in the United Kingdom: a population-based study. Eur. J. Neurol. 16, 745–751 (2009).
Article CAS PubMed PubMed Central Google Scholar
Wray, N. R. et al. Polygenic methods and their application to psychiatric traits. J. Child Psychol. Psychiatry http://dx.doi.org/10.1111/jcpp.12295 (2014).
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82 (2011).
Article CAS PubMed PubMed Central Google Scholar
Gail, M. H. & Pfeiffer, R. M. On criteria for evaluating models of absolute risk. Biostatistics 6, 227–239 (2005).
Article PubMed Google Scholar
Tenesa, A. & Haley, C. S. The heritability of human disease: estimation, uses and abuses. Nature Rev. Genet. 14, 139–149 (2013).
Article CAS PubMed Google Scholar
So, H. C., Gui, A. H., Cherny, S. S. & Sham, P. C. Evaluating the heritability explained by known susceptibility variants: a survey of ten complex diseases. Genet. Epidemiol. 35, 310–317 (2011).
Article PubMed Google Scholar
So, H. C., Li, M. & Sham, P. C. Uncovering the total heritability explained by all true susceptibility variants in a genome-wide association study. Genet. Epidemiol. 35, 447–456 (2011).
Article PubMed Google Scholar
So, H. C., Kwan, J. S., Cherny, S. S. & Sham, P. C. Risk prediction of complex diseases from family history and known susceptibility loci, with applications for cancer screening. Am. J. Hum. Genet. 88, 548–565 (2011). This study uses variance explained by loci and considers complications of age-related risk.
Article CAS PubMed PubMed Central Google Scholar
Do, C. B., Hinds, D. A., Francke, U. & Eriksson, N. Comparison of family history and SNPs for predicting risk of complex disease. PLoS Genet. 8, e1002973 (2012).
Article CAS PubMed PubMed Central Google Scholar
Zaitlen, N. et al. Informed conditioning on clinical covariates increases power in case–control association studies. PLoS Genet. 8, e1003032 (2012).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors thank C. Nolan and B. Beyamin for developing the companion website, M. Robinson for help with the figure in Box 1, T. Hoffmann for help in plotting Figure 3, and J. Liu for linkage disequilibrium filtering of the breast cancer SNPs. This work is supported by the US National Institutes of Health grants R01 CA088164, U01 CA127298, U01 GM061390 and P30 CA82103, and by the Australian National Health and Medical Research Council grants 613602, 613601, 1011506, 1050218 and 1048853.

Author information

Authors and Affiliations

Department of Epidemiology and Biostatistics, and Department of Urology, University of California, San Francisco
John S. Witte
Institute for Human Genetics, University of California, San Francisco
John S. Witte
Helen Diller Comprehensive Cancer Center, University of California, San Francisco, 1450 3rd Street, San Francisco, 94158, California, USA
John S. Witte
Queensland Brain Institute, The University of Queensland, Building 79, Research Road, Brisbane, 4072, Queensland, Australia
Peter M. Visscher & Naomi R. Wray
The University of Queensland Diamantina Institute, The University of Queensland, 37 Kent Street, Brisbane, 4102, Queensland, Australia
Peter M. Visscher

Authors

John S. Witte
View author publications
You can also search for this author in PubMed Google Scholar
Peter M. Visscher
View author publications
You can also search for this author in PubMed Google Scholar
Naomi R. Wray
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to John S. Witte or Naomi R. Wray.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Glossary

Mendelian loci: Genetic loci that have alleles with discrete effects on the phenotype and that follow Mendel's laws of segregation and independent assortment.
Heritability: The proportion of phenotypic variation in a population that is attributable to genetic variation among individuals.
Disease liability: An underlying or latent continuous variable such that those with a liability above a threshold are considered diseased. The quantitative trait of liability reflects both genetic and environmental factors.
Sibling recurrence risk: The ratio of the probability that a sibling of an individual affected by a disease will also be affected compared to the risk of disease in the general population.
Genetic variance: The variance of trait values that can be ascribed to genetic differences among individuals. The total genetic variance of a trait can be dissected into additive, dominance and other components.
Area under the receiver–operating curve: (AUC). The receiver–operating curve for a predictor (for example, a genetic test) plots the proportion of cases correctly identified by the test against the proportion of controls that are incorrectly classified as cases. The AUC indicates the probability that a factor (for example, a genetic risk score) will predict a higher risk of disease in a randomly selected case than in a control.
Population attributable fraction: (PAF; also known as population attributable risk). For a given disease, risk factor and population, the fraction by which the incidence rate of the disease in the population would be reduced if the risk factor was eliminated.
Overall disease risk: The lifetime probability that an individual will be affected by a disease.
Genetic architectures: The number of risk alleles underlying disease, their allele frequency spectrum, effect sizes and mode of interaction.
Linkage disequilibrium: A measure of whether alleles at two loci coexist in a population in a nonrandom manner. Alleles that are in linkage disequilibrium are found together on the same haplotype more often than expected by chance.
Genomic profile risk: A predicted measure of genetic risk for individuals constructed from a set of loci, the risk alleles and corresponding effect sizes of which have been estimated in an independent sample.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Witte, J., Visscher, P. & Wray, N. The contribution of genetic variants to disease depends on the ruler. Nat Rev Genet 15, 765–776 (2014). https://doi.org/10.1038/nrg3786

Download citation

Published: 16 September 2014
Issue Date: November 2014
DOI: https://doi.org/10.1038/nrg3786

This article is cited by

An overview of DNA methylation-derived trait score methods and applications
- Marta F. Nabais
- Danni A. Gadd
- Naomi R. Wray
Genome Biology (2023)
A genome-wide association study identifies distinct variants associated with pulmonary function among European and African ancestries from the UK Biobank
- Musalula Sinkala
- Samar S. M. Elsheikh
- Nicola J. Mulder
Communications Biology (2023)
Development and validation of asthma risk prediction models using co-expression gene modules and machine learning methods
- Eskezeia Y. Dessie
- Yadu Gautam
- Tesfaye B. Mersha
Scientific Reports (2023)
Rank concordance of polygenic indices
- Dilnoza Muslimova
- Rita Dias Pereira
- S. Fleur W. Meddens
Nature Human Behaviour (2023)
Characteristics and long-term mortality of patients with non-MAFLD hepatic steatosis
- Hong Fan
- Zhenqiu Liu
- Tiejun Zhang
Hepatology International (2023)

Subjects

Key Points

Abstract

Access options

Similar content being viewed by others

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Competing interests

Related links

Further information

PowerPoint slides

Supplementary information

Glossary

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links