Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Analysis
  • Published:

Including known covariates can reduce power to detect genetic effects in case-control studies

Abstract

Genome-wide association studies (GWAS) search for associations between genetic variants and disease status, typically via logistic regression. Often there are covariates, such as sex or well-established major genetic factors, that are known to affect disease susceptibility and are independent of tested genotypes at the population level. We show theoretically and with data from recent GWAS on multiple sclerosis, psoriasis and ankylosing spondylitis that inclusion of known covariates can substantially reduce power for the identification of associated variants when the disease prevalence is lower than a few percent. Whether the inclusion of such covariates reduces or increases power to detect genetic effects depends on various factors, including the prevalence of the disease studied. When the disease is common (prevalence of >20%), the inclusion of covariates typically increases power, whereas, for rarer diseases, it can often decrease power to detect new genetic associations.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Purchase on Springer Link

Instant access to full article PDF

Prices may be subject to local taxes which are calculated during checkout

Figure 1: Results with and without the covariate.

Similar content being viewed by others

References

  1. Spencer, C.C., Su, Z., Donnelly, P. & Marchini, J. Designing genome-wide association studies: sample size, power, imputation, and the choice of genotyping chip. PLoS Genet. 5, e1000477 (2009).

    Article  Google Scholar 

  2. Robinson, L.D. & Jewell, N.P. Some surprising results about covariate adjustment in logistic-regression models. Int. Stat. Rev. 59, 227–240 (1991).

    Article  Google Scholar 

  3. Prentice, R.L. & Pyke, R. Logistic disease incidence models and case-control studies. Biometrika 66, 403–411 (1979).

    Article  Google Scholar 

  4. Neuhaus, J.M. & Jewell, N.P. A geometric approach to assess bias due to omitted covariates in generalized linear-models. Biometrika 80, 807–815 (1993).

    Article  Google Scholar 

  5. Stringer, S., Wray, N.R., Kahn, R.S. & Derks, E.M. Underestimated effect sizes in GWAS: fundamental limitations of single SNP analysis for dichotomous phenotypes. PLoS ONE 6, e27964 (2011).

    Article  CAS  Google Scholar 

  6. Neuhaus, J.M. Estimation efficiency with omitted covariates in generalized linear models. J. Am. Stat. Assoc. 93, 1124–1129 (1998).

    Google Scholar 

  7. Xing, G. & Xing, C. Adjusting for covariates in logistic regression models. Genet. Epidemiol. 34, 769–771 (2010).

    Article  Google Scholar 

  8. Lin, D.Y. & Zeng, D. On the relative efficiency of using summary statistics versus individual-level data in meta-analysis. Biometrika 97, 321–332 (2010).

    Article  CAS  Google Scholar 

  9. International Multiple Sclerosis Genetics Consortium & Wellcome Trust Case Control Consortium 2. Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis. Nature 476, 214–219 (2011).

  10. Genetic Analysis of Psoriasis Consortium & Wellcome Trust Case Control Consortium 2. A genome-wide association study identifies new psoriasis susceptibility loci and an interaction between HLA-C and ERAP1. Nat. Genet. 42, 985–990 (2010).

  11. Australo-Anglo-American Spondyloarthritis Consortium & Wellcome Trust Case Control Consortium 2. Interaction between ERAP1 and HLA-B27 in ankylosing spondylitis implicates peptide handling in the mechanism for HLA-B27 in disease susceptibility. Nat. Genet. 43, 761–767 (2011).

  12. Chasman, D.I. et al. Genome-wide association study reveals three susceptibility loci for common migraine in the general population. Nat. Genet. 43, 695–698 (2011).

    Article  CAS  Google Scholar 

  13. Lee, L.F. Specification error in multinomial logit-models—analysis of the omitted variable bias. J. Econom. 20, 197–209 (1982).

    Article  Google Scholar 

  14. Vukcevic, D., Hechter, E., Spencer, C. & Donnelly, P. Disease model distortion in association studies. Genet. Epidemiol. 35, 278–290 (2011).

    PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

We thank G. Nicholson for helpful comments. This work was funded by the Wellcome Trust, as part of the Wellcome Trust Case Control Consortium 2 project (085475/B/08/Z and 085475/Z/08/Z) and through the Wellcome Trust core grant for the Wellcome Trust Centre for Human Genetics (090532/Z/09/Z). P.D. was supported in part by a Wolfson Royal Society Merit Award and a Wellcome Trust Senior Investigator Award (095552/Z/11/Z). C.C.A.S. was supported in part by a Wellcome Trust Career Development Fellowship (097364/Z/11/Z).

Author information

Authors and Affiliations

Authors

Contributions

M.P., P.D. and C.C.A.S. jointly designed the study and wrote the paper. M.P. derived the mathematical results and carried out the example analyses.

Corresponding authors

Correspondence to Matti Pirinen or Chris C A Spencer.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Text and Figures

Supplementary Figures 1–7 and Supplementary Note (PDF 259 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pirinen, M., Donnelly, P. & Spencer, C. Including known covariates can reduce power to detect genetic effects in case-control studies. Nat Genet 44, 848–851 (2012). https://doi.org/10.1038/ng.2346

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/ng.2346

This article is cited by

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing