Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Letter
  • Published:

A new statistic and its power to infer membership in a genome-wide association study using genotype frequencies

Abstract

Aggregate results from genome-wide association studies (GWAS)1,2,3, such as genotype frequencies for cases and controls, were until recently often made available on public websites4,5 because they were thought to disclose negligible information concerning an individual's participation in a study. Homer et al.6 recently suggested that a method for forensic detection of an individual's contribution to an admixed DNA sample could be applied to aggregate GWAS data. Using a likelihood-based statistical framework, we developed an improved statistic that uses genotype frequencies and individual genotypes to infer whether a specific individual or any close relatives participated in the GWAS and, if so, what the participant's phenotype status is. Our statistic compares the logarithm of genotype frequencies, in contrast to that of Homer et al.6, which is based on differences in either SNP probe intensity or allele frequencies. We derive the theoretical power of our test statistics and explore the empirical performance in scenarios with varying numbers of randomly chosen or top-associated SNPs.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Figure 1: Histogram of Tgeno for a GWAS with 1,000 cases and controls.
Figure 2: Histograms of calibrated Tgeno and Homer's Tallele with 1,000 cases and controls and varying numbers of SNPs.
Figure 3: Sensitivity and specificity of Tgeno applied to GWAS data.

Similar content being viewed by others

References

  1. Manolio, T.A., Brooks, L.D. & Collins, F.S.A. HapMap harvest of insights into the genetics of common disease. J. Clin. Invest. 118, 1590–1605 (2008).

    Article  CAS  Google Scholar 

  2. McCarthy, M.I. et al. Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nat. Rev. Genet. 9, 356–369 (2008).

    Article  CAS  Google Scholar 

  3. Wang, W.Y., Barratt, B.J., Clayton, D.G. & Todd, J.A. Genome-wide association studies: theoretical and practical concerns. Nat. Rev. Genet. 6, 109–118 (2005).

    Article  CAS  Google Scholar 

  4. Mailman, M.D. et al. The NCBI dbGaP database of genotypes and phenotypes. Nat. Genet. 39, 1181–1186 (2007).

    Article  CAS  Google Scholar 

  5. Wellcome Trust Case Control Consortium. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447, 661–678 (2007).

  6. Homer, N. et al. Resolving individuals contributing trace amounts of DNA to highly complex mixtures using high-density SNP genotyping microarrays. PLoS Genet. 4, e1000167 (2008).

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Contributions

K.B.J. and N.C. devised the statistical methods; K.B.J. implemented these methods in software and applied them to simulated and empirical data; K.B.J., M.Y. and S.J.C. drafted the article; S.W. and P.K. made important suggestions to the analytic plan and aided in the interpretation of the results; D.C. and J.P. aided in the study design and verification of methodology; D.J.H., T.A.M., M.T., R.N.H. and G.D.T. participated in revising the manuscript and made important intellectual contributions. All the authors and reviewed and approved the final manuscript.

Corresponding author

Correspondence to Kevin B Jacobs.

Supplementary information

Supplementary Text and Figures

Supplementary Figures 1–3 (PDF 545 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jacobs, K., Yeager, M., Wacholder, S. et al. A new statistic and its power to infer membership in a genome-wide association study using genotype frequencies. Nat Genet 41, 1253–1257 (2009). https://doi.org/10.1038/ng.455

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/ng.455

This article is cited by

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing