Nature Genetics 37, 1217 - 1223 (2005)
Published online: 23 October 2005; | doi:10.1038/ng1669
Efficiency and power in genetic association studiesPaul I W de Bakker1, 2, 3, 4, 8, Roman Yelensky1, 2, 5, 8, Itsik Pe'er1, 4, Stacey B Gabriel4, Mark J Daly1, 4, 6
& David Altshuler1, 2, 3, 4, 6, 71
Center for Human Genetic Research, Massachusetts General Hospital, 185 Cambridge Street, CPZN-6818, Boston, Massachusetts 02114-2790, USA. 2
Department of Molecular Biology, Massachusetts General Hospital, 185 Cambridge Street, CPZN-6818, Boston, Massachusetts 02114-2790, USA. 3
Department of Genetics, Harvard Medical School, Boston, Massachusetts, USA. 4
Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA. 5
Harvard-MIT Division of Health Sciences and Technology, Cambridge, Massachusetts, USA. 6
Department of Medicine, Harvard Medical School, Boston, Massachusetts, USA. 7
Diabetes Unit, Massachusetts General Hospital, 185 Cambridge Street, CPZN-6818, Boston, Massachusetts 02114-2790, USA. 8
These authors contributed equally to this work.
Correspondence should be addressed to Mark J Daly mjdaly@chgr.mgh.harvard.edu or David Altshuler altshuler@molbio.mgh.harvard.edu We investigated selection and analysis of tag SNPs for genome-wide association studies by specifically examining the relationship between investment in genotyping and statistical power. Do pairwise or multimarker methods maximize efficiency and power? To what extent is power compromised when tags are selected from an incomplete resource such as HapMap? We addressed these questions using genotype data from the HapMap ENCODE project, association studies simulated under a realistic disease model, and empirical correction for multiple hypothesis testing. We demonstrate a haplotype-based tagging method that uniformly outperforms single-marker tests and methods for prioritization that markedly increase tagging efficiency. Examining all observed haplotypes for association, rather than just those that are proxies for known SNPs, increases power to detect rare causal alleles, at the cost of reduced power to detect common causal alleles. Power is robust to the completeness of the reference panel from which tags are selected. These findings have implications for prioritizing tag SNPs and interpreting association studies.
MORE ARTICLES LIKE THIS These links to content published by NPG are automatically generated.
|