Genome-wide efficient mixed-model analysis for association studies

Zhou, Xiang; Stephens, Matthew

doi:10.1038/ng.2310

Technical Report
Published: 17 June 2012

Genome-wide efficient mixed-model analysis for association studies

Xiang Zhou¹ &
Matthew Stephens^1,2

Nature Genetics volume 44, pages 821–824 (2012)Cite this article

26k Accesses
1649 Citations
29 Altmetric
Metrics details

Subjects

Abstract

Linear mixed models have attracted considerable attention recently as a powerful and effective tool for accounting for population stratification and relatedness in genetic association tests. However, existing methods for exact computation of standard test statistics are computationally impractical for even moderate-sized genome-wide association studies. To address this issue, several approximate methods have been proposed. Here, we present an efficient exact method, which we refer to as genome-wide efficient mixed-model association (GEMMA), that makes approximations unnecessary in many contexts. This method is approximately n times faster than the widely used exact method known as efficient mixed-model association (EMMA), where n is the sample size, making exact genome-wide association analysis computationally practical for large numbers of individuals.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 1: Comparison of GEMMA with EMMA, EMMAX and GRAMMAR on HMDP HDL-C data and WTCCC Crohn's disease data.**

Genome-wide association studies

Article 26 August 2021

Leveraging functional genomic annotations and genome coverage to improve polygenic prediction of complex traits within and between ancestries

Article Open access 30 April 2024

Genome-wide analysis in over 1 million individuals of European ancestry yields improved polygenic risk scores for blood pressure traits

Article Open access 30 April 2024

References

Kang, H.M. et al. Variance component model to account for sample structure in genome-wide association studies. Nat. Genet. 42, 348–354 (2010).
Article CAS Google Scholar
Kang, H.M., Ye, C. & Eskin, E. Accurate discovery of expression quantitative trait loci under confounding from spurious and genuine regulatory hotspots. Genetics 180, 1909–1925 (2008).
Article CAS Google Scholar
Kang, H.M. et al. Efficient control of population structure in model organism association mapping. Genetics 178, 1709–1723 (2008).
Article Google Scholar
Listgarten, J., Kadie, C., Schadt, E.E. & Heckerman, D. Correction for hidden confounders in the genetic analysis of gene expression. Proc. Natl. Acad. Sci. USA 107, 16465–16470 (2010).
Article CAS Google Scholar
Price, A.L., Zaitlen, N.A., Reich, D. & Patterson, N. New approaches to population stratification in genome-wide association studies. Nat. Rev. Genet. 11, 459–463 (2010).
Article CAS Google Scholar
Yu, J. et al. A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat. Genet. 38, 203–208 (2006).
Article CAS Google Scholar
Zhang, Z. et al. Mixed linear model approach adapted for genome-wide association studies. Nat. Genet. 42, 355–360 (2010).
Article CAS Google Scholar
Lippert, C. et al. FaST linear mixed models for genome-wide association studies. Nat. Methods 8, 833–835 (2011).
Article CAS Google Scholar
Aulchenko, Y.S., Ripke, S., Isaacs, A. & van Duijn, C.M. GenABEL: an R library for genome-wide association analysis. Bioinformatics 23, 1294–1296 (2007).
Article CAS Google Scholar
Aulchenko, Y.S., de Koning, D.J. & Haley, C. Genomewide rapid association using mixed model and regression: a fast and simple method for genomewide pedigree-based quantitative trait loci association analysis. Genetics 177, 577–585 (2007).
Article CAS Google Scholar
Abney, M., Ober, C. & McPeek, M.S. Quantitative-trait homozygosity and association mapping and empirical genomewide significance in large, complex pedigrees: fasting serum-insulin level in the Hutterites. Am. J. Hum. Genet. 70, 920–934 (2002).
Article CAS Google Scholar
Guan, Y. & Stephens, M. Practical issues in imputation-based association mapping. PLoS Genet. 4, e1000279 (2008).
Article Google Scholar
Howie, B.N., Donnelly, P. & Marchini, J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet. 5, e1000529 (2009).
Article Google Scholar
Knuth, D.E. Big Omicron and big Omega and big Theta. ACM SIGACT News. 8, 18–24 (1976).
Article Google Scholar
Bennett, B.J. et al. A high-resolution association mapping panel for the dissection of complex traits in mice. Genome Res. 20, 281–290 (2010).
Article CAS Google Scholar
The Wellcome Trust Case Control Consortium. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447, 661–678 (2007).
Lee, S.H., van der Werf, J.H., Hayes, B.J., Goddard, M.E. & Visscher, P.M. Predicting unobserved phenotypes for complex traits from whole-genome SNP data. PLoS Genet. 4, e1000231 (2008).
Article Google Scholar
Meyer, K. Estimating variances and covariances for multivariate animal models by restricted maximum likelihood. Genet. Sel. Evol. 23, 67–83 (1991).
Article Google Scholar
Searle, S.R., Casella, G. & McCulloch, C.E. Variance Components. (Wiley, New York, 2006).
Henderson, C.R. Applications of Linear Models in Animal Breeding (University of Guelph, Guelph, Canada, 1984).

Download references

Acknowledgements

This research is supported in part by grants from the US National Institutes of Health (NIH) (HL092206 to Y. Gilad and HG02585 to M.S.). We thank A.J. Lusis for making the mouse genotype and phenotype data available. This study also makes use of data generated by the WTCCC¹⁵. A full list of the investigators who contributed to the generation of the data is available from the WTCCC website. Funding for the WTCCC project was provided by the Wellcome Trust (award 085475).

Author information

Authors and Affiliations

Department of Human Genetics, University of Chicago, Chicago, Illinois, USA
Xiang Zhou & Matthew Stephens
Department of Statistics, University of Chicago, Chicago, Illinois, USA
Matthew Stephens

Authors

Xiang Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Stephens
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.Z. and M.S. designed the study, developed methods and wrote the manuscript. X.Z. implemented software and analyzed data.

Corresponding authors

Correspondence to Xiang Zhou or Matthew Stephens.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Text and Figures

Supplementary Figures 1 and 2, Supplementary Table 1 and Supplementary Note (PDF 365 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhou, X., Stephens, M. Genome-wide efficient mixed-model analysis for association studies. Nat Genet 44, 821–824 (2012). https://doi.org/10.1038/ng.2310

Download citation

Received: 11 July 2011
Accepted: 04 May 2012
Published: 17 June 2012
Issue Date: July 2012
DOI: https://doi.org/10.1038/ng.2310

This article is cited by

Genetic dissection of resistance to gray leaf spot by genome-wide association study in a multi-parent maize population
- Can Hu
- Tianhui Kuang
- Xingming Fan
BMC Plant Biology (2024)
Human genetic associations of the airway microbiome in chronic obstructive pulmonary disease
- Jingyuan Gao
- Yuqiong Yang
- Zhang Wang
Respiratory Research (2024)
Genomic evidence for human-mediated introgressive hybridization and selection in the developed breed
- Heng Du
- Zhen Liu
- Jian-Feng Liu
BMC Genomics (2024)
A cautionary tale of low-pass sequencing and imputation with respect to haplotype accuracy
- David Wragg
- Wengang Zhang
- Dylan N. Clements
Genetics Selection Evolution (2024)
Large-scale gene expression alterations introduced by structural variation drive morphotype diversification in Brassica oleracea
- Xing Li
- Yong Wang
- Feng Cheng
Nature Genetics (2024)