Abstract
We describe factored spectrally transformed linear mixed models (FaST-LMM), an algorithm for genome-wide association studies (GWAS) that scales linearly with cohort size in both run time and memory use. On Wellcome Trust data for 15,000 individuals, FaST-LMM ran an order of magnitude faster than current efficient algorithms. Our algorithm can analyze data for 120,000 individuals in just a few hours, whereas current algorithms fail on data for even 20,000 individuals (http://mscompbio.codeplex.com/).
This is a preview of subscription content, access via your institution
Relevant articles
Open Access articles citing this article.
-
Genetic-based dissection of resistance to bacterial leaf streak in rice by GWAS
BMC Plant Biology Open Access 18 August 2023
-
Single nucleotide polymorphisms associated with wine fermentation and adaptation to nitrogen limitation in wild and domesticated yeast strains
Biological Research Open Access 29 July 2023
-
Using pre-selected variants from large-scale whole-genome sequence data for single-step genomic predictions in pigs
Genetics Selection Evolution Open Access 26 July 2023
Access options
Subscribe to this journal
Receive 12 print issues and online access
$259.00 per year
only $21.58 per issue
Rent or buy this article
Prices vary by article type
from$1.95
to$39.95
Prices may be subject to local taxes which are calculated during checkout


References
Balding, D.J. Nat. Rev. Genet. 7, 781–791 (2006).
Yu, J. et al. Nat. Genet. 38, 203–208 (2006).
Kang, H.M. et al. Genetics 107, 1709–1723 (2008).
Zhang, Z. et al. Nat. Genet. 42, 355–360 (2010).
Kang, H.M. et al. Nat. Genet. 42, 348–354 (2010).
Zhao, K. et al. PLoS Genet. 3, e4 (2007).
Price, A.L., Zaitlen, N.A., Reich, D. & Patterson, N. Nat. Rev. Genet. 11, 459–463 (2010).
Henderson, C.R. Applications of Linear Models in Animal Breeding (University of Guelph, Guelph, Ontario, Canada, 1984).
Goddard, M.E., Wray, N., Verbyla, K. & Visscher, P.M. Stat. Sci. 24, 517–529 (2009).
Hayes, B.J., Visscher, P.M. & Goddard, M.E. Genet. Res. 91, 47–60 (2009).
Fisher, R. Trans. R. Soc. Edinb. 52, 399–433 (1918).
Yang, J. et al. Nat. Genet. 42, 565–569 (2010).
Welham, S. & Thompson, R. J. R. Stat. Soc. B 59, 701–714 (1997).
Demidenko, E. Mixed Models Theory and Applications (Wiley, Hoboken, New Jersey, USA, 2004).
Listgarten, J., Kadie, C., Schadt, E.E. & Heckerman, D. Proc. Natl. Acad. Sci. USA 107, 16465–16470 (2010).
Devlin, B. & Roeder, K. Biometrics 55, 997–1004 (1999).
Edenberg, H.J. et al. BMC Genet. 6 (suppl. 1), S2 (2005).
Wellcome Trust Case Control Consortium. Nature 447, 661–678 (2007).
Acknowledgements
We thank E. Renshaw for help with implementation of Brent's method and the χ2 distribution function, J. Carlson for help with tools used to manage the data and deploy runs on our computer cluster, and N. Pfeifer for an implementation of the ATT. A full list of the investigators who contributed to the generation of the Wellcome Trust Case-Control Consortium data we used in this study is available from http://www.wtccc.org.uk/. Funding for the project was provided by the Wellcome Trust (076113 and 085475). The GAW14 data were provided by the members of the Collaborative Study on the Genetics of Alcoholism (US National Institutes of Health grant U10 AA008401).
Author information
Authors and Affiliations
Contributions
C.L., J.L. and D.H. designed and performed research, contributed analytic tools, analyzed data and wrote the paper. Y.L. designed and performed research. C.M.K. and R.I.D contributed analytic tools.
Corresponding authors
Ethics declarations
Competing interests
C.L., J.L., C.M.K., R.I.D. and D.H. are employees of Microsoft. Y.L. was employed by Microsoft while performing this research.
Supplementary information
Supplementary Text and Figures
Supplementary Figure 1, Supplementary Notes 1–2 (PDF 394 kb)
Supplementary Software 1
FaST-LMM software and associated files. (ZIP 104264 kb)
Rights and permissions
About this article
Cite this article
Lippert, C., Listgarten, J., Liu, Y. et al. FaST linear mixed models for genome-wide association studies. Nat Methods 8, 833–835 (2011). https://doi.org/10.1038/nmeth.1681
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/nmeth.1681
This article is cited by
-
Genetic-based dissection of resistance to bacterial leaf streak in rice by GWAS
BMC Plant Biology (2023)
-
Using pre-selected variants from large-scale whole-genome sequence data for single-step genomic predictions in pigs
Genetics Selection Evolution (2023)
-
Multi-omics analysis identifies drivers of protein phosphorylation
Genome Biology (2023)
-
Single nucleotide polymorphisms associated with wine fermentation and adaptation to nitrogen limitation in wild and domesticated yeast strains
Biological Research (2023)
-
E-GWAS: an ensemble-like GWAS strategy that provides effective control over false positive rates without decreasing true positives
Genetics Selection Evolution (2023)