Population structure, differential bias and genomic control in a large-scale, case-control association study

Clayton, David G; Walker, Neil M; Smyth, Deborah J; Pask, Rebecca; Cooper, Jason D; Maier, Lisa M; Smink, Luc J; Lam, Alex C; Ovington, Nigel R; Stevens, Helen E; Nutland, Sarah; Howson, Joanna M M; Faham, Malek; Moorhead, Martin; Jones, Hywel B; Falkowski, Matthew; Hardenbol, Paul; Willis, Thomas D; Todd, John A

doi:10.1038/ng1653

Letter
Published: 09 October 2005

Population structure, differential bias and genomic control in a large-scale, case-control association study

David G Clayton¹,
Neil M Walker¹,
Deborah J Smyth¹,
Rebecca Pask¹,
Jason D Cooper¹,
Lisa M Maier¹,
Luc J Smink¹,
Alex C Lam¹,
Nigel R Ovington¹,
Helen E Stevens¹,
Sarah Nutland¹,
Joanna M M Howson¹,
Malek Faham²,
Martin Moorhead²,
Hywel B Jones²,
Matthew Falkowski²,
Paul Hardenbol²,
Thomas D Willis² &
…
John A Todd¹

Nature Genetics volume 37, pages 1243–1246 (2005)Cite this article

3443 Accesses
422 Citations
18 Altmetric
Metrics details

Abstract

The main problems in drawing causal inferences from epidemiological case-control studies are confounding by unmeasured extraneous factors, selection bias and differential misclassification of exposure¹. In genetics the first of these, in the form of population structure, has dominated recent debate^2,3,4. Population structure explained part of the significant +11.2% inflation of test statistics we observed in an analysis of 6,322 nonsynonymous SNPs in 816 cases of type 1 diabetes and 877 population-based controls from Great Britain. The remainder of the inflation resulted from differential bias in genotype scoring between case and control DNA samples, which originated from two laboratories, causing false-positive associations. To avoid excluding SNPs and losing valuable information, we extended the genomic control method^2,3,4,5 by applying a variable downweighting to each SNP.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 2: Quantile-quantile plots of Cochran-Armitage test statistics.**

Reconstructing SNP allele and genotype frequencies from GWAS summary statistics

Article Open access 17 May 2022

Assessing the contribution of rare variants to complex trait heritability from whole-genome sequence data

Article 07 March 2022

Controlling for human population stratification in rare variant association studies

Article Open access 24 September 2021

References

Breslow, N.E. & Day, N.E. Statistical Methods in Cancer Research Vol. I. The Analysis of Case-Control Studies (International Agency for Research on Cancer, Lyon, 1980).
Google Scholar
Devlin, B., Bacanu, S.A. & Roeder, K. Genomic control to the extreme. Nat. Genet. 36, 1129–1130; author reply 1131 (2004).
Article CAS Google Scholar
Freedman, M.L. et al. Assessing the impact of population stratification on genetic association studies. Nat. Genet. 36, 388–393 (2004).
Article CAS Google Scholar
Marchini, J., Cardon, L.R., Phillips, M.S. & Donnelly, P. The effects of human population structure on large genetic association studies. Nat. Genet. 36, 512–517 (2004).
Article CAS Google Scholar
Devlin, B. & Roeder, K. Genomic control for association studies. Biometrics 55, 997–1004 (1999).
Article CAS Google Scholar
Vella, A. et al. Localization of a type 1 diabetes locus in the IL2RA/CD25 region by use of tag single-nucleotide polymorphisms. Am. J. Hum. Genet. 76, 773–779 (2005).
Article CAS Google Scholar
Lowe, C.E. et al. Cost-effective analysis of candidate genes using htSNPs: a staged approach. Genes Immun. 5, 301–305 (2004).
Article CAS Google Scholar
Wang, W.Y., Barratt, B.J., Clayton, D.G. & Todd, J.A. Genome-wide association studies: theoretical and practical concerns. Nat. Rev. Genet. 6, 109–118 (2005).
Article CAS Google Scholar
Hardenbol, P. et al. Multiplexed genotyping with sequence-tagged molecular inversion probes. Nat. Biotechnol. 21, 673–678 (2003).
Article CAS Google Scholar
Hardenbol, P. et al. Highly multiplexed molecular inversion probe genotyping: over 10,000 targeted SNPs genotyped in a single tube assay. Genome Res. 15, 269–275 (2005).
Article CAS Google Scholar
Ueda, H. et al. Association of the T-cell regulatory gene CTLA4 with susceptibility to autoimmune disease. Nature 423, 506–511 (2003).
Article CAS Google Scholar
The International HapMap Consortium. The International HapMap Project. Nature 426, 789–796 (2003).
Armitage, P. Test for linear trend in proportions and frequencies. Biometrics II, 375–386 (1955).
Article Google Scholar
Mantel, N. Chi-square tests with one degree of freedom: extensions of the Mantel-Haenszel procedure. J. Am. Stat. Assoc. 58, 690–700 (1963).
Google Scholar
Nelder, J. & Wedderburn, R. Generalised linear models. J. R. Statist. Soc. A 135, 370–384 (1972).
Article Google Scholar
Moorhead, M. et al. Optimal genotype determination in highly multiplexed SNP data. Eur. J. Hum. Genet. (in the press).

Download references

Acknowledgements

We thank the individuals with T1D and control individuals for their participation; G. Coleman, S. Field, T. Mistry, K. Bourget, S. Clayton, M. Hardy, P. Lauder, M. Maisuria, W. Meadows and S. Wood for preparing DNA samples; D. Strachan, R. Jones, S. Ring and W. McArdle for providing DNA from the 1958 British Birth Cohort collection; and A. Long, N. Naclerio, T. Cormier, K. Tran, C. Bruckner and S. Picton for genotyping and technical assistance. We acknowledge use of DNA from the 1958 British Birth Cohort collection, funded by the Medical Research Council and the Wellcome Trust. We thank the Juvenile Diabetes Research Foundation, the Wellcome Trust, Diabetes UK and the Medical Research Council for financial support. D.G.C. is a Juvenile Diabetes Research Foundation and Wellcome Trust Principal Research Fellow.

Author information

Authors and Affiliations

Juvenile Diabetes Research Foundation/Wellcome Trust Diabetes and Inflammation Laboratory, University of Cambridge, Cambridge Institute for Medical Research, Wellcome Trust/MRC Building, Cambridge, CB2 2XY, UK
David G Clayton, Neil M Walker, Deborah J Smyth, Rebecca Pask, Jason D Cooper, Lisa M Maier, Luc J Smink, Alex C Lam, Nigel R Ovington, Helen E Stevens, Sarah Nutland, Joanna M M Howson & John A Todd
ParAllele BioScience, 7300 Shoreline Court, South San Francisco, California, 94080, USA
Malek Faham, Martin Moorhead, Hywel B Jones, Matthew Falkowski, Paul Hardenbol & Thomas D Willis

Authors

David G Clayton
View author publications
You can also search for this author in PubMed Google Scholar
Neil M Walker
View author publications
You can also search for this author in PubMed Google Scholar
Deborah J Smyth
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca Pask
View author publications
You can also search for this author in PubMed Google Scholar
Jason D Cooper
View author publications
You can also search for this author in PubMed Google Scholar
Lisa M Maier
View author publications
You can also search for this author in PubMed Google Scholar
Luc J Smink
View author publications
You can also search for this author in PubMed Google Scholar
Alex C Lam
View author publications
You can also search for this author in PubMed Google Scholar
Nigel R Ovington
View author publications
You can also search for this author in PubMed Google Scholar
Helen E Stevens
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Nutland
View author publications
You can also search for this author in PubMed Google Scholar
Joanna M M Howson
View author publications
You can also search for this author in PubMed Google Scholar
Malek Faham
View author publications
You can also search for this author in PubMed Google Scholar
Martin Moorhead
View author publications
You can also search for this author in PubMed Google Scholar
Hywel B Jones
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Falkowski
View author publications
You can also search for this author in PubMed Google Scholar
Paul Hardenbol
View author publications
You can also search for this author in PubMed Google Scholar
Thomas D Willis
View author publications
You can also search for this author in PubMed Google Scholar
John A Todd
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to David G Clayton or John A Todd.

Ethics declarations

Competing interests

M. Faham, M.M., H.B.J., M. Falkowski, P.H. and T.D.W. are currently employed by ParAllele Bioscience.

Supplementary information

Supplementary Note (PDF 106 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Clayton, D., Walker, N., Smyth, D. et al. Population structure, differential bias and genomic control in a large-scale, case-control association study. Nat Genet 37, 1243–1246 (2005). https://doi.org/10.1038/ng1653

Download citation

Received: 10 June 2005
Accepted: 15 August 2005
Published: 09 October 2005
Issue Date: 01 November 2005
DOI: https://doi.org/10.1038/ng1653

This article is cited by

Transfer learning for genotype–phenotype prediction using deep learning models
- Muhammad Muneeb
- Samuel Feng
- Andreas Henschel
BMC Bioinformatics (2022)
Towards fine-scale population stratification modeling based on kernel principal component analysis and random forest
- Weiwen Zhang
- Lianglun Cheng
- Guoheng Huang
Genes & Genomics (2021)
Genotype calling of triploid offspring from diploid parents
- Kim Erik Grashei
- Jørgen Ødegård
- Theo H. E. Meuwissen
Genetics Selection Evolution (2020)
Neo-functionalization of a Teosinte branched 1 homologue mediates adaptations of upland rice
- Jun Lyu
- Liyu Huang
- Fengyi Hu
Nature Communications (2020)
Robust genome-wide ancestry inference for heterogeneous datasets: illustrated using the 1,000 genome project with 3D facial images
- Jiarui Li
- Tomás González Zarzar
- Peter Claes
Scientific Reports (2020)

Population structure, differential bias and genomic control in a large-scale, case-control association study

Abstract

Access options

Similar content being viewed by others

Reconstructing SNP allele and genotype frequencies from GWAS summary statistics

Assessing the contribution of rare variants to complex trait heritability from whole-genome sequence data

Controlling for human population stratification in rare variant association studies

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Competing interests

Supplementary information

Supplementary Note (PDF 106 kb)

Rights and permissions

About this article

Cite this article

This article is cited by

Transfer learning for genotype–phenotype prediction using deep learning models

Towards fine-scale population stratification modeling based on kernel principal component analysis and random forest

Genotype calling of triploid offspring from diploid parents

Neo-functionalization of a Teosinte branched 1 homologue mediates adaptations of upland rice

Robust genome-wide ancestry inference for heterogeneous datasets: illustrated using the 1,000 genome project with 3D facial images

Search

Quick links

Abstract

Access options

Similar content being viewed by others

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Competing interests

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links