Article
European Journal of Human Genetics (2008) 16, 506–515; doi:10.1038/sj.ejhg.5201987; published online 16 January 2008
Singleton SNPs in the human genome and implications for genome-wide association studies
Xiayi Ke1,2,3, Martin S Taylor1,4 and Lon R Cardon1
- 1Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
- 2Centre for Integrated Genomic Medical Research (CIGMR), School of Medicine, University of Manchester, Manchester, UK
- 3Arthritis Research Campaign (arc) Epidemiology Unit, School of Medicine, University of Manchester, Manchester, UK
Correspondence: Dr X Ke, Centre for Integrated Genomic Medical Research (CIGMR), School of Medicine, University of Manchester, Oxford Road, Manchester, M13 9PT, UK. Tel: +44 161 2751674; Fax: +44 161 2755043; E-mail: Xiayi.Ke@manchester.ac.uk
4Current address: European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK.
Received 9 February 2007; Revised 16 November 2007; Accepted 22 November 2007; Published online 16 January 2008.
Abstract
The human genome is estimated to contain one single nucleotide polymorphism (SNP) every 300 base pairs. The presence of LD between SNP markers can be used to save genotyping cost via appropriate SNP tagging strategies, whereas absence or low level of LD between markers generally increase genotyping cost. It is quite common that a large proportion of tagging SNPs in a tagging scheme often turn out to be singleton SNPs, that is, SNPs that only tag themselves rather than contribute power to the rest of a region. If genotyping cost is a major concern, which often is the case at the present time for genome-wide association studies, these singleton tagging SNPs would be the primary targets to be removed from genotyping. It is important, however, to understand the characteristics of such SNPs and estimate the impact of removing them in a study. Using the HapMap genotype data and genome wide expression data, we assessed the distribution and functional implications of singleton SNPs in the human genome. Our results demonstrated that SNPs of potentially higher functional importance (eg, nonsynonymous SNPs, SNPs in splicing sites and SNPs in 5' and 3' UTR) are associated with a higher tendency to be singleton SNPs than SNPs in intronic and intergenic regions. We further assessed whether singleton SNPs can be tagged using haplotypes of tagSNPs in the three genome wide chips, that is, GeneChip 500k of Affymetrix, HumanHap300 and HumanHap550 of Illumina, and discussed the general implications on genetic association studies.
Keywords:
singleton SNPs, functionality, linkage disequilibrium, genetic association
MORE ARTICLES LIKE THIS
These links to content published by NPG are automatically generated
REVIEWS
Human genetic variation and its contribution to complex traits
Nature Reviews Genetics Review (01 Apr 2009)
NEWS AND VIEWS
Agnosticism and equity in genome-wide association studies
Nature Genetics News and Views (01 Jun 2006)
Evaluating signatures of sex-specific processes in the human genome
Nature Genetics News and Views (01 Jan 2009)
RESEARCH
Evaluating coverage of genome-wide association studies
Nature Genetics Letter (01 Jun 2006)
A second generation human haplotype map of over 3.1 million SNPs
Nature Article (18 Oct 2007)

