Characterization of single-nucleotide polymorphisms in coding regions of human genes

Cargill, Michele; Altshuler, David; Ireland, James; Sklar, Pamela; Ardlie, Kristin; Patil, Nila; Lane, Charles R.; Lim, Esther P.; Kalyanaraman, Nilesh; Nemesh, James; Ziaugra, Liuda; Friedland, Lisa; Rolfe, Alex; Warrington, Janet; Lipshutz, Robert; Daley, George Q.; Lander, Eric S.

doi:10.1038/10290

Article
Published: July 1999

Characterization of single-nucleotide polymorphisms in coding regions of human genes

Michele Cargill¹^na1,
David Altshuler^1,2^na1,
James Ireland¹,
Pamela Sklar^1,3,
Kristin Ardlie¹,
Nila Patil⁵,
Charles R. Lane¹,
Esther P. Lim¹,
Nilesh Kalyanaraman¹,
James Nemesh¹,
Liuda Ziaugra¹,
Lisa Friedland¹,
Alex Rolfe¹,
Janet Warrington⁵,
Robert Lipshutz⁵,
George Q. Daley^1,4 &
…
Eric S. Lander^1,6

Nature Genetics volume 22, pages 231–238 (1999)Cite this article

6148 Accesses
1442 Citations
26 Altmetric
Metrics details

A Correction to this article was published on 01 November 1999

Abstract

A major goal in human genetics is to understand the role of common genetic variants in susceptibility to common diseases. This will require characterizing the nature of gene variation in human populations, assembling an extensive catalogue of single-nucleotide polymorphisms (SNPs) in candidate genes and performing association studies for particular diseases. At present, our knowledge of human gene variation remains rudimentary. Here we describe a systematic survey of SNPs in the coding regions of human genes. We identified SNPs in 106 genes relevant to cardiovascular disease, endocrinology and neuropsychiatry by screening an average of 114 independent alleles using 2 independent screening methods. To ensure high accuracy, all reported SNPs were confirmed by DNA sequencing. We identified 560 SNPs, including 392 coding-region SNPs (cSNPs) divided roughly equally between those causing synonymous and non-synonymous changes. We observed different rates of polymorphism among classes of sites within genes (non-coding, degenerate and non-degenerate) as well as between genes. The cSNPs most likely to influence disease, those that alter the amino acid sequence of the encoded protein, are found at a lower rate and with lower allele frequencies than silent substitutions. This likely reflects selection acting against deleterious alleles during human evolution. The lower allele frequency of missense cSNPs has implications for the compilation of a comprehensive catalogue, as well as for the subsequent application to disease association.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 1: Minor allele frequency by polymorphism type.**

**Figure 2: Distribution of nucleotide diversity.**

References

Ayala, F.J., Escalante, A., O'Huigin, C. & Klein, J. Molecular genetics of speciation and human origins. Proc. Natl Acad. Sci. USA 91, 6787–6794 (1994).
Article CAS PubMed PubMed Central Google Scholar
Risch, N. & Merikangas, K. The future of genetic studies of complex human diseases. Science 273, 1516–1517 (1996).
Article CAS PubMed Google Scholar
Collins, F.S., Guyer, M.S. & Chakravarti, A. Variations on a theme: cataloging human DNA sequence variation. Science 278, 1580– 1581 (1997).
Article CAS PubMed Google Scholar
Lander, E.S. The new genomics: global views of biology. Science 274, 536–539 (1996).
Article CAS PubMed Google Scholar
Saunders, A.M. et al. Association of apolipoprotein E allele ε 4 with late-onset familial and sporadic Alzheimer's disease. Neurology 43, 1467–1472 (1993).
Article CAS PubMed Google Scholar
Bertina, R.M. et al. Mutation in blood coagulation factor V associated with resistance to activated protein C. Nature 369, 64– 67 (1994).
Article CAS PubMed Google Scholar
Dean, M. et al. Genetic restriction of HIV-1 infection and progression to AIDS by a deletion allele of the CKR5 structural gene. Hemophilia Growth and Development Study, Multicenter AIDS Cohort Study, Multicenter Hemophilia Cohort Study, San Francisco City Cohort, ALIVE Study. Science 273 , 1856–1862 (1996).
Article CAS PubMed Google Scholar
Corder, E.H. et al. Protective effect of apolipoprotein E type 2 allele for late onset Alzheimer disease. Nature Genet. 7, 180–184 (1994).
Article CAS PubMed Google Scholar
Moriyama, E.N. & PowelI, J.R. Intraspecific nuclear DNA variation in Drosophila. Mol. Biol. Evol. 13, 261– 277 (1996).
Article CAS PubMed Google Scholar
Harris, H. The Principles of Biochemical Genetics (North-Holland/Elsevier, Amsterdam, 1975).
Harding, R.M. et al. Archaic African and Asian lineages in the genetic ancestry of modern humans. Am. J. Hum. Genet. 60, 772–789 (1997).
CAS PubMed PubMed Central Google Scholar
Nickerson, D.A. et al. DNA sequence diversity in a 9.7-kb region of the human lipoprotein lipase gene. Nature Genet. 19, 233– 240 (1998).
Article CAS PubMed Google Scholar
Li, W.-H. & Sadler, L.A. Low nucleotide diversity in man. Genetics 129, 513–523 (1991).
CAS PubMed PubMed Central Google Scholar
Chee, M. et al. Accessing genetic information with high-density DNA arrays. Science 274, 610–614 (1996).
Article CAS PubMed Google Scholar
Wang, D.G. et al. Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome. Science 280, 1077–1082 (1998).
Article CAS PubMed Google Scholar
Underhill, P.A. et al. A pre-Columbian Y chromosome-specific transition and its implications for human evolutionary history. Proc. Natl Acad. Sci. USA 93, 196–200 (1996).
Article CAS PubMed PubMed Central Google Scholar
Li, W.-H. Molecular Evolution (Sinauer Associates, Canada, 1997 ).
Tajima, F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123, 585–595 (1989).
CAS PubMed PubMed Central Google Scholar
Begun, D.J. & Aquadro, C.F. Levels of naturally occurring DNA polymorphism correlate with recombination rates in D. melanogaster . Nature 356, 519–520 (1993).
Article Google Scholar
Nachman, M.W., Bauer, V.L. Crowell, S.L. & Aquadro, C.F. DNA variability and recombination rates at X-linked loci in humans. Genetics 150, 1133–1141 (1998).
CAS PubMed PubMed Central Google Scholar
Wayne, M.L. & Simonson, K.L. Statistical tests of neutrality in the age of weak selection. Trends Ecol. Evol. 13 , 236 (1998).
Article CAS PubMed Google Scholar
Lander, E.S. & Schork, N.J. Genetic dissection of complex traits. Science 265, 2037–2048 (1994).
Article CAS PubMed Google Scholar
Watterson, G.A. & Guess, H.A. Is the most frequent allele the oldest? Theor. Popul. Biol. 11, 141–160 (1977).
Article CAS PubMed Google Scholar
Zietkiewicz, E. et al. Nuclear DNA diversity in worldwide distributed human populations. Gene 205, 161–171 (1997).
Article CAS PubMed Google Scholar
Halushka, M.K. et al. Patterns of single-nucleotide polymorphisms in candidate genes regulating blood-pressure homeostasis. Nature Genet. 22, 239–247 (1999).
Article CAS PubMed Google Scholar
Eyre-Walker. A. & Keightley, P. High genomic deleterious mutations rates in hominids. Nature 397 , 344–347 (1999).
Article CAS PubMed Google Scholar
Weber, J.L. & Myers, E.W. Human whole-genome shotgun sequencing. Genome Res. 7, 401–409 (1997).
Article CAS PubMed Google Scholar
Venter, J.C. et al. Shotgun sequencing of the human genome. Science 280, 1540–1542 ( 1998).
Article CAS PubMed Google Scholar
Clark, A.G. et al. Haplotype structure and population genetic inferences from nucleotide-sequence variation in human lipoprotein lipase. Am. J. Hum. Genet. 63, 595–612 (1998).
Article CAS PubMed PubMed Central Google Scholar
Day, D.J., Speiser, P.W., White, P.C. & Barany, F. Detection of steroid-21 hydroxylase alleles using gene specific PCR and a multiplex ligation detection reaction. Genomics 29 152–162 (1995).
Nickerson D.A., Tobe, V.O. & Taylor, S.L. PolyPhred: automating the detection and genotyping of single nucleotide substitution using fluorescence-based resequencing. Nucleic Acids Res. 25, 2745–2751 (1997).
Article CAS PubMed PubMed Central Google Scholar
Henikoff, S. & Henikoff, J.G. Amino acid substitution matrices from protein blocks. Proc. Natl Acad. Sci. USA 89, 10915–10919 (1992).
Article CAS PubMed PubMed Central Google Scholar

Download references

Author information

Michele Cargill and David Altshuler: These authors contributed equally to this work.

Authors and Affiliations

Whitehead Institute/MIT Center for Genome Research , One Kendall Square, Building 300, Cambridge , 02139, Massachusetts, USA
Michele Cargill, David Altshuler, James Ireland, Pamela Sklar, Kristin Ardlie, Charles R. Lane, Esther P. Lim, Nilesh Kalyanaraman, James Nemesh, Liuda Ziaugra, Lisa Friedland, Alex Rolfe, George Q. Daley & Eric S. Lander
Departments of Endocrinology, Boston, 02114, Massachusetts, USA
David Altshuler
Psychiatry, Boston, 02114, Massachusetts , USA
Pamela Sklar
Hematology, Massachusetts General Hospital, Boston, 02114, Massachusetts, USA
George Q. Daley
Affymetrix, Inc., Santa Clara, 95051, California, USA
Nila Patil, Janet Warrington & Robert Lipshutz
Department of Biology, Massachusetts Institute of Technology, Cambridge, 02139, Massachusetts, USA
Eric S. Lander

Authors

Michele Cargill
View author publications
You can also search for this author in PubMed Google Scholar
David Altshuler
View author publications
You can also search for this author in PubMed Google Scholar
James Ireland
View author publications
You can also search for this author in PubMed Google Scholar
Pamela Sklar
View author publications
You can also search for this author in PubMed Google Scholar
Kristin Ardlie
View author publications
You can also search for this author in PubMed Google Scholar
Nila Patil
View author publications
You can also search for this author in PubMed Google Scholar
Charles R. Lane
View author publications
You can also search for this author in PubMed Google Scholar
Esther P. Lim
View author publications
You can also search for this author in PubMed Google Scholar
Nilesh Kalyanaraman
View author publications
You can also search for this author in PubMed Google Scholar
James Nemesh
View author publications
You can also search for this author in PubMed Google Scholar
Liuda Ziaugra
View author publications
You can also search for this author in PubMed Google Scholar
Lisa Friedland
View author publications
You can also search for this author in PubMed Google Scholar
Alex Rolfe
View author publications
You can also search for this author in PubMed Google Scholar
Janet Warrington
View author publications
You can also search for this author in PubMed Google Scholar
Robert Lipshutz
View author publications
You can also search for this author in PubMed Google Scholar
George Q. Daley
View author publications
You can also search for this author in PubMed Google Scholar
Eric S. Lander
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Eric S. Lander.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cargill, M., Altshuler, D., Ireland, J. et al. Characterization of single-nucleotide polymorphisms in coding regions of human genes. Nat Genet 22, 231–238 (1999). https://doi.org/10.1038/10290

Download citation

Received: 28 April 1999
Accepted: 03 June 1999
Issue Date: July 1999
DOI: https://doi.org/10.1038/10290

This article is cited by

The Emerging Role of Toll-Like Receptor-Mediated Neuroinflammatory Signals in Psychiatric Disorders and Acquired Epilepsy
- Anubha Chaudhary
- Parul Mehra
- Amit Prasad
Molecular Neurobiology (2024)
Soybean (Glycine max) isoflavone conjugate hydrolysing β-glucosidase (GmICHG): a promising candidate for soy isoflavone bioavailability enhancement
- Sandeep Kumar
- Monika Awana
- Anil Dahuja
3 Biotech (2023)
PLEACH: a new heuristic algorithm for pure parsimony haplotyping problem
- Reza Feizabadi
- Mehri Bagherian
- Maziar Salahi
The Journal of Supercomputing (2023)
Detection of Transversions and Transitions in HBG2 Cis-Elements Associated with Sickle Cell Allele in Ghanaians
- G. K. Ababio
- I. Ekem
- I. K. Quaye
Biochemical Genetics (2023)
In-silico analysis unravels the structural and functional consequences of non-synonymous SNPs in the human IL-10 gene
- Shuvo Chandra Das
- Md. Anisur Rahman
- Shipan Das Gupta
Egyptian Journal of Medical Human Genetics (2022)

Characterization of single-nucleotide polymorphisms in coding regions of human genes

Abstract

Access options

Similar content being viewed by others

A structural variation reference for medical and population genetics

Rare copy number variants in over 100,000 European ancestry subjects reveal multiple disease associations

Effective variant filtering and expected candidate variant yield in studies of rare human disease

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

This article is cited by

The Emerging Role of Toll-Like Receptor-Mediated Neuroinflammatory Signals in Psychiatric Disorders and Acquired Epilepsy

Soybean (Glycine max) isoflavone conjugate hydrolysing β-glucosidase (GmICHG): a promising candidate for soy isoflavone bioavailability enhancement

PLEACH: a new heuristic algorithm for pure parsimony haplotyping problem

Detection of Transversions and Transitions in HBG2 Cis-Elements Associated with Sickle Cell Allele in Ghanaians

In-silico analysis unravels the structural and functional consequences of non-synonymous SNPs in the human IL-10 gene

Search

Quick links

Abstract

Access options

Similar content being viewed by others

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links