 |
| Original Article |
| Efficient discovery of single-nucleotide polymorphisms in coding regions of human genes |
 |
| G Hu1,3,4, B Modreck1,3,4, H M F Riise Stensland2, J Saarela2, P Pajukanta2, V Kustanovich2, L Peltonen2, S F Nelson2 and C Lee1,3,4 |
 |
1Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA
2Department of Human Genetics, University of California, Los Angeles, CA, USA
3UCLA-DOE Laboratory for Structural Biology and Molecular Medicine, University of California, Los Angeles, CA, USA
4Molecular Biology Institute, University of California, CA, USA
|
Correspondence to: C Lee, UCLA-DOE Laboratory for Structural Biology and Molecular Medicine, University of California, Los Angeles, CA 90095-1570, USA. Tel: +1 310 825 7374 Fax: +1 310 267 0248 E-mail: leec@mbi.ucla.edu |
 |
| Abstract |
 | Single nucleotide polymorphisms in protein coding regions (cSNPs) are of great interest for their effects on phenotype and potential for mapping disease genes. We have identified 5400 novel exonic SNPs from alignments of public EST data to the draft human genome sequence, and approximately 12 000 more novel exonic SNPs from EST cluster alignments. We found 82% of the genomic-aligned SNPs and 63% of the EST-only SNPs to be detectably polymorphic in 20 Finnish DNA samples. 37% of the SNPs mapped to known protein coding regions, yielding 6500 distinct, novel cSNPs from the two datasets. These data reveal selection against mutations that alter protein structure, and distinct classes of genes under strongly positive vs. negative pressure from natural selection for amino acid replacement (detected by KA/KSratio). We have searched these cSNPs for compatibility with the amino acid profile at each site and structural impact on protein core stability. The Pharmacogenomics Journal (2002) 2, 236-242. doi:10.1038/sj.tpj.6500109 |
 |
| Keywords |
 | data mining; bioinformatics; functional genomics; cSNP; polymorphisms; EST |
|
 |
 |
 |
| Received 17 November 2001; revised 30 January 2002; accepted 7 March 2002 |
 |
| 2002, Volume 2, Number 4, Pages 236-242 |
| Table of contents Previous Abstract Next Full text PDF |
|