Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Research Article
  • Published:

Selective amplification of protein-coding regions of large sets of genes using statistically designed primer sets

Abstract

We describe a novel approach to design a set of primers selective for large groups of genes. This method is based on the distribution frequency of all nucleotide combinations (octa- to decanucleotides), and the combined ability of primer pairs, based on these oligonucleotides, to detect genes. By analyzing 1000 human mRNAs, we found that a surprisingly small subset of octanucleotides is shared by a high proportion of human protein-coding region sense strands. By computer simulation of polymerase chain reactions, a set based on only 30 primers was able to detect approximately 75% of known (and presumably unknown) human protein-coding regions. To validate the method and provide experimental support for the feasibility of the more ambitious goal of targeting human protein-coding regions, we sought to apply the technique to a large protein family: G-protein coupled receptors (GPCRs). Our results indicate that there is sufficient low level homology among human coding regions to allow design of a limited set of primer pairs that can selectively target coding regions in general, as well as genomic subsets (e.g., GPCRs). The approach should be generally applicable to human coding regions, and thus provide an efficient method for analyzing much of the transcriptionally active human genome.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Similar content being viewed by others

References

  1. Lisitsyn, N., Lisitsyn, N., and Wigler, M. 1993. Cloning the differences between two complex genomes. Science 259: 946–951.

    Article  CAS  Google Scholar 

  2. Lander, E.S. 1993. Finding similarities and differences between genomes. Nature Genetics 4: 5–6.

    Article  CAS  Google Scholar 

  3. Lisitsyn, N.A., Segre, J.A., Kusumi, K., Lisitsyn, N.M., Nadeau, J.H., Frankel, W.N., Wigler, M.H., and Lander, E.S. 1994. Direct isolation of polymorphic markers linked to a trait by genetically directed representational difference analysis. Nature Genetics 6: 57–63.

    Article  CAS  Google Scholar 

  4. Zhang, M.Q. and Marr, T.G. 1993. Genome mapping by nonrandom anchoring: a discrete theoretical analysis. Proc. Natl. Acad. Sci. USA 90: 600–604.

    Article  CAS  Google Scholar 

  5. Smith, M.W., Holmsen, A.L., Wei, Y.H., Peterson, M., and Evans, G.A. 1994. Genomic sequence sampling: a strategy for high resolution sequence-based physical mapping of complex genomes. Nature Genetics 7: 40–47.

    Article  CAS  Google Scholar 

  6. Lee, S.W., Tomasetto, C., and Sager, R. 1991. Positive selection of candidate tumor-suppressor genes by subtractive hybridization. Proc. Natl. Acad. Sci. USA 88: 2825.

    Article  CAS  Google Scholar 

  7. Liang, P. and Pardee, A.B. 1992. Differential display of eukaryotic messenger RNA by means of the polymerase chain reaction. Science 257: 967–971.

    Article  CAS  Google Scholar 

  8. Liang, P., Averboukh, L., and Pardee, A.B. 1993. Distribution and cloning of eukaryotic mRNAs by means of differential display: refinements and optimizations. Nucleic Acids Res. 21: 3169–3175.

    Article  Google Scholar 

  9. Welsh, J., Chada, K., Dalal, S., Cheng, R., Ralph, D., and McClelland, M. 1992. Arbitrarily primed PCR fingerprinting of RNA. Nucleic Acids Res. 20: 4965–4970.

    Article  CAS  Google Scholar 

  10. Libert, F., Parmentier, M., Lefort, A., Dinsart, C., Van Sande, J., Maenhaut, C., Simons, M.-J., Dumont, J.E., and Vassart, G. 1989. Selective amplification and cloning of four new members of the G-protein-coupled receptor family. Science 244: 569–572.

    Article  CAS  Google Scholar 

  11. Stone, B. and Wharton, W. 1994. Targeted RNA fingerprinting: the cloning of differentially-expressed cDNA fragments enriched for members of the zinc finger gene family. Nucleic Acids Res. 22: 2612–2618.

    Article  CAS  Google Scholar 

  12. Caetano-Anolles, G. 1993. Amplifying DNA with arbitrary oligonucleotide primers. PCR Methods and Applications 3: 85–94.

    Article  CAS  Google Scholar 

  13. Caetano-Anolles, G., Bassam, B.J., and Gresshoff, P.M. 1992. Primer-template interactions during DNA amplification fingerprinting with single arbitrary oligonucleotides. Mol. and Gen. Genet. 325: 157–165.

    Article  Google Scholar 

  14. Caetano-Anolles, G., Bassam, B.J., and Gresshoff, P.M. 1991. DNA amplification fingerprinting using very short arbitrary oligonucleotide primers. Bio/Technology 9: 553–557.

    CAS  PubMed  Google Scholar 

  15. Sommer, R. and Tautz, D. 1989. Minimal homology requirements for PCR primers. Nucleic Acids Res. 17: 6749.

    Article  CAS  Google Scholar 

  16. Kwok, S., Kellogg, D.E., McKinney, E., Spasic, D., Goda, L., Levenson, C., and Sninsky, J.J. 1990. Effects of primer-template mismatches on the polymerase chain reaction: human immunodeficiency virus type 1 model studies. Nucleic Acids Res. 18: 999–1005.

    Article  CAS  Google Scholar 

  17. Gish, W. and States, D.J. 1993. Identification of protein coding regions by database similarity search. Nature Genetics 3: 266–272.

    Article  CAS  Google Scholar 

  18. Altschul, S.F., Gish, W., Miller, W., Myers, E.W., and Lipman, D.J. 1990. Basic local alignment search tool. J. Mol. Biol. 215: 03–10.

    Article  Google Scholar 

  19. National Center for Biotechnology Information. 1993. Entrez: Sequences, Releases 6.0 and 7.0. National Institutes of Health, Bethesda, MD.

  20. Dumas, J.P. and Ninio, J. 1982. Efficient algorithms for folding and comparing nucleic acid sequences. Nucleic Acids Res. 10: 197–206.

    Article  CAS  Google Scholar 

  21. Claverie, J.M., Sauvaget, I., and Bougueleret, L. 1990. K-tuple frequency analysis: from intron/exon discrimination to T-cell epitope mapping. Methods in Enzymology 183: 237–254.

    Article  CAS  Google Scholar 

  22. Griffais, R., Andre, P.M., and Thibon, M. 1991. K-tuple frequency in the human genome and polymerase chain reaction. Nucleic Acids Res. 19: 3887–3891.

    Article  CAS  Google Scholar 

  23. Chirgwin, J., Przybyla, A., MacDonald, R., and Rutter, W.J. 1979. Isolation of biologically active ribonucleic acid from sources enriched in ribonuclease. Biochemistry 18: 5294–5299.

    Article  CAS  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

López-Nieto, C., Nigam, S. Selective amplification of protein-coding regions of large sets of genes using statistically designed primer sets. Nat Biotechnol 14, 857–861 (1996). https://doi.org/10.1038/nbt0796-857

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1038/nbt0796-857

This article is cited by

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing