Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Protocol
  • Published:

Single nucleotide polymorphism (SNP) allele frequency estimation in DNA pools using Pyrosequencing™

Abstract

Identifying the genetic variation underlying complex disease requires analysis of many single nucleotide polymorphisms (SNPs) in a large number of samples. Several high-throughput SNP genotyping techniques are available; however, their cost promotes the use of association screening with pooled DNA. This protocol describes the estimation of SNP allele frequencies in pools of DNA using the quantitative sequencing method Pyrosequencing™ (PSQ™). PSQ is a relatively recently described high-throughput method for genotyping, allele frequency estimation and DNA methylation analysis based on the detection of real-time pyrophosphate release during synthesis of the complementary strand to a PCR product. The protocol involves the following steps: (i) quantity and quality assessment of individual DNA samples; (ii) DNA pooling, which may be undertaken at the pre- or post-PCR stage; (iii) PCR amplification of PSQ template containing the variable sequence region of interest; and (iv) PSQ to determine the frequency of alleles at a particular SNP site. Once the quantity and quality of individual DNA samples has been assessed, the protocol usually requires a few days for setting up pre-PCR pools, depending on sample number. After PCR amplification, preparation and analysis of PCR amplicon by PSQ takes 1 h per plate.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Figure 1: Flowchart of protocol for allele frequency estimation in DNA pools.
Figure 2: The PSQ method.
Figure 3: Standard curve to convert PSQ™-generated SNP allele peak heights into allele frequencies.
Figure 4: Frequency distribution of precision, defined as the within-pool standard deviation (SD) of the difference between estimated and true allele frequency (Δ).
Figure 5: Frequency distribution of the absolute difference between estimated and true allele frequency (Δ) in 186 pools of 10–479 DNA samples, quantified using Hoechst 33258, for three SNPs with at least one flanking base being homologous to the SNP allele.

Similar content being viewed by others

References

  1. The International HapMap Consortium. A haplotype map of the human genome. Nature 437, 1299–1320 (2005).

  2. Conrad, D.F. et al. A worldwide survey of haplotype variation and linkage disequilibrium in the human genome. Nature Genet. 38, 1251–1260 (2006).

    Article  CAS  Google Scholar 

  3. Balding, D.J. A tutorial on statistical methods for population association studies. Nature Rev. Genet. 7, 781–791 (2006).

    Article  CAS  Google Scholar 

  4. Risch, N. & Merikangas, K. The future of genetic studies of complex human diseases. Science 273, 1516–1517 (1996).

    Article  CAS  Google Scholar 

  5. Schaid, D.J. Power and sample size for testing associations of haplotypes with complex traits. Ann. Hum. Genet. 70, 116–130 (2006).

    Article  Google Scholar 

  6. Syvänen, A.C. Toward genome-wide SNP genotyping. Nature Genet. 37 (Suppl): S5–S10 (2005).

    Article  Google Scholar 

  7. Fan, J.-B., Chee, M.S. & Gunderson, K.L. Highly parallel genomic assays. Nature Rev. Genet. 7, 632–644 (2006).

    Article  CAS  Google Scholar 

  8. Sham, P., Bader, J.S., Craig, I., O'Donovan, M. & Owen, M. DNA pooling: a tool for large-scale association studies. Nature Rev. Genet. 3, 862–871 (2002).

    Article  CAS  Google Scholar 

  9. Norton, N. et al. Universal, robust, highly quantitative SNP allele frequency measurement in DNA pools. Hum. Genet. 110, 471–478 (2002).

    Article  CAS  Google Scholar 

  10. Downes, K. et al. SNP allele frequency estimation in DNA pools and variance components analysis. Biotechniques 36, 840–845 (2004).

    Article  CAS  Google Scholar 

  11. Butcher, L.M. et al. Genotyping pooled DNA on microarrays: a systematic genome screen of thousands of SNPs in large samples to detect QTLs for complex traits. Behav. Genet. 34, 549–555 (2004).

    Article  Google Scholar 

  12. Craig, D.W. et al. Identification of disease causing loci using an array-based genotyping approach on pooled DNA. BMC Genomics 6, 138 (2005).

    Article  Google Scholar 

  13. Gruber, J.D., Colligan, P.B. & Wolford, J.K. Estimation of single nucleotide polymorphism allele frequency in DNA pools by using Pyrosequencing. Hum. Genet. 110, 395–401 (2002).

    Article  CAS  Google Scholar 

  14. Neve, B. et al. Rapid SNP allele frequency determination in genomic DNA pools by Pyrosequencing™. BioTechniques 32, 1138–1142 (2002).

    Article  CAS  Google Scholar 

  15. Wasson, J., Skolnick, G., Love-Gregory, L. & Permutt, M.A. Assessing allele frequencies of single nucleotide polymorphisms in DNA pools by Pyrosequencing™ technology. BioTechniques 32, 1144–1152 (2002).

    Article  CAS  Google Scholar 

  16. Lavebratt, C., Sengul, S., Jansson, M. & Schalling, M. Pyrosequencing-based SNP allele frequency estimation in DNA pools. Hum. Mutat. 23, 92–97 (2004).

    Article  CAS  Google Scholar 

  17. Permutt, M.A. et al. Searching for type 2 diabetes genes on chromosome 20. Diabetes 51, 308–315 (2002).

    Article  Google Scholar 

  18. Love-Gregory, L.D. et al. A common polymorphism in the upstream promoter region of the hepatocyte nuclear factor-4 gene on chromosome 20q is associated with type 2 diabetes and appears to contribute to the evidence for linkage in an ashkenazi jewish population. Diabetes 53, 1134–1140 (2004).

    Article  CAS  Google Scholar 

  19. Wang, H. et al. Adiponectin receptor 1 gene (ADIPOR1) as a candidate for type 2 diabetes and insulin resistance. Diabetes 53, 2132–2136 (2004).

    Article  CAS  Google Scholar 

  20. Wang, H. et al. Molecular screening and association analyses of the interleukin 6 receptor gene variants with type 2 diabetes, diabetic nephropathy, and insulin sensitivity. J. Clin. Endocrinol. Metab. 90, 1123–1129 (2005).

    Article  CAS  Google Scholar 

  21. Ronaghi, M., Karamohamed, S., Pettersson, B., Uhlén, M. & Nyrén, P. Real-time DNA sequencing using detection of pyrophosphate release. Anal. Biochem. 242, 84–89 (1996).

    Article  CAS  Google Scholar 

  22. Ronaghi, M., Uhlén, M. & Nyrén, P. A sequencing method based on real-time pyrophosphate detection. Science 281, 363–365 (1998).

    Article  CAS  Google Scholar 

  23. Alderborn, A., Kristofferson, A. & Hammerling, U. Determination of single-nucleotide polymorphisms by real-time pyrophosphate DNA sequencing. Genome Res. 10, 1249–1258 (2000).

    Article  CAS  Google Scholar 

  24. Fakhrai-Rad, H., Pourmand, N. & Ronaghi, M. Pyrosequencing™: An accurate detection platform for single nucleotide polymorphisms. Hum. Mutat. 19, 479–485 (2002).

    Article  CAS  Google Scholar 

  25. Nordfors, L. et al. Large-scale genotyping of single nucleotide polymorphisms by Pyrosequencing™ and validation against the 5' nuclease (TaqMan®) assay. Hum. Mutat. 19, 395–401 (2002).

    Article  CAS  Google Scholar 

  26. Karimi, M. et al. LUMA (LUminometric Methylation Assay) — a high throughput method to the analysis of genomic DNA methylation. Exp. Cell Res. 312, 1989–1995 (2006).

    Article  CAS  Google Scholar 

  27. Le Hellard, S. et al. SNP genotyping on pooled DNAs: comparison of genotyping technologies and a semi automated method for data storage and analysis. Nucleic Acids Res. 30, e74 (2002).

    Article  Google Scholar 

  28. Glantz, S.A. Primer of Biostatistics 4th edn. (McGraw Hill, New York, 1997).

    Google Scholar 

  29. Barratt, B.J. et al. Identification of the sources of error in allele frequency estimations from pooled DNA indicates an optimal experimental design. Ann. Hum. Genet. 66, 393–405 (2002).

    Article  CAS  Google Scholar 

Download references

Acknowledgements

This study was supported by grants from the Swedish Medical Research Council, the Swedish Heart and Lung Foundation, and the Swedish Society of Medicine.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Catharina Lavebratt.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lavebratt, C., Sengul, S. Single nucleotide polymorphism (SNP) allele frequency estimation in DNA pools using Pyrosequencing™. Nat Protoc 1, 2573–2582 (2006). https://doi.org/10.1038/nprot.2006.442

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/nprot.2006.442

This article is cited by

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing