Metagenomics uncovers gaps in amplicon-based detection of microbial diversity

  • Nature Microbiology 1, Article number: 15032 (2016)
  • doi:10.1038/nmicrobiol.2015.32
  • Download Citation
Published online:


Our view of microbial diversity has expanded greatly over the past 40 years, primarily through the wide application of PCR-based surveys of the small-subunit ribosomal RNA (SSU rRNA) gene. Yet significant gaps in knowledge remain due to well-recognized limitations of this method. Here, we systematically survey primer fidelity in SSU rRNA gene sequences recovered from over 6,000 assembled metagenomes sampled globally. Our findings show that approximately 10% of environmental microbial sequences might be missed from classical PCR-based SSU rRNA gene surveys, mostly members of the Candidate Phyla Radiation (CPR) and as yet uncharacterized Archaea. These results underscore the extent of uncharacterized microbial diversity and provide fruitful avenues for describing additional phylogenetic lineages.

  • Subscribe to Nature Microbiology for full access:



Additional access options:

Already a subscriber?  Log in  now or  Register  for online access.


  1. 1.

    Mapping the tree of life: Progress and prospects. Microbiol. Mol. Biol. Rev. 73, 565–576 (2009).

  2. 2.

    et al. Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms. ISME J. 6, 1621–1624 (2012).

  3. 3.

    et al. Uniting the classification of cultured and uncultured bacteria and archaea using 16S rRNA gene sequences. Nature Rev. Microbiol. 12, 635–645 (2014).

  4. 4.

    et al. Unusual biology across a group comprising more than 15% of domain Bacteria. Nature 523, 208–211 (2015).

  5. 5.

    et al. Genomic expansion of domain archaea highlights roles for organisms from new phyla in anaerobic carbon cycling. Curr. Biol. 25, 690–701 (2015).

  6. 6.

    et al. Insights into the phylogeny and coding potential of microbial dark matter. Nature 499, 431–437 (2013).

  7. 7.

    & Ecology and exploration of the rare biosphere. Nature Rev. Microbiol. 13, 217–229 (2015).

  8. 8.

    et al. Experimental factors affecting PCR-based estimates of microbial species richness and evenness. ISME J. 4, 642–647 (2010).

  9. 9.

    & Searching for new branches on the tree of life. Science 346, 698–699 (2014).

  10. 10.

    et al. IMG/M 4 version of the integrated metagenome comparative analysis system. Nucleic Acids Res. 42, D568–D573 (2014).

  11. 11.

    et al. The Earth Microbiome Project: Meeting report of the ‘1st EMP meeting on sample selection and acquisition’ at Argonne National Laboratory October 6th 2010. Stand. Genomic Sci. 3, 249–253 (2010).

  12. 12.

    , & Every base matters: assessing small subunit rRNA primers for marine microbiomes with mock communities, time-series and global field samples. Environ. Microbiol. (2015).

  13. 13.

    et al. Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies. Nucleic Acids Res. 41, e1 (2013).

  14. 14.

    The Human Microbiome Consortium. Structure, function and diversity of the healthy human microbiome. Nature 486, 207–214 (2012).

  15. 15.

    et al. Complex archaea that bridge the gap between prokaryotes and eukaryotes. Nature 521, 173–179 (2015).

  16. 16.

    et al. Genome sequences of rare, uncultured bacteria obtained by differential coverage binning of multiple metagenomes. Nature Biotechnol. 31, 533–538 (2013).

  17. 17.

    et al. PrimerProspector: de novo design and taxonomic analysis of barcoded polymerase chain reaction primers. Bioinformatics 27, 1159–1161 (2011).

  18. 18.

    BLAT—The BLAST-Like Alignment Tool. Genome Res. 12, 656–664 (2002).

  19. 19.

    & Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics 29, 2933–2935 (2013).

  20. 20.

    RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics 22, 2688–2690 (2006).

  21. 21.

    et al. WebLogo: a sequence logo generator. Genome Res. 14, 1188–1190 (2004).

Download references


This work was conducted by the US Department of Energy Joint Genome Institute, a DOE Office of Science User Facility (contract no. DE-AC02-05CH11231), and used resources of the National Energy Research Scientific Computing Center, which is supported by the Office of Science of the US Department of Energy (contract no. DE-AC02-05CH11231).

Author information


  1. Joint Genome Institute, Walnut Creek, California 94598, USA

    • Emiley A. Eloe-Fadrosh
    • , Natalia N. Ivanova
    • , Tanja Woyke
    •  & Nikos C. Kyrpides


  1. Search for Emiley A. Eloe-Fadrosh in:

  2. Search for Natalia N. Ivanova in:

  3. Search for Tanja Woyke in:

  4. Search for Nikos C. Kyrpides in:


E.A.E.-F., N.N.I., T.W. and N.C.K. designed the project, analysed the data and wrote the manuscript. All authors discussed the results and commented on the manuscript.

Competing interests

The authors declare no competing financial interests.

Corresponding author

Correspondence to Nikos C. Kyrpides.

Supplementary information

PDF files

  1. 1.

    Supplementary Information

    Supplementary Figures 1–3, Supplementary Tables 1,2.