Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Brief Communication
  • Published:

Quality-filtering vastly improves diversity estimates from Illumina amplicon sequencing

Abstract

High-throughput sequencing has revolutionized microbial ecology, but read quality remains a considerable barrier to accurate taxonomy assignment and α-diversity assessment for microbial communities. We demonstrate that high-quality read length and abundance are the primary factors differentiating correct from erroneous reads produced by Illumina GAIIx, HiSeq and MiSeq instruments. We present guidelines for user-defined quality-filtering strategies, enabling efficient extraction of high-quality data and facilitating interpretation of Illumina sequencing results.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Figure 1
Figure 2: The α and β diversity comparisons of mock community reads filtered using select phred_quality_score (q) settings (data set 1).

Similar content being viewed by others

References

  1. Yatsunenko, T. et al. Nature 486, 222–227 (2012).

    Article  CAS  Google Scholar 

  2. Gilbert, J.A. & Meyer, F. ASM Microbe 7, 64–69 (2012).

    Google Scholar 

  3. Reeder, J. & Knight, R. Nat. Methods 7, 668–669 (2010).

    Article  CAS  Google Scholar 

  4. Quince, C. et al. Nat. Methods 6, 639–641 (2009).

    Article  CAS  Google Scholar 

  5. Caporaso, J.G. et al. Proc. Natl. Acad. Sci. USA 108, 4516–4522 (2011).

    Article  CAS  Google Scholar 

  6. Minoche, A.E. et al. Genome Biol. 12, R112 (2011).

    Article  CAS  Google Scholar 

  7. Caporaso, J.G. et al. Nat. Methods 7, 335–336 (2010).

    Article  CAS  Google Scholar 

  8. Caporaso, J.G. et al. ISME J. 6, 1621–1624 (2012).

    Article  CAS  Google Scholar 

  9. Bokulich, N.A. et al. PLoS ONE 7, e36357 (2012).

    Article  CAS  Google Scholar 

  10. Bokulich, N.A., Bamforth, C.W. & Mills, D.A. PLoS ONE 7, e35507 (2012).

    Article  CAS  Google Scholar 

  11. Lozupone, C. & Knight, R. Appl. Environ. Microbiol. 71, 8228–8235 (2005).

    Article  CAS  Google Scholar 

  12. Edgar, R.C. Bioinformatics 26, 2460–2461 (2010).

    Article  CAS  Google Scholar 

  13. Wang, Q., Garrity, G.M., Tiedje, J.M. & Cole, J.R. Appl. Environ. Microbiol. 73, 5261–5267 (2007).

    Article  CAS  Google Scholar 

  14. DeSantis, T.Z. et al. Appl. Environ. Microbiol. 72, 5069–5072 (2006).

    Article  CAS  Google Scholar 

  15. Caporaso, J.G. et al. Bioinformatics 26, 266–267 (2010).

    Article  CAS  Google Scholar 

Download references

Acknowledgements

We thank G. Giannoukos (Broad Institute of MIT and Harvard), I. Rasolonjatovo (Illumina), M. Gebert (University of Colorado, Boulder) and L. Wegener Parfrey (University of Colorado, Boulder) for contributing mock community sequencing data used in this study, and S. Huse and A. Gonzalez for useful feedback and discussions of this manuscript. This work was supported in part by grants from the US National Institutes of Health (NIH DK78669 to J.I.G., NIH R01HD059127 to D.A.M. and NIH U54HG004969 to D.G.), the Juvenile Diabetes Research Fund (D.G.), the Crohn's and Colitis Foundation of America (J.I.G. and D.G.), and the Howard Hughes Medical Institute. N.A.B. was supported by the 2012–2013 Dannon Probiotics Fellow Program (The Dannon Company) and a Wine Spectator scholarship.

Author information

Authors and Affiliations

Authors

Contributions

N.A.B., J.G.C., D.A.M. and R.K. conceived and designed the experiments; N.A.B. performed the experiments and data analysis. All authors contributed sequencing data sets and wrote the manuscript.

Corresponding author

Correspondence to J Gregory Caporaso.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Text and Figures

Supplementary Figures 1–16, Supplementary Tables 1–9, Supplementary Note (PDF 21952 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bokulich, N., Subramanian, S., Faith, J. et al. Quality-filtering vastly improves diversity estimates from Illumina amplicon sequencing. Nat Methods 10, 57–59 (2013). https://doi.org/10.1038/nmeth.2276

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/nmeth.2276

This article is cited by

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing