Skip to main content

Thank you for visiting You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Quality score compression improves genotyping accuracy

Your institute does not have access to this article

Relevant articles

Open Access articles citing this article.

Access options

Buy article

Get time limited or full article access on ReadCube.


All prices are NET prices.

Figure 1: Compressive quality scores using the Quartz algorithm.
Figure 2: Scaled ROC curves of genotyping accuracy.


  1. Berger, B., Peng, J. & Singh, M. Nat. Rev. Genet. 14, 333–346 (2013).

    CAS  Article  Google Scholar 

  2. Kahn, S.D. Science 331, 728–729 (2011).

    CAS  Article  Google Scholar 

  3. The 1000 Genomes Project Consortium. Nature 491, 56–65 (2012).

  4. Veeramah, K.R. & Hammer, M.F. Nat. Rev. Genet. 15, 149–162 (2014).

    CAS  Article  Google Scholar 

  5. Shapiro, E., Biezuner, T. & Linnarsson, L. Nat. Rev. Genet. 14, 618–630 (2013).

    CAS  Article  Google Scholar 

  6. Bonfield, J.K. & Mahoney, M.V. PLoS ONE 8, e59190 (2013).

    CAS  Article  Google Scholar 

  7. Apostolico, A. & Lonardi, S. in Proceedings of the IEEE Data Compression Conference 2000 (DCC'00) 143–152 (IEEE Computer Society, 2000).

    Book  Google Scholar 

  8. Kozanitis, C., Saunders, C., Kruglyak, S., Bafna, V. & Varghese, G. J. Comput. Biol. 18, 401–413 (2011).

    CAS  Article  Google Scholar 

  9. Jones, D.C., Ruzzo, W.L., Peng, X. & Katze, M.G. Nucleic Acids Res. 40, e171 (2012).

    CAS  Article  Google Scholar 

  10. Fritz, M.H.Y., Leinonen, R., Cochrane, G. & Birney, E. Genome Res. 21, 734–740 (2011).

    CAS  Article  Google Scholar 

  11. Deorowicz, S. & Grabowski, S. Bioinformatics 27, 860–862 (2011).

    CAS  Article  Google Scholar 

  12. Loh, P.R., Baym, M. & Berger, B. Nat. Biotechnol. 30, 627–630 (2012).

    CAS  Article  Google Scholar 

  13. Ochoa, I. et al. BMC Bioinformatics 14, 187 (2013).

    Article  Google Scholar 

  14. Hach, F., Numanagic, I., Alkan, C. & Sahinalp, S.C. Bioinformatics 28, 3051–3057 (2012).

    CAS  Article  Google Scholar 

  15. Christley, S., Lu, Y., Li, C. & Xie, X. Bioinformatics 25, 274–275 (2009).

    CAS  Article  Google Scholar 

  16. Janin, L., Rosone, G. & Cox, A.J. Bioinformatics 30, 24–30 (2014).

    CAS  Article  Google Scholar 

  17. DePristo, M.A. et al. Nat. Genet. 43, 491–498 (2011).

    CAS  Article  Google Scholar 

  18. Yu, Y.W., Yorukoglu, D. & Berger, B. in Research in Computational Molecular Biology: 18th Annual International Conference, RECOMB 2014—Proceedings (ed. Sharan, R.) 385–399 (Springer, 2014).

    Book  Google Scholar 

  19. Kelley, D.R., Schatz, M.C. & Salzberg, S.L. Genome Biol. 11, R116 (2010).

    CAS  Article  Google Scholar 

  20. Grabherr, M.G. et al. Nat. Biotechnol. 29, 644–652 (2011).

    CAS  Article  Google Scholar 

  21. Cánovas, R., Moffat, A. & Turpin, A. Bioinformatics 30, 2130–2136 (2014).

    Article  Google Scholar 

  22. Li, H. et al. Bioinformatics 25, 2078–2079 (2009).

    Article  Google Scholar 

  23. Li, H. & Durbin, R. Bioinformatics 26, 589–595 (2010).

    Article  Google Scholar 

  24. Langmead, B. & Salzberg, S.L. Nat. Methods 9, 357–359 (2012).

    CAS  Article  Google Scholar 

Download references


We thank L. Cowen and N. Daniels for helpful discussions and comments. Y.W.Y. gratefully acknowledges support from the Fannie and John Hertz Foundation. D.Y., J.P. and B.B. are partially supported by US National Institutes of Health grant GM108348.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Bonnie Berger.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Figures and Text

Supplementary Figures 1–12, Supplementary Tables 1–6 and Supplementary Methods (PDF 889 kb)

Supplementary Code

Source code for the Quartz software described and used in the manuscript (ZIP 23 kb)

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Yu, Y., Yorukoglu, D., Peng, J. et al. Quality score compression improves genotyping accuracy. Nat Biotechnol 33, 240–243 (2015).

Download citation

  • Published:

  • Issue Date:

  • DOI:

Further reading


Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing