Abstract
We introduce Sailfish, a computational method for quantifying the abundance of previously annotated RNA isoforms from RNA-seq data. Because Sailfish entirely avoids mapping reads, a time-consuming step in all current methods, it provides quantification estimates much faster than do existing approaches (typically 20 times faster) without loss of accuracy. By facilitating frequent reanalysis of data and reducing the need to optimize parameters, Sailfish exemplifies the potential of lightweight algorithms for efficiently processing sequencing reads.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Rent or buy this article
Prices vary by article type
from$1.95
to$39.95
Prices may be subject to local taxes which are calculated during checkout
Accession codes
References
Soneson, C. & Delorenzi, M. BMC Bioinformatics 14, 91 (2013).
Roychowdhury, S. et al. Sci. Trans. Med. 111ra121 (2011).
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S.L. Genome Biol. 10, R25 (2009).
Mortazavi, A., Williams, B.A., McCue, K., Schaeffer, L. & Wold, B. Nat. Methods 5, 621–628 (2008).
Trapnell, C. et al. Nat. Biotechnol. 28, 511–515 (2010).
Li, B. & Dewey, C. BMC Bioinformatics 12, 323 (2011).
Roberts, A. & Pachter, L. Nat. Methods 10, 71–73 (2012).
Philippe, N., Salson, M., Commes, T. & Rivals, E. Genome Biol. 14, R30 (2013).
Botelho, F.C., Pagh, R. & Ziviani, N. Proceedings of the 10th International Workshop on Algorithms and Data Structures Halifax, NS, Canada, August 15–17, 2007 (eds. Dehne, F., Sack, J.-R. & Zeh, N.)139–150 (Springer, 2007).
Marçais, G. & Kingsford, C. Bioinformatics 27, 764–770 (2011).
Varadhan, R. & Roland, C. Scand. J. Stat. 35, 335–353 (2008).
Nicolae, M., Mangul, S., Mandoiu, I. & Zelikovsky, A. Algorithms Mol. Biol. 6, 9 (2011).
Salzman, J., Jiang, H. & Wong, W.H. Stat. Sci. 26, 62–83 (2011).
Zheng, W., Chung, L.M. & Zhao, H. BMC Bioinformatics 12, 290 (2011).
Shi, L. et al. Nat. Biotechnol. 24, 1151–1161 (2006).
Bullard, J.H., Purdom, E., Hansen, K.D. & Dudoit, S. BMC Bioinformatics 11, 94 (2010).
Griebel, T. et al. Nucleic Acids Res. 40, 10073–10083 (2012).
Grabherr, M.G. et al. Nat. Biotechnol. 29, 644–652 (2011).
Sacomoto, G.A. et al. BMC Bioinformatics 13 (suppl. 6), S5 (2012).
Pruitt, K.D., Tatusova, T., Brown, G.R. & Maglott, D.R. Nucleic Acids Res. 40, D1, D130–D135 (2012).
Flicek, P. et al. Nucleic Acids Res. 41, D1, D48–D55 (2013).
Trapnell, C., Pachter, L. & Salzberg, S. Bioinformatics 25, 1105–1111 (2009).
Pheatt, C. J. Comput. Sci. Coll. 23, 298–298 (2008).
Acknowledgements
This work has been partially funded by the US National Science Foundation (CCF-1256087, CCF-1053918, and EF-0849899) and US National Institutes of Health (R21AI085376, R21HG006913 and R01HG007104). C.K. received support as an Alfred P. Sloan Research Fellow. We would like to thank A. Roberts for helping to diagnose and resolve an artifact in an earlier version of this manuscript pertaining to the synthetic data generated by the Flux Simulator.
Author information
Authors and Affiliations
Contributions
R.P., S.M.M. and C.K. designed the method and algorithms, devised the experiments, and wrote the manuscript. R.P. implemented the Sailfish software.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Supplementary information
Supplementary Text and Figures
Supplementary Figures 1–7, Supplementary Table 1 and Supplementary Notes 1–3 (PDF 1337 kb)
Supplementary Data
Version 0.6.3 of the Sailfish source code (ZIP 666 kb)
Source data
Rights and permissions
About this article
Cite this article
Patro, R., Mount, S. & Kingsford, C. Sailfish enables alignment-free isoform quantification from RNA-seq reads using lightweight algorithms. Nat Biotechnol 32, 462–464 (2014). https://doi.org/10.1038/nbt.2862
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/nbt.2862
This article is cited by
-
Identification of acidic stress-responsive genes and acid tolerance engineering in Synechococcus elongatus PCC 7942
Applied Microbiology and Biotechnology (2024)
-
DIVE: a reference-free statistical approach to diversity-generating and mobile genetic element discovery
Genome Biology (2023)
-
Sorting and packaging of RNA into extracellular vesicles shape intracellular transcript levels
BMC Biology (2022)
-
Bulk RNA sequencing analysis of developing human induced pluripotent cell-derived retinal organoids
Scientific Data (2022)
-
Foliar application of clay-delivered RNA interference for whitefly control
Nature Plants (2022)