Pebblescout navigates vast, rapidly growing nucleotide content in resources by providing indexing and search capabilities. We used Pebblescout to index a metagenomic subset of Sequence Read Archive and seven other resources into databases spanning over 3.7 petabases and searchable interactively at a pilot website using queries as short as 42 bases.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$29.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 print issues and online access
$259.00 per year
only $21.58 per issue
Buy this article
- Purchase on Springer Link
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
References
Zheludev, I. N. et. al. Viroid-like colonists of human microbiomes. Preprint at bioRxiv https://doi.org/10.1101/2024.01.20.576352 (2024). A preprint that presents a previously unrecognized class of viroid-like elements.
Mario-Vasquez, J. E. et al. Finding Candida auris in public metagenomic repositories. PLoS One https://doi.org/10.1371/journal.pone.0291406 (2024). This paper reports on monitoring systems for Candida auris using publicly available metagenomic data in the SRA.
Joakim Larsson, D. G. & Flach, C.-F. Antibiotic resistance in the environment. Nat. Rev. Microbiol. 20, 257–269 (2022). A review article that presents the global health challenge of antibiotic resistance; it includes a discussion on methods for surveillance and an assessment of potential drivers.
Karasikov, M. et al. Lossless indexing with counting de Bruijn graphs. Genome Res. 32, 1754–1764 (2022). An article that presents counting de Bruijn graphs with positional annotations and the Virus PacBio HiFi dataset.
Pierce, N. T. et al. Large-scale sequence comparisons with Sourmash. F1000Research 8, 1006 (2019). An article that presents the k-mer analysis tool Sourmash.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This is a summary of: Shiryev, S. A. & Agarwala, R. et al. Indexing and searching petabase-scale nucleotide resources. Nat. Methods https://doi.org/10.1038/s41592-024-02280-z (2024).
Rights and permissions
About this article
Cite this article
Pebblescout is an easy-to-use tool for fast sequence search in petabase-scale nucleotide resources. Nat Methods (2024). https://doi.org/10.1038/s41592-024-02281-y
Published:
DOI: https://doi.org/10.1038/s41592-024-02281-y