Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Research Briefing
  • Published:

Pebblescout is an easy-to-use tool for fast sequence search in petabase-scale nucleotide resources

Pebblescout navigates vast, rapidly growing nucleotide content in resources by providing indexing and search capabilities. We used Pebblescout to index a metagenomic subset of Sequence Read Archive and seven other resources into databases spanning over 3.7 petabases and searchable interactively at a pilot website using queries as short as 42 bases.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Pebblescout indexing.

References

  1. Zheludev, I. N. et. al. Viroid-like colonists of human microbiomes. Preprint at bioRxiv https://doi.org/10.1101/2024.01.20.576352 (2024). A preprint that presents a previously unrecognized class of viroid-like elements.

  2. Mario-Vasquez, J. E. et al. Finding Candida auris in public metagenomic repositories. PLoS One https://doi.org/10.1371/journal.pone.0291406 (2024). This paper reports on monitoring systems for Candida auris using publicly available metagenomic data in the SRA.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Joakim Larsson, D. G. & Flach, C.-F. Antibiotic resistance in the environment. Nat. Rev. Microbiol. 20, 257–269 (2022). A review article that presents the global health challenge of antibiotic resistance; it includes a discussion on methods for surveillance and an assessment of potential drivers.

    Article  Google Scholar 

  4. Karasikov, M. et al. Lossless indexing with counting de Bruijn graphs. Genome Res. 32, 1754–1764 (2022). An article that presents counting de Bruijn graphs with positional annotations and the Virus PacBio HiFi dataset.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Pierce, N. T. et al. Large-scale sequence comparisons with Sourmash. F1000Research 8, 1006 (2019). An article that presents the k-mer analysis tool Sourmash.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This is a summary of: Shiryev, S. A. & Agarwala, R. et al. Indexing and searching petabase-scale nucleotide resources. Nat. Methods https://doi.org/10.1038/s41592-024-02280-z (2024).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Pebblescout is an easy-to-use tool for fast sequence search in petabase-scale nucleotide resources. Nat Methods (2024). https://doi.org/10.1038/s41592-024-02281-y

Download citation

  • Published:

  • DOI: https://doi.org/10.1038/s41592-024-02281-y

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing