Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Matters Arising
  • Published:

Microbial dark matter could add uncertainties to metagenomic trait estimations

The Original Article was published on 05 October 2023

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Evidence of varying proportions of non-bacterial DNA in soil metagenomes from different ecosystems.
Fig. 2: Evidence that varying proportions of non-bacterial DNA influences estimates of bacterial community life history and relationships with environmental variables.

References

  1. Chuckran, P. F. et al. Edaphic controls on genome size and GC content of bacteria in soil microbial communities. Soil Biol. Biochem. 178, 108935 (2023).

    Article  CAS  Google Scholar 

  2. Chen, Y., Neilson, J. W., Kushwaha, P., Maier, R. M. & Barberán, A. Life-history strategies of soil microbial communities in an arid ecosystem. ISME J 15, 649–657 (2021).

    Article  CAS  PubMed  Google Scholar 

  3. Fierer, N., Barberán, A. & Laughlin, D. C. Seeing the forest for the genes: using metagenomics to infer the aggregated traits of microbial communities. Front. Microbiol. 5, 614 (2014).

    Article  PubMed  PubMed Central  Google Scholar 

  4. Wang, C. et al. Bacterial genome size and gene functional diversity negatively correlate with taxonomic diversity along a pH gradient. Nat. Commun. 14, 7437 (2023).

    Article  PubMed  PubMed Central  Google Scholar 

  5. Piton, G. et al. Life history strategies of soil bacterial communities across global terrestrial biomes. Nat. Microbiol., https://doi.org/10.1038/s41564-023-01465-0 (2023).

    Article  PubMed  Google Scholar 

  6. Wood, D. E., Lu, J. & Langmead, B. Improved metagenomic analysis with Kraken 2. Genome Biol. 20, 257 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Rinke, C. et al. Insights into the phylogeny and coding potential of microbial dark matter. Nature 499, 431–437 (2013).

    Article  CAS  PubMed  Google Scholar 

  8. Strickland, M. S. & Rousk, J. Considering fungal:bacterial dominance in soils – methods, controls, and ecosystem implications. Soil Biol. Biochem. 42, 1385–1395 (2010).

    Article  CAS  Google Scholar 

  9. Levy Karin, E., Mirdita, M. & Söding, J. MetaEuk—sensitive, high-throughput gene discovery, and annotation for large-scale eukaryotic metagenomics. Microbiome 8, 48 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Pronk, L. J. U. & Medema, M. H. Whokaryote: distinguishing eukaryotic and prokaryotic contigs in metagenomes based on gene structure. Microb. Genom. 8, mgen000823 (2022).

    PubMed  PubMed Central  Google Scholar 

  11. Lind, A. L. & Pollard, K. S. Accurate and sensitive detection of microbial eukaryotes from whole metagenome shotgun sequencing. Microbiome 9, 58 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Rodríguez-Gijón, A. et al. A genomic perspective across earth’s microbiomes reveals that genome size in archaea and bacteria is linked to ecosystem type and trophic strategy. Front. Microbiol. 12, 761869 (2022).

    Article  PubMed  PubMed Central  Google Scholar 

  13. Nayfach, S. & Pollard, K. S. Average genome size estimation improves comparative metagenomics and sheds light on the functional ecology of the human microbiome. Genome Biol. 16, 51 (2015).

    Article  PubMed  PubMed Central  Google Scholar 

  14. Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Contributions

E.D.O and S.G.M. conceived the study. E.D.O conducted the bioinformatic and statistical analyses. M.S.S. supervised the project. All authors contributed to the writing and editing of the manuscript.

Corresponding author

Correspondence to Ernest D. Osburn.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Microbiology thanks Kate Buckeridge and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1

Percent of metagenomic reads not classified to any taxon (a), percent of reads classified as bacteria between forested and non-forested ecosystems (b), and percentage of genome equivalents recovered in the ‘bacterial’ metagenomes that were found in the full metagenomes (c). Classification was done using kraken2 with RefSeq genomes for bacteria, archaea, viruses, fungi, and protists as reference databases. Genomes within the metagenomes were quantified by determining mean coverage of 30 single-copy genes using MicrobeCensus. The R2 value and line of best fit in (a) and (c) are from linear regression. Asterisks in (b) indicate significantly higher percentage of reads classified as bacterial in non-forested environments (p < 0.001).

Extended Data Fig. 2

Average genome size in the full metagenomes as a function of the percentage of reads classified as bacteria (a) and the ratio of eukaryotic to bacterial reads (b). R2 values and lines of best fit are from linear regression (both p < 0.001). Average genome sizes were determined using MicrobeCensus.

Extended Data Fig. 3

Metagenome GC content in the full (a) and bacterial (b) metagenomes and average 16S rRNA gene copy number in the full (c) and bacterial (d) metagenomes as a function of soil pH. R2 values and best-fit lines on (a) and (b) are from linear regression (both p < 0.001). Regression models for 16S gene copy number in (c) and (d) were not significant (both p > 0.05).

Supplementary information

Supplementary Material

Supplementary methods, bioinformatic analyses and statistical analyses.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Osburn, E.D., McBride, S.G. & Strickland, M.S. Microbial dark matter could add uncertainties to metagenomic trait estimations. Nat Microbiol (2024). https://doi.org/10.1038/s41564-024-01687-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1038/s41564-024-01687-w

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing