Abstract
Humans and other primates harbour complex gut bacterial communities that influence health and disease, but the evolutionary histories of these symbioses remain unclear. This is partly due to limited information about the microbiota of ancestral primates. Here, using phylogenetic analyses of metagenome-assembled genomes (MAGs), we show that hundreds of gut bacterial clades diversified in parallel (that is, co-diversified) with primate species over millions of years, but that humans have experienced widespread losses of these ancestral symbionts. Analyses of 9,460 human and non-human primate MAGs, including newly generated MAGs from chimpanzees and bonobos, revealed significant co-diversification within ten gut bacterial phyla, including Firmicutes, Actinobacteriota and Bacteroidota. Strikingly, ~44% of the co-diversifying clades detected in African apes were absent from available metagenomic data from humans and ~54% were absent from industrialized human populations. In contrast, only ~3% of non-co-diversifying clades detected in African apes were absent from humans. Co-diversifying clades present in both humans and chimpanzees displayed consistent genomic signatures of natural selection between the two host species but differed in functional content from co-diversifying clades lost from humans, consistent with selection against certain functions. This study discovers host-species-specific bacterial symbionts that predate hominid diversification, many of which have undergone accelerated extinctions from human populations.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$29.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 digital issues and online access to articles
$119.00 per year
only $9.92 per issue
Rent or buy this article
Get just this article for as long as you need it
$39.95
Prices may be subject to local taxes which are calculated during checkout



Data availability
All sequence data generated in this study have been deposited to the National Center for Biotechnology Information Sequence Read Archive under accessions PRJNA842693 (Nanopore data) and PRJNA842587 (Illumina data). All bacterial genome assemblies generated in this study are available at Dryad under accession at https://doi.org/10.5061/dryad.00000006x. Previously published data from humans and non-human primates analysed in this study are available from http://opendata.lifebit.ai/table/?project=SGB and the European Nucleotide Archive (accession PRJEB35610).
Code availability
Code used for co-diversification and selection analyses is available at https://github.com/CUMoellerLab.
References
Bello, M. G., Knight, R., Gilbert, J. A. & Blaser, M. J. Preserving microbial diversity. Science 362, 33–34 (2018).
Sonnenburg, E. D. & Sonnenburg, J. L. The ancestral and industrialised gut microbiota and implications for human health. Nat. Rev. Microbiol. 17, 383–390 (2019).
Groussin, M., Mazel, F. & Alm, E. J. Co-evolution and co-speciation of host-gut bacteria systems. Cell Host Microbe 28, 12–22 (2020).
Davenport, E. R. et al. The human microbiome in evolution. BMC Biol. 15, 127 (2017).
Moeller, A. H. et al. Cospeciation of gut microbiota with hominids. Science 353, 380–382 (2016).
Nishida, A. H. & Ochman, H. Captivity and the co-diversification of great ape microbiomes. Nat. Commun. 12, 5632 (2021).
Garud, N. R., Good, B. H., Hallatschek, O. & Pollard, K. S. Evolutionary dynamics of bacteria in the gut microbiome within and across hosts. PLoS Biol. 17, e3000102 (2019).
Grieneisen, L. et al. Gut microbiome heritability is nearly universal but environmentally contingent. Science 373, 181–186 (2021).
Goodrich, J. K. et al. Human genetics shape the gut microbiome. Cell 4, 789–799 (2014).
Hooper, L. V., Littman, D. R. & Macpherson, A. J. Interactions between the microbiota and the immune system. Science 336, 1268–1273 (2012).
Sonnenburg, J. L. & Bäckhed, F. Diet-microbiota interactions as moderators of human metabolism. Nature 535, 56–64 (2016).
Diaz Heijtz, R. et al. Normal gut microbiota modulates brain development and behavior. Proc. Natl Acad. Sci. USA 108, 3047–3052 (2011).
Youngblut, N. D. et al. Host diet and evolutionary history explain different aspects of gut microbiome diversity among vertebrate clades. Nat. Commun. 10, 2200 (2019).
Kang, D. D. et al. MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies. PeerJ 7, e7359 (2019).
Wu, Y.-W., Simmons, B. A. & Singer, S. W. MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets. Bioinformatics 32, 605–607 (2016).
Alneberg, J. et al. Binning metagenomic contigs by coverage and composition. Nat. Methods 11, 1144–1146 (2014).
Sieber, C. M. K. et al. Recovery of genomes from metagenomes via a dereplication, aggregation and scoring strategy. Nat. Microbiol. 3, 836–843 (2018).
Manara, S. et al. Microbial genomes from non-human primate gut metagenomes expand the primate-associated bacterial tree of life with over 1000 novel species. Genome Biol. 20, 299 (2019).
Pasolli, E. et al. Extensive unexplored human microbiome diversity revealed by over 150,000 genomes from metagenomes spanning age, geography, and lifestyle. Cell 176, 649–662.e20 (2019).
Wibowo, M. C. et al. Reconstruction of ancient microbial genomes from the human gut. Nature 594, 234–239 (2021).
Chaumeil, P.-A., Mussig, A. J., Hugenholtz, P. & Parks, D. H. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics 36, 1925–1927 (2019).
Minh, B. Q. et al. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 37, 1530–1534 (2020).
Hommola, K., Smith, J. E., Qiu, Y. & Gilks, W. R. A permutation test of host-parasite cospeciation. Mol. Biol. Evol. 26, 1457–1468 (2009).
Kumar, S., Stecher, G., Suleski, M. & Hedges, S. B. TimeTree: a resource for timelines, timetrees, and divergence times. Mol. Biol. Evol. 34, 1812–1819 (2017).
de Vienne, D. M. et al. Cospeciation vs host-shift speciation: methods for testing, evidence from natural associations and relation to coevolution. New Phytol. 198, 347–385 (2013).
Duchêne, S. et al. Genome-scale rates of evolutionary change in bacteria. Microb. Genom. 2, e000094 (2016).
Menardo, F., Duchêne, S., Brites, D. & Gagneux, S. The molecular clock of Mycobacterium tuberculosis. PLoS Pathog. 15, e1008067 (2019).
Rascovan, N. et al. Emergence and spread of basal lineages of Yersinia pestis during the neolithic decline. Cell 176, 295–305.e10 (2019).
Ochman, H., Elwyn, S. & Moran, N. A. Calibrating bacterial evolution. Proc. Natl Acad. Sci. USA 96, 12638–12643 (1999).
Suzuki, T. A. et al. Codiversification of gut microbiota with humans. Science 377, 1328–1332 (2022).
Moeller, A. H. et al. Rapid changes in the gut microbiome during human evolution. Proc. Natl Acad. Sci. USA 111, 16431–16435 (2014).
Moeller, A. H. The shrinking human gut microbiome. Curr. Opin. Microbiol. 38, 30–35 (2017).
Yatsunenko, T. et al. Human gut microbiome viewed across age and geography. Nature 486, 222–227 (2012).
Blaser, M. J. The theory of disappearing microbiota and the epidemics of chronic diseases. Nat. Rev. Immunol. 17, 461–463 (2017).
Sonnenburg, J. L. & Sonnenburg, E. D. Vulnerability of the industrialised microbiota. Science 366, eaaw9255 (2019).
Pamer, E. G. Resurrecting the intestinal microbiota to combat antibiotic-resistant pathogens. Science 352, 535–538 (2016).
Almeida, A. et al. A unified catalog of 204,938 reference genomes from the human gut microbiome. Nat. Biotechnol. 39, 105–114 (2021).
Olson, N. D. et al. Metagenomic assembly through the lens of validation: recent advances in assessing and improving the quality of genomes assembled from metagenomes. Brief. Bioinform. 20, 1140–1150 (2019).
Navarre, W. W. et al. PoxA, yjeK, and elongation factor P coordinately modulate virulence and drug resistance in Salmonella enterica. Mol. Cell 39, 209–221 (2010).
Wrangham, R. W. et al. The raw and the stolen: cooking and the ecology of human origins. Curr. Anthropol. 40, 567–594 (1999).
Keele, B. F. et al. Chimpanzee reservoirs of pandemic and nonpandemic HIV-1. Science 313, 523–526 (2006).
Keele, B. F. et al. Increased mortality and AIDS-like immunopathology in wild chimpanzees infected with SIVcpz. Nature 460, 515–519 (2009).
Rudicell, R. S. et al. Impact of simian immunodeficiency virus infection on chimpanzee population dynamics. PLoS Pathog. 6, e1001116 (2010).
Li, Y. et al. Eastern chimpanzees, but not bonobos, represent a simian immunodeficiency virus reservoir. J. Virol. 86, 10776–10791 (2012).
Liu, W. et al. Wild bonobos host geographically restricted malaria parasites including a putative new Laverania species. Nat. Commun. 8, 1635 (2017).
Bibollet-Ruche, F. et al. CD4 receptor diversity in chimpanzees protects against SIV infection. Proc. Natl Acad. Sci. USA 116, 3229–3238 (2019).
Rohland, N. & Reich, D. Cost-effective, high-throughput DNA sequencing libraries for multiplexed target capture. Genome Res. 22, 939–946 (2012).
Quick, J. The ‘Three Peaks’ faecal DNA extraction method for long-read sequencing v2 (protocols.io.7rshm6e). protocols.io, https://doi.org/10.17504/protocols.io.7rshm6e (2019).
Mölder, F. et al. Sustainable data analysis with Snakemake. F1000Research 10, 33 (2021).
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 17, 10 (2011).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Nurk, S., Meleshko, D., Korobeynikov, A. & Pevzner, P. A. metaSPAdes: a new versatile metagenomic assembler. Genome Res. 27, 824–834 (2017).
Gurevich, A., Saveliev, V., Vyahhi, N. & Tesler, G. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29, 1072–1075 (2013).
Brown, C. T. & Irber, L. sourmash: a library for MinHash sketching of DNA. J. Open Source Softw. 1, 27 (2016).
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Kolmogorov, M. et al. metaFlye: scalable long-read metagenome assembly using repeat graphs. Nat. Methods 17, 1103–1110 (2020).
Vaser, R., Sović, I., Nagarajan, N. & Šikić, M. Fast and accurate de novo genome assembly from long uncorrected reads. Genome Res. 27, 737–746 (2017).
Clayton, J. B. et al. Captivity humanizes the primate microbiome. Proc. Natl Acad. Sci. USA 113, 10376–10381 (2016).
Houtz, J. L., Sanders, J. G., Denice, A. & Moeller, A. H. Predictable and host-species specific humanization of the gut microbiota in captive primates. Mol. Ecol. 30, 3677–3687 (2021).
Eren, A. M. et al. Anvi’o: an advanced analysis and visualization platform for ‘omics data. PeerJ 3, e1319 (2015).
Revell, L. J. phytools: an R package for phylogenetic comparative biology (and other things). Methods Ecol. Evol. 3, 217–223 (2012).
Parks, D. H., Imelfort, M., Skennerton, C. T., Hugenholtz, P. & Tyson, G. W. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 25, 1043–1055 (2015).
Ranwez, V., Douzery, E. J. P., Cambon, C., Chantret, N. & Delsuc, F. MACSE v2: toolkit for the alignment of coding sequences accounting for frameshifts and stop codons. Mol. Biol. Evol. 35, 2582–2584 (2018).
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Rice, P., Longden, I. & Bleasby, A. EMBOSS: the European molecular biology open software suite. Trends Genet. 16, 276–277 (2000).
Harris, C. D., Torrance, E. L., Raymann, K. & Bobay, L.-M. CoreCruncher: fast and robust construction of core genomes in large prokaryotic data sets. Mol. Biol. Evol. 38, 727–734 (2021).
Yang, Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 1586–1591 (2007).
Macías, L. G., Barrio, E. & Toft, C. GWideCodeML: a Python package for testing evolutionary hypotheses at the genome-wide level. G3 10, 4369–4372 (2020).
Acknowledgements
We thank W. Yan for assistance with DNA extractions from chimpanzee faecal samples and H. Ochman for comments on the manuscript. Primate cartoons were created with BioRender.com. Funding was provided by National Institutes of Health grant R35 GM138284 (A.H.M.) and grant R01 AI050529 (B.H.H.).
Author information
Authors and Affiliations
Contributions
A.H.M. and J.G.S. designed the study, performed analyses and wrote the manuscript. D.D.S. performed analyses and edited the manuscript. B.H.H., M.P., Y.L., D.B.M., C.M.S., J.A.H., A.V.G., J.-B.N.N., E.V.L. and D.M. provided samples and edited the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Microbiology thanks Ruth Ley, Jonathan Clayton and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 Map of sampling locations.
a, Map shows sampling locations for human, great ape, new world monkey, old world monkey, and lemur fecal samples. Circles correspond to individual populations sampled as indicated by the key. b, Map of equatorial Africa shows sampling locations for Pan fecal samples sequenced for this study. Two-letter codes correspond to those associated with host IDs in Supplementary Table 1. DP = Doumo Pierre; IK = Ikela; GT = Goualougo Triangle; TL2 = Tshuapa-Lomami-Lualaba; GM = Gombe; LK = Lui-kotal; KR = Kokolopori.
Extended Data Fig. 2 Histogram of number of significant nodes detected after permuting host labels.
X axis indicates number of significant nodes (Mantel test p < 0.01, r > 0.75) recovered in the co-diversification scan after permuting labels of host tree but retaining symbiont tree labels and all other structure in the dataset. Results of 100 random permutations are shown. Value for unpermuted dataset is shown as a vertical red line.
Extended Data Fig. 3 Number of significant nodes detected after removing MAGs from individual host species.
X axis indicates the host species whose MAGs were removed from that dataset before performing sensitivity analyses in which scans for co-diversification were performed after removing individual host species. The number of co-diversifying clades (Mantel test p < 0.01, r > 0.75) detected in each scan are shown. Value for the full dataset is shown as a horizontal dashed line.
Extended Data Fig. 4 Depths of co-diversifying bacterial clades corroborate known ages of host clades.
a, Scatter plot and regression line show the positive association between the depths of co-diversifying bacterial clades based on protein divergence of bac120 single-copy core genes and the known ages of their corresponding host clades based on timetree.org (df = 204; t = 6.03; unadjusted p-value = 7.36e-09). Each point corresponds to a co-diversifying bacterial clade. b–d, Scatter plots show relationships for Firmicutes (df = 72; t = 8.58; unadjusted p-value = 9.16e-11) (b), Actinobacteriota (df = 11; t = 2.25; unadjusted p-value = 0.046) (c), and Bacteroidota (df = 91; t = 4.38, unadjusted p-value = 3.17e-05) (d). Colours denote bacterial phyla as in Fig. 1b. In a–d, bands represent 99% confidence intervals, centre lines indicate best-fit regression, and p-values represent results of two-sided Student’s t-tests.
Extended Data Fig. 5 Depths of non-co-diversifying bacterial clades and ages of host clades.
Scatter plot and regression line show the association between the depths of strongly non-co-diversifying bacterial clades (r < 0) based on protein divergence of bac120 single-copy core genes and the known ages of their corresponding host clades based on timetree.org (df = 53; t = 0.86; unadjusted p-value = 0.393). Each point corresponds to a bacterial clade. The non-codiversifying clades were derived from host species spanning the same epochs as in Extended Data Fig. 3. Bands represent 99% confidence intervals, centre line represents best-fit regression, and p-value represents result of two-sided Student’s t-tests. In contrast to results displayed in Extended Data Fig. 3 based on co-diversifying clade depths, non-co-diversifying clade depths were not significantly positively associated with known ages of the corresponding host clades.
Extended Data Fig. 6 Rates of genomic evolution vary among bacterial phyla.
Scatter plot and regression lines show the positive relationships within co-diversifying bacterial clades between DNA substitutions per site of bacterial lineages and divergence time of host species from which the lineages were recovered. Each point represents a comparison between two co-diversifying bacterial lineages. Points and lines are coloured based on bacterial phyla as indicated by the key and corresponding to Fig. 1. Bands represent 95% confidence intervals and centre lines represent best-fit regression.
Extended Data Fig. 7 COG pathways enriched in Pan MAGs from co-diversifying clades missing from humans.
Bar plots show the enrichment scores of COG pathways identified as significantly overrepresented in Pan MAGs from co-diversifying clades missing from humans relative to Pan MAGs from co-diversifying clades present in humans. Enrichment scores were calculated as the Rao test statistic for equality of proportions as implemented in Anvi’o anvi-compute-functional-enrichment. Only the top 20 COG pathways are shown in the figure. For a full list see Supplementary Table 5.
Extended Data Fig. 8 Genomic signatures of selection in human and chimpanzee gut bacteria.
Scatter plot shows the relationship between per-CORF dN/dS values in humans and Pan. Points correspond to individual CORFs from co-diversifying bacterial lineages detected in human and Pan. Dashed vertical and horizontal lines correspond to the dN/dS expectation under neutral evolution, and dashed diagonal line corresponds to a 1-to-1 relationship between dN/dS values in humans and Pan. Points are coloured based on bacterial phyla as in Fig. 1 and as indicated in the key.
Supplementary information
Supplementary Information
Supplementary Discussion, References and Tables 1–7 captions.
Supplementary Tables
Supplementary Tables 1–7.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Sanders, J.G., Sprockett, D.D., Li, Y. et al. Widespread extinctions of co-diversified primate gut bacterial symbionts from humans. Nat Microbiol (2023). https://doi.org/10.1038/s41564-023-01388-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41564-023-01388-w