Abstract
Many gut microorganisms critical to human health rely on nutrients produced by each other for survival; however, these cross-feeding interactions are still challenging to quantify and remain poorly characterized. Here, we introduce a Metabolite Exchange Score (MES) to quantify those interactions. Using metabolic models of prokaryotic metagenome-assembled genomes from over 1600 individuals, MES allows us to identify and rank metabolic interactions that are significantly affected by a loss of cross-feeding partners in 10 out of 11 diseases. When applied to a Crohn’s disease case-control study, our approach identifies a lack of species with the ability to consume hydrogen sulfide as the main distinguishing microbiome feature of disease. We propose that our conceptual framework will help prioritize in-depth analyses, experiments and clinical targets, and that targeting the restoration of microbial cross-feeding interactions is a promising mechanism-informed strategy to reconstruct a healthy gut ecosystem.
Similar content being viewed by others
Introduction
The human gut contains hundreds of microbial species forming a complex and interdependent metabolic network. Over half of the metabolites consumed by gut microbes are by-products of microbial metabolism1 with the waste of one species serving as nutrients for others2,3,4. Species interdependence can render microorganisms vulnerable to local extinction if a partner is lost5 unless alternative species are available to fill that niche. In this context, having functionally redundant species with the ability to produce or consume the same nutrients is beneficial for the host. While it is generally accepted that high functional redundancy is a characteristic of resilient human gut microbiomes6,7,8, the human health impacts of redundancy in metabolic interactions remain largely uncharacterized. Restoring the diversity of cross-feeding microbial partners represents a logical but still largely unexplored rubric to fight a wide range of diseases linked with an unbalanced gut microbiome.
Mechanistic models that simulate microbial metabolism in silico hold the promise to fill our knowledge gap on microbial metabolic interactions4,9. Genome-scale metabolic models (GEMs) are based on increasingly comprehensive databases linking genes to biochemical and physiological processes10,11. These models have been used to estimate metabolic exchanges between pairs of bacterial species for over a decade12,13. Developments in automating the reconstruction of GEMs14 and the availability of manually-curated GEMs for thousands of gut microorganisms15,16 have paved the way to build metabolic models for complex microbial communities. Methodological advances now allow modelling interactions between multiple species17,18, and a recently developed workflow by Zorrilla and colleagues19 now allows reconstructing metabolic models directly from large-scale metagenome datasets. Studies using community-wide metabolic models have found dozens to hundreds of significantly different metabolic exchanges in the gut microbiome associated with type 2 diabetes19 and in inflammatory bowel disease20 when compared to healthy controls. A method to rank these metabolic interactions according to an ecology-based framework provides the opportunity to generate targeted hypotheses underlying mechanistic links between the gut microbiome and diseases.
Here, we introduce a metabolite exchange scoring system derived from metagenome-scale metabolic models, designed to identify the potential microbial cross-feeding interactions most affected in disease. We apply our conceptual framework to an integrated dataset of 1661 publicly available stool metagenomes, encompassing 15 countries and 11 disease phenotypes. Our framework identified both known and novel microbiome-disease associations, including a link between colorectal cancer and the microbial metabolism of ethanol, a connection between rheumatoid arthritis with microbially-derived ribosyl nicotinamide, and links between Crohn’s disease and specific bacteria that metabolise hydrogen sulfide. The scoring system can help quantify and identify context-dependent disruptions of microbial interactions, which may be targets for microbiome-based medicines.
Results
Potential cross-feeding interactions quantification
To understand the link between cross-feeding interactions and disease, we designed the Metabolite Exchange Score (MES). MES is the product of the diversity of taxa predicted to consume and taxa predicted to produce a given metabolite, normalized by the total number of involved taxa (Fig. 1a and methods). The potential production, consumption and exchange of metabolites by each microbiome member for which MAGs can be reconstructed is estimated through metabolic modelling. As with a centrality measure of a network that defines their most connected nodes, metabolites with high MESs are likely to be key components in the microbial food chain. At the other extreme, metabolites where MES is zero are not produced or not consumed by any member of the community. By comparing MESs for each metabolite across healthy and diseased microbiomes, one can rank and identify the metabolites most affected by the loss of cross-feeding partners (Fig. 1b). Once metabolites have been prioritized with MESs, it is then possible to integrate taxa abundances and their estimated metabolic fluxes to retrieve a consortium of species that act as the main producers or consumers of the targeted metabolites. We propose this approach as a hypothesis generation strategy to guide new discoveries, targeted experiments and clinical trials.
Meta-analysis of 1661 microbiomes reveals key metabolic interactions among gut microorganisms in health and disease
To obtain an overview of the association between cross-feeding interactions and different diseases, we performed a large-scale analysis of 1661 high-quality and deeply sequenced gut metagenome samples, including 871 healthy and 790 diseased individuals from 33 published studies, 15 countries and 11 disease phenotypes (Supplementary Data 1). Integrating studies and countries enabled the assembly of Metagenome-Assembled Genomes (MAGs) for a diverse range of gut microbes and allowed characterization of the baseline MESs in the healthy population. Our healthy cohort was composed of both males and females with a Body Mass Index (BMI) between 18.5 and 24.9 and no reported disease. Samples for which this information was unclear (e.g., disease controls where health status or BMI was not reported) are not included in our dataset (see Methods for details). Within-sample sequence assembly21, metagenome co-binning22 and quality control23 resulted in 55,345 bins, including 24,369 high-quality MAGs with >90% completeness and <0.05% contamination. We selected one representative MAG per species, defined at 95% Average Nucleotide Identity (ANI), resulting in 949 bacterial and 6 archaeal species, encompassing all dominant microbial phyla found in the gut (Fig. 2a, Supplementary Data 2). The presence and abundance of these species were determined by mapping sequence reads against the 955 MAGs. Forty bacterial and one archaeal species were exclusively found in diseased individuals (Supplementary Data 3a), while healthy individuals harboured 59 bacterial and one archaeal species that were not observed in any diseased individual (Supplementary Data 3b). Identifying species in metagenome samples remains a challenge, and it is likely that our MAG-based approach misses rare components of the gut microbiome despite the large dataset used here for co-binning. To infer metabolic exchanges between microbes, we reconstructed Genome-Scale Models (GEMs)14 for the 955 MAGs, built community-scale metabolic models for each individual based on the species-level abundances using MICOM18, and calculated MES using custom scripts24. Our modelled communities contained an average of 138 species (min = 34, max = 236 species).
We first sought to identify the metabolic exchanges with the highest diversity of cross-feeding partners in healthy microbiomes by analysing the MESs of each metabolite of the entire healthy group. Metabolites showed a wide variation of MESs between individuals (Fig. 2b, Fig. S1). Metabolites with the highest mean MES included nucleobases such as uracil (MES mean and sd = 60.5 ± 17.6) and thymine (41.8 ± 21.8), essential nutrients such as phosphate (59.9 ± 17.0) and iron (40.3 ± 36.9), and sugars such as glucose (52.6 ± 22.1) and galactose (52.3 ± 21.3).
To identify the metabolites most affected by the loss of cross-feeding partners during disease, we compared MESs between the healthy group and the eleven disease phenotypes. This analysis identified significant loss of cross-feeding partners for specific metabolites in all disease groups except for schizophrenia (Fig. 2c, Fig. S2). Metabolites with high MESs in healthy individuals and known to be important for human health, such as vitamin B1 (thiamin)25 and precursors of short-chain fatty acids (e.g., malate, glucose, galactose)26, were significantly affected in multiple disease phenotypes (Kruskal–Wallis’ p < 0.05/number of tests to correct for multiple comparisons). Thiamin was the metabolite with the highest difference in MESs between healthy and diseased microbiomes in cirrhosis and ankylosing spondylitis, ranking second in Inflammatory Bowel Disease (IBD) (Fig. 2c). Associations between deficiency of thiamine with cirrhosis and IBD have been previously reported27,28,29, but to our knowledge, this is the first indication of a possible microbial-mediation of this phenotype. Likewise, this is the first indication of a link between microbially-derived ribosyl nicotinamide and rheumatoid arthritis (Fig. 2c). The results also confirmed previously reported microbially-mediated disease-metabolite associations, such as ethanol in colorectal cancer30 and hydrogen sulfide in IBD31,32, reinforcing the potential of our novel approach to identify reasonable relationships.
We next compared our results with the study of Zorrilla and colleagues19, who used SMETANA17 to quantify microbial metabolic exchanges in the gut and link those with glucose intolerance and type 2 diabetes (T2D). Their study identified significantly different exchanges for 22 metabolites, including for hydrogen sulfide (H2S) and D-galactose, which were also identified in our analyses as having significantly higher MESs in T2D-associated microbiomes when compared to healthy microbiomes (Supplementary Data 4). There was also some concordance between our results regarding the metabolites identified as being most frequently exchanged between gut bacteria, with three out of the six metabolites highlighted in Zorrilla et al. (Fig. 3a in ref. 19), being among the top 15 metabolites with the highest MESs in healthy microbiomes (L-malate, H2S and acetaldehyde).
Species diversity has distinct relationships with producers and consumers of exchanged metabolites
Diversity of microbial species within the gut community is commonly considered a marker of health status. Microbiomes associated with five diseases showed significant and consistent reduction in alpha diversity across indices (Shannon index and species richness), while microbiomes from individuals with type 2 diabetes had a significantly higher alpha diversity when compared with the healthy group (Fig. S3). Diseases associated with low species diversity (e.g., Inflammatory Bowel Disease) showed the highest magnitude in MES differences (Fig. 2c), which is expected given that the number of microbial species exchanging metabolites naturally correlates with the number of species in the community.
To further understand the relationship between diversity and metabolite exchange, we tested the null hypothesis that producers and consumers are equally affected by species diversity. Specifically, we correlated the number of producer or consumer species of each metabolite with species richness to determine statistical differences between the slopes of these correlations for metabolite production and consumption. The null hypothesis (no statistical difference between slopes) implies that the number of producer species and consumer species increases at the same rate as species richness increases. Such results would imply that cross-feeding interactions dependent only on the number of species present in the community. This null hypothesis was rejected for 79% of metabolites exchanged by the gut microbiome (Fig. 3a, Supplementary Data 5), with the slope of the correlation being significantly steeper either for consumers (55% of metabolites) or producers (24% of metabolites). From the metabolites with the highest MESs, only producers and consumers of glycerol showed no significant difference in response to species richness (Fig. 3b–p).
Microbial food web restoration as a potential therapeutic strategy for Crohn’s disease
To investigate how the application of MES and our modelling framework may guide the identification of promising therapeutic targets, we focused on Crohn’s disease (CD), a form of IBD. We selected a single case-control study33 with the largest number of samples from healthy and diseased individuals within our quality-controlled dataset to minimize batch effects. In accordance with the global analyses, we found that H2S – a gas previously implicated in CD and IBD symptoms31,32,34—was the metabolite most affected by the loss of cross-feeding microbial partners (twofold reduction, Supplementary Data 6). While H2S production by the gut microbiome has been the subject of several studies (e.g., refs. 35,36), the consumption of this gas is less characterized, and our modelling results indicate that H2S consumed by bacteria can be incorporated into sulfur-containing amino acids such as cysteine (Fig. S4).
Focusing on H2S, we found that the microbiome of healthy individuals contained more species with the potential to produce H2S, as well as more species with the potential to consume H2S, than the microbiomes associated with CD (Fig. 4a). Interestingly, the diversity of potential H2S consumers was more affected in CD patients (56% less diverse on average, Supplementary Data 7) than the diversity of H2S producers (32% less diverse), resulting in a significantly higher H2S producer to consumer ratio in individuals affected by CD (Fig. 4c). We observed similar results when investigating the flux of H2S among microorganisms. The total estimated ability of the microbiome to consume H2S in the disease state was reduced by 74%, while the total production was not significantly affected, resulting in a higher H2S production to consumption ratio in CD (Fig. 4b, d, Supplementary Data 7). The excess of H2S (i.e., H2S predicted to be exported to medium) was not significantly different between healthy and diseased subjects (Kruskal–Wallis χ2(1) = 0.0356, p = 0.8503). The indication that H2S consumers are more affected than H2S producers in CD stands after correcting for the confounding effects of species diversity, although no significant difference was observed for the flux of H2S exchanged among microorganisms (Supplementary Data 8).
To better understand the genetic basis of the metabolic modelling results, we investigated the distribution of 46 genes known to be involved in H2S cycling36 in the MAGs present in the CD case-control study. We found between one and 23 genes in each MAG (Supplementary Data 9). Five genes involved in H2S cycling were significantly more prevalent in microbiomes associated with healthy individuals (Supplementary Data 10): cysK, dcm, Fuso_cyst, metH and metK (linear model, using species diversity as confounder variable and a two-way t-test to assess significance, p < 0.0012 accounting for multiple comparisons). Another five genes were more prevalent in CD-associated microbiomes: asrA, asrB, asrC, dmsA and dsrC (p < 0.0012), the first four genes also being significantly enriched when accounting for species abundance (Supplementary Data 10).
To identify the key species associated with H2S imbalance in CD, we compared the contribution of each species to the total H2S production or consumption in the healthy and CD cohorts. For each species, H2S flux (weighted by relative abundances) was estimated and the difference of total H2S weighted flux in healthy and CD individuals calculated. The species showing the highest increase towards H2S production in CD patients included members of the classes Clostridia, Bacteroidia and Bacilli (Fig. 4e, Supplementary Data 11). Enterocloster clostridioformis (Clostridia) and Enterococcus_B faecium (Bacilli) were only observed in the CD cohort. Many species (45% of the MAGs from the case-control study) showed an ability to both produce and consume H2S according to the models, and their role was dependent on their community context. Phocaeicola dorei (Bacteroidia) was the species showing the highest difference in predicted H2S production between healthy and CD individuals despite being common in both cohorts. We found multiple genes related to H2S metabolism in this species (cysK, bsh, dcm, Fuso cyst, luxS, metK, sufS, and two copies of the malY and metH genes). Members of the Clostridia class were the H2S consumers showing the highest reduction in H2S consumption in CD microbiomes, including Roseburia intestinalis, Blautia_A obeum, and two Faecalibacterium species (F. prausnitzii_J and F. sp900758465) (Fig. 4e, Supplementary Data 11). The top 5 consumer species had between two and four copies of the cysteine desulfurase (iscS) gene, in addition to a range of other genes involved in H2S metabolism (Supplementary Data 9 and 11).
We next compared the results obtained from our metabolic modelling approach with traditional compositional microbiome analyses. Community beta-diversity was visualized using principal component analysis, showing that microbiomes associated with CD formed a distinct cluster (Fig. S5a). To identify the species that contributed most to these differences we used a random forest (RF) classifier (70% of data used for training, 30% for testing). The out-of-bag error rate of the training dataset was 9.52%, and the accuracy on the test dataset was 100%. The species contributing most to the differences between healthy and CD-associated microbiomes were identified through their importance scores (Fig. S5b). Some of the species identified with the RF analysis were also identified with our metabolic modelling approach, including the H2S consumers Roseburia intestinalis, Escherichia coli and Anaerostipes hadrus, and the H2S producer Clostridium_Q symbiosum. Sixteen out of the 20 species identified by our modelling approach as contributing most to the H2S production to consumption ratio unbalance in CD (Fig. 4e) were not among the top 30 species selected with this compositional-based analysis.
Discussion
In this work, we introduce a new MES-based conceptual framework and apply it to an integrated dataset of metabolic models for 955 gut species from 1661 publicly available stool metagenomes, encompassing 15 countries and 11 disease phenotypes. This approach revealed a significant depletion of potential cross-feeding interactions in the microbiomes associated with 10 diseases and identified promising therapeutic targets in a case-control Crohn’s disease study.
We show that our analytical framework identifies both known and novel microbiome-disease associations, providing a cost-efficient and mechanistically grounded strategy to prioritize experiments and guide clinical trials. One example is the link between rheumatoid arthritis and ribosyl nicotinamide (also known as nicotinamide riboside or NR). This metabolite is one of the main precursors of nicotinamide adenine dinucleotide (NAD+), which has been reported to be significantly reduced in individuals with rheumatoid arthritis37. Administration of NR and other NAD+ precursors leads to improved clinical outcomes for rheumatoid arthritis patients37 and for a range of other inflammatory, neurodegenerative and cardiovascular diseases38. To our knowledge, this is the first reported evidence for a role of microbial NR metabolism in rheumatoid arthritis. We also identified ethanol as the metabolite most affected by loss of cross-feeding in individuals with Colorectal Cancer (CRC). Moderate to heavy alcohol consumption is associated with a 1.17 – 1.44 higher risk of developing CRC39 via a process that is at least partially mediated by the microbiome, as gut bacteria metabolise ethanol to produce the carcinogenic acetaldehyde40. The capacity to identify these and other coherent metabolite-disease links using exclusively metagenome data is further evidence for the validity and utility of our approach. Some associations observed in our study such as links between Roseburia intestinalis and CD could be retrieved using analyses based solely on the composition of the microbiome, but most associations could not (e.g., Phocaeicola dorei), with the modelling framework yielding additional insights on the metabolic and ecological processes underlying these associations. We also observed a complementarity between our MES approach and previously proposed methods based on SMETANA scores. Metabolites identified as markers of T2D progression19 were among the metabolites with highest MESs in the healthy population, supporting the idea that the exchange of these metabolites is an important feature of healthy microbiomes.
The reliance of microbes on cross-feeding is expected to be influenced by the availability of metabolites in the gut environment. Several metabolites with significant MES difference in health and disease are found in food (e.g., vitamins and sugars), highlighting the importance of diet in understanding cross-feeding in the gut microbiome. Interestingly, for many metabolites (e.g., phosphate, glucose, galactose and choline), we observe a high proportion of producers when species diversity is low, but the proportion of consumers overtakes producers as species richness increases (Fig. 3). We speculate that low species richness is associated with a lack of metabolites available for consumption, favouring species that are self-sufficient in producing these metabolites. High species diversity, on the other hand, is likely linked to higher net metabolite production by the community, providing more opportunities for consumer species to thrive. This hypothesis is consistent with two recent studies indicating that microbiomes associated with IBD (which typically have low species diversity) are enriched in bacteria with genomes that encode complete pathways for the synthesis and metabolism of essential amino acids and vitamins (including thiamine), while microbiomes of healthy individuals are enriched with bacteria that are expected to rely on cross-feeding for essential metabolites41,42. These studies, together with our results, suggest an extensive reliance on cross-feeding in healthy and diverse microbiomes.
Using CD as a case study, we demonstrated how the modelling framework can help define mechanistically informed hypotheses for targeted experimental and clinical validation. Our results suggest that CD patients lack microbial community members to support a healthy H2S balance. This gas is expected to have a protective effect in the gut when present in small amounts, but it disrupts the mucus layer and may cause inflammation when present in larger quantities43,44,45,46. Our results corroborate recent findings suggesting that the microbiome of IBD patients is particularly deficient in secreting metabolites containing sulfur20, and additionally indicate that H2S consumer species are disproportionately lost in CD. Microbial exchanges of H2S may affect the host directly through mechanisms such as modulating luminal pH32, or indirectly through cascade effects on microbiome composition.
The accuracy of the modelling framework applied here is limited by the use of automated genome-scale metabolic reconstructions, which represent phenotypes close to manually-curated models14 but are naturally unable to predict all organism-specific traits or secondary metabolism, especially if those rely on genes and pathways that are yet to be characterized. Automated genome-scale models provide an opportunity for a top-down approach, where large scale analyses like the one performed here can guide a range of more refined hypothesis-driven studies, ideally coupled with experimental validation. Additional refinement can be obtained in future studies handling smaller datasets by manual model curation, integration of additional ‘omics data, e.g., ref. 47 and other lines of evidence (e.g., machine learning methods trained on compositional data), and by integrating personalized data on host diet and metabolism48. It is also important to note that only the prokaryotic fraction of the microbiomes for which high-quality MAGs were reconstructed could be included in the models and that our analyses were performed at the species level (95% ANI), which may miss strain-level differences in metabolism. Future research applying the MES approach in combination with strain-level compositional information will be highly informative to identify biomarkers of health status and to better understand the ecology of these complex gut communities.
We expect that metagenome-informed metabolic models, coupled with an assessment of microbial cross-feeding interactions, will help alleviate one of the main barriers in the development of microbiome therapies – prioritizing which species or metabolites to target. By focusing on restoring key aspects of the gut ecology, we may be able to introduce more effective and long-lasting changes in the human gut microbiome.
Methods
Global survey of gut metagenomes and quality control
We performed a literature search for peer-reviewed studies with publicly available human stool metagenomes and associated metadata. These included large-scale meta-analyses of gut metagenomes and metadata compilations49,50. Studies focusing on dietary interventions, medications, exercise and children (<10 years old) were excluded. For longitudinal studies, only one sample per individual was included in the analyses. To minimize the impact of sequencing technologies, only studies reporting paired-end sequencing using Illumina’s HiSeq or NovaSeq platforms were included.
The healthy cohort included individuals reported as not having any evident disease or adverse symptoms50. Samples classified as disease controls and where the health status could not be determined were excluded. To avoid ambiguous health/disease status, samples from individuals with colorectal adenoma (non-cancerous tumour) and impaired glucose tolerance (pre-diabetes) were excluded, and only individuals with a Body Mass Index (BMI) between 18.5 and 24.9 were included in the healthy cohort. Samples with less than 15 M PE reads after quality control were excluded to minimize the impact of sequencing depth. A maximum of 100 samples per disease category from each study were used to minimize batch effects and reduce the dataset to a computationally feasible size.
Raw sequence reads were downloaded from NCBI and subject to quality control with TrimGalore v.0.6.6 (Krueger F. http://www.bioinformatics.babraham.ac.uk/projects/trim_galore/) using a minimum length threshold of 80 bp and a minimum Phred score of 25. Potential contamination with human sequence reads was removed by mapping the metagenome sequences to the human genome with Bowtie v.2.3.551. To minimize the impact of sequence depth, samples were rarefied to 15 M fragments (30 M PE reads) with seqtk v.1.3 (https://github.com/lh3/seqtk). The quality-controlled dataset contained 1697 samples, which are provided along with their metadata in Supplementary Data 1.
Metagenome assembly and binning
Assembly was performed for individual metagenomes with Megahit v.1.2.921. It has been shown that co-binning multiple samples yields a higher number of high-quality MAGs, but using co-abundance information requires significant computational resources52. We, therefore, divided the 1697 samples into two batches (indicated in Supplementary Data 1) and, for each of these batches, followed the steps recommended in the VAMB v.3.0.222 workflow. In short, we mapped quality-filtered sequenced reads against all contigs assembled within that batch with minimap253, and used VAMB to identify metagenome bins. The snakemake workflow for these steps (adapted from the VAMB github) is available in our Zenodo repository24. Completeness and contamination levels of metagenome bins were assessed with CheckM23. We retrieved 24,369 bins with >90% completeness and <0.05% contamination. These bins were dereplicated at 95%ANI using drep v.3.0.054, which selects the ‘best’ representative genome based on multiple quality metrics (completeness, contamination, strain heterogeneity, N50, centrality). De-replication resulted in 955 high-quality, species-level (95% ANI) metagenome-assembled genomes. These MAGs were taxonomically classified with GTDBtk v.1.5.155 and their species abundances across samples were calculated by mapping sequence reads to MAGs with KMA v.1.3.1356. The prevalence of MAGs across all samples was visualized along a tree built with GTDBtk55 and visualized with iTOL57.
Genome and metagenome-scale metabolic modelling
Genome-scale metabolic models (GEMs) were reconstructed for each species-level MAG with CarveMe v1.514. GEMs were produced using domain-specific templates for archaea and bacteria, an average European diet58 as medium for gap filling, and the IBM Cplex solver.
Metabolic exchanges between community members of a microbiome were calculated with MICOM v.0.2618. MICOM simulates growth and metabolic exchanges among members of the microbiome while accounting for their differential abundances, and it has been shown to estimate realistic growth rates. Furthermore, MICOM is computationally tractable when it comes to simulating diverse microbial communities (i.e., dozens-to-hundreds of species). Metabolic exchanges were estimated with MICOM’s growth workflow, using a 0.5 trade-off parameter, an average European diet as medium, and parsimonious Flux Balance Analysis (pFBA) to identify optimal growth rates and metabolic fluxes. The underlying CarveMe models contain relatively few carbon sources, leading to low growth rates and consequent numerical instability. Therefore, the fluxes of medium items were multiplied by 600 to feasibly calculate metabolic exchanges, and then corrected in the final results. We verified the bacterial growth rates estimated with MICOM for all samples, which were within the expected range (Fig. S6), suggesting that this multiplication step did not induce unrealistic growth. An optimal solution was not found for 36 samples, which were removed from the analysis (identified in Supplementary Data 1), resulting in a final dataset of 1661 samples. A snakemake workflow is provided in the Zenodo repository for reproducibility24.
Metabolite exchange scores
The underlying rationale to define the Metabolite Exchange Score (MES) is that an individual where metabolites are produced and consumed by multiple members of the microbiome will have a higher functional redundancy than an individual where these metabolites are produced and consumed by fewer species, which is a characteristic of most healthy ecosystems. For homogenized stool-derived metagenomes, which do not capture the patchiness in microbial aggregates found in the gut, high functional redundancy increases the likelihood that most micro-niches are populated by at least one species. The MES weighs the number of microbial species consuming and producing a given metabolite, in a given microbiome sample. MES was defined for each metabolite as the harmonic mean between potential consumers and producers (Eq. 1):
Where P is the number of potential producers and C is the number of potential consumers of a given metabolite. Note that MES will be zero if a metabolite is only produced or only consumed but not exchanged among microorganisms.
The specific metabolites for which cross-feeding partners were significantly lost were identified with a Kruskal–Wallis test comparing diseased phenotypes against the healthy population. The Bonferroni method was used to account for multiple tests (0.05 as target alpha, divided by the number of tests), and only metabolites present in at least 50 individuals, including at least 15 diseased subjects, were included in the analyses. Water and oxygen were excluded from the analyses. For a simplified graphical representation (Fig. 2c), metabolites were selected for display if they showed a significant reduction in the number of cross-feeding partners, and if they were in the top 5 metabolites with the highest difference in MES in any disease. Barplots were generated and coloured according to the metabolite Sub Class defined in the Human Metabolome Database59 using the ggplot2 R package60. An additional word cloud including up to 100 metabolites with significant MES differences between healthy and diseased was generated with the wordcloud R package61.
Species diversity effects
To estimate taxonomic diversity, the metagenome reads were mapped to the 955 species-level MAGs with KMA v.1.3.1356. Shannon index and species richness (total number of species in each sample, according to the reads mapping result) were used to quantify alpha-diversity, and compared between healthy and diseased microbiomes using the Wilcoxon test (holm method to account for multiple comparisons). Species richness were then used as a measure of species diversity for downstream analyses.
Differences in the slopes between species diversity and consumer or producer correlations were assessed on the entire dataset (including healthy and diseased microbiomes) by fitting a linear model (lm) in R, considering the interaction between number of producers and consumers with their category (producer or consumer). The statistical significance for the difference between slopes was corrected for multiple comparisons using the Bonferroni method.
Nutritional interactions in the microbiome associated with Crohn’s disease
We selected a case-control study for an in-depth analysis that demonstrates how our framework can be applied to identify promising therapeutic targets. Given that the completeness of metagenome-assembled genomes is optimized by co-binning large datasets22, we opted to select a case-control study from our quality-controlled dataset to take advantage of the large number of high-quality MAGs used to model community-wide metabolism. A total of 84 samples from the study of He and colleagues33—the largest CD study within our dataset—passed our quality control and were included in our analyses, including 46 patients with Crohn’s disease and 38 healthy controls. The specific metabolites for which cross-feeding partners were lost were identified with a Kruskal–Wallis test, using only metabolites observed in over half of the samples and adjusting for multiple tests with a Bonferroni correction.
The flux of H2S, estimated in millimoles per hour per gram of dry weight, was multiplied by species abundances to obtain the total H2S production and consumption exchanged among microorganisms. Fluxes were log2-transformed for the statistical tests and graphical representation. Differences between the diversity of H2S producers and consumers, ratios of producers to consumers, and their fluxes was evaluated with Kruskal–Wallis tests. The H2S predicted to be exported to medium was used to estimate the excess H2S production by the microbiome.
We used a nested linear model to account for the confounding effects of species diversity on the associations between number or flux of producers/consumers and disease status. Samples containing less than 99 species (the minimum number of species in the healthy cohort) were excluded from this analysis (n = 58 samples remaining), ensuring a linear relationship between species diversity and number of H2S consumers or producers.
To better understand the genetic basis of H2S production and consumption in MAGs observed within the CD case-control study, we performed a Hidden Markov Model (HMM) survey of 74 genes involved in H2S cycling36 with HMMer v.3.3.262, using trusted cutoff scores to ensure homology. We used a linear model to test if these genes were differentially distributed between healthy and CD individuals, using only samples with at least 100 species and genes observed in at least 10 samples. Analyses were performed considering both MAGs abundance (by multiplying gene counts by spp. abundance) and prevalence (using species presence/absence, which would be more informative when relatively rare taxa are responsible for a large proportion of the production and consumption of H2S). Data was offset by 0.1 to avoid infinity upon log-transformation, species diversity was used as a confounding variable and the Bonferroni correction was used to account for multiple comparisons.
In order to identify species that may be promising targets of microbiome therapy in CD, we weighted in their flux of H2S and relative abundances within CD and healthy cohorts. Specifically, weighted H2S fluxes of each microbial species was estimated by multiplying their H2S fluxes by their relative abundances. The weighted sum of H2S fluxes was calculated as the sum of all weighted fluxes within healthy or diseased cohorts. Differences in the weighted sum of H2S between healthy and CD cohorts pointed to the key H2S producers and consumers associated with Crohn’s disease. The Crohn’s disease cohort contained more individuals than the healthy one, therefore, eight random samples were excluded to ensure the same number of individuals (38) in healthy and diseased categories. The metabolic model of Roseburia intestinalis, one key H2S consumer, was visualized with Fluxer63 using best k-shortest paths to visualize pathways between H2S intake and cell growth.
To better understand how the modelling framework compare to more traditional composition-based analyses, we visualized the community beta diversity using a PCA plot of CLR-normalized species abundances with mixOmics64, using the balanced dataset from He and colleagues33 described above. We then performed a random-forest analysis65 where 70% of the samples were randomly selected for training the model and the remaining 30% were used to test the classifier. Feature importance (mean decrease in Gini) was used to rank the species that most explained the variation between healthy and CD-associated microbiomes.
Statistics and reproducibility
The statistical tests applied here are described within their relevant section above using R. For reproducibility, we provide the R scripts in our Zenodo repository24. Data exclusion was performed based on quality/sequencing depth of metagenomes and completeness of the metadata (see ‘Global survey of gut metagenomes and quality control section’). No statistical method was used to predetermine sample size.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
The data used in this study is publicly available in the European Nucleotide Archive (ENA). All assemblies and MAGs reconstructed in this study have been deposited in ENA under project PRJEB63093. BioSample IDs for the raw sequence data and assembly IDs for the assemblies performed in this study are provided in Supplementary Data 1. ENA sample accessions for all metagenome bins reconstructed in this study are provided in Supplementary Data 12, and the ENA analysis ID for the 955 species-level MAGs are provided in Supplementary Data 2. All high-quality MAGs are also available in Zenodo24 [https://zenodo.org/record/8223163]. Metabolite classes were inferred from the Human Metabolome Database HMDB 4.0 [https://hmdb.ca].
Code availability
The code developed to run the metabolic modelling analysis, perform statistical tests and to produce the graphs presented here, along with a step-by-step description of the analysis workflow, are available in Zenodo24: https://zenodo.org/record/8223163 (repository v.1.2.2), and in GitHub: https://github.com/vrmarcelino/MetaModels.
References
Wang, T., Goyal, A., Dubinkina, V. & Maslov, S. Evidence for a multi-level trophic organization of the human gut microbiome. PLOS Comput. Biol. 15, e1007524 (2019).
Fischbach, M. A. & Sonnenburg, J. L. Eating for two: how metabolism establishes interspecies interactions in the gut. Cell Host Microbe 10, 336–347 (2011).
Gralka, M., Szabo, R., Stocker, R. & Cordero, O. X. Trophic interactions and the drivers of microbial community assembly. Curr. Biol. 30, R1176–R1188 (2020).
Goyal, A., Wang, T., Dubinkina, V. & Maslov, S. Ecology-guided prediction of cross-feeding interactions in the human gut microbiome. Nat. Commun. 12, 1335 (2021).
Coyte, K. Z., Schluter, J. & Foster, K. R. The ecology of the microbiome: networks, competition, and stability. Science 350, 663–666 (2015).
Moya, A. & Ferrer, M. Functional redundancy-induced stability of gut microbiota subjected to disturbance. Trends Microbiol. 24, 402–413 (2016).
Tian, L. et al. Deciphering functional redundancy in the human microbiome. Nat. Commun. 11, 6217 (2020).
Fassarella, M. et al. Gut microbiome stability and resilience: elucidating the response to perturbations in order to modulate gut health. Gut 70, 595–605 (2021).
Sung, J. et al. Global metabolic interaction network of the human gut microbiota for context-specific community-scale analysis. Nat. Commun. 8, 15393 (2017).
Fang, X., Lloyd, C. J. & Palsson, B. Ø. Reconstructing organisms in silico: genome-scale models and their emerging applications. Nat. Rev. Microbiol. 18, 731–743 (2020).
Heinken, A., Basile, A., Hertel, J., Thinnes, C. & Thiele, I. Genome-scale metabolic modeling of the human microbiome in the era of personalized medicine. Annu. Rev. Microbiol. 75, 199–222 (2021).
Freilich, S. et al. Competitive and cooperative metabolic interactions in bacterial communities. Nat. Commun. 2, 589 (2011).
Levy, R. & Borenstein, E. Metabolic modeling of species interaction in the human microbiome elucidates community-level assembly rules. Proc. Natl Acad. Sci. 110, 12804–12809 (2013).
Machado, D., Andrejev, S., Tramontano, M. & Patil, K. R. Fast automated reconstruction of genome-scale metabolic models for microbial species and communities. Nucleic Acids Res. 46, 7542–7553 (2018).
Magnúsdóttir, S. et al. Generation of genome-scale metabolic reconstructions for 773 members of the human gut microbiota. Nat. Biotechnol. 35, 81–89 (2017).
Heinken, A. et al. Genome-scale metabolic reconstruction of 7302 human microorganisms for personalized medicine. Nat. Biotechnol. 41, 1320–1331 (2023).
Zelezniak, A. et al. Metabolic dependencies drive species co-occurrence in diverse microbial communities. Proc. Natl Acad. Sci. 112, 6449–6454 (2015).
Diener, C., Gibbons, S. M. & Resendis-Antonio, O. MICOM: metagenome-scale modeling to infer metabolic interactions in the gut microbiota. mSystems 5, e00606–e00619 (2020).
Zorrilla, F., Buric, F., Patil, K. R. & Zelezniak, A. metaGEM: reconstruction of genome scale metabolic models directly from metagenomes. Nucleic Acids Res. 49, e126–e126 (2021).
Heinken, A., Hertel, J. & Thiele, I. Metabolic modelling reveals broad changes in gut microbial metabolism in inflammatory bowel disease patients with dysbiosis. Npj Syst. Biol. Appl. 7, 19 (2021).
Li, D., Liu, C.-M., Luo, R., Sadakane, K. & Lam, T.-W. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics 31, 1674–1676 (2015).
Nissen, J. N. et al. Improved metagenome binning and assembly using deep variational autoencoders. Nat. Biotechnol. 39, 555–560 (2021).
Parks, D. H., Imelfort, M., Skennerton, C. T., Hugenholtz, P. & Tyson, G. W. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 25, 1043–1055 (2015).
Marcelino, V. R. et al. Code for community-wide metabolic modelling, calculation of metabolite exchange scores (MES) and statistical tests. version 1.2.2. https://doi.org/10.5281/zenodo.8223163 (2023).
Uebanso, T., Shimohata, T., Mawatari, K. & Takahashi, A. Functional roles of B‐vitamins in the gut and gut microbiome. Mol. Nutr. Food Res. 64, 2000426 (2020).
Mortensen, P. B., Holtug, K. & Rasmussen, H. S. Short-chain fatty acid production from mono- and disaccharides in a fecal incubation system: implications for colonic fermentation of dietary fiber in humans. J. Nutr. 118, 321–325 (1988).
Baker, H. et al. Inability of chronic alcoholics with liver disease to use food as a source of folates, thiamin and vitamin B6. Am. J. Clin. Nutr. 28, 1377–1380 (1975).
Tallaksen, C. M. E., Bell, H. & Bøhmer, T. The concentration of thiamin and thiamin phosphate esters in patients with alcoholic liver cirrhosis. Alcohol. Alcohol. 27, 523–530 (1992).
Costantini, A. & Pala, M. I. Thiamine and fatigue in inflammatory bowel diseases: an open-label pilot study. J. Altern. Complement. Med. 19, 704–708 (2013).
Tsuruya, A. et al. Ecophysiological consequences of alcoholism on human gut microbiota: implications for ethanol-related pathogenesis of colon cancer. Sci. Rep. 6, 27923 (2016).
Mottawea, W. et al. Altered intestinal microbiota–host mitochondria crosstalk in new onset Crohn’s disease. Nat. Commun. 7, 13419 (2016).
Dordević, D., Jančíková, S., Vítězová, M. & Kushkevych, I. Hydrogen sulfide toxicity in the gut environment: meta-analysis of sulfate-reducing and lactic acid bacteria in inflammatory processes. J. Adv. Res. 27, 55–69 (2021).
He, Q. et al. Two distinct metacommunities characterize the gut microbiota in Crohn’s disease patients. GigaScience 6, 1–11 (2017).
Roediger, E. W. & Millard, S. Reducing sulfur compounds of the colon impair coionocyte nutrition: implications for ulcerative colitis. Gastroenterology 104, 802–809 (1993).
Braccia, D. J., Jiang, X., Pop, M. & Hall, A. B. The capacity to produce hydrogen sulfide (H2S) via cysteine degradation is ubiquitous in the human gut microbiome. Front. Microbiol. 12, 705583 (2021).
Wolf, P. G. et al. Diversity and distribution of sulfur metabolic genes in the human gut microbiome and their association with colorectal cancer. Microbiome 10, 64 (2022).
Perez-Sanchez, C. et al. POS0394 NAD+ boosters reestablish the altered NAD+ metabolism of leukocytes from rheumatoid arthritis patients improving their oxidative, apoptotic and inflammatory status. Ann. Rheum. Dis. 80, 426.2–426 (2021).
Mehmel, M., Jovanović, N. & Spitz, U. Nicotinamide riboside—the current state of research and therapeutic uses. Nutrients 12, 1616 (2020).
LoConte, N. K., Brewster, A. M., Kaur, J. S., Merrill, J. K. & Alberg, A. J. Alcohol and cancer: a statement of the American Society of Clinical Oncology. J. Clin. Oncol. 36, 83–93 (2018).
Louis, P., Hold, G. L. & Flint, H. J. The gut microbiota, bacterial metabolites and colorectal cancer. Nat. Rev. Microbiol. 12, 661–672 (2014).
Watson, A. R. et al. Metabolic independence drives gut microbial colonization and resilience in health and disease. Genome Biol. 24, 78 (2023).
Veseli, I. et al. Microbes with higher metabolic independence are enriched in human gut microbiomes under stress. eLife. 12, RP89862 (2023).
Blachier, F. et al. Luminal sulfide and large intestine mucosa: friend or foe? Amino Acids 39, 335–347 (2010).
Gemici, B. & Wallace, J. L. Anti-inflammatory and cytoprotective properties of hydrogen sulfide. in Methods in Enzymology Vol. 555, 169–193 (Elsevier, 2015).
Wallace, J. L., Motta, J.-P. & Buret, A. G. Hydrogen sulfide: an agent of stability at the microbiome-mucosa interface. Am. J. Physiol. Gastrointest. Liver Physiol. 314, G143–G149 (2018).
Blachier, F., Beaumont, M. & Kim, E. Cysteine-derived hydrogen sulfide and gut health: a matter of endogenous or bacterial origin. Curr. Opin. Clin. Nutr. Metab. Care 22, 68–75 (2019).
Zampieri, G., Campanaro, S., Angione, C. & Treu, L. Metatranscriptomics-guided genome-scale metabolic modeling of microbial communities. Cell Rep. Methods 3, 100383 (2023).
Thiele, I. et al. Personalized whole‐body models integrate metabolism, physiology, and the gut microbiome. Mol. Syst. Biol. 16, e8982 (2020).
Pasolli, E. et al. Accessible, curated metagenomic data through ExperimentHub. Nat. Methods 14, 1023–1024 (2017).
Gupta, V. K. et al. A predictive index for health status using species-level gut microbiome profiling. Nat. Commun. 11, 4635 (2020).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Salazar, V. W. et al. Metaphor—a workflow for streamlined assembly and binning of metagenomes. GigaScience 12, giad055 (2022).
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Olm, M. R., Brown, C. T., Brooks, B. & Banfield, J. F. dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication. ISME J. 11, 2864–2868 (2017).
Chaumeil, P.-A., Mussig, A. J., Hugenholtz, P. & Parks, D. H. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics 36, 1925–1927 (2019).
Clausen, P. T. L. C., Aarestrup, F. M. & Lund, O. Rapid and precise alignment of raw reads against redundant databases with KMA. BMC Bioinformatics 19, 307 (2018).
Letunic, I. & Bork, P. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49, W293–W296 (2021).
Noronha, A. et al. The Virtual Metabolic Human database: integrating human and gut microbiome metabolism with nutrition and disease. Nucleic Acids Res. 47, D614–D624 (2019).
Wishart, D. S. et al. HMDB 4.0: the human metabolome database for 2018. Nucleic Acids Res. 46, D608–D617 (2018).
Wickham, H. ggplot2: Elegant graphics for data analysis. Springer-Verlag New York (2016).
Fellows, I. wordcloud : Word Clouds. R package version 2, 331 (2018).
Eddy, S. R. Accelerated profile HMM searches. PLoS Comput. Biol. 7, e1002195 (2011).
Hari, A. & Lobo, D. Fluxer: a web application to compute, analyze and visualize genome-scale metabolic flux networks. Nucleic Acids Res. 48, W427–W435 (2020).
Rohart, F., Gautier, B., Singh, A. & Lê Cao, K.-A. mixOmics: an R package for ‘omics feature selection and multiple data integration. PLOS Comput. Biol. 13, e1005752 (2017).
Liaw, A. & Wiener, M. Classification and regression by randomForest. R News 2, 18–22 (2002).
Acknowledgements
This work was supported by the Australian Research Council (DP190101504) and the Australian National Health and Medical Research Council (APP1181105 and APP1186371). V.R.M. is supported by an Australian Research Council DECRA Fellowship (DE220100965), C.G. is supported by an National Health & Medical Research Council EL2 Fellowship (APP1178715), and S.C.F. is supported by a CSL Centenary Fellowship. S.M.G. and C.D. were supported by the National Institute of Diabetes and Digestive and Kidney Diseases of the National Institutes of Health (R01DK133468). The authors acknowledge the Monash eResearch Centre for access to computational resources and expertise and the support of the Victorian Government’s Operational Infrastructure Support Program. We thank Dr Paul Harrison and Dr Jamie Gearing for statistical and bioinformatics advice, and Dr Lucas Schiffer for help with curatedMetagenomicData. We also thank the stool donors and researchers who made their metadata publicly available and the reviewers of this manuscript for their constructive feedback. Open access charges funded by the Hudson Institute of Medical Research.
Author information
Authors and Affiliations
Contributions
V.R.M. and S.C.F. designed the study. V.R.M. and R.B.Y. identified samples and curated the metadata. V.R.M. conducted the metabolic modelling analyses. C.D. and S.M.G. assisted with data analysis and interpretation. C.W. and C.G. performed the survey of H2S genes. E.L.G., E.L.R., and R.B.Y. contributed with bacterial microbiology expertise, and E.M.G contributed with clinical expertise in IBD. All authors contributed to the results interpretation and manuscript writing.
Corresponding authors
Ethics declarations
Competing interests
S.C.F. is an inventor on patents and has acted as an advisor to BiomeBank and Microbiotica. R.B.Y. has acted as an advisor to BiomeBank. All other authors have no competing interests to declare.
Peer review
Peer review information
Nature Communications thanks Francisco Zorrilla and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Marcelino, V.R., Welsh, C., Diener, C. et al. Disease-specific loss of microbial cross-feeding interactions in the human gut. Nat Commun 14, 6546 (2023). https://doi.org/10.1038/s41467-023-42112-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-023-42112-w
This article is cited by
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.