The hallmarks of dietary intervention-resilient gut microbiome

Klimenko, Natalia S.; Odintsova, Vera E.; Revel-Muroz, Anastasia; Tyakht, Alexander V.

doi:10.1038/s41522-022-00342-8

Download PDF

Article
Open access
Published: 08 October 2022

The hallmarks of dietary intervention-resilient gut microbiome

Natalia S. Klimenko ORCID: orcid.org/0000-0001-9640-0102¹,
Vera E. Odintsova²,
Anastasia Revel-Muroz¹ &
…
Alexander V. Tyakht^1,2

npj Biofilms and Microbiomes volume 8, Article number: 77 (2022) Cite this article

8190 Accesses
15 Citations
20 Altmetric
Metrics details

Subjects

Abstract

Maintaining equilibrium of the gut microbiome is crucial for human health. Diet represents an important and generally accessible natural channel of controlling the nutrients supply to the intestinal microorganisms. Although many studies showed that dietary interventions can specifically modulate gut microbiome composition, further progress of the approach is complicated by interindividual variability of the microbial community response. The reported causes of this variability include the baseline microbiome composition features, but it is unclear whether any of them are intervention-specific. Here, we applied a unified computational framework to investigate the variability of microbiome response measured as beta diversity in eight various dietary interventions using previously published 16S rRNA sequencing datasets. We revealed a number of baseline microbiome features which determine the microbiome response in an intervention-independent manner. One of the most stable associations, reproducible for different interventions and enterotypes, was a negative dependence of the response on the average number of genes per microorganism in the community—an indicator of the community functional redundancy. Meanwhile, many revealed microbiome response determinants were enterotype-specific. In Bact1 and Rum enterotypes, the response was negatively correlated with the baseline abundance of their main drivers. Additionally, we proposed a method for preliminary assessment of the microbiome response. Our study delineats the universal features determining microbiome response to diverse interventions. The proposed approach is promising for understanding the mechanisms of gut microbiome stability and improving the efficacy of personalised microbiome-tailored interventions.

The Spanish gut microbiome reveals links between microorganisms and Mediterranean diet

Article Open access 10 November 2021

Adriel Latorre-Pérez, Marta Hernández, … Luis Collado

Impact of long-term dietary habits on the human gut resistome in the Dutch population

Article Open access 03 February 2022

Paul B. Stege, Joost Hordijk, … Fernanda L. Paganelli

Intermittent fasting modulates the intestinal microbiota and improves obesity and host energy metabolism

Article Open access 07 April 2023

Xiangwei Hu, Kai Xia, … Shenglin Duan

Introduction

Currently, there is lots of evidence for multifaceted interactions between human health and gut microbial community structure. Changes of microbiome composition were observed for diverse diseases including inflammatory bowel diseases^1,2,3,4, colorectal cancer^5,6, type 2 diabetes mellitus^7,8 and obesity^9,10. In addition to the pathologies directly related to the gastrointestinal tract functioning and digestion, the associations with microbiome were also observed for cardiovascular diseases^11,12 and allergies¹³, as well as disorders affecting the nervous system - such as Parkinson's disease¹⁴, autism¹⁵ and Viliuisk encephalomyelitis¹⁶. The number of associations between disease states and gut microbiome is steadily increasing. Moreover, multiple studies on model animals colonised with defined microbial consortia demonstrated the causality of microbiome changes in regards to the host health disruption¹⁷.

Two important ecological properties of gut microbiome are its abilities: (1) to maintain its taxonomic composition stability over time and, on the other hand, (2) to alter its composition rapidly under the influence of certain factors, in particular, diet changes. Several studies with a large number of sampled time points per individual showed that, in general, the interindividual variability of the microbiome composition exceeds the intraindividual variability over time^18,19,20. On the contrary, short-term changes in diet, administration of medicines or probiotics, as well as onset of an intestinal infection, can cause significant changes in the composition of the colon microbiota in a fairly short time (measured in days)¹⁸. For at least some of these changes, their directions were consistent across subjects and reproducible in different cohorts. However, there are differences between the subjects in the magnitude of these changes - in other words, the interindividual variability of the microbiome response to intervention^21,22,23. The variability was observed for different types of interventions, such as probiotic and prebiotic intake, specific foods, overall dietary pattern changes^24,25,26 and faecal microbiota transplantation²⁷.

One of the most intriguing challenges in such studies is to identify the possible causes of the observed variability. This issue is interesting from both fundamental and practical perspectives. It can elucidate the ecological mechanisms underlying diversity of gut communities²⁸; on the other hand, it can help tailor the optimal intervention type for a particular person targeting health improvements^23,29,30. In several studies addressing this question, the researchers have suggested that one of the main factors influencing the rate of microbiome response to dietary intervention is the individual prior long-term dietary pattern^26,31 (Supplementary Table 1). However, in some other studies, no associations with long-term diet were observed³². Other factors suggested to affect the magnitude of microbiome response included host gene expression pattern²⁵, various metabolic parameters³³ as well as baseline (initial) microbiome composition (Supplementary Table 1)^{24,25,26,32,33,34,35,36,37,38,39,40,41,42,43,44,45}. Baseline features of gut microbiome composition are outstanding among such factors, as due to the potential of modulating gut community, there is an opportunity to control the individual response in the perspective. The microbiome features shown to influence the degree of response include alpha diversity, relative abundance of specific taxa and metabolic pathways. Some studies showed good performance of the subjects classification into responders and non-responders based on baseline features including microbiome content^32,44. Moreover, some of the predictions were confirmed mechanistically in interventional experiments using animal models: transplantation of the responders’ microbiome to the animal caused a more pronounced response to the subsequent intervention⁴⁶.

The majority of studies to investigate dependence of gut community response to dietary interventions on its initial state have small sample size (<50 subjects per intervention (Supplementary Table 1)). Small sample size along with the differences in study design and analytical approaches hamper generalisation of the conclusions across these studies. Particularly, it remains unclear whether among the baseline features associated with the microbiome response there are universal for various interventions. In the present study, we have analysed the microbiome response variation and its dependence on the baseline microbiome composition under eight distinct dietary interventions using previously published data on adult subjects. These interventions utilised different ways of microbiome modulation including change of dietary pattern, prebiotics and probiotics intake, as well as intake of products that are believed not to affect microbiome (placebo groups). By applying unified data analysis methods to these studies, we explored the questions of how the microbiome response can be measured, if there are microbiome features associated with the response universal among interventions and whether it is possible to partly predict the response prior to the intervention.

Results

The metrics of microbiome response to dietary interventions

One of the widely applied ways to evaluate microbiome response is to calculate beta diversity between paired samples collected before and after the intervention. As mentioned above, the aim of this work is to find the dependence of microbiome response on its initial composition. In this context, one of the problems of using beta diversity for assessing sustainability is its dependence on the alpha diversity of the compared communities, known from previous research in ecology⁴⁷. The problem is that this dependence can have both a biological component (that is of high interest to investigate) and a purely computational one (that it is desirable to exclude). Computational component arises from the mutual dependence of alpha and beta diversities from the specific sample characteristics such as sparsity, and depends on the chosen diversity metrics. We explored the computational component on 500 sample pairs from the FGFP (Flemish Gut Flora Project) cohort⁴⁸. By using two abundance randomisation techniques (see Materials and Methods, Fig. 1a), we showed that the computational dependence exists for two diversity metrics widely used in microbiome research - Aitchison⁴⁹ and Bray–Curtis dissimilarities (Fig. 1b, c, randomised data).

**Fig. 1: Investigation of computational components in the relationship between alpha and beta diversity.**

One way to evaluate differences between communities taking account of the computational dependence on alpha diversity is the one based on the so-called null-models. The approach originally developed by Raup and Crick⁵⁰ and adopted for ecology and microbiome analysis by Chase et al.⁴⁷ and Stegen et al.⁵¹ allows one to estimate how much the observed beta diversity differs from that expected for a given alpha diversity, average taxa abundance and prevalence in communities. We have chosen the variant of the Raup–Crick metric based on Bray–Curtis dissimilarities proposed by Stegen et al. and denoted as RC_bray⁵¹. It varies from −1 to 1, where −1 corresponds to completely identical communities, and 1 - to the most distinct ones. It can be seen that, in contrast to the beta diversity, such an estimate does not carry a computational dependence on the alpha diversity (Fig. 1d, randomised data). This is unsuprising, since RC_bray was developed specifically to correct for the computational dependence and the procedure for its calculation even includes a randomisation similar to the “randomisation 2” in our analysis. This fact makes RC_bray the metric of choice for our purposes. For validation purposes, the analyses involving adjustment for alpha diversity were additionally conducted using other beta diversity metrics.

Dependence of response on the baseline microbiome composition

We explored the microbiome response to dietary interventions in five independent studies resulting in eight different interventions lasting from 2 to 10 weeks (see Materials and Methods). Briefly, the interventions were: high fibre diet (HFD)⁴⁴, fermented dairy product fortified with probiotic (FDP)⁴⁵, resistant starch from potatoes (F.POT)³⁷, resistant starch from maize (F.HIM)³⁷, inulin from chicory root (F.INU)³⁷, accessible corn starch (F.ACC)³⁷, maltodextrin (MAL)³⁵ and non-meat diet (NM)⁴³. The overall dataset included 1242 microbiomes from 641 individuals. The countries where the studies took place were USA, Russia and New Zealand (Supplementary Fig. 1); the metadata collected in the studies are summarized in the Supplementary Table 2.

As expected, baseline taxa abundances strongly differed across the studies (Fig. 2a). However, enterotyping in the context of the large FGFP cohort (see Materials and Methods) showed that each original FGFP enterotype (named Bact1, Bact2, Rum and Prev as in ref. ⁵²) included samples from more than one study (Fig. 2b). Therefore, all further tests including the combined set of the interventions were conducted with the correction for the intervention (in most cases - by including it as a random effect in mixed effect models); this affects the variants of tests both with and without stratification by enterotype.

**Fig. 2: Variation of microbiome composition in the analysed data.**

The microbiome response was measured as RC_bray within the pairs of samples collected before and after the intervention. In all investigated studies, the microbiome response to dietary intervention varied notably and exceeded the variance of the mean response in different studies (intrastudy variance 0.3722 ± 0.0612, interstudy variance of the mean response 0.0080, Fig. 2c, d). The variance was not correlated neither with the mean response nor with the intervention time period (linear regression, p > 0.1). The response was not correlated with any of the baseline sample factors collected in the studies either (Supplementary Table 2).

The overall variance of the response did not correlate with the microbial composition at the first time point (dbRDA, p = 0.3017, R² = 2%, Fig. 3a). However, the effect was observed when stratification by enterotype was performed: significant associations were revealed for the Rum (dbRDA, p = 0.0003, R² = 1.2%, N = 212, p) and Bact2 (dbRDA, p = 0.0403, R² = 8.0%, N = 195) enterotypes, while for the Prev (dbRDA, p = 1, R² = 1.5%, N = 67) and Bact1 (dbRDA, p = 0.4031, R² = 1.4%, N = 167) enterotypes the results remained non-significant. When the studies were analysed separately, association was detected in 5 of 8 interventions: HFD, FDP, F.POT, F.INU and MAL (dbRDA, p < 0.1, R² - from 1% to 5%).

**Fig. 3: Associations between microbiome response to interventions and baseline microbiome composition.**

As for distinct microbiome features, dependence of response from three variables - baseline alpha diversity, Bacteroidetes:Firmicutes ratio (B:F) and mean number of genes per organism in the community (AGN, average gene number) - were analysed using partial correlation network due to potential correlations between the predictors (correlation between two components are computed with the correction for all the others, with intervention used as a random effect, see Methods). Baseline AGN was negatively associated with the response, while alpha diversity and B:F - positively (partial correlations, FDR < 0.05). The same effects were observed for most of the interventions analysed separately (partial correlations, FDR < 0.05, Fig. 3b). Enterotype stratification showed that associations of AGN and alpha diversity with the response were observed in each enterotype except for the Prev (partial correlations, FDR < 0.05, Fig. 3b, c); association with the B:F ratio was detected only in the Bact2 enterotype (partial correlations, FDR < 0.05, Fig. 3b, c). Since partial correlation networks include a correction for alpha diversity, we were able to validate the results for B:F and AGN using seven other beta diversity metrics for response calculation (Bray–Curtis, Aitchison, generalised UniFrac, weighted UniFrac, unweighted UniFrac, Jaccard and inverse Pearson correlation) (Fig. 3b). According to the analysis type including all samples the most reproducible was negative association with AGN (observed for all five weighted metrics), while positive association with B:F ratio was reproduced using three metrics.

Since the estimate of the AGN from 16S rRNA data might be quite imprecise, we performed additional validation of partial correlation results using “shotgun” metagenomic dataset related to a dietary intervention⁵³. The intervention included 126 subjects who consumed blended cooking oils or refined olive oil for 8 weeks (three time points were collected: baseline, 2 weeks and 8 weeks). For both time intervals (2 weeks and 8 weeks), we confirmed negative association of the response with AGN (partial correlations, p = 0.0024 and 0.0025, respectively) and positive association with alpha diversity (partial correlations, p = 0.0084 and 0.0096) (Supplementary Fig. 2). For B:F ratio, no significant associations were observed. The validity of our AGN measure was supported by its high correlation with the average genome size estimate calculated using MicrobeCensus⁵⁴ (linear regression, p < 2·10⁻¹⁶, Supplementary Fig. 3).

As for the associations of the response with the baseline abundance of individual microbial species, there were 52 significant taxa for combined interventions, 55 - for at least one enterotype under enterotype stratification and 37 - for at least one intervention when the stratification was conducted by intervention (Supplementary Table 3, linear mixed effect regression and simple linear regression, FDR < 0.05). Almost all significant results with enterotype stratification were identified for the Rum and Bact2 enterotypes, several associations - for Bact1 and zero - for Prev (the latter fact was possibly due to its lower number of samples). The majority of results were enterotype-specific: 28 associations were unique to Bact2 and 14 - to Rum, while 13 associations were shared among more than one enterotype. The most reproducible associations were negative dependence of response with the abundance of unclassified species from Blautia genus (detected in three enterotypes and four interventions) and [Ruminococcus] gnavus (detected in two enterotypes and six interventions). Among the enterotype-specific associations, the strongest were negative dependencies of response from unclassified Bacteroides abundance in Bact2 enterotype and with unclassified (Clostridiaceae/Ruminococcaceae) - in Rum enterotype (“the strongest” here means the one characterised by the maximum number of separate interventions where it was observed, Supplementary Table 3). We also repeated this analysis with the adjustment for alpha diversity, keeping in mind that we will lose taxa associated with alpha diversity (Supplementary Table 4). However, in this case, we will be able to assess reproducibility of the results using various beta diversity metrics to response calculations. The majority of the significantly associated taxa were metric-specific, with the largest number of associations being detected for RC_bray. The only association that was replicated for all weighted metrics was a positive response to [Eubacterium]-biforme abundance in the Bact2 enterotype and overall mixed-effect model.

Microbiome response potential

According to our observations, some microbiome features associated with the response were shared between distinct interventions. In this connection, we proposed the following assumptions and definitions:

Assumption 1: There exist microbiome markers that partly determine the response of the microbiome a priori, regardless of the intervention type.

Definition 1: Response potential - the component of the response that is determined solely by the individuals’ initial internal microbiome characteristics.

Thus, the observed response to an intervention is represented by the superposition of the response potential and the intervention-specific component.

Definition 2: Landscape of response potential - the distribution of response potential in a multidimensional space of taxa abundances.

Assumption 2: Areas of the landscape with low response potential would be characterised by increased sample density in cohorts with sufficiently large sample sizes and one sample per subject.

The latter assumption is based on the ergodicity property, which is used for macroecological communities⁵⁵. We propose to further extend this property for microbiomes in the sense that during the adult period of life of a generally healthy individual, its microbiome can with a certain probability take on a configuration that is close enough to almost any of the compositions observed in other individuals (of the same age range). Then, such property of ergodic systems as “time average is equal to the average over space” can be interpreted as the ability to assess the stability of a single community a priori based on its position relative to the communities of other subjects in a sufficiently large cohort.

Proceeding from Assumption 2, the response potential can be estimated from its position relative to samples of other individuals in a multidimensional space of taxa abundances. One way to do this is to calculate the average beta diversity between a given sample and the samples of a relatively large number of other individuals from the same enterotype. We performed such calculations for each baseline sample in each intervention using the FGFP cohort samples and all other baseline samples as a reference (see Materials and Methods).

We introduced response potential to the partial correlation network (along with AGN, alpha diversity and B:F ratio) to explore its relation to response and baseline microbiome characteristics. A significant positive correlation between response and response potential was observed both in combined data and in all enterotypes except for the Bact1 (partial correlations, FDR < 0.05, Fig. 4a). These observations support our assumptions stated above. Among the associations with baseline microbiome composition, the response potential replaced the response in association with AGN, but not with the alpha diversity and B:F (Fig. 4a). The correlation of response potential with the response and AGN were validated using other beta diversity metrics (Supplementary Fig. 4).

**Fig. 4: Associations of microbiome response potential with the baseline microbiome composition features and response to the interventions.**

We also tested the selected method of response potential evaluation using a machine learning approach. In particular, we estimated the extent to which the response component predictable by the initial microbiome composition is correlated with our estimate of the response potential. We trained a gradient tree boosting model (XGBoost) to predict the response based on baseline microbiome content (Materials and Methods, Fig. 4b). The model showed better cross-validation performance characteristics compared to a random model (Fig. 4b, Table 1). Then we tested the model initially trained to predict response using the values of response potential instead of response in the testing set. This substitution improved the performance characteristics for the majority of studies (Fig. 4b, Table 1), showing that the proposed approach to calculate response potential is reasonable. In other words, the result of the machine learning algorithm, which is a complex function of baseline taxa abundance, converged with the assessment of the response potential proposed by us via the position of the analysed microbiome relative to others. The robustness of our response potential concept was further supported by the fact that a response potential calculated for the second time point samples was highly correlated with the above-described calculations from the baseline values suggesting that this characteristic is only slightly affected by the analysed interventions (Supplementary Fig. 5).

Table 1 Average model quality characteristics obtained in 50 iterations of cross-validation trained and tested on initial data, on the data with randomly permuted sample names, and after substitution of response values with the response potential values in the testing sets.

Full size table

The response potential, according to our definition, does not depend on the type or presence of intervention, and can be calculated from a single time point. To validate the associations of the response potential with microbiome features, we tested them on the FGFP cohort dataset⁴⁸. Association with AGN was confirmed for all the enterotypes as well as for the entire cohort (partial correlations, FDR < 0.05, Fig. 4c). Associations with alpha diversity and B:F were detected for the entire cohort as well as for several enterotypes (partial correlations, FDR < 0.05, Fig. 4c).

Discussion

Pronounced interindividual variability of response to dietary interventions has been observed in numerous studies examining diverse intervention types - from modifying a dietary pattern to introducing a certain nutrient or probiotic microorganism to diet. The measures of response were also quite different across the studies (Supplementary Table 1). Beta diversity is often used as a measure of microbiome response. However, its computational dependence on alpha diversity is rarely taken into account. In our work, we emphasised that this is an important point, especially when searching for the baseline microbiome markers of the microbiome response. In this regard, we proposed to use the R_bray⁵¹ metric as a response measure, for which the above-mentioned computational dependence is much less pronounced, according to our results. Still, this metric is not flawless. One disadvantage is its dependence on the total pool and abundance of bacterial taxa in all analysed samples. On the one hand, this leads to the changes in pairwise dissimilarity between samples when they are analysed in different contexts⁴⁷. On the other hand, average taxa abundance in the context is implicitly taken into account during metric calculation. This may be one of the reasons why we observed more associations with the baseline microbiome features for RC_bray metric compared to other metrics in the analyses where adjustment for alpha diversity was performed. This observation, coupled with the notable differences between the results obtained using different response metrics (Supplementary Table 4) emphasise the need for a future detailed study on appropriate response measures.

Although the investigated interventions were quite different in both duration and the expected effects on microbiome, the intrastudy variance of mean response still notably exceeds the interstudy response variance. The dependence of response on initial microbiome state was significant for the majority of interventions. However, when the interventions were combined, this dependence remained significant only when stratification into enterotypes was applied for the two enterotypes with the highest number of samples. This fact may indicate that the intervention-independent microbiome signatures of response are enterotype-specific. This is also supported by the results observed for individual taxa. Most associations were detected for a specific enterotype, despite a lower statistical power compared to the mixed-effect models over all samples. Interestingly, the main enterotype-specific signatures were negatively associated with the response and included taxa phylogenetically related to the enterotype drivers: unclassified Bacteroides in Bact2 and (Clostridiaceae/Ruminococcaceae) in Rum. This may indicate that in each enterotype, the stability of a sample increases, the closer it is to the enterotype centre - suggesting the latter is an attractor in a certain sense. However, to confirm this hypothesis, a deeper study of this problem is required. Several associations revealed at the level of individual taxa reproduced previously found microbiome stability markers: positively associated with the response Prevotella copri and negatively - unclassified Bacteroides (in the previous study where “shotgun” sequencing was used it was Bacteroides uniformis, in our study this taxon was included in the group for unclassified Bacteroides species)¹⁹.

The relationship with the average number of genes per microorganism in the community (AGN) was observed for the entire pool of samples, as well as for all enterotypes except the Prev. This may be due to a smaller number of samples in the Prev enterotype, as well as a lower compositional variation within it. Average number of genes per organism in the community can be considered as an indicator of the prevailing ecological roles in the community. It is known that a predominance of generalist taxa possessing a higher number of genes and, accordingly, a wider metabolic potential - compared to the specialist taxa - contributes to the microbiome stability⁵⁶. The AGN, together with alpha diversity, determines the functional redundancy of the community: AGN - at the level of an individual organism and alpha diversity - at the community level. Indeed, it is functional diversity and redundancy that are considered in theoretical ecology as stability markers⁵⁷.

Interestingly, however, the association of response with alpha diversity in our study was not negative, as might be expected from the above statements and as it had been observed in several previous studies^32,34,41,43. This discrepancy can have several causes. Firstly, we have used RC_bray response measure, which considers for the computational component in the alpha and beta diversity relationship, and had not previously been applied in the studies investigating microbial communities stability markers. Noteworthy, for other response metrics that did not include a correction for the computational component, the association had an opposite sign (Fig. 3b). Secondly, a negative correlation is observed between AGN and alpha diversity. This means that more diverse communities are formed by microorganisms with fewer genes (Figs. 3, 5), which also corresponds to the literature data⁵⁸. Thus, functional redundancy is achieved through a trade-off between the number of genes per organism and the number of organisms (Fig. 5). However, we would stay cautious in drawing unambiguous conclusions about the relationship between response and alpha diversity. As we mentioned above, a further detailed study of response measures is needed to confirm and understand the nature of this relationship. As for the negative association with AGN, it was observed with multiple response metrics in different analysis types, which makes this association more reliable.

**Fig. 5: Relation between the response, baseline alpha diversity and baseline AGN.**

The identification of intervention-independent response markers is an important step for an assessment of an individual's microbiome stability. However, we have also looked at the problem of assessing stability from the other side. We have estimated stability based on the sample relative position to other individuals’ samples in the space of taxa abundances. We showed a high correlation of this estimate, called the response potential, with the real response in the interventional studies. In addition, using machine learning, we have isolated the part of the response that can be explained by the baseline microbiome, and have shown that for most interventions this variable correlates better with the response potential than with the response itself. However, in the partial correlation networks analysis, some of the intervention-independent associations of the response with the baseline microbiome composition persisted when the response potential was introduced into the networks. This may indicate that the method of the response potential estimation can be improved. One way of such improvement may be stratification of the acceptable microbiome configurations landscape by certain phenotypes, such as sex and age. This is particularly true when the age range is wider than in the data we used (17–65 years). For example, infants’ microbiomes will form a completely different part of the landscape. The same problem will appear when analysing subjects with a certain disease if the disease affects the microbiome composition. It is also noteworthy that the space of analysed interventional studies was generally well covered by the FGFP cohort (Supplementary Fig. 6). Therefore, we performed the calculation of the response potential in the context of the FGFP data. However, it is unlikely this will hold true for any other given study. In such cases, the reference context for calculating the response potential will have to be modified. Finally, we confirmed the associations between the microbiome response markers and the response potential on an independent cohort.

Compared to the 16S rRNA sequencing data, “shotgun” metagenomics/WGS provide a more detailed microbiome portrait, especially its functional potential, subspecies-level diversity and gene content. It might be interesting to confirm our findings on WGS data - when a sufficient number of relevant publicly available datasets of this format is accumulated. As a pilot step, we have validated them - as well as our approach for AGN calculation - using one large “shotgun” dataset (Supplementary Figs. 2, 3).

To conclude, meta-analysis of diverse dietary interventions studies suggests existence of universal baseline microbiome features defining microbiome response. One of the most common features of intervention-resistant communities is high average number of genes per microorganism in the community, likely reflecting enrichment of generalist microorganisms compared to the specialists. Reproducible specificity of response markers across enterotypes, predictability of response from the baseline location in microbiome landscape and problem of dissecting biological and computational components in alpha diversity and response relationship highlight key points to be considered during future gut microbial ecology studies.

Methods

Datasets description

To investigate the dependence of microbiome composition change from its initial state, we used the data from five previously published studies^{35,37,43,44,45}. All the studies investigated microbiome response to the diet intervention using 16S rRNA gene sequencing of stool samples. For the studies where subjects underwent consequential interventions (e.g., a course of one fibre type intake followed by a course of another fibre), we picked a subset of time points corresponding to a single intervention per subject. Overall, we prepared data on eight distinct interventions where each individual was characterised by two time points in the resulting dataset - before and after the intervention. The selected interventions were:

change of dietary pattern towards higher fibre intake for 2 weeks (HFD, N=215)⁴⁴;
intake of a fermented dairy product with Bifidobacterium animalis strain BB-12 for 2 months (FDP, N = 150)⁴⁵;
intake of fermentable fibres/placebo for 2 weeks: resistant starch from potatoes (F.POT, N = 43), resistant starch from maize (F.HIM, N = 43), inulin from chicory root (F.INU, N = 50) and accessible corn starch (F.ACC, N = 39)³⁷;
intake of maltodextrin for 3 weeks (MAL, N = 33)³⁵;
2 weeks washout period followed by 4 weeks of non meat protein source intake (NM, N = 109)⁴³.

Additionally, the FGFP dataset (N = 1106)⁴⁸ including one time point per individual was processed for the purposes of:

enterotyping;
investigating a computational relationship between alpha and beta diversity;
validating the dependencies discovered for the response potential.

One “shotgun” metagenomic dataset⁵³ described microbiome changes during the intervention including intake of blended cooking oils or refined olive oil for 8 weeks (N = 126 subjects, 3 time points); it was used for:

validation of response associations with alpha diversity, AGN and B:F;
validation of AGN calculation.

Data processing

For all 16S studies in the analysis, the raw data generated using Illumina MiSeq or HiSeq sequencing were downloaded from the NCBI Sequence Read Archive (ERP018192⁴⁴, ERP109009⁴⁵, SRP120250³⁵, SRP128128³⁷ and SRP166672⁴³). The FGFP raw reads had been provided by the request via the European Genome Archive (EGAD00001001936). In all studies but the MAL study, the V4 16S rRNA gene region was sequenced. In MAL, V3-V4 regions were analysed, therefore, in order to unify the studies we trimmed the raw reads from this dataset using V4 primer sequences (GTGBCAGCMGCCGCGGTAA, GACTACNVGGGTMTCTAATCC) with cutadapt software⁵⁹. Taxa abundance profiles were calculated on Knomics-Biota platform⁶⁰ using “16S dada2 GreenGenes V4” analysis protocol, which included the following main steps:

denoising by DADA2 algorithm⁶¹ implemented in QIIME2⁶²;
taxonomic classification of obtained amplicon sequence variants (ASV) with QIIME2⁶³ classifier trained on preprocessed GreenGenes database (database preprocessing steps included trimming of 16S rRNA sequences using the primers given above with TaxMan⁶⁴ software and further clustering 97% identical sequences with CD-HIT software⁶⁵);
microbial abundance tables at the levels of species and genus were obtained by summing the relative abundance levels of ASV included in the corresponding clade.

Only samples with >3000 reads post-denoising were included in the analysis. For HFD and FDP studies, we also excluded samples with outlying abnormal composition (e.g., those with dominating Enterobacteriaceae). In F.POT, F.HIM, F.INU and F.ACC interventions having 1–5 samples per time point and subject, we calculated the resulting microbiome profiles by averaging the abundances across the replicates. After preprocessing, the number of subjects per intervention was the following: HFD - 206, FDP - 130, F.POT - 43, F.HIM - 42, F.INU - 50, F.ACC - 39, MAL - 33, NM - 98, FGFP - 1,061. All samples included in the analysis are listed in Supplementary Table 5.

Filtering of relative abundance tables was done by excluding the taxa with abundance >20 reads in <20 samples. All analyses but the enterotyping (see below) were conducted at the level of species. Alpha diversity was calculated using Shannon index after rarefying all abundance tables to the minimum read count in each dataset. We calculated beta diversity using Bray–Curtis, three UniFracs, Jaccard and inverse Pearson dissimilarities after rarefaction and Aitchison dissimilarity - after substituting all zero values in the dataset by pseudocounts (0.5). The Raup–Crick metric based on Bray–Curtis dissimilarities (RC_bray)⁵¹ was calculated after rarefaction (iCAMP package). As RC_bray depends on the overall species pool in the abundance matrix, the values of the metric for the pair of samples - for example, in the context of specific intervention and in the context of all samples from the same enterotype - can slightly differ. Therefore, we provide a description of the abundance tables used to calculate RC_bray metric for different analyses in Supplementary Methods. Visualisation of the relative position of samples in the taxa space was carried out by applying the UMAP algorithm with the default parameters to the matrices of the relative species abundance values⁶⁶.

The “shotgun” dataset used for validation of partial correlation analysis results⁵³ was downloaded from the NCBI Sequence Read Archive (PRJNA728374) (N = 378 samples). The reads were processed using KneadData (https://huttenhower.sph.harvard.edu/kneaddata/) and MetaPhlAn3 software⁶⁷ with “-t rel_ab_w_read_stats” option to obtain the read counts. Only the taxa detected at the level of species were included in further analysis. For computational purposes, the abundance tables were rarefied to 10,000 reads per sample prior to the RC_bray calculation.

Evaluation of computational component in beta and alpha diversity relation

To explore the computational component in alpha and beta diversities relationship, we used two randomisation options on 500 sample pairs from the FGFP cohort (Fig. 1a). In both cases, randomisation allowed us to obtain random taxa abundances while maintaining the alpha diversity value of the sample. Thus, the biological relationship between alpha diversity and microbiome composition was eliminated. After randomisation, we recalculated beta and alpha diversities on randomised abundance tables and used a linear model to explore the relationship between them. In this analysis, Shannon index was used as the metric for alpha diversity, while the Bray–Curtis and Aitchison⁴⁹ dissimilarities, as well as RC_bray⁵¹ - as metrics for beta diversity.

In the first randomisation variant, the names of taxa in each sample were randomly permuted, independently of other samples (Fig. 1a, “randomisation 1”). In the second variant, the procedure similar to the one used in the RC_bray calculation algorithm⁵¹ was applied (Fig. 1a “randomisation 2”). It included the following steps:

average taxa abundances and average taxa prevalence were calculated using the whole abundance table;
for each sample, the list of taxa names equal to the number of unique taxa in the sample were randomly sampled without replacement from the whole taxa pool with the taxa probability proportional to average taxa prevalence.
for each sample, taxa counts equal in sum to the number of reads in this sample were randomly sampled with replacement using only taxa from the list obtained on the previous step with the taxa probability proportional to average taxa abundance.

The first randomisation gave beta diversity values between pairs of samples higher in average than the initial ones, while the second - lower ones. The first randomisation preserves the Shannon index of the sample, while after the second changes it slightly.

Enterotyping

We performed enterotyping in the context of the FGFP cohort on the level of genera⁴⁸. Firstly, we calculated enterotypes on genus-level abundance tables for the FGFP cohort using Dirihlet multinomial mixture models⁶⁸ (R package DirichletMultinomial⁶⁹). Enterotypes’ names for the FGFP cohort were matched with those described by Vandeputte et al.⁵² using the criteria listed by Valles-Colomer et al.⁷⁰. Then, we performed classification of the samples from the interventional studies into obtained enterotypes by calculating Bray–Curtis dissimilarity between the classified sample and each enterotype medoid. The sample was assigned an enterotype according to the medoid providing the lowest dissimilarity value.

Estimation of the mean number of genes per microorganism in the community

We estimated the average number of genes per microorganism in the community (AGN) using information from the NCBI database about prokaryotic genome assemblies (https://www.ncbi.nlm.nih.gov/genome/browse/#!/prokaryotes/). The evaluation was performed on the level of species. For all ASVs whose taxonomic assignment was resolved on the level of species (no “unclassified” term in the taxonomy), we calculated the average number of genes in the NCBI assemblies corresponding to this species (Supplementary Fig. 7). For ASVs taxonomically resolved only at the genus level (for example, “Bacteroides unclassified”), we performed averaging over all assemblies belonging to a specific genus. We excluded from the analysis the ASVs resolved only at the family level or higher (for example, “Clostridiaceae unclassified”), as well as the ASVs with taxonomies having no match in the NCBI database. Next, we calculated the weighted average number of genes per organism in the sample by multiplying the number of genes determined for each ASV by its relative abundance (read count) and dividing by the total number of reads belonging to all ASVs that were taken into account in the analysis for this sample. The obtained AGN was compared to estimation of average genome size with MicrobeCensus⁵⁴ using “shotgun” data described above.

Statistical analysis

Most statistical tests aimed to investigate microbiome response dependence on initial composition were performed in three variants:

using the samples from all eight interventions in one model with the correction for the intervention;
using samples from each enterotype separately with the correction for the intervention;
using samples from each intervention separately.

Firstly, we analysed overall variance of response associations with baseline microbiome content using distance-based redundancy analysis dbRDA with 7000 permutations⁷¹ (R vegan package⁷²). For dbRDA analysis, we scaled RC_bray metric to the [0;1] interval. Correction for the intervention was implemented through stratification of the data during permutations.

Then we analysed associations of the response with alpha diversity, B:F ratio and AGN using a partial correlation network. This analysis method was chosen because all four components have significant correlations between each other, and we are interested in the strongest ones. During a partial correlation network analysis, an association between two components i and j is calculated as a Pearson coefficient between two linear models residuals. Each linear model includes one of the analysed components (i or j) as a predicted variable and all components of the network except i and j - as predictors. Correction for the intervention was conducted by using a mixed effect linear model with intervention as a random effect instead of a simple linear model. Correction for multiple comparisons was performed using the Benjamini–Hochberg method. The partial correlation network from five components (including response potential) was constructed in the same way. For this analysis, we performed a validation using a “shotgun” dataset described above⁵³.

We analysed association of individual taxa abundance with the response using a linear model after clr transformation of the abundance tables⁴⁹. The analysis was performed at the level of species. Zero abundance values were replaced with pseudocounts (0.5). Correction on intervention was conducted by using a mixed effect linear model with intervention as a random effect instead of a simple linear model. Correction for multiple comparisons was performed using the Benjamini–Hochberg method.

Correlation of response with baseline metadata collected in studies was evaluated using a mixed effect model with intervention as a random effect or simple linear model (the former was used when the factor was available for several interventions, while the latter - when the factor was collected in only one intervention.)

Microbiome response potential

To investigate the universality of discovered associations between microbiome composition and response, we introduced the definition of the response potential as a response component determined solely by the initial microbiome state (see Results). We proposed an approach to estimate response potential as the mean RC_bray value between the sample and the samples from the same enterotype belonging to a relatively large number of other individuals (we used more than 300 individuals per enterotype) (the justification can be found in the Results section). This approach allows one to estimate response potential from the data including 1 time point per individual.

Machine learning approach to test the response potential estimate

A machine learning approach was used to determine the extent to which the response component predictable from the baseline microbiome composition is correlated with the proposed estimate of the response potential. For this purpose, we constructed a model aimed to predict the response based on baseline taxa abundances. It was constructed separately for each intervention using the gradient boosted trees (XGBoost)⁷³. The arcsine square root transformed baseline relative abundances on the species level were used as predictors. The following parameters were used in the model: tree depth - 15, number of trees - 1000, learning rate (step size) - 0.3. Model quality was evaluated via a cross-validation procedure by randomly dividing the data into training and testing sets 50 times in 2:1 ratio. We performed predictors filtration and selection on each iteration based on the information in the training set. During filtration, taxa with zero abundance in >30% of the samples were removed. Predictors selection included training of 10 additional XGBoost models on the training set of the iteration, followed by evaluation of the predictors’ importance. These additional models had the same parameters and structure as the main one, except for the number of trees (300). We estimated predictors importance as its relative contribution to the improvement of the prediction in each tree in the model (xgboost R package⁷⁴). The 30 predictors with the highest importance were chosen to construct the main model on specific iteration. To assess the quality of the resulting regression, we calculated R² values by the method proposed in the caret package⁷⁵ (representing the square of the Pearson coefficient between the true and predicted response) at each iteration. The average R² values across all iterations were used as a model quality measure.

We compared the regression quality with the quality of a random model. The latter was obtained by repeating cross-validation for the abundance matrices with randomly shuffled sample names. After that, we analysed how the quality of the regression has changed when the response values were replaced by the values of response potential in testing set during cross-validation. If we observed similarity of R² values between this and initial models in comparison to the random model, that would indicate that the proposed method of response potential calculation is a good estimate for the component of response determined by baseline microbiome state.

Data availability

In this study, we re-analysed raw data from previously published datasets. Accession numbers are listed in the Materials and Methods section.

Code availability

All preprocessed data (ASV tables) (except FGFP dataset) as well as the code to reproduce all of the analyses are available at https://bitbucket.org/natasha_klmnk/art_variation_public.

References

Zhou, Y. et al. Gut Microbiota Offers Universal Biomarkers across Ethnicity in Inflammatory Bowel Disease Diagnosis and Infliximab Response Prediction. mSystems 3, e00188–17 (2018).
Article CAS PubMed PubMed Central Google Scholar
Halfvarson, J. et al. Dynamics of the human gut microbiome in inflammatory bowel disease. Nat. Microbiol 2, 17004 (2017).
Article CAS PubMed PubMed Central Google Scholar
Gevers, D. et al. The treatment-naive microbiome in new-onset Crohn’s disease. Cell Host Microbe 15, 382–392 (2014).
Article CAS PubMed PubMed Central Google Scholar
Pascal, V. et al. A microbial signature for Crohn’s disease. Gut 66, 813–822 (2017).
Article CAS PubMed Google Scholar
Louis, P., Hold, G. L. & Flint, H. J. The gut microbiota, bacterial metabolites and colorectal cancer. Nat. Rev. Microbiol. 12, 661–672 (2014).
Article CAS PubMed Google Scholar
Zeller, G. et al. Potential of fecal microbiota for early‐stage detection of colorectal cancer. Mol. Syst. Biol. 10, 766 (2014).
Article PubMed PubMed Central Google Scholar
Wang, Y. et al. Gut microbiome analysis of type 2 diabetic patients from the Chinese minority ethnic groups the Uygurs and Kazaks. PLoS One 12, e0172774 (2017).
Article PubMed PubMed Central Google Scholar
Forslund, K. et al. Disentangling type 2 diabetes and metformin treatment signatures in the human gut microbiota. Nature 528, 262–266 (2015).
Article CAS PubMed PubMed Central Google Scholar
de la Cuesta-Zuluaga, J., Corrales-Agudelo, V., Carmona, J. A., Abad, J. M. & Escobar, J. S. Body size phenotypes comprehensively assess cardiometabolic risk and refine the association between obesity and gut microbiota. Int. J. Obes. 42, 424–432 (2018).
Article Google Scholar
Yun, Y. et al. Comparative analysis of gut microbiota associated with body mass index in a large Korean cohort. BMC Microbiol. 17, 151 (2017).
Article PubMed PubMed Central Google Scholar
Jie, Z. et al. The gut microbiome in atherosclerotic cardiovascular disease. Nat. Commun. 8, 845 (2017).
Article PubMed PubMed Central Google Scholar
Cui, L., Zhao, T., Hu, H., Zhang, W. & Hua, X. Association study of gut flora in coronary heart disease through high-throughput sequencing. Biomed. Res. Int. 2017, 3796359 (2017).
Article PubMed PubMed Central Google Scholar
Kirjavainen, P. V., Arvola, T., Salminen, S. J. & Isolauri, E. Aberrant composition of gut microbiota of allergic infants: a target of bifidobacterial therapy at weaning? Gut 51, 51–55 (2002).
Article CAS PubMed PubMed Central Google Scholar
Petrov, V. A. et al. Analysis of gut microbiota in patients with Parkinson’s disease. Bull. Exp. Biol. Med. 162, 734–737 (2017).
Article CAS PubMed Google Scholar
De Angelis, M. et al. Fecal microbiota and metabolome of children with autism and pervasive developmental disorder not otherwise specified. PLoS One 8, e76993 (2013).
Article PubMed PubMed Central Google Scholar
Kuznetsova, V. et al. Gut microbiome signature of Viliuisk encephalomyelitis in Yakuts includes an increase in microbes linked to lean body mass and eating behaviour. Orphanet J. Rare Dis. 15, 327 (2020).
Article PubMed PubMed Central Google Scholar
Salonen, A. & de Vos, W. M. Impact of diet on human intestinal microbiota and health. Annu. Rev. Food Sci. Technol. 5, 239–262 (2014).
Article CAS PubMed Google Scholar
David, L. A. et al. Host lifestyle affects human microbiota on daily timescales. Genome Biol. 15, R89 (2014).
Article PubMed PubMed Central Google Scholar
Johnson, A. J. et al. Daily sampling reveals personalized diet-microbiome associations in humans. Cell Host Microbe 25, 789–802.e5 (2019).
Article CAS PubMed Google Scholar
Vandeputte, D. et al. Temporal variability in quantitative human gut microbiome profiles and implications for clinical research. Nat. Commun. 12, 6740 (2021).
Article CAS PubMed PubMed Central Google Scholar
Biesiekierski, J. R., Jalanka, J. & Staudacher, H. M. Can gut microbiota composition predict response to dietary treatments? Nutrients 11, 1134 (2019).
Article CAS PubMed Central Google Scholar
Kolodziejczyk, A. A., Zheng, D. & Elinav, E. Diet–microbiota interactions and personalized nutrition. Nat. Rev. Microbiol. 17, 742–753 (2019).
Article CAS PubMed Google Scholar
Mills, S. et al. Precision nutrition and the microbiome part II: potential opportunities and pathways to commercialisation. Nutrients 11, 1468 (2019).
Article CAS PubMed Central Google Scholar
Korpela, K. et al. Gut microbiota signatures predict host and microbiota responses to dietary interventions in obese individuals. PLoS One 9, e90702 (2014).
Article PubMed PubMed Central Google Scholar
Zmora, N. et al. Personalized gut mucosal colonization resistance to empiric probiotics is associated with unique host and microbiome features. Cell 174, 1388–1405.e21 (2018).
Article CAS PubMed Google Scholar
Griffin, N. W. et al. Prior dietary practices and connections to a human gut microbial metacommunity alter responses to diet interventions. Cell Host Microbe 21, 84–96 (2017).
Article CAS PubMed Google Scholar
Smillie, C. S. et al. Strain tracking reveals the determinants of bacterial engraftment in the human gut following fecal microbiota transplantation. Cell Host Microbe 23, 229–240.e5 (2018).
Article CAS PubMed PubMed Central Google Scholar
Walter, J. & Ley, R. The human gut microbiome: ecology and recent evolutionary changes. Annu. Rev. Microbiol. 65, 411–429 (2011).
Article CAS PubMed Google Scholar
Liu, Z., de Vries, B., Gerritsen, J., Smidt, H. & Zoetendal, E. G. Microbiome-based stratification to guide dietary interventions to improve human health. Nutr. Res. 82, 1–10 (2020).
Article CAS PubMed Google Scholar
Lampe, J. W., Navarro, S. L., Hullar, M. A. J. & Shojaie, A. Inter-individual differences in response to dietary intervention: integrating omics platforms towards personalised dietary recommendations. Proc. Nutr. Soc. 72, 207–218 (2013).
Article PubMed PubMed Central Google Scholar
Healey, G. et al. Habitual dietary fibre intake influences gut microbiota response to an inulin-type fructan prebiotic: a randomised, double-blind, placebo-controlled, cross-over, human intervention study. Br. J. Nutr. 119, 176–189 (2018).
Article CAS PubMed Google Scholar
Salonen, A. et al. Impact of diet and individual variation on intestinal microbiota composition and fermentation products in obese men. ISME J. 8, 2218–2230 (2014).
Article CAS PubMed PubMed Central Google Scholar
Cotillard, A. et al. Dietary intervention impact on gut microbial gene richness. Nature 500, 585–588 (2013).
Article CAS PubMed Google Scholar
Tap, J. et al. Gut microbiota richness promotes its stability upon increased dietary fibre intake in healthy adults. Environ. Microbiol. 17, 4954–4964 (2015).
Article CAS PubMed Google Scholar
Healey, G. et al. Habitual dietary fibre intake influences gut microbiota response to an inulin-type fructan prebiotic: a randomised, double-blind, placebo-controlled, cross-over, human intervention study. Br. J. Nutr. 119, 176–189 (2018).
Article CAS PubMed Google Scholar
Davis, L. M. G., Martínez, I., Walter, J. & Hutkins, R. A dose dependent impact of prebiotic galactooligosaccharides on the intestinal microbiota of healthy adults. Int. J. Food Microbiol. 144, 285–292 (2010).
Article CAS PubMed Google Scholar
Baxter, N. T. et al. Dynamics of human gut microbiota and short-chain fatty acids in response to dietary interventions with three fermentable fibers. MBio 10, e02566–18 (2019).
Article CAS PubMed PubMed Central Google Scholar
Venkataraman, A. et al. Variable responses of human microbiomes to dietary supplementation with resistant starch. Microbiome 4, 33 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kolida, S., Meyer, D. & Gibson, G. R. A double-blind placebo-controlled study to establish the bifidogenic dose of inulin in healthy humans. Eur. J. Clin. Nutr. 61, 1189–1195 (2007).
Article CAS PubMed Google Scholar
Bouhnik, Y. et al. The capacity of nondigestible carbohydrates to stimulate fecal bifidobacteria in healthy humans: a double-blind, randomized, placebo-controlled, parallel-group, dose-response relation study. Am. J. Clin. Nutr. 80, 1658–1664 (2004).
Article CAS PubMed Google Scholar
Ma, C. et al. Differential pattern of indigenous microbiome responses to probiotic Bifidobacterium lactis V9 consumption across subjects. Food Res. Int. 136, 109496 (2020).
Article CAS PubMed Google Scholar
Maldonado-Gómez, M. X. et al. Stable engraftment of bifidobacterium longum AH1206 in the human gut depends on individualized features of the resident microbiome. Cell Host Microbe 20, 515–526 (2016).
Article PubMed Google Scholar
Lang, J. M. et al. Impact of individual traits, saturated fat, and protein source on the gut microbiome. MBio 9, e01604–18 (2018).
Article PubMed PubMed Central Google Scholar
Klimenko, N. S. et al. Microbiome responses to an uncontrolled short-term diet intervention in the frame of the citizen science project. Nutrients 10, 576 (2018).
Article PubMed Central Google Scholar
Volokh, O. et al. Human gut microbiome response induced by fermented dairy product intake in healthy volunteers. Nutrients 11, 547 (2019).
Article CAS PubMed Central Google Scholar
Zmora, N. et al. Personalized gut mucosal colonization resistance to empiric probiotics is associated with unique host and microbiome features. Cell 174, 1388–1405.e21 (2018).
Article CAS PubMed Google Scholar
Chase, J. M., Kraft, N. J. B., Smith, K. G., Vellend, M. & Inouye, B. D. Using null models to disentangle variation in community dissimilarity from variation in α-diversity. Ecosphere 2, 1–11 (2011).
Article Google Scholar
Falony, G. et al. Population-level analysis of gut microbiome variation. Science 352, 560–564 (2016).
Article CAS PubMed Google Scholar
Aitchison, J. The statistical analysis of compositional data (Chapman and Hall, London, 1986).
Raup, D. M. & Crick, R. E. Measurement of faunal similarity in paleontology. J. Paleontol. 53, 1213–1227 (1979).
Google Scholar
Stegen, J. C. et al. Quantifying community assembly processes and identifying features that impose them. ISME J. 7, 2069–2079 (2013).
Article PubMed PubMed Central Google Scholar
Vandeputte, D. et al. Quantitative microbiome profiling links gut community variation to microbial load. Nature 551, 507–511 (2017).
Article CAS PubMed Google Scholar
Lim, R. R. X. et al. Gut microbiome responses to dietary intervention with hypocholesterolemic vegetable oils. NPJ Biofilms Microbiomes 8, 24 (2022).
Article CAS PubMed PubMed Central Google Scholar
Nayfach, S. & Pollard, K. S. Average genome size estimation improves comparative metagenomics and sheds light on the functional ecology of the human microbiome. Genome Biol. 16, 51 (2015).
Article PubMed PubMed Central Google Scholar
Olesen, J. M. et al. Chapter 1 - From Broadstone to Zackenberg: Space, Time and Hierarchies in Ecological Networks. Adv. Ecol. Res. 42, 1–69 (2010).
Article Google Scholar
Matias, M. G., Combe, M., Barbera, C. & Mouquet, N. Ecological strategies shape the insurance potential of biodiversity. Front. Microbiol. 3, 432 (2012).
PubMed Google Scholar
McCann, K. S. The diversity-stability debate. Nature 405, 228–233 (2000).
Article CAS PubMed Google Scholar
Bello, M. D., Lee, H., Goyal, A. & Gore, J. Resource–diversity relationships in bacterial communities reflect the network structure of microbial metabolism. Nat. Ecol. Evolution 5, 1424–1434 (2021).
Article Google Scholar
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet. J. 17, 10–12 (2011).
Article Google Scholar
Efimova, D. et al. Knomics-Biota - a system for exploratory analysis of human gut microbiota data. BioData Min. 11, 25 (2018).
Article PubMed PubMed Central Google Scholar
Callahan, B. J. et al. DADA2: High-resolution sample inference from Illumina amplicon data. Nat. Methods 13, 581–583 (2016).
Article CAS PubMed PubMed Central Google Scholar
Bolyen, E. et al. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat. Biotechnol. 37, 852–857 (2019).
Article CAS PubMed PubMed Central Google Scholar
Bokulich, N. A. et al. Optimizing taxonomic classification of marker-gene amplicon sequences with QIIME 2’s q2-feature-classifier plugin. Microbiome 6, 90 (2018).
Article PubMed PubMed Central Google Scholar
Brandt, B. W., Bonder, M. J., Huse, S. M. & Zaura, E. TaxMan: a server to trim rRNA reference databases and inspect taxonomic coverage. Nucleic Acids Res 40, W82–W87 (2012).
Article CAS PubMed PubMed Central Google Scholar
Fu, L., Niu, B., Zhu, Z., Wu, S. & Li, W. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics 28, 3150–3152 (2012).
Article CAS PubMed PubMed Central Google Scholar
McInnes, L., Healy J., Saul, N. & Großberger, L. UMAP: uniform manifold approximation and projection. J. Open Source Softw. 3, 861 (2018).
Article Google Scholar
Beghini, F. et al. Integrating taxonomic, functional, and strain-level profiling of diverse microbial communities with bioBakery 3. Elife 10, e65088 (2021).
Article CAS PubMed PubMed Central Google Scholar
Holmes, I., Harris, K. & Quince, C. Dirichlet multinomial mixtures: generative models for microbial metagenomics. PLoS One 7, e30126 (2012).
Article CAS PubMed PubMed Central Google Scholar
Morgan, M. DirichletMultinomial for Clustering and Classification of Microbiome Data. Vienna: R package. R Foundation for Statistical Computing (2014).
Valles-Colomer, M. et al. The neuroactive potential of the human gut microbiota in quality of life and depression. Nat. Microbiol 4, 623–632 (2019).
Article CAS PubMed Google Scholar
Legendre, P. & Anderson, M. J. Distance-based redundancy analysis: testing multispecies responses in multifactorial ecological experiments. Ecol. Monogr. 69, 1–24 (1999).
Article Google Scholar
Dixon, P. VEGAN, a package of R functions for community ecology. J. Veg. Sci. 14, 927–930 (2003).
Article Google Scholar
Chen, T. & Guestrin, C. XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785–794 (2016).
Chen, T. et al. Xgboost: extreme gradient boosting. R package version 0. 4-2 1, 1–4 (2015).
Kuhn, M. Building predictive models in R using the caret package. J. Stat. Softw. 28, 1–26 (2008).
Article Google Scholar

Download references

Acknowledgements

This work was supported by the Russian Science Foundation (project #22-24-00683). V.O. and A.T. acknowledge their finances from Atlas Biomed Group - Knomx LLC. The authors would like to thank Mikhail Gelfand and his prokaryotic research group for the discussion and proposing the continuous response analysis. We acknowledge the help of the FGFP team for kindly providing access to the FGFP cohort microbiome datasets.

Author information

Authors and Affiliations

Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia
Natalia S. Klimenko, Anastasia Revel-Muroz & Alexander V. Tyakht
Atlas Biomed Group - Knomx LLC, Tintagel House, 92 Albert Embankment, Lambeth, London, SE1 7TY, UK
Vera E. Odintsova & Alexander V. Tyakht

Authors

Natalia S. Klimenko
View author publications
You can also search for this author in PubMed Google Scholar
Vera E. Odintsova
View author publications
You can also search for this author in PubMed Google Scholar
Anastasia Revel-Muroz
View author publications
You can also search for this author in PubMed Google Scholar
Alexander V. Tyakht
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

N.K. and A.T. designed the study. N.K., V.O., A.R. and A.T. performed data analysis. N.K., V.O., A.R. and A.T. wrote the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Natalia S. Klimenko.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Tables

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Klimenko, N.S., Odintsova, V.E., Revel-Muroz, A. et al. The hallmarks of dietary intervention-resilient gut microbiome. npj Biofilms Microbiomes 8, 77 (2022). https://doi.org/10.1038/s41522-022-00342-8

Download citation

Received: 05 May 2022
Accepted: 28 September 2022
Published: 08 October 2022
DOI: https://doi.org/10.1038/s41522-022-00342-8

This article is cited by

Cluster-specific associations between the gut microbiota and behavioral outcomes in preschool-aged children
- Marcel van de Wouw
- Yanan Wang
- Gerald F. Giesbrecht
Microbiome (2024)
Multi-omics Analysis Reveals the Crucial Mediators of DJB in the Treatment of Type 2 Diabetes
- Hailing Guo
- Qingzhu Ding
- Shangeng Weng
Obesity Surgery (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

The Spanish gut microbiome reveals links between microorganisms and Mediterranean diet

Impact of long-term dietary habits on the human gut resistome in the Dutch population

Intermittent fasting modulates the intestinal microbiota and improves obesity and host energy metabolism

Introduction

Results

The metrics of microbiome response to dietary interventions

Dependence of response on the baseline microbiome composition

Microbiome response potential

Discussion

Methods

Datasets description

Data processing

Evaluation of computational component in beta and alpha diversity relation

Enterotyping

Estimation of the mean number of genes per microorganism in the community

Statistical analysis

Microbiome response potential

Machine learning approach to test the response potential estimate

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Information

Supplementary Tables

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Cluster-specific associations between the gut microbiota and behavioral outcomes in preschool-aged children

Multi-omics Analysis Reveals the Crucial Mediators of DJB in the Treatment of Type 2 Diabetes

Comments

Search

Quick links