Introduction

Sodium benzoate being a safe antimicrobial agent is commonly used as a food preservative. The human intestine receives an average of 0ā€“5Ā mgĀ kgāˆ’1 of benzoate per day either as a food additive1 or as a metabolic end-product of polyphenol diets2. Metabolism of the sodium benzoate is well described in humans1. Almost all the absorbed sodium benzoate is excreted out of the human body through the urogenital system as amino acids adduct i.e. hippuric acid3,4. The human metabolic framework is considered as a sole element in sodium benzoate detoxification4. From the last few decades, microbes have been identified as a key human companion and considered as a virtual organ of the human system5. Human microbial companion known as the human microbiome resides in and on the human body6 and plays a vital role in the various physiological processes5,6 that cumulatively allows the host to sustain healthy physiology6. Even microbiome dysbiosis either due to any genetic7 or epigenetic changes8 results in the onset of various physiological5,9 and metabolic disorders10.

Benzoate is a well-proven antimicrobial agent11, however, its limited availability in the human gut might have failed to induce any antimicrobial effect on gut microbes. Despite limited concentration, antimicrobial food additives were found to induce gut metagenome enrichment for their catabolism12. Likewise, continuous dietary benzoate exposure might have allowed the human gut microbes to evolve benzoate catabolic machinery. However, to date, no efforts have been made to explore the benzoate catabolic machinery of the human gut microbes. Simultaneously, sodium benzoate metabolic studies in various primates also strongly indicate the possibility of an alternate mechanism in addition to the glycine conjugation4,5. Additionally, free-living microbes like Pseudomonas putida13, Rhodopseudomonas palustris14, Thauera aromatica15, Azoarcus sp.16, Rhodococcus sp.17, etc.,18,19,20,21 were characterized for benzoate catabolic potential. These microbes were identified to catabolize benzoate either with aerobic or anaerobic mechanisms13,14,15,16,17,18,19,20,21. Gene products for these pathways were identified from gram-positive, as well as gram-negative microorganisms, indicating their invariable presence within the microbial clade13,14,15,16,17,18,19,20,21. All this information generates a strong possibility for the presence of efficient catabolic machinery among human gut microbes to counter the antimicrobial effect of the sodium benzoate. As there is a paucity of information, the current study has been designed with a culture-independent approach to explore the human gut microbiome for its structure, genetic makeover, as well as its metabolic machinery involved in benzoate catabolism. The outcome of this study would enhance our understanding of human microbiome functions, defining essential features for microbiome establishment, sustenance, and colonization. This information could be harnessed to take forward the concept of microbiome therapeutics for human healthcare.

Results

Metagenomic analysis of human gut microbiome

A total of 40,273,834 sequences totaling 12,068,464,866 basepairs with an average length of 300bps were obtained after paired-end sequencing chemistry with MiSeq System (Illumina, USA) (Supplementary Table S1).

Human gut microbiome composition based on metagenomic RNA and Protein features

Quality filtered metagenomic sequences have a representation of ribosomal RNA features within 245,344 sequences, while 6,868,080 sequences (56.86%) contain predicted proteins with known functions, and 5,208,955 sequences (43.13%) contain predicted proteins with unknown functions (Supplementary Table S1). The majority of the ribosomal features (98.41ā€‰Ā±ā€‰1.18%) shared their taxonomic affiliation within the eubacterial clade, while a very small fraction (0.68ā€‰Ā±ā€‰0.7%) were identified to share their homology within Eukaryota domain in addition to 0.9ā€‰Ā±ā€‰0.48% unclassified RNA features (Fig.Ā 1a). Likewise, Phylogenetic analysis based on RNA features, the majority of identified protein features (99.22ā€‰Ā±ā€‰0.51%) shared homology with the protein affiliated within the eubacterial domain followed by Archaea (0.48ā€‰Ā±ā€‰0.35%), Eukaryota (0.23ā€‰Ā±ā€‰0.13%), Viruses (0.06ā€‰Ā±ā€‰0.03%), and others (0.0015ā€‰Ā±ā€‰0.001%) (Fig.Ā 1b).

Figure 1
figure 1

Taxonomic and functional characterization of the human gut metagenome. Microbiome composition based on rRNA features (a) and protein features (b). Subsystem database clustering of the protein features based on their probable functional potential subsystem database (c). Taxonomic distribution of eubacterial specific rRNA features and protein features at phylum level is shown in the inset (a,b).

Metabolic potential of the human gut microbes

A plethora of microbial companions in the human gut play a variety of functions for their survival and maintenance within the human gut5,6. Protein features of the human gut metagenome were used to create the annotation profile to define their probable functional categories in the subsystem database. Subsystem-based categorization indicated more than half (52.91%) of the total identified protein features associated with general cellular metabolism (carbohydrate-, protein-, amino acids-, nucleic acid- metabolism). Most surprisingly, a limited number of protein features (1.37ā€‰Ā±ā€‰0.11%) are involved in microbial stress response physiology (dormancy & sporulation, stress response), while an elevated number of protein features (5.65ā€‰Ā±ā€‰0.75%) are associated with virulence and genetic transformations (Fig.Ā 1c). The human gut is comparatively more stable than its environment counterpart which could be limiting the requirements of stress response features.

Benzoate metabolic features in the human gut metagenome

Metagenomic exploration of the human gut microbiome has identified genetic features associated with anaerobic (Benzoyl-CoA degradation pathway) and aerobic (oxygenase-coupled central aromatic intermediate metabolic pathway) (Table 1). These genetic features were predominantly dispersed among various microbial groups like Proteobacteria, Firmicutes, Bacteroidetes, and Actinobacteria (Supplementary Fig. S1, S2, and S3).

Table 1 Human gut microbiome protein features mapped for sodium benzoate catabolism.

Metabolic features for anaerobic benzoate metabolism

The anaerobic benzoyl-CoA degradation pathway is a predominantly multistep process. Initially, benzoate is converted into benzoyl CoA in the activation process followed by its conversion into Aliphatic C7 dicarboxyl CoA by the dearomatization and Ī²-oxidation in the upper pathway18,19,20,21. Aliphatic C7 dicarboxyl CoA is further converted into acetyl CoA and CO2 in lower pathway18,19,20,21. Denitrifying bacteria like Thauera aromatica, Azoarcus sp. and strict anaerobic Fe(III)-reducing Geobacter metallireducens, Syntrophus aciditrophicus employ this pathway to utilize benzoate to meet energy demands18. Exploration of the human gut metagenome identifies genes encoding benzoyl-CoA reductase (BadD, BadE, BadF, BadG, and BadH) with the potential to catalyze benzoyl CoA dearomatization. Glutaryl-CoA dehydrogenase, 3-hydroxybutyryl-CoA dehydrogenase, and acetyl-CoA acetyltransferase associated with a lower pathway of anaerobic benzoate metabolism (Table 1, Fig.Ā 2). Surprisingly, there was not a single hit for benzoate CoA ligase in the studied metagenome. The absence of this enzyme generates a probability that either this function is being carried out by a novel enzyme or these pathways are inactive. Phylogenetic affiliation analysis indicates that these protein features were predominantly affiliated with Firmicutes (87.5%) followed by Proteobacteria (8.86%), Actinobacteria (0.73%), and unclassified bacteria (2.91%) (Supplementary Fig. S1). Within Firmicutes, the majority of protein features were affiliated to the Ruminococcaceae (29.7%), Lachnospiraceae (14.5%), and Veillonellaceae (9.14%) (Supplementary Fig. S1).

Figure 2
figure 2

The delineated anaerobic pathways of benzoate metabolism in the human gut microbiome. The relative abundance of the benzoate catabolic protein features in the current metagenome datasets is shown in percentage.

Metabolic features for aerobic benzoate metabolism

The aerobic benzoate metabolic pathway involves an oxygenase-dependent upper pathway converting benzoate into central aromatic intermediates like catechol or protocatechuate19,22. These central aromatic intermediates were catabolized into TCA cycle intermediates in a lower metabolic step which involves dearomatization ring cleavage of central intermediates19,20,21,22. Metagenomic exploration identified protein features for cellular uptake of benzoate (BenK) or benzoate derivatives (PcaK) and catalyzing upper and lower pathway of aerobic benzoate metabolism (Table 1). Benzoate-1,2-dioxygenase (BenA), 4 hydroxybenzoate-3-monooxygenase features of the current metagenomic dataset could catalyze the upper pathway reaction by converting benzoate into catechol and protocatechuate respectively. Catechol and protocatechuate are central aromatic intermediates that were catabolized by the enzymes associated with the convergent lower metabolic pathway12,22. Human gut metagenome indicates the presence of cat operon genes (catA, catB, catC) encoding proteins homologous to the catechol-1,2-dioxygenase (CatA), muconate cycloisomerase (CatB), and muconolactone D isomerase (CatC), respectively. These proteins were characterized in Pseudomonas putida13,22 for the ortho cleavage of catechol into 2- (2-oxo-3H-furan-5-yl) acetic acid (Fig.Ā 3). The 2- (2-oxo-3H-furan-5-yl) acetic acid is metabolized into TCA cycle intermediates by the encoded proteins of pcaD, pcaI/J, and pcaF genes of the pca operon as a convergent metabolic pathway (Fig.Ā 3). Proteins associated with benzoate catabolism via ortho-cleavage of catechol were phylogenetically affiliated with the proteins of Bacteroidetes/Chlorobi (62.78%), Proteobacteria (20.278%), Firmicutes (12.5%), Chloroflexi (0.278%), and unclassified bacteria (4.167%) (Supplementary Fig. S2).

Figure 3
figure 3

Mapping of the metabolic features associated with catechol mediated ortho-cleavage pathway of benzoate metabolism. The relative abundance of the benzoate catabolic protein features in the current metagenome datasets is shown in percentage.

Manual curation of the current metagenomics dataset indicates several genes sharing homology with pcaB, pcaC, pcaD, pcaG/H, pcaI, pcaJ, pcaK, pcaF, and pcaR encoding proteins associated with catabolism of the protocatechuate into the TCA cycle intermediates19,20,21,22 (Table 1, Fig.Ā 4). These genes encode p-hydroxybenzoate transporter (PcaK) for cellular transport of benzoate. The p-hydroxybenzoate can be sequentially metabolized by the PcaG/H, PcaB, PcaC, PcaD, PcaI/J, PcaF into succinyl CoA (Fig.Ā 4). In addition to these protein features, the current dataset harbors PcaR to regulate the expression of the pca operon23. Proteins associated with the protocatechuate branch of the beta-ketoadipate pathway were phylogenetically affiliated with Firmicutes (44.75%), Bacteroidetes/Chlorobi (24.52%), Proteobacteria (18.46%), Verrucomicrobia (1.182%), Actinobacteria (0.88%), and unclassified bacteria (10.19%) (Supplementary Fig. S3).

Figure 4
figure 4

Mapping of the metabolic features associated with oxygenase coupled protocatechuate mediated aerobic pathway for benzoate metabolism. The relative abundance of the benzoate catabolic protein features in the current metagenome datasets is shown in percentage.

Human gut metagenome datasets of the diverse populations

Human gut metagenome datasets (nā€‰=ā€‰201) of diverse populations were downloaded (Supplementary Table S2) and analyzed for genetic and functional diversity to assess the similarity and uniqueness of the current metagenome datasets. Ribosomal features of the current metagenomic dataset showed a good correlation of (rā€‰>ā€‰0.99) at the domain level with ribosomal features identified in other metagenome datasets. The PERMANOVA test was used to assess how much the overall variation could be explained in the gut metagenomic datasets of the studied populations (Supplementary Table S3). Though no notable separation (Pā€‰=ā€‰1.0) was observed among gut metagenome datasets of the individuals from India, Japan, Venezuela, and USA, however these metagenomes were found significantly distinct (Pā€‰ā‰¤ā€‰0.05) from metagenome datasets of Malaysian, Malawi, and European populations. Similarly, metagenome datasets of Malaysian and Malawi population did not show any significant difference (Pā€‰=ā€‰0.59). The metagenome dataset of the European population was remarkably distinct from other studied populations (Pā€‰ā‰¤ā€‰0.05). Principal coordinate analysis (PCoA) analysis also made a similar observation (Fig.Ā 5a). Heatmap analysis indicated the distribution pattern of the subsystem categorized protein features among gut metagenomes (Fig.Ā 5b). Exploration of the gut metagenome datasets indicated the presence of the protein features associated with benzoate catabolism in each gut metagenome dataset (Supplementary Table S4). Principal coordinate analysis (PCoA) with benzoate catabolic protein features showed clustering of various populations except the Malaysian population (Fig.Ā 6a). Despite the presence of benzoate catabolic protein features in every metagenome dataset, heatmap indicated their differential distribution among the gut metagenome datasets (Fig.Ā 6b). The omnipresence of the benzoate catabolic protein features in studied gut metagenomes (Supplementary Table S4) clearly indicates the metabolic potential of the gut microbiome to catabolize benzoate, however, their functioning needs to be assessed through functional assays.

Figure 5
figure 5

A comparison of the metabolic potential of the gut metagenomes across diversified populations. Principle coordinate analysis (PCoA) (a) and heatmap (b) showing the functional relationship and distribution of subsystem protein features among gut metagenomes of various populations (Malawi (MA), Malaysia (ML), Japan (JP), Europe (EU), USA (US), Venezuela (VZ), and India (IN)).

Figure 6
figure 6

Benzoate catabolic protein features across the various gut metagenomes. Principal component analysis (a) and heatmap (b) showing the functional relationship and distribution of benzoate catabolic protein features among the gut metagenomes of various populations (Malawi (MA), Malaysia (ML) Japan (JP), Europe (EU), USA (US), Venezuela (VZ), and India (IN)).

Functional assay for sodium benzoate catabolism

Functional assessment of sodium benzoate catabolism was performed to identify functional benzoate metabolic machinery of the human gut microbiome. LCā€“MS profile assessment indicated that the human gut microbes have completely catabolized the available sodium benzoate within 30Ā min of incubation (Supplementary Fig. S4). Additionally, many metabolic features of the mass spectra were positively matched with several metabolites associated with benzoate catabolic pathways (Table 2). Identified metabolites were only found associated with aerobic protocatechuate intermediated benzoate metabolic pathway.

Table 2 List of statistically significant (pā€‰<ā€‰0.05) metabolites associated with benzoate catabolic pathways.

Discussion

Sodium benzoate possesses antibacterial properties, hereby, it is commonly used as an inexpensive preservative in the packaged food items11. Sodium benzoate enters the human system through enterocytes absorption and hepatic portal circulation. Humans are known to have biochemical machinery to metabolize and excrete out these xenobiotics as hippuric acid adducts3,24. Recently, various studies confirmed the role of the human gut microbiome in various metabolic processes like drug metabolism25, food digestion5, polyphenol metabolism12,26, antibiotic chemical modifications27, etc. Surprisingly, there is no information about the influence of the human gut microbiome on dietary sodium benzoate and vice-versa. Recently, a gut microbiome dynamics study was performed to explore the influence of sodium benzoate as an antimicrobial agent on the human gut microbiome composition28. This study confirmed the non-significant impact of sodium benzoate on human gut microbiome composition28. Additionally, there is a disparity between the amount of sodium benzoate intake and its excretion as hippuric acid4. It generates a strong possibility that there is an alternative unexplored metabolic pathway that consumes a fraction of sodium benzoate and does not allow its excretion as hippuric acid24. These cumulatively indicate that the human gut microbiome possibly plays a vital role in sodium benzoate catabolism either to protect themselves from its antimicrobial effects or to use it as an energy source. Hereby, the current study has been carried out finding functional genetic machinery involved in the bio-catabolism of the frequently used preservative sodium benzoate.

Shotgun sequenced metagenomic dataset captures complete diversity irrespective of taxonomic boundaries29, hence, the metagenomic dataset is helpful to draw a holistic and accurate picture of the studied microbiome. The human gut metagenome dataset was explored to define human gut microbiome composition based on rRNA gene features, as well as protein features. Microbiome composition deciphered with RNA & protein features of metagenomic dataset analysis reveals a similar outcome for eubacteria taxonomic affiliation. Bacteroidetes were uniformly dominant in both datasets followed by Firmicutes and Proteobacteria with slight variations in their percentage share. These results are in line with the various global microbiome studies7,8, and studies carried on Indian subjects6. The abundance of Bacteroidetes in the human gut microbiome could be due to their advantage in terms of possessing efficient catabolic machinery to harvest energy from complex carbohydrates6, which is a prime dietary component in north India. Metagenomic datasets also have a minor fraction of genetic and proteomic features affiliated with archaea, viruses, fungi, as well as with higher eukaryotes. These results are substantiating various studies that the human gut microbiome is a complex architecture having various live forms dominated by the eubacterial clade6,7,8,9.

Functional annotation and clustering of metagenomic protein features identified a good percentage of protein features associated with carbohydrate metabolism defining the enrichment of proteins involved in diversified sugar-utilization monosaccharides (Sorbitol, Gluconate, Glucouronides, etc.), oligosaccharides (Trehalose, Fructo-oligosaccharides, etc.), complex carbohydrates (cellulosome, xyloglucans, etc.,), sugar alcohols, and organic acids. The presence of a large array of proteins involved in diversified sugarsā€™ metabolism to meet their energy requirements paves the way towards the successful commensalism of the human gut microbiome30. In addition to it, a plethora of proteins was identified to play a vital role in the cellular physiology and metabolism required for their growth and sustenance. More surprisingly, the human gut microbiome has a lower percentage of protein features associated with dormancy & sporulation, photosynthesis, cell signaling, in comparison to various environmental microbiomes31. In contrast, a slightly higher percentage of protein features associated with virulence, diseases & defense; Phages, Prophages, Transposable elements, Plasmids than environmental counterparts31. This variability could be seen as evolutionary adaptations to meet up environmental requirements for successful colonization32.

As an answer to the key question, many gene clusters/protein features potentially associated with benzoate metabolism were identified from the human gut metagenome dataset. Homologs of these putative protein features in various environmental microorganisms were well characterized for their functional role in benzoate catabolism13,14,15,16,17,18,19,20,21,22,23. Metagenomic exploration of the human gut microbiome has identified genetic features associated with anaerobic (Benzoyl-CoA degradation pathway)18,20 and aerobic (oxygenase coupled central aromatic intermediate metabolic pathway)19,20,21,22,23. In the environment, anaerobic microorganisms metabolize benzoate predominantly into TCA cycle intermediates through the anaerobic Benzoyl-CoA degradation pathway18,19,20. While aerobic micro-organisms predominantly utilize central aromatic intermediate metabolic pathway (either protocatechuate mediated beta-ketoadipate pathway or catechol mediated ortho/meta-cleavage pathway of benzoate oxidation19,20,21,22,23 and hybrid box pathway18,19,20 to catabolize benzoate into TCA cycle intermediates.

Identified protein features indicate the prevalence of major metabolic pathways of benzoate catabolism i.e. aerobic and anaerobic pathways18,19,20,21,22,23. The presence of these pathways indicates potentially-independent evolution of benzoate catabolic machinery among the human gut microbiome. This view is further supported by the outcome based on the phylogenetic affiliation of protein features associated with benzoate metabolism. Phylogenetic affiliation analysis of potential hits indicates the enrichment of these pathways in diversified microbial groups. Protein features in anaerobic benzoate metabolism pathways within Firmicutes followed by Proteobacteria and Actinobacteria. While protein features associated with the catechol branch of the central aromatic intermediate metabolic pathway got enriched within Bacteroidetes, followed by Proteobacteria and Firmicutes. Protein features of the Protocatechuate branch of the beta-ketoadipate pathway were equally distributed among Firmicutes, Bacteroidetes, and Proteobacteria. Except for selective enrichment, benzoate catabolic protein features showed phylogenetic affiliation with the proteins of the major gut microbial groups. It also indicates why there was a non-significant effect of sodium benzoate diet on human gut microbiome composition28. Comparative metagenomic analysis showed that despite the varied genetic and metabolic similarities, benzoate catabolic features were omnipresent among studied metagenomic datasets. It also indicates that despite the diverse ethnicity and food habits of the host, their inhabiting microbes have enriched themselves with benzoate catabolic potential. These results also strengthen the diet-driven gut microbiome evolutionary hypothesis33.

It was commonly observed that the conclusions and outcomes of various microbiome studies were drawn after analyzing the metagenomic dataset, which was based on one hypothesis i.e. if a gene for a defined function exists within the microbiome, then it would be extending its function in that microbiome34. As the functioning of all genes in a defined environment could not be ascertained34, in such case how many such conclusions stand correct? Keeping these aspects into consideration, the functionality of these metabolic pathways and protein features were assessed by metabolic fingerprinting or metabolite profiling35. Metabolite profiling during sodium benzoate functional assay identifies several intermediary metabolites as an outcome of sodium benzoate catabolism. Functional assay for benzoate catabolism indicates that human gut microbes quickly metabolize the available benzoate to generate metabolic intermediates in energy generation pathways. However, these metabolites were mapped only onto oxygenase-coupled protocatechuate mediated benzoate metabolism. It indicates that all metagenomic genetic machinery was not constitutively expressed. The non-functioning of the remaining pathway still needs to be explored. However, it could be either due to the limited prevalence of these gene sets or due to the non-availability of strict anaerobic conditions.

Conclusion

The current study explores the human gut microbiome to highlight the influence of the human gut microbiome on dietary sodium benzoate and vice-versa. The current study establishes the role of the human gut microbiome in sodium benzoate catabolism, as well as indicates the various gene clusters/protein features involved in these metabolic pathways. Besides, it also enriches our understanding of the evolutionary maturation of the human gut microbiome towards changes in dietary habits and the environmental requirement for successful sustenance in the variable gut environment.

Methodology

Sample collection and metagenomic DNA isolation

Fecal samples (nā€‰=ā€‰8) were collected in a sterile container from healthy individuals (Age 29ā€“36Ā years, Male) without any history of prolonged illness and taking proactive foods. The study was conducted after seeking ethical clearance from the human ethical committee of Maharshi Dayanand University, Rohtak, Haryana, India. Informed consent was taken before engaging the individuals in the current study. Human ethical guidelines were strictly followed before engaging individuals, as well as a written consent was taken before collecting samples. Metagenomic DNA was isolated using the alkali-lysis method36. The qualitative and quantitative analysis of the isolated metagenomic DNA was performed with agarose gel electrophoresis and Qubit dsDNA HS Assay Kit (Invitrogen, USA) respectively.

Metagenome sequencing and sequence analysis

The metagenomic DNAs of the samples were sequenced with MiSeq Reagent Kits v3 using MiSeq next-generation sequencing platform (Illumina, USA) at CSIR- Institute of Genomics and Integrative Biology (IGIB), New Delhi following standard sequencing instructions (Illumina, USA). FASTQ reads were uploaded to the Metagenome Rapid Annotation using Subsystem Technology (MG-RAST) server37 (http://metagenomics.anl.gov/) for taxonomic and functional annotation of the sequences. Metagenome data was processed at MG-RAST37 for adapter trimming, denoising, and normalization, removal of sequencing artifacts, and host DNA contamination removal. Quality filtered sequences were searched for the presence of rRNA features (cutoff similarity %ā€‰>ā€‰70%). Potential ribosomal RNA genes were clustered and checked for their homologs in the Ribosomal Database Project (e valueā€‰<ā€‰10ā€“5, sequence similarityā€‰<ā€‰60% & word sizeā€‰>ā€‰15Ā bp). FragGeneScan38 was employed for the identification of all putative protein-coding genes. Identified gene and their encoded proteins were described as genetic features and proteins features, respectively. Predicted protein-coding sequences were clustered (~ā€‰90% identity) and processed for protein similarity search using the BLAT algorithm against the M5NR protein database, RefSeq database, and Subsystems database (e valueā€‰<ā€‰10ā€“5, minimum identityā€‰<ā€‰60%). Similarity outcome was mapped to annotate protein features of each sequence in given clusters to describe each feature abundance, Lowest common ancestor (LCA) abundance profile, Data source abundance profile to predict protein feature-based taxonomic and functional affiliation.

Mining and characterization of benzoate metabolic gene clusters

Genetic and protein features associated with benzoate catabolism were extracted from the output of the subsystem database search analysis (e valueā€‰<ā€‰10ā€“5, minimum identityā€‰<ā€‰60%). Identified benzoate catabolic genetic/protein features were mapped manually onto aerobic and anaerobic benzoate metabolic pathways. Taxonomic affiliation of benzoate catabolic protein features was assessed with KAIJU webserver39 after searching their homologs in NCBI nr databases (e valueā€‰<ā€‰0.0, minimum identityā€‰<ā€‰75%, word sizeā€‰>ā€‰15Ā bp).

Analysis of the diverse human gut metagenomes

The human gut metagenomes of diverse populations were downloaded and analyzed to understand abundance to benzoate catabolic genetic and protein features in USA, Sweden, Venezuela, Japan, and Malaysia gut microbiomes (Supplementary Table S2). RNA and protein features were extracted and processed for their homology search in Ribosomal Database Project and Subsystem database. Benzoate catabolic (genetic and protein) features were extracted from the subsystem database search analysis. Significance testing between the metagenomic datasets was assessed using the PERMANOVA (Permutational Multivariate Analysis of Variance)40. Database search output was used for Principle coordinate analysis (PCoA)40 and heatmap41 analysis.

Functional affirmation of sodium benzoate catabolism

The 200Ā mg of fresh human feces (nā€‰=ā€‰8) was suspended into 1Ā ml Phosphate buffer saline (pH 7.4). The samples were processed to collect the microbial pellet36. The microbial pellet was suspended in sodium benzoate solution (5Ā mM) followed by incubation at 37Ā Ā°C. In parallel, a control sample containing microbial pellet in PBS was also incubated. Fractions were collected at different time intervals (0, 0.5, and 2Ā h) and centrifuged for 1Ā min at 13,000 rev mināˆ’1. Samples were quenched with 60% aqueous methanol solution (āˆ’ā€‰48Ā Ā°C)42,43. The quenched sample was centrifuged immediately at 16060Ā gĀ for 10Ā min atā€‰āˆ’ā€‰10Ā Ā°C. The whole mixture was snap-frozen in liquid nitrogen and processed for metabolites extraction with methanol by freezeā€“thaw cycling42,43. The supernatant was collected and stored atā€‰āˆ’ā€‰86Ā Ā°C after the centrifugation of the methanol suspension at 16ā€‰060Ā g, atā€‰āˆ’ā€‰9Ā Ā°C, for 5Ā min. An additional amount of 500 Ī¼L of 100% methanol (āˆ’ā€‰48Ā Ā°C) was added to the pellet followed by freezeā€“thaw cycling42,43. The second aliquot of the supernatant was collected again after the centrifugation of the methanol suspension at 16ā€‰060Ā g, atā€‰āˆ’ā€‰9Ā Ā°C, for 5Ā min. Both supernatants were pooled and stored atā€‰āˆ’ā€‰86Ā Ā°C for further analysis42,43. The deproteinization of samples was achieved by the addition of 400Ā Āµl acetonitrile in each vial followed by centrifugation at 5000 rev mināˆ’1 for 10Ā min. The supernatant was collected and put into an autosampler vial for injection. Unbiased metabolic profiling was performed with Exion LC system integrated AB SCIEXX500B QTOF mass spectrometer (SCIEX, USA) (Supplementary methods SM1 and SM2). LC/MS-based metabolic profiles were firstly converted into mzML data format by ProteoWizard software44 and sequentially handled by XCMS software running under R environment Version 3.7.1 for metabolic features detection and chromatographic matching45. Significant metabolic features were imported into MetaboAnalystR 2.046 for the untargeted metabolic assignment of MS peaks to the metabolic pathways (Supplementary methods SM3).