Gut microbiome of the largest living rodent harbors unprecedented enzymatic systems to degrade plant polysaccharides

Cabral, Lucelia; Persinoti, Gabriela F.; Paixão, Douglas A. A.; Martins, Marcele P.; Morais, Mariana A. B.; Chinaglia, Mariana; Domingues, Mariane N.; Sforca, Mauricio L.; Pirolla, Renan A. S.; Generoso, Wesley C.; Santos, Clelton A.; Maciel, Lucas F.; Terrapon, Nicolas; Lombard, Vincent; Henrissat, Bernard; Murakami, Mario T.

doi:10.1038/s41467-022-28310-y

Download PDF

Article
Open access
Published: 02 February 2022

Gut microbiome of the largest living rodent harbors unprecedented enzymatic systems to degrade plant polysaccharides

Nature Communications volume 13, Article number: 629 (2022) Cite this article

16k Accesses
34 Citations
152 Altmetric
Metrics details

Subjects

Abstract

The largest living rodent, capybara, can efficiently depolymerize and utilize lignocellulosic biomass through microbial symbiotic mechanisms yet elusive. Herein, we elucidate the microbial community composition, enzymatic systems and metabolic pathways involved in the conversion of dietary fibers into short-chain fatty acids, a main energy source for the host. In this microbiota, the unconventional enzymatic machinery from Fibrobacteres seems to drive cellulose degradation, whereas a diverse set of carbohydrate-active enzymes from Bacteroidetes, organized in polysaccharide utilization loci, are accounted to tackle complex hemicelluloses typically found in gramineous and aquatic plants. Exploring the genetic potential of this community, we discover a glycoside hydrolase family of β-galactosidases (named as GH173), and a carbohydrate-binding module family (named as CBM89) involved in xylan binding that establishes an unprecedented three-dimensional fold among associated modules to carbohydrate-active enzymes. Together, these results demonstrate how the capybara gut microbiota orchestrates the depolymerization and utilization of plant fibers, representing an untapped reservoir of enzymatic mechanisms to overcome the lignocellulose recalcitrance, a central challenge toward a sustainable and bio-based economy.

Integrative omics analysis of the termite gut system adaptation to Miscanthus diet identifies lignocellulose degradation enzymes

Article Open access 01 June 2020

Neotropical termite microbiomes as sources of novel plant cell wall degrading enzymes

Article Open access 02 March 2020

Proteome specialization of anaerobic fungi during ruminal degradation of recalcitrant plant fiber

Article Open access 14 September 2020

Introduction

The symbiotic microbiota within the digestive tract of herbivores has been an overwhelming source of diverse enzymatic mechanisms for lignocellulose depolymerization^1,2,3,4. For decades, the microbiota of foregut (rumen) fermenters has been employed as a model system^5,6, which resulted in the discovery of sophisticated systems to degrade recalcitrant plant fibers, such as the multi-enzyme complexes (cellulosomes) from Ruminococcus flavefaciens⁷ and the efficient cellulose degradation system from Fibrobacter succinogenes⁸.

A less explored and equally effective class of herbivores is the hindgut fermenters⁹. Similar to foregut fermenters, the digestion is accomplished by a symbiotic microbial community, but in a single fermentation chamber¹⁰. These monogastric herbivores comprise a vast range of animals, from massive mammals, such as elephants, rhinos, and horses, to small animals, as rabbits and semi-aquatic rodents¹¹. In addition, they are spread over a myriad of ecological niches, suggesting that highly specialized specialized microbial strategies may have emerged to overcome the complexity and diversity of plant glycans in these environments.

The capybara (Hydrochoerus hydrochaeris) is the largest living rodent, typically found in the Pantanal wetlands and Amazon basin, and it is also known as “the master of the grasses” due to its diet based on gramineous and aquatic plants. In this animal, the fermentation takes place in the cecum that corresponds to almost three-quarters of the gastrointestinal tract, reaching a digestive efficiency comparable to that of ruminants¹². Moreover, as a strategy to maximize absorption of nutrients derived from bacterial fermentation, capybara can eat their cecotropes, a specific type of soft excreta¹³. This habit of cecotrophy is more frequent in wild animals during the dry season, when food is scarce¹³. Despite being a formidable plant biomass fermenter, the enzymatic strategies and metabolic pathways employed by its microbial symbiotic community for the breakdown and utilization of recalcitrant dietary fibers remain mostly elusive. In addition, wild capybara animals dwelling the Southeast region of Brazil have incorporated sugarcane in their diet for decades¹⁴, which makes their cecal microbiome particularly attractive for the discovery of enzymatic mechanisms for the depolymerization of this industrially relevant feedstock and related grasses.

To elucidate the enzymatic strategies employed by the Brazilian capybara microbiota for plant cell wall depolymerization, we comprehensively investigated this gut microbial community combining 16S rRNA gene targeting sequencing (16S), metagenomics (MG), metatranscriptomics (MT), and nuclear magnetic resonance (NMR) based metabolomics, with carbohydrate enzymology and X-ray crystallography, which ultimately led to the discovery of two families, according to the carbohydrate-active enzymes database (CAZy) (Fig. 1). These findings highlight the potential of the capybara gut microbiome as a reservoir of uncharted enzymatic systems for carbohydrate processing, and thereby expanding our current understanding of gut microbial strategies from hindgut fermenters to overcome the plant cell wall recalcitrance, which might be instrumental to foster the development of value-added products from lignocellulosic agro-industrial materials.

**Fig. 1: Experimental design employed to explore capybara gut microbiome.**

Results

Capybara gut microbiota encompasses taxonomic novelties involved in plant fiber breakdown

The taxonomic structure of the capybara gut microbiota from fresh cecal and rectal samples is mainly constituted by Bacteria, with only a small fraction of reads corresponding to Archaea (16S: 1.07%, MG: 0.40%, and MT: 1.55%) and Fungi (MG 0.12% and MT: 0.32%). 16S rRNA gene-based taxonomic analysis, corroborated by 16S rRNA reads recovered from metagenome (16S_MG), MG, and MT data sets, indicates that the most abundant bacteria found in this microbiota are members from the phyla Firmicutes (mean ± SD: 35.8 ± 12.4%) and Bacteroidetes (31.5 ± 9.8%), followed by Fusobacteria (15.3 ± 5.4%) and Proteobacteria (8.4 ± 5.4%) (Fig. 2a, b and Supplementary Fig. 1). Initial characterization by hybridization approaches of the capybara cecal microbiota from wild Venezuelan animals (not exposed to sugarcane biomass) also identified Firmicutes, Proteobacteria, and Bacteroidetes as major constituents of this microbiota¹⁵. However, distinct abundances of these phyla were observed with a higher number of OTUs belonging to Firmicutes (322) and Proteobacteria (301) compared to only 76 OTUs from Bacteroidetes¹⁵. This variability would be an indicative of a potential feed specialization such as from wild animals dwelling in the Brazilian sugarcane belt, and/or microbial dynamics and adaptation to nutrient availability. A prevalence of Firmicutes, Bacteroidetes, and Proteobacteria was also observed in the gut microbiota of beaver, horse, rabbit, and koala^{16,17,18,19,20}, indicating to be a common feature among hindgut herbivores. Comparative investigation of the gut microbiota of a large number of carnivores, omnivores, and herbivores, observed a lower microbiota biodiversity in cecum fermenters (such as capybara, beaver, and rabbit) compared to colon fermenters (such as horse, elephant, and tapir), despite their high plant fiber-based diets²¹. The combination of biomass adaptation/specialization and low biodiversity of cecum microbiota would suggest the presence of highly efficient enzymatic systems for the depolymerization and utilization of recalcitrant plant fibers. In this regard, the reconstruction of metagenome-assembled genomes (MAGs) (Supplementary Data 1) revealed several taxonomic novelties, representing either unknown species or genera from bacterial families that are recognized as plant fiber degraders such as Fibrobacteraceae and Bacteroidaceae (Fig. 2c). Among these taxonomic novelties, MAGs 41 to 44, assigned to the uncultured UBA932 family, are phylogenetically grouped and may represent an unprecedented Bacteroidetes genus with a genetic potential for lignocellulose degradation. Interestingly, the high-quality MAG42 harbors a predicted polysaccharide utilization locus (PUL), which culminated in the discovery of the GH173 family (see the “Biochemical” section below). MAGs 56 to 59 may also represent an expansion of genera in the Bacteroidaceae family and are closely related to other uncultured Bacteroidaceae MAGs recovered from sheep, elephant and mice gut microbiomes²² (Supplementary Fig. 2). In particular, the high-quality MAG57 notably contains one of the largest inventories of genes encoding carbohydrate-active enzymes (CAZymes) among the 79 recovered MAGs from capybara gut microbiota (Supplementary Data 2), which are likely involved in the depolymerization of distinct hemicelluloses, including heteroxylans, an abundant polysaccharide in sugarcane and other grasses (see the “Biochemical and structural” section below). These analyses indicate that the capybara gut microbiota may harbor unexplored and high-performance molecular systems for plant fiber breakdown and utilization.

**Fig. 2: Microbial taxonomic composition of capybara gut microbiome.**

Fibrobacteres and Bacteroidetes are main degraders of dietary fibers in capybara gut

In order to understand the ability of capybara gut microbiota to convert plant polysaccharides into edible sugars, MG and MT data from cecal and rectal samples were investigated to determine the genomic potential associated with plant fiber depolymerization. A total of 7377 putative CAZymes genes encoding for 106 glycoside hydrolases (GH), 11 carbohydrate esterases (CE), and 10 polysaccharide lyases (PL) families were identified, of which 517 genes presented a modular architecture (Supplementary Data 3). The most abundant CAZymes identified are members of the families GH3, GH2, and GH1 (by decreasing abundance) (Fig. 3), which is in agreement with that reported for other gut microbiomes such as human, swine, and cattle rumen²³. These enzymes encompass diversified activities such as β-glucosidase, β-xylosidase, β-galactosidase, β-mannosidase, and α-arabinofuranosidase, and are often associated with the final steps in the depolymerization cascade of several plant polysaccharides such as cellulose, heteroxylans, mixed-linkage β-glucans, and β-mannans.

**Fig. 3: Functional annotation of capybara metagenome co-assembly predicted genes according to carbohydrate-active enzymes database (CAZy).**

In the CAZyme repertoire of this microbiota neither cellulases from families GH6, GH7, and GH48, nor cellulosomes, assessed by the presence of cohesin and dockerin domains associated with cellulases, could be identified. In ruminal Ruminococcus²⁴ and anaerobic fungi, these families are found in high abundance, possibly targeting recalcitrant cellulose structures²⁵. However, in the capybara gut microbiota assayed here, fungi were detected only at very low abundance (Fig. 2a). It suggests that cellulose degradation in the capybara gut might be mainly accomplished by endo-β-1,4-glucanases (Enzyme Commission (EC) number 3.2.1.4) from families GH5 (subfamilies GH5_2 and GH5_4), GH8, GH9, and GH45, which were detected either as single domains or in multi-modular protein architectures (Supplementary Data 3). Interestingly, the most expressed genes encoding endo-β-1,4-glucanases belong to families GH5_2, GH8, GH9, and GH45, and were identified in Fibrobacter MAGs (Supplementary Fig. 3 and Supplementary Data 4) that also present a high MT/MG ratio (Fig. 2b), indicating that these bacteria might play a key role in cellulose degradation in the capybara gut. Fibrobacter succinogenes is known as a highly efficient cellulolytic bacterium in the cow rumen²⁶ and employs a multi-protein complex to attach to cellulose fibers and cellulases secreted by the T9SS-dependent secretion system for cellulose breakdown²⁷. The three Fibrobacter MAGs recovered from capybara gut microbiome encode cellulases with a T9SS signal sequence as well as proteins for cellulose adhesion including tetratricopeptide, fibro-slime, OmpA, and pilin proteins, as reported for F. succinogenes²⁷. Furthermore, from the set of 347 proteins observed in the outer membrane vesicles (OMVs) from F. succinogenes²⁸, we have identified 262 with sequence identity ranging from 30 to 99%. These observations suggest that Fibrobacter mechanisms, fundamentally relying on cell surface adhesion and OMVs, might be central for cellulose depolymerization in the capybara gut.

In the multiple recovered Bacteroidetes MAGs, a large number of PULs and clusters of CAZymes (CCs) were identified (Fig. 4 and Supplementary Data 5), which provide highly diversified capabilities to this microbiota to cope with the chemical and structural complexity of abundant hemicelluloses and pectins in gramineous or aquatic plants such as heteroxylans, mixed-linkage β-glucans, β-1,3-glucans, xyloglucans, mannans, and homogalacturonans. Notably, the identified PUL targeting mixed-linkage β-glucans (Supplementary Fig. 4) is conserved in capybara and human microbiomes, presenting identical gene architecture encompassing one GH16 and two GH3 enzymes²⁹. PULs targeting heteroxylans and homogalacturonans (Supplementary Fig. 4), common components of gramineous plants such as sugarcane³⁰, also resemble PULs identified in human gut microbiomes³¹, highlighting a remarkable level of conservation of Bacteroidetes enzymatic systems in omnivores and hindgut herbivores.

Fig. 4: Heatmap of main carbohydrate-active enzymes (CAZymes) families, polysaccharide utilization loci (PULs) and clusters of CAZymes (CCs) identified in the recovered metagenome-assembled genomes (MAGs).

Despite the presence of multiple CEs (Fig. 4), the lack of auxiliary-active enzymes (AAs) indicates a low capacity of the capybara microbiota to perform plant biomass delignification, as also observed for other monogastric herbivores^32,33. As a mechanism to cope with lignin-rich diets, these animals may employ cecotrophy to enhance digestibility and nutrient uptake. In addition, it is noteworthy that many identified PULs only showed similarity with non-experimentally validated PULs and without a defined substrate target (Supplementary Data 5), which in part could be due to intrinsic limitations of genome reconstruction from metagenomes, but could also reflect the variability, heterogeneity and limited knowledge of the structure and composition of the glycans present in the diet of wild capybaras.

In summary, the CAZyome (CAZyme inventory) analysis of the capybara gut microbiota indicates Fibrobacteres as the main drivers for cellulose breakdown, whereas the numerous Bacteroidetes PULs and clusters of CAZymes confer to this community a myriad of enzymatic strategies to tackle with complex and diverse hemicelluloses and pectins commonly present in gramineous and aquatic plants, major components of capybara diet.

Metabolite profiling shows high performance on the conversion of dietary fibers into short-chain fatty acids

Once addressed important players for depolymerization of dietary fibers in the capybara gut, we further investigated the role of these microorganisms in the conversion of free sugars into energy for the host by integrating metabolomics (obtained from the polar fraction after a Folch extraction) and metabolic reconstruction analysis.

The major fermentation products measured in the capybara gut were short-chain fatty acids (SCFAs), among more than 40 metabolites detected by NMR spectroscopy-based metabolomics (Supplementary Data 6). The most abundant metabolites observed in cecal and rectal samples were acetate (mean ± SD: 74.83 ± 22.17 and 30.40 ± 22.76 μM, respectively), propionate (31.0 ± 6.67 and 15.98 ± 12.8 μM), and butyrate (23.30 ± 5.63 and 8.35 ± 12.83 μM). These SCFA ratios indicate a forage-based diet and are similar to that seen for ruminants^34,35, supporting a high efficiency of this microbiota in the use of dietary fibers as an energy source.

Genes related to pyruvate fermentation into acetate were highly abundant in both MG and MT data for cecal and rectal samples, and they are derived from Firmicutes, Bacteroidetes, and Fusobacteria (Fig. 5 and Supplementary Fig. 5). Metabolic pathway reconstruction analysis shows that acetate can be produced by any of the bacterial MAGs recovered from capybara gut microbiome (Fig. 5), which is in agreement with the high abundance of this metabolite in both cecal and rectal samples (Supplementary Data 6). On the other hand, the expression analysis of key genes involved in the butyrate pathway (atoA/D genes) indicates that Firmicutes Ileibacterium sp. MAG6 and Megasphaera sp. MAG33 are likely the major butyrate-producing bacteria in the capybara gut (Supplementary Fig. 6 and Supplementary Data 7). The Bacteroidetes MAG47 and Fusobacteria MAG38 and MAG39 also have co-localized genes atoA/atoD and ptb/butK, suggesting that they might contribute to butyrate production, in some extent (Supplementary Fig. 6 and Supplementary Data 7). The typical genes from acrylate and propanediol pathways involved in propionate production were not identified in the recovered MAGs from capybara gut (Fig. 5), but the mmdA gene encoding a methylmalonyl-CoA decarboxylase from the succinate pathway, is widespread mainly among Bacteroidetes and was also observed in some Firmicutes and Fusobacteria MAGs (Supplementary Fig. 6 and Supplementary Data 7). Furthermore, the ratio of propionate detected in the capybara gut correlates (R = 0.77 and p = 0.07) with the relative abundance of Bacteroidetes, supporting the succinate pathway from this phylum as the major source of propionate production in this environment.

**Fig. 5: Metabolic reconstruction of 79 unique metagenome-assembled genomes (MAGs) recovered from the capybara gut microbiome.**

The detection of high SCFAs levels in the capybara digestive tract, which is a common marker of digestion performance of dietary fibers³⁶ along with the identification of the several metabolic pathways involved in SCFAs production corroborates the potential of this microbiota for the breakdown of recalcitrant plant polysaccharides with concomitant production of energetic metabolites for the host.

A β-galactosidase GH family discovered from capybara gut microbiome

Drawing on the results showed herein, the capybara gut microbiome can be an important source of microbes harboring uncharted enzymes involved in plant polysaccharides depolymerization. Moreover, the joint MG and MT analysis of the capybara gut microbiome revealed several expressed genes annotated as hypothetical proteins. Some of these genes display remote similarity to CAZy members, with sequence identity ranging from 10 to 21%, suggesting a potential function in the processing of plant polysaccharides, but requiring further functional investigation (Supplementary Table 1).

One of these hypothetical proteins (SEQ ID PBMDCECB_44807, named here as CapGH173), was recovered from Bacteroidales bacterium MAG42, a discovered genome that expands the uncultured UBA932 family (Supplementary Data 1). Biochemical characterization of CapGH173 showed it is active on p-nitrophenyl-β-d-galactopyranoside (pNP-β-D-Gal) and kinetic parameters were determined from substrate saturation curves (Table 1 and Supplementary Fig. 7). CapGH173 orthologues are found in Actinobacteria, Firmicutes, Verrucomicrobia and Bacteroidetes MAGs recovered from diverse sources such as rumen, feces, gut, and oral microbiotas (Supplementary Table 2), being the closest sequence from a rumen-derived MAG (UBA2817) from the uncultured RC9 group²². Phylogenetic analysis showed that CapGH173 is remotely related to GH-A CAZy families, with GH30 and GH5 being the closest ones (Fig. 6a). Protein modeling and threading performed using RoseTTAFold³⁷ and PDBsum³⁸, respectively, revealed that CapGH173 consists of a (α/β)₈-barrel structure (Supplementary Fig. 8), which is an archetypal scaffold of the clan GH-A. According to structural predictions, CapGH173 exhibits a two-domain architecture including an appended β-sandwich domain (Supplementary Fig. 9), which is a similar structural organization found in the GH30 family. With the exception of the residues defining the clan GH-A, sequence alignment with GH5 and GH30 members revealed a very low sequence conservation below the criterium for significant similarity detection (using an e-value < 0.05), demonstrating that although the domains in the tertiary structure can be similar, the sequences between these families are remarkably diverse (Supplementary Figs. 9–11 and Supplementary Table 3). To further explore the discovered GH173 family, the enzyme BXY_26070 (SEQ_ID CBK67650.1) from B. xylanisolvens, which shares 46% sequence identity with CapGH173, was also heterologously expressed, purified, and biochemically characterized (Table 1). The two members characterized from the GH173 family present β-galactosidase activity, which is not described in either GH30 or GH5 families, strengthening at the biochemical level the establishment of this GH family.

Table 1 Kinetic parameters of recombinant CAZymes.

Full size table

**Fig. 6: Phylogeny of the GH173 family and its genomic context.**

In the Bacteroidales bacterium MAG42, CapGH173 is found in a predicted PUL that includes enzymes from families GH2 and GH78 (Fig. 6b). A similar PUL organization was also predicted in Bacteroidetes sp. 1_1_30 recovered from the human gut, which yet harbors enzymes from GH36, CE7, and PL8_2 families (Fig. 6b). It is worth to mention that CapGH173 is often found fused to a GH36 module or in PULs having GH36 members, as in B. xylanisolvens and Prevotella dentalis, recovered from the stool and oral cavity, respectively (Fig. 6b), indicating a synergistic relationship between these families. Moreover, these families are also commonly found along with GH78 α-l-rhamnosidases in the PUL context. In Bacteroidales bacterium UBA2817, a GH173 member is appended to a GH78 module carrying a CBM67, both targeting rhamnogalacturonans (Fig. 6b). These observations suggest that GH173 could act on β-linked galactosyl residues in pectic polysaccharides.

Capybara Bacteroidaceae bacterium MAG57 harbors an unprecedented family of carbohydrate-binding module

As presented in the taxonomic analysis, Bacteroidaceae bacterium MAG57 encompasses a remarkable number of CAZyme-encoding genes including a gene cluster targeting arabinoxylan (CC102), an abundant hemicellulose in secondary cell walls of sugarcane and other grasses. This cluster encodes two exo-enzymes from families GH43 and GH97, and an unconventional GH10 member with an unknown 45 kDa N-terminal domain (Fig. 7a). Sequence analysis showed that this unusual N-terminal domain is also present in Bacteroidetes MAGs derived from the gut of human, mouse, and elephant (Supplementary Table 4); however, it displays no similarity with any known ancillary domain associated with CAZymes. Therefore, to evaluate the function of this unconventional GH10 member, the full-length protein, its domains apart, and the other GH members comprising the CC102 cluster were recombinantly expressed, purified, and characterized. The GH97 member (CapGH97) is a calcium-activated α-galactosidase, whereas the GH43 member (CapGH43_12) is a highly active α-L-arabinofuranosidase (Supplementary Figs. 12, 13 and Table 1)—two key activities to remove glycosidic substitutions of heteroxylans.

**Fig. 7: Enzymatic system for heteroxylan degradation from *Bacteroidaceae bacterium* MAG57.**

The GH10 domain of CapGH10 exhibits endo-β-1,4-xylanase activity, being active on both xylan and distinct arabinoxylans (Table 1). Kinetic analysis indicates that substitutions present in rye arabinoxylan (arabinose/xylose ratio = 40/60) are not detrimental to the catalytic performance, showing similar K_m (Michaelis constant) and k_cat (turnover number) constants compared to xylan (Table 1 and Supplementary Fig. 14). The endo-β-1,4-xylanase enzyme from Hungateiclostridium themocellum ATCC 27405 (Xyn10Z) is the closest characterized GH10 member, sharing 35% of sequence identity with CapGH10³⁹. The N-terminal region of Xyn10Z encompasses a feruloyl esterase followed by a CBM6 domain, which is not conserved in CapGH10⁴⁰. The CapGH10 N-terminus showed only sequence similarity with uncharacterized proteins and the closest homologs, featuring similar domain architecture, were identified in genomes from ruminal Prevotella sp. such as Prevotella sp. BP1-148, Prevotella sp. BP1-145, Prevotellaceae bacterium HUN156 and Prevotellaceae bacterium MN60, also likely targeting xylan-related polysaccharides (Supplementary Table 4).

The potential enzymatic activity of the isolated N-terminal domain of CapGH10 was assessed against 30 different substrates including synthetic substrates, oligosaccharides, and polysaccharides (Supplementary Table 5), but no (hydrolase, lyase, or esterase) activity was observed. Typical activities involved in heteroxylans breakdown including endo-β-1,4-xylanase, β-xylosidase, α-l-arabinofuranosidase, α-d-galactosidase, α-d-glucuronidase, 4-O-methyl-glucuronoyl methylesterase, feruloyl esterase, and acetyl xylan esterase were assayed by distinct methods without the detection of product formation or substrate consumption. Under this perspective, we then interrogated the capacity of this N-terminal domain to bind potential substrates of its GH10 partner such as beechwood xylan and arabinoxylans using affinity gel electrophoresis (AGE). As shown in Fig. 7b, this domain can indeed interact with the substrates of the GH10 domain, suggesting that this N-terminal domain may target the CapGH10 catalytic domain to xylan polysaccharides.

To get further insights into the potential role of this unconventional N-terminal domain, its crystallographic structure was solved by SeMet phasing at 1.8 Å resolution (Supplementary Table 6). The domain behaves as a monomer in solution (Supplementary Fig. 15) and exhibits a parallel right-handed β-helix fold, consisting of 14 complete helical turns with two main short helices protruding from the β-helix backbone (Fig. 7c). The 14 helical turns are twisted and curved with a calcium ion between the 11th and 12th turns in an octahedral coordination sphere (Fig. 7c). This β-helix fold is observed in the clan GH-N of the GH superfamily, in the carbohydrate esterase CE8, and in several polysaccharide lyase (PL) families; however, structural comparisons with these CAZy families (GH28, GH91, PL6, and CE8) led to high rmsd values (>2.9 Å), indicating poor three-dimensional conservation (Supplementary Table 7). Neither the catalytically relevant residues nor the active site topology of these families is conserved in the CapGH10 β-helix domain (Supplementary Fig. 16), supporting that this domain is not catalytically active.

In order to elucidate the molecular determinants for xylan binding observed in AGE experiments, two surface regions populated with aromatic and acidic residues, typical platforms for carbohydrate interaction, were identified and mutated. The region I between the turns 1–4 and the region II near to turns 6–10 (Fig. 7c, d and Supplementary Fig. 17). Mutations E247A and E282A, in the region II, severely impaired protein stability and led to the expression only as inclusion bodies. Mutation D344L (at the calcium-binding site) also affected protein stability, but the arabinoxylan/xylan binding capacity was preserved (Supplementary Fig. 18). This result indicates that calcium ion has a structural relevance rather than a functional role in carbohydrate recognition. Only the mutants Y62A and E82A, affected the migration pattern in AGE assays with beechwood xylan and rye arabinoxylan (Fig. 7b). Both residues are located at the region I, indicating that this patch plays a role in carbohydrate binding. It is worth to mention that two aromatic residues considered critical for the activity of a GH28 member are present in the corresponding region of the CapGH10 β-helix domain, Y193 and Y279; however, their alanine mutation did not alter the carbohydrate binding, in agreement with the lack of catalytic function of the CapGH10 β-helix domain (Supplementary Figs. 16 and 18). Combining the biochemical, structural, and mutagenesis analyses, we would define CapGH10 β-helix domain as a CBM, therefore, establishing a distinguishing structural scaffold in this superfamily and founding the family CBM89.

The combination of this unconventional modular endo-β-1,4-xylanase with complementary GH43 and GH97 enzymes constituting the cluster CC107 likely confer the ability to Bacteroidaceae bacterium MAG57 to act on complex heteroxylans (Fig. 7e), a key function in the gut microbiome of capybara that have grasses as a major component in its diet.

Discussion

In this study, we investigated how the gut microbiota of the largest living rodent, capybara (Hydrochoerus hydrochaeris) known as “master of the grasses”, can efficiently depolymerize and utilize recalcitrant plant polysaccharides. These semi-aquatic animals are hindgut fermenters throughout found in Pantanal wetlands and Amazon basin and, in particular, such animals dwelling the Piracicaba basin region in Brazil have incorporated sugarcane in their diet for decades, a relevant timescale for microbial adaptation and specialization⁴¹, reasoning that this microbiota has been shaped and optimized for energy extraction²¹ from this industrially relevant lignocellulosic biomass. Sugarcane is an important feedstock for Brazilian economy and other countries such as India and Thailand. Two-thirds of this crop are made of lignocellulose, which currently is left in the field (straw) and burnt (bagasse) for energy purposes (electricity and vapor). The understanding of the enzymatic and metabolic mechanisms employed by these microbial communities to obtain energy from plant fibers may unveil alternative biological systems for the conversion of these lignocellulosic agro-industrial residues into value-added products and create further opportunities for carbohydrate-based biotechnological applications.

An integrated meta-omics approach focused on the enzymatic and metabolic capabilities for plant fiber breakdown unveiled that cellulose degradation in this community is not accomplished by classical mechanisms involving cellobiohydrolases or cellulosomes. Instead, cellulose is likely processed by the sophisticated mechanisms from Fibrobacteres, including single and multi-modular CAZymes secreted by the T9SS system, CAZyme-rich outer membrane vesicles, and lignocellulose adhesion proteins (Fig. 8). The complex and diverse composition of hemicellulosic and pectic polysaccharides present in gramineous and aquatic plants are tackled by a broad number of CAZymes organized in PULs found in the multiple recovered Bacteroidetes MAGs, which in part resembles to that from human gut Bacteroidetes species such as the PULs for mixed-linkage β-glucans²⁹ and xyloglucans^42,43. From one of these Bacteroidetes MAGs, which represents a taxonomic novelty (MAG57), an elaborate CAZyme cluster targeting heteroxylans was biochemically elucidated, which may contribute to circumvent the recalcitrance of these polysaccharides.

Fig. 8: Schematic representation of the capybara gut microbial community and enzymatic strategies involved in the depolymerization and conversion of dietary polysaccharides into short-chain fatty acids (SCFAs).

The CAZyme repertoire of the capybara gut microbiome comprehensively covers the most abundant and recalcitrant polysaccharides present in gramineous and aquatic plants, which also requires efficient metabolic capabilities to further convert these depolymerized polysaccharides into SCFAs, the main energy source of the host. This hypothesis was validated by metabolite profiling and metabolic reconstructions with the identification of acetate, butyrate, and propionate as the main metabolites, produced by classical sugar-to-SCFAs metabolic pathways including pyruvate-Acetyl-CoA for acetate, succinate for propionate, and both butyryl-CoA:acetate-CoA-transferase and phosphotransbutyrylase/butyrate kinase for butyrate (Fig. 8). Similar microbial strategies for SFCAs production were also observed in human gut bacteria, highlighting additional commonalities between the gut microbiota from omnivores and hindgut fermenters⁴⁴.

It was prominent the identification of genes and PULs with remote or no similarity to known CAZy families and systems, which led to the discovery of two CAZy families including a high-molecular-weight CBM family involved in (hetero-)xylan recognition (CBM89) and a GH-A clan family of β-galactosidases (GH173). CBM89 is an unconventional CBM family featuring a β-helix fold, demonstrating that such domains, through the evolution, were also sculpted from tandem repeat structures, which could be exploited as a versatile platform for the rational design and engineering of CBMs. The GH173 family expands the panel of industrially relevant enzymes since β-galactosidases are broadly employed in the food and beverage industries, especially in the processing of dairy-based products.

In conclusion, this work sheds light on the enzymatic apparatus and metabolic pathways employed by the gut microbiota from the Amazon monogastric semi-aquatic herbivore, capybara, for the breakdown and utilization of recalcitrant dietary polysaccharides. The discovery of several taxonomic novelties associated with plant fiber degradation along with the founding of two CAZy families highlight this microbiota as an untapped source of CAZymes and enzymatic systems of biotechnological interest. Furthermore, this comprehensively and multidisciplinary investigation of the capybara microbiome advances our understanding regarding the molecular strategies exploited by the microbiota of hindgut herbivores to utilize complex plant glycans as an energy source, indicating that the exploration of such ecological niches might create opportunities to leverage sustainable carbohydrate-based technologies.

Methods

Procedures for sample collection

This study was carried out in strict accordance with the Animal Management Rule of the Brazilian Ministry of Environment (Sisbio 59826-1). The samples were collected from three euthanized young female animals in Tatuí/São Paulo State, Brazil (September 2017), which were not infected with Rickettsia rickettsii assessed by immunofluorescence assays⁴⁵, as a control procedure of Rocky Mountain Spotted Fever (RMSF) hosts. After euthanasia, the animals were submitted to abdominal surgery to collect fresh samples (20 g) from the cecum and recto of each animal. All samples were placed in sterile containers and immediately frozen in liquid nitrogen. Samples were kept at −80 °C until processing.

Microbial DNA and RNA extraction

Samples of cecal and rectal contents were frozen in liquid nitrogen and pulverized with an oscillating ball mill (TE-350, Tecnal Inc.). The homogenized samples were used for microbial DNA extraction according to the protocol described by Yu and Morrison⁴⁶ with modifications. Briefly, 0.25 g of sample was transferred to a Lysing Matrix E Tube from the FastDNA Spin Kit (MP Biomedical, Inc.). For cell lysis, 1 mL RBB+C buffer was added in each sample, followed by homogenization in a FastPrep® FP120 instrument (MP Biomedical, Inc.). The precipitation of proteins was carried out with the addition of 0.26 mL of 10 M ammonium acetate followed by incubation on ice for 5 min and centrifugation at 4 °C for 10 min at 16,000 × g. The precipitation of nucleic acids was performed with isopropanol (1 mL). The samples were incubated on ice for 30 min and then centrifuged at 4 °C for 15 min at 16,000 × g. The pellet was recovered and washed with 70% (v/v) ethanol, followed by drying at room temperature. The nucleic acid pellet was dissolved in 75 μL of autoclaved ultrapure water. RNA was removed with the addition of DNase-free RNase (2 μL from a 10 mg mL⁻¹ stock solution). DNA purification was performed using PowerClean® DNA Clean-Up Kit (Mo Bio Laboratories). Finally, electrophoresis using 0.8% (w/v) agarose gel was used to separate the DNA fragments and to evaluate DNA quality. The DNA solution was stored at –20 °C.

The homogenized samples with the oscillating ball mill were also used for RNA extraction. In this experiment, 500 mg of each sample was used for total RNA extraction with Trizol and FastRNA® Pro Green Kit (MP Biomedicals). The quality of RNA was verified using an Agilent Bioanalyzer 2100 with the RNA 6000 Nano Reagents Kit (Agilent) and RNA samples with RNA integrity number (RIN) > 8.0 were treated with a blend of the Ribo-Zero rRNA removal Kit for bacteria and the Ribo-Zero Magnetic Gold Kit (Epicentre Biotechnologies) to remove both prokaryotic and eukaryotic rRNAs, respectively. Subsequently, the supernatant was purified using an 80% (v/v) ethanol solution, and the resultant RNA were used for library RNA preparation.

Microbial community structure and diversity analysis

Capybara gut microbial community structure and diversity was investigated via high-throughput sequencing of 16S rRNA gene. The amplification of the 16S rRNA gene V4 region was performed in technical and biological triplicates using the 515F (5′-GTGCCAGCMGCCGCGGTAA) and 806R (GGACTACHVGGGTWTCTAAT) primers⁴⁷. Sequencing was performed on a MiSeq Sequencing System (Illumina Inc.) with the V3 kit, 600 Cycles, in paired-end sequencing mode 2 × 300 bp. The ZymoBIOMICS™ Microbial Community DNA Standard (D6305, Zymo Research) with eight phylogenetically distant bacterial strains (3 gram-negative and 5 gram-positive) and 2 yeasts, was included as a positive control to evaluate possible bias in libraries construction, sequencing, and bioinformatics analysis. For taxonomic analysis, paired-end reads were quality checked using FastQC (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) and filtered using Trimmomatic v.0.36⁴⁸ to remove adapters and low-quality reads, using the following parameters: ILLUMINACLIP:NexteraPE-PE.fa:2:30:10:8:true, LEADING:4, TRAILING:4, MINLEN:60 and SLIDINGWINDOW:4:16. Filtered paired-end reads were merged using fastq_mergepairs function from Usearch v.10 package⁴⁹ (parameters: fastq_maxee 0.5, fastq_minovlen 50, and fastq_minmergelen 250). Detailed number of reads per sample are summarized in Supplementary Table 8. Furthermore, primer sequences were removed, and singletons were discarded. Filtered amplicon reads were denoised (error-corrected) using the UPARSE unoise3 function (parameters: -minsize 8 and alpha 0.2), to likely recover true biological sequences (zOTUs – zero radius OTUs). Prokaryotic taxonomic assignment was performed using sintax function, as implemented in Usearsh v.10, using a the sintax_cutoff parameter of 0.8 as threshold and the RDP database v16⁵⁰. Further analyses were performed using phyloseq v1.20 package on the R Studio. Pearson and Kendall correlations were calculated between taxonomic assignments performed with 16S, MG, MT, and 16S from MG. A significant Pearson correlation (P < 0.05) was observed for all omics (correlation coefficient r = 0.96 for MG:MT, r = 0.74 for 16S_MG:MG and r = 0.71 for 16S_MG:MT, P < 0.05), except when comparing with 16S data.

Metagenome and metatranscriptome sequencing

Metagenomic libraries were prepared using the Nextera Library Preparation kit (Illumina Inc.), while metatranscriptomic libraries were prepared using the TruSeq Stranded total RNA Library Prep Kit (Illumina Inc.). Libraries concentrations were measured through quantitative qPCR using the KAPA Library Quantification Kit (Roche Inc.) and assayed for quality using an Agilent Bioanalyzer 2100 system (Agilent Technologies). MG and MT libraries were paired-end sequenced in two runs (2 × 100 bp) on the Illumina HiSeq 2500 platform at the NGS sequencing facility at LNBR/CNPEM (Campinas, Brazil). Furthermore, the cecal and rectal gDNA were homogenized into a single sample and sequenced on a MinION sequencing device (Oxford Nanopore Technologies Inc.) to obtain long reads. About 1 µg of ultra-long high-molecular weight gDNA from the homogenized samples was used for library preparation using the SQK-LSK109 Kit (Oxford Nanopore Technologies Inc.). The MinION run was performed on a Flow cell R9 version, generating around 3 Gb of long reads.

Metagenome and metatranscriptome analysis

MG and MT raw sequences were quality-checked and trimmed as described above. MT reads were also analyzed using SortmeRNA v. 2.0 to remove rRNA reads, and then both MG and MT reads were taxonomically classified using Kaiju v. 1.7.4 with a maximum number of mismatches allowed = 5 and with the greedy mode⁵¹. Reago was used to recover 16S ribosomal RNA from the MG data⁵². For functional analysis, the MG trimmed reads were de novo co-assembled using IDBA_UD (version 1.1.1) with the pre-correction parameter and k-mer size from 20 to 60⁵³. Assembly statistics are described in Supplementary Table 9. The assembled MT was binned using CONCOCT v.0.4.0 (parameters: -c 400, -k 4 -l 1000, -r 200 and --no_cov_normalization)⁵⁴ and MaxBin 2.0 (parameters: min_contig_length 1000, max_iteration 50, prob_threshold 0.9 and markerset 107)⁵⁵ to recover putative genomes from the MT data. The binned genomes were dereplicated to remove redundancies using dRep v. 2.0.5 (parameters: -comp 80 -con 10 -str 100 and -p 10) and analyzed using CheckM v1.0.6 with the lineage_wf workflow⁵⁶ to determine the completeness and contamination ratios of these genomes. Long-reads sequencing (ONT) were used for MAGs scaffolding using SSPACE-long-reads v1.1 (parameters: -k 5, -a 0.7, -x 1, -m 50, -o 20 and -n 1000), resulting in 24 MAGs of high quality (completeness > 90% and contamination < 5%) and 50 MAGs of medium quality (completeness > 50% and contamination <10%), according to parameters proposed by Bowers et al. 2017⁵⁷ (Supplementary Data 1). Genomes with completeness lower than 55% and more than 15% contamination rate were discarded. To assign taxonomy to the recovered genomes, GTDB-tk tool v.1.4 was used with the release 202 of the GTDB database⁵⁸. Gene prediction and annotation of both the recovered genomes and the co-assembly were performed using Prokka v.1.11 with the meta parameter⁵⁹, and annotation statistics are described in Supplementary Table 8. KEGG Orthologous (KOs) and pathways annotation were performed using KOFAM (e-value < 1e−5)⁶⁰ and functional ontology assignments for metagenomes (FOAM) database (e-value < 1e−5)⁶¹. CAZymes and PULs annotations were performed according to pipelines established and developed by the CAZy database team, based on HMM search profiles and sequence similarity to known CAZymes^62,63. Furthermore, EC number and KO annotations were also inspected and provided (Supplementary Data 4) to assure proper annotation of CAZymes⁶². Furthermore, MG and MT reads were mapped to the whole set of genes recovered from the co-assembled MT and the set of genes recovered from the MAGs using Kallisto v. 0.46.1 with quant function⁶⁴ to estimate the coverage/abundance of protein-coding genes in cecal and rectal samples. Normalized abundance was estimated based on the count/number of reads per kilobase per million mapped reads and expressed as TPM (Transcripts Per Million).

Phylogenetic analysis and metabolic reconstruction

Phylogenetic analysis of the MAG57, reference Bacteroidetes type strains, and Prevotellaceae uncultured genomes recovered from UBA project²² was performed using concatenated 92 single-copy core genes according to UBCG method⁶⁵. CAZymes phylogenetic analysis was carried out using the catalytic domain of each family aligned with MAFFT⁶⁶, and using maximum likelihood methods implemented in the RAxML software⁶⁷, with 1000 rapid bootstrap inferences and LG as the substitution model.

To perform the reconstruction of metabolic pathways of each recovered MAG, their annotation obtained from the KOFAM database were filtered to keep only the top 5 hits of each protein with e-value below the 1e−5 threshold. These filtered annotations were then supplied to the Annotation of Metabolite Origins (AMON) tool⁶⁸, which based on the KOs annotated in each MAG predicts the putative metabolites that it can generate.

NMR-based metabolomics

Approximately 30 mg of dried cecal and rectal contents, and 300 μL of solution 2:1 (methanol: chloroform) were mixed and sonicated for 1 min (4 cycles of 15 s with intervals of 10 s) and placed at 4 °C for 15 min. Next, 300 μL of solution 1:1 (methanol: ultrapure water) was added, followed by centrifugation at 16,000 × g and 4 °C for 20 min. The supernatant was transferred to another tube and were dried in a CentriVap Solvent System (Labconco Corporation). Samples were diluted to 630 μL by addition of D₂O, 70 μL of sodium phosphate buffer (final concentration 0.1 M) containing dimethyl-silapentane-sulfonate (final concentration 0.5 mM) for NMR chemical shift reference and concentration calibration. The samples were filtrated in a syringe filter with a 0.22 µm pore size hydrophilic polyethersulfone (PES) membrane. The final volume of filtrate ranged from 500 to 650 μL. 1H NMR spectra of samples were acquired using a Varian Inova NMR spectrometer (Agilent Technologies Inc.) equipped with a 5 mm triple resonance cold probe and operating at a 1H resonance frequency of 599.84 MHz and constant temperature of 298 K (25 °C). A total of 1024 free induction decays were collected with 32-k data points over a spectral width of 16 ppm. A 1.5-s relaxation delay was incorporated between scans, during which a continual water presaturation radio frequency (RF) field was applied.

The metabolites were processed and quantified using the NMR Suite software version 7.5 (Chenomx Inc™, Edmonton, AB, Canada). Processor module of this software was used to adjust the spectral phase and baseline corrections. A 0.5 Hz line-broadening function was used to reduce signal noise and facilitate the fitting of the metabolite signals in spectral peaks. The water signal was suppressed, and the spectra were calibrated using the reference signal of the TMSP-d₄ as 0.5 mM. The spectra were individually transferred to the Profiling module of this software to determine the metabolomic profile of each group. Metabolites were identified and their concentrations were measured. Metabolite concentration data were normalized using the initial amount of extraction.

Protein expression and purification

Protein expression and purification were conducted as reported in Santos et al.⁶⁹. Briefly, E. coli BL21 strain was transformed with target genes subcloned into the pET28a vector in frame to a 6×His-Tag at the N-terminus. Transformed strains were grown in selective LB medium (0.5% (w/v) yeast extract, 1% (w/v) tryptone, 1% (v/v) sodium chloride) at 37 °C until the O.D._600nm around 0.8. Then, the temperature was decreased to 18 °C and protein expression was induced with the addition of 0.2 mM isopropyl β-d-1-thiogalactopyranoside (IPTG) (Sigma Aldrich) and incubated for 16 h. Cells were then harvested by centrifugation at 5000×g for 15 min and further the pelleted cells were resuspended in saline-phosphate buffer (20 mM sodium phosphate, 500 mM NaCl, pH 7.5) containing 5 mM imidazole, 1 mM phenylmethylsulfonyl fluoride (PMSF), 5 mM benzamidine and 0.1 mg mL⁻¹ lysozyme. Cells were then disrupted by sonication (20-30 cycles of 10 s with intervals of 20 s), followed by centrifugation at 30,000 × g and 4 °C for 60 min. The soluble protein lysates were applied to a 5-ml HiTrap Chelating HP column (GE Healthcare) and 6×his-tagged target proteins were eluted an imidazole gradient up to 0.5 M. 6×His-Tag was cleaved using 1% (w/w) trypsin (catalog no. T1426, Sigma Aldrich). Target proteins were further purified by size-exclusion chromatography with a HiLoad 16/600 Superdex 75 pg column (GE Healthcare) equilibrated with 20 mM sodium phosphate, 150 mM NaCl, pH 7.5. Purified proteins were evaluated by denaturing polyacrylamide gel electrophoresis and dynamic light scattering (DLS) and samples with high purity (>95%) and low polydispersity (<20%) were employed in biochemical and biophysical experiments.

Enzyme assays

Purified enzymes were screened against polysaccharides, oligosaccharides, and synthetic substrates as described in Supplementary Table 2. The saturation curves of the enzymes CapGH43_12 (α-l-arabinofuranosidase, 35 °C and pH 6.5), CapGH97 (α-galactosidase, 35 °C and pH 7.0) and CapGH173 (β-galactosidase, 45 °C and pH 7.5) were obtained using the synthetic substrates p-nitrophenyl-α-l-arabinofuranoside (pNP-α-l-AraF), p-nitrophenyl-α-d-galactopyranoside (pNP-α-d-Gal) or p-nitrophenyl-β-d-galactopyranoside (pNP-β-d-Gal) (Sigma Aldrich), respectively, at the optimal pH and temperature of each enzyme in McIlvaine buffer (70 mM). The saturation curves of the full-length CapGH10 enzyme (50 °C and pH 5.0 or 5.5) and of the isolated GH10 catalytic domain (55 °C and pH 5.5) were determined for both beechwood xylan and rye arabinoxylan substrates using the 3,5-dinitrosalicylic acid method⁷⁰ in McIlvaine buffer (70 mM). Kinetic data were calculated from initial velocities and expressed as mean ± SD from three independent experiments (n = 3) using OriginPro v 8. Binding capacity of the CBM89 domain from the CapGH10 enzyme and its mutants were evaluated by affinity gel electrophoresis (AGE) according to Mandelli et al.⁷¹. Briefly, continuous native polyacrylamide gels consisted of 7.5% (w/v) acrylamide in 25 mm Tris, 250 mM glycine buffer (pH 8.3) with 0.5% (w/w) of each polysaccharide. 10 μg of the WT enzyme/mutants and BSA (negative control) were loaded on the gels and subjected to electrophoresis at room temperature for 2 h and 60 mA. Proteins were visualized by Coomassie Blue stain.

Small-angle X-ray scattering (SAXS)

SAXS data of the CBM89 domain from the CapGH10 enzyme were collected at the SAXS1 beamline (Brazilian Synchrotron Light Laboratory, Campinas, Brazil) at a protein concentration of 8.4 mg mL⁻¹ in 20 mM Hepes buffer pH 7.5. Buffer scattering were recorded and subtracted from the protein scattering. SAXS patterns were integrated using Fit2D⁷² and GNOM⁷³ was used to evaluate the pair-distance distribution functions p(r). Ab initio molecular envelope was calculated from SAXS data with DAMMIF⁷⁴ and the crystallographic coordinates were fitted into the SAXS low-resolution model using SUPCOMB⁷⁵.

Crystallization, X-ray diffraction, structure determination, and molecular modeling

Crystallization experiments of the CBM89 domain from the CapGH10 enzyme were carried out by the sitting-drop vapor-diffusion method at 18 °C. Native crystals were grown in 20% (w/v) PEG6000, 0.1 M sodium acetate (pH 5.0) and 0.2 M sodium chloride. SeMet crystals were obtained under the same condition with the addition of 20 mM betaine hydrochloride. For cryoprotection, crystals were soaked in the reservoir solution added with glycerol or PEG400 (20% (w/v)) prior to flash cooling. Diffraction datasets of native and SeMet crystals were collected at the PROXIMA-2A and MX2 beamlines from SOLEIL (Gif-sur-Yvette Cedex, France) and LNLS (Brazilian Synchrotron Light Laboratory, Campinas, Brazil), respectively. Datasets were indexed, integrated, merged, and scaled using the XDS package⁷⁶. The structure was solved by single anomalous dispersion (SAD) using the programs SHELXC, SHELXD, and SHELXE⁷⁷ for data preparation, anomalous scatters location and phase calculation, respectively. An initial model was built with the AutoBuild Wizard⁷⁸ from the Phenix package⁷⁹. The structure was refined with the programs PHENIX.REFINE⁸⁰ and REFMAC5⁸¹, and the models were inspected and manually adjusted according to the computed σ_A-weighted (2F₀ − F_c) and (F₀ − F_c) electron density maps using COOT⁸². TLS groups were calculated by TLSMD⁸³ and applied during the crystallographic refinement. The refined structure was evaluated with the servers MolProbity⁸⁴ and the PDBRedo⁸⁵. Structure factors and atomic coordinates were deposited at the Protein Data Bank (PDB) under the accession codes 7JVI. Data collection and refinement statistics are summarized in Supplementary Table 6. Structural models of the CapGH173, GH10 domain from the CapGH10, CapGH97, and CapGH43_12 were obtained using RoseTTAFold³⁷ available in the Robetta structure prediction server. Protein topology of CapGH173 was obtained using PDBsum³⁸.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All sequencing data generated in this study can be found under the BioProject ID PRJNA563062. The 16S, metagenomic and metatranscriptomic reads for cecal and rectal samples have been deposited in the Sequence Read Archive (SRA) under the accession numbers SRR11852069-SRR11852086, SRR11852046-SRR11852057, and SRR11852097-SRR11852108, respectively (Supplementary Table 10). The recovered MAGs have been deposited in the GenBank under the accession numbers JABUSA000000000-JABUVA000000000 (Supplementary Table 11). The NMR metabolomics data have been deposited in the Metabolomics Workbench database under accession number ST001945. Atomic coordinates and structure factors have been deposited in the Protein Data Bank (PDB) under accession code 7JVI (CapCBM89). Other data generated or analyzed during this study are included in this published article and its Supplementary Information files. Source data are provided with this paper.

Code availability

Data and code used for microbiome analyses are publicly available at https://github.com/gpersinoti/capybara_microbiome⁸⁶.

References

Flint, H. J., Bayer, E. A., Rincon, M. T., Lamed, R. & White, B. A. Polysaccharide utilization by gut bacteria: potential for new insights from genomic analysis. Nat. Rev. Microbiol. 6, 121–131 (2008).
Article CAS PubMed Google Scholar
Morrison, M., Pope, P. B., Denman, S. E. & McSweeney, C. S. Plant biomass degradation by gut microbiomes: more of the same or something new? Curr. Opin. Biotechnol. 20, 358–363 (2009).
Article CAS PubMed Google Scholar
White, B. A., Lamed, R., Bayer, E. A. & Flint, H. J. Biomass utilization by gut microbiomes. Annu. Rev. Microbiol. 68, 279–296 (2014).
Article CAS PubMed Google Scholar
Kartzinel, T. R., Hsing, J. C., Musili, P. M., Brown, B. R. P. & Pringle, R. M. Covariation of diet and gut microbiome in African megafauna. Proc. Natl Acad. Sci. USA 116, 23588–23593 (2019).
Article CAS PubMed PubMed Central Google Scholar
Krause, D. O. et al. Opportunities to improve fiber degradation in the rumen: microbiology, ecology, and genomics. FEMS Microbiol. Rev. 27, 663–693 (2003).
Article CAS PubMed Google Scholar
Hess, M. et al. Metagenomic discovery of biomass-degrading genes and genomes from cow rumen. Science 331, 463–467 (2011).
Article ADS CAS PubMed Google Scholar
Rincon, M. T. et al. Novel organization and divergent dockerin specificities in the cellulosome system of Ruminococcus flavefaciens. J. Bacteriol. 185, 703–713 (2003).
Article CAS PubMed PubMed Central Google Scholar
Burnet, M. C. et al. Evaluating models of cellulose degradation by Fibrobacter succinogenes S85. PLoS ONE 10, e0143809 (2015).
Article PubMed PubMed Central CAS Google Scholar
Demment, M. W. & Van Soest, P. J. A nutritional explanation for body-size patterns of ruminant and nonruminant herbivores. Am. Nat. 125, 641–672 (1985).
Article Google Scholar
Stevens, C. E. & Hume, I. D. Contributions of microbes in vertebrate gastrointestinal tract to production and conservation of nutrients. Physiol. Rev. 78, 393–427 (1998).
Article CAS PubMed Google Scholar
Sakaguchi, E. Digestive strategies of small hindgut fermenters. Anim. Sci. J. 74, 327–337 (2003).
Article Google Scholar
Kiani, A. et al. Digestive physiology of captive capybara (Hydrochoerus hydrochaeris). Zoo. Biol. 38, 167–179 (2019).
Article CAS PubMed Google Scholar
Herrera, E. A. In Capybara: Biology, Use and Conservation of an Exceptional Neotropical Species 97–106 (Springer New York, 2013).
Polo, G., Mera Acosta, C., Labruna, M. B., Ferreira, F. & Brockmann, D. Hosts mobility and spatial spread of Rickettsia rickettsii. PLoS Comput. Biol. 14, e1006636 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
García-Amado, M. A. et al. Bacterial diversity in the cecum of the world’s largest living rodent (Hydrochoerus hydrochaeris). Microb. Ecol. 63, 719–725 (2012).
Article PubMed Google Scholar
Pratama, R., Schneider, D., Böer, T. & Daniel, R. First insights into bacterial gastrointestinal tract communities of the Eurasian beaver (Castor fiber). Front. Microbiol. 10, 1646 (2019).
Article PubMed PubMed Central Google Scholar
Armstrong, Z. et al. Metagenomics reveals functional synergy and novel polysaccharide utilization loci in the Castor canadensis fecal microbiome. ISME J. 12, 2757–2769 (2018).
Article CAS PubMed PubMed Central Google Scholar
Morrison, P. K. et al. The equine gastrointestinal microbiome: Impacts of age and obesity. Front. Microbiol. 9, 3017 (2018).
Article PubMed PubMed Central Google Scholar
Velasco-Galilea, M. et al. Rabbit microbiota changes throughout the intestinal tract. Front. Microbiol. 9, 2144 (2018).
Article PubMed PubMed Central Google Scholar
Barker, C. J., Gillett, A., Polkinghorne, A. & Timms, P. Investigation of the koala (Phascolarctos cinereus) hindgut microbiome via 16S pyrosequencing. Vet. Microbiol. 167, 554–564 (2013).
Article CAS PubMed Google Scholar
Milani, C. et al. Multi-omics approaches to decipher the impact of diet and host physiology on the mammalian gut microbiome. Appl. Environ. Microbiol. 86, e01864-20 (2020).
Parks, D. H. et al. Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life. Nat. Microbiol. 2, 1533–1542 (2017).
Article CAS PubMed Google Scholar
Li, J. et al. A catalog of microbial genes from the bovine rumen unveils a specialized and diverse biomass-degrading environment. GigaScience 9, 1–15 (2020).
Google Scholar
Dai, X. et al. Metatranscriptomic analyses of plant cell wall polysaccharide degradation by microorganisms in the cow rumen. Appl. Environ. Microbiol. 81, 1375 (2015).
Article ADS PubMed PubMed Central CAS Google Scholar
Hagen, L. H. et al. Proteome specialization of anaerobic fungi during ruminal degradation of recalcitrant plant fiber. ISME J. 15, 421–434 (2021).
Article CAS PubMed Google Scholar
Ransom-Jones, E., Jones, D. L., McCarthy, A. J. & McDonald, J. E. The Fibrobacteres: an important phylum of cellulose-degrading bacteria. Microb. Ecol. 63, 267–281 (2012).
Article CAS PubMed Google Scholar
Raut, M. P., Couto, N., Karunakaran, E., Biggs, C. A. & Wright, P. C. Deciphering the unique cellulose degradation mechanism of the ruminal bacterium Fibrobacter succinogenes S85. Sci. Rep. 9, 1–15 (2019).
Google Scholar
Arntzen, M., Várnai, A., Mackie, R. I., Eijsink, V. G. H. & Pope, P. B. Outer membrane vesicles from Fibrobacter succinogenes S85 contain an array of carbohydrate-active enzymes with versatile polysaccharide-degrading capacity. Environ. Microbiol. 19, 2701–2714 (2017).
Article CAS PubMed Google Scholar
Tamura, K. et al. Molecular mechanism by which prominent human gut Bacteroidetes utilize mixed-linkage beta-glucans, major health-promoting cereal polysaccharides. Cell Rep. 21, 417–430 (2017).
Article CAS PubMed PubMed Central Google Scholar
de Souza, A. P., Leite, D. C. C., Pattathil, S., Hahn, M. G. & Buckeridge, M. S. Composition and structure of sugarcane cell wall polysaccharides: implications for second-generation bioethanol production. Bioenergy Res. 6, 564–579 (2013).
Article CAS Google Scholar
Martens, E. C. et al. Recognition and degradation of plant cell wall polysaccharides by two human gut symbionts. PLoS Biol. 9, e1001221 (2011).
Article CAS PubMed PubMed Central Google Scholar
de Oliveira, K. et al. Indigestible cellulose and lignin in determining feces production and apparent digestibility in horses. Acta Sci. Anim. Sci. 34, 267–272 (2012).
Google Scholar
Gomez, A., Sharma, A. K., Grev, A., Sheaffer, C. & Martinson, K. The horse gut microbiome responds in a highly individualized manner to forage lignification. J. Equine Vet. Sci. 96, 103306 (2021).
Article PubMed Google Scholar
Hungate, R. E. in The Rumen and its Microbes 206–244 (Elsevier Science, 1966).
Rémond, D., Ortigues, I. & Jouany, J.-P. Energy substrates for the rumen epithelium. Proc. Nutr. Soc. 54, 95–105 (1995).
Article PubMed Google Scholar
Makki, K., Deehan, E. C., Walter, J. & Bäckhed, F. The impact of dietary fiber on gut microbiota in host health and disease. Cell Host Microbe 23, 705–715 (2018).
Article CAS PubMed Google Scholar
Baek, M. et al. Accurate prediction of protein structures and interactions using a three-track neural network. Science 373, 871–876 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Laskowski, R. A., Jabłońska, J., Pravda, L., Vařeková, R. S. & Thornton, J. M. PDBsum: structural summaries of PDB entries. Protein Sci. 27, 129–134 (2018).
Article CAS PubMed Google Scholar
Grépinet, O., Chebrou, M. C. & Béguin, P. Purification of Clostridium thermocellum xylanase Z expressed in Escherichia coli and identification of the corresponding product in the culture medium of C. thermocellum. J. Bacteriol. 170, 4576–4581 (1988).
Article PubMed PubMed Central Google Scholar
Schubot, F. D. et al. Structural basis for the substrate specificity of the feruloyl esterase domain of the cellulosomal xylanase Z from Clostridium thermocellum. Biochemistry 40, 12524–12532 (2001).
Article CAS PubMed Google Scholar
Henry, L. P., Bruijning, M., Forsberg, S. K. G. & Ayroles, J. F. The microbiome extends host evolutionary potential. Nat. Commun. 12, 5141 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Larsbrink, J. et al. A complex gene locus enables xyloglucan utilization in the model saprophyte Cellvibrio japonicus. Mol. Microbiol. 94, 418–433 (2014).
Article CAS PubMed PubMed Central Google Scholar
Hemsworth, G. R. et al. Structural dissection of a complex Bacteroides ovatus gene locus conferring xyloglucan metabolism in the human gut. Open Biol. 6, 160142 (2016).
Koh, A., de Vadder, F., Kovatcheva-Datchary, P. & Bäckhed, F. From dietary fiber to host physiology: short-chain fatty acids as key bacterial metabolites. Cell 165, 1332–1345 (2016).
Article CAS PubMed Google Scholar
Nunes, F. B. P., Nunes, A. Z., Nunes, M. P., Labruna, M. B. & Pizzutto, C. S. Reproductive control of capybaras through sterilization in areas at risk of transmission of Brazilian spotted fever. Cienc. Rural 50, 1–9 (2020).
Google Scholar
Yu, Z. & Morrison, M. Improved extraction of PCR-quality community DNA from digesta and fecal samples. BioTechniques 36, 808–812 (2004).
Article CAS PubMed Google Scholar
Caporaso, J. G. et al. Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample. Proc. Natl Acad. Sci. USA 108, 4516–4522 (2011).
Article ADS CAS PubMed Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS PubMed PubMed Central Google Scholar
Edgar, R. C. UPARSE: highly accurate OTU sequences from microbial amplicon reads. Nat. Methods 10, 996–998 (2013).
CAS PubMed Google Scholar
Cole, J. R. et al. Ribosomal database project: data and tools for high throughput rRNA analysis. Nucleic Acids Res. 42, D633–D642 (2014).
Menzel, P., Ng, K. L. & Krogh, A. Fast and sensitive taxonomic classification for metagenomics with Kaiju. Nat. Commun. 7, 11257 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Yuan, C., Lei, J., Cole, J. & Sun, Y. Reconstructing 16S rRNA genes in metagenomic data. Bioinformatics 31, i35–i43 (2015).
Article CAS PubMed PubMed Central Google Scholar
Peng, Y., Leung, H. C. M., Yiu, S. M. & Chin, F. Y. L. IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth. Bioinformatics 28, 1420–1428 (2012).
Article CAS PubMed Google Scholar
Alneberg, J. et al. Binning metagenomic contigs by coverage and composition. Nat. Methods 11, 1144–1146 (2014).
Article CAS PubMed Google Scholar
Wu, Y. W., Simmons, B. A. & Singer, S. W. MaxBin 2.0: An automated binning algorithm to recover genomes from multiple metagenomic datasets. Bioinformatics 32, 605–607 (2016).
Article CAS PubMed Google Scholar
Parks, D. H., Imelfort, M., Skennerton, C. T., Hugenholtz, P. & Tyson, G. W. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 25, 1043–1055 (2015).
Article CAS PubMed PubMed Central Google Scholar
Bowers, R. M. et al. Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea. Nat. Biotechnol. 35, 725–731 (2017).
Article CAS PubMed PubMed Central Google Scholar
Chaumeil, P. A., Mussig, A. J., Hugenholtz, P. & Parks, D. H. GTDB-Tk: A toolkit to classify genomes with the genome taxonomy database. Bioinformatics 36, 1925–1927 (2020).
CAS Google Scholar
Seemann, T. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30, 2068–2069 (2014).
CAS PubMed Google Scholar
Aramaki, T. et al. KofamKOALA: KEGG Ortholog assignment based on profile HMM and adaptive score threshold. Bioinformatics 36, 2251–2252 (2020).
Article CAS PubMed Google Scholar
Prestat, E. et al. FOAM (functional ontology assignments for metagenomes): a Hidden Markov Model (HMM) database with environmental focus. Nucleic Acids Res. 42, e145 (2014).
Lombard, V., Golaconda Ramulu, H., Drula, E., Coutinho, P. M. & Henrissat, B. The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res. 42, D490–D495 (2014).
Article CAS PubMed Google Scholar
Terrapon, N. et al. PULDB: the expanded database of polysaccharide utilization loci. Nucleic Acids Res. 46, 677–683 (2017).
Article CAS Google Scholar
Bray, N. L., Pimentel, H., Melsted, P. & Pachter, L. Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 34, 525–527 (2016).
Article CAS PubMed Google Scholar
Na, S.-I. I. et al. UBCG: Up-to-date bacterial core gene set and pipeline for phylogenomic tree reconstruction. J. Microbiol. 56, 280–285 (2018).
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Article CAS PubMed PubMed Central Google Scholar
Shaffer, M. et al. AMON: Annotation of metabolite origins via networks to integrate microbiome and metabolome data. BMC Bioinformatics 20, 614 (2019).
Article CAS PubMed PubMed Central Google Scholar
Santos, C. R. et al. Structural insights into β-1,3-glucan cleavage by a glycoside hydrolase family. Nat. Chem. Biol. 16, 920–929 (2020).
Article CAS PubMed Google Scholar
Miller, G. L. Use of dinitrosalicylic acid reagent for determination of reducing sugar. Anal. Chem. 31, 426–428 (1959).
Article CAS Google Scholar
Mandelli, F. et al. Spatially remote motifs cooperatively affect substrate preference of a ruminal GH26-type endo-β-1,4-mannanase. J. Biol. Chem. 295, 5012–5021 (2020).
Article CAS PubMed PubMed Central Google Scholar
Hammersley, A. P., Svensson, S. O., Hanfland, M., Fitch, A. N. & Häusermann, D. Two-dimensional detector software: from real detector to idealised image or two-theta scan. High. Press. Res. 14, 235–248 (1996).
Article ADS Google Scholar
Svergun, D. I. Determination of the regularization parameter in indirect-transform methods using perceptual criteria. J. Appl. Crystallogr. 25, 495–503 (1992).
Article CAS Google Scholar
Franke, D. & Svergun, D. I. DAMMIF, a program for rapid ab-initio shape determination in small-angle scattering. J. Appl. Crystallogr. 42, 342–346 (2009).
Article CAS PubMed PubMed Central Google Scholar
Kozin, M. B. & Svergun, D. I. Automated matching of high- and low-resolution structural models. J. Appl. Crystallogr. 34, 33–41 (2001).
Article CAS Google Scholar
Kabsch, W. et al. XDS. Acta Crystallogr. Sect. D Biol Crystallogr. 66, 125–132 (2010).
Sheldrick, G. M. Experimental phasing with SHELXC/D/E: combining chain tracing with density modification. Acta Crystallogr. Sect. D Biol. Crystallogr. 66, 479–485 (2010).
Article CAS Google Scholar
Terwilliger, T. C. et al. Iterative model building, structure refinement and density modification with the PHENIX AutoBuild wizard. Acta Crystallogr. Sect. D Biol. Crystallogr. 64, 61–69 (2008).
Article CAS Google Scholar
Adams, P. D. et al. PHENIX: a comprehensive python-based system for macromolecular structure solution. Acta Crystallogr. Sect. D Biol. Crystallogr. 66, 213–221 (2010).
Article CAS Google Scholar
Afonine, P. V. et al. Towards automated crystallographic structure refinement with phenix.refine. Acta Crystallogr. Sect. D Biol. Crystallogr. 68, 352–367 (2012).
Article CAS Google Scholar
Murshudov, G. N., Vagin, A. A. & Dodson, E. J. Refinement of macromolecular structures by the maximum-likelihood method. Acta Crystallogr. Sect. D Biol. Crystallogr. 53, 240–255 (1997).
Article CAS Google Scholar
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr. Sect. D Biol. Crystallogr. 60, 2126–2132 (2004).
Article CAS Google Scholar
Painter, J. & Merritt, E. A. TLSMD web server for the generation of multi-group TLS models. J. Appl. Crystallogr. 39, 109–111 (2006).
Article CAS Google Scholar
Chen, V. B. et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr. Sect. D Biol. Crystallogr. 66, 12–21 (2010).
Article CAS Google Scholar
Joosten, R. P., Long, F., Murshudov, G. N. & Perrakis, A. The PDB-REDO server for macromolecular structure model optimization. IUCrJ 1, 213–220 (2014).
Article CAS PubMed PubMed Central Google Scholar
Persinoti, G. F. Gut microbiome of the largest living rodent harbors unprecedented enzymatic systems to degrade plant polysaccharides. Github https://zenodo.org/badge/latestdoi/358011770 (2022).
Biely, P., Singh, S. & Puchart, V. Towards enzymatic breakdown of complex plant xylan structures: state of the art. Biotechnol. Adv. 34, 1260–1274 (2016).
Article CAS PubMed Google Scholar
Peter, Z. Order in cellulosics: historical review of crystal structure research on cellulose. Carbohydr. Polym. 254, 117417 (2021).
Article CAS Google Scholar
Pauly, M. & Keegstra, K. Biosynthesis of the plant cell wall matrix polysaccharide xyloglucan. Annu. Rev. Plant Biol. 67, 235–259 (2016).
Article CAS PubMed Google Scholar
Harholt, J., Suttangkakul, A. & Scheller, H. V. Biosynthesis of pectin. Plant Physiol. 153, 384–395 (2010).
Article CAS PubMed PubMed Central Google Scholar
Izydorczyk, M. S. & Dexter, J. E. Barley β-glucans and arabinoxylans: molecular structure, physicochemical properties, and uses in food products—a review. Food Res. Int. 41, 850–868 (2008).
Article CAS Google Scholar

Download references

Acknowledgements

We acknowledge the Brazilian Synchrotron Light Laboratory (LNLS) for the provision of time on the MX2 and SAXS1 beamlines, the Brazilian Biosciences National Laboratory (LNBio) for the use of the crystallization (Robolab), NMR and spectroscopy facilities, and the Brazilian Biorenewables National Laboratory (LNBR) for the use of the characterization of macromolecules and next-generation sequencing facilities. LNLS, LNBio, and LNBR are operated by the Brazilian Center for Research in Energy and Materials for the Brazilian Ministry for Science, Technology, and Innovations. We acknowledge SOLEIL for the provision of synchrotron radiation facilities at PROXIMA-2A (proposal 20181915) and we would like to thank William Shepard and Martin Savko for assistance in using the beamline. We acknowledge the support from Alana H. S. Alvarenga in protein purification and solubilization assays. We are also thankful to Prof. Anne S. Meyer from the Technical University of Denmark and Prof. Lucimara M. C. Cordeiro from the Federal University of Parana for providing purified polysaccharides for enzyme assays. This research was supported by grants from Fundação de Amparo à Pesquisa do Estado de São Paulo (grant no. 2015/26982-0 to M.T.M. and postdoctoral fellowship 2016/19995-0 to M.A.B.M) and Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) (grant no. 305013/2020-3 to M.T.M., 408600/2018-7 to G.F.P, 439195/2016-0 to L.C, 150552/2017-3, and 142332/2017-8 to M.P.M.). L.C, “Coordenação de Aperfeiçoamento de Pessoal de Nível Superior” (CAPES; Finance code 001).

Author information

These authors contributed equally: Lucelia Cabral, Gabriela F. Persinoti, Douglas A. A. Paixão.

Authors and Affiliations

Brazilian Biorenewables National Laboratory, Brazilian Center for Research in Energy and Materials, Campinas, SP, Brazil
Lucelia Cabral, Gabriela F. Persinoti, Douglas A. A. Paixão, Marcele P. Martins, Mariana A. B. Morais, Mariana Chinaglia, Mariane N. Domingues, Renan A. S. Pirolla, Wesley C. Generoso, Clelton A. Santos, Lucas F. Maciel & Mario T. Murakami
Graduate Program in Functional and Molecular Biology, Institute of Biology, University of Campinas, Campinas, SP, Brazil
Marcele P. Martins & Mariana Chinaglia
Brazilian Biosciences National Laboratory, Brazilian Center for Research in Energy and Materials, Campinas, SP, Brazil
Mauricio L. Sforca
The Institut National de la Recherche Agronomique, USC 1408 AFMB, 13288, Marseille, France
Nicolas Terrapon & Vincent Lombard
Architecture et Fonction des Macromolécules Biologiques, CNRS, Aix-Marseille Université, Marseille, France
Nicolas Terrapon & Vincent Lombard
Department of Biotechnology and Biomedicine (DTU Bioengineering), Technical University of Denmark, 2800 Kgs, Lyngby, Denmark
Bernard Henrissat
Department of Biological Sciences, King Abdulaziz University, Jeddah, Saudi Arabia
Bernard Henrissat

Authors

Lucelia Cabral
View author publications
You can also search for this author in PubMed Google Scholar
Gabriela F. Persinoti
View author publications
You can also search for this author in PubMed Google Scholar
Douglas A. A. Paixão
View author publications
You can also search for this author in PubMed Google Scholar
Marcele P. Martins
View author publications
You can also search for this author in PubMed Google Scholar
Mariana A. B. Morais
View author publications
You can also search for this author in PubMed Google Scholar
Mariana Chinaglia
View author publications
You can also search for this author in PubMed Google Scholar
Mariane N. Domingues
View author publications
You can also search for this author in PubMed Google Scholar
Mauricio L. Sforca
View author publications
You can also search for this author in PubMed Google Scholar
Renan A. S. Pirolla
View author publications
You can also search for this author in PubMed Google Scholar
Wesley C. Generoso
View author publications
You can also search for this author in PubMed Google Scholar
Clelton A. Santos
View author publications
You can also search for this author in PubMed Google Scholar
Lucas F. Maciel
View author publications
You can also search for this author in PubMed Google Scholar
Nicolas Terrapon
View author publications
You can also search for this author in PubMed Google Scholar
Vincent Lombard
View author publications
You can also search for this author in PubMed Google Scholar
Bernard Henrissat
View author publications
You can also search for this author in PubMed Google Scholar
Mario T. Murakami
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.C., G.F.P., M.A.B.M. and M.T.M. designed the study and wrote the paper. G.F.P., L.F.M., N.T., V.L. and B.H. performed the multi-omics data analyses. L.C., D.A.P., M.P.M. and M.C. performed the 16S, metagenomics, and metatranscriptomics experiments. L.C., M.P.M., M.C., M.N.D., R.A.S.P., W.C.G. and C.A.S. expressed and purified the enzymes and performed the functional characterization. L.C., M.P.M., M.A.B.M., W.C.G. and M.T.M. performed the structural analysis. L.C. and M.L.S. performed the metabolomics analysis.

Corresponding authors

Correspondence to Gabriela F. Persinoti or Mario T. Murakami.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Eric J. Sundberg, Alma Villaseñor and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Description of Additional Supplementary Files

Supplementary Dataset 1

Supplementary Dataset 2

Supplementary Dataset 3

Supplementary Dataset 4

Supplementary Dataset 5

Supplementary Dataset 6

Supplementary Dataset 7

Reporting Summary

Source_data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cabral, L., Persinoti, G.F., Paixão, D.A.A. et al. Gut microbiome of the largest living rodent harbors unprecedented enzymatic systems to degrade plant polysaccharides. Nat Commun 13, 629 (2022). https://doi.org/10.1038/s41467-022-28310-y

Download citation

Received: 13 September 2021
Accepted: 14 January 2022
Published: 02 February 2022
DOI: https://doi.org/10.1038/s41467-022-28310-y

This article is cited by

Comparative genomic analysis of Planctomycetota potential for polysaccharide degradation identifies biotechnologically relevant microbes
- Dominika Klimek
- Malte Herold
- Magdalena Calusinska
BMC Genomics (2024)
Genetic and functional diversity of β-N-acetylgalactosamine-targeting glycosidases expanded by deep-sea metagenome analysis
- Tomomi Sumida
- Satoshi Hiraoka
- Takuro Nunoura
Nature Communications (2024)
Prebiotic inulin ameliorates SARS-CoV-2 infection in hamsters by modulating the gut microbiome
- Isaiah Song
- Jiayue Yang
- Shinji Fukuda
npj Science of Food (2024)
Successional action of Bacteroidota and Firmicutes in decomposing straw polymers in a paddy soil
- Junjie Huang
- Kailin Gao
- Yahai Lu
Environmental Microbiome (2023)
Trait biases in microbial reference genomes
- Sage Albright
- Stilianos Louca
Scientific Data (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.