Abstract
In this study, a high-throughput sequencing approach was applied to discover novel biocatalysts for lignocellulose hydrolysis from three dedicated energy crops, Arundo donax, Eucalyptus camaldulensis and Populus nigra, after natural biodegradation. The microbiomes of the three lignocellulosic biomasses were dominated by bacterial species (approximately 90%) with the highest representation by the Streptomyces genus both in the total microbial community composition and in the microbial diversity related to GH families of predicted ORFs. Moreover, the functional clustering of the predicted ORFs showed a prevalence of poorly characterized genes, suggesting these lignocellulosic biomasses are potential sources of as yet unknown genes. 1.2%, 0.6% and 3.4% of the total ORFs detected in A. donax, E. camaldulensis and P. nigra, respectively, were putative Carbohydrate-Active Enzymes (CAZymes). Interestingly, the glycoside hydrolases abundance in P. nigra (1.8%) was higher than that detected in the other biomasses investigated in this study. Moreover, a high percentage of (hemi)cellulases with different activities and accessory enzymes (mannanases, polygalacturonases and feruloyl esterases) was detected, confirming that the three analyzed samples were a reservoir of diversified biocatalysts required for an effective lignocellulose saccharification.
Similar content being viewed by others
Introduction
The main renewable resources available to counteract the high greenhouse gas emissions and dependence on feedstock imports associated with fossil sources utilization1 are waste materials such as crop and forestry residues, agro-industrial wastes and municipal solid waste2 and dedicated energy crops, such as miscanthus, switchgrass, reed canary, giant reed, poplar, willow and eucalyptus3,4. However, the main drawback of their use is related to the complexity of macromolecular composition that requires an effective disarraying of recalcitrant lignin and a suitable tailor-made enzyme mixture based on (hemi)cellulases and auxiliary enzymes needed to obtain an effective saccharification5,6. Enzymes involved into the degradation, modification, or creation of glycosidic bonds are referred to as carbohydrate-active enzymes (CAZymes) that are categorized in different classes and families including glycoside hydrolases (GHs), key enzymes for lignocellulosic biomass degradation, glycosyltransferases (GTs), polysaccharide lyases (PLs), carbohydrate esterases (CEs) and carbohydrate-binding modules (CBMs)7. The cellulases hydrolyze the β (1 → 4) glycosidic bonds and are grouped into three main groups, according to their reaction mechanism: the endoglucanases (EC 3.2.1.4) cut randomly the internal glycosidic bonds in the amorphous cellulose; the exocellulases act from the reducing ends (EC 3.2.1.176) or non-reducing ends (EC 3.2.1.91) of cellulose; the β-glucosidases (EC 3.2.1.21) are involved in the hydrolysis of cellobiose. The hemicellulases include several enzymes–such as endo/exo-xylanases (E.C. 3.2.1.8/37), endo/exo-β-glucanases (EC 3.2.1.6/58), β-mannanases (EC 3.2.1.78), polygalacturonases (EC 3.2.1.15, 67, 82), pectin lyases, pectate lyases (EC 4.2.2.2, 6, 9, 10), pectin methyl esterases (EC 3.1.1.11), arabinofuranosidases (EC 3.2.1.55), feruloyl esterases (EC 3.1.1.73)–acting on specific glyco-units and glycosidic bonds towards different hemicelluloses. Furthermore, auxiliary enzymes acting towards recalcitrant highly crystalline cellulose by a non-hydrolytic mechanism, such as lytic polysaccharides monooxygenases, are needed to enhance the fermentable sugars yield8.
Although different combinations of processes for conversion of dedicated energy crops and waste materials into fermentable sugars have been widely studied9,10,11,12,13,14,15, the saccharification step is still the main bottleneck in the biorefinery16 due to the high costs of the enzyme production and the need for biocatalysts that are efficient and stable at the operative conditions17. Therefore, the discovery of novel biocatalysts that could satisfy these criteria is one of the main challenges to overcome this bottleneck. At present, the most advanced researches exploit metagenomes, namely genomic DNAs extracted directly from different environments18, bypassing the need for culture under laboratory conditions and avoiding the restrictions related to in vitro techniques. Two different methods can be used to screen the metagenomes. The function-driven strategy is performed by a biological activity- screening of expression libraries18. The sequence-driven approach is based on the direct sequencing of all genetic material from a target environment and on the homology analysis in comparison with sequences already present in the databases18. The increasing number of works focusing on the study of microbiota from guts of wood-eating insects19, cow20, green-waste compost21 shows the relevance of the research for new lignocellulolytic microorganisms and enzymes. At the present, among natural environments, decaying lignocellulosic materials could represent an important reservoir of novel genes encoding enzymes involved in (hemi)cellulose degradation, necessary for the development of eco-compatible and economically favorable industrial processes. In a previous study22, new multifunctional degrading bacteria that were potential producers of multiple enzymes that have synergistic actions on cellulose and hemicellulose were isolated and selected from lignocellulosic biomasses using a cultural-dependent approach.
Therefore, in the present work, a sequence-driven metagenomic approach was applied to the three dedicated lignocellulosic energy crops Arundo donax, Eucalyptus camaldulensis and Populus nigra after natural biodegradation to identify candidate genes coding for enzymes that may be of use in lignocellulose hydrolysis. Moreover, metagenomic DNA sequences were also analysed to assess the complex microbial community structure and taxonomic diversity of the analyzed biomasses and to evaluate the microbial diversity related to GH families of predicted ORFs.
This study provides high-quality results for the identification of sequences coding for enzymes involved in breakdown, biosynthesis or modification of complex carbohydrates such as lignocellulosic biomass.
The data obtained in this work indicate that the investigated feedstock represent a source of biocatalysts potentially suitable for industrial applications to enhance the conversion of lignocellulosic crops into fermentable sugars.
Results
Data Statistics
The microbiota of three different lignocellulosic biomasses were analysed by Illumina sequencing of the metagenomic DNAs. A total of 11,208,388,400, 11,274,127,600 and 2,392,000 raw reads for A. donax, E. camaldulensis and P. nigra, respectively, were obtained. Sequence reads accounting for around 10.0 Gb, for A. donax and E. camaldulensis samples, and 2 Gb, for P. nigra, were selected (Table 1).
The reads were assembled into 95,292, 159,184 and 33,805 contigs (cut-off value 500 bp) for A. donax, E. camaldulensis and P. nigra biomasses, respectively (Table 1). The N50 and N90 contig lengths ranged from 914 to 1,452 and from 546 to 583 bases, respectively. The longest contig was 49,245, 650,642 and 85,030 bases in A. donax, E. camaldulensis and P. nigra, respectively (Table 1).
Microbial community composition of lignocellulosic biomasses
The reads were compared against sequences in the NCBI NR database and the results processed by MEGAN version 4.70.4 to determine the composition of the microbial communities. The three lignocellulosic biomass samples were shown dominated by Proteobacteria and Actinobacteria. These phyla together accounted for approximately 87.5%, 87.2% and 89.4% of the total biodiversity in A. donax, E. camaldulensis and P. nigra, respectively (Table 2). In P. nigra biomass, Firmicutes, and in particular Bacilli, were detected at a high incidence (approximately 10%).
A low percentage of reads matched fungal species in A. donax and E. camaldulensis (2.7% and 5.3%, respectively) (Table 2).
The relative abundances of microbial taxa were examined at the level of genera to determine the dominant taxa within the bacterial communities degrading biomass from the different investigated plant species. The composition of prokaryotic and eukaryotic subpopulations within the biomass were also separately assessed and presented below.
In total, sixteen different bacterial genera with an incidence ≥1% were detected in the biomass materials, but only Streptomyces, Pseudomonas, Agrobacterium, Xanthomonas and Stenotrophomonas were detected in all samples (Fig. 1). In particular, the composition of microbial community in the P. nigra biomass was strongly dominated by Streptomyces (50.1%), followed by Bacillus (7.8%), Stenotrophomonas (7%), Pseudomonas (5.1%), Xanthomonas (4.2%), Rahnella (3.3%), Agrobacterium (1.7%) and Pseudoxanthomonas (1.1%).
As in the P. nigra biomass, Streptomyces was the taxa that heavily dominated the microbial community in A. donax and E. camaldulensis (35.0% and 47.7%, respectively), followed by Pseudomonas, Agrobacterium, Xanthomonas, Pantoea and Stenotrophomonas. The relative abundance of these taxa was very variable showing a percentage ranging approximately from 1% to 6.5%, depending on lignocellulosic plant species (Fig. 1).
In the E. camaldulensis biomass, Erwinia occurred at a high incidence (6.3%); while, Corallococcus (1.8%), Ketogulonicigenium (1.8%), Methylobacterium (1.2%) and Enterobacter (1.1%) genera were recovered only in the A. donax biomass (Fig. 1).
The relative abundances of fungal taxa accounted for 2.2%, 4.9% and 0.2% of the total biodiversity in A. donax, E. camaldulensis and P. nigra, respectively (Fig. 1). In detail, the incidence of all fungal genera identified in A. donax and P. nigra biomass was <1%; while in E. camaldulensis biomass, Penicillium strongly dominated the eukaryotic biodiversity showing a relative abundance of 3.2% (Fig. 1).
eggNOG and KEGG functional profiling of lignocellulosic biomass
With the aim to investigate the functional diversity in the three samples, the predicted amino acid sequences were also aligned to the databases Evolutionary genealogy of Genes non-supervised orthologous groups–eggNOG–and Kyoto Encyclopedia of Genes and Genomes–KEGG–by using BLAST.
As shown in Fig. 2, the data revealed a prevalence of poorly characterized genes belonging to S (function unknown) or R (general function prediction only) eggNOG category. Moreover, for all three samples, a high percentage (~38%) of genes matching to non-supervised orthologous groups were classified involving in metabolism (categories C, E, F, G, H, I, P, Q) with ~8% of genes related to the carbohydrate transport and metabolism.
As shown in Fig. 3, although the majority of predicted ORFs were related to the membrane transport, this analysis confirmed that many genes matching to KEGG database (~12%) originated from pathways involved in the carbohydrate metabolism.
Inventory of the detected Carbohydrate-Active Enzymes families and putative plant-polysaccharides-targeting Glycoside Hydrolases
In order to identify putative genes and enzymes involved in breakdown, biosynthesis or modification of carbohydrates, the total predicted ORFs in the three investigated biomass samples were compared to the entries of the Carbohydrate-Active Enzymes (CAZymes) database. A total of 1792, 1279 and 2113 putative CAZymes were identified in the samples T3ADSB (from A. donax after 135 days of natural biodegradation in underwood), T3ESB (from E. camaldulensis after 135 days of natural biodegradation in underwood) and T3PSB (from P. nigra after 135 days of natural biodegradation in underwood) respectively, corresponding to 1.2%, 0.6% and 3.4% of the total ORFs (Table 3). A high relative abundance (25–26%) of predicted CAZymes was reported belonging to glycosyltransferases (GTs) families and involved in forming glycosidic bonds for the biosynthesis of di-, oligo- and polysaccharides. A less amount of Carbohydrate Esterases–CEs–(~5–7%), Polysaccharide Lyases–PLs–(~1–3%) and Auxiliary Activities–AAs- (~2–4%) enzymes were detected in the three samples. Moreover, ORFs coding for putative Carbohydrate-binding modules (CBMs) having binding activity to carbohydrates were 5.2%, 11.6% and 13.5% on total CAZymes for T3ADSB, T3ESB and T3PSB, respectively. Around half of the detected CBMs (2.5%, 6.3% and 7% on total CAZYmes for T3ADSB, T3ESB and T3PSB, respectively) was in conjunction with other non-catalytic CBMs and/or with catalytically-active GHs modules exhibiting a modular structure. In particular, in the metagenome from A. donax, most of the multimodular CAZymes contained two modules. Only 4 ORFs encoding putative multimodular proteins contained three modules. In the metagenome from E. camaldulensis, multimodular CAZYmes containing CBM32 module were mainly detected. The members belonging to CBM family 32, commonly found in bacterial CAZymes that modify plant cell wall polysaccharides and eukaryotic glycans, were reported to have different substrate specificity23. Modular proteins containing CBM32 module were mainly detected even in metagenome from P. nigra in multiple copies within the same enzyme or in conjunction with other CBM and/or GH motifs. In this sample, the largest amount of multimodular CAZymes was recognized. In particular, one ORF consisted of seven modules (GH16-CBM4-CBM4-CBM4-CBM4-CBM32-CBM32), one of six modules (CBM54-GH16-CBM4-CBM4-CBM4-CBM4) and one of five modules (CBM35- CBM35- CBM35- CBM35-GH43).
However, most of the detected CAZymes in the three samples were involved in hydrolysis and/or rearrangement of glycosidic bonds. In particular, a number of 1059 in A. donax (corresponding to 59.1% on total CAZymes and to 0.7% on total ORFs detected), 750 in Eucalyptus camaldulensis (corresponding to 58.6% on total CAZymes and to 0.3% on total ORFs detected) and 1136 in Populus nigra (corresponding to 53.8% on total CAZymes and to 1.9% on total ORFs detected) predicted proteins were classified as GHs. The Fig. 4 shows the most frequently occurring putative GHs detected in T3ADSB, T3ESB and T3PSB samples. For each sample, the GHs with abundance ≥1% of the total detected GHs are reported. Table 4 shows the comparison of GH family percentage (abundance >3%) of predicted ORFs from the samples. An abundance of putative GH92–exo-acting α-mannosidases–(5.2%, 6.4% and 4.6% for T3ADSB, T3ESB and T3PSB, respectively, GH3 (6.6%, 5.4% and 6.2% T3ADSB, T3ESB and T3PSB, respectively) and GH43 (5.3% 3.9% and 4.3% T3ADSB, T3ESB and T3PSB, respectively) was noted in all samples. Moreover, in the sample from Arundo donax and Eucalyptus camaldulensis, a large amount of GH18 (3.9% and 4. % respectively) was detected. This family is reported to include both chitinases and endo-β-N-acetylglucosaminidases but also sub-families of non-hydrolytic proteins. In the metagenome from Eucalyptus camaldulensis, CAZymes belonging to family GH13 were relatively abundant. The GH13 enzymes act on a wide range of different substrates and have been subdivided into almost 40 subfamilies, most of which are monofunctional24. In particular, in all three samples, only GH13 belonging to subfamily 11 (reported having debranching activity on glycogen, amylopectin and their β-limit dextrins) and subfamily 30 (involved in the hydrolysis of terminal α-D-glucose residues with release of monomers) were detected. Moreover, the sample from Populus nigra showed a high abundance of GHs belonging to GH23 family (4.1%). All the enzymes belonging to GH23 family were reported to have activity on peptidoglycan and, in particular, the lysozymes to have activity even on chitin and chitooligosaccharides.
The microbial diversity of the ORFs predicted to encode GHs from the three lignocellulosic biomasses was also investigated to identify the bacterial and fungal genera encoding enzymes involved in the carbohydrate metabolism. The microbial biodiversity related to GHs was very high and twenty-six bacterial and forty-two fungal genera were recovered with an incidence ≥1% in at least one sample (Fig. 5). Streptomyces was a dominant genus in all samples accounting for 18.1%, 28.3% and 30.0% in A. donax, E. camaldulensis and P. nigra, respectively, of the microbial genera related to GH (Fig. 5A).
Unlike the other biomasses in which Streptomyces was the dominant taxon, in P. nigra the most abundant GHs were related to Paenibacillus (30.38%) (Fig. 5A). Pseudomonas and Rhizobium were the other genera recovered in all lignocellulosic biomasses in relationship to the GHs (Fig. 5A) showing an abundance ranging from 1.1% to 6.3% depending on plant species.
The abundance of the other taxa related to GHs is strictly correlated to substrate source. A high percentage of bacteria belonging to Stenotrophomonas genus encoded GHs in A. donax (8.8%) and P. nigra (10.3%) biomass (Fig. 5A). Also most GHs in A. donax biomass was encoded by genera belonging to the class of Actinobacteria and in particular, Curtobacterium (6.0%), Microbacterium (8.7%), Nocardiopsis (6.0%) and Promicromonospora (1.2%) (Fig. 5A). In contrast, members belonging to α-Proteobcteria (Novosphingobium and Isoptericola) and γ-Proteobacteria (Pseudoxanthomonas, Xanthomonas, Dyella and Rhodanobacter) classes characterized E. camaldulensis biomass; while γ-Proteobacteria (Stenotrophomonas and Xanthomonas) together to Bacillus (5.8%) were the other taxa recovered in the P. nigra biomass (Fig. 5A).
In this study, the GHs also originated from a wide range of fungal taxa. Among the forty-two genera occurring with an abundance ≥1% in at least one sample, only Pestalotiopsis was recovered in all lignocellulosic biomasses (with an incidence of 2.7%, 2.0% and 12.50 in A. donax, E. camaldulensis and P. nigra, respectively) (Fig. 5B).
Overall, the highest fungal biodiversity related to GHs was found in E. camaldulensis (35 genera) followed by A. donax (13 genera) and P. nigra (5 genera). Although the highest biodiversity was found in E. camaldulensis, all fungal genera occurred at low percentage with the exception of Nectria (10.3%) and Sporothrix (20.6%) (Fig. 5B). By contrast, in A. donax biomass, most of the GHs were related to Fusarium (18.6%), Nectria (14.2%) and Trichoderma (12.4%), while the abundance of the other taxa range approximately from 8% to 1% (Fig. 5B).
Finally, the lowest fungal diversity was found in P. nigra biomass. The most abundant taxa recovered in this plant sample was Togninia (37.5%) followed by Batrachochytrium (25.5%) and Pestalotiopsis, Meyerozyma and Ustilago (12.5%) (Fig. 5B). However, although this result seemed suggest that the fungal taxa were abundant, overall very few GHs were related to them because only the 0.2% of total biodiversity was determined by fungi in this sample (Fig. 1).
KEGG pathway classification related to Glycoside Hydrolases
An in-depth KEGG pathway mapping was carried out for the putative genes coding for plant polysaccharides-degrading enzymes in order to obtain a specific, unique activity for each detected GH. As shown in Fig. 6, a high percentage of different cellulases were detected. In particular, β-glucosidases (EC 3.2.1.21, hydrolyzing cellobiose and other cellodextrins) and endo-1,4-β-glucanases (EC 3.2.1.4, performing the random internal hydrolysis of amorphous cellulose) were the most abundant putative enzymes involved in the hydrolysis of glycosidic bonds. In the samples from Arundo donax and Populus nigra, an abundance of chitinases (EC 3.2.1.14) was also detected (7.41% and 6.29% respectively). It is noteworthy that in all three samples putative genes coding for hemicellulases and accessory enzymes with a broad spectrum of activities were recognized. In particular, a high percentage of proteins involved in the degradation of (glucurono)(arabino)xylan–such as endoxylanases (E.C. 3.2.1.8) and β-xylosidases–and in the removal of arabinose–α-L-arabinofuranosidases (E.C. 3.2.1.55)–or galactose–α-galactosidases (E.C. 3.2.1.22)–substituents in hemicelluloses were detected. Moreover, several additional putative enzymes related to the hemicelluloses degradation–such as mannanases (EC 3.2.1.78), polygalacturonases (EC 3.2.1. 67) and feruloyl esterases (EC 3.1.1.73) were recognized in a lower percentage.
Discussion
In the last decades, the increasing interest in the use of renewable sources for green energy and chemicals has strongly stimulated search for new biocatalysts from different ecosystems for lignocellulose conversion. Therefore, in this work, microbial and enzymatic diversities potentially relevant to the degradation of plant biomass into fermentable sugars were explored through metagenomic approach in three dedicated lignocellulosic energy crops, Arundo donax, Eucalyptus camaldulensis and Populus nigra, after natural biodegradation22. Metagenomic DNA sequences were analysed to assess the total biodiversity, identify candidate genes coding for enzymes putatively involved in carbohydrates metabolism and that may be of use in lignocellulosic degradation, and evaluate microbial diversity related to GH families of predicted ORFs.
The microbial diversity results from this study were performed on the same samples previously characterised using 16S phylotyping in our earlier study22 with samples T3ADSB, T3ESB and T3PSB corresponding to samples At3UW, Et3UW and Pt3UW in that publication. Some taxa differed sharply in composition, e.g. Actinobacterial content of 40.1% vs 8.6% when T3ADSB and At3UW were compared. The substantial differences could be due to the different molecular methods adopted by Ventorino et al.22 for sequencing in comparison to those ones used in this study (amplicon sequencing of the 16S rRNA gene vs shotgun metagenomic sequencing) as well as to the different methods used for microbial DNA extraction. In fact, in the present work eDNA was extracted directly from lignocellulosic biomass samples, whereas Ventorino et al.22 extracted DNA from pellets obtained from microbial cells desorbed from lignocellulosic materials. This approach could determine an underrepresentation of filamentous bacteria, and in general of relative abundance of Actinobacteria, in amplicon data reported in the previous work. Discrepancies between different approaches to quantifying the taxonomic composition of microbiomes are a known phenomenon. According to Morgan et al.25 the relative abundances of microbial taxa inferred from metagenomic sequences significantly varied depending on the DNA extraction and sequencing protocols utilized. Recently, Duncan et al.26 revealed that shotgun metagenomics detected a much higher abundance of Actinobacteria than amplicon sequencing.
Nevertheless, Actinobacteria were significant components of biomass in both studies. The prevalence of the actinobacterial genus Streptomyces could be due to the ability to synthetize enzymes, such as cellulases27,28, which efficiently degrade lignocellulosic materials under a wide range of environmental conditions29. Actinobacteria, and in particular, Streptomyces spp. were found to be major plant biomass degrading microbes in peat swamp forests and also ubiquituously present during the composting of chestnut green waste30,31.
Bacterial species belonging to Proteobacteria phylum, such as Pseudomonas spp. and Stenotrophomonas spp., were also retrieved in all lignocellulosic samples. Bacteria belonging to these genera are known to be able to produce a wide range of enzymes for efficient degradation of carboxymethylcellulose, (hemi)cellulose and lignin32,33. These results are in according with previous study in which culture-independent approach based on 16S rRNA gene sequence demonstrated that Proteobacteria was the taxa that heavily dominated the microbial community in different lignocellulosic biomass piles, remaining high during all degradation processes in natural conditions22. Moreover, Actinobacteria and Proteobacteria have been identified as the predominant bacterial phyla during composting of lignocellulosic waste exhibiting the enzymatic activities required for the degradation of this recalcitrant polymeric material34.
The occurrence of other bacterial taxa with a different abundance depending on plant species was also demonstrated in the investigated lignocellulosic biomasses. Interestingly, Bacillus genus covered approximately 8.0% of the total microbial biodiversity in P. nigra. Members belonging to Bacillus spp. isolated from different environments exhibit cellulolytic and/or hemicellulolytic activities to potentially breakdown the components of lignocellulosic material35,36,37,38. Moreover, different microbial strains belonging to Enterobacteriaceae family such as Pantoea, Rahnella and Erwinia, are frequently recovered in the gut of insects producing digestive enzymes implicated in the hydrolysis of cellulose39,40.
Moreover, a low abundance of eukaryotic populations was observed in all the lignocellulosic biomass samples. This result could be due to the fact that fungi have tough chitin walls that are difficult to breach. In fact, since fungal community patterns could be strongly dependent on the extraction method used41, their representation in this work could be depressed. However, among the fungal taxa retrieved, only Penicillium showed an incidence >1% in E. camaldulensis. Cellulolytic activity of this genus is well documented and there are several reports on β-glucosidase, cellulases and xylanases production from different Penicillium species42,43,44. Moreover, Ryckeboer et al.45 reported also the ability of Penicillium spp. to degrade lignin and starch making it a good candidate in the producing of industrial cellulases46.
Analysing the biodiversity related to GH families of predicted ORFs, a highly complex microbial community was found. With regard to bacterial biodiversity, Streptomyces, Pseudomonas and Rhizobium were found in all lignocellulosic biomass samples. In agreement with the results obtained analysing the total biodiversity, Streptomyces was the dominant taxon, confirming the ability of the members belonging to this genus to encode enzymes involved in cellulose and hemicellulose degradation. In fact, Streptomyces spp. is reported to produce different GHs that are well characterized47,48. In addition, the production of cellulolytic enzymes in Rhizobium spp. is related to their ability to nodulate leguminous plants. In fact, Rhizobium is a plant growth promoting rhizobacterium living as free-living saprophytes in the soil but also able to fix nitrogen establishing a symbiotic associations with a host plant49. The production of enzymes, such as cellulases, is fundamental to degrade plant cell wall polymers and penetrate in the host root50. García-Fraile et al.51 reported the ability to actively hydrolyse CM-cellulose of two bacterial strains isolated from decaying wood of Populus alba and classified as Rhizobium cellulosilyticum.
The prokaryotic biodiversity related to GHs was also dominated by Paenibacillus genus in the P. nigra biomass. Eida et al.52 reported the ability of different Paenibacillus isolates to efficiently contribute to cellulolytic and hemicellulolytic processes during composting of sawdust. Other taxa recovered in the P. nigra biomass that are known as plant biomass-degrading microbes were Stenotrophomonas and Xanthomonas (Proteobacteria) and Bacillus (Firmicutes). De Angelis et al.17 reported that the members of Proteobacteria as well as Firmicutes strongly dominated switchgrass-adapted communities comprising approximately 80% of the microbial richness.
Differently, in A. donax biomass the most of GHs was encoded by genera belonging to the class of Actinobacteria. These taxa are related to well characterized potent plant polysaccharide-degrading bacteria and play an important role in degradation of numerous polymers such as chitin, cellulose, lignin and polyphenol53.
With regard to fungal biodiversity related to GHs, diverse genera were found, and among these only Pestalotiopsis was recovered in all lignocellulosic biomasses. This result is in agreement with Cahyani et al.54 that reported the ubiquitous presence of Pestalotiopsis spp. during the composting process of rice straw. In fact, this endophytic fungus is able to secrete xylanases and cellulases also in salt stress conditions55 as well as produce a considerable amount of ligninolytic enzymes such as laccase56.
However, Sporothrix, Fusarium, Nectria and Trichoderma dominated the eukaryotic biodiversity related to GHs in A. donax and E. camaldulensis biomasses. These Ascomycota are known for their ability to produce cellulolytic enzymes57,58 and comprise many species involved in the degradation of recalcitrant substances such as cellulose, hemicellulose, pectin, and lignin59. Jurado et al.60 reported that fungi belonged to Ascomycota group were ubiquitous throughout the whole lignocellulose-based composting process.
The functional clustering of the predicted ORFs to eggNOG and KEGG databases showed high similarity among the three analyzed samples.
The prevalence of poorly characterized genes obtained by matching to eggNOG categories suggested the three detected biomasses as potential sources of not yet known genes. Moreover, the analysis of functional classification distribution among these three metagenomes, based on both the eggNOG and KEGG database, suggests that a large number of predicted genes were putatively associated with formation, breakdown and interconversion of polysaccharides. In particular, the relative abundance of genes linked to carbohydrates metabolism pathway was higher than or similar to that detected in metagenomes from samples with well-known lignocellulose-degrading ability, such as invasive snail crop microbiome61 and lower termite Coptotermes gestroi gut62. This result confirmed the high potentiality of the three analyzed metagenomes to express genes involved in lignocellulosic biomasses biotransformation.
Moreover, the inventory of the Carbohydrate-Active Enzymes families detected in the three samples interestingly revealed ORFs codifying for putative lytic polysaccharide monooxygenases (LPMOs). Nowadays, the interest is moving towards the LPMOs belonging to AA9 (formerly reported as GH61), AA11 or AA10 (formerly reported as CMB33) families, due to their ability to depolymerize the recalcitrant insoluble polysaccharides from highly crystalline cellulose, increasing the efficiency of lignocellulose saccharification8. Only a few of LPMOs have been discovered by metagenomic approach18. In metagenomes analyzed in this study, 3, 5 and 9 ORFs (for T3ADSB, T3ESB and T3PSB respectively) were assigned to family AA10, whereas only in the sequenced eDNA from A. donax 11 and 2 ORFs encoding putative enzymes belonging respectively to families AA9 and to AA11 were detected.
However, most of CAZymes detected in the three samples were related to putative plant-polysaccharides-targeting GHs. Based on the results obtained by Li et al.63 analyzing 46 finished metagenomic studies collected in Genomes OnLine Database (GOLD) by comparison against the CAZy sequences for homologues of glycosyl hydrolases using an e-value <10−40 as a cut-off threshold, the percentages of detected GHs in our study were higher than those present in metagenomic samples from soil, sludge and marine or lake environments. Furthermore, the diversity of GH family enzymes detected in the three samples was greater than that observed in insect or mammalian fecal and gut samples with high lignocellulose-degrading potentiality64, in line with the detected high phylogenetic diversity.
The putative genes encoding proteins involved in the degradation of plant polysaccharides were detected in the three samples. Moreover, accepted that the obtained data are sensitive to the bioinformatics workflow used in the different studies, a comparison between the GHs detected in our samples and in metagenomes well known as reservoirs of genes involved in lignocellulose-degradation was attempted (Table 5), based on the classification provided by Allgaier et al.65. The detection and assignment of glycoside hydrolases in our metagenomes and bovine rumen metagenome66 were performed by BLAST-based procedures against the CAZy database, whilst the searches for glycoside hydrolases in metagenomes from six years old elephant feces64, yak67 and cow rumen66, snail crop61, macropod gut68 and termite hindgut19 were performed by using HMMER hmmsearch with Pfam. The putative ORFs encoding enzymes related to the oligosaccharides degradation represented the majority of the total plant-polysaccharides-targeting GHs and their abundance (~26% for T3ADSB, ~22% for T3ESB and ~24% for T3PSB) was comparable to that detected in samples from cow rumen20 and termite hindgut19. Most belonged to GH1, GH2 and GH3 families including β-glucosidases, β-galactosidases, β-mannosidase, β-glucuronidase, β-xylosidase and other enzymes involved in the breakdown of a large variety of β-linked disaccharides. Due to the high diversity of protein structural arrangements, a robust phylogenetic classification of these families is currently not available. In addition, enzymes belonging to GH43 family were highly represented (mainly in T3ADSB and T3PSB). This family includes β-xylosidases and α-L-arabinofuranosidases and several bifunctional enzymes; moreover, due to a remarkable expansion in GH43 family resulting from novel studies about plant cell wall degrading organisms, members of this family may have a more extensive range of specificities69.
In the sample T3ADSB, the abundance of endocellulases was double than T3ESB and T3PSB and comparable to that detected in the six-years-old elephant feces by Ilmberger et al.64 and in yak rumen by Dai et al.67. The GH5 and GH6 were the most represented families. While only endoglucanase and cellobiohydrolase activities have been reported for the members of GH6 family, the enzymes belonging to Glycoside Hydrolases family 5 have a variety of specificities: this is one of the largest of all CAZy glycoside hydrolase families comprising not only cellulases, such as endo- and exo-glucanases and β-glucosidases, but even hemicellulases, such as endo- and exo-mannanases and β-mannosidase. Interestingly, in T3ADSB an amount of enzymes belonging to GH7 family (that includes mainly enzymes from fungi) was detected, although in this sample only a small amount of fungi was identified. The cellobiohydrolases belonging to GH7 family are the most active exoglucanases known70.
The abundance of hemicellulases detected in the three investigated samples was comparable with the percentage occurred in bovine rumen66 and macropod gut68. In T3ADSB, more that 1% of CAZymes belonged to GH10 family. These enzymes have received much attention for their use in degradation of lignocellulosic biomass for biochemicals production, due to their involvement in breaking down of xylan, the major component of the hemicellulose. Moreover, in the three samples a percentage of 1–2% of enzymes belonging to Glycoside Hydrolases family 28 was identified. These CAZymes are involved in the degradation of pectin, a structural constituent of the plant cell wall.
About 1% of the debranching enzymes detected in the three samples belonged to family GH51: this percentage was higher than that detected in yak and cow rumen20,67 and snail crop61. Moreover, the samples T3ESB and T3PSB revealed an abundance of family GH67 members. The enzymes belonging to these two families (α-L-arabinofuranosidases and α-glucuronidases respectively) are required for the optimal breakdown of glucoronoarabinoxylans (GAXs), one of the major component of hemicellulose, composed by β(1–4)-D-xylose linked polymers branched with arabinose and glucuronic acid. Interestingly, in the samples from Eucalyptus camaldulensis and Populus nigra 2.5% and 0.6% respectively of total GHs belonged to GH78 α-L-rhamnosidases. These enzymes catalyze the hydrolysis of α-L-rhamnosyl-linkages in L-rhamnosides present in polysaccharides such as rhamnogalacturonan.
Furthermore, the in-depth KEGG pathway mapping of the genes encoding enzymes involved in the polysaccharides hydrolysis confirmed that all three analyzed samples were a valuable source of a full set of diversified (hemi)cellulases and accessory enzymes required for an effective pretreated lignocellulosic biomass hydrolysis71,72.
Methods
Lignocellulosic biomasses and DNA extraction
Chipped wood from A. donax, E. camaldulensis and P. nigra was used to form piles of approximately 30 kg that were submitted to biodegradation under natural conditions as previously reported22. Briefly, the biomass piles were placed without any coverage under oak trees in the woodland at the Department of Agriculture (Naples, Italy). After 135 days of natural biodagradation, samples of 0.5 kg were collected from the external part (right and left side of the pile) and the internal central part of the biomass, milled and stored at −20 °C until use.
3 g of each milled biomass were used to isolate the total environmental DNA (eDNA), including genetic material from microorganisms adherent to the plant biomass. The eDNA extraction was performed by using the PowerSoil® DNA Isolation Kit (MO BIO Laboratories, INC. CARLSBAD, CA) according to the manufacturer’s instructions. NanoDrop and Qubit Fluorometer tests were performed to verify the level of purity of recovered eDNA. About 25 μg of each eDNA samples were sent to BGI Tech Solutions Co., Ltd. (Hongkong, China) for further analyses.
Metagenome shotgun sequencing and assembly
Three qualified 270 bp short-insert libraries were constructed from the eDNA samples. The genetic material was firstly sheared into smaller fragments by nebulization. Then the overhangs resulting from fragmentation were converted into blunt ends by using T4 DNA polymerase, Klenow Fragment and T4 Polynucleotide Kinase. An “A” base was added to the 3′ phosphorylated blunt ends of the DNA fragments and the adapters were ligated. Undersized fragments were removed with Agencourt AMPure XP Beads (Beckman Coulter Inc, Brea, CA, USA). The libraries were then subjected to 151 paired-end sequencing on Illumina HiSeq2000 platform by using TruSeq SBS Kit v3-HS (Illumina, San Diego, CA, USA) following standard pipelines. The generated raw data were trimmed: leading or trailing low quality (below quality 3) or 3 N bases were cut off and reads contaminated by adapter (15 bases overlapped by reads and adapter) or with low quality (20) bases (40% as default, parameter setting at 36 bp) were removed. The data were filtered by using readfq.v5 (unpublished software, BGI).
The obtained Clean Data were used to perform the metagenome sequences. Before assembly, k-mer analysis (K-mer length 15) was done to evaluate the sequencing depth for each sample. SOAPdenovo (Version 1.06)73 was used to assemble filtered data in contigs and scaffolds and assembly results were optimized by in-house scripts (key parameters: -r 2; -l 35; -M 4; -p 1) using the SOAP-aligner tool.
Metagenome analyses
To evaluate the microbial composition, the assembled contigs were matched against the bacteria, fungi and archaea sequences extracted from NCBI NR database (release-20130408) by BLASTx with 1 × 10−8 and ≥90% identity cut-off. Each contig was subsequently taxonomically assigned by MEGAN version 4.70.474, based on lowest common ancestor (LCA). The taxonomic abundance was determined by read count of each taxon, after mapping to the assembled contigs using SOAPaligner version 2.2175 with default parameters. Assembled contigs are used to predict genes by using MetaGeneMark Software76 (version 2.10, default parameters) based on assembly results.
Functional annotations of predicted amino acid sequences were performed by BGI Tech Solutions Co., Ltd. (Hongkong, China) by using BLASTP (version 2.2.23). In particular, the metabolism pathway assignment of the predicted protein was performed using the Enzyme Commission (EC) number in the Kyoto Encyclopedia of Genes and Genomes (KEGG)–version 59–databases77 and the annotation of each contig with functional categories was carried out by matching against Evolutionary genealogy of genes: Non-supervised Orthologous Groups (eggNOG)–version 3.078. Both comparison were performed by using BLAST79 with e-value threshold of 1e-5 and a 40% minimum percentage of identity to assign the subject sequence to a specific function family. Moreover, in order to explore in depth the ability of the microbial biodiversity detected in the samples to degrade lignocellulose, the putative encoded protein sequences were first compared to the full length sequences of the CAZy database using BLAST75 and query sequences that produced a e-value >10−6 were discarded. Query sequences that produced an e-value <10−6 and aligned over their entire length with a protein in the database with >50% identity were automatically assigned to the same family as the subject sequence. The remaining query sequences were subjected to manual curation which involved BLAST searches against a library built with partial sequences corresponding to individual GH, PL, CE and CBM modules and examination of the conservation of specific family patterns and features such as catalytic residues (where known).
Additional Information
Accession codes: The data is available in the Sequence Read Archive database of the National Center of Biotechnology Information (SRP090993).
How to cite this article: Montella, S. et al. Discovery of genes coding for carbohydrate-active enzyme by metagenomic analysis of lignocellulosic biomasses. Sci. Rep. 7, 42623; doi: 10.1038/srep42623 (2017).
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
Haberl, H., Beringer, T., Bhattacharya, S. C., Erb, K.-H. & Hoogwijk, M. The global technical potential of bio-energy in 2050 considering sustainability constraints. Curr. Opin. Environ. Sustain. 2, 394–403 (2010).
Liguori, R., Amore, A. & Faraco, V. Waste valorization by biotechnological conversion into added value products. Appl. Microbiol. Biotechnol. 97, 6129–6147 (2013).
Fiorentino, N. et al. Assisted phytoextraction of heavy metals: compost and Trichoderma effects on giant reed (Arundo donax L) uptake and soil N-cycle microflora. Ital. J. Agron. 8, 244–254 (2013).
Mariani, C. et al. Origin, diffusion and reproduction of the giant reed (Arundo donax L.): a promising weedy energy crop. Ann. Appl. Biol. 157, 191–202 (2010).
Gao, D., Chundawat, S. P. S., Krishnan, C., Balan, V. & Dale, B. E. Mixture optimization of six core glycosyl hydrolases for maximizing saccharification of ammonia fiber expansion (AFEX) pretreated corn stover. Bioresour. Technol. 101, 2770–2781 (2010).
Gao, D. et al. Hemicellulases and auxiliary enzymes for improved conversion of lignocellulosic biomass to monosaccharides. Biotechnol. Biofuels. 4, 5 (2011).
Cantarel, B. L. et al. The Carbohydrate-Active EnZymes database (CAZy): An expert resource for glycogenomics. Nucleic Acids Res. 37, D233–D238 (2009).
Beeson, W. T., Vu, V. V., Span, E. A., Phillips, C. M. & Marletta, M. A. Cellulose degradation by Polysaccharide Monooxygenases. Ann. Rev. Biochem. 84, 923–46 (2015).
Brethauer, S., Studer, M. H. & Wyman, C. E. Application of a slurry feeder to 1 and 3 stage continuous simultaneous saccharification and fermentation of dilute acid pretreated corn stover. Bioresour. Technol. 170, 470–476 (2014).
Du, J. et al. Enzymatic liquefaction and saccharification of pretreated corn stover at high-solids concentrations in a horizontal rotating bioreactor. Bioprocess Biosyst. Eng. 37, 173–181 (2014).
Gupta, R., Kumar, S., Gomes., J. & Kuhad, R. C. Kinetic study of batch and fed-batch enzymatic saccharification of pretreated substrate and subsequent fermentation to ethanol. Biotechnol. Biofuels 20, 5–16 (2012).
Jin, M., Gunawan, C., Balan, V., Yu, X. & Dale, B. E. Continuous SSCF of AFEXTM pretreated corn stover for enhanced ethanol productivity using commercial enzymes and Saccharomyces cerevisiae 424A (LNH-ST). Biotechnol. Bioeng. 110, 1302–1311 (2013).
Kadić, A., Palmqvist, B. & Lidén, G . Effects of agitation on particle-size distribution and enzymatic hydrolysis of pretreated spruce and giant reed.Biotechnol. Biofuels 7, 77–86 (2014).
Liguori, R., Ventorino, V., Pepe, O. & Faraco, V. Bioreactors for lignocellulose conversion into fermentable sugars for production of high added value products. Appl. Microbiol. Biotechnol. 100, 597–611 (2016).
Pihlajaniemi, V., Sipponen, S., Sipponen, M. H., Pastinen, O. & Laakso, S. Enzymatic saccharification of pretreated wheat straw: comparison of solids-recycling, sequential hydrolysis and batch hydrolysis. Bioresour. Technol. 153, 15–22 (2014).
Berrin, J. G. et al. Exploring the natural fungal biodiversity of tropical and temperate forests toward improvement of biomass conversion. Appl. Environ. Microbiol. 78, 6483–6490 (2012).
DeAngelis, K. M. et al. Strategies for enhancing the effectiveness of metagenomic-based enzyme discovery in lignocellulolytic microbial communities. Bioenerg. Res. 3, 146–158 (2010).
Montella, S., Amore, A. & Faraco, V. Metagenomics for the development of new biocatalysts to advance lignocellulose saccharification for bioeconomic development. Crit. Rev. Biotechnol. 18, 1–12 (2015).
Warnecke, F. et al. Metagenomic and functional analysis of hindgut microbiota of a wood-feeding higher termite. Nature 450, 560–565 (2007).
Hess, M. et al. Metagenomic discovery of biomass-degrading genes and genomes from cow rumen. Science 331, 463–467 (2011).
Dougherty, M. J. et al. Glycoside hydrolases from a targeted compost metagenome, activity-screening and functional characterization. BMC Biotechnol. 12, 38 (2012).
Ventorino, V. et al. Exploring the microbiota dynamics related to vegetable biomasses degradation and study of lignocellulose-degrading bacteria for industrial biotechnological application. Sci. Rep. 5, 8161 (2015).
Abbott, D. W., Eirín-López, J. M. & Boraston, A. B. Insight into ligand diversity and novel biological roles for family 32 carbohydrate-binding modules. Mol. Biol. Evol. 25, 155–167 (2008).
Stam, M. R., Danchin, E. G., Rancurel, C., Coutinho, P. M. & Henrissat, B. Dividing the large glycoside hydrolase family 13 into subfamilies: towards improved functional annotations of alpha-amylase-related proteins. Protein Eng Des Sel. 19, 555–62 (2006).
Morgan, J. L., Darling, A. E. & Eisen, J. A. Metagenomic sequencing of an in vitro-simulated microbial community. PLoS ONE 5(4), e10209 (2010).
Duncan, D. S., Jewell, K. A., Suen, G. & Jackson, R. D. Detection of short-term cropping system-induced changes to soil bacterial communities differs among four molecular characterization methods. Soil Biol. Biochem. 96, 160–168 (2016).
Amore, A. et al. Cloning and recombinant expression of a cellulase from the cellulolytic strain Streptomyces sp. G12 isolated from compost. Microb. Cell Fact. 11, 164 (2012).
Woo, H. L., Terry, Hazena, C., Simmonsa, B. A. & DeAngelis, K. M. Enzyme activities of aerobic lignocellulolytic bacteria isolated from wet tropical forest soils. Syst. Appl. Microbiol. 37, 60–67 (2014).
Wang, C. et al. New insights into the structure and dynamics of actinomycetal community during manure composting. Appl. Microbiol. Biotechnol. 98, 3327–3337 (2014).
Kanokratana, P. et al. Insights into the phylogeny and metabolic potential of a primary tropical peat swamp forest microbial community by metagenomic analysis. Microb. Ecol. 61, 518–528. (2011).
Ventorino, V. et al. Chestnut green waste composting for sustainable forest management: Microbiota dynamics and impact on plant disease control. J. Environ. Manage. 166, 168–177 (2016).
Talia, P. et al. Biodiversity characterization of cellulolytic bacteria present on native chaco soil by comparison of ribosomal RNA genes. Res. Microbiol. 163, 221–232 (2012).
Wang, Y. et al. A novel lignin degradation bacterial consortium for efficient pulping. Bioresour. Technol. 139, 113–119 (2013).
Lopez-Gonzalez, J. A. et al. Dynamics of bacterial microbiota during lignocellulosic waste composting: studies upon its structure, functionality and biodiversity. Bioresour. Technol. 175, 406–416 (2015).
Amore, A., Pepe, O., Ventorino, V., Aliberti, A. & Faraco, V. Cellulolytic Bacillus strains from natural habitats–A review. Chim. Oggi/Chem. Today 31, 49–52 (2013a).
Amore, A. et al. Industrial waste based compost as a source of novel cellulolytic strains and enzymes. FEMS Microbiol. Lett. 339, 93–101 (2013b).
Di Pasqua, R. et al. Influence of different lignocellulose sources on endo-1,4-ß-glucanase gene expression and enzymatic activity of Bacillus amyloliquefaciens B31C. Bioresources 9, 1303–1310 (2014).
Jones, S. M., Van Dyk, J. S. & Pletschke, B. I. Bacillus subtilis SJ01 produces hemicellulose degrading multi-enzyme complexes. Bioresources 7, 1294–1309 (2012).
Anand, A. A. et al. Isolation and characterization of bacteria from the gut of Bombyx mori that degrade cellulose, xylan, pectin and starch and their impact on digestion. J. Insect Sci. 10, 107 (2010).
Morales-Jiménez, J., Zúñiga, G., Ramírez-Saad, H. C. & Hernández-Rodríguez, C. Gut-associated bacteria throughout the life cycle of the bark beetle Dendroctonus rhizophagus Thomas and Bright (Curculionidae: Scolytinae) and their cellulolytic activities. Microb. Ecol. 64, 268–278 (2012).
Plassart, P. et al. Evaluation of the ISO standard 11063 DNA extraction procedure for assessing soil microbial abundance and community structure. PLoS ONE 7(9), e44279 (2012).
Camassola, M. & Dillon, A. J. P. Biological pretreatment of sugar cane bagasse for the production of cellulases and xylanases by Penicillium echinulatum . Indus. Crops Prod. 29, 642–647 (2008).
Jorgensen, H., Eriksson, T., Borjesson, J., Tjerneld, F. & Olsson, L. Purification and characterization of five cellulases and one xylanase from Penicillium brasilianum IBT 20888. Enzyme Microb. Technol. 32, 851–61 (2003).
Krogh, K. B. R. M. et al. Characterization and kinetic analysis of a thermostable GH3 b-glucosidase from Penicillium brasilianum . Appl. Microbiol. Biotechnol. 86, 143–154 (2010).
Ryckeboer, J., Mergaert, J., Coosemans, J., Deprins, K. & Swings, J. Microbiological aspects of biowaste during composting in a monitored compost bin. J. Appl. Microbiol. 94, 127–137 (2003).
Adsul, M. G., Singhvi, M. S., Gaikaiwari, S. A. & Gokhale D. V. Development of biocatalysts for production of commodity chemicals from lignocellulosic biomass. Bioresour. Technol. 102, 4340–4312 (2011).
Ducros, V. et al. Substrate Specificity in Glycoside Hydrolase Family 10. J Biol Chem 28, 23020–23026 (2000).
Goedegebuur, F. et al. Cloning and relational analysis of 15 novel fungal endoglucanases from family 12 glycosyl hydrolase. Curr. Genet. 41, 89–98 (2002).
Ventorino, V. et al. Response to salinity stress of Rhizobium leguminosarum bv. viciae strains in the presence of different legume host plants. Ann. Microbiol. 62, 811–823 (2012).
Robledo, M. et al. Rhizobium cellulase CelC2 is essential for primary symbiotic infection of legume host roots. Proc. Natl. Acad. Sci. USA 105, 7064–7069 (2008).
García-Fraile, P. et al. Rhizobium cellulosilyticum sp. nov., isolated from sawdust of Populus alba . Int. J. Syst. Evol. Microbiol. 57, 844–848 (2007).
Eida, M. F., Nagaoka, T., Wasaki, J. & Kouno, K. Isolation and characterization of cellulose-decomposing bacteria inhabiting sawdust and coffee residue composts. Microbes Environ. 27, 226–233 (2012).
Castillo, J. M., Romero, E. & Nogales, R. Dynamics of microbial communities related to biochemical parameters during vermicomposting and maturation of agroindustrial lignocellulose wastes. Bioresour. Technol. 146, 345–354 (2013).
Cahyani, V. R., Matsuya, K., Asakawa, S. & Kimura, M. Succession and phylogenetic profile of eukaryotic communities in the composting process of rice straw estimated by PCR-DGGE analysis. Biol. Fertil. Soils 40, 334–344 (2004).
Arfi, Y. et al. Characterization of salt-adapted secreted lignocellulolytic enzymes from the mangrove fungus Pestalotiopsis sp. Nat. Commun. 4, 1810 (2013).
Chen, H. Y., Xue, D. S., Feng, X. Y. & Yao, S. J. Screening and production of ligninolytic enzyme by a marine-derived fungal Pestalotiopsis sp. J63. Appl Biochem Biotechnol 165, 1754–1769 (2011).
Gherbawy, Y. A. M. H. & Abdelzaher, H. M. A. Isolation of fungi from tomato rhizosphere and evaluation of the effect of some fungicides and biological agents on the production of cellulase enzymes by Nectria haematococca and Pythium ultimum var. ultimum . Czech Micol (1999).
Wenzel, M., Schoènig, I., Berchtold, M., KaÈmpfer, P. & KoÈnig, H. Aerobic and facultatively anaerobic cellulolytic bacteria from the gut of the termite Zootermopsis angusticollis. J. Appl. Micobiol. 92, 32–40 (2002).
Li, H. et al. Molecular analyses of the functional microbial community in composting by PCR-DGGE targeting the genes of the b-glucosidase. Bioresour. Technol. 134, 51–58 (2013).
Jurado, M., López, M. J., Suárez-Estrella, F., Vargas-García, M. C., López-González, J. A. & Moreno, J. Exploiting composting biodiversity: Study of the persistent and biotechnologically relevant microorganisms from lignocellulose-based composting. Bioresour. Technol. 162, 283–293 (2014).
Cardoso, A. M. et al. Metagenomic analysis of the microbiota from the crop of an invasive snail reveals a rich reservoir of novel genes. PLoS ONE 7(11), e48505 (2012).
Do, T. H. et al. Mining biomass-degrading genes through Illumina-based de novo sequencing and metagenomic analysis of free-living bacteria in the gut of the lower termite Coptotermes gestroi harvested in Vietnam. J. Biosci. Bioeng. 118(6), 665–671 (2014).
Li, L. L., McCorkle, S. R., Monchy, S., Taghavi, S. & van der Lelie, D. Bioporspecting metagenomes: glycosyl hydrolases for converting biomass. Biotechnol. Biofuels 18(2) (2009).
Ilmberger, N. et al. A comparative metagenome survey of the fecal microbiota of a breast- and a plant-fed Asian elephant reveals an unexpectedly high diversity of glycoside hydrolase family enzymes. PLoS ONE 9(9), e106707 (2014).
Allgaier et al. Targeted Discovery of Glycoside Hydrolases from a Switchgrass-Adapted Compost Community. PLoS ONE 5(1), e8812 (2010).
Brulc, J. M. et al. Gene-centric metagenomics of the fiber-adherent bovine rumen microbiome reveals forage specific glycoside hydrolases. Proc. Natl. Acad. Sci. USA 106(6), 1948–1953 (2009).
Dai, X. et al. Metagenomic insights into the fibrolytic microbiome in yak rumen. PLoS ONE 7(7), e40430 (2012).
Pope, P. B. et al. Adaptation to herbivory by the Tammar wallaby includes bacterial and glycoside hydrolase profiles different from other herbivores. PNAS 107, 14793–14798 (2010).
Mewis, K., Lenfant, N., Lombard, V. & Henrissat, B. Dividing the large glycoside hydrolase family 43 into subfamilies: a motivation for detailed enzyme characterization. Appl. Environ. Microbiol. 82, 1686–1692 (2016).
Xu, Q., Adney, W. S., Ding, S. Y. & Michael, H. E. Cellulases for biomass conversion in Industrial Enzymes . Structure, Function and Applications (eds Polaina, J. & MacCabe, A. P. ) 35–50 (Springer, 2007).
Hu, J., Arantes, V. & Saddler, J. N. The enhancement of enzymatic hydrolysis of lignocellulosic substrates by the addition of accessory enzymes such as xylanase: is it an additive or synergistic effect? Biotechnol. Biofuels 4(36) (2011).
Hu, J. et al. The addition of accessory enzymes enhances the hydrolytic performance of cellulase enzymes at high solid loadings. Bioresour. Technol. 186, 149–153 (2015).
Li, R. et al. De novo assembly of human genomes with massively parallel short read sequencing. Genome Res. 20, 265–272 (2010).
Huson, D. H., Mitra, S., Ruscheweyh, H. J., Weber, N. & Schuster, S. C. Integrative analysis of environmental sequences using MEGAN4. Genome Res. 21, 1552–1560 (2011).
Li, R. et al. SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics 25, 1966–1967 (2009).
Zhu, W., Lomsadze, A. & Borodovsky, M. Ab initio gene identification in metagenomics sequences. Nucleic Acids Res. 38, e132 (2010).
Kanehisa, M., Goto, S., Sato, Y., Furumichi, M. & Tanabe, M. KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 40, D109–D114 (2012).
Powell, S. et al. eggNOG v3.0: orthologous groups covering 1133 organisms at 41 different taxonomic ranges. Nucleic Acids Res. 40, D284–D289 (2012).
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).
Acknowledgements
This work was supported by grant from the Ministero dell’Università e della Ricerca Scientifica Industrial Research Project “Development of green technologies for production of BIOchemicals and their use in preparation and industrial application of POLImeric materials from agricultural biomasses cultivated in a sustainable way in Campania region–BioPoliS” PON03PE_00107_1/1, funded in the frame of Operative National Programme Research and Competitiveness 2007–2013 D. D. Prot. n. 713/Ric. 29.10.2010.
Author information
Authors and Affiliations
Contributions
S.M. carried out the experiments, analysed the results for the part of enzymes’ sequences analysis and drafted the manuscript for this part. V.V. carried out the experiments, analysed the results for the part of microorganisms’ data analysis and drafted the manuscript for this part. B.H. and V.L. analyzed CAZymes data using the tools of the carbohydrate-active enzymes database. O.P. contributed to conceiving the study and helped to revise the manuscript. V.F. conceived the study, participated in its design and coordination and revised and corrected the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Montella, S., Ventorino, V., Lombard, V. et al. Discovery of genes coding for carbohydrate-active enzyme by metagenomic analysis of lignocellulosic biomasses. Sci Rep 7, 42623 (2017). https://doi.org/10.1038/srep42623
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/srep42623
This article is cited by
-
Investigation of cold-active and mesophilic cellulases: opportunities awaited
Biomass Conversion and Biorefinery (2023)
-
Functional and sequence-based metagenomics to uncover carbohydrate-degrading enzymes from composting samples
Applied Microbiology and Biotechnology (2023)
-
Role of metagenomics in prospecting novel endoglucanases, accentuating functional metagenomics approach in second-generation biofuel production: a review
Biomass Conversion and Biorefinery (2023)
-
Comparative and pangenomic analysis of the genus Streptomyces
Scientific Reports (2022)
-
Influence of lignin level of raw material on anaerobic digestion process in reorganization and performance of microbial community
International Journal of Environmental Science and Technology (2022)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.