Devosia are well known for their dominance in soil habitats contaminated with various toxins and are best characterized for their bioremediation potential. In this study, we compared the genomes of 27 strains of Devosia with aim to understand their metabolic abilities. The analysis revealed their adaptive gene repertoire which was bared from 52% unique pan-gene content. A striking feature of all genomes was the abundance of oligo- and di-peptide permeases (oppABCDF and dppABCDF) with each genome harboring an average of 60.7 ± 19.1 and 36.5 ± 10.6 operon associated genes respectively. Apart from their primary role in nutrition, these permeases may help Devosia to sense environmental signals and in chemotaxis at stressed habitats. Through sequence similarity network analyses, we identified 29 Opp and 19 Dpp sequences that shared very little homology with any other sequence suggesting an expansive short peptidic transport system within Devosia. The substrate determining components of these permeases viz. OppA and DppA further displayed a large diversity that separated into 12 and 9 homologous clusters respectively in addition to large number of isolated nodes. We also dissected the genome scale positive evolution and found genes associated with growth (exopolyphosphatase, HesB_IscA_SufA family protein), detoxification (moeB, nifU-like domain protein, alpha/beta hydrolase), chemotaxis (cheB, luxR) and stress response (phoQ, uspA, luxR, sufE) were positively selected. The study highlights the genomic plasticity of the Devosia spp. for conferring adaptation, bioremediation and the potential to utilize a wide range of substrates. The widespread toxin-antitoxin loci and ‘open’ state of the pangenome provided evidence of plastic genomes and a much larger genetic repertoire of the genus which is yet uncovered.
Devosia comprises a group of motile, gram-negative bacteria within the class Alphaproteobacteria and family Hyphomicrobiaceae1. The first recognized species of the genus was Pseudomonas riboflavina IFO13584 described by Foster in 19442 from riboflavin-rich soil which was reclassified into Devosia riboflavina in 19961. Since then, many members of this genus have been reported from diverse ecological niches. Although their distribution is ubiquitous including their presence in human cerebrospinal fluid3, nodules of legume plants4,5 and beach sediment6, members of this genus have been mainly reported from soils contaminated with hexachlorocyclohexane (HCH)7,8, mycotoxins (deoxynivalenol)9,10 and other hydrocarbon pesticides11.
In an effort to characterize the culturable diversity of soil contaminated with HCH12,13,14,15,16,17,18, we isolated and characterized four novel members of the genus Devosia viz. D. chinhatensis IPL187, D. crocina IPL208, D. albogilva IPL158 and D. lucknowensis L1519. Although isolated from HCH contaminated soil, these isolates were not able to degrade HCH isomers8. However, members of the genus are best studied for their potential to degrade several toxic compounds, establishing their promising candidature for bioremediation2,9. Previous studies have aimed to characterize their metabolic routes of detoxification20. In spite of their abundance in culture collections and public repositories, the genetic repertoire that enables them to survive in harsh environments have not been elucidated. Here, we report the first comparative genomic study of 27 members of genus Devosia, which provides valuable insights into their adaptations, the role of environment in shaping their genomes and the degree of genomic evolution in response to different environmental pressures.
Our study suggested the influx of new metabolic capabilities into the “open” pangenome of Devosia. Besides, the phylogenetic relationships of the group were fairly consistent. The study revealed that the genomes harbor a large diversity of transporters involved in uptake of di- and oligo-peptides from the environment. These peptide transport systems enable bacteria to take up short peptides of different amino-acid composition for satisfying nutritional demands and have been extensively studied in species of Lactococcus and Staphylococcus21,22. Besides increasing nutritional fitness, these permeases are also shown to be involved in signaling and virulence in Staphylococcus aureus, Borrelia burgdorferi and Bacillus thuringiensis23,24,25,26. Here, our analysis revealed the high diversity of these permeases encoded within genus Devosia for enabling efficient nutrient utilization and cell signaling required at such environments. Additionally, the large diversity of their substrate binding components reflected their wide range of substrates utilization. Positive evolution and selection of genes associated with growth and utilization of toxins highlights future applications in bioremediation. Further, the genomic repertoire adapted for utilization of organic sulfur, phosphorus and aromatic compounds are presumed to enable the members of the genus Devosia to survive in harsh sites. The presence of toxin-antitoxin (TA) loci within their genomes provided evidence of enhanced genome plasticity for maintaining a wide range of biological functions including stress response.
Results and Discussion
Genome analysis of twenty seven strains of the genus Devosia showed >96% completeness establishing the reliability of the datasets for comparative analyses. The overall genomic features of the strains are listed in Table 1. The genome size ranged from 3.5 to 5.8 Mbp with an average genome size of 4.3 ± 0.6 Mbp. Notably, the strains isolated from HCH contaminated sites namely, IPL-18, L15 and IPL-20 represented the three smallest genomes. It is difficult to explain the minimum genomic size of the organisms at such contaminated and nutrient depleted sites. However, in a previous study, where we isolated and described a Pseudomonas species that has the smallest genome with respect to its neighbours, this was attributed to the HCH isomer pressure shaping the genomic repertoire27. IPL18 and L15 also lacked genetic potential for utilization of organic phosphorus, rather found in other genomes. It is likely that the organisms lost several accessory gene clusters as part of adaptations to survive at HCH rich habitat. The two largest genomes of Root105 and Root413D1 harboured several hypothetical proteins in singletons along with the genes involved in drug resistance (daunorubicin and doxorubicin), serralysin and leukotoxin, type I secretion system, adhesion protein BmaC and polyamine synthesis proteins. These proteins are associated with protection, adhesion and biofilm formation and may facilitate the colonization of these strains in plant roots28,29. The %GC contents varied between 60.5–65.9% with an average of 63 ± 1.7%. Each genome, on an average consisted of 4,330 ± 620.4 protein coding genes. The number of predicted coding sequences correlated positively with the genome size (PMCC, r = 0.99). The large difference with respect to genome size and the coding potential among the species reflected towards the cadences in the genomic repertoire of the Devosia ecotypes in response to the different niches.
We deciphered the phylogenetic relationships of Devosia strains using marker genes, core genome and whole genome based average nucleotide identities. Maximum likelihood phylogeny based on the conserved set of 400 bacterial marker genes (Fig. 1)30 was reasonably consistent with that obtained from the concatenated alignments of 1,165 orthologous single copy core genes identified using OMCL algorithm (Fig. 2A). The phylogeny reconstructed from the whole genome wide ANIb also revealed identical topology (Fig. 2B). All the methods clearly resolved the genus into three different groups denoted as Group I, II and III with subclades (Figs. 1 and 2). Intriguingly, isolates from unrelated environments, for instance, CGMCC 1.10210 isolated from glacier cryoconite and YR412 isolated from rhizosphere clustered together while those from same habitats, such as isolates from Arabidopsis root appeared distantly in the phylogeny. This suggests that the role of environment in shaping bacterial genomes is still undefined.
We noticed high ANI values shared between the type strains D. soli GH2-10 and D. subaequoris HST3-14 (99.99%) with high percentage of conserved proteins (98.15%). However, as the percentage similarity shared between their submitted 16S rRNA gene sequences is less than 98.65%, it highlighted the need to redefine the boundaries for species demarcation due to low phylogenetic resolution of 16S rRNA marker gene31. Both the genomes were predicted to be 98.1% complete supporting the ANI based prediction. Similarly, we detected other pairs that might represent single species based on ANI values with the cutoff score of >95% defined for species demarcation that included the two DON degrading strains DDB001 and 17-2-E-8 (99%), Arabidopsis leaf isolates, Leaf64 and Leaf420 (96%), Arabidopsis root isolates, Root105 and Root413-D1 (98%). Moreover, the pairs also clustered together based on the comparative functional analysis while harboring the similar genetic repertoire. Further, the analysis showed that D. soli GH2-10 and D. subaequoris HST3-14 are likely the same species with a high ANIb value of 99.9% (Fig. 2B).
The pangenome of Devosia was analysed with aim to determine its genetic potential. The pangenome is defined as entire set of gene clusters present in a group and is constituted by the core and accessory genomes32. The core genome is formed of the conserved set of genomic functions found in all strains of the group. While accessory genome consists of the dispensable component which is present in a subset of genomes and the strain-specific content (singletons) that is unique to only one strain out of all the analysed genomes. The pangenome of Devosia was shown to be formed by 23,421 gene clusters (Distance: Euclidean; Linkage: Ward)33 that included 1,257 core, 10,383 dispensable and 11,781 strain-specific gene clusters. The small sized core (5.4%) and unique content (50.3%) forming more than half of the pangenome indicated that the species are highly diverged (Fig. 3). A robust analysis of the changes in pan- and core genome sizes upon sequential addition of genomes and their regression trends plotted as Tettelin best fit curve revealed indefinite increase in pangenome size up to the addition of the last genome. Therefore, the pangenome of Devosia may be classified as ‘open’ for expansion (Supplementary Fig. S1).
The distribution of strains based on the pangenomic clusters was deviated from the phylogenomic clustering which was partly reflected in the differences in the accessory genomic contents of Group III strains (Fig. 3). The core and unique gene clusters were further annotated into COG classes. The core genome was mostly conserved in the following: amino acid transport and metabolism (11.12%), translation and ribosomal biogenesis (11.27%), post translational modifications (6.42%), energy production and conversion (6.36%) and signal transduction (5.65%). As largely the species are soil microbes that inhabit intoxicated environments, their genomes are thus enriched in genes for efficient uptake of the restrained nutrients, sense chemotactic stimuli and transduce the signals for colonization. The classes for carbohydrate metabolism (5.44%), transcription (5.25%), replication (5.32%) and cell envelope synthesis (5.18%) were moderately abundant while intracellular trafficking, secondary metabolite synthesis, defense mechanisms and extracellular structures were limited in the core (0.5–2%). About 70% of the average genome size of the genus was not conserved indicating a high degree of genomic diversity. Isolate ATCC23634 was found to harbor highest numbers of singletons which is expected as it is the only isolate from freshwater (Table 1). Besides, it also harbored a large CRISPR locus with 16 spacer sequences unveiling its adaptive immunity resulting from previous viral encounters.
Comparative functional profiles
To gain more insights into specific functions, the top metabolic pathways of the genus were minimally reconstructed within individual genomes. Interestingly, the phylogenetically consistent groups of strains displayed different functional profiles and revealed an altogether different clustering pattern (Fig. 4). It suggests that their functional profiles might have been selected by the environment owing to evolutionary processes such as gene gain or loss and lateral gene transfer. The top metabolic pathways that were reconstructed within the genomes involved metabolism of sugars, fatty acids and amino acids, biosynthesis of antibiotics such as tetracycline, ansamycins and vancomycins, flagellar assembly and chemotaxis, ABC class transporters and degradation of chlorinated hydrocarbon compounds. These abundant functions are anticipated to provide survival benefits to the strains at the diverse niches that they inhabit. Strain DSM17137 was uncovered to be the most diverged strain within the genus with respect to its overall functional profile as the top metabolic pathways could not be reconstructed within its genome (Fig. 4). A major difference in the clades thus obtained was observed in the genes for synthesis of polyketide sugars that are important antimicrobial agents34 indicating that defence is not a primary function and hence not a part of the core genome. Concurrently, we noted that the clustering based on functional profiles was not strictly habitat-dependent. For instance, the strains isolated from plant leaves, Leaf64 and Leaf420 showed key differences in selenoamino acids utilization and polyketide sugar unit biosynthesis. This may be explained based on the fact that the process of gene gain or loss does not necessarily occur at the same rate in the isolates from similar habitats and hence the differences were observed. Similarly, the isolates from HCH contaminated soils showed different profiles for degradation of 1,2-dichloroethane and 3-chloroacrylic acid and for synthesis and degradation of ketone bodies. This suggests their dynamic genome repertoire and that the strains might be in the process of acquiring the genes for degradation of chlorinated hydrocarbons at this site.
Abundance of Oligo- and Di- peptide ABC transporters
As amino acid transport and metabolism emerged as one of the most abundant functions of the genus, we studied the genes of this class for determining the important survival strategies of Devosia. More precisely, we found these genomes to be enriched in the oligo-peptide permeases (Opp) and di-peptide permeases (Dpp). Opp and Dpp permeases are present in the bacterial membranes as multi-subunit protein complexes and function primarily in the uptake of peptides from the environment to serve as a source of carbon and nitrogen. These transport systems have been widely studied in species of Lactococcus, Staphylococcus, Borrelia and Bacillus where they have been shown to be involved in growth, signalling and virulence21,22,23,24,25,26. The permease complex has a typical structure of an ABC class transporter: a substrate binding protein OppA/DppA, two transmembrane proteins OppB/DppB and OppC/DppC and two membrane bound cytoplasmic ATP-binding proteins OppD/DppD and OppF/DppF35. The copy number of each of these transporters within the analysed strains is given in Supplementary Fig. S2. Their large diversity and abundance in Devosia was further checked by comparing these permeases with those in representative genomes (n = 27) of other genera of family Hyphomicrobiaceae (Supplementary Table S1). A large diversity in the organization of genes within operons was observed and many individual genes were found segregated throughout the genomes. As the presence of each of the gene in the cluster is not a prerequisite for the operon to be functional, their abundance might be an adaptation for uptake of large variety of peptides for optimal nutrition36. The gene copy number varied from 21 to as high as 93 Opp operon associated genes with an average of 60.7 ± 19.1 copies within each genome. Also, the genomes were abundant in Dpp permeases with 17 to 54 copies of associated genes within a genome and each genome carried an average number of 36.5 ± 10.6 genes. Their genetic diversity across the genus was determined by eliminating ~6.8% of the redundant sequences in each case (sequence identity = 100%) from a total of predicted 1,640 Opp and 986 Dpp sequences indicating high diversity of these transporters. An empirical measure of the diversity among the permeases and comparison of pairwise relationships was determined through sequence similarity network (SSN) analysis. In SSN, each protein sequence is represented by a node and any two nodes are connected by edges if they share more than the defined threshold similarity. The similarity networks for all non-redundant Opp and Dpp sequences were visualized, using the threshold pairwise Blastp e-value of 1e-30 and 1e-25 respectively. Each node in the resulting networks could not be connected with all other nodes through a finite path (Fig. 5A,B). OppABCDF and DppABCDF partitioned into 65 and 55 connected components respectively that included both homologous and heterologous clusters and isolated nodes. Through network analysis, we identified 29 Opp and 19 Dpp sequences that did not share homology with any other sequence suggesting an expansive short peptidic transport system within Devosia. Average neighborhood connectivity within the networks was interpreted as an increasing function in k both in case of Opp (correlation = 0.72, r2 = 0.77) and Dpp (correlation = 0.75, r2 = 0.71) suggesting scarce edges between low connected and highly connected nodes and highlighting the diversity among the sequences (Fig. 5C). Furthermore, closeness centrality that measures the closeness of a node with all other nodes was negatively correlated with the number of neighbors in both Opp (-0.038, r2 = 0.020) and Dpp (−0.121, r2 = 0) (Fig. 5C). More specifically, we analysed the diversity of substrate binding components (SBCs): OppA and DppA within these complexes. OppA partitioned into 12/29 isolated nodes while DppA constituted 9/19 isolated nodes. The network parameters are noted in Table 2. Notably, all the isolated nodes of SBCs belonged to the species of the Group III strains that were most diverged in phylogeny (Fig. 1). Both the networks were very sparse and analysis of the networks revealed that a random Opp sequence was similar to only 20.5% of all the sequences which was even less 6.2% in case of OppA (n = 343) (Table 2). At the same time, any random Dpp sequence was similar to only 5.5% of the sequences while the similarity between any two DppA sequences (n = 192) was estimated to be 8.3%. Phylogenetic diversity of these SBCs was further compared with those predicted in the representative genomes (n = 27) from other genera of family Hyphomicrobiaceae by constructing a neighbour joining tree (Supplementary Fig. S3).
A relatively high diversity of the substrate binding proteins in Devosia unveiled the high nutritional demands and efficiency of the genus towards uptake of a wide range of structurally and chemically diverse amino acid side chains from environment. Apart from nutritional significance, the permeases are also gates to acquire natural and non-natural cargo molecules attached with amino acid side chains of peptides thereby acting as environmental sensors37,38. These signals drive the bacterial chemotaxis and form the basis of bacterial tolerance and bioremediation of environmental pollutants by bacteria39. Thus, the genus might as well have adopted this strategy for chemosensing and mediating signals to help them regulate their cellular processes for tolerating environmental stress.
Genome scale positive selection
For determining the genes under positive selection pressure, the orthologous gene clusters identified in all the genomes were filtered for eliminating clusters with low quality sequences. A total of 2000 valid clusters thus obtained were tested for presence of recombination and filtered based on FDR < 10% and dN/dS values were calculated. The dN/dS values compare the rate of substitutions at non-synonymous sites (dN) with the rate of substitutions at synonymous sites (dS) in protein orthologs. Values greater than 1 indicates positive selection while values less than one indicate that the protein is under purifying selection. The genes which were present in at least 25 genomes (1263 gene clusters) were considered to denote the positively selected genes of the genus. 24 genes were found to be under positive selection pressure with dN/dS values (ω) greater than 1 (Table 3).
The genes related to growth, osmotic stress response, inorganic polyphosphate utilization and amino acid and divalent cation starvation were under strong positive selection pressure. Apart from these, the gene responsible for cofactor molybdopterin synthesis was found to be under strong positive selection pressure. Molybdopterin acts as a cofactor for many enzymes responsible for detoxification such as sulphite oxidase, xanthine oxidase, aldehyde oxidase and formate dehydrogenase40. These molybdopterin dependent enzymes which were present in the genomes enable the optimal growth of strains by utilization of nitrate, inorganic sulfur and purines and pyrimidines as carbon and nitrogen sources. The genes involved in assembly of iron-sulfur (Fe-S) clusters were under positive selection pressure. Fe-S clusters are cofactors of proteins that perform a number of biological roles including electron transfer, redox and non-redox catalysis, and sensing for iron41. Besides, the universal stress protein (UspA) that gets activated in response to various stressors such as high temperature and salinity, antibiotics, nutrient starvation42 and LuxR family transcriptional regulator that plays a key role in quorum sensing, motility, and antibiotic synthesis43 were also positively selected. These positively selected genes signify the evolving environmental tolerance mechanisms among Devosia species.
Determination of positively evolving genes at HCH contaminated sites and differential osmotic stress response
As the three strains IPL18, IPL20 and L15 isolated from HCH contaminated sites tolerate high levels of the chlorinated pollutant (450 mg/g of soil)44, we looked specifically at their genomic repertoire to uncover what enables them withstand high HCH stress. Through delineation of their orthologous proteins, we identified that their tolerance may be attributed to the abundance of two-component systems such as chemosensory phoB/phoR, cheA/cheW, cheB/cheR, cheD, cheY and methyl accepting chemotaxis protein I, might as well have been adopted to tolerate HCH stress as has been reported previously in a Pseudomonas genotypes27,45.
In order to determine the proteins encoded within their genomes that are under positive selection pressure to tolerate HCH stress, the orthologous proteins in independent pairs of three strains were subjected to positive selection detection. The majority of the proteins of all pairs were identified to be evolving under purifying selection with dN/dS values < 1 suggesting a conserved repertoire of genes is required for their survival (Fig. 6A). In IPL18 and IPL20, tRNA pseudouridine synthase subunit B was found to be under positive selection pressure (dN/dS = 1.7). Formation of pseudouridine is one of the important post-transcriptional modifications of the tRNAs. Most often these residues are confined to the functionally important part of tRNAs such that the genetic mutants lacking pseudouridine residues exhibit slow growth rates due to difficulties in translation and are not able to compete with wild type cells46. Therefore, the enzyme might confer selective advantage during competition at such a challenging niche47. In IPL18 and L15, putrescine transporter PotH was positively evolving (dN/dS = 1.25), which transports putrescine and is again involved in growth, as well as incorporation into the cell wall and biosynthesis of siderophore48. In IPL20 and L15, nucleoside diphosphate kinase showed dN/dS = 2.3. The enzyme facilitates bacterial cell growth and proliferation and mediates signal transduction49. Along with these, many hypothetical proteins were found to be under positive selection pressure (Fig. 6A). The hypothetical protein with the highest dN/dS of 3.58 belonged to GPCR family2-like protein with a query coverage of 76% using SmartBLAST (http://blast.ncbi.nlm.nih.gov/blast/smartblast/). In concordance with the previous results, all the positively selected proteins were related to growth or signalling mechanisms indicating the need to improve genetic fitness to cope high microbial competition at this nutrient depleted site.
As the soils near the dumpsites are also reported to have high salinity levels44, we compared the profiles of osmotic stress response of these strains to determine any active gene transfers at this dumpsite and to gain insights on the plasticity of the genus Devosia. One of the strategies to cope osmotic stress is the uptake and synthesis of osmolytes such as glycine betaine, ectoine and hydroxyectoine50. Glycine betaine is synthesized from choline by betICBA operon where BetI is a sensory repressor and BetC converts choline-O-sulfate into choline. Choline uptake is mediated by BetT or ProU which is converted to glycine betaine by dehydrogenases BetA and BetB51.The tendency to synthesize the glycine betaine was restricted to I507 and CGMCC1.10210. However, the isolates from HCH dumpsite encoded complete clusters for synthesis of other two osmolytes ectoine and hydroxyectoine (Fig. 6B). Ectoine is synthesized from phosphorylation of aspartate to β-aspartyl phosphate by aspartokinase (Ask) which is then converted to a semialdehyde derivative. The derivative is successively converted to ectoine by ectABC gene cluster regulated by ectR52. Hydroxyectoine is produced from ectoine by a hydroxylase (EctD)53. The complete pathway for their synthesis was also determined in DSM17137, H5989 and I507 but was altogether absent in all other strains (Fig. 6B). The isolates from HCH and strain I507 appear to have acquired the potential for synthesis of ectoine and hydroxyectoine to overcome the osmotic stress posed by the high salinity in their respective niches.
Degradation of organic compounds
Utilization of phosphonates and sulphonates
The sulphonates and phosphonates are added to environment through pesticides and are major source of sulfur and phosphorus in the soils54,55. Bacterial degradation of organic P and S play large role in global P and S cycling. As the Devosia are optimized for efficient utilization of nutrients, it evoked our interest in genus wide profiles for degradation of organic P and S.
Bacterial degradation of complex C-P bond in alkylphosphonates is catalyzed by C-P lyase encoded by a 14 gene cluster phnCDEFGHIJKLMNOP in which phnGHIJKLM code for the “core” components of the enzyme, PhnJ catalyzes the central reaction while phnNOP gene products play accessory roles56,57. phnCDE encode an ABC transporter and phnF a repressor protein. rcsF encodes a phosphoesterase analogous to phnP58. The degradation of aliphatic sulfonates is mediated by ssuEADCB gene cluster where SsuABC proteins constitute an ABC transport system while SsuD catalyzes the desulfonation of substrates and SsuE is an FMN reductase59. Our analysis revealed that the degradation of alkylphosphonates was widespread across Devosia while differential profiles for the degradation of alkylphosphonates were observed among the strains (Fig. 7A). Strains ATCC23634, IPL18 and L15 completely lacked potential to degrade alkylphosphonates (Fig. 7A). We argue that the strains IPL18 and L15 might have lost the catabolic ability in the process to tolerate the dominant pollutant i.e., HCH in their habitats. These functions are presumed to have been of environmental origin based on the clustering of genomes independent of their phylogeny. Overall, the analysis highlighted plasticity of Devosia genomes with potential for continued influx of novel functions and their evolution in response to environment.
Degradation of aromatic and xenobiotic compounds
The degradation of aromatic compounds by bacteria has immense environmental significance as they are the most prevalent class of natural carbon compounds which are also persistent pollutants60. So far, genera such as Pseudomonas, Acinetobacter, Geobacter, Dechloromonas and Novosphingobium have been extensively studied for their abilities of aromatic compounds degradation61,62,63,64,65,66. In this study, we examined the enzyme arsenal for remediation of aromatic compounds encoded wide the genus Devosia. The genomes were rich in the genes involved in both the branches of ß-ketoadipate utilization, one that converts catechol derived from various aromatic hydrocarbons, amino aromatics, and lignin monomers to beta-ketoadipate and another that converts protocatechuate, derived from phenolic compounds also to beta-ketoadipate for reduction through tricarboxylic acid cycle67. Among the peripheral catabolic pathways, the degradation of chloroaromatic compounds was most abundant among the strains (Fig. 7B). Again, the strains did not cluster in concordance with their phylogenetic distances. To note, strain DSM 17137 which showed maximum divergence with respect to overall functional profiles displayed maximum potential for homogentisate degradation pathway which were lacked by all other strains further confirming its functional divergence. In line with the previous observations, strain ATCC23634, the freshwater isolate was again the next most diverged among all analyzed genomes which displayed maximum potential for degradation of heterocyclic aromatic compounds (Fig. 7B). Overall, the profiles led us to consider that Devosia have acquired the potential of bioremediation during the course of evolution to adapt optimally to the environmental insults imposed on them. The conclusion was supported by the fact that the strains did not cluster based on their phylogeny but rather based on their abilities to degrade wide array of aromatic and xenobiotic compounds such as benzoate, p-hydroxybenzoate, biphenyl, catechol and chlorinated aromatic compounds.
Metabolic versatility for decomposition of urea
Urea occurs as a source of organic nitrogen and its decomposition by bacteria is of immense significance for bacterial growth and nutrient cycling. Urea may be decomposed by either of the two different enzymatic pathways catalyzed by urease and urea amidolyase as illustrated in Fig. 8A 68. The second pathway catalyzed by urea amidolyase comprises activities of urea carboxylase and allophanate hydrolase69. This alternative pathway was only detected in few genomes (data not shown) and therefore, was not further inspected. Urease pathway was found to be the core pathway for urea decomposition as all the essential genes ureA, ureB, ureC encoding a functional urease and several accessory protein encoding genes ureDEFG, ureI or ureJ70 were present in all genomes (Fig. 8B). However, the genes for uptake of urea, urtABCDE were absent in DDB001, 17-2-E-8 and E84 that might have lost them or that might also harbor unique genes that still need to be characterized. Notably, the ureC gene coding for the α-subunit of urease was found to be evolving in strain DS-56 under strong positive selection pressure (dN/dS = 3.19). The ureC is the largest of the genes encoding urease functional subunits and is essential for a functional urease70,71. The strain DS-56 was isolated from the island soil near sea where urea acts as the dominant N source and thus the organism might be dependent upon its decomposition for building amino acids and hence proteins. We further tried to reconstruct the phylogeny in order to check the conservedness of the genes belonging to this pathway. The maximum-likelihood phylogeny was similar to phylogeny based upon conserved genes and marker proteins. This suggests that urea decomposition by urease is a conserved function of the genus. The conserved organization of the genes within operons also provided evidence of phylogenetic origin of this pathway.
Determination of toxin-antitoxin (TA) systems
Bacterial toxin-antitoxin (TA) systems are key regulators of cellular processes that can respond to external stimuli and promote survival during periods of stress72. A TA locus is composed of two genes coding for a toxin and its cognate antitoxin73. Under favourable conditions, antitoxins typically inhibit their cognate toxins. While they are readily proteolysed upon stress encounters thereby unleashing the inhibitory effect of the toxin72. Widespread TA loci could be dissected within Devosia that all belonged to type II class in which both the toxin and anti-toxin are proteins73. Among the major TA systems within the genus were higB/higA and vapC/vapB but others such as parE/parD, yoeB/yefM, yafQ/dinJ and relB/relE were also present (Table 4). These small genetic modules are thought to epigenetically regulate bacterial survival controlling a wide range of biological functions including growth, persistence, programmed cell death, phage inhibition, biofilm formation and response to stress72,74. Besides, these loci are also known to stabilize the mobile genetic elements (MGEs) and enhance the genomic plasticity72. Therefore, the study could present a scenario that the environmental stress could have favored the accumulation of TA systems that confer selective advantage and competitiveness to the genus.
In the present study, the genomes of 27 strains of the genus Devosia were analyzed which allowed the description of the open pangenome of the genus with half of the pangenome (50.32%) represented by the unique genes suggesting the role of their respective environments in shaping the genomic repertoire of the members. This was also indicated from the dissimilar phylogenetic pattern obtained based on conserved core genes and those obtained from the reconstruction of overall metabolic profiles. The phylogenetic relationships of the strains could be clearly resolved by the study. The clustering of the strains based on specific bioremediation linked functions and niche specific adaptations for example, the synthesis of osmolytes, utilization of sulphonates and phosphonates and degradation of aromatic and xenobiotic compounds revealed their plastic genomic repertoire subject to locally relevant environmental stressors. The uptake and utilization of nutrients for growth and survival was found to be the dominant function of the genus along with detoxification and degradation of organic pollutants. On this account, the genes associated with growth, motility, detoxification and nutrient starvation were found to be positively evolving. In concordance, the abundance of ABC class transporters for uptake of di- and oligo-peptides and potential of urea decomposition further revealed that the members have well adapted themselves for survival at hydrocarbons and organic compounds rich habitats by optimizing their genetic repertoire for optimal nutrient uptake and metabolism.
Materials and Methods
Genomic DNA extraction and sequencing
D. crocina IPL20 and D. lucknowensis L15 were isolated from soils contaminated with hexachlorocyclohexane (HCH) from dumpsites located at Chinhat and Ummari villages in Lucknow, India8,19. The strains were grown on Luria-Bertani (LB) agar incubated at 28 °C and genomic DNA was isolated by lysis with lysozyme and proteinase K followed by CTAB extraction using method described elsewhere75. Sequencing was performed on an Illumina HiSeq. 2500-1TB platform with Illumina regular fragment library of insert size 300 bp. A paired end library of read length 151 bp was generated for each genome. The sequencing and assembly was performed under the project ‘Genomic Encyclopedia of Type Strains, Phase III’ by the Joint Genome Institute (JGI) [Project ID: 1102317 (D. crocina IPL20) and 1102429 (D. lucknowensis L15)]. Whole genome sequences are available on NCBI under the accession numbers NZ_FPCK00000000.1 and NZ_FXWK00000000.1 respectively.
Selection and annotation of genomes
The whole genome sequences of all publicly available draft and complete genomes were retrieved from NCBI and JGI databases in March 2018 (n = 33). For all genomes, open reading frames (ORFs) were predicted using Prodigal76 and percentage completeness were estimated using 107 essential genes77 based on hidden Markov models (HMMs). Using the completeness criterion, we selected 27 strains (>96% complete) for comparative analyses (Table 1). Further, the putative protein-encoding genes were also predicted using GLIMMER-378 on RAST server v2.079. The rRNAs and tRNAs were predicted using RNAmmer v1.280 and ARAGORN81, respectively. The clustered regularly interspaced short palindromic repeat (CRISPR) elements were identified using CRISPR Finder82. Phage and prophage regions were determined using PHASTER83.
The maximum likelihood phylogeny based on 400 ubiquitous and conserved marker proteins, was constructed using PhyloPhlan30 with 1000 bootstrap replications. iTOL v3 was used to visualize the tree84. In addition, phylogenetic analysis was also performed on the core genes identified in single copy within each genome. For this, amino acid alignments for each gene cluster were generated using KAlign v2.04 that employs Wu-Manber string-matching algorithm, to improve the accuracy of multiple sequence alignment85. The concatenated alignments were used to construct a maximum likelihood tree based on LG + F + R6 identified as the best fit model in IQ tree v1.686. The model generates a general amino acid replacement matrix87 using empirical amino acid frequencies and FreeRate model for calculating heterogeneity across sites. For genome-wide reconstruction of phylogeny, blast based pairwise Average Nucleotide Identity (ANIb) values computed using JSpecies web server88 were used to construct a Pearson correlation matrix and plotted in R (R Development Core Team, 2015).
Pan-gene clusters and identification of homologues
The pan-gene clusters were identified using microbial pangenomics workflow in anvi’o33 and the genomes were organized based on the distribution of gene clusters using MCL algorithm into core, dispensable and strain-specific contents (Distance: Euclidean; Linkage: Ward). The genes were annotated by BLASTp against the NCBI COG database. Heatmap based on the annotated COG functions of the core and singleton gene clusters were then plotted in R (R Development Core Team, 2015). The Tettelin best-fit curves32 of core and pangenomes were constructed using OMCL v1.4 implemented in GET_HOMOLOGUES pipeline89.
Comparative functional analyses
Functional annotation of genes was done on RAST v2.069 using the SEED subsystems approach. The ORFs were annotated by KAAS (KEGG Automatic Annotation Server)90 using Bi-directional Best Hit (BBH) algorithm. The top 50 metabolic pathways reconstructed within each genome using MinPath91 were plotted as heatmap using pheatmap package92 in R (R Development Core Team, 2015).
Sequence similarity network analysis
The di- and oligo-peptide permeases were identified within the genomes using Protein BLAST on NCBI database. The sequences were analysed by constructing similarity networks in which the relationships were read as independent pairwise alignments. The approach offers serious advantages over the phylogenetic trees in inferring relationships between large sequence data sets at defined cut-offs with ease. The sequences were filtered for the removal of 100% identical sequences using CD-HIT93. A pairwise BLAST of all non-redundant proteins was performed and sequence similarity networks (SSN) were constructed with a threshold alignment score of 50%. The threshold cutoff values of 1e-30 and 1e-25 were used for construction of opp and dpp sequence networks respectively upon analysing the trends of varying alignment length at different e-values. The networks were visualized in Cytoscape v3.6.1. The average numbers of neighbors or degree for a node or sequence was calculated as:
where K denotes the total number of edges and N denotes the total nodes. To estimate the diversity/similarity among sequences, the density of networks i.e. the fraction of all edges in the similarity networks was also calculated as:
Genome scale and pairwise positive selection detection
The orthologous gene clusters were determined using OrthoMCL v1.4. Orthologous groups with single copy genes were then filtered for determining orthologs under positive selection using POTION v1.1.394. Groups with evidence of recombination were removed from analysis using PhiPack95 that integrates three recombination tests: Phi, NSS and Max Chi2. For each group, multiple protein sequence alignments were generated using MUSCLE v 3.8.31 and trimmed using TrimAl v1.296. DNAML from phylip was used for phylogenetic tree reconstruction with 100 bootstraps. Later, groups were tested for positive selection using site-model analysis in codeml and a likelihood ratio test was conducted. The p-values were calculated as 2Δℓ (twice the difference in likelihood of the two nested models evaluated) based on the χ2 distribution with 2 ° of freedom followed by multiple hypothesis correction. Errors were minimised through False Discovery Rate (FDR) adjusted q-values (significance threshold cutoff of 10%).
To determine the evolutionary pressures at the HCH dumpsites, dN/dS values were calculated independently for the three HCH tolerating strains in a pairwise manner. The orthologous proteins were aligned using KAlign v2.04 and further converted to corresponding codon alignments using PAL2NAL script97. yn00 module in the PAML package was used to calculate dN/dS value for each orthologous pair.
Nakagawa, Y., Sakane, T. & Yokota, A. Transfer of “Pseudomonas riboflavina” (Foster 1944), a gram-negative, motile rod with long-chain 3-hydroxy fatty acids, to Devosia riboflavina gen. nov., sp. nov., nom. rev. Int. J. Syst. Bacteriol. 46, 16–22, https://doi.org/10.1099/00207713-46-1-16 (1996).
Foster, J. W. Microbiological aspects of riboflavin. J. Bacteriol. 47, 27–41 (1944).
Nicholson, A. C. et al. Complete genome sequence of strain H5989 of a novel Devosia species. Genome Announc. 3, e00934–15, https://doi.org/10.1128/genomeA.00934-15 (2015).
Rivas, R. et al. Description of Devosia neptuniae sp. nov. that nodulates and fixes nitrogen in symbiosis with Neptunia natans, an aquatic legume from India. Syst. Appl. Microbiol. 26, 47–53, https://doi.org/10.1078/072320203322337308 (2003).
Bautista, V. V., Monsalud, R. G. & Yokota, A. Devosia yakushimensis sp. nov., isolated from root nodules of Pueraria lobata (Willd.) Ohwi. Int. J. Syst. Evol. Microbiol. 60, 627–32, https://doi.org/10.1099/ijs.0.011254-0 (2010).
Lee, S. D. Devosia subaequoris sp. nov., isolated from beach sediment. Int. J. Syst. Evol. Microbiol. 57, 2212–5, https://doi.org/10.1099/ijs.0.65185-0 (2007).
Kumar, M., Verma, M. & Lal, R. Devosia chinhatensis sp. nov., isolated from a hexachlorocyclohexane (HCH) dump site in India. Int. J. Syst. Evol. Microbiol. 58, 861–5, https://doi.org/10.1099/ijs.0.65574-0 (2008).
Verma, M., Kumar, M., Dadhwal, M., Kaur, J. & Lal, R. Devosia albogilva sp. nov. and Devosia crocina sp. nov., isolated from a hexachlorocyclohexane dump site. Int. J. Syst. Evol. Microbiol. 59, 795–9, https://doi.org/10.1099/ijs.0.005447-0 (2009).
Onyango, M. et al. First genome sequence of potential mycotoxin-degrading bacterium Devosia nanyangense DDB001. Genome Announc. 2, e00922–14, https://doi.org/10.1128/genomeA.00922-14 (2014).
Yin, X. et al. Complete genome sequence of deoxynivalenol-degrading bacterium Devosia sp. strain A16. J. Biotechnol. 218, 21–22, https://doi.org/10.1016/j.jbiotec.2015.11.016 (2016).
Ryu, S. H. et al. Devosia geojensis sp. nov., isolated from diesel-contaminated soil in Korea. Int. J. Syst. Evol. Microbiol. 58, 633–636, https://doi.org/10.1099/ijs.0.65481-0 (2008).
Lal, R. et al. Biochemistry of microbial degradation of hexachlorocyclohexane and prospects for bioremediation. Microbiol. Mol. Biol. Rev. 74, 58–80, https://doi.org/10.1128/MMBR.00029-09 (2010).
Kumar, R. et al. Parapedobacter indicus sp. nov., isolated from hexachlorocyclohexane-contaminated soil. Int. J. Syst. Evol. Microbiol. 65, 129–34, https://doi.org/10.1099/ijs.0.069104-0 (2015).
Mahato, N. K., Tripathi, C., Nayyar, N., Singh, A. K. & Lal, R. Pontibacter ummariensis sp. nov., isolated from a hexachlorocyclohexane contaminated soil. Int. J. Syst. Evol. Microbiol. 66, 1080–1087, https://doi.org/10.1099/ijsem.0.000840 (2016).
Rani, P., Mukherjee, U., Verma, H., Kamra, K. & Lal, R. Luteimonas tolerans sp. nov., isolated from hexachlorocyclohexane-contaminated soil. Int. J. Syst. Evol. Microbiol. 66, 1851–6 (2016).
Dwivedi, V., Niharika, N. & Lal, R. Pontibacter lucknowensis sp. nov., isolated from a hexachlorocyclohexane dump site. Int. J. Syst. Evol. Microbiol. 63, 309–13, https://doi.org/10.1099/ijsem.0.000956 (2013).
Kaur, J. et al. Sphingobium baderi sp. nov., isolated from a hexachlorocyclohexane dump site. Int. J. Syst. Evol. Microbiol. 63, 673–8, https://doi.org/10.1099/ijs.0.039834-0 (2013).
Dadhwal, M., Jit, S., Kumari, H. & Lal, R. Sphingobium chinhatense sp. nov., a hexachlorocyclohexane (HCH)-degrading bacterium isolated from an HCH dumpsite. Int. J. Syst. Evol. Microbiol. 59, 3140–3144, https://doi.org/10.1099/ijs.0.005553-0 (2009).
Dua, A., Malhotra, J., Saxena, A., Khan, F. & Lal, R. Devosia lucknowensis sp. nov., a bacterium isolated from hexachlorocyclohexane (HCH) contaminated pond soil. J. Microbiol. 51, 689–94, https://doi.org/10.1007/s12275-013-2705-9 (2013).
He, J. W. et al. Bacterial epimerization as a route for deoxynivalenol detoxification: the influence of growth and environmental conditions. Front. Microbiol. 7, 572, https://doi.org/10.3389/fmicb.2016.00572 (2016).
Lamarque, M. et al. A multifunction ABC transporter (Opt) contributes to diversity of peptide uptake specificity within the genus. Lactococcus. J. Bacteriol. 186, 6492–500, https://doi.org/10.1128/JB.186.19.6492-6500.2004 (2004).
Yu, D. et al. Diversity and evolution of oligopeptide permease systems in staphylococcal species. Genomics 104, 8–13, https://doi.org/10.1016/j.ygeno.2014.04.003 (2014).
Hiron, A., Borezée-Durant, E., Piard, J. C. & Juillard, V. Only one of four oligopeptide transport systems mediates nitrogen nutrition in. Staphylococcus aureus. J. Bacteriol. 189, 5119–5129, https://doi.org/10.1128/JB.00274-07 (2007).
Medrano, M. S. et al. Regulators of expression of the oligopeptide permease A proteins of Borrelia burgdorferi. J. Bacteriol. 189, 2653–9, https://doi.org/10.1128/JB.01760-06 (2007).
Wang, X. G., Lin, B., Kidder, J. M., Telford, S. & Hu, L. T. Effects of environmental changes on expression of the oligopeptide permease (opp) genes of Borrelia burgdorferi. J. Bacteriol. 184, 6198–6206, https://doi.org/10.1128/jb.184.22.6198-6206.2002 (2002).
Gominet, M., Slamti, L., Gilois, N., Rose, M. & Lereclus, D. Oligopeptide permease is required for expression of the Bacillus thuringiensis plcR regulon and for virulence. Mol. Microbiol. 40, 963–75, https://doi.org/10.1046/j.1365-2958.2001.02440.x (2001).
Sharma, A. et al. Pan-genome dynamics of Pseudomonas gene complements enriched across hexachlorocyclohexane dumpsite. BMC Genomics 16, 313, https://doi.org/10.1186/s12864-015-1488-2 (2015).
Michael, A. J. Polyamine function in archaea and bacteria. J. Biol. Chem. 293, 18693–701, https://doi.org/10.1074/jbc.TM118.005670 (2018).
Bogino, P. C., Oliva, M., Sorroche, F. G. & Giordano, W. The role of bacterial biofilms and surface components in plant-bacterial associations. Int. J. Mol. Sci. 14, 15838–59, https://doi.org/10.3390/ijms140815838 (2013).
Segata, N., Börnigen, D., Morgan, X. C. & Huttenhower, C. PhyloPhlAn is a new method for improved phylogenetic and taxonomic placement of microbes. Nat. Commun. 4, 2304, https://doi.org/10.1038/ncomms3304 (2013).
Mahato, N. K. et al. Microbial taxonomy in the era of OMICS: application of DNA sequences, computational tools and techniques. Antonie Van Leeuwenhoek 110, 1357–1371, https://doi.org/10.1007/s10482-017-0928-1 (2017).
Tettelin, H. et al. Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: implications for the microbial “pan-genome. Proc. Natl. Acad. Sci. USA 102, 13950–55, https://doi.org/10.1073/pnas.0506758102 (2005).
Eren, A. M. et al. Anvi’o: an advanced analysis and visualization platform for ‘omics data. PeerJ. 3, e1319, https://doi.org/10.7717/peerj.1319 (2015).
Gomes, E. S., Schuch, V. & de Macedo Lemos, E. G. Biotechnology of polyketides: new breath of life for the novel antibiotic genetic pathways discovery through metagenomics. Braz. J. Microbiol. 44, 1007–34, https://doi.org/10.1590/s1517-83822013000400002 (2014).
Higgins, C. F. ABC transporters: from microorganisms to man. Annu. Rev. Cell Biol. 8, 67–113, https://doi.org/10.1146/annurev.cb.08.110192.000435 (1992).
Green, R. M., Seth, A. & Connell, N. D. A peptide permease mutant of Mycobacterium bovis BCG resistant to the toxic peptides glutathione and S-nitrosoglutathione. Infect. Immun. 68, 429–436, https://doi.org/10.1128/iai.68.2.429-436.2000 (2000).
Kuenzl, T. et al. Mutant variants of the substrate-binding protein DppA from Escherichia coli enhance growth on nonstandard γ-glutamyl amide-containing peptides. Appl. Environ. Microbiol. 84, e00340–18, https://doi.org/10.1128/AEM.00340-18 (2018).
Lamarque, M. et al. The peptide transport system Opt is involved in both nutrition and environmental sensing during growth of Lactococcus lactis in milk. Microbiology 157, 1612–9, https://doi.org/10.1099/mic.0.048173-0 (2011).
Pandey, G. & Jain, R. K. Bacterial chemotaxis toward environmental pollutants: role in bioremediation. Appl. Environ. Microbiol. 68, 5789–95, https://doi.org/10.1128/aem.68.12.5789-5795.2002 (2002).
Schwarz, G. & Mendel, R. R. Molybdenum cofactor biosynthesis and molybdenum enzymes. Annu. Rev. Plant Biol. 57, 623–47, https://doi.org/10.1146/annurev.arplant.57.032905.105437 (2006).
Beinert, H., Holm, R. H. & Münck, E. Iron-sulfur clusters: nature’s modular, multipurpose structures. Science 277, 653–9, https://doi.org/10.1126/science.277.5326.653 (1997).
Siegele, D. A. Universal stress proteins in Escherichia coli. J. Bacteriol. 187, 6253–54, https://doi.org/10.1128/JB.187.18.6253-6254.2005 (2005).
Chen, J. & Xie, J. Role and regulation of bacterial LuxR-like regulators. J. Cell Biochem. 112, 2694–702, https://doi.org/10.1002/jcb.23219 (2011).
Sangwan, N. et al. Comparative metagenomic analysis of soil microbial communities across three hexachlorocyclohexane contamination levels. PLoS ONE 7, e46219, https://doi.org/10.1371/journal.pone.0046219 (2012).
Zschiedrich, C. P., Keidel, V. & Szurmant, H. Molecular mechanisms of two-component signal transduction. J. Mol. Biol. 428, 3752–3775, https://doi.org/10.1016/j.jmb.2016.08.003 (2016).
Pan, H., Agarwalla, S., Moustakas, D. T., Finer-Moore, J. & Stroud, R. M. Structure of tRNA pseudouridine synthase TruB and its RNA complex: RNA recognition through a combination of rigid docking and induced fit. Proc. Natl. Acad. Sci. USA 100, 12648–53, https://doi.org/10.1073/pnas.2135585100 (2003).
Gutgsell, N. et al. Deletion of the Escherichia coli pseudouridine synthase gene truB blocks formation of pseudouridine 55 in tRNA in vivo, does not affect exponential growth, but confers a strong selective disadvantage in competition with wild-type cells. RNA 6, 1870–81, https://doi.org/10.1017/s1355838200001588 (2000).
Wortham, B. W., Patel, C. N. & Oliveira, M. A. Polyamines in bacteria: pleiotropic effects yet specific mechanisms. Adv. Exp. Med. Biol. 603, 106–15, https://doi.org/10.1007/978-0-387-72124-8_9 (2007).
Chakrabarty, A. M. Nucleoside diphosphate kinase: role in bacterial growth, virulence, cell signalling and polysaccharide synthesis. Mol. Microbiol. 28, 875–82, https://doi.org/10.1046/j.1365-2958.1998.00846.x (1998).
Galinski, E. A., Pfeiffer, H. P. & Trüper, H. G. 1,4,5,6,-Tetrahydro-2-methyl-4-pyrimidinecarboxylic acid, a novel cyclic acid from halophilic phototrophic bacteria of genus. Ectothiorodospira. Eur. J. Biochem. 149, 135–139, https://doi.org/10.1111/j.1432-1033.1985.tb08903.x (1985).
Osterås, M., Boncompagni, E., Vincent, N., Poggi, M. C. & Le Rudulier, D. Presence of a gene encoding choline sulfatase in Sinorhizobium meliloti bet operon: choline-O-sulfate is metabolized into glycine betaine. Proc. Natl. Acad. Sci. USA 95, 11394–9, https://doi.org/10.1073/pnas.95.19.11394 (1998).
Peters, P., Galinski, E. A. & Trüper, H. G. The biosynthesis of ectoine. FEMS Microbiol. Lett. 71, 157–162, https://doi.org/10.1016/0378-1097(90)90049-V (1990).
Ingbar, L. & Labidot, A. The structure and biosynthesis of new tetrahydropyrimidine derivatives in actinomycin D producer Streptomyces parvulus. Use of 13C- and 15N-labeled L-glutamate and 13C and 15N NMR spectroscopy. J. Biol. Chem. 263, 16014–22 (1988).
Autry, A. R. & Fitzgerald, J. W. Sulfonate S: A major form of forest soil organic sulfur. Biol. Fertil. Soils 10, 50–56 (1990).
McGrath, J. W., Chin, J. P. & Quinn, J. P. Organophosphonates revealed: new insights into the microbial metabolism of ancient molecules. Nat. Rev. Microbiol. 11, 412–9, https://doi.org/10.1038/nrmicro3011 (2013).
Metcalf, W. W. & Wanner, B. L. Evidence for a fourteen-gene, phnC to phnP locus for phosphonate metabolism in Escherichia coli. Gene 129, 27–32, https://doi.org/10.1016/0378-1119(93)90692-v (1993).
Hove-Jensen, B., Rosenkrantz, T. J., Zechel, D. L. & Willemoës, M. Accumulation of intermediates of the carbon-phosphorus lyase pathway for phosphonate degradation in phn mutants of Escherichia coli. J. Bacteriol. 192, 370–4, https://doi.org/10.1128/JB.01131-09 (2010).
Martínez, A. & Ventouras, L. A, Wilson, S. T., Karl, D. M. & DeLong, E. F. Metatranscriptomic and functional metagenomic analysis of methylphosphonate utilization by marine bacteria. Front. Microbiol. 4, 340, https://doi.org/10.3389/fmicb.2013.00340 (2013).
van Der Ploeg, J. R., Iwanicka-Nowicka, R., Bykowski, T., Hryniewicz, M. M. & Leisinger, T. The Escherichia coli ssuEADCB gene cluster is required for the utilization of sulfur from aliphatic sulfonates and is regulated by the transcriptional activator Cbl. J. Biol. Chem. 274, 29358–65, https://doi.org/10.1074/jbc.274.41.29358 (1999).
Seo, J. S., Keum, Y. S. & Li, Q. X. Bacterial degradation of aromatic compounds. Int. J. Environ. Res. Public Health 6, 278–309, https://doi.org/10.3390/ijerph6010278 (2009).
Li, D. et al. Genome-wide investigation and functional characterization of the β-ketoadipate pathway in the nitrogen-fixing and root-associated bacterium Pseudomonas stutzeri A1501. BMC Microbiology 10, 36, https://doi.org/10.1186/1471-2180-10-36 (2010).
Barbe, V. et al. Unique features revealed by the genome sequence of Acinetobacter sp. ADP1, a versatile and naturally transformation competent bacterium. Nucleic Acids Res. 32, 5766–79, https://doi.org/10.1093/nar/gkh910 (2004).
Butler, J. E. et al. Genomic and microarray analysis of aromatics degradation in Geobacter metallireducens and comparison to a Geobacter isolate from a contaminated field site. BMC Genomics 8, 180, https://doi.org/10.1186/1471-2164-8-180 (2007).
Salinero, K. K. et al. Metabolic analysis of the soil microbe Dechloromonas aromatica str. RCB: indications of a surprisingly complex life-style and cryptic anaerobic pathways for aromatic degradation. BMC Genomics 10, 351–10, https://doi.org/10.1186/1471-2164-10-351 (2009).
Wang, J. et al. Comparative genomics of degradative Novosphingobium strains with special reference to microcystin-degrading Novosphingobium sp. THN1. Front. Microbiol. 9, 2238, https://doi.org/10.3389/fmicb.2018.02238 (2018).
Kumar, R. et al. Comparative genomic analysis reveals habitat-specific genes and regulatory hubs within the genus Novosphingobium. mSystems 2, e00020–17, https://doi.org/10.1128/mSystems.00020-17 (2017).
Harwood, C. S. & Parales, R. E. The beta-ketoadipate pathway and the biology of self-identity. Annu. Rev. Microbiol. 50, 553–90, https://doi.org/10.1146/annurev.micro.50.1.553 (1996).
Hausinger, R. P. Metabolic versatility of prokaryotes for urea decomposition. J. Bacteriol. 186, 2520–2, https://doi.org/10.1128/jb.186.9.2520-2522.2004 (2004).
Kanamori, T., Kanou, N., Atomi, H. & Imanaka, T. Enzymatic characterization of a prokaryotic urea carboxylase. J. Bacteriol. 186, 2532–9, https://doi.org/10.1128/jb.186.9.2532-2539.2004 (2004).
Mobley, H. L. T. & Hausinger, R. P. Microbial urease: significance, regulation, and molecular characterization. Microbiol. Rev. 53, 85e108 (1989).
Nolden, L. et al. Urease of Corynebacterium glutamicum: organization of corresponding genes and investigation of activity. FEMS Microbiol. Lett. 189, 305–310, https://doi.org/10.1111/j.1574-6968.2000.tb09248.x (2000).
Schuster, C. F. & Bertram, R. Toxin-antitoxin systems are ubiquitous and versatile modulators of prokaryotic cell fate. FEMS Microbiol. Lett. 340, 73–85, https://doi.org/10.1111/1574-6968.12074 (2013).
Unterholzner, S. J., Poppenberger, B. & Rozhon, W. Toxin–antitoxin systems: Biology, identification, and application. Mob. Genet. Elements 3, e26219, https://doi.org/10.4161/mge.26219 (2013).
Wen, Y., Behiels, E. & Devreese, B. Toxin–Antitoxin systems: their role in persistence, biofilm formation, and pathogenicity. Pathog. Dis. 70, 240–249, https://doi.org/10.1111/2049-632X.12145 (2014).
Wilson, K. Preparation of genomic DNA from bacteria. Curr. Protoc. Mol. Biol. 2(2), 4, https://doi.org/10.1002/0471142727.mb0204s56 (2001).
Hyatt, D. et al. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11, 119, https://doi.org/10.1186/1471-2105-11-119 (2010).
Dupont, C. L. et al. Genomic insights to SAR86, an abundant and uncultivated marine bacterial lineage. ISME J. 6, 1186–1199, https://doi.org/10.1038/ismej.2011.189 (2012).
Delcher, A. L., Bratke, K. A., Powers, E. C. & Salzberg, S. L. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 23, 673–9, https://doi.org/10.1093/bioinformatics/btm009 (2007).
Aziz, R. K. et al. The RAST Server: rapid annotations using subsystems technology. BMC Genomics 9, 75, https://doi.org/10.1186/1471-2164-9-75 (2008).
Lagesen, K. et al. RNammer: consistent and rapid annotation of ribosomal rRNA genes. Nucleic Acids Res. 35, 3100–8, https://doi.org/10.1093/nar/gkm160 (2007).
Laslett, D. & Canback, B. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res. 32, 11–6, https://doi.org/10.1093/nar/gkh152 (2004).
Grissa, I., Vergnaud, G. & Pourcel, C. CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats. Nucleic Acids Res. 35, W52–W57, https://doi.org/10.1093/nar/gkm360 (2007).
Arndt, D. et al. PHASTER: a better, faster version of the PHAST phage search tool. Nucleic Acids Res. 44, W16–21, https://doi.org/10.1093/nar/gkw387 (2016).
Letunic, I. & Bork, P. Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res. 44, W242–W245, https://doi.org/10.1093/nar/gkw290 (2016).
Lassmann, T. & Sonnhammer, E. L. L. Kalign–an accurate and fast multiple sequence alignment algorithm. BMC Bioinformatics 6, 298, https://doi.org/10.1186/1471-2105-6-298 (2005).
Nguyen, L. T., Schmidt, H. A., von Haeseler, A. & Minh, B. Q. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274, https://doi.org/10.1093/molbev/msu300 (2015).
Le, S. Q. & Gascuel, O. An improved general amino acid replacement matrix. Mol. Biol. Evol. 25, 1307–1320, https://doi.org/10.1093/molbev/msn067 (2008).
Richter, M., Rosselló-Móra, R., Oliver Glöckner, F. & Peplies, J. JSpeciesWS: a web server for prokaryotic species circumscription based on pairwise genome comparison. Bioinformatics 32, 929–31, https://doi.org/10.1093/bioinformatics/btv681 (2016).
Contreras-Moreira, B. & Vinuesa, P. GET_HOMOLOGUES, a versatile software package for scalable and robust microbial pangenome analysis. Appl. Environ. Microbiol. 79, 7696–7701, https://doi.org/10.1128/AEM.02411-13 (2013).
Moriya, Y., Itoh, M., Okuda, S., Yoshizawa, A. C. & Kanehisa, M. KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res. 35, 182–185, https://doi.org/10.1093/nar/gkm321 (2007).
Ye, Y. & Doak, T. G. A parsimony approach to biological pathway reconstruction/inference for genomes and metagenomes. PLoS Comput. Biol. 5, e1000465, https://doi.org/10.1371/journal.pcbi.1000465 (2009).
Kolde, R. & Kolde, M.R. Package ‘pheatmap’. https://cran.r project.org/web/packages/pheatmap/pheatmap.pdf (2015).
Huang, Y., Niu, B., Gao, Y., Fu, L. & Li, W. CD-HIT Suite: a web server for clustering and comparing biological sequences. Bioinformatics 26, 680–682, https://doi.org/10.1093/bioinformatics/btq003 (2010).
Hongo, J. A., de Castro, G. M., Cintra, L. C., Zerlotini, A. & Lobo, F. P. POTION: an end-to-end pipeline for positive Darwinian selection detection in genome-scale data through phylogenetic comparison of protein-coding genes. BMC Genomics 16, 567, https://doi.org/10.1186/s12864-015-1765-0 (2015).
Bruen, T. C., Philippe, H. & Bryant, D. A simple and robust statistical test for detecting the presence of recombination. Genetics 172, 2665–81, https://doi.org/10.1534/genetics.105.048975 (2006).
Capella-Gutierrez, S., Silla-Martinez, J. M. & Gabaldon, T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–3, https://doi.org/10.1093/bioinformatics/btp348 (2009).
Suyama, M., Torrents, M. & Bork, P. PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res. 34, W609–W612, https://doi.org/10.1093/nar/gkl315 (2006).
Hassan, Y. I., Lepp, D. & Zhou, T. Genome assemblies of three soil-associated Devosia species: D. insulae, D. limi and D. soli. Genome Announc. 3, e00514–15, https://doi.org/10.1128/genomeA.00514-15 (2015).
Hassan, Y. I., Lepp, D. & Zhou, T. Draft genome sequences of Devosia sp. strain 17-2-E-8 and Devosia riboflavina strain IFO13584. Genome Announc. 2, e00994–14, https://doi.org/10.1128/genomeA.00994-14 (2014).
Hassan, Y. I., Lepp, D., Li, X. Z. & Zhou, T. Insights into the hydrocarbon tolerance of two Devosia isolates, D. chinhatensis strain IPL18T and D. geojensis strain BD-c194T, via whole-genome sequence analysis. Genome Announc. 3, e00890–15, https://doi.org/10.1128/genomeA.00890-15 (2015).
Gan, H. Y. et al. Whole-genome sequences of five oligotrophic bacteria isolated from deep within Lechuguilla cave, New Mexico. Genome Announc. 2, e01133–14, https://doi.org/10.1128/genomeA.01133-14 (2014).
Bai, Y. et al. Functional overlap of the Arabidopsis leaf and root microbiota. Nature 528, 364–369, https://doi.org/10.1038/nature16192 (2015).
The sequence data were produced by the US Department of Energy Joint Genome Institute https://www.jgi.doe.gov/ in collaboration with the user community. This work was supported by funds from the Department of Biotechnology (DBT), National Bureau of Agriculturally Important Microorganisms (NBAIM) and DU-DST-PURSE grant, Government of India. C.T. and S.N. thank Council of Scientific and Industrial Research (CSIR) for providing doctoral fellowships.
The authors declare no competing interests.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Talwar, C., Nagar, S., Kumar, R. et al. Defining the Environmental Adaptations of Genus Devosia: Insights into its Expansive Short Peptide Transport System and Positively Selected Genes. Sci Rep 10, 1151 (2020). https://doi.org/10.1038/s41598-020-58163-8
This article is cited by
Single-cell Microbiomics Unveils Distribution and Patterns of Microbial Symbioses in the Natural Environment
Microbial Ecology (2023)
Environmental Microbiome (2022)
Scientific Reports (2022)
Devosia rhizoryzae sp. nov., and Devosia oryziradicis sp. nov., novel plant growth promoting members of the genus Devosia, isolated from the rhizosphere of rice plants
Journal of Microbiology (2022)
Microbial community dynamics in the mesophilic and thermophilic phases of textile waste composting identified through next-generation sequencing
Scientific Reports (2021)