HydDB: A web tool for hydrogenase classification and analysis

Søndergaard, Dan; Pedersen, Christian N. S.; Greening, Chris

doi:10.1038/srep34212

Download PDF

Article
Open access
Published: 27 September 2016

HydDB: A web tool for hydrogenase classification and analysis

Dan Søndergaard¹,
Christian N. S. Pedersen¹ &
Chris Greening^2,3

Scientific Reports volume 6, Article number: 34212 (2016) Cite this article

9912 Accesses
292 Citations
22 Altmetric
Metrics details

Subjects

Abstract

H₂ metabolism is proposed to be the most ancient and diverse mechanism of energy-conservation. The metalloenzymes mediating this metabolism, hydrogenases, are encoded by over 60 microbial phyla and are present in all major ecosystems. We developed a classification system and web tool, HydDB, for the structural and functional analysis of these enzymes. We show that hydrogenase function can be predicted by primary sequence alone using an expanded classification scheme (comprising 29 [NiFe], 8 [FeFe] and 1 [Fe] hydrogenase classes) that defines 11 new classes with distinct biological functions. Using this scheme, we built a web tool that rapidly and reliably classifies hydrogenase primary sequences using a combination of k-nearest neighbors’ algorithms and CDD referencing. Demonstrating its capacity, the tool reliably predicted hydrogenase content and function in 12 newly-sequenced bacteria, archaea and eukaryotes. HydDB provides the capacity to browse the amino acid sequences of 3248 annotated hydrogenase catalytic subunits and also contains a detailed repository of physiological, biochemical and structural information about the 38 hydrogenase classes defined here. The database and classifier are freely and publicly available at http://services.birc.au.dk/hyddb/

Unraveling the functional dark matter through global metagenomics

Article Open access 11 October 2023

A standardized archaeal taxonomy for the Genome Taxonomy Database

Article 21 June 2021

BuscoPhylo: a webserver for Busco-based phylogenomic analysis for non-specialists

Article Open access 17 October 2022

Introduction

Microorganisms conserve energy by metabolizing H₂. Oxidation of this high-energy fuel yields electrons that can be used for respiration and carbon-fixation. This diffusible gas is also produced in diverse fermentation and anaerobic respiratory processes¹. H₂ metabolism contributes to the growth and survival of microorganisms across the three domains of life, including chemotrophs and phototrophs, lithotrophs and heterotrophs, aerobes and anaerobes, mesophiles and extremophiles alike^1,2. On the ecosystem scale, H₂ supports microbial communities in most terrestrial, aquatic and host-associated ecosystems^1,3. It is also proposed that H₂ was the primordial electron donor^4,5. In biological systems, metalloenzymes known as hydrogenases are responsible for oxidizing and evolving H₂^1,6. Our recent survey showed there is a far greater number and diversity of hydrogenases than previously thought². It is predicted that over 55 microbial phyla and over a third of all microorganisms harbor hydrogenases^2,7. Better understanding H₂ metabolism and the enzymes that mediate it also has wider implications, particularly in relation to human health and disease^3,8, biogeochemical cycling⁹ and renewable energy^10,11.

There are three types of hydrogenase, the [NiFe], [FeFe] and [Fe] hydrogenases, that are distinguished by their metal composition. Whereas the [Fe]-hydrogenases are a small methanogenic-specific family¹², the [NiFe] and [FeFe] classes are widely distributed and functionally diverse. They can be classified through a hierarchical system into different groups and subgroups/subtypes with distinct biochemical features (e.g. directionality, affinity, redox partners and localization) and physiological roles (i.e. respiration, fermentation, bifurcation, sensing)^1,6. It is necessary to define the subgroup or subtype of the hydrogenase to predict hydrogenase function. For example, while Group 2a and 2b [NiFe]-hydrogenases share >35% sequence identity, they have distinct roles as respiratory uptake hydrogenases and H₂ sensors respectively^13,14. Likewise, discrimination between Group A1 and Group A3 [FeFe]-hydrogenases is necessary to distinguish fermentative and bifurcating enzymes^2,15. Building on previous work^16,17, we recently created a comprehensive hydrogenase classification scheme predictive of biological function². This scheme was primarily based on the topology of phylogenetic trees built from the amino acid sequences of hydrogenase catalytic subunits/domains. It also factored in genetic organization, metal-binding motifs and functional information. This analysis identified 22 subgroups (within four groups) of [NiFe]-hydrogenases and six subtypes (within three groups) of [FeFe]-hydrogenases, each proposed to have unique physiological roles and contexts².

In this work, we build on these findings to develop the first web database for the classification and analysis of hydrogenases. We developed an expanded classification scheme that captures the full sequence diversity of hydrogenase enzymes and predicts their biological function. Using this information, we developed a classification tool based on the k-nearest neighbors’ (k-NN) method. HydDB is a user-friendly, high-throughput and functionally-predictive tool for hydrogenase classification that operates with precision exceeding 99.8%.

Results and Discussion

A sequence-based classification scheme for hydrogenases

We initially developed a classification scheme to enable prediction of hydrogenase function by primary sequence alone. To do this, we visualized the relationships between all hydrogenases in sequence similarity networks (SSN)¹⁸, in which nodes represent individual proteins and the distances between them reflect BLAST E-values. As reflected by our analysis of other protein superfamilies^19,20, SSNs allow robust inference of sequence-structure-function relationships for large datasets without the problems associated with phylogenetic trees (e.g. long-branch attraction). Consistent with previous phylogenetic analyses^2,16,17, this analysis showed the hydrogenase sequences clustered into eight major groups (Groups 1 to 4 [NiFe]-hydrogenases, Groups A to C [FeFe]-hydrogenases, [Fe]-hydrogenases), six of which separate into multiple functionally-distinct subgroups or subtypes at narrower logE filters (Fig. 1; Figure S1). The SSNs demonstrated that all [NiFe]-hydrogenase subgroups defined through phylogenetic trees in our previous work² separated into distinct clusters, which is consistent with our evolutionary model that such hydrogenases diverged from a common ancestor to adopt multiple distinct functions². The only exception were the Group A [FeFe]-hydrogenases, which, as previously-reported^2,17, cannot be classified by sequence alone as they have principally diversified through changes in domain architecture and quaternary structure. It remains necessary to analyze the organization of the genes encoding these enzymes to determine their specific function, e.g. whether they serve fermentative or electron-bifurcating roles.

The SSN analysis revealed that several branches that clustered together on the phylogenetic tree analysis² in fact separate into several well-resolved subclades (Fig. 1). We determined whether this was significant by analyzing the taxonomic distribution, genetic organization, metal-binding sites and reported biochemical or functional characteristics of the differentiated subclades. On this basis, we concluded that 11 of the new subclades identified are likely to have unique physiological roles. We therefore refine and expand the hydrogenase classification to reflect the hydrogenases are more diverse in both primary sequence and predicted function than accounted for by even the latest classification scheme². The new scheme comprises 38 hydrogenase classes, namely 29 [NiFe]-hydrogenase subclasses, 8 [FeFe]-hydrogenase subtypes and the monophyletic [Fe]-hydrogenases (Table 1).

Table 1 Expanded classification scheme for hydrogenase enzymes.

Full size table

Three lineages originally classified as Group 1a [NiFe]-hydrogenases were reclassified as new subgroups, namely those affiliated with Coriobacteria (Group 1i), Archaeoglobi (Group 1j) and Methanosarcinales (Group 1i). Cellular and molecular studies show these enzymes all support anaerobic respiration of H₂, but differ in the membrane carriers (methanophenazine, menaquinone) and terminal electron acceptors (heterodisulfide, sulfate, nitrate) that they couple to^21,22. The previously-proposed 4b and 4d subgroups² were dissolved, as the SSN analysis confirmed they were polyphyletic. These sequences are reclassified here into five new subgroups: the formate- and carbon monoxide-respiring Mrp-linked complexes (Group 4b)²³, the ferredoxin-coupled Mrp-linked complexes (Group 4d)²⁴, the well-described methanogenic Eha (Group 4h) and Ehb (Group 4i) supercomplexes²⁵ and a more loosely clustered class of unknown function (Group 4g). Enzymes within these subgroups, with the exception of the uncharacterized 4g enzymes, sustain well-described specialist functions in the energetics of various archaea^23,24,25. Three crenarchaeotal hydrogenases were also classified as their own family (Group 2e); these enzymes enable certain crenarchaeotes to grow aerobically on O₂^26,27 and hence may represent a unique lineage of aerobic uptake hydrogenases currently underrepresented in genome databases. The Group C [FeFe]-hydrogenases were also separated into three main subtypes given they separate into distinct clusters even at relatively broad logE values (Fig. 1); these subtypes are each transcribed with different regulatory elements and are likely to have distinct regulatory roles^2,17,28 (Table 1).

HydDB reliably predicts hydrogenase class using the k-NN method and CDD referencing

Using this information, we built a web tool to classify hydrogenases. Hydrogenase classification is determined through a three-step process following input of the catalytic subunit sequence. Two checks are initially performed to confirm if the inputted sequence is likely to encode a hydrogenase catalytic subunit/domain. The Conserved Domain Database (CDD)²⁹ is referenced to confirm that the inputted sequence has a hydrogenase catalytic domain, i.e. “Complex1_49kDa superfamily” (cl21493) (for NiFe-hydrogenases), “Fe_hyd_lg_C superfamily” (cl14953) (for FeFe-hydrogenases) and “HMD” (pfam03201) (for Fe-hydrogenases). A homology check is also performed that computes the BLAST E-value between the inputted sequence and its closest homolog in HydDB. HydDB classifies any inputted sequence that lacks hydrogenase conserved domains or has low homology scores (E-value > 10⁻⁵) as a non-hydrogenase (Table S1).

In the final step, the sequence is classified through the k-NN method that determines the most similar sequences listed in the HydDB reference database. To determine the optimal k for the dataset, we performed a 5-fold cross-validation for k = 1…10 and computed the precision for each k. The results are shown in Fig. 2. The classifier predicted the classes of the 3248 hydrogenase sequences with 99.8% precision and high robustness when performing a 5-fold cross-validation (as described in the Methods section) for k = 4. The six sequences where there were discrepancies between the SSN and k-NN predictions are shown in Table S2. The classifier has also been trained to detect and exclude protein families that are homologous to hydrogenases but do not metabolize H₂ (Nuo, Ehr, NARF, HmdII^1,2) using reference sequences of these proteins (Table S1).

Sequences of the [FeFe] Group A can be classified into functionally-distinct subtypes (A1, A2, A3, A4) based on genetic organization². The classifier can classify such hydrogenases if the protein sequence immediately downstream from the catalytic subunit sequence is provided. The classifier references the CDD to search for conserved domains in the downstream protein sequence. A sequence is classified as [FeFe] Group A2 if one of the domains “GltA”, “GltD”, “glutamate synthase small subunit” or “putative oxidoreductase”, but not “NuoF”, is found in the sequence. Sequences are classified as [FeFe] Group A3 if the domain “NuoF” is found and [FeFe] Group A4 if the domain “HycB” is present. If none of the domains are found, the sequence is classified as A1. These classification rules were determined by collecting 69 downstream protein sequences. The sequences were then submitted to the CDD and the domains which most often occurred in each subtype were extracted.

In addition to its precision, the classifier is superior to other approaches due to its usability. It is accessible as a free web service at http://services.birc.au.dk/hyddb/ HydDB allows the users to paste or upload sequences of hydrogenase catalytic subunit sequences in FASTA format and run the classification (Figure S2). When analysis has completed, results are presented in a table that can be downloaded as a CSV file (Figure S3). This provides an efficient and user-friendly way to classify hydrogenases, in contrast to the previous standard which requires visualization of phylogenetic trees derived from multiple sequence alignments³⁰.

HydDB infers the physiological roles of H₂ metabolism

As summarized in Table 1, hydrogenase class is strongly correlated with physiological role. As a result, the classifier is capable of predicting both the class and function of a sequenced hydrogenase. To demonstrate this capacity, we used HydDB to analyze the hydrogenases present in 12 newly-sequenced bacteria, archaea and eukaryotes of major ecological significance. The classifier correctly classified all 24 hydrogenases identified in the sequenced genomes, as validated with SSNs (Table 2). On the basis of these classifications, the physiological roles of H₂ metabolism were predicted (Table 2). For five of the organisms, these predictions are confirmed or supported by previously published data^{27,31,32,33,34}. Other predictions are in line with metabolic models derived from metagenome surveying^35,36,37. In some cases, the capacity for organisms to metabolize H₂ was not tested or inferred in previous studies despite the presence of hydrogenases in the sequenced genomes^32,38,39,40.

Table 2 Predictive capacity of the HydDB.

Full size table

While HydDB serves as a reliable initial predictor of hydrogenase class and function, further analysis is recommended to verify predictions. Hydrogenase sequences only provide organisms with the genetic capacity to metabolise H₂; their function is ultimately modulated by their expression and integration within the cell^1,41. In addition, some classifications are likely to be overgeneralized due to lack of functional and biochemical characterization of certain lineages and sublineages. For example, it is not clear if two distant members of the Group 1h [NiFe]-hydrogenases (Robiginitalea biformata, Sulfolobus islandicus) perform the same H₂-scavenging functions as the core group⁹. Likewise, it seems probable that the Group 3a [NiFe]-hydrogenases of Thermococci and Aquificae use a distinct electron donor to the main class⁴². Prominent cautions are included in the enzyme pages in cases such as these. HydDB will be updated when literature is published that influences functional assignments.

HydDB contains interfaces for hydrogenase browsing and analyzing

In addition to its classification function, HydDB is designed to be a definitive repository for hydrogenase retrieval and analysis. The database presently contains entries for 3248 hydrogenases, including their NCBI accession numbers, amino acid sequences, hydrogenase classes, taxonomic affiliations and predicted behavior (Figure S4). To enable easy exploration of the data set, the database also provides access to an interface for searching, filtering and sorting the data, as well as the capacity to download the results in CSV or FASTA format. There are individual pages for the 38 hydrogenase classes defined here (Table 1), including descriptions of their physiological role, genetic organization, taxonomic distribution and biochemical features. This is supplemented with a compendium of structural information about the hydrogenases, which is integrated with the Protein Databank (PDB), as well as a library of over 500 literature references (Figure S5).

Conclusions

To summarize, HydDB is a definitive resource for hydrogenase classification and analysis. The classifier described here provides a reliable, efficient and convenient tool for hydrogenase classification and functional prediction. HydDB also provides browsing tools for the rapid analysis and retrieval of hydrogenase sequences. Finally, the manually-curated repository of class descriptions, hydrogenase structures and literature references provides a deep but accessible resource for understanding hydrogenases.

Methods

Sequence datasets

The database was constructed using the amino acid sequences of all curated non-redundant 3248 hydrogenase catalytic subunits represented in the NCBI RefSeq database in August 2014² (Dataset S1). In order to test the classification tool, additional sequences from newly-sequenced archaeal and bacterial phyla were retrieved from the Joint Genome Institute’s Integrated Microbial Genomes database⁴³.

Sequence similarity networks

Sequence similarity networks (SSNs)¹⁸ constructed using Cytoscape 4.1⁴⁴ were used to visualize the distribution and diversity of the retrieved hydrogenase sequences. In this analysis, each node represents one of the 3248 hydrogenase sequences in the reference database (Dataset S1). Each edge represents the sequence similarity between them as determined by E-values from all-vs-all BLAST analysis, with all self and duplicate edges removed. Three networks were constructed, namely for the [NiFe]-hydrogenase large subunit sequences (Dataset S2), [FeFe]-hydrogenase catalytic domain sequences (Dataset S3) and [Fe]-hydrogenase sequences (Dataset S4). To control the degree of separation between nodes, logE cutoffs that were incrementally decreased from −5 to −200 until no major changes in clustering was observed. The logE cutoffs used for the final classifications are shown in Fig. 1 and Figure S1.

Classification method

The -NN method is a well-known machine learning method for classification⁴⁵. Given a set of data points x₁, x₂, … x_N (e.g. sequences) with known labels y₁, y₂, …, y_N (e.g. type annotations), the label of a point, , is predicted by computing the distance from to x₁, x₂, … x_N and extracting the labeled points closest to , i.e. the neighbors. The predicted label is then determined by majority vote of the labels of the neighbors. The distance measure applied here is that of a BLAST search. Thus, the classifier corresponds to a homology search where the types of the top results are considered. However, formulating the classification method as a machine learning problem allows the use of common evaluation methods to estimate the precision of the method and perform model selection. The classifier was evaluated using -fold cross-validation. The dataset is first split in to parts of equal size. parts (the training set) are then used for training the classifier and the labels of the data points in the remaining part (the test set) are then predicted. This process, called a fold, is repeated times. The predicted labels of each fold are then compared to the known labels and a precision can be computed.

Additional Information

How to cite this article: Søndergaard, D. et al. HydDB: A web tool for hydrogenase classification and analysis. Sci. Rep. 6, 34212; doi: 10.1038/srep34212 (2016).

References

Schwartz, E., Fritsch, J. & Friedrich, B. H2-metabolizing prokaryotes (Springer Berlin Heidelberg, 2013).
Greening, C. et al. Genome and metagenome surveys of hydrogenase diversity indicate H2 is a widely-utilised energy source for microbial growth and survival. Isme J. 10, 761–777 (2016).
CAS PubMed Google Scholar
Cook, G. M., Greening, C., Hards, K. & Berney, M. In Advances in Bacterial Pathogen Biology (ed. Poole, R. K. ) 65, 1–62 (Academic Press, 2014).
CAS Google Scholar
Lane, N., Allen, J. F. & Martin, W. How did LUCA make a living? Chemiosmosis in the origin of life. BioEssays 32, 271–280 (2010).
CAS PubMed Google Scholar
Weiss, M. C. et al. The physiology and habitat of the last universal common ancestor. Nat. Microbiol. 1, 16116 (2016).
CAS PubMed Google Scholar
Lubitz, W., Ogata, H., Rüdiger, O. & Reijerse, E. Hydrogenases. Chem. Rev. 114, 4081–4148 (2014).
CAS PubMed Google Scholar
Peters, J. W. et al. [FeFe]- and [NiFe]-hydrogenase diversity, mechanism and maturation. Biochim. Biophys. Acta - Mol. Cell Res. 1853, 1350–1369 (2014).
Google Scholar
Carbonero, F., Benefiel, A. C. & Gaskins, H. R. Contributions of the microbial hydrogen economy to colonic homeostasis. Nat Rev Gastroenterol Hepatol 9, 504–518 (2012).
CAS PubMed Google Scholar
Greening, C. et al. Atmospheric hydrogen scavenging: from enzymes to ecosystems. Appl. Environ. Microbiol. 81, 1190–1199 (2015).
PubMed PubMed Central Google Scholar
Levin, D. B., Pitt, L. & Love, M. Biohydrogen production: prospects and limitations to practical application. Int. J. Hydrogen Energy 29, 173–185 (2004).
CAS Google Scholar
Cracknell, J. A., Vincent, K. A. & Armstrong, F. A. Enzymes as working or inspirational catalysts for fuel cells and electrolysis. Chem. Rev. 108, 2439–2461 (2008).
CAS PubMed Google Scholar
Shima, S. et al. The crystal structure of [Fe]-Hydrogenase reveals the geometry of the active site. Science 321, 572–575 (2008).
ADS CAS PubMed Google Scholar
Lenz, O. & Friedrich, B. A novel multicomponent regulatory system mediates H2 sensing in Alcaligenes eutrophus. Proc. Natl. Acad. Sci. USA 95, 12474–12479 (1998).
ADS CAS PubMed PubMed Central Google Scholar
Greening, C., Berney, M., Hards, K., Cook, G. M. & Conrad, R. A soil actinobacterium scavenges atmospheric H2 using two membrane-associated, oxygen-dependent [NiFe] hydrogenases. Proc. Natl. Acad. Sci. USA 111, 4257–4261 (2014).
ADS CAS PubMed PubMed Central Google Scholar
Schuchmann, K. & Müller, V. A bacterial electron-bifurcating hydrogenase. J. Biol. Chem. 287, 31165–31171 (2012).
CAS PubMed PubMed Central Google Scholar
Vignais, P. M., Billoud, B. & Meyer, J. Classification and phylogeny of hydrogenases. Fems Microbiol. Rev. 25, 455–501 (2001).
CAS PubMed Google Scholar
Calusinska, M., Happe, T., Joris, B. & Wilmotte, A. The surprising diversity of clostridial hydrogenases: a comparative genomic perspective. Microbiology 156, 1575–1588 (2010).
CAS PubMed Google Scholar
Atkinson, H. J., Morris, J. H., Ferrin, T. E. & Babbitt, P. C. Using sequence similarity networks for visualization of relationships across diverse protein superfamilies. Plos One 4, e4345 (2009).
ADS PubMed PubMed Central Google Scholar
Ahmed, F. H. et al. Sequence-structure-function classification of a catalytically diverse oxidoreductase superfamily in mycobacteria. J. Mol. Biol. 427, 3554–3571 (2015).
CAS PubMed Google Scholar
Ney, B. et al. The methanogenic redox cofactor F420 is widely synthesized by aerobic soil bacteria. Isme J., 10.1038/ismej.2016.100 (2016).
Stetter, K. O. Archaeoglobus fulgidus gen. nov., sp. nov.: a new taxon of extremely thermophilic archaebacteria. Syst. Appl. Microbiol. 10, 172–173 (1988).
Google Scholar
Deppenmeier, U. & Blaut, M. Analysis of the vhoGAC and vhtGAC operons from Methanosarcina mazei strain Gö1, both encoding a membrane-bound hydrogenase and a cytochrome b. Eur. J. Biochem. 269, 261–269 (1995).
Google Scholar
Kim, Y. J. et al. Formate-driven growth coupled with H2 production. Nature 467, 352–355 (2010).
ADS CAS PubMed Google Scholar
McTernan, P. M. et al. Intact functional fourteen-subunit respiratory membrane-bound [NiFe]-hydrogenase complex of the hyperthermophilic archaeon Pyrococcus furiosus. J. Biol. Chem. 289, 19364–19372 (2014).
CAS PubMed PubMed Central Google Scholar
Lie, T. J. et al. Essential anaplerotic role for the energy-converting hydrogenase Eha in hydrogenotrophic methanogenesis. Proc. Natl. Acad. Sci. USA 109, 15473–15478 (2012).
ADS CAS PubMed PubMed Central Google Scholar
Auernik, K. S. & Kelly, R. M. Physiological versatility of the extremely thermoacidophilic archaeon Metallosphaera sedula supported by transcriptomic analysis of heterotrophic, autotrophic and mixotrophic growth. Appl. Environ. Microbiol. 76, 931–935 (2010).
CAS PubMed Google Scholar
Giaveno, M. A., Urbieta, M. S., Ulloa, J. R., González Toril, E. & Donati, E. R. Physiologic versatility and growth flexibility as the Main characteristics of a novel thermoacidophilic Acidianus strain isolated from Copahue geothermal area in Argentina. Microb. Ecol. 65, 336–346 (2012).
PubMed Google Scholar
Poudel, S. et al. Unification of [FeFe]-hydrogenases into three structural and functional groups. Biochim. Biophys. Acta (BBA)-General Subj., 10.1016/j.bbagen.2016.05.034 (2016).
Marchler-Bauer, A. & Bryant, S. H. CD-Search: protein domain annotations on the fly. Nucleic Acids Res. 32, W327–W331 (2004).
CAS PubMed PubMed Central Google Scholar
Berney, M., Greening, C., Hards, K., Collins, D. & Cook, G. M. Three different [NiFe] hydrogenases confer metabolic flexibility in the obligate aerobe Mycobacterium smegmatis. Environ. Microbiol. 16, 318–330 (2014).
CAS PubMed Google Scholar
Greening, C. et al. Persistence of the dominant soil phylum Acidobacteria by trace gas scavenging. Proc. Natl. Acad. Sci. 112, 10497–10502 (2015).
ADS CAS PubMed PubMed Central Google Scholar
Chen, Z. et al. Phaeodactylibacter xiamenensis gen. nov., sp. nov., a member of the family Saprospiraceae isolated from the marine alga Phaeodactylum tricornutum. Int. J. Syst. Evol. Microbiol. 64, 3496–3502 (2014).
PubMed Google Scholar
Koch, H. et al. Growth of nitrite-oxidizing bacteria by aerobic hydrogen oxidation. Science 345, 1052–1054 (2014).
ADS CAS PubMed Google Scholar
Carere, C. R. et al. Growth and persistence of methanotrophic bacteria by aerobic hydrogen respiration. Proc. Natl. Acad. Sci. USA (2016).
Haroon, M. F. et al. Anaerobic oxidation of methane coupled to nitrate reduction in a novel archaeal lineage. Nature 500, 567–570 (2013).
ADS CAS PubMed Google Scholar
Evans, P. N. et al. Methane metabolism in the archaeal phylum Bathyarchaeota revealed by genome-centric metagenomics. Science 350, 434–438 (2015).
ADS CAS PubMed Google Scholar
Brown, C. T. et al. Unusual biology across a group comprising more than 15% of domain Bacteria. Nature 523, 208–211 (2015).
ADS CAS PubMed Google Scholar
Spang, A. et al. Complex archaea that bridge the gap between prokaryotes and eukaryotes. Nature 521, 173–179 (2015).
ADS CAS PubMed PubMed Central Google Scholar
Eloe-Fadrosh, E. A. et al. Global metagenomic survey reveals a new bacterial candidate phylum in geothermal springs. Nat Commun 7 (2016).
Wilson, M. C. et al. An environmental bacterial taxon with a large and distinct metabolic repertoire. Nature 506, 58–62 (2014).
ADS CAS PubMed Google Scholar
Greening, C. & Cook, G. M. Integration of hydrogenase expression and hydrogen sensing in bacterial cell physiology. Curr. Opin. Microbiol. 18, 30–38 (2014).
CAS PubMed Google Scholar
Greening, C. et al. Physiology, biochemistry and applications of F420- and Fo-dependent redox reactions. Microbiol. Mol. Biol. Rev. 80, 451–493 (2016).
CAS PubMed PubMed Central Google Scholar
Markowitz, V. M. et al. IMG: the integrated microbial genomes database and comparative analysis system. Nucleic Acids Research 40, D115–D122 (2012).
CAS PubMed Google Scholar
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
CAS PubMed PubMed Central Google Scholar
Cover, T. & Hart, P. Nearest neighbor pattern classification. Ieee Trans. Inf. Theory 13 (1967).
Constant, P., Chowdhury, S. P., Pratscher, J. & Conrad, R. Streptomycetes contributing to atmospheric molecular hydrogen soil uptake are widespread and encode a putative high-affinity [NiFe]-hydrogenase. Environ. Microbiol. 12, 821–829 (2010).
CAS PubMed Google Scholar
Hamann, E. et al. Environmental Breviatea harbour mutualistic Arcobacter epibionts. Nature 534, 254–258 (2016).
ADS CAS PubMed PubMed Central Google Scholar
Sousa, F. L., Neukirchen, S., Allen, J. F., Lane, N. & Martin, W. F. Lokiarchaeon is hydrogen dependent. Nat. Microbiol. 1, 16034 (2016).
CAS PubMed Google Scholar

Download references

Acknowledgements

We thank A/Prof Colin J. Jackson, Dr. Hafna Ahmed, Dr. Andrew Warden, Dr. Stephen Pearce and the two anonymous reviewers for their helpful advice and comments regarding this manuscript. This work was supported by a PUMPkin Centre of Excellence PhD Scholarship awarded to DS and a CSIRO Office of the Chief Executive Postdoctoral Fellowship awarded to CG.

Author information

Authors and Affiliations

Aarhus University, Bioinformatics Research Centre, C.F. Møllers Allé 8, Aarhus, DK-8000, Denmark
Dan Søndergaard & Christian N. S. Pedersen
The Commonwealth Scientific and Industrial Research Organisation, Land and Water Flagship, Clunies Ross Street, Acton, 2601, ACT, Australia
Chris Greening
Monash University, School of Biological Sciences, Clayton, 3800, VIC, Australia
Chris Greening

Authors

Dan Søndergaard
View author publications
You can also search for this author in PubMed Google Scholar
Christian N. S. Pedersen
View author publications
You can also search for this author in PubMed Google Scholar
Chris Greening
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.G. and D.S. designed experiments. D.S. and C.G. performed experiments. C.G., D.S. and C.N.S.P. analyzed data. C.N.S.P. supervised students. C.G. and D.S. wrote the paper.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Supplementary Dataset S1

Supplementary Dataset S2

Supplementary Dataset S3

Supplementary Dataset S4

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Søndergaard, D., Pedersen, C. & Greening, C. HydDB: A web tool for hydrogenase classification and analysis. Sci Rep 6, 34212 (2016). https://doi.org/10.1038/srep34212

Download citation

Received: 24 June 2016
Accepted: 09 September 2016
Published: 27 September 2016
DOI: https://doi.org/10.1038/srep34212

This article is cited by

Functional similarity, despite taxonomical divergence in the millipede gut microbiota, points to a common trophic strategy
- Julius Eyiuche Nweze
- Vladimír Šustr
- Roey Angel
Microbiome (2024)
A genus in the bacterial phylum Aquificota appears to be endemic to Aotearoa-New Zealand
- Jean F. Power
- Carlo R. Carere
- Matthew B. Stott
Nature Communications (2024)
Methanogenic partner influences cell aggregation and signalling of Syntrophobacterium fumaroxidans
- Anna Doloman
- Maaike S Besteman
- Diana Z Sousa
Applied Microbiology and Biotechnology (2024)
Sulfur-cycling chemolithoautotrophic microbial community dominates a cold, anoxic, hypersaline Arctic spring
- Elisse Magnuson
- Ianina Altshuler
- Lyle G. Whyte
Microbiome (2023)
Long-read assembled metagenomic approaches improve our understanding on metabolic potentials of microbial community in mangrove sediments
- Zhi-Feng Zhang
- Li-Rui Liu
- Meng Li
Microbiome (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.