Asgard archaea capable of anaerobic hydrocarbon cycling

Seitz, Kiley W.; Dombrowski, Nina; Eme, Laura; Spang, Anja; Lombard, Jonathan; Sieber, Jessica R.; Teske, Andreas P.; Ettema, Thijs J. G.; Baker, Brett J.

doi:10.1038/s41467-019-09364-x

Download PDF

Article
Open access
Published: 23 April 2019

Asgard archaea capable of anaerobic hydrocarbon cycling

Nature Communications volume 10, Article number: 1822 (2019) Cite this article

11k Accesses
139 Citations
79 Altmetric
Metrics details

Subjects

Abstract

Large reservoirs of natural gas in the oceanic subsurface sustain complex communities of anaerobic microbes, including archaeal lineages with potential to mediate oxidation of hydrocarbons such as methane and butane. Here we describe a previously unknown archaeal phylum, Helarchaeota, belonging to the Asgard superphylum and with the potential for hydrocarbon oxidation. We reconstruct Helarchaeota genomes from metagenomic data derived from hydrothermal deep-sea sediments in the hydrocarbon-rich Guaymas Basin. The genomes encode methyl-CoM reductase-like enzymes that are similar to those found in butane-oxidizing archaea, as well as several enzymes potentially involved in alkyl-CoA oxidation and the Wood-Ljungdahl pathway. We suggest that members of the Helarchaeota have the potential to activate and subsequently anaerobically oxidize hydrothermally generated short-chain hydrocarbons.

The oxidation of hydrocarbons by diverse heterotrophic and mixotrophic bacteria that inhabit deep-sea hydrothermal ecosystems

Article Open access 30 April 2020

Expanding the repertoire of electron acceptors for the anaerobic oxidation of methane in carbonates in the Atlantic and Pacific Ocean

Article 12 March 2021

Metabolic potentials of archaeal lineages resolved from metagenomes of deep Costa Rica sediments

Article 17 February 2020

Introduction

Short-chain alkanes, such as methane and butane, are abundant in marine sediments and play an important role in carbon cycling with methane concentrations of ~1 Gt being processed globally through anoxic microbial communities^1,2,3. Until recently, archaeal methane cycling was thought to be limited to Euryarchaeota⁴. However, additional archaeal phyla, including Bathyarchaeota⁵ and Verstraetarchaeota⁶, have been shown to contain proteins with homology to the activating enzyme methyl-coenzyme M reductase (Mcr) and corresponding pathways for methane utilization. Furthermore, lineages within the Euryarchaeota belonging to Candidatus Syntrophoarchaeum spp., have been shown to use methyl-CoM reductase-like enzymes for anaerobic butane oxidation⁷. Similar to methane oxidation in many ANME-1 archaea, butane oxidation in Syntrophoarchaeum is proposed to be enabled through a syntrophic interaction with sulfur-reducing bacteria⁷. Metagenomic reconstructions of genomes recovered from deep-sea sediments from near 2000 m depth in Guaymas Basin (GB) in the Gulf of California have revealed the presence of additional uncharacterized alkyl methyl-CoM reductase-like enzymes in metagenome-assembled genomes within the Methanosarcinales (Gom-Arc1)⁸. GB is characterized by hydrothermal alterations that transform large amounts of organic carbon into methane, polycyclic aromatic hydrocarbons, low-molecular weight alkanes and organic acids allowing for diverse microbial communities to thrive (Supplementary Table 1)^8,9,10,11.

Recently, genomes of a clade of uncultured archaea, referred to as the Asgard superphylum that includes the closest archaeal relatives of eukaryotes, have been recovered from anoxic environments around the world^12,13,14. Diversity surveys in anoxic marine sediments show that Asgard archaea appear to be globally distributed^12,14,15,16. Based on phylogenomic analyses, Asgard archaea have been divided into four distinct phyla: Lokiarchaeota, Thorarchaeota, Odinarchaeota, and Heimdallarchaeota, with the latter possibly representing the closest relatives of eukaryotes¹². Supporting their close relationship to eukaryotes, Asgard archaea possess a wide repertoire of proteins previously thought to be unique to eukaryotes known as eukaryotic signature proteins (ESPs)¹⁷. These ESPs include homologs of eukaryotic proteins, which in eukaryotes are involved in ubiquitin-mediated protein recycling, vesicle formation and trafficking, endosomal sorting complexes required for transport-mediated multivesicular body formation, as well as cytokinetic abscission and cytoskeleton formation¹⁸. Asgard archaea have been suggested to possess heterotrophic lifestyles and are proposed to play a role in carbon degradation in sediments; however, several members of the Asgard archaea also have genes that code for a complete Wood–Ljungdahl pathway and are therefore interesting with regard to carbon cycling in sediments^14,19.

Here, we present metagenome-assembled genomes (MAGs), recovered from GB deep-sea hydrothermal sediments, which represent an undescribed Asgard phylum with the metabolic potential to perform anaerobic hydrocarbon degradation using a methyl-CoM reductase-like homolog.

Results

Identification of Helarchaeota genomes from GB sediments

We recently obtained more than ~280 gigabases of sequencing data from 11 samples taken from various sites and depths at GB hydrothermal vent sediments²⁰. De novo assembly and binning of metagenomic contigs resulted in the reconstruction of over 550 genomes (>50% complete)²⁰. these genomes we detected a surprising diversity of archaea, including >20 phyla, which appear to represent up to 50% of the total microbial community in some of these samples²⁰. A preliminary phylogeny of the dataset using 37 concatenated ribosomal proteins revealed two draft genomic bins representing a previously unknown lineage of the Asgard archaea. These draft genomes, referred to as Hel_GB_A and Hel_GB_B, were re-assembled and re-binned resulting in final bins that were 82% and 87% complete and had a bin size of 3.54 and 3.84 Mbp, respectively (Table 1). An in-depth phylogenetic analysis consisting of 56 concatenated ribosomal proteins was used to confirm the placement of these final bins form a distant sister group with the Lokiarchaeota (Fig. 1a). Hel_GB_A percent abundance ranged from 3.41 × 10⁻³% to 8.59 × 10⁻⁵%, and relative abundance from 8.43 to 0.212. Hel_GB_B percent abundance ranged from 1.20 × 10⁻³% to 7.99 × 10⁻⁵%, and relative abundance from 3.41 to 0.22 compared to the total raw reads. For both Hel_GB_A and Hel_GB_B the highest abundance was seen at the site from which the bins were recovered. These numbers are comparable to other Asgard archaea whose genomes have been isolated form these sites²⁰. Hel_GB_A and Hel_GB_B had a mean GC content of 35.4% and 28%, respectively, and were recovered from two distinct environmental samples, which share similar methane-supersaturated and strongly reducing geochemical conditions (concentrations of methane ranging from 2.3 to 3 mM, dissolved inorganic carbon ranging from 10.2 to 16.6 mM, sulfate near 21 mM and sulfide near 2 mM; Supplementary Table 1) but differed in temperature (28 and 10 °C, respectively, Supplementary Table 1)²¹.

Table 1 Bin statistics for Helarchaeota bins

Full size table

Phylogenetic analyses of a 16S rRNA gene sequence (1058 bp in length) belonging to Hel_GB_A confirmed that it is related to Lokiarchaeota and Thorarchaeota, but is phylogenetically distinct from either of these lineages (Fig. 1b). A comparison to published Asgard archaeal 16S rRNA gene sequences indicate a phylum level division between the Hel_GB_A sequence and other Asgard archaea with a percent identity of 82.67% when compared to Lokiarchaeum GC14_75²² (Supplementary Table 2). A search for ESPs in both bins revealed that they contained a similar suite of homologs compared to those previously identified in Lokiarchaeota, which is consistent with their phylogenomic relationship (Fig. 2). Yet these lineages are relatively distantly related as evidenced by their difference in GC content and relatively low-pairwise sequence identity of proteins. An analysis of the average amino acid identity (AAI) showed that Hel_GB_A and Hel_GB_B shared 1477 genes and AAI of 51.96%. When compared to Lokiarcheota_CR4, Hel_GB_A shares 634 orthologous genes out of 3595 and Hel_GB_B shares 624 out of 3157. Helarchaeota bins showed the highest AAI similarity to Odinarchaeota LCB_4 (45.9%); however, it contained fewer orthologous genes (574 out of 3595 and 555 out of 3157 for Hel_GB_A and Hel_GB_B, respectively). Additionally, the Hel_GB bins differed from Lokiarchaeota in their total gene number, for example Hel_GB_A possessed 3595 genes and CR_4 possessed 4218; this difference is consistent with the larger estimated genome size for Lokiarchaeum CR_4 compared to Hel_GB_A (~5.2 to ~4.6 Mbp) (Supplementary Table 3). These results add support to the phylum level distinction observed for Hel_GB_A and Hel_GB_B in both the ribosomal protein and 16S rRNA phylogenetic trees. We propose the name Helarchaeota after Hel, the Norse goddess of the underworld and Loki’s daughter for this lineage.

Metabolic analysis of Helarchaeota

To reconstruct the metabolic potential of these archaea, the Helarchaeota proteomes were compared to several functional protein databases²⁰ (Fig. 3a). Like many archaea in marine sediments²³, Helarchaeota may be able to utilize organic carbon as they possess a variety of extracellular peptidases and carbohydrate degradation enzymes that include the β-glucosidase, α-l-arabinofuranosidase and putative rhamnosidase, among others (Supplementary Table 4 and Supplementary Data 1). Degraded organic substrates can then be metabolized via glycolysis and an incomplete TCA cycle from citrate to malate and a partial gamma-aminobutyric acid shunt (Fig. 3a, Supplementary Data 1). Both Helarchaeota bins are missing fructose-1,6-bisphosphatase and have few genes coding for the pentose phosphate pathway. Genes encoding for the bifunctional enzyme 3-hexulose-6-phosphate synthase/6-phospho-3-hexuloisomerase (hps-phi) were identified in Hel_GB_B suggesting they may be using the ribulose monophosphate pathway for formaldehyde anabolism. Genes coding for acetate-CoA ligase (both APM and ADP-forming) and an alcohol dehydrogenase (adhE) were identified in both genomes suggesting that the organisms may be capable of both fermentation and production of acetyl-CoA using acetate and alcohols (Supplementary Data 1). Like in Thorarchaeota and Lokiarchaeota, these genomes possess the large subunit of type IV ribulose bisphosphate carboxylase^19,24. In addition, the Helarchaeota genomes encode for the catalytic subunit of the methanogenic type III ribulose bisphosphate carboxylase used for C-fixation²⁴. Helarchaeota are metabolically distinct from Lokiarchaeota as both Hel_GB draft genomes appear to lack a complete TCA cycle as genes coding for citrate synthase and malate/lactate dehydrogenase are absent. Both genomes also likely produce acetyl-CoA using glyceraldehyde 3-phosphate dehydrogenase which is absent in Lokiarchaeota¹⁹ (Supplementary Data 1). Helarchaeota genomes lack genes that code for enzymes involved in dissimilatory nitrogen and sulfur metabolism. Assimilatory genes, including sat, cysN, and cysC were found in Hel_GB_B, however, these genes were not identified in Hel_GB_A. This absence may be indicative of species-specific characteristics or could be a results of genome incompleteness. Additional genomes of members of the Helarchaeota will help to fully understand the diversity of these pathways across the whole phylum.

Interestingly, both Helarchaeota genomes have mcrABG-containing gene clusters encoding putative methyl-CoM reductase-like enzymes (Fig. 3b, Supplementary Figure 1)^4,5,7. Phylogenetic analyses of both the A subunit of methyl-CoM reductase-like enzymes (Supplementary Figure 2) as well as the concatenated A and B subunits (Fig. 3b) revealed that the Helarchaeota sequences are distinct from those involved in methanogenesis and methane oxidation but cluster with homologs from butane-oxidizing Syntrophoarchaea⁷ and Bathyarchaeota with high-statistical support (rapid bootstrap support/single-branch test bootstrap support/posterior probability of 99.8/100/1; Fig. 3b) excluding the distant homolog of Ca. Syntrophoarchaeum caldarius (OFV68676). Analysis of the Helarchaeota mcrA alignment confirmed that amino acids present at their active sites are similar to those identified in Bathyarchaeota and Syntrophoarchaeum methyl-CoM reductase-like enzymes (Supplementary Figure 3). In Syntrophoarchaeum, the methyl-CoM reductase-like enzymes have been suggested to activate butane to butyl-CoM⁷. It is proposed that this process is then followed by the conversion of butyl-CoM to butyryl-CoA; however, the mechanism of this reaction is still unknown. Butyryl-CoA can then be oxidized to acetyl-CoA that can be further feed into the Wood–Ljungdahl pathway to produce CO₂⁷. While some n-butane is detected in GB sediments (usually below 10 µM), methane is the most abundant hydrocarbon (Supplementary Table 1) followed by ethane and propane (often reaching the 100 µM range); thus, a spectrum of short-chain alkanes could potentially be metabolized by Helarchaeota²⁵.

Proposed hydrocarbon degradation pathway for Helarchaeota

Next, we searched for genes encoding enzymes potentially involved in hydrocarbon utilization pathways, including propane and butane oxidation. Along with the methyl-CoM reductase-like enzyme that could convert alkane to alkyl-CoM, Helarchaeota possess heterodisulfide reductase subunits ABC (hdrABC), which is needed to recycle the CoM and CoB heterodisulfides after this reaction occurs (Figs. 3 and 4)^7,8. The conversion of alkyl-CoM to acyl-CoA is currently not understood in archaea capable of butane oxidation. Specialized alkyl-binding versions of methyltransferases would be required to convert alkyl-CoM to butyl-CoA or other acyl-CoAs, as discussed for Ca. S. butanivorans⁷. Genes coding for methyltransferases were identified in both Helarchaeota genomes, including a likely tetrahydromethanopterin S-methyltransferase subunit H (MtrH) homolog (Fig. 4; Supplementary Data 1). Short-chain acyl-CoA could be oxidized to acetyl-CoA using the beta-oxidation pathway via a short-chain acyl-CoA dehydrogenase, enoyl-CoA hydratase, 3-hydroxyacyl-CoA dehydrogenase, and acetyl-CoA acetyltransferase, candidate enzymes for all of which are present in the Helarchaeota genomes and are also found in genomes of other Asgard archaea (Fig. 4)¹⁹. Along with these enzymes, genes coding for the associated electron transfer systems, including an Fe–S oxidoreductase and all subunits of the electron transfer flavoprotein complex were identified in Helarchaeota (Fig. 4). Acetyl-CoA produced by beta-oxidation might be further oxidized to CO₂ via the Wood–Ljungdahl pathway, using among others the classical 5,10-methylene-tetrahydromethanopterin reductase (Figs. 3a and 4).

Possible energy-transferring mechanisms for Helarchaeota

To make anaerobic alkane oxidation energetically favorable, it must be coupled to the reduction of an internal electron acceptor or transferred to a syntrophic partner that can perform this reaction^7,26,27. We could not identify an internal electron sink or any canonical terminal reductases used by ANME archaea (such as iron, sulfur, or nitrogen reductases), leading to the conclusion that a syntrophic partner organism would be necessary to enable growth on short-chain hydrocarbons. However, we could not identify any obvious syntrophic partner organisms based on co-occurrence analyses of abundance profiles in metagenomic datasets generated in this study²⁰.

An evaluation of traditional energy transferring mechanisms showed that the Helarchaeota bins lack genes coding for NADH:ubiquinone oxidoreductase, F₄₂₀-dependent oxidoreductase, F₄₂₀H₂:quinone oxidoreductase and NADH:quinone oxidoreductase that were identified in Ca. S. butanivorans (Fig. 4)⁷. These protein complexes are important for energy transfer across the cell membrane and are common among syntrophic organisms^2,28,29. Helarchaeota also lack genes coding for pili or cytochromes that are often involved in direct electron transfer to a bacterial partner, as demonstrated for different ANME archaea^26,30. Therefore, Helarchaeota may use a thus far unknown mechanism for energy conservation. Below we analyzed potential energy-transferring mechanisms that might be involved in syntrophic interactions between Helarchaeota and potential partner organisms.

A possible candidate for energy transfer to a partner may be formate dehydrogenase because substrate exchange in form of formate has previously been described to occur between methanogens and sulfur-reducing bacteria²⁷. Helarchaeota genomes code for the alpha and beta subunits of a membrane-bound formate dehydrogenase (EC. 1.2.1.2) that could facilitate this transfer (Fig. 2, Supplementary Data 1). However, to our knowledge formate transfer has not been shown to mediate methane oxidation. Alternatively, Helarchaeota may possess a previously undiscovered redox-active complex. In both Helarchaeota bins, a gene cluster was found encoding three proteins that were identified as members of the HydB/Nqo4-like superfamily, Oxidored_q6 superfamily, and a Fe–S disulfide reductase with a FlpD domain (mvhD) (Fig. 5a). An analysis of these three proteins showed that each possessed transmembrane motifs (Fig. 5b, and Supplementary Methods). While the membrane association of the disulfide reductase/FlpD needs to be confirmed, interactions with the other two membrane-associated subunits may allow for the bifurcated electrons to be transferred across the membrane.

Finally, hydrogen production and release was also considered as a possible electron sink for Helarchaeota. We identified several hydrogenase subunits and putative Fe-S disulfide reductase-encoding genes in the Helarchaeota genomes. Subsequent phylogenetic analyses revealed that the majority of these hydrogenases represent small and large subunits of group IIIC hydrogenases (methanogenic F₄₂₀-non-reducing hydrogenase (mvh)) that are usually involved in bifurcating electrons from hydrogen (Supplementary Figure 4, Supplementary Data 1). In contrast, while homologs belonging to the above mentioned Oxidored_q6 superfamily protein family are often found to be associated with group IV hydrogenases, canonical membrane-bound group IV-hydrogenases could not be identified in the genomes of the Helarchaeota. Altogether, this indicates that hydrogen could play a central role in energy metabolism of Helarcharota, but the absence of a classical membrane-bound hydrogenase makes it unlikely that hydrogen is the major syntrophic electron carrier.

Discussion

Historically methanogenesis and anaerobic methane oxidation were regarded as the only examples of anaerobic archaeal short-chain alkane metabolism. The enzymes acting in these pathways were considered to be biochemically and phylogenetically unique and limited to lineages within the Euryarchaeota⁴. This study represents the discovery of the previously unknown phylum referred to as Helarchaeota, whose members encode a mcr-like gene cluster. This opens the possibility that some representatives of the Asgard archaea may have the potential for anaerobic short-chain alkane oxidation. Since the presence of these mcr genes is restricted to Helarchaeota among the known Asgard archaea¹⁹, these genes were likely transferred to Helarchaeota and do not constitute an ancestral trait within the Asgard superphylum. Based on current phylogenetic analysis, the Helarchaeota mcr gene cluster may have been horizontally acquired from either Bathyarchaeota or Ca. Syntrophoarchaeum (Fig. 1b, Supplementary Figure 3). Due to this close relationship, we based our analysis of Helarchaeota’s ability to perform anaerobic short-chain hydrocarbon oxidation on the pathway proposed for Ca. Syntrophoarchaeum. Helarchaeota probably utilize a similar short-chain alkane as a substrate in lieu of methane, but given the low-butane concentrations at our site it may not be the only substrate.

Our comparison to Ca. S. butanivorans shows a consistent presence in genes necessary for this metabolism including a complete Wood–Ljungdahl pathway, acyl oxidation pathway, and internal electron transferring systems. Some of these electron-transferring systems are essential housekeeping components that may act as electron carriers for oxidation reactions. Interestingly, in the Wood–Ljungdahl pathway identified in Ca. S. butanivorans, the bacterial enzyme 5,10-methylene-tetrahydrofolate reductase (met) is thought to be substituting for the missing 5,10-methylene-tetrahydromethanopterin reductase (mer)⁷. In contrast, Helarchaeota encode the canonical archaeal-type mer. To render anaerobic butane oxidation energetically favorable, it must be coupled to the reduction of an electron acceptor such as nitrate, sulfate or iron^7,26,27. In ANME archaeum that lack genes for internal electron acceptors, methane oxidation is enabled through the transfer of electrons to a syntrophic partner organism. In Syntrophoarchaeum, syntrophic butane oxidation is thought to occur through the exchange of electrons via pili and/or cytochromes with sulfate-reducing bacteria⁷. Helarchaeota do not appear to encode any of the systems traditionally associated with syntrophy and no partner was identified in this study. Thus, further research is needed to identify possible bacterial partners.

Furthermore, the hypothesis that Helarchaeota have the ability to utilize short-chain alkanes remains to be confirmed as the genomes of members of this group do not encode canonical routes for electron transfer to a partner bacterium. However, we identified potential enzymes that may be involved in transfer of electrons. Some methanogenic archaea use formate for syntrophic energy transfer to a syntrophic partner; therefore, the reverse reaction has been speculated to be energetically feasible for methane oxidation²⁷. If this is true, the presence of a membrane-bound formate dehydrogenase in the Helarchaeota genomes may support this electron-transferring mechanism, however, to our knowledge this has never been shown for an ANME archaea so far. Alternatively, the type 3 NiFe-hydrogenases encoded by Helarchaeota may be involved in transfer of hydrogen to a partner organism. For example, we identified a protein complex distantly related to the mvh–hdr of methanogens for electron transfer (Supplementary Methods). Mvh–hdr structures have been proposed to be potentially used by facultative hydrogenotrophic methanogens for energy transfer, but the directionality of hydrogen exchange could easily be reversed². These methanogens form syntrophic associations with fermenting, H₂-producing bacteria, lack dedicated cytochromes or pili and use the mvh–hdr for electron bifurcation². The detection of a hydrophobic region in the mvh–hdr complex led to the suggestion that this complex could be membrane bound and act as mechanism for electron transfer across the membrane; however, a transmembrane association has never been successfully shown². While the membrane association of the disulfide reductase/FlpD needs to be confirmed, we were able to detect several other transmembrane motifs in the associated proteins that could potentially allow electron transfer in form of hydrogen to an external partner. Thus, while we propose that the most likely explanation for anaerobic short-chain alkane oxidation in Helarchaeota is via a syntrophic interaction with a partner, additional experiments are needed to confirm this working hypothesis.

The discovery of alkane-oxidizing pathways and possible syntrophic interactions in a phylum of Asgard archaea indicates a much wider phylogenetic range for hydrocarbon utilization. Based on phylogenetic analyses it seems most likely that the Helarchaeota mcr operon may have been horizontally transferred from either Bathyarchaeota or Syntrophoarchaea. However, the preservation of a horizontally transferred pathway is indicative of a competitive advantage; it follows that gene transfers among different archaeal phyla reflect alkane oxidation as a desirable metabolic trait. The discovery of the alkyl-CoM reductases and alkane-oxidizing pathways among the Asgard archaea indicates ecological roles for these still cryptic organisms, and opens up a wider perspective on the evolution and expansion of hydrocarbon-oxidizing pathways throughout the archaeal domain.

Methods

Sample collection and processing

Samples analyzed here are part of a study that aimed to characterize the geochemical conditions and microbial community of GB hydrothermal vent sediments (Gulf of California, Mexico)^31,32. The two genomic bins discussed in this paper, Hel_GB_A and Hel_GB_B, were obtained from sediment core samples collected in December 2009 on Alvin dives 4569_2 and 4571_4, respectively²¹. Immediately after the dive, freshly recovered sediment cores were separated into shallow (0–3 cm), intermediate (12–15 cm), and deep (21–24 cm) sections for further molecular and geochemical analysis, and frozen at −80 °C on the ship until shore-based DNA extraction. Hel_GB_A was recovered from the intermediate sediment (~28 °C) and Hel_GB_B was recovered from shallow sediment (~10 °C) from a nearby core (Supplementary Table 1); the sampling context and geochemical gradients of these hydrothermally influenced sediments are published and described in detail^21,31.

DNA was extracted from sediment samples using the MO BIO—PowerMax Soil DNA Isolation kit and sent to the Joint Genome Institute (JGI) for sequencing.

JGI generation of reads and processing of data

A half a lane of Illumina reads (HiSeq-2500 1TB, read length of 2 × 151 bp) were generated at Joint Genome Institute for each sample, producing a total of 226,647,966 and 241,605,888 reads from dives 4569-2 and 4571-4, respectively. The average percent of reads with a phred-score (Q) ≥ 30 was 86.2% and 90.39% and the average base quality score was 34.35 ± 7.73 and 35.38 ± 6.52 for samples from dive 4569-2 and 4571-4, respectively. The JGI performed read quality checks and generated a first assembly using the following methods: BBDuk adapter trimming removed known Illumina adapters. The reads were further processed using BBDuk quality filtering and trimming to remove reads with a quality score less than 12, containing more than three “Ns”, or with quality scores (before trimming) averaging less than 3 over the read length, or length under 51 bp after trimming. In addition, reads matching Illumina artifacts or phiX were discarded. The remaining reads were mapped to a masked version of the human HG19 with BBMap and all hits over 93% sequence identity to the human genome were discarded. Trimmed, screened, paired-end Illumina reads were assembled using the megahit assembler using a range of kmers. Assemblies were preformed with default parameters in megahit with the following options: “–k-list 23,43,63,83,103,123”. High-quality reads were mapped to the final assembly to calculate coverage information using bbmap by excluding all parameters except ambiguous=random as described by JGI.

Genome reconstruction

The contigs from the JGI assembled data were binned using ESOM³³, MetaBAT³⁴, and CONCOCT³⁵ and resulting bins were combined using DAS Tool (version 1.0)³⁶. For ESOM, binning was performed on contigs with a minimum length of 2000 bp using the K-batch algorithm for training after running the perl script esomWrapper.pl³³. Emerging self-organizing maps (ESOM) were manually sorted and curated. The bins were extracted using getClassFasta.pl (using −loyal 51). Reference genomes were included to add genetic signatures for the assembled contigs and improve binning. For CONCOCT, Anvi’o (v2.2.2) was used as the metagenomic workflow pipeline³⁷. Coverage information was obtained by mapping all high-quality reads of each sample against the assembly of another sample using the BWA-MEM algorithm in paired-end mode (bwa-0.7.12-r1034; using default settings)³⁸. The resulting sam file was sorted and converted to bam using samtools (version 0.1.19)³⁹. The bam file was prepared for Anvi’o using the script anvi-init-bam and a contigs database generated using anvi-gen-contigs-database. These files were the input for anvi-profile. Generated profiles for the assemblies were combined using anvi-merge and the resulting bins summarized using anvi-summarize (-C CONCOT)³⁷. If not mentioned otherwise, the scripts were used with default settings. Metabat was also used as a binning approach (v1)³⁴. As described for Anvi’o the input consisted of the scaffold files (≥2000 bp) and the mapping files. First, each of the mapping files were summarized using jgi_summarize_bam_contig_depths and then metabat was run using the following settings: –minProb 75 –minContig 2000 –minContigByCorr 2000. Results from the three different binning tools were combined using DAS Tool (version 1.0)³⁶. For each of the binning tools a scaffold-to-bin list was prepared and DAS Tool run on each of the eleven scaffold files as follows: DAS_Tool.sh -i Anvio_contig_list.tsv,Metabat_contig_list.tsv,ESOM_contig_list.tsv -l Anvio,Metabat,ESOM -c scaffolds.fasta –write_bins 1. CheckM lineage_wf (v1.0.5) was run on bins generated from DAS_Tool and 577 bins showed an completeness >50% and were characterized further⁴⁰. 37 Phylosift⁴¹ identified marker genes were used for preliminary phylogenetic identification of individual bins (Supplementary Table 5). Thereby, we identified two genomes, belonging to a previously uncharacterized phylum within the Asgard archaea, which we named Helarchaeota. To improve the quality of the two Helarchaeota genomes IDBA-UD was run on raw data using the command: “idba_ud -r Guay9_METAGENOME.fasta -o G9 –pre_correction –mink 75 –maxk 105 –seed_kmer 55 –num_threads 30”. Metaspades was run on Raw data and Metabat assembled bins using as follows: “metaspades.py –12 Guay16.11400.5.204846.CTCTCTA-CGTCTAA.filter-METAGENOME.fastq -o Metaspades –only-assembler –meta”. Binning procedures (using scaffolds longer than 2000 bp) as described above for the original bins were repeated with these redone assembles. All bins were compared to the original Helarchaeota bins using blastn⁴² for identification. Mmgenome⁴³ and CheckM⁴⁰ were used to calculate genome statistics (i.e., contig length, genome size, contamination, and completeness). The highest quality Helarchaeota bin from each sample was chosen for further analyses. For the 4572-4 dataset, the best bin was generated using the Metaspades reassembly on the trimmed data and for the 4569-2 dataset the best bin was recovered using the Metaspades reassembly on the original Hel bin contigs. The final genomes were further cleaned by GC content, paired-end connections, sequence depth and coverage using Mmgenome⁴³. CheckM was rerun on cleaned bins to estimate the Hel_GB_A to be 82% and Hel_GB_B to be 87% complete and both bins were characterized by a low degree of contamination (between 1.4 and 2.8% with no redundancy) (Table 1)⁴⁰. Genome size was estimated to be 4.6 Mbp for Hel_GB_A and 4.1 for Hel_GB_B and was calculated using percent completeness and bin size to extrapolate the likely size of the complete genome. CompareM was used to analysis differences between Helarchaeota bins and published Asgard bins using the command python comparem aai_wf –tmp_dir tmp/ –file_ext fa -c 8 aai_compair_loki aai_compair_loki_output (https://github.com/dparks1134/CompareM). Read abundance summarized by jgi_summarize_bam_contig_depths were used to calculate relative read abundance and total percent of metagenomic reads. Relative read abundance was calculated as total read abundance normalized to genome size and divided by total reads. Relative read abundance was then multiplied by the constant 1 × 10¹² for clarity. Total percent of metagenomic reads was calculated as total read abundance divided by total reads times 100. Relative read abundance was compared to other genomics bins recovered from these sites to look for co-occurrence²⁰.

16S rRNA gene analysis

Neither bin possessed a 16S rRNA gene sequence⁴¹, and to uncover potentially unbinned 16S rRNA gene sequences from Helarchaeota, all 16S rRNA gene sequences obtained from samples 4569_2 and 4571_4 were identified using JGI-IMG annotations, regardless of whether or not the contig was successfully binned. These 16S rRNA gene sequences were compared using blastn⁴² (blastn -outfmt 6 -query Hel_possible_16s.fasta –db Hel_16s -out Hel_possible_16s_blast.txt -evalue 1E-20) to recently acquired 16S rRNA gene sequences from MAGs recovered from preliminary data from additional GB sites. A 37 Phylosift⁴¹ marker genes tree was used to assign taxonomy to these MAGs. We were able to identify five MAGs that possessed 16S and that formed a monophyletic group with our Hel_GB bins (Supplementary Table 2; Megxx in Fig. 2). Of the unbinned 16S rRNA gene sequences one was identified as likely Helarchaeota sequence. The contig was retrieved from the 4572_4 assembly (designated Ga0180301_10078946) and was 2090 bp long and encoded for an 16S rRNA gene sequence that was 1058 bp long. Given the small size of this contig relative to the length of the 16S rRNA gene none of the other genes on the contig could be annotated. Blastn⁴² comparison to published Asgard 16S rRNA gene sequences was performed using the following command: blastn -outfmt 6 -query Hel_possible_16s.fasta –db Asgrad_16s -out Hel_possible_16s_blast.txt -evalue 1E-20 (Supplementary Table 2). The GC content of each 16S rRNA gene sequence was calculated using the Geo-omics script length+GC.pl (https://github.com/Geo-omics/scripts/blob/master/AssemblyTools/length%2BGC.pl). For a further phylogenetic placement, the 16S rRNA gene sequences were aligned to the SILVA database (SINA v1.2.11) using the SILVA online server⁴⁴ and Geneious (v10.1.3)⁴⁵ was used to manually trim sequences. The alignment also contained 16S rRNA gene sequences from the preliminary Helarchaeota bins. The cleaned alignment was used to generated a maximum-likelihood tree with RAxML as follows: “/raxmlHPC-PTHREADS-AVX -T 20 -f a -m GTRGAMMA -N autoMRE -p 12345 -x 12345 -s Nucleotide_alignment.phy -n output” (Fig. 1b).

Phylogenetic analysis of ribosomal proteins

For a more detailed phylogenetic placement, we used BLASTp⁴⁶ to identify orthologs of 56 ribosomal proteins in the two Helarchaeota bins, as well as from a selection of 130 representative taxa of archaeal diversity and 14 eukaryotes. The full list of marker genes selected for phylogenomic analyses is shown in Supplementary Table 6. Individual protein datasets were aligned using mafft-linsi⁴⁷ and ambiguously aligned positions were trimmed using BMGE (-m BLOSUM30)⁴⁸. Maximum likelihood (ML) individual phylogenies were reconstructed using IQtree v. 1.5.5⁴⁹ under the LG+C20+G substitution model with 1000 ultrafast bootstraps that were manually inspected. Trimmed alignments were concatenated into a supermatrix, and two additional datasets were generated by removing eukaryotic and/or DPANN homologs to test the impact of taxon sampling on phylogenetic reconstruction. For each of these concatenated datasets, phylogenies were inferred using ML and Bayesian approaches. ML phylogenies were reconstructed using IQtree under the LG+C60+F+G+PMSF model⁵⁰. Statistical support for branches was calculated using 100 bootstraps replicated under the same model. To test robustness of the phylogenies, the dataset was subjected to several treatments. For the “full dataset” (i.e., with all 146 taxa), we tested the impact of removing the 25% fastest-evolving sites, as within a deep phylogenetic analysis, these sites are often saturated with multiple substitutions and, as a result of model-misspecification can manifest in an artifactual signal^51,52,53. The corresponding ML tree was inferred as described above. Bayesian phylogenies were reconstructed with Phylobayes for the dataset “without DPANN” under the LG+GTR model. Four independent Markov chain Monte Carlo were run for ~38,000 generations. After a burn-in of 20%, convergence was achieved for three of the chains (maxdiff < 0.29). The initial supermatrix was also recoded into four categories, in order to ameliorate effects of model misspecification and saturation⁵⁴ and the corresponding phylogeny was reconstructed with Phylobayes, under the CAT+GTR model. Four independent Markov chain Monte Carlo chains were run for ~49,000 generations. After a burn-in of 20 convergence was achieved for all four the chains (maxdiff < 0.19). All phylogenetic analyses performed are summarized in Supplementary Table 7, including maxdiff values and statistical support for the placement of Helarchaeota, and of eukaryotes.

Phylogenetic analysis of McrA and concatenated McrAB

McrA homologs were aligned using mafft-linsi⁴⁷, trimmed with trimAL⁵⁵, and the final alignment consisting of 528 sites was subjected to phylogenetic analyses using IQtree v. 1.5.5⁴⁹ with the LG+C60+R+F model. Support values were estimated using 1000 ultrafast boostraps⁵⁶ and SH-like approximate likelihood ratio test⁵⁷, respectively. Sequences for McrA and B were aligned separately with mafft-linsi⁴⁷ and trimmed using trimAL Subsequently, McrA and McrB encoded in the same gene cluster, were concatenated yielding a total alignment of 972 sites. Bayesian and ML phylogenies were inferred using IQtree v. 1.5.5⁴⁹ with the mixture model LG+C60+R+F and PhyloBayes v. 3.2⁵⁸ using the CAT-GTR model. For ML inference, support values were estimated using 1000 ultrafast boostraps⁵⁶ and SH-like approximate likelihood ratio test⁵⁷, respectively. For Bayesian analyses, four chains were run in parallel, sampling every 50 points until convergence was reached (maximum difference < 0.07; mean difference < 0.002). The first 25% or the respective generations were selected as burn-in. Phylobayes posterior predictive values were mapped onto the IQtree using sumlabels from the DendroPy package⁵⁹. The final trees were rooted artificially between the canonical Mcr and divergent Mcr-like proteins, respectively. Original alignment and treefiles are available upon request.

Metabolic analyses

Gene prediction for the two Helarchaeota bins was performed using prodigal⁶⁰ (V2.6.2) with default settings and Prokka⁶¹ (v1.12) with the extension “–kingdom archaea”. Results for both methods were comparable and yielded a total of 3574–3769 and 3164–3287 genes for Hel_GB_A and Hel_GB_B, respectively, with Prokka consistently identifying fewer genes. Genes were annotated by uploading the protein fasta files from both methods to KAAS (KEGG Automatic Annotation Server) for complete or draft genomes to assign orthologs⁶². Files were run using the following settings: prokaryotic option, GhostX, and bi-directional best hit (BBH)⁶². Additionally, genes were annotated by JGI-IMG⁶³ to confirm hits using two independent databases. Hits of interest were confirmed using blastp on the NCBI webserver⁴⁶. The dbCAN⁶⁴ and MEROPS⁶⁵ webserver were run using default conditions for identification of carbohydrate degrading enzymes and peptidases respectively. Hits with e-values lower than e⁻²⁰ were discarded. In addition to these methods an extended search was used to categorize genes involved in butane metabolism, syntrophy and energy transfer.

Identified genes predicted to code for putative alkane oxidation proteins were similar to those described from Candidatus Syntrophoarchaeum spp. Therefore, a blastp⁴⁶ database consisting of proteins predicted to be involved in the alkane oxidation pathway of Ca. Syntrophoarchaeum was created in order to identify additional proteins in Helarchaeota, which may function in alkane oxidation. Positive hits were confirmed with blastp⁴⁶ on the NCBI webserver and compared to the annotations from JGI-IMG⁶³, Interpro⁶⁶, Prokka⁶¹, and KAAS⁶² annotation. Genes for mcrABG were further confirmed by a HMMER⁶⁷ search to a published database using the designated threshold values⁶⁸ and multiple MCR trees (see Methods). To confirm that the contigs with the mcrA gene cluster were not missbined, all other genes on these contigs were analyzed for their phylogenetic placement and gene content. The prodigal protein predictions for genes on the contigs with mcrA operons were used to determine directionality and length of the potential operon.

To identify genes that are involved in electron and hydrogen transfer across the membrane, a database was created of known genes relevant in syntrophy that were download from NCBI. The protein sequences of the two Helarchaeota genomes were blasted against the database to detect relevant hits (E-value ≥ e⁻¹⁰). All hits were confirmed using the NCBI webserver, Interpro, JGI-IMG, and KEGG. Hydrogenases were identified by a HMMER search to published database using the designated threshold values. Hits were confirmed with comparisons against JGI annotations and NCBI blasts, the HydDB database⁶⁹ and a manual database made from published sequences^70,71. All detected hydrogenases were used to generate two phylogenetic trees, one for proteins identified as small subunits and one for large subunits in order to properly identify the different hydrogenase subgroups. Hydrogenases that are part of the proposed complex were then further analyzed to evaluate if this was a possible operon by looking for possible transcription factors and binding motifs (Supplementary Methods).

ESP identification

Gene prediction for the two Helarchaeota bins was performed using prodigal⁶⁰ (V2.6.2) with default settings. All the hypothetical proteins inferred in both Helarchaeaota were used as seeds against InterPro⁶⁶, arCOG⁷², and nr using BLAST⁴⁶. The annotation table from Zaremba-Niedzwiedzka et al.¹² was used as a basis for the comparison. The IPRs (or in some cases, the arCOGs) listed in the Zaremba-Niedzwiedzka et al. were searched for in the Helarchaeota genomes¹², and the resulting information was used to complete the presence/absence of table. When something that had previously been detected in an Asgard bin was not found in a Helarchaeota bin using the InterPro/arCOG annotations, BLASTs were carried out using the closest Asgard seeds to verify the absence. In some cases, specific analyses were used to verify the homology or relevance of particular sequences. The details for each individual ESP are depicted in Supplementary Methods.

Reporting Summary

Further information on experimental design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The raw reads from the metagenomes described in this study are available at JGI under the IMG genome IDs 3300014911 and 3300013103 for samples 4569-2 and 4571-4, respectively. Genome sequences are available at NCBI under the Accession numbers SAMN09406154 and SAMN09406174 for Hel_GB_A and Hel _GB_B, respectively. Both are associated with BioProject PRJNA362212.

References

Claypool, G. E., Kvenvolden & K. A. Methane and other hydrocarbon gases in marine sediment. Annu. Rev. Earth Planet. Sci. 11, 299–327 (1983).
Article ADS CAS Google Scholar
Thauer, R. K., Kaster, A.-K., Seedorf, H., Buckel, W. & Hedderich, R. Methanogenic archaea: ecologically relevant differences in energy conservation. Nat. Rev. Microbiol. 6, 579–591 (2008).
Article CAS PubMed Google Scholar
Reeburgh, W. S. Oceanic methane biogeochemistry. Chem. Rev. 107, 486–513 (2007).
Article CAS PubMed Google Scholar
Spang, A., Caceres, E. F. & Ettema, T. J. G. Genomic exploration of the diversity, ecology, and evolution of the archaeal domain of life. Science 357, eaaf3883 (2017).
Article PubMed Google Scholar
Evans, P. N. et al. Methane metabolism in the archaeal phylum Bathyarchaeota revealed by genome-centric metagenomics. Science 350, 434–438 (2015).
Article ADS CAS PubMed Google Scholar
Vanwonterghem, I. et al. Methylotrophic methanogenesis discovered in the archaeal phylum Verstraetearchaeota. Nat. Microbiol. 1, 16170 (2016).
Article CAS PubMed Google Scholar
Laso-Pérez, R. et al. Thermophilic archaea activate butane via alkyl-coenzyme M formation. Nature 539, 396–401 (2016).
Article ADS PubMed Google Scholar
Dombrowski, N., Seitz, K. W., Teske, A. P. & Baker, B. J. Genomic insights into potential interdependencies in microbial hydrocarbon and nutrient cycling in hydrothermal sediments. Microbiome 5, 106 (2017).
Article PubMed PubMed Central Google Scholar
Bazylinski, D. A., Farrington, J. W. & Jannasch, H. W. Hydrocarbons in surface sediments from a Guaymas Basin hydrothermal vent site. Org. Geochem. 12, 547–558 (1988).
Article CAS Google Scholar
Teske, A., Callaghan, A. V. & LaRowe, D. E. Biosphere frontiers of subsurface life in the sedimented hydrothermal system of Guaymas Basin. Front. Microbiol. 5, 1–11 (2014).
Von Damm, K. L., Edmond, J. M., Measures, C. I. & Grant, B. Chemistry of submarine hydrothermal solutions at Guaymas Basin, Gulf of California. Geochim. Cosmochim. Acta 49, 2221–2237 (1985).
Article ADS Google Scholar
Zaremba-Niedzwiedzka, K. et al. Asgard archaea illuminate the origin of eukaryotic cellular complexity. Nature 541, 353–358 (2017).
Article ADS CAS PubMed Google Scholar
Spang, A. et al. Complex archaea that bridge the gap between prokaryotes and eukaryotes. Nature 521, 173–179 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Seitz, K. W., Lazar, C. S., Hinrichs, K.-U., Teske, A. P. & Baker, B. J. Genomic reconstruction of a novel, deeply branched sediment archaeal phylum with pathways for acetogenesis and sulfur reduction. ISME J. 10, 1696–1705 (2016).
Article CAS PubMed PubMed Central Google Scholar
Jørgensen, S. L., Thorseth, I. H., Pedersen, R. B., Baumberger, T. & Schleper, C. Quantitative and phylogenetic study of the Deep Sea Archaeal Group in sediments of the Arctic mid-ocean spreading ridge. Front. Microbiol. 4, 1–11 (2013).
Jorgensen, S. L. et al. Correlating microbial community profiles with geochemical data in highly stratified sediments from the Arctic Mid-Ocean Ridge. Proc. Natl Acad. Sci. USA 109, E2846–E2855 (2012).
Article CAS PubMed PubMed Central Google Scholar
Hartman, H. & Fedorov, A. The origin of the eukaryotic cell: a genomic investigation. Proc. Natl Acad. Sci. 99, 1420–1425 (2002).
Article ADS CAS PubMed PubMed Central Google Scholar
Eme, L., Spang, A., Lombard, J., Stairs, C. & J. G. Ettema, T. Archaea and the origin of eukaryotes. Nat. Rev. Microbiol. 15, 711–723 (2017). https://doi.org/10.1038/s41564-019-0406-9.
Spang, A. et al. A renewed syntrophy hypothesis for the origin of the eukaryotic cell based on comparative analysis of Asgard archaeal metabolism. Nat. Microbiol. 2, 1–9 (2019).
Dombrowski, N., Teske, A. P. & Baker, B. J. Extensive metabolic versatility and redundancy in microbially diverse, dynamic Guaymas Basin hydrothermal sediments. Nat. Commun. 9, 4999 (2018).
Article ADS PubMed PubMed Central Google Scholar
McKay, L. et al. Thermal and geochemical influences on microbial biogeography in the hydrothermal sediments of Guaymas Basin, Gulf of California. Environ. Microbiol. Rep. 8, 150–161 (2016).
Article CAS PubMed Google Scholar
Yarza, P. et al. Uniting the classification of cultured and uncultured bacteria and archaea using 16S rRNA gene sequences. Nat. Rev. Microbiol. 12, 635–645 (2014).
Article CAS PubMed Google Scholar
Lazar, C. S. et al. Environmental controls on intragroup diversity of the uncultured benthic archaea of the miscellaneous Crenarchaeotal group lineage naturally enriched in anoxic sediments of the White Oak River estuary (North Carolina, USA). Environ. Microbiol. 17, 2228–2238 (2015).
Article CAS PubMed Google Scholar
Tabita, F. R., Satagopan, S., Hanson, T. E., Kreel, N. E. & Scott, S. S. Distinct form I, II, III, and IV Rubisco proteins from the three kingdoms of life provide clues about Rubisco evolution and structure/function relationships. J. Exp. Bot. 59, 1515–1524 (2007).
Article Google Scholar
Dowell, F. et al. Microbial communities in methane- and short chain alkane-rich hydrothermal sediments of Guaymas Basin. Front. Microbiol. 7, 17 (2016).
Article PubMed PubMed Central Google Scholar
Krukenberg, V. et al. Candidatus Desulfofervidus auxilii, a hydrogenotrophic sulfate-reducing bacterium involved in the thermophilic anaerobic oxidation of methane. Environ. Microbiol. 18, 3073–3091 (2016).
Article CAS PubMed Google Scholar
Stams, A. J. M. & Plugge, C. M. Electron transfer in syntrophic communities of anaerobic bacteria and archaea. Nat. Rev. Microbiol. 7, 568–577 (2009).
Article CAS PubMed Google Scholar
Meuer, J., Kuettner, H. C., Zhang, J. K., Hedderich, R. & Metcalf, W. W. Genetic analysis of the archaeon Methanosarcina barkeri Fusaro reveals a central role for Ech hydrogenase and ferredoxin in methanogenesis and carbon fixation. Proc. Natl Acad. Sci. 99, 5632–5637 (2002).
Article ADS CAS PubMed PubMed Central Google Scholar
Kunow, J., Linder, D., Stetter, K. O. & Thauer, R. K. F420H2: quinone oxidoreductase from Archaeoglobus fulgidus. Eur. J. Biochem. 223, 503–511 (1994).
Article CAS PubMed Google Scholar
Wegener, G., Krukenberg, V., Riedel, D., Tegetmeyer, H. E. & Boetius, A. Intercellular wiring enables electron transfer between methanotrophic archaea and bacteria. Nature 526, 587–590 (2015).
Article ADS CAS PubMed Google Scholar
McKay, L. J. et al. Spatial heterogeneity and underlying geochemistry of phylogenetically diverse orange and white Beggiatoa mats in Guaymas Basin hydrothermal sediments. Deep Sea Res. Part I 67, 21–31 (2012).
Article CAS Google Scholar
Meyer, S. et al. Microbial habitat connectivity across spatial scales and hydrothermal temperature gradients at Guaymas Basin. Front. Microbiol. 4, 207 (2013).
PubMed PubMed Central Google Scholar
Dick, G. J. et al. Community-wide analysis of microbial genome sequence signatures. Genome Biol. 10, R85 (2009).
Article PubMed PubMed Central Google Scholar
Kang, D. D., Froula, J., Egan, R. & Wang, Z. MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities. PeerJ 3, e1165 (2015).
Article PubMed PubMed Central Google Scholar
Alneberg, J. et al. Binning metagenomic contigs by coverage and composition. Nat. Methods 11, nmeth.3103 (2014).
Article Google Scholar
Sieber, C. M. K. et al. Recovery of genomes from metagenomes via a dereplication, aggregation, and scoring strategy. bioRxiv 107789. https://doi.org/10.1101/107789 (2017)
Eren, A. M. et al. Anvi’o: an advanced analysis and visualization platform for ‘omics data. PeerJ 3, e1319 (2015).
Article PubMed PubMed Central Google Scholar
Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. ArXiv13033997 Q-Bio 0, 1–3 (2013).
Li, H. et al. The sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central Google Scholar
Parks, D. H., Imelfort, M., Skennerton, C. T., Hugenholtz, P. & Tyson, G. W. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. gr.186072.114. https://doi.org/10.1101/gr.186072.114 (2015)
Article CAS PubMed PubMed Central Google Scholar
Darling, A. E. et al. PhyloSift: phylogenetic analysis of genomes and metagenomes. PeerJ 2, e243 (2014).
Article PubMed PubMed Central Google Scholar
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).
Article CAS PubMed Google Scholar
Karst, S. M., Kirkegaard, R. H. & Albertsen, M. mmgenome: a toolbox for reproducible genome extraction from metagenomes. bioRxiv 059121. https://doi.org/10.1101/059121 (2016)
Pruesse, E., Peplies, J. & Glöckner, F. O. SINA: accurate high-throughput multiple sequence alignment of ribosomal RNA genes. Bioinformatics 28, 1823–1829 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kearse, M. et al. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinforma. Oxf. Engl. 28, 1647–1649 (2012).
Article Google Scholar
Altschul, S. F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (1997).
Article CAS PubMed PubMed Central Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Article CAS PubMed PubMed Central Google Scholar
Criscuolo, A. & Gribaldo, S. BMGE (Block Mapping and Gathering with Entropy): a new software for selection of phylogenetic informative regions from multiple sequence alignments. BMC Evol. Biol. 10, 210 (2010).
Article PubMed PubMed Central Google Scholar
Nguyen, L.-T., Schmidt, H. A., von Haeseler, A. & Minh, B. Q. Iq-tree: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).
Article CAS PubMed Google Scholar
Wang, H.-C., Minh, B. Q., Susko, E. & Roger, A. J. Modeling site heterogeneity with posterior mean site frequency profiles accelerates accurate phylogenomic estimation. Syst. Biol. 67, 216–235 (2017).
Jeffroy, O., Brinkmann, H., Delsuc, F. & Philippe, H. Phylogenomics: the beginning of incongruence? Trends Genet. 22, 225–231 (2006).
Article CAS PubMed Google Scholar
Lartillot, N. & Philippe, H. Improvement of molecular phylogenetic inference and the phylogeny of Bilateria. Philos. Trans. R. Soc. Lond. B Biol. Sci. 363, 1463–1472 (2008).
Article PubMed PubMed Central Google Scholar
Brown, M. W. M. et al. Phylogenomics demonstrates that breviate flagellates are related to opisthokonts and apusomonads. Proc. R. Soc. B Biol. Sci. 280, 20131755 (2013).
Article Google Scholar
Susko, E. & Roger, A. J. On reduced amino acid alphabets for phylogenetic inference. Mol. Biol. Evol. 24, 2139–2150 (2007).
Article CAS PubMed Google Scholar
Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinforma. Oxf. Engl. 25, 1972–1973 (2009).
Article Google Scholar
Minh, B. Q., Nguyen, M. A. T. & von Haeseler, A. Ultrafast approximation for phylogenetic bootstrap. Mol. Biol. Evol. 30, 1188–1195 (2013).
Article CAS PubMed PubMed Central Google Scholar
Guindon, S. et al. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59, 307–321 (2010).
Article CAS PubMed Google Scholar
Lartillot, N. & Philippe, H. A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process. Mol. Biol. Evol. 21, 1095–1109 (2004).
Article CAS PubMed Google Scholar
Sukumaran, J. & Holder, M. T. DendroPy: a Python library for phylogenetic computing. Bioinforma. Oxf. Engl. 26, 1569–1571 (2010).
Article CAS Google Scholar
Hyatt, D. et al. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinforma. 11, 119 (2010).
Article Google Scholar
Seemann, T. Prokka: rapid prokaryotic genome annotation. Bioinforma. Oxf. Engl. 30, 2068–2069 (2014).
Article CAS Google Scholar
Moriya, Y., Itoh, M., Okuda, S., Yoshizawa, A. C. & Kanehisa, M. KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res. 35, W182–W185 (2007).
Article PubMed PubMed Central Google Scholar
Markowitz, V. M. et al. IMG: the integrated microbial genomes database and comparative analysis system. Nucleic Acids Res. 40, D115–D122 (2012).
Article CAS PubMed Google Scholar
Yin, Y. et al. dbCAN: a web resource for automated carbohydrate-active enzyme annotation. Nucleic Acids Res. 40, W445–W451 (2012).
Article CAS PubMed PubMed Central Google Scholar
Rawlings, N. D., Barrett, A. J. & Bateman, A. MEROPS: the peptidase database. Nucleic Acids Res. 38, D227–D233 (2010).
Article CAS PubMed Google Scholar
Jones, P. et al. InterProScan 5: genome-scale protein function classification. Bioinforma. Oxf. Engl. 30, 1236–1240 (2014).
Article CAS Google Scholar
Johnson, L. S., Eddy, S. R. & Portugaly, E. Hidden Markov model speed heuristic and iterative HMM search procedure. BMC Bioinforma. 11, 431 (2010).
Article Google Scholar
Anantharaman, K. et al. Thousands of microbial genomes shed light on interconnected biogeochemical processes in an aquifer system. Nat. Commun. 7, 13219 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Søndergaard, D., Pedersen, C. N. S. & Greening, C. HydDB: A web tool for hydrogenase classification and analysis. Sci. Rep. 6, 1–11 (2016).
Article ADS PubMed PubMed Central Google Scholar
Vignais, P. M. & Billoud, B. Occurrence, classification, and biological function of hydrogenases: an overview. Chem. Rev. 107, 4206–4272 (2007).
Article CAS PubMed Google Scholar
Vignais, P. M., Billoud, B. & Meyer, J. Classification and phylogeny of hydrogenases1. FEMS Microbiol. Rev. 25, 455–501 (2001).
Makarova, K. S., Wolf, Y. I. & Koonin, E. V. Archaeal Clusters of Orthologous Genes (arCOGs): an update and application for analysis of shared features between Thermococcales, Methanococcales, and Methanobacteriales. Life 5, 818–840 (2001).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This study was supported in part by an Alfred P. Sloan Foundation fellowship (FG-2016-6301) and National Science Foundation Directorate of Biological Sciences (Systematics and Biodiversity Sciences) (Award 1737298) to B.J.B. Sampling in Guaymas Basin and post-cruise work was supported by NSF Awards OCE-0647633 and OCE-1357238 to APT, respectively. The work conducted by the U.S. Department of Energy Joint Genome Institute, a DOE Office of Science User Facility, is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231 provided to N.D. A.S. was supported by a Marie Curie IEF European grant (625521), a VR starting grant (2016-03559) and a WISE fellowship by the NWO-I Foundation of the Netherlands Organisation for Scientific Research. L.E. was funded by the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No 704263. This work was supported by grants of the European Research Council (ERC Starting grant 310039-PUZZLE_CELL), the Swedish Foundation for Strategic Research (SSF-FFL5) and the Swedish Research Council (VR grant 2015-04959) to T.J.G.E.

Author information

Authors and Affiliations

Department of Marine Science, University of Texas Austin, Port Aransas, TX, 78373, USA
Kiley W. Seitz, Nina Dombrowski & Brett J. Baker
NIOZ, Royal Netherlands Institute for Sea Research, and Utrecht University, Den Burg, 1797 SZ, AB, The Netherlands
Nina Dombrowski & Anja Spang
Department of Cell and Molecular Biology, Science for Life Laboratory, Uppsala University, Uppsala, SE-75123, Sweden
Laura Eme, Anja Spang, Jonathan Lombard & Thijs J. G. Ettema
Unité d’Ecologie, Systématique et Evolution, CNRS, Université Paris-Sud, Orsay, 91400, France
Laura Eme
University of Minnesota Duluth, Duluth, 55812, MN, USA
Jessica R. Sieber
Department of Marine Sciences, University of North Carolina, Chapel Hill, 27599, NC, USA
Andreas P. Teske
Laboratory of Microbiology, Department of Agrotechnology and Food Sciences, Wageningen University, Wageningen, NL-6708WE, The Netherlands
Thijs J. G. Ettema

Authors

Kiley W. Seitz
View author publications
You can also search for this author in PubMed Google Scholar
Nina Dombrowski
View author publications
You can also search for this author in PubMed Google Scholar
Laura Eme
View author publications
You can also search for this author in PubMed Google Scholar
Anja Spang
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Lombard
View author publications
You can also search for this author in PubMed Google Scholar
Jessica R. Sieber
View author publications
You can also search for this author in PubMed Google Scholar
Andreas P. Teske
View author publications
You can also search for this author in PubMed Google Scholar
Thijs J. G. Ettema
View author publications
You can also search for this author in PubMed Google Scholar
Brett J. Baker
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.W.S., T.J.G.E., N.D. and B.J.B. conceived the study. K.W.S., N.D. and B.J.B. analyzed the genomic data. A.P.T. collected and processed the samples. K.W.S., A.S. and L.E. performed the phylogenetic analyses. J.L. analyzed the ESPs. K.W.S., J.R.S., A.P.T. and B.J.B. handled the metabolic inferences. B.J.B. and K.W.S. wrote the paper with inputs from all authors.

Corresponding author

Correspondence to Brett J. Baker.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Journal peer review information: Nature Communications thanks Laura Hug and the other anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Seitz, K.W., Dombrowski, N., Eme, L. et al. Asgard archaea capable of anaerobic hydrocarbon cycling. Nat Commun 10, 1822 (2019). https://doi.org/10.1038/s41467-019-09364-x

Download citation

Received: 12 December 2018
Accepted: 06 March 2019
Published: 23 April 2019
DOI: https://doi.org/10.1038/s41467-019-09364-x

This article is cited by

Cultivation and visualization of a methanogen of the phylum Thermoproteota
- Anthony J. Kohtz
- Nikolai Petrosian
- Roland Hatzenpichler
Nature (2024)
Methyl-reducing methanogenesis by a thermophilic culture of Korarchaeia
- Viola Krukenberg
- Anthony J. Kohtz
- Roland Hatzenpichler
Nature (2024)
Asgard archaea modulate potential methanogenesis substrates in wetland soil
- Luis E. Valentin-Alvarado
- Kathryn E. Appler
- Jillian F. Banfield
Nature Communications (2024)
BASALT refines binning from metagenomic data and increases resolution of genome-resolved metagenomic analysis
- Zhiguang Qiu
- Li Yuan
- Ke Yu
Nature Communications (2024)
The emerging view on the origin and early evolution of eukaryotic cells
- Julian Vosseberg
- Jolien J. E. van Hooff
- Thijs J. G. Ettema
Nature (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Identification of Helarchaeota genomes from GB sediments

Metabolic analysis of Helarchaeota

Proposed hydrocarbon degradation pathway for Helarchaeota

Possible energy-transferring mechanisms for Helarchaeota

Discussion

Methods

Sample collection and processing

JGI generation of reads and processing of data

Genome reconstruction

16S rRNA gene analysis

Phylogenetic analysis of ribosomal proteins

Phylogenetic analysis of McrA and concatenated McrAB

Metabolic analyses

ESP identification

Reporting Summary

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links