Abstract
The roles of Asgard archaea in eukaryogenesis and marine biogeochemical cycles are well studied, yet their contributions in soil ecosystems remain unknown. Of particular interest are Asgard archaeal contributions to methane cycling in wetland soils. To investigate this, we reconstructed two complete genomes for soil-associated Atabeyarchaeia, a new Asgard lineage, and a complete genome of Freyarchaeia, and predicted their metabolism in situ. Metatranscriptomics reveals expression of genes for [NiFe]-hydrogenases, pyruvate oxidation and carbon fixation via the Wood-Ljungdahl pathway. Also expressed are genes encoding enzymes for amino acid metabolism, anaerobic aldehyde oxidation, hydrogen peroxide detoxification and carbohydrate breakdown to acetate and formate. Overall, soil-associated Asgard archaea are predicted to include non-methanogenic acetogens, highlighting their potential role in carbon cycling in terrestrial environments.
Similar content being viewed by others
Introduction
Wetland soils are hotspots for methane production by methanogenic archaea. The extent of methane production depends in part on the availability of substrates for methanogenesis (e.g., hydrogen, carbon dioxide, formate, formaldehyde, methanol, acetate), compounds that are both produced and consumed by co-existing microbial community members. Among the groups of organisms that coexist with methanogens are Asgard archaea, of recent interest from the perspective of eukaryogenesis1,2,3,4. To date, numerous lineages of Asgard archaea have been reported from anaerobic, sedimentary freshwater, marine, and hydrothermal environments1,2,3,4,5,6,7,8,9,10,11,12,13,14,15. Predictions primarily from draft metagenome-assembled genomes (MAGs) indicate metabolic diversity and flexibility that may enable them to occupy these diverse ecological niches. It appears that Asgard archaea are not capable of methane production since they lack the key canonical methyl-coenzyme M reductase (MCR). Although a few complete genomes for Asgard from hydrothermal and geothermal environments have been reported9,15,16,17, most metabolic analyses of Asgard archaea are limited by reliance on partial genomes. To date, no Asgard genomes from non-estuarine wetland soils have been reported. Thus, nothing is known about the ways in which Asgard archaea directly (via methane production) or indirectly (via metabolic interactions) impact methane cycling in wetlands.
To investigate the roles of Asgard archaea in carbon cycling in wetland soil, we reconstructed two complete genomes for a newly defined group, here named Atabeyarchaeia, and one complete genome for a group named Freyarchaeia. Freyarchaeia MAGs were originally reconstructed from Guaymas Basin, located in the Gulf of California, México14, and from Jinze Hot Spring (Yunnan, China)4. Subsequently, another group used the original data to recover similar genomes and referred to them as Jordarchaeia18. Here, we retain the original nomenclature. The genomes for soil Asgard archaea were initially reconstructed by manual curation of Illumina short-read assemblies and then validated using both Nanopore and PacBio long reads. These fully curated genomes enabled us to comprehensively sample the genetic potential. This enabled us to constrain metabolism without the risks associated with reliance on partial and/or contaminated draft genomes. The genomes also provided context for metatranscriptomic measurements of in situ activity. Our integrated analysis of gene expression and metabolic predictions revealed roles for Atabeyarchaeia and Freyarchaeia in the production and consumption of carbon compounds that can serve as substrates for methanogenesis by coexisting methanogenic archaea.
Results
Complete genomes and phylogenetic placement of Asgard archaea from wetland soil
We analyzed Illumina metagenomic data from samples collected from 20 cm to 175 cm depth in the soil of a wetland located in Lake County, California, USA. We previously reported megaphages19 and Methanoperedens archaea and their 1 Mb-scale “Borg” extrachromosomal elements from this site20 (Fig. 1a). From the metagenomic analyses conducted at this site, we determined that archaea account for > 45% of the total community below a depth of 60 cm. Archaeal groups detected include members of the Asgardarchaeota, Thermoproteota, Thermoplasmatota, Nanoarchaeota, Micrarchaeota, Halobacteriota, Hadarchaeota, Aenigmatarchaeota and Methanobacteriota (Fig. 1b). Notably, methanogens assigned to Methanomicrobia and Methanomethylicia were actively transcribing the MCR complex, consistent with methanogenesis and anaerobic oxidation of methane in situ (Supplementary Fig. 1). In addition, we searched publicly available metagenomes to determine the biogeographical distribution of Atabeyarchaeia and Freyarchaeia using singleM21. We detected Atabeyarchaeia predominantly in freshwater sediment, peat, and wetland environments somewhat analogous to the site studied here. In contrast, Freyarchaeia were found across a diverse array of habitats, including extreme environments such as hydrothermal vents and hot springs, as well as in animal-associated and wetland ecosystems (Supplementary Fig. 2a, b).
From 60 cm, 80 cm, and 100 cm deep wetland soil, we recovered four draft Asgard genomes, three of which were manually curated to completion using methods described previously22. Taxonomic classification using the Silva DB placed the 3,576,204 bp genome as Freyarchaeia. 16S rRNA gene sequence analysis showed the two other complete genomes were distinct from Freyarchaeia (16S rRNA genes are < 75% identical), thus representing organisms from a separate, new lineage. These genomes are 2,808,651 and 2,756,679 bp in length (Supplementary Data 1) with an average amino acid identity (AAI) of ~ 70% (Supplementary Data 2).
Phylogenetic analyses using several sets of marker genes (see “materials and methods”) placed our two novel complete genomes in a monophyletic group within the Asgard clade as a sister group to Freyarchaeia (Fig. 1c). We performed phylogenetic analyses using concatenated marker sets of 47 archaeal clusters of orthologous genes (arCOGs) and 15 ribosomal protein (RP15) gene cluster (Supplementary Fig. 3), as well as the 16S rRNA gene (Supplementary Fig. 4). The new genomes share only 40–45% AAI when compared to other Asgard genomes, consistent with their assignment to a new phylum. Although our analyses provide evidence for distinction at the phylum level, we chose to adhere to the Genome Taxonomy Database (GTDB) for standardized microbial genome nomenclature (Supplementary Data 3). Here, we propose the name Candidatus “Atabeyarchaeia” for this new group, where ‘Atabey’ is a goddess in of Taíno Puerto Rican mythology. Atabeyarchaeia is represented by the complete Atabeyarchaeia group 1 (Atabeyarchaeum deiterrae 1 sp. nov.) and group 2 (Atabeyarchaeum deiterrae 2 sp. nov) genomes. Included in this group are two highly fragmented draft genomes from the Lake Cootharaba Group (ALCG)12. The cumulative GC skew of the Freyarchaeia and Atabeyarchaeia genomes is consistent with bidirectional replication. This style of replication is typical of bacterial genomes but has not been widely reported in Archaea and has never been described in the Asgard group (Fig. 1d and Supplementary Fig. 5). Intriguingly, complete genome analyses of representatives from Heimdallarchaeia, Lokiarchaeia, and Odinarchaeia suggest the absence of bidirectional replication in these lineages (Supplementary Fig 5).
Unexpectedly, we found that 92% to 95% of tRNA genes from all three genomes contain at least one intron. This contrasts with the general estimate that 15% of archaeal tRNA harbor introns23, and with Thermoproteales (another order of archaea), where 70% of the tRNAs contain introns24. In total, there are 228 tRNA introns across the three new Asgard genomes (Supplementary Data 4). Unlike most archaeal tRNA introns that occur in the anticodon loop at position 37 / 3825,26, Atabeyarchaeia and Freyarchaeia introns often occur at non-canonical positions, and over half of their tRNA genes have multiple introns (Supplementary Data 4).
Subsequently, we acquired and independently assembled Oxford Nanopore and PacBio long-reads from a subset of the samples to generate three circularized genomes that validate the overall topology of all three curated Illumina read-based genomes (Supplementary Fig. 6 and Supplementary Data 1). These complete genomes allowed us to genomically describe two Atabeyarchaeia-2 strain variants from 100 cm and 175 cm depth soil. In addition, we used Illumina reads to curate a draft Nanopore genome for another Atabeyarchaeia species, Atabeyarchaeia-3, from 75 cm and 175 cm depth soil (Supplementary Fig. 6). The Atabeyarchaeia-3 genome is most closely related to the Asgard Lake Cootharaba Group (ALCG) fragments18. To further solidify the phylogenetic position of Atabeyarchaeia, we included the Atabeyarchaeia-3 genome and another draft genome (Atabeyarchaeia-4) from Illumina reads in the phylogenetic analysis.
Using the Asgard clusters of orthologous genes (AsCOGS) database and functional classification, we identified eukaryotic signature proteins (ESPs) in the complete and public genomes of Atabeyarchaeia and Freyarchaeia2,3. Atabeyarchaeia and Freyarchaeia genomes had the highest percentage of hits for ‘Intracellular trafficking, secretion, and vesicular transport’ (U) among the AsCOG functional classes, accounting for 84.3% of the hits to the database. Within this class, we identified key protein domains such as Adaptin, ESCRT-I-III complexes, Gelsolin family protein, Longin domain, Rab-like GTPase, Ras family GTPase, and Roadblock/LC7 domain (Supplementary Data 5 and Supplementary Fig. 8). The ‘Post Translational modification, protein turnover, and chaperones’ category (O) followed with a count of 101 (15.8%), highlighting domains like Ubiquitin, Jab1/MPN domain-containing protein, and the RING finger domain. The presence of ESPs in the newly described Atabeyarchaeia lineage and their presence in Freyarchaeia aligns with previous findings for Asgardarchaeota1,3,4.
Expression of energy conservation pathways constrain key metabolisms in situ
We analyzed the metabolic potential of the three complete genomes and investigated their gene expression in situ through metatranscriptomics of soil samples (see “materials and methods”, Fig. 2, Supplementary Data 6 and Supplementary Data 7). The metatranscriptomic data indicate the expression of genes involved in key energy conservation pathways (Fig. 3a). The most transcribed metabolic genes encode soluble heterodisulfide reductase (HdrABC), a cytosolic protein complex involved in electron bifurcation, [NiFe] hydrogenases (groups 3 and 4), ATP synthase, numerous aldehyde ferredoxin oxidoreductases, enzymes for phosphoenolpyruvate (PEP) and pyruvate metabolism, and carbon monoxide dehydrogenase/acetyl CoA synthetase (CODH/ACS). Notably, the Hdr, the group 3 and group 4 hydrogenase (including up to eight Nuo-like), as well as the ATP synthase, are co-encoded in a syntenic block in all the genomes (Fig. 4a).
The group 4 [NiFe]-hydrogenases of these archaea appear to conserve energy through a common mechanism with complex I (Nuo). Phylogenetic analysis of the large subunit of these hydrogenases suggests they are closely related to those of Odinarchaeia, Heimdallarchaeia, and Hermodarchaeia (Fig. 4b and Supplementary Data 8). However, the exact function of this unclassified Asgard group has not been validated biochemically27. One clue relies on the identification of eight genes homologous to the transmembrane proton-translocating subunits of complex I, NuoL, M, and N (E. coli nomenclature), and Mrp-type Na+/H+ antiporters. Thus, these Asgard archaea may couple hydrogenase activity to Na+/H+ translocation, thereby generating a sodium/proton-motive force that drives the ATP synthase28,29,30. We employed AlphaFold2 to model the hydrogenase and associated complex I-like modules from Freyarchaeia and Atabeyarchaeia (Fig. 4c and Supplementary Fig. 9a–f). Overall, the predicted structure has a cytosolic and membrane-associated portion (Fig. 4c). The cytosolic portion aligned with the respiratory membrane-bound hydrogenase (MBH) from Pyrococcus furiosus28 with high confidence (Fig. 4d). When superimposed, the predicted structures of the membrane-associated hydrophobic L, M, K, and S chains are equivalent to those of bacterial complex I. In the canonical complex I31,32, Chains L, M, N, and K translocate protons32,33, a process that is facilitated by a lateral helix, helix HL, of chain L. A full-length helix HL is also present in the L-like subunit of the Asgard complexes (Fig. 4e), but not in the other structurally characterized respiratory membrane-bound hydrogenases (Fig. 4e and Supplementary Fig. 9a–f).
The Group 3c cofactor-coupled bidirectional [NiFe] hydrogenase (Supplementary Fig. 10a), in combination with HdrABC, suggests the capability to bifurcate electrons from H2 to ferredoxin and an unidentified heterodisulfide compound. This capacity has been observed in methanogenic archaea via the MvhADG–HdrABC system34,35. Atabeyarchaeia genomes also encode two independent gene clusters for the Group 3b NADP-coupled [NiFe] hydrogenases (Supplementary Fig. 10b). Their presence suggests the capacity to maintain redox equilibrium and potentially grow lithoautotrophically by using H2 as an electron donor, as suggested for other Asgardarchaeota members10,14,27.
Atabeyarchaeia and Freyarchaeia encode both the tetrahydromethanopterin (H4MPT) methyl branch and the carbonyl branch of the Wood–Ljungdahl pathway (WLP) (Supplementary Fig. 11). This reversible pathway can be used to reduce CO2 to acetyl coenzyme A (acetyl CoA), which can be further converted to acetate. This last conversion can lead to energy conservation in both Asgard lineages via substrate-level phosphorylation when mediated by acetate-CoA ligase (see below). We confirmed the expression of almost all of the genes of the methyl and carbonyl branches, including the acetate-CoA ligase, in all complete genomes. Gene expression data indicate that when H2 is available in the ecosystem, these archaea have the genetic potential to utilize the Wood-Ljungdahl pathway (WLP) for the reduction of CO2 or formate. In addition, the potential exists for the reverse operation of the WLP to oxidize acetate. The observed expression of energy-converting hydrogenases and ATP synthases suggests that these archaea are equipped to maintain cofactor ratios and conserve energy, which could support these metabolic processes if they occur. The metabolic inferences, along with the transcriptional data, including the expression of pyruvate-ferredoxin oxidoreductase (por) genes in all three Asgard genomes indicate a reliance on an archaeal version of the WLP to perform acetogenesis35,36. This acetogenic lifestyle appears to involve energy conservation through a hydrogenase-dependent chemiosmotic mechanism similar to that observed in some acetogenic bacteria37.
Potential for non-methanogenic methylotrophic lifestyle
Despite the absence of the MCR complex, Freyarchaeia genomes have all the necessary genes to synthesize coenzyme-M from sulfopyruvate via the ComABC pathway similar to methanogens38,39. Most methanogens conserve energy via the Na+-translocating MtrA-H complex, which is encoded by an eight-gene cluster40. Although Atabeyarchaeia and Freyarchaeia do not have the genes for the full complex, Atabeyarchaeia-1 has two copies of a subunit similar to the CH3-H4MPT-dependent methyltransferase subunit A (MtrA-like) and both Freyarchaeia and Atabeyarchaeia also encode the CH3-H4MPT-dependent methyltransferase subunit H (MtrH), along with a phylogenetically distinct fused polypeptide of MtrA-like and MtrH (Fig. 5a). Under the conditions prevalent at the time of sampling, the mtr genes were only weakly expressed (< 10 transcripts) (Supplementary Data 6 and Supplementary Data 8). While the biochemical activity of these divergent non-methanogen-associated MtrA-like and MtrH-like enzymes remains unclear, our phylogenetic analyses suggest they are phylogenetically related to methanogenic MtrA, MtrH, and MtrAH sequences. This suggests their potential role in converting CH3-H4MPT to H4MPT, transferring a methyl group to an acceptor –possibly coenzyme-M, which is predicted to be synthesized by Freyarchaeia. As they lack the MCR complex, the subsequent fate of the methyl group remains uncertain.
Although Atabeyarchaeia and Freyarchaeia genomes do not encode MtrE, we identified genes associated with methyltransferase systems encoded in close proximity to the MtrH gene. Specifically, the genomes are predicted to encode trimethylamine methyltransferase (MttB-like, COG5598 superfamily), undefined corrinoid protein (MtbC-like), and putative glycine cleavage system H (gcvH) (Supplementary Data 9). Both Atabeyarchaeia and Freyarchaeia genomes encode trimethylamine methyltransferase MttB (COG5598). Phylogenetic analysis suggests that MttB (Supplementary Fig. 12) and MtbC (Supplementary Fig. 13) belong to a previously uncharacterized group of methyltransferases, similar to those found in Njordarchaeales, Helarchaeales, Odinarchaeia and TACK members, including Brockarchaeia and Thermoproteota. In methanogens that encode mttB, this gene has an amber codon encoding the amino acid pyrrolysine in the active site41,42. The archaea from this study do not encode pyrrolysine, suggesting Freyarchaeia and Atabeyarchaeia encode a non-pyrrolysine MttB homolog, likely a quaternary amine (QA) dependent methyltransferase43. Only a fraction of QA methyltransferase substrates have been identified, and these include glycine betaine, proline betaine, carnitine, and butyrobetaine43,44,45,46. The methyl group from the QA may be transferred to THF or H4MPT branches of the WLP, akin to the mechanisms described in archaea with the capacity for non-methanogenic anaerobic methylotrophy, including Freyarchaeia, Sifarchaeia, Brockarchaeia, and Culexarchaeia11,12,47,48. Consumption of QA compounds may reduce the pool of potential substrates for methanogenic methane production.
We also identified genes in the Freyarchaeia genome encoding a putative molybdenum-containing hydroxylase distantly related to the aerobic carbon monoxide dehydrogenase complex (CoxLMS). Phylogenetic analysis places the large subunit protein (related to CoxL) in a monophyletic group with other archaea including Thermoplasmatales, Marsarchaeota, and Culexarchaeia (Supplementary Fig. 14a). Analysis of the protein sequence reveals that the putative large-subunit aerobic CO dehydrogenases (CoxL) are missing the characteristic VAYRCSFR motif, which is critical for CO binding in the form I Cox proteins49,50. The genetic organization suggests the Freyarchaeia enzyme is structurally similar to carbon monoxide dehydrogenase, but likely oxidizes distinct substrate (Supplementary Fig. 14b). Consistently, other members of the molybdenum-containing hydroxylase superfamily oxidize other substrates, such as aldehydes or purines48,50,51.
Carbon compound metabolic pathways
There are indications that Freyarchaeia and Atabeyarchaeia display distinct metabolic preferences for various soil carbon compounds (Fig. 2 and Supplementary Fig. 15). Freyarchaeia exhibits a genetic repertoire to break down various extracellular lignin-derived compounds, including 5-carboxyvanillate. Other substrates that we predict can be metabolized by Freyarchaeia carbohydrate-active enzymes include hemicellulose (C5), cellobiose, maltose, and cellulose (C12). We predict that cellodextrin (C18) compounds can be converted to glucose via beta-glucosidase (BglBX). The findings implicate Freyarchaeia in the metabolism of plant-derived soil carbon compounds. Glucose, resulting from the degradation of complex carbohydrates, as well as ribulose and other carbon substrates, likely enters the modified Embden-Meyerhof-Parnas (EMP) pathway, yet the genes of this EMP pathway genes are only weakly expressed (Supplementary Fig. 15, Supplementary Data 6 and Supplementary Data 8). In addition, Freyarchaeia encode and express an array of genes for the uptake of carbohydrates, including sugar transporters from both the major facilitator and ABC superfamilies, suggesting an active role in efficiently assimilating diverse carbon substrates from soil environments Atabeyarchaeia also harbor genes of the EMP glycolytic pathway, producing ATP through the conversion of acetyl-CoA to acetate (Supplementary Fig. 15). Unlike Freyarchaeia, which likely feed glucose into the EMP pathway, the entry point for Atabeyarchaeia to the EMP pathway appears to be fructose 6-phosphate (F6P). This is relatively uncommon for Asgard archaea but is reminiscent of the pathway in Helarchaeales7, an order of Lokiarchaeia. We identified Atabeyarchaeia transcripts for all but one of the genes for the steps from G6P to acetate (Supplementary Data 6).
Atabeyarchaeia and Freyarchaeia use different enzymes to produce pyruvate. Atabeyarchaeia encodes the oxygen-sensitive reversible enzyme, pyruvate:phosphate dikinase (PpdK), whereas Freyarchaeia encodes unidirectional pyruvate water dikinase/phosphoenolpyruvate synthase (PpS) and pyruvate kinase (Pk), producing phosphoenolpyruvate and pyruvate52, respectively. Pyruvate generated via the EMP pathway can then be converted to acetyl-CoA by pyruvate:ferredoxin oxidoreductase (PorABCDG) complex using a low-potential electron acceptor such as a ferredoxin. Alternatively, acetyl-CoA can also be generated via pyruvate formate-lyase (pflD) generating formate as a byproduct. The final step involves the conversion of acetyl-CoA to acetate via acetate-CoA ligase (ADP-forming), producing ATP via substrate-level phosphorylation, in a crucial energy-conserving step during the fermentation of carbon compounds in both lineages.
Lacking the ability to phosphorylate C6 carbon sources, Atabeyarchaeia is predicted to convert ribulose-5-phosphate (C5) and fix formaldehyde (C1) into hexulose-6-phosphate (H6P) via the ribulose monophosphate (RuMP) and non-oxidative pentose phosphate (NO-PPP) pathways (Fig. 2 and Supplementary Fig. 16). The Atabeyarchaeia RuMP pathway bifunctional enzymes (HPS-PH and Fae-HPS) are common in archaea and similar to methylotrophic bacterial homologs53. The RuMP pathway in these Asgard archaea can modulate the formaldehyde availability, a byproduct of methanol oxidation, microbial organic matter decomposition, and combustion. High expression of aldehyde-ferredoxin oxidoreductases (AOR) genes suggests another mechanism for the interconversion of organic acids to aldehydes. For example, aldehyde detoxification (e.g., formaldehyde to formate) and source of acetate from acetaldehyde. Atabeyarchaeia-1, Atabeyarchaeia-2, and Freyarchaeia encode multiple AOR gene copies 5, 6, and 8 respectively. Phylogenetic analyses (Supplementary Fig. 17) suggest that both Asgard lineages encode AOR genes related to the FOR family that oxidize C1-C3 aldehydes or aliphatic and aromatic aldehydes (e.g., formaldehyde or glyceraldehyde)54,55,56. Furthermore, Freyarchaeia also encodes a tungsten-based AOR-type enzyme (XOR family) found in cellulolytic anaerobes with undefined substrate specificity57 (Supplementary Fig. 17). Of the classified AORs, only one gene is expressed in Atabeyarchaeia-2 (Fig. 3). Yet, some of the unclassified AOR genes are among the most expressed genes (> 10 transcripts) in the Atabeyarchaeia genomes. Despite the lack of biochemical characterization for most AOR families, these observations suggest a key role of multiple aldehydes in the generation of reducing power in the form of reduced ferredoxin.
Similar to other Asgard archaea7,10,27, Atabeyarchaeia and Freyarchaeia encode genes for the large subunit of type IV and methanogenic type III Ribulose 1,5-bisphosphate carboxylase (RbcL) (Supplementary Fig. 18) a key enzyme in the partial nucleotide salvage pathway. This pathway facilitates the conversion of adenosine monophosphate (AMP) to 3-phosphoglycerate (3-PG), potentially leading to further metabolism into acetyl-CoA58.
Anaerobic glycerol (C3) metabolism by Atabeyarchaeia and Freyarchaeia is predicted based on the presence of glycerol kinase (GlpK), which forms glycerol-3-phosphate (3PG) from glycerol. 3PG (along with F6P) can be broken down via the EMP pathway, or 3PG can be converted to dihydroxyacetone phosphate (DHAP) via GlpABC. DHAP can also serve as a precursor for sn-glycerol-1-phosphate (G1P), the backbone of archaeal phospholipids. Freyarchaeia have an extra GlpABC operon, the GlpA subunit of which clusters phylogenetically with GlpA of Halobacteriales, the only known archaeal group capable of glycerol assimilation (Supplementary Fig. 19).
All three genomes have a partial TCA cycle similar to other anaerobic archaeal groups such as methanogens59. They encode succinate dehydrogenase, succinyl-CoA synthetase, and 2-oxoglutarate ferredoxin reductase, which are important intermediates for amino acid degradation (e.g., glutamate). Only Atabeyarchaeia can convert fumarate to malate via fumarate hydratase. The only portion of the TCA cycle transcribed in any genome is 2-oxoglutarate/2-oxoacid ferredoxin oxidoreductase, which can produce reducing power in the form of NADH.
A clue suggesting that amino acids are an important resource for Atabeyarchaeia and Freyarchaeia is the high expression of genes for protein and peptide breakdown (Fig. 2). All three organisms are predicted to have the capacity to break down fatty acids via beta-oxidation including crotonate (short-chain fatty acid) via the poorly described crotonate pathway. Furthermore, they encode some enzymes involved in fermenting amino acids to H +, ammonium, acetate, and NAD(P)H via the hydroxyglutarate pathway (Supplementary Data 7). The genomes also encode amino acid transporters and these are also transcribed in both archaeal groups. The ability to anaerobically degrade amino acids is consistent with predictions of the metabolism of the last Asgard common ancestor4,9.
In addition, Freyarchaeia and Atabeyarchaeia can reverse the step in the formyl branch of the WLP that transforms glycine into methylenetetrahydrofolate (methylene-THF). Methylene-THF may then be converted to methyl-THF and then to formyl-THF, producing reducing power (Fig. 2). Ultimately, the methyl group may be used to form acetate via the WLP. Interestingly, Atabeyarchaeia-2 and Freyarchaeia expressed methylenetetrahydrofolate reductase (MTHFR), which is homologous to the enzyme used in the bacterial WLP and also plays a role in folate biosynthesis.
Environmental protection and adaptations
We predict that both Atabeyarchaeia and Freyarchaeia are anaerobes that express genes encoding oxygen-sensitive enzymes and proteins that have evolved to protect against oxidative and other environmental stressors. Notably, all three organisms in our study encode an ancestral version of clade I catalases (KatE), (Supplementary Fig. 20). Additionally, their genomes encode Fe-Mn superoxide dismutase (SOD2), which plays a critical role in the dismutation of superoxide radicals into less harmful molecular oxygen and hydrogen peroxide.
Freyarchaeia encodes a catalase-peroxidase (WXG42230.1) that shares 88% protein identity with a catalase from Methanosarcina (WP_048128959.1). This protein clusters within the Methanomicrobia (Supplementary Fig. 21), suggesting acquisition via lateral gene transfer. Previous analyses have described these enzymes in acetogenic and sulfate-reducing bacteria and methanogenic archaea, but to our knowledge, not in Asgard archaea. As this enzyme does not occur in Freyarchaeia from other environments, it may be an adaptation to soil60.
We infer that Atabeyarchaeia and Freyarchaeia use selenocysteine (Sec), the 21st amino acid, due to the presence of the Sec-specific elongation factor and Sec tRNA in their genomes. Additional Sec components, including phosphoseryl-tRNA kinase (Pstk), Sec synthase (SecS), selenophosphate synthetase (SPS) genes, and multiple Eukaryotic-like Sec Insertion Sequences are also present (Supplementary Data 10). Phylogenetic analysis shows that the Sec elongation factor sequences from Atabeyarchaeia and Freyarchaeia are closely related to other Asgard members and Eukaryotes (Supplementary Fig. 22). We identified multiple selenoproteins encoded within each genome, including CoB-CoM heterodisulfide reductase iron-sulfur subunit (HdrA), peroxiredoxin family protein (Prx-like), selenophosphate synthetase (SPS), and the small subunit (~ 50 aa) of NiFeSec (VhuU). In VhuU, Sec plays a crucial role in mitigating oxidative stress61. Sec can also enhance the catalytic efficiency of redox proteins62, and the identified selenoproteins have the characteristic CXXU or UXXC sequence (Supplementary Data 11) observed in redox-active motifs63.
Discussion
Here, we reconstructed and validated three complete Asgard archaeal genomes from wetland soils in which these archaea comprise less than 1% of complex microbial communities. We used these genomes to define their chromosome lengths, structure, and replication modes. It is relatively common for authors to report circularized genomes as complete, but this may be erroneous due to the prominence of local assembly errors, chimeras, scaffolding gaps, and other issues in de novo metagenome assemblies22,64. Our genomes were thoroughly inspected, corrected, and vetted after circularization, steps previously described to complete genomes from metagenomes65. These complete genomes are one of the first manual curations of short-read metagenomic data verified entirely with long-read analysis (Oxford Nanopore and/or PacBio) and the first complete short-read environmental Asgard genomes. Two of these genomes are from Atabeyarchaeia, a previously undescribed Asgard group and the first complete genome for Freyarchaeia. The predicted bidirectional replication in Freyarchaeia and Atabeyarchaeia suggests the likelihood of this DNA replication method in the last Asgard-Eukaryotic common ancestor. Additional high-quality Asgard genomes are needed to determine the precise evolutionary role of replication in the emergence of the complex cellular organization characteristic of eukaryotes.
Overall, prior studies primarily predict that Asgard archaea degrade proteins, carbohydrates, fatty acids, amino acids, and hydrocarbons5,6,10,66. Lokiarchaeales, Thorarchaeia, Odinarchaeia, and Heimdallarchaeia are primarily organoheterotrophs with varying capacities to consume and produce hydrogen27. Helarchaeales are proposed to anaerobically oxidize hydrocarbons7,10,67, whereas Freyarchaeia and Sifarchaeia are predicted to be heterorganotrophic acetogens capable of utilizing methylated amines11,12. Hermodarchaeia is proposed to degrade alkanes and aromatic compounds via the alkyl/benzyl-succinate synthase and benzoyl-CoA pathway10. Gerdarchaeales may be facultative anaerobes and utilize both organic and inorganic carbon8. Wukongarcheaia is predicted to be chemolithotrophic acetogens, encoding the WLP and ADP-dependent acetyl-CoA synthetase in the absence of the TCA cycle. Atabeyarchaeia and Freyarchaeia share several metabolic pathways with new lineages from the Asgard sister-clade TACK (e.g., Brock- and Culexarchaeia) and other deeply branching Asgard lineages. Based on the genomic and metatranscriptomic analyses, we predict that the soil-associated Atabeyarchaeia and Freyarchaeia are chemoheterotrophs that likely degrade amino acids and other carbon compounds. Both encode the EMP Pathway for cellular respiration and the WLP for CO2 fixation.
Although Atabeyarchaeia and Freyarchaeia share key central metabolic pathways, they differ in that Freyarchaeia can metabolize compounds such as formaldehyde (C1), glycerol (C3), ribulose (C5), and glucose (C6), whereas Atabeyarchaeia can only metabolize C1, C3 and C5 compounds (Fig. 2). The ability to metabolize C3 and C5 compounds is rare in Asgard archaea. Both lineages possess the genetic repertoire necessary for converting carbohydrates into acetate and may be capable of growth as anaerobic acetogens via the WLP.
Our results suggest acetogenesis is a common metabolic feature shared by marine sediment and wetland soil-associated Asgardarchaeota. The soil metagenomes described here showed potential for acetogenesis and homo-acetogenesis through WLP. Supporting previous analysis indicates acetogenesis is an ancestral trait within this phylum4. Homoacetogenic pathways have also been described in several marine Asgard lineages, including Lokiarchaeia, Sifarchaeia, Thorarchaeia, and Wukongarchaeia3,5,6,11,27,36. We were able to identify both acetogenesis and homo-acetogenesis genes in the two lake sediment Atabeyarchaeia MAGs and Freyarchaeia MAGs extracted from deep-sea (Guaymas Basin) and hot spring sediments (Tibet Plateau, Yunnan Province, and Radiata Pool)4,12. The ability to fix CO2 and produce acetate via the WLP could provide a competitive advantage for Asgard archaea in marine and terrestrial environments, where competition may limit labile organic carbon sources. Acetate production in today’s Asgard archaea from various environments may support the growth of other microbial community members, including acetoclastic methanogens.
Similar to other Asgard archaea, they have methyltransferase complexes potentially involved in the catabolism of quaternary amines (or yet unknown methylated substrates). These methylotrophic Asgard archaea may compete with methanogens and other anaerobic methylotrophic groups (Brockarchaeia and Culexarchaeia) identified within these California soil samples for methylated compounds (Fig. 6). These results align with recent studies suggesting a broader presence of methylotrophic metabolisms among archaea10,47,48. It also opens up avenues for exploring the environmental impact of these metabolisms, particularly in relation to carbon cycling and greenhouse gas emissions68.
Of particular interest is the predicted metabolic capability of Atabeyarchaeia and Freyarchaeia to degrade aldehydes. Aldehydes in soils come from several sources, including the microbial breakdown of methanol potentially produced from methane oxidation, degradation of plant and animal compounds, and products of industrial combustion and wildfires (e.g., volatile organic compounds). In fact, the California wetland soil that hosts these archaea contains charcoal, likely produced by wildfires. They are also predicted to be capable of growing on glycerol under anaerobic conditions capacity previously undescribed in Asgard archaea.
The growth of soil-associated Atabeyarchaeia and Freyarchaeia could be supported by hydrogen oxidation in tandem with the Wood-Ljungdahl pathway (WLP), as evidenced by the expression of genes for group 3c [NiFe]-hydrogenase and the WLP. Given that both sets of genes were actively expressed in situ, we infer that the growth of these Asgard archaea reduces the concentration of H2 available for methane production by coexisting methanogens. The presence of syntenic blocks encoding heterodisulfide reductase complexes, [NiFe] hydrogenases and ATP synthase suggests a sophisticated apparatus for energy transduction, resembling mechanisms previously characterized in other archaeal groups35. In addition, our results suggest the existence of an electron bifurcation mechanism in both Asgard archaea lineages, where electrons can be transferred from H2 to ferredoxin and an unidentified heterodisulfide intermediate27. Atabeyarchaeia and Freyarchaeia also have membrane-bound group 4 [NiFe]-hydrogenases that likely facilitate the oxidation of reduced ferredoxin generated through fermentative metabolism. However, this complex is novel in that it includes an HL helix on the L-like subunit and two antiporters, neither of which are present in other structurally characterized group 4 respiratory hydrogenases. The functional modeling of these complexes reveals structural congruences with known respiratory enzymes, hinting at a potential for chemiosmotic energy conservation that may be a widespread feature among the Asgard clade. The findings indicate a potential evolutionary connection between hydrogenases and complex I, aligning with the hypothesis that complex I may have evolved from ancestral hydrogenases31,69.
We manually curated three complete genomes for Asgard archaea from wetland soils, uncovering bidirectional replication and an unexpected abundance of introns in tRNA genes. These features suggest another facet of the evolutionary relationship between archaea and eukaryotes. Metabolic reconstruction and metatranscriptomic measurements of in situ activity revealed a non-methanogenic, acetogenic lifestyle and a diverse array of proteins likely involved in energy conservation. The genome analysis uncovered some metabolic similarities between soil and sediment-associated Asgard archaea9,27. In addition, the unusual genomic features, such as high intron prevalence and the presence of ESPs call for deeper investigations into their biological significance and contribution to the adaptability and evolution of Asgard archaea. Overall, the findings point to metabolic flexibility and adaptation to soil conditions of wetlands. Finally, they contribute to the cycling of carbon compounds that are relevant for methane production by coexisting methanogenic archaea. Future studies should aim to explore the functional roles of the identified genes and pathways in situ, which will further elucidate the ecological impacts and evolutionary history of these enigmatic microorganisms.
Methods
Sample acquisition, nucleic acid extraction, and sequencing
We collected soil cores from a seasonal vernal pool in Lake County, California, in October 2018, October 2019, November 2020, and October 2021. Samples were frozen in the field using dry ice and kept at − 80 C until extraction. The Qiagen PowerSoil Max DNA extraction kit was used to extract DNA from 5-10 g of soil, and the Qiagen AllPrep DNA/RNA extraction kit was used to extract RNA from 2 g of soil. Samples were sequenced by the QB3 sequencing facility at the University of California, Berkeley on a NovaSeq 6000. Read lengths for the 2018 DNA samples, and the RNA samples were 2 × 150 bp and 2 × 250 bp for the 2019–2021 DNA samples. A sequencing depth of 10 Gb was targeted for each of the 2018, 2020, and 2021 samples, and 20 Gbp for each of the 2019 samples. A subset of deep soil samples from 2021 were sequenced using PacBio and Oxford Nanopore technologies. For Oxford Nanopore sequencing, the Ligation Sequencing Kit (LSK114) was used to prepare native DNA libraries and the Qiagen Repli-G mini kit and LSK114 were used to prepare amplified DNA libraries. Samples were sequenced on FLO-PRO114M flowcells on the PromethION for 72 hours.
Sample preparation and metatranscriptomic analysis
To address potential technical variations and ensure robustness in our findings, we implemented several strategies during sample preparation. Homogenization of soil samples: Prior to RNA extraction, soil samples were thoroughly homogenized to ensure the representativeness of each environmental sample, which is crucial for minimizing variability due to sample heterogeneity. Pooling of Samples: Given the challenges in obtaining sufficient RNA from deep soil samples, we pooled replicate samples before sequencing. This not only facilitated adequate RNA yields for robust metatranscriptomic analysis but also helped average out individual sample variability, thereby reducing the impact of technical variation on our results. Despite the absence of technical replicates, these preparatory measures should help minimize technical variation.
Illumina metagenomic assembly, binning, and annotation and genome statistics
Metagenomic sequencing reads were trimmed for adapter sequences and quality using sickle70. The filtered metagenomic reads from 2018 were assembled using the IDBA_UD assembler71. The reads from 2019-2020 using metaSPAdes72 Contigs greater than 2.5 Kb were retained and sequencing reads from all samples were cross-mapped against each resulting assembly using Bowtie273. The resulting differential coverage profiles were filtered at a 95% read identity cutoff and then used for genome binning with MetaBAT274, VAMB75, and MaxBin276. The resulting genome bins were assessed for completeness and contamination using CheckM277 and were manually curated using taxonomic profiling with GGKBase (www.ggkbase.berkeley.edu). The first iteration of taxonomy was assigned to genome bins with GTDB-Tk v2.3.078 and further validated with phylogenetic trees of single-copy marker genes. We also downloaded all the publicly available Asgard genomes from BV-BRC and used CheckM2 to estimate genome completeness.
PacBio metagenomic sequencing and assembly
Samples from September 9, 2021, from 140 cm and 75 cm, were sequenced using Sequel II to generate PacBio HiFi reads. Reads were quality trimmed using BBDuk (bbduk.sh minavgquality = 20 qtrim = rl trimq = 20)79 and assembled with hifiasm-meta80.
Oxford Nanopore metagenomic assembly
Reads were base called with Guppy using the dna_r10.4.1_e8.2_sup@v3.5.1 model. Reads were filtered for an average quality > 10 and a minimum length of 1 kb using fastp (v0.23.2). Adapters were trimmed using porechop (v0.2.4). Branching artifacts caused by multiple displacement amplification were removed by aligning amplified reads to themselves with mappy (v2.24). Reads with non-diagonal self-alignment were removed. Both amplified and native reads were jointly assembled with metaFlye (v2.9)81, long-read polished with medaka consensus (v1.7.1), and short-read polished with Hapo-G (v1.3.1).
Manual genome curation of short-reads genomes and validation using long-reads
The manually curated genomes were de novo reconstructed from high-quality Illumina metagenomic data as described previously22. From soil samples taken at various depths, we recovered draft Illumina-based MAGs corresponding to Atabeyarchaeia-1, Atabeyarchaeia-2, and Freyarchaeia. The curation process involved the identification and removal of obvious chimeric regions, which were indicated by abrupt changes in GC content or by insufficient Illumina read mapping support. We also corrected sequences in regions with imperfect read alignment, allowing no single nucleotide polymorphisms (SNPs), by mapping reads at a reduced stringency threshold (allowing for up to 3% SNPs). This was followed by manual curation of the consensus sequence, including insertion, deletion, or substitution of individual base pairs. The extension of contig ends was conducted using unplaced Illumina reads. High read coverage was interpreted as indicative of the genomic termini. A genome was deemed complete when it displayed uninterrupted support from Illumina reads. The final assessment of genome completeness was performed by examining the cumulative GC skew and ensuring alignment with known complete genomes from related taxa. The Average Amino Acids of the new genomes were performed using AAI: Average Amino acid Identity calculator tool (http://enve-omics.ce.gatech.edu/aai/) and using compareM (v.0.0.23) with the ‘aai_wf’ at default settings (https://github.com/dparks1134/CompareM). Replichores of complete genomes were predicted according to the GC skew and cumulative GC skew calculated by the iRep package (gc_skew.py).
Metabolic pathway reconstruction of complete genomes and phylogenetic analysis of key functional genes
We utilized Prodigal (v2.6.3) to predict the genes for Atabeyarchaeia and Freyarchaeia genomes in this study82. A suite of databases, including KofamKOALA (v1.3.0)83 and InterProScan (5.50-84.0)84 were combined to begin annotation. Complete genomes were also annotated with HydDB85, METABOLIC (v4.0)86, PROKKA (v1.14.6), DRAM87, and MEBS (v2.0). Our methods also incorporated a microbial genome annotation workflow as previously described in the Undinarchaeota study88.
For each gene/protein of interest, references were compiled by 1) BLASTing the corresponding gene against the NCBI nr database and their top 50 hits clustered by CD-HIT using a 90% similarity threshold, 2) obtaining sequences from the high-quality manually annotated UniProtKB/Swiss-Prot/InterPro/PFAM (reviewed) (catalases), or 3) using previously published alignment and creating hmm models to extract Atabeyarchaeia and Freyarchaeia sequences (AOR, catalases, and MttB-like homologs). The final set genes/proteins were aligned using MAFFT v.7.407 (AOR and catalases)/v7.490 (ADH and MttB-like homologs), trimmed with trimAl v1.4.rev15 (AOR, ADH, catalases, and MttB-like homologs), and a phylogenetic tree was inferred using IQ-TREE v.1.6.6 (AOR and catalases)/v2.0.7 (ADH and MttB-like homologs) using automatic model selection. More detailed descriptions for each phylogeny, including model selection are discussed in the supplemental figure captions.
Environmental distribution of Atabeyarchaeia and Freyarchaeia
The analysis of metagenomic data to determine the presence and relative abundance of LC30 and Jordarchaeia was performed using the Sandpiper tool, which is available at https://sandpiper.qut.edu.au/. Sandpiper was utilized to search across a wide range of publicly available metagenomes to identify the occurrences of these organisms. The tool provides a streamlined workflow for uploading metagenomic data, selecting specific targets (in this case, Atabeyarchaeia and Freyarchaeia), and analyzing the data based on relative abundance and environmental metadata.
Identification of [NiFe]-hydrogenase and phylogenetic analyses
We extracted [NiFe]-hydrogenase sequences from complete genomes as well as publicly available Asgardarchaeota genomes from NCBI and BCWT using custom Hidden Markov Models (HMMs). To confirm the accuracy of the candidate [NiFe]-hydrogenase sequences, we checked the identified sequences by looking at nearby genes and ensuring the presence of essential hydrogenase accessory genes and confirmed the preliminary classification using HydDB85. We combined our [NiFe]-hydrogenase sequences with those from the HydDB database. Then, we aligned all the sequences using MAFFT (parameters:–localpair –maxiterate 1000). After alignment, we looked for the N-terminal and C-terminal CxxC conserved motifs. We then used TrimAl v (parameters: -gt 0.5) to clean up the alignment, removing parts where more than 50% of the sequences had gaps. We used IQ-TREE version 1.6.12 with the best-fit model according to the Bayesian information criterion (BIC) and ultrafast 1000 bootstrap method to estimate support values for the tree branches. The output tree was visualized using ItoL, and the subgroups were classified based on known HYDB references85.
Identification of Selenocysteine machinery and Selenoproteins, tRNA, and intron predictions
The Sec machinery (pstK, EFsec, SecS, and SPS) was identified using HMMER v3.3.189 via hmmscan against the TIGRFAM v15.0 database90 and also confirmed via Selenoprofiles v4.4.891, using the -p machinery option. Selenoprofiles v4.4.8 and the Seblastian- SECIS3 software92 was accessed to identify potential selenoproteins, which were manually curated via alignment to known selenoproteins as previously reported93. SECIsearch3 was used to identify SECIS for selenoproteins.
For identification of predicted pre-tRNA, we used tRNAscan-SE v.2.0.12 (default settings and -A for archaea model)94. The R2DT software95 was used to predict and visualize the tRNA secondary structure, which also led to the identification of additional introns that were not detected by tRNAscan. We identified Sec tRNA with the Secmarker v0.4 webserver96 with a minimum Infernal score of 35.
Metatranscriptome sequencing and analysis
A subset of the deep SRVP soil samples (60 cm, 70 cm, 80 cm, and 100 cm) were used for metatranscriptomics. Asgard genomes were first dereplicated using deep v 3.497, resulting in a set of representative genomes. A Bowtie2 index was constructed from the dereplicated genomes to expedite the sequence alignment process. Post alignment, a custom-built Python script98 was utilized to scrutinize the paired reads from the alignment (BAM) files and deliver statistical insights on gene expression. This script initiates by creating a Pysam ‘AlignmentFile’ object, enabling a systematic traversal through the ‘.bam’ files by parsing the complex binary alignments. It then proceeds to iterate over each gene listed in a predefined set. For each gene, the script tallies the reads that satisfy user-defined thresholds for Average Nucleotide Identity (ANI) and Mapping Quality (MAPQ). These thresholds, designed to uphold a high level of alignment quality and sequence identity, are critical in enhancing the reliability of the data. Upon the completion of this process, the script outputs the filename, the gene name, and the count of reads that conform to the defined parameters. This pipeline permits an in-depth investigation of RNA hits within the dereplicated genomes, thereby illuminating gene expression profiles across diverse environmental scenarios.
To quantify the number of reads per gene, we further utilized a Python script (filter_counts.py). This script operates based on gene predictions produced by the Prodigal software for the genomes of interest. With the required gene predictions secured through a rerun of Prodigal, the script was set in motion on the BAM files. The command executed was as follows: python filter_counts.py -q 10 -m 0.97 derep_genomes.faa. This command sets the minimum MAPQ score (-q 10) and the minimum ANI (-m 0.97) to ensure high-quality alignments and a high degree of sequence identity. This adherence to stringent criteria bolsters the reliability and robustness of our analytical process.
Mapped mRNA reads were standardized by the calculation of reads per kilobase of transcript per million reads mapped (RPKM), which allows for the direct comparison of the level of transcription of all genes in the de novo assembly (Supplementary Data 6). RPKM is calculated in Rockhopper as RPKM = ((number of mapped reads/length of transcript (gene) in kilobase)/million mapped reads))99.
Abundance and distribution of archaea
The dereplicated Atabeyarchaeia, Freyarchaeia, and other archaeal genomes present in the 36 metagenome samples were mapped to all the metagenome reads independently using BBMap79 (parameters: nodisk = t pigz = t unpigz = t ambiguous = random). We set the minimum identity for mapping to 0.9 and handled ambiguous mappings randomly. The percentage of relative abundance of each genome and the unmapped read percentage values were calculated using coverM (parameters: -m relative_abundance –min-read-percent-identity 95).
Phylogenetic analyses of Asgard genomes
We used three different sets of markers for the phylogenetic analysis. 47 arCOGs (Archaeal cluster of orthologous genes), a subset of previously described marker set100, were extracted with hmmsearch (v3.1b2) from the 9 MAGs in this study, 303 additional Asgard MAGs, and 36 TACK publicly available MAGs. Korarchaeales MAGs were excluded from the analysis due to the hyperthermal-sensitive marker genes4. We chose to exclude arCOG01183 from the previous set as it was found in less than 50% of the MAGs. mafft (v7.310) and BMGE (v1.12) were used to align and trim the concatenated alignments. Maximum likelihood phylogenetic trees were generated using IQ-TREE (v1.6.1) to test various models and topologies to obtain bootstrap values and the LG + F + R10 model was selected (Fig. 1c). In addition, we compared this phylogeny with a set of 37 single-copy marker genes from Phylosift (v1.0.1)101. The sequences were manually accessed, aligned with mafft auto (v7.490), trimmed with BMGE (v2.0), and a maximum likelihood phylogeny generated with IQ-TREE (v2.0.7), using the LG + F + R10 model (Supplementary Fig. 25). All marker sets resulted in the same phylogeny for the 9 added MAGs. 16S ribosomal proteins were extracted from 439 Asgard and 39 TACK MAGs with barrnap 0.9, using options –kingdom arc –lencutoff 0.2 –reject 0.3 –evalue 1e-05. Sequences were manually curated, aligned with mafft auto (v7.490) and masked with Geneious Prime, and ran with IQ-Tree (v2.0.7). 16 S phylogeny separates Atabeyarchaeia MAGs from other Asgard clades (Supplementary Fig. 2). Amino acid identity (AAI) was determined with CompareM (v0.0.23) to distinguish taxonomic level. We assigned numbers from 1-9 as candidate groups within this new Asgardarchaeota class.
Identification and analysis of genomic ESPs
PSI-BLAST was used to query the Asgard proteomes against the AsCOGs (Asgard database) and arCOGs databases3. Hits were only considered if the absolute difference between the subject and query sequence was less than 75% of the length of the query sequence. Hits were then dereplicated by taking the best hit for each query.
Protein structure prediction using AlphaFold-multimer and ColabFold
Structural predictions for group 4 [NiFe]-hydrogenases, CoxLMS, MtrA, and MtrAH fusion proteins were performed using ColabFold, incorporating AlphaFold-multimer capabilities102. These predicted structures were then visualized and aligned using ChimeraX103, with corroborating reference structures obtained from the Protein Data Bank104. This structural modeling was integral in reinforcing our phylogenetic and metabolic inference.
Nomenclature, etymology, and proposal of type material
Description of Atabey- taxonomy
Based on the phylogenetic placement and metabolic traits uncovered in our research, we propose new taxonomic nomenclature for the identified Asgardarchaeota organisms. The chosen nomenclature encapsulates various facets that resonate with the organisms’ nature and their ecological habitat. The term ‘Atabey’ is a homage to the Mother Goddess, often referred to as the Earth Mother in the Taíno mythology, symbolizing fertility, abundance, and the nurturing aspects of nature. We suggest the taxonomic hierarchy of class ‘Ca. Atabeyarchaeia’, order ‘Ca. Atabeyarchaeales’, family ‘Ca. Atabeyarchaeaceae’, genus ‘Ca. Atabeyarchaeum’, and species ‘Ca. Atabeyarchaeum deaterra’, which have been registered via SeqCode (seqco.de/r:igzul8qd).
Description of Atabeyarchaeum deiteterre sp. nov
(de.i.ter’rae L. fem. n. dea, deity; L. fem. n. terra, Earth; N.L. gen. n. deiterrae, of an Earth deity)
A soil-associated uncultured lineage, represented by the complete genome “Atabeyarchaeia-1” (2.81 Mbp), NCBI BioSample SAMN40176692 in BioProject PRJNA1050611, recovered from California soil. According to SeqCode Registry v1.0.3, the submitted complete genome passes data quality standards, including estimated completeness at 97.2%, contamination at 1.87%, and the presence of 16 S rRNA genes (1529 bp) and 40 tRNAs.
Description of Atabeyarchaeum gen. nov
(A.ta.bey.ar.chae’um N.L. fem. n. Atabey, the Mother Goddess in Taíno mythology; N.L. neut. n. archaeum, ancient, an archaeon from Gr. adj. archaios; N.L. neut. n. Atabeyarchaeum, an archaeon of Atabey, the Mother Goddess)
The type species is Atabeyarchaeum deiterrae from the complete genome “Atabeyarchaeia-1” submitted by this study. Seven additional representatives are released by this study, including another complete genome “Atabeyarchaeia-2”.
Description of Atabeyarchaeaceae fam. nov
A.ta.bey.ar.chae.a.ce’ae N.L. neut. n. Atabeyarchaeum, the type genus of the family; -aceae, ending to denote a family; N.L. fem. pl. n. Atabeyarchaeaceae, the Atabeyarchaeum family.
Description of Atabeyarchaeales ord. nov
A.ta.bey.ar.chae’a.les N.L. neut. n. Atabeyarchaeum, the type genus of the order; -ales, ending to denote an order; N.L. fem. pl. n. Atabeyarchaeales, the Atabeyarchaeum order.
Description of Atabeyarchaeia class. nov
A.ta.bey.ar.chae’ia N.L. neut. n. Atabeyarchaeum, the type genus of the class; -ia, ending to denote a class; N.L. neut. pl. n. Atabeyarchaeia, the Atabeyarchaeum class.
Description of Frey- taxonomy
To support the previously published lineage named ‘Ca. Freyarchaeota’, we also submit the complete “Freyarchaeaia” genome as type material for the class ‘Ca. Freyarchaeia’, order ‘Ca. Freyarchaeales’, family ‘Ca. Freyarchaeaceae’, genus ‘Ca. Freyarchaeum’, and species ‘Ca. Freyarchaeum deiteterre, which have been registered via SeqCode (seqco.de/r:igzul8qd).
Description of Freyarchaeum deiteterre sp. nov. (de.i.ter’rae L. fem. n. dea, deity; L. fem. n. terra, Earth; N.L. gen. n. deiterrae, of an Earth deity). A soil-associated uncultured lineage, represented by the complete genome “Freyarchaeia” (3.58 Mbp), NCBI BioSample SAMN40176693 in BioProject PRJNA1050611, recovered from California soil sediments. According to SeqCode Registry v1.0.3, the submitted complete genome passes data quality standards, including estimated completeness at 98.13%, contamination at 4.21%, and the presence of 16 S rRNA genes (1264 bp) and 38 tRNAs.
Description of Freyarchaeum gen. nov
Frey, named after the Norse Goddess most commonly associated with love and fertility; archaeum [from Gr. adj. archaios, -e, -on] ancient, archaeon. Growth condition characteristics. The type species is Freyarchaeum deiteterre from the complete genome “Freyarchaeia” submitted by this study.
Description of Freyarchaeaceae fam. nov
Freyarchaeaceae (Frey.ar.chae.a.ce’ae), named after the type genus Freyarchaeum gen. nov. The description is the same as for Freyarchaeum gen. nov. with the following additions; -aceae ending to denote a family.
Description of Freyarchaeales ord. nov
Freyarchaeales (Frey.ar.chae’a.les), named after the type genus Freyarchaeum gen. nov. The description is the same as for Freyarchaeum gen. nov. with the following additions; -ales ending to denote an order.
Description of Freyarchaeia class. nov
Freyarchaeia (Frey.ar.chae’ia), named after the type genus Freyarchaeum gen. nov. The description is the same as for Freyarchaeum gen. nov. with the following additions; -ia ending to denote a class.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
The metagenome-assembled genomes reported in this study can be accessed via the ggKbase database (https://ggkbase.berkeley.edu/SRVP_asgard/organisms). The data used in this study, including all metagenomic and metatranscriptomic reads associated with the reported genomes, have been deposited in the NCBI database under BioProject ID PRJNA1050611, SRA: SRX24933389, SRX24933388, SRX24933387, SRX24933386, SRX23813130, SRX23813131, and SRX23813132. Protein sequences used to generate the alignments and phylogenetic trees, as well as AlphaFold predicted structures, are available via Zenodo (https://doi.org/10.5281/zenodo.11671770). Additional supporting analyses are provided in the Supplementary Information/Source Data file.
Code availability
The software used in this manuscript is publicly available. No new code was generated.
References
Spang, A. et al. Complex archaea that bridge the gap between prokaryotes and eukaryotes. Nature 521, 173–179 (2015).
Zaremba-Niedzwiedzka, K. et al. Asgard archaea illuminate the origin of eukaryotic cellular complexity. Nature 541, 353–358 (2017).
Liu, Y. et al. Expanded diversity of Asgard archaea and their relationships with eukaryotes. Nature 593, 553–557 (2021).
Eme, L. et al. Inference and reconstruction of the heimdallarchaeial ancestry of eukaryotes. Nature 618, 992–999 (2023).
Seitz, K. W., Lazar, C. S., Hinrichs, K.-U., Teske, A. P. & Baker, B. J. Genomic reconstruction of a novel, deeply branched sediment archaeal phylum with pathways for acetogenesis and sulfur reduction. ISME J. 10, 1696–1705 (2016).
Liu, Y. et al. Comparative genomic inference suggests mixotrophic lifestyle for Thorarchaeota. ISME J. 12, 1021–1031 (2018).
Seitz, K. W. et al. Asgard archaea capable of anaerobic hydrocarbon cycling. Nat. Commun. 10, 1822 (2019).
Cai, M. et al. Diverse Asgard archaea including the novel phylum Gerdarchaeota participate in organic matter degradation. Sci. China Life Sci. 63, 886–897 (2020).
Imachi, H. et al. Isolation of an archaeon at the prokaryote-eukaryote interface. Nature 577, 519–525 (2020).
Zhang, J.-W. et al. Newly discovered Asgard archaea Hermodarchaeota potentially degrade alkanes and aromatics via alkyl/benzyl-succinate synthase and benzoyl-CoA pathway. ISME J. 15, 1826–1843 (2021).
Farag, I. F., Zhao, R. & Biddle, J. F. ‘Sifarchaeota,’ a novel Asgard Phylum from Costa Rican sediment capable of polysaccharide degradation and anaerobic methylotrophy. Appl. Environ. Microbiol. 87, e02584–20 (2021).
Sun, J. et al. Recoding of stop codons expands the metabolic potential of two novel Asgardarchaeota lineages. ISME Commun. 1, 30 (2021).
Cai, M. et al. Ecological features and global distribution of Asgard archaea. Sci. Total Environ. 758, 143581 (2021).
Xie, R. et al. Expanding Asgard members in the domain of Archaea sheds new light on the origin of eukaryotes. Sci. China Life Sci. 65, 818–829 (2022).
Rodrigues-Oliveira, T. et al. Actin cytoskeleton and complex cell architecture in an Asgard archaeon. Nature 613, 332–339 (2023).
Tamarit, D. et al. A closed Candidatus Odinarchaeum chromosome exposes Asgard archaeal viruses. Nat. Microbiol. 7, 948–952 (2022).
Wu, F. et al. Unique mobile elements and scalable gene flow at the prokaryote-eukaryote boundary revealed by circularized Asgard archaea genomes. Nat. Microbiol. 7, 200–212 (2022).
Sun, J. et al. Correction: Recoding of stop codons expands the metabolic potential of two novel Asgardarchaeota. ISME Commun. 2, 1–1 (2022).
Al-Shayeb, B. et al. Clades of huge phages from across Earth’s ecosystems. Nature 578, 425–431 (2020).
Al-Shayeb, B. et al. Borgs are giant genetic elements with potential to expand metabolic capacity. Nature 610, 731–736 (2022).
Woodcroft, B. J. et al. SingleM and Sandpiper: Robust microbial taxonomic profiles from metagenomic data. Preprint at bioRxiv https://doi.org/10.1101/2024.01.30.578060. (2024).
Chen, L.-X., Anantharaman, K., Shaiber, A., Eren, A. M. & Banfield, J. F. Accurate and complete genomes from metagenomes. Genome Res. 30, 315–333 (2020).
Marck, C. & Grosjean, H. Identification of BHB splicing motifs in intron-containing tRNAs from 18 archaea: evolutionary implications. RNA 9, 1516–1531 (2003).
Sugahara, J. et al. Comprehensive analysis of archaeal tRNA genes reveals rapid increase of tRNA introns in the order thermoproteales. Mol. Biol. Evol. 25, 2709–2716 (2008).
Tocchini-Valentini, G. D., Fruscoloni, P. & Tocchini-Valentini, G. P. Evolution of introns in the archaeal world. Proc. Natl. Acad. Sci. USA 108, 4782–4787 (2011).
Yoshihisa, T. Handling tRNA introns, archaeal way and eukaryotic way. Front. Genet. 5, 213 (2014).
Spang, A. et al. Proposal of the reverse flow model for the origin of the eukaryotic cell based on comparative analyses of Asgard archaeal metabolism. Nat. Microbiol. 4, 1138–1148 (2019).
Yu, H. et al. Structure of an ancient respiratory system. Cell 173, 1636–1649.e16 (2018).
Marreiros, B. C., Batista, A. P., Duarte, A. M. S. & Pereira, M. M. A missing link between complex I and group 4 membrane-bound [NiFe] hydrogenases. Biochim. Biophys. Acta 1827, 198–209 (2013).
Efremov, R. G. & Sazanov, L. A. The coupling mechanism of respiratory complex I - a structural and evolutionary perspective. Biochim. Biophys. Acta 1817, 1785–1795 (2012).
Efremov, R. G. & Sazanov, L. A. Structure of the membrane domain of respiratory complex I. Nature 476, 414–420 (2011).
Baradaran, R., Berrisford, J. M., Minhas, G. S. & Sazanov, L. A. Crystal structure of the entire respiratory complex I. Nature 494, 443–448 (2013).
Nakamaru-Ogiso, E. et al. The membrane subunit NuoL(ND5) is involved in the indirect proton pumping mechanism of Escherichia coli complex I. J. Biol. Chem. 285, 39070–39078 (2010).
Buckel, W. & Thauer, R. K. Energy conservation via electron bifurcating ferredoxin reduction and proton/Na(+) translocating ferredoxin oxidation. Biochim. Biophys. Acta 1827, 94–113 (2013).
Sousa, F. L., Neukirchen, S., Allen, J. F., Lane, N. & Martin, W. F. Lokiarchaeon is hydrogen dependent. Nat. Microbiol. 1, 16034 (2016).
Orsi, W. D. et al. Metabolic activity analyses demonstrate that Lokiarchaeon exhibits homoacetogenesis in sulfidic marine sediments. Nat. Microbiol. 5, 248–255 (2020).
Schoelmerich, M. C. & Müller, V. Energy conservation by a hydrogenase-dependent chemiosmotic mechanism in an ancient metabolic pathway. Proc. Natl. Acad. Sci. USA 116, 6329–6334 (2019).
Graham, D. E. & White, R. H. Elucidation of methanogenic coenzyme biosyntheses: from spectroscopy to genomics. Nat. Prod. Rep. 19, 133–147 (2002).
White, R. H. Biosynthesis of coenzyme M (2-mercaptoethanesulfonic acid). Biochemistry 24, 6487–6493 (1985).
Gottschalk, G. & Thauer, R. K. The Na(+)-translocating methyltransferase complex from methanogenic archaea. Biochim. Biophys. Acta 1505, 28–36 (2001).
Hao, B. et al. A new UAG-encoded residue in the structure of a methanogen methyltransferase. Science 296, 1462–1466 (2002).
Li, J. et al. Insights into pyrrolysine function from structures of a trimethylamine methyltransferase and its corrinoid protein complex. Commun. Biol. 6, 54 (2023).
Ticak, T., Kountz, D. J., Girosky, K. E., Krzycki, J. A. & Ferguson, D. J. Jr. A nonpyrrolysine member of the widely distributed trimethylamine methyltransferase family is a glycine betaine methyltransferase. Proc. Natl. Acad. Sci. USA 111, E4668–E4676 (2014).
Picking, J. W., Behrman, E. J., Zhang, L. & Krzycki, J. A. MtpB, a member of the MttB superfamily from the human intestinal acetogen Eubacterium limosum, catalyzes proline betaine demethylation. J. Biol. Chem. 294, 13697–13707 (2019).
Kountz, D. J., Behrman, E. J., Zhang, L. & Krzycki, J. A. MtcB, a member of the MttB superfamily from the human gut acetogen Eubacterium limosum, is a cobalamin-dependent carnitine demethylase. J. Biol. 295, 11971–11981 (2020).
Ellenbogen, J. B., Jiang, R., Kountz, D. J., Zhang, L. & Krzycki, J. A. The MttB superfamily member MtyB from the human gut symbiont Eubacterium limosum is a cobalamin-dependent γ-butyrobetaine methyltransferase. J. Biol. Chem. 297, 101327 (2021).
De Anda, V. et al. Brockarchaeota, a novel archaeal phylum with unique and versatile carbon cycling pathways. Nat. Commun. 12, 2404 (2021).
Kohtz, A. J., Jay, Z. J., Lynes, M. M., Krukenberg, V. & Hatzenpichler, R. Culexarchaeia, a novel archaeal class of anaerobic generalists inhabiting geothermal environments. ISME Commun. 2, 1–13 (2022).
Dobbek, H., Gremer, L., Kiefersauer, R., Huber, R. & Meyer, O. Catalysis at a dinuclear [CuSMo(O)OH] cluster in a CO dehydrogenase resolved at 1.1-Å resolution. Proc. Natl. Acad. Sci. USA 99, 15971–15976 (2002).
Cordero, P. R. F. et al. Atmospheric carbon monoxide oxidation is a widespread mechanism supporting microbial survival. ISME J. 13, 2868–2881 (2019).
Beam, J. P. et al. Ecophysiology of an uncultivated lineage of Aigarchaeota from an oxic, hot spring filamentous ‘streamer’ community. ISME J. 10, 210–224 (2016).
Bräsen, C., Esser, D., Rauch, B. & Siebers, B. Carbohydrate metabolism in Archaea: current insights into unusual enzymes and pathways and their regulation. Microbiol. Mol. Biol. Rev. 78, 89–175 (2014).
Kato, N., Yurimoto, H. & Thauer, R. K. The physiological role of the ribulose monophosphate pathway in bacteria and archaea. Biosci. Biotechnol. Biochem. 70, 10–21 (2006).
Arndt, F. et al. Characterization of an Aldehyde Oxidoreductase from the Mesophilic Bacterium Aromatoleum aromaticum EbN1, a member of a new subfamily of Tungsten-Containing Enzymes. Front. Microbiol. 10, 71 (2019).
Mukund, S. & Adams, M. W. Glyceraldehyde-3-phosphate ferredoxin oxidoreductase, a novel tungsten-containing enzyme with a potential glycolytic role in the hyperthermophilic archaeon Pyrococcus furiosus. J. Biol. Chem. 270, 8389–8392 (1995).
Bevers, L. E., Bol, E., Hagedoorn, P.-L. & Hagen, W. R. WOR5, a novel tungsten-containing aldehyde oxidoreductase from Pyrococcus furiosus with a broad substrate Specificity. J. Bacteriol. 187, 7056–7061 (2005).
Scott, I. M. et al. A new class of tungsten-containing oxidoreductase in caldicellulosiruptor, a genus of Plant biomass-degrading thermophilic bacteria. Appl. Environ. Microbiol. 81, 7339–7347 (2015).
Tabita, F. R., Satagopan, S., Hanson, T. E., Kreel, N. E. & Scott, S. S. Distinct form I, II, III, and IV Rubisco proteins from the three kingdoms of life provide clues about Rubisco evolution and structure/function relationships. J. Exp. Bot. 59, 1515–1524 (2008).
Lang, K. et al. New mode of energy metabolism in the seventh order of methanogens as revealed by comparative genome analysis of ‘Candidatus methanoplasma termitum’. Appl. Environ. Microbiol. 81, 1338–1352 (2015).
Brioukhanov, A. L. & Netrusov, A. I. Catalase and superoxide dismutase: distribution, properties, and physiological role in cells of strict anaerobes. Biochemistry 69, 949–962 (2004).
Marques, M. C. et al. The direct role of selenocysteine in [NiFeSe] hydrogenase maturation and catalysis. Nat. Chem. Biol. 13, 544–550 (2017).
Stadtman, T. C. Selenocysteine. Annu. Rev. Biochem. 65, 83–100 (1996).
Fomenko, D. E., Xing, W., Adair, B. M., Thomas, D. J. & Gladyshev, V. N. High-throughput identification of catalytic redox-active cysteine residues. Science 315, 387–389 (2007).
Watson, M. & Warr, A. Errors in long-read assemblies can critically affect protein prediction. Nat. Biotechnol. 37, 124–126 (2019).
Tyson, G. W. et al. Community structure and metabolism through reconstruction of microbial genomes from the environment. Nature 428, 37–43 (2004).
MacLeod, F., Kindler, G. S., Wong, H. L., Chen, R. & Burns, B. P. Asgard archaea: Diversity, function, and evolutionary implications in a range of microbiomes. AIMS Microbiol. 5, 48–61 (2019).
Zhao, R. & Biddle, J. F. Helarchaeota and co-occurring sulfate-reducing bacteria in subseafloor sediments from the Costa Rica Margin. ISME Commun. 1, 25 (2021).
Offre, P., Spang, A. & Schleper, C. Archaea in biogeochemical cycles. Annu. Rev. Microbiol. 67, 437–457 (2013).
Friedrich, T. & Scheide, D. The respiratory complex I of bacteria, archaea and eukarya and its module common with membrane-bound multisubunit hydrogenases. FEBS Lett. 479, 1–5 (2000).
Sickle: Windowed Adaptive Trimming for Fastq Files Using Quality. (Github).
Peng, Y., Leung, H. C. M., Yiu, S. M. & Chin, F. Y. L. IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth. Bioinformatics 28, 1420–1428 (2012).
Nurk, S., Meleshko, D., Korobeynikov, A. & Pevzner, P. A. metaSPAdes: a new versatile metagenomic assembler. Genome Res. 27, 824–834 (2017).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Kang, D. D. et al. MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies. PeerJ 7, e7359 (2019).
Nissen, J. N. et al. Improved metagenome binning and assembly using deep variational autoencoders. Nat. Biotechnol. 39, 555–560 (2021).
Wu, Y.-W., Simmons, B. A. & Singer, S. W. MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets. Bioinformatics 32, 605–607 (2016).
Parks, D. H., Imelfort, M., Skennerton, C. T., Hugenholtz, P. & Tyson, G. W. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 25, 1043–1055 (2015).
Chaumeil, P.-A., Mussig, A. J., Hugenholtz, P. & Parks, D. H. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics 36, 1925–1927 (2019).
Bushnell, B. BBMap: A Fast, Accurate, Splice-Aware Aligner. https://www.osti.gov/servlets/purl/1241166 (2014).
Feng, X., Cheng, H., Portik, D. & Li, H. Metagenome assembly of high-fidelity long reads with hifiasm-meta. Nat. Methods 19, 671–674 (2022).
Kolmogorov, M. et al. metaFlye: scalable long-read metagenome assembly using repeat graphs. Nat. Methods 17, 1103–1110 (2020).
Hyatt, D. et al. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinforma. 11, 119 (2010).
Aramaki, T. et al. KofamKOALA: KEGG Ortholog assignment based on profile HMM and adaptive score threshold. Bioinformatics 36, 2251–2252 (2020).
Paysan-Lafosse, T. et al. InterPro in 2022. Nucleic Acids Res. 51, D418–D427 (2023).
Søndergaard, D., Pedersen, C. N. S. & Greening, C. HydDB: A web tool for hydrogenase classification and analysis. Sci. Rep. 6, 34212 (2016).
Zhou, Z. et al. METABOLIC: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks. Microbiome 10, 33 (2022).
Shaffer, M. et al. DRAM for distilling microbial metabolism to automate the curation of microbiome function. Nucleic Acids Res. 48, 8883–8900 (2020).
Dombrowski, N. et al. Undinarchaeota illuminate DPANN phylogeny and the impact of gene transfer on archaeal evolution. Nat. Commun. 11, 3939 (2020).
Eddy, S. R. Accelerated profile HMM searches. PLoS Comput. Biol. 7, e1002195 (2011).
Li, W. et al. RefSeq: expanding the prokaryotic genome annotation pipeline reach with protein family model curation. Nucleic Acids Res. 49, D1020–D1028 (2021).
Santesmasses, D., Mariotti, M. & Guigó, R. Selenoprofiles: A computational pipeline for annotation of selenoproteins. Methods Mol. Biol. 1661, 17–28 (2018).
Mariotti, M., Lobanov, A. V., Guigo, R. & Gladyshev, V. N. SECISearch3 and Seblastian: new tools for prediction of SECIS elements and selenoproteins. Nucleic Acids Res. 41, e149 (2013).
Mariotti, M. et al. Lokiarchaeota marks the transition between the Archaeal and eukaryotic selenocysteine encoding systems. Mol. Biol. Evol. 33, 2441–2453 (2016).
Chan, P. P., Lin, B. Y., Mak, A. J. & Lowe, T. M. tRNAscan-SE 2.0: improved detection and functional classification of transfer RNA genes. Nucleic Acids Res. 49, 9077–9096 (2021).
Sweeney, B. A. et al. R2DT is a framework for predicting and visualising RNA secondary structure using templates. Nat. Commun. 12, 1–12 (2021).
Santesmasses, D., Mariotti, M. & Guigó, R. Computational identification of the selenocysteine tRNA (tRNASec) in genomes. PLoS Comput. Biol. 13, e1005383 (2017).
Olm, M. R., Brown, C. T., Brooks, B. & Banfield, J. F. dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication. ISME J. 11, 2864–2868 (2017).
Crits-Christoph, A. Filter_reads.py: Filters BAM Files Created by bowtie2 for Better Read Mapping. Use for Genomes from Metagenomes. (Github).
McClure, R. et al. Computational analysis of bacterial RNA-Seq data. Nucleic Acids Res. 41, e140 (2013).
Baker, B. J. et al. Genomic inference of the metabolism of cosmopolitan subsurface Archaea, Hadesarchaea. Nat. Microbiol. 1, 16002 (2016).
Darling, A. E. et al. PhyloSift: phylogenetic analysis of genomes and metagenomes. PeerJ 2, e243 (2014).
Mirdita, M. et al. ColabFold: making protein folding accessible to all. Nat. Methods 19, 679–682 (2022).
Meng, E. C. et al. UCSF ChimeraX: Tools for structure building and analysis. Protein Sci. 32, e4792 (2023).
Berman, H. M. et al. The protein data bank. Nucleic Acids Res. 28, 235–242 (2000).
Adam, P. S., Kolyfetis, G. E., Bornemann, T. L. V., Vorgias, C. E. & Probst, A. J. Genomic remnants of ancestral methanogenesis and hydrogenotrophy in Archaea drive anaerobic carbon cycling. Sci. Adv. 8, eabm9651 (2022).
Wagner, T., Ermler, U. & Shima, S. MtrA of the sodium ion pumping methyltransferase binds cobalamin in a unique mode. Sci. Rep. 6, 28226 (2016).
Acknowledgements
We thank Basem Al-Shayeb for his contribution to field work and generation of sequence datasets, and Shufei Lei and Jordan Hoff for bioinformatics support. We thank Dr. Pok Leung for the discussions on archaeal hydrogenase classification. We also thank Dr. Adam Panagiotis for the helpful discussion about methyltransferases from methanogens and Tristan Wagner for the helpful discussion. We are grateful to Dr. Luke Oltrogge and Dr. Daniel Gittins for their discussions about the multimeric structures modeled using AlphaFold. We thank M. Palmer for guidance with SeqCode nomenclature registration. This publication is based on research in part funded by the Bill & Melinda Gates Foundation (Grant Number: INV-037174 to J.F.B.). The findings and conclusions contained within are those of the authors and do not necessarily reflect positions or policies of the Bill & Melinda Gates Foundation. University of California Dissertation-Year Fellowship (to L.E.V.A.). Stengl-Wyer Graduate Fellowship and University of Texas at Austin Graduate Continuing Fellowship (to K.E.A.). We thank the Innovative Genomics Institute for financial support. Moore-Simons Project on the Origin of the Eukaryotic Cell, Simons Foundation grant 73592LPI (B.J.B.) and Simons Foundation early career award 687165 (to B.J.B.), and Simon Foundation (Award LI-SIAME-00002001) (to B.J.B.). This work was supported by the U. S. Department of Energy Award Number DE-SC0016240 (to D.F.S).
Author information
Authors and Affiliations
Contributions
The study was designed by L.E.V.A. and J.F.B. Samples collection and nucleic acid extractions were performed by L.E.V.A., M.C.S., J.F.B., A.C.C., and J.W.R. Metagenomic data was generated by L.E.V.A., M.C.S., R.S., A.C.C., J.W.R., and J.F.B. Genome binning was done by J.F.B., L.E.V.A., M.C.S., A.C.C., and J.W.R. Complete Asgard genome curation was conducted by J.F.B. and L.E.V.A. Phylogenetic analyses were conducted by K.E.A. and L.E.V.A. Metabolic annotation and analysis done by K.E.A., L.E.V.A. and V.D.A. Metatranscriptome analyses were performed by L.E.V.A. with input from A.C.C. and M.C.S. provided knowledge on metabolism of archaea. C.G. provided knowledge on hydrogenases and archaeal metabolism. V.K. and L.E.V.A. performed tRNAs and selenoproteins analysis. L.L. generated and analyzed the nanopore datasets. D.F.S. and B.J.B. provided feedback on the study design and methodology. J.F.B., D.F.S., and B.J.B. provided resources and funding. K.E.A. and L.E.V.A. registered the genomes via SeqCode. L.E.V.A. and J.F.B. wrote the manuscript, with significant contributions by K.E.A., V.D.A., and M.C.S., and input from all authors. All authors read and approved the manuscript.
Corresponding authors
Ethics declarations
Competing interests
J.F.B. is a co-founder of Metagenomi. D.F.S. is a co-founder and scientific advisory board member of Scribe Therapeutics. L.L. is an employee of Oxford Nanopore Technologies, Inc., and is a stock or stock option holder of Oxford Nanopore Technologies plc. The other authors declare that they have no competing interests.
Peer review
Peer review information
Nature Communications thanks Meng Li and the other anonymous reviewers for their contributions to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Valentin-Alvarado, L.E., Appler, K.E., De Anda, V. et al. Asgard archaea modulate potential methanogenesis substrates in wetland soil. Nat Commun 15, 6384 (2024). https://doi.org/10.1038/s41467-024-49872-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-024-49872-z
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.