Potential metabolic and genetic interaction among viruses, methanogen and methanotrophic archaea, and their syntrophic partners

Wang, Long; Wang, Yinzhao; Huang, Xingyu; Ma, Ruijie; Li, Jiangtao; Wang, Fengping; Jiao, Nianzhi; Zhang, Rui

doi:10.1038/s43705-022-00135-2

Download PDF

Article
Open access
Published: 28 June 2022

Potential metabolic and genetic interaction among viruses, methanogen and methanotrophic archaea, and their syntrophic partners

ISME Communications volume 2, Article number: 50 (2022) Cite this article

4011 Accesses
8 Citations
11 Altmetric
Metrics details

Subjects

Microbial ecology

Abstract

The metabolism of methane in anoxic ecosystems is mainly mediated by methanogens and methane-oxidizing archaea (MMA), key players in global carbon cycling. Viruses are vital in regulating their host fate and ecological function. However, our knowledge about the distribution and diversity of MMA viruses and their interactions with hosts is rather limited. Here, by searching metagenomes containing mcrA (the gene coding for the α-subunit of methyl-coenzyme M reductase) from a wide variety of environments, 140 viral operational taxonomic units (vOTUs) that potentially infect methanogens or methane-oxidizing archaea were retrieved. Four MMA vOTUs (three infecting the order Methanobacteriales and one infecting the order Methanococcales) were predicted to cross-domain infect sulfate-reducing bacteria. By facilitating assimilatory sulfur reduction, MMA viruses may increase the fitness of their hosts in sulfate-depleted anoxic ecosystems and benefit from synthesis of the sulfur-containing amino acid cysteine. Moreover, cell-cell aggregation promoted by MMA viruses may be beneficial for both the viruses and their hosts by improving infectivity and environmental stress resistance, respectively. Our results suggest a potential role of viruses in the ecological and environmental adaptation of methanogens and methane-oxidizing archaea.

Viral potential to modulate microbial methane metabolism varies by habitat

Article Open access 29 February 2024

Evolutionary diversification of methanotrophic ANME-1 archaea and their expansive virome

Article Open access 19 January 2023

A compendium of viruses from methanogenic archaea reveals their diversity and adaptations to the gut environment

Article 25 September 2023

Introduction

Methane is a potent greenhouse gas that can significantly influence the Earth’s climate [1], and therefore is a critical component in global carbon cycling. Biogenic methane production is mostly by methanogenic archaea through methanogenesis in anoxic environments [2]. Methane can be oxidized by anaerobic methane-oxidizing archaea (ANME) via a reversed-methanogenesis pathway, usually in anoxic sediments, which significantly reduces methane emission into the atmosphere [3,4,5]. Currently, pure cultured or enriched methanogens and methane-oxidizing archaea (MMA) are only found in eight orders of the Euryarchaeota, although emerging metagenomic evidence indicated that more phyla possibly involved in the anaerobic metabolism of methane, such as Candidatus Verstraetearchaeota, Korarchaeota, Nezhaarchaeota etc [6,7,8]. MMA usually survive in mutualistic ways [1]. Most of ANME rely on syntrophic interactions with sulfate-reducing bacteria (SRB), except for ANME-2d, or occasionally ANME-1 [9, 10]. Many methanogens also benefit, although not obligatory, from syntrophic bacteria, such as SRB from the phylum Firmicutes, carbohydrate-fermenting bacteria from the phylum Chloroflexi, and acetate-oxidizing bacteria from the class Deltaproteobacteria [11,12,13,14].

In most anoxic environments, viruses are the main biological controlling factors of indigenous microbial communities such as MMA [15]. However, our knowledge about viruses that can infect MMA is scarce. Currently, only few viruses that infect methanogens have been isolated, such as Myoviridae ΦFl (infects Methanobacterium sp.) [16], Siphoviridae ψM1 (infects Methanothermobacter marburgensis Marburg) [17], and Tectiviridae MetSV (infects Methanosarcina mazei Gö1) [18]. Generally, most cultured MMA viruses have been isolated from engineered ecosystems (e.g., anaerobic sludge, Supplemental Table 1). Using culture-independent methods, a virus that potentially infects the methanogen Methanosarcina barkeri Fusaro with high abundance was identified from hydrocarbon polluted sediment metagenomes [19]. Paul et al. recovered an ANME virus from deep subsurface virome coding for the diversity-generating retroelements system, which may enhance the genomic diversification of their hosts and confer additional selective advantages in the energy-limited environment [20]. Based on this limited information, we hypothesized that diverse and distinct viral groups are widely distribute in MMA inhabiting environments. Considering the close ecological relationships between MMA and their mutualistic partners, we further hypothesized that interactions exist between them and their viruses. Therefore, to achieve a better understanding of MMA viruses, we investigated metagenomes that may inhabit MMA from a variety of habitats and expanded knowledge of the diversity, distribution, life strategies, and possible ecological roles of MMA viruses.

Results and discussion

To evaluate potential viral attack on MMA, we analyzed all MMA genomes in the NCBI RefSeq database (n = 301; Aug 5, 2020). More than 74% (86 of 116) of the complete MMA genomes contain at least one level 4 CRISPR and Cas cluster (CRISPRCasdb database, version Jan 21, 2021) and the percentage is higher than the average levels of all archaea (70%) and bacteria (36%) [21]. Furthermore, the proportion of MMA genomes with at least one provirus was 41.9% (126 of 301), higher than the average level (~30%) when considering all microbial genomes [22]. These findings indicate that MMAs are under severe threat from viral infection, whereas CRISPR Cas system is a vital method for virus defense.

The gene coding for the α-subunit of methyl-coenzyme M reductase (mcrA), the key enzyme in both the methanogenesis and anaerobic methane-oxidizing pathways, is usually used as a marker gene for the detection and phylogenetic analysis of MMAs [8, 23]. Therefore, we analyzed 74 public mcrA gene-containing metagenomic datasets from diverse natural ecosystems, including marine and lake sediments, hot spring sediments, peatland soil, ground water, and hydrothermal vents (Fig. 1, Supplemental Table 2). The relative abundance of MMA averaged 4.6% across all samples and reached as high as 73.3% in a mud volcano sediment. Methanosarcinales was the dominant order of MMAs, followed by Methanomicrobiales and Methanobacteriales (Supplemental Fig. 1). A total of 2050 high quality metagenomic assembled genomes (MAGs) were recovered from the 74 samples. Deltaproteobacteria (n = 168), Chloroflexi (n = 163), Bacteroides (n = 148), and Gammaproteobacteria (n = 137) were the most widespread bacterial MAGs. Among the 565 archaeal MAGs, 82 affiliated to the eight typical MMA orders. Functional annotation revealed 12 more MAGs belonging to Candidatus Bathyarchaeota, Korarchaeota, and Verstraetearchaeota with methyl-coenzyme M reductase, which were considered as MMA as well (Supplemental Table 3). Among 94 MMA MAGs obtained in this study, 31 MAGs had at least one level 4 CRISPR and Cas cluster. The lower rate of MAGs with CRISPR Cas system compared with reference MMA genomes is probably because the presence of CRISPR spacers can influence the tetranucleotide frequency calculation and interfere binning processes [24].

**Fig. 1: Geographic locations of metagenomic samples analyzed in this study.**

Viruses are widely distributed in MMA inhabiting environments

Using combined searches with the Earth’s virome protocol [25], VirSorter [26], and DeepVirFinder [27], 17,350 viral contigs were retrieved from the 75 datasets. By integrating the ~2.33 million viral sequences from the IMG/VR database (v3.0) [28], all viral contigs were clustered into approximate species level virus operational taxonomic units (vOTUs) at 95% average nucleotide identity (ANI), resulting 988,888 vOTUs in total. After excluding the vOTUs only composed of viral sequences from IMG/VR database, 15,048 vOTUs containing viral contigs identified in the present study were used for further analysis, in which 5714 vOTU representative sequences were >10 kb, with four contigs >200 kb, the latter conforming to the definition of giant viruses (Fig. 2a; Supplemental Table 4). As estimated by CheckV, 577 viruses were complete or with high quality [29]. Compared with the latest IMG/VR database, ~65% (9761) and 9.7% (1463) of the vOTUs identified in this study were novel, or of higher quality, respectively. Moreover, 59.3% of the vOTUs were only found in one of the 74 datasets and >90% of the vOTUs were distributed in <4 samples, suggesting a high diversity and endemic of the viruses within the studied environments (Supplemental Fig. 2a). On the other hand, 78.8% vOTUs could be assigned taxonomy, mostly as tailed viruses (Caudovirales), which was dominated by Myoviridae (31.6%), followed by Siphoviridae (27.7%) and Podoviridae (14.5%) (Fig. 2b). The distribution range of Myoviridae was significantly wider than Siphoviridae and Podoviridae (Mann–Whitney ranked t-test, p < 10⁻⁴; Supplemental Fig. 2b).

**Fig. 2: Quality and taxonomic diversity of viral operational taxonomic units (vOTUs) identified in this study.**

The lifestyle of viruses is a critical factor to evaluate their ecological significance. Lytic infection more likely indicates top-down control of the host community [30], whereas lysogenic infection usually regulates the metabolism of prokaryotic hosts [31]. In this study, both the presence of integrase genes annotated by the Pfam database and the integration of viral regions into host genomes predicted by VirSorter were used as signatures for a lysogenic viral lifestyle [32]. Among the 15,048 vOTUs identified in this study, a total of 978 vOTUs were considered as proviruses (Supplemental Table 4). Among the 577 high quality or complete vOTUs, at least 21.32% (123 of 577) had potential to enter the lysogenic cycle. Recently, a provirus (MFTV1) infecting hyperthermophilic Methanocaldococcus fervens AG 86 ^T, which had been predicted using similar in silico methods with the present study [33], was successfully induced by low temperature stress [34].

The MMA viruses are diverse and unique

Using a modified in silico host prediction pipeline [28], 141 vOTUs were predicted to infect methanogens or methane-oxidizing archaea, expanding by >40% of the current cultured or uncultured MMA viral diversity in public databases. The orders Methanosarcinales (94 vOTUs), Methanobacteriales (18 vOTUs), and Methanomicrobiales (9 vOTUs) were the most prevalent hosts of the predicted MMA viruses (Supplemental Table 5). Among the viruses of Methanosarcinales, 20 vOTUs were predicted to infect ANME-2. For the MMA viruses with taxonomic assignment, more vOTUs were classified as Siphoviridae (39%) than Myoviridae (26%). High relative abundance of MMA viruses can be observed, especially in methane seep and marine sediments (Supplemental Fig. 3). Each MMA virus was generally distributed in same or similar environments, although certain vOTUs were recovered from diverse ecosystems.

To investigate the genomic similarity of the predicted MMA viruses with publicly available sequences, a shared protein content-based network analysis [35] was performed with four datasets to produce genus-level viral clusters (VCs): (1) 3464 prokaryotic viral genomes, including 107 archaeal viruses and 3357 bacterial phages (RefSeq v99); (2) all 15,048 vOTUs mined in this study; (3) 140 provirus regions (Supplemental Table 6) identified from 301 MMA genomes (Supplemental Table 7) from the NCBI RefSeq database; and (4) 349 MMA vOTUs acquired from the IMG/VR database (v3.0). The gene-sharing network revealed that no MMA viruses/proviruses could form VCs with viral genome from RefSeq v99, except for two methanogens viral isolates (Methanothermobacter virus ψM100 and Methanobacterium virus ψM2) (Fig. 3), which reflected the uniqueness of the viruses infecting MMA. By contrast, the MMA viruses derived from the present study and the public databases formed cohesive clusters, suggesting that MMA viruses with various origins share similar core genomic characteristics. However, variation of the pan-genomic traits could be observed between the MMA viruses identified in this study, which are all from natural environments, and those from the IMG/VR database primarily identified from animal-associated (32.3%) or engineered (55.9%) ecosystems [28]. Among the 2640 protein clusters (PCs) encoded by MMA viruses from natural ecosystems (140 and 51 MMA viruses from the present study and the IMG/VR database, respectively), 43.2% (1140) of the PCs are unique (Supplemental Fig. 4). Furthermore, 46.0% of the PCs of MMA viruses from engineered and animal-associated ecosystems are distinct compared with those from the other environments. The lower nutrient concentrations and higher complexity of redox gradients in the investigated natural ecosystems [36, 37] may contribute to the differences in the viral genomic signatures. On the contrary, for the 64 shared PCs, they generally related to some core functions of viruses, such as structure (capsid, portal, and tail), replication (DNA methylase, DNA polymerase, integrase, and transposase), lysis (holin), packaging (terminase), and toxin-antitoxin systems.

**Fig. 3: Relationship between vOTUs identified in this study and reference viral genomes.**

To investigate the phylogenetic variation between MMA viruses and NCBI reference viruses, terminase large subunit were used to construct a maximum likelihood tree (Fig. 4). Clear evolution variations could be observed of MMA viruses and other viruses, as well as the viruses infecting MMA of different orders. Furthermore, terminase large subunit encoded by MMA viruses clustered according to their taxonomy and inhabiting environments. The tetranucleotide frequencies of viruses infecting different orders of MMA were compared (Supplemental Fig. 5). Except for Methanomassiliicoccales and their viruses, Methanomicrobiales, Methanobacteriales, Methanosarcinales, and Methanococcales were illustrated similar tetranucleotide frequencies with their viruses. Moreover, similar tetranucleotide frequencies could be observed for viruses infecting MMA from same order, although their tetranucleotide frequency variations were higher than their hosts.

**Fig. 4: Phylogenetic comparison of MMA viruses.**

MMA viruses may infect mutualistic Deltaproteobacteria

Many proteins encoded by MMA viruses were found having high homology with proteins from Deltaproteobacteria. To uncover the relationships between MMA viruses and Deltaproteobacteria, host prediction was conducted with 386 vOTUs that were identified as potentially infecting Deltaproteobacteria using the same host prediction pipelines (Supplemental Table 8). Interestingly, four vOTUs were predicted to infect both MMA and Deltaproteobacteria (Supplemental Fig. 6). Specifically, 11 and 13 proteins of vOTU SRR3715733_6188278_L48927 were best hits to MMA and Deltaproteobacteria, respectively, in a BLASTP search of the NCBI nr database. Gene cluster I, containing viral structure-related genes showed high similarity with MMA, whereas gene cluster II was homologous to Desulfuromonadales bacterium C00003093 (GCA_001751205.1). The recombination of the viral genome with MMA and Deltaproteobacteria implied a possible cross-domain infection. For vOTU SRR5214151_scaffold00034_L57801, the genomic region encoding tail- and capsid-related genes is homologous to a prophage integrated in a deltaproteobacterial genome (GCA_007280345.1), while the match between the viral protospacer and CRISPR spacer of Methanobrevibacter olleyae (GCF_001563245.1) indicates a previous infection of MMA. Moreover, the anti-recBCD protein 1 (abc1) encoded by this viral contig was proved able to inhibit RecBCD nuclease, a complex with central anti-phage functions in bacteria [38, 39]. Considering the abc1 gene is located adjacent to three type IV CRISPR-Cas protein-encoding genes (csf2, csf3, and csf4), we suspect a possible function of this gene island in viruses in resisting MMA CRISPR-Cas immune systems. A mechanism for resisting host CRISPR-Cas systems may be important for viruses to expand their host-range [40].

Emerging evidence derived from recent ecological and metagenomics studies indicates that viruses with broad host-range are far more prevalent than previously thought [41, 42], yet the underlying mechanism is still poorly understood. From the IMG/VR database v3.0, we found similar evidence of cross-domain infection by MMA viruses, which were also predicted to infect Anaerolineales (Chloroflexi), or Syntrophorhabdus and/or Smithella (Deltaproteobacteria) (Supplemental Fig. 6, Supplemental Table 9). As most MMA benefit from symbiotic heterotrophic bacteria, represented by Deltaproteobacteria and Chloroflexi [1], the close ecological relationships between MMA and their mutualistic partners are probably the reason why they are susceptible to the same viruses, or their viruses are similar in genetic characteristics. Viruses have long been deemed an important medium of horizontal gene transfer [43]. Viruses with broad host-range perhaps significantly contribute to gene transfer from bacteria to archaea, which has been implicated as a primary driver of archaeal metabolic innovation [44, 45], but the actual contribution awaits more experimental evidence.

MMA viruses are associated with organosulfur metabolism

To investigate the mechanisms of MMA viruses interacting with their hosts and affecting biogeochemical cycles, virus-encoded putative auxiliary metabolic genes (AMGs) were predicted using DRAM-v, followed by manual curation, resulting in the identification of 44 putative AMGs (auxiliary score ≤ 3) related to various host metabolic functions, including carbohydrates, nucleotides, cofactors and vitamins, sulfur, and amino acid metabolisms (Supplemental Table 10). One of the most widespread and abundant putative AMGs was cysH, which encodes phosphoadenosine phosphosulfate reductase, a key enzyme of the assimilatory sulfate reduction pathway (Fig. 5a; Supplemental Table 10). cysH has frequently been reported to be carried by viruses from diverse anoxic ecosystems, including rumen [46], sulfidic mine tailings [47], stratified redoxcline [48], and cold seeps [32], but to be absent from oxic ecosystems [48]. As one of the limited electron acceptors in anoxic environments, sulfate plays vital roles in energy metabolism of microorganisms [49]. Meanwhile, sulfur is also an essential constituent of biomass [50], so the enhancement of sulfur uptake through assimilatory sulfate reduction for synthesis of sulfur-containing amino acids (methionine and cysteine) and other organic matters may help MMAs, as well as their viruses, to survive in anoxic ecosystems.

**Fig. 5: The potential influence of viruses on the metabolism of methanogens and methane-oxidizing archaea.**

To explore the potential function of cysH in viruses, the frequency of cysteine and methionine codons was compared between viruses with and without a cysH gene identified in this study. Significant higher cysteine frequency was observed for the viruses containing cysH. However, the methionine frequency was not influenced by the presence of cysH (Supplemental Fig. 7). The widespread presence of DNA (cytosine-5)-methyltransferase (dcm) in viral genomes, which catalyzes the degradation of methionine, may explain this phenomenon. There were 154 vOTUs that contained the dcm gene, seven of which were predicted to infect MMA with one MMA virus carried both cysH and dcm genes. We also found that the viruses with the dcm gene had a higher frequency of codons for cysteine (but not methionine) in their genomes (Supplemental Fig. 7). The dcm gene from viruses may enhance the degradation of methionine to redirect organic sulfur to enter the cysteine biosynthesis pathway, although a potential function of dcm in helping viruses to protect DNA from being cut by native methylation-sensitive restriction enzymes has been reported [51]. Overall, we propose two strategies used by viruses in anoxic ecosystems to synthesize more cysteine, which is the only amino acid that can form disulfide bonds that stabilize viral protein structure [52]. Viruses enhance the assimilatory sulfate reduction pathway, and/or reuse the sulfur in methionine or other organosulfur from the host cell to synthesize cysteine (Fig. 5b).

To help uncover the function of cysteine in viruses, the two MMA viral genomes separately containing cysH and dcm genes were studied in detail (Fig. 5a). In both viruses, the tail spike protein was among the most cysteine-rich proteins. Tail spikes are responsible for attachment to cells, and frequently recognize and hydrolytically cleave bacterial cell surface polysaccharides [53]. The enrichment with cysteine of the tail spike may play a critical role in the disruption of the host membrane via specific and non-specific electrostatic and hydrophobic interactions with cell surface groups [54, 55]. Radical S-adenosyl-L-methionine (SAM) enzyme, which includes a vitamin B₁₂-binding domain and a [4Fe–4 S] cluster, also has 10 cysteine and initiate diverse sets of radical reactions [56]. The three cysteine residues of the CX₃CX₂C motif coordinate three of the four irons of the [4Fe–4 S] cluster at the active site of the enzyme [56]. A huge virus identified in this study encoded 15 radical SAM enzymes (Supplemental Fig. 8), indicating a critical function of this gene for viruses in anoxic ecosystems. Overall, cysteine may be important for both the activity of enzymes and entry into host cells for viruses.

MMA viruses adapt to their hosts and inhabiting ecosystems

The MMA viruses identified in this study encoded genes involved in the biosynthesis of O-antigen (including UDP-glucose 4-epimerase, GDP-mannose 4,6 dehydratase, and dTDP-4-dehydrorhamnose reductase) and other carbohydrate glycosyltransferases (GT2, GT4, and GT9) (Supplemental Fig. 9). The O-antigen is an important component of the outer membrane, but flexible and highly variable even among the closest of relatives [57,58,59]. Temperate phages from the gammaproteobacterial Shigella flexneri code for factors responsible for serotype conversion, which can protect S. flexneri from immune response directed against the O-antigen component of the outer membrane lipopolysaccharide (LPS) [60]. The O-antigen genes encoded by MMA viruses may enrich rhamnose and galactose in the LPS of their hosts by converting GDP-D-mannose to GDP-D-rhamnose, dTDP-4-dehydro-β-L-rhamnose to dTDP-L-rhamnose, and UDP-D-glucose to UDP-D-galactose. Rhamnose-rich O-antigen can contribute to surface adhesion and cell-cell aggregation [61], which is a strategy to protect microorganisms from environmental stresses [62, 63]. On the other hand, as the burst size of MMA viruses is generally small [17, 34], the aggregated cells may make it easier for viruses to encounter potential hosts (Fig. 5c). As well as the synthesis of O-antigen, the bactoprenol glucosyl transferase gene (gtrB) affiliated to GT2 encoded by MMA viruses catalyze the transfer of glucosyl from UDP-glucose to bactoprenol [64]. Moreover, research has revealed that the modification of host O-antigen and LPS by viruses (typically proviruses) probably acts as a protective mechanism to exclude other viruses from infecting by altering viral adsorption sites [65, 66]. Additionally, viruses may help with host defensive systems. A total of 12 MMA viruses identified both in this study and the IMG/VR database containing genes coding for non-toxic nonhemagglutinin type C, which in proviruses of Clostridium botulinum, protects against pH-mediated botulinum neurotoxin type C (BoNT/C) inactivation [67]. MMA viruses also contain diverse genes associated with the purine and pyrimidine metabolism pathways, to shift host metabolism toward nucleotide biosynthesis as an adaptation to viral replication [68]. Previous research found that viruses encoded various carbohydrate-active enzymes affiliated to glycoside hydrolases to augment the breakdown of complex carbohydrates to increase energy production and boost viral replication [32, 69,70,71]. However, the putative AMGs encoded by MMA viruses rarely related to organic matter degradation, consistent with the metabolism of methanogens and ANME, which barely use carbohydrates as carbon and energy sources.

Conclusions

Together, we observed the high diversity and novelty of viruses from various natural environments that potentially inhabit MMA. Distinct genome signatures, wide distribution, and high abundance were observed for the viruses infecting methanogens or methane-oxidizing archaea. Several MMA viruses were predicted to be able to infect mutualistic sulfate-reducing bacteria such as Deltaproteobacteria and Chloroflexi. As adaptations to their inhabiting environment, complex strategies were proposed of MMA viruses to interact with their hosts, such as enhancing assimilatory sulfate reduction to synthesize organosulfur, protecting the defense systems of their hosts, and facilitating cell-cell aggregation to resist environmental stresses. Although intensive laboratorial or in situ experiments are needed to validate these results, the present study may expand our view of MMA viruses and provide clues to the survival strategies of viruses in anoxic environments.

Materials and methods

Data acquirement

Using a previously reported mcrA protein database (n = 153) containing sequences from the known methanogens and ANMEs [8], predicted protein sequences of high-throughput metagenomes were queried to identify metagenomic datasets containing MCR-based alkane metabolism-related genes, using Diamond [72] version 0.8.28.90 (identity ≥ 0.3, coverage ≥ 0.75, e value ≤ 1 × 10^–20). A total of 74 genomic datasets were acquired and download from the NCBI Sequence Read Archive public database (Fig. 1; Supplemental Table 2). Raw reads were trimmed using the Sickle algorithm version 1.33 followed by the assembly conducted using MEGAHIT version 1.0.6-hotfix1 with default parameter. Moreover, to identified high degree of confidence CRISPR spacers and proviruses of MMA, 116 complete and 185 draft genomes of methanogens or ANMEs were download from NCBI reference sequence database (RefSeq). All biological sequence data was downloaded using NCBI datasets command line tools version 10.5.1 (https://www.ncbi.nlm.nih.gov/datasets).

Prokaryotic community, genomic binning, and taxonomic annotation

To explore the prokaryotic composition of each sample, prokaryotic 16S rRNA miTags were extracted from quality-controlled reads using SortMeRNA [73] version 4.2.0 with default parameters. All 16S rRNA miTags were queried to the kraken2 database (pre-built 16S rRNA gene database of SILVA v138) using lowest common ancestor algorithm [74], and then adjusted using Bracken [75]. MetaPhlAn3 was used to evaluate the potential eukaryotic contamination with default parameters [76], which revealed that the eukaryotic reads account for <0.1% of all clean reads for all samples. A hybrid binning approach was employed to cluster metagenomic contigs for each sample. Before the binning, sequencing reads were mapped to a corresponding metagenome by Bowtie2 version 2.3.4 and SAMtools version 1.6 to calculate average sequencing depth for each contig [77, 78]. Then the generated depth profile and metagenomic contigs (≥2 kb) were input into MaxBin2 for a first binning with default parameters [79]. Resultant contig groups were individually imported into MetaBAT2 for a second binning with default parameters [80]. Finally, CheckM was employed to estimate quality of the genome bins, which with contamination ≤10% and completeness ≥50% were retained for subsequent analysis [81]. Since metagenomes were assembled and binned separately, resulting redundant MAGs, which were dereplicated at 99% ANI using dRep v2.6.2 (parameters: -comp 50 -con 10 -sa 0.99) [82]. The phylogenetic affiliations of all non-redundant MAGs were analyzed using the GTDB-Tk genome-based taxonomy (GTDB-Tk version 1.1.1 with GTDB version 89) [83].

Recover and deduplicate viral contigs

Contigs ≥5 kb of all metagenomic samples were pulled for viral contig recovery by stepwise method according to the confidence of prediction strategy. (1) The virus detection pipelines of Earth’s virome [25] resulted in 9892 viral contigs; (2) 6264 more contigs were sorted as virus/provirus category 1 or 2 by VirSorter [26]; (3) Among the category 3 virus/provirus sequences, 1088 were identified as viruses using DeepVirFinder (score ≥ 0.9 and p < 0.05) [27]. For the 301 MMA genomes, proviruses were also predicted using VirSorter (categories 1 and 2) [22]. After merging the viral contigs identified from mcrA-containing metagenomes, provirus regions from 301 MMA genomes, and all viral contigs of IMG/VR database (v3.0) [28], species-rank virus groups “vOTUs” were clustered based on pairwise ANI at the thresholds of 95% identity over 85% alignment fraction (relative to the shorter sequence) using CheckV code [29], and the longest contig of each vOTU were kept as representatives for further analysis. The completeness of viral contigs was then estimated using the CheckV pipeline (end_to_end program) [29]. VirSorter2 version 2.1 was also used to evaluate the prediction of viruses according to their standard operating procedure with manual curation and generate “affi-contigs.tab” files needed by DRAMv to identify AMGs with parameters --prep-for-dramv and --provirus-off [84].

Viral taxonomic assignment and distribution profiles

Two complementary approaches were used for taxonomic classification. Firstly, vConTACT2 was conducted using default parameters resulting in only 1.26% vOTUs (n = 192) taxonomically annotated. Secondly, all vOTU representatives were sorted using CAT version 5.0.4 against the NCBI Viral RefSeq proteins v207 setting default options except “--evalue 1e-5” with 78.8% vOTUs acquiring taxonomic assignment. By comparing the two methods, high consistent rate (97.8%) of the vOTUs classified by both methods at family level can be observed. The relative abundances of vOTUs were quantified using reads per kilobase per million mapped reads, which calculated by mapping the original trimmed reads to viral contigs using BBMap version 38.87 with default parameters (https://github.com/BioInfoTools/BBMap). If the percentage of vOTU contig covered by reads was ≥85%, this vOTU was considered being present in this sample.

Methanogens and methane-oxidizing archaea viral prediction

From the 15,048 vOTUs identified in this study, the viruses possibly infecting methanogens or methanotrophic archaea were predicted using in silico methods based on previous reports [69, 85, 86]. The clustered regularly interspaced short palindromic repeat (CRISPR) spacers and their associated Cas proteins were searched using CRISPRCasFinder from the 301 MMA genomes and 94 MMA MAGs binned in the present study [87]. The searching resulted CRISPR arrays were further sorted into four evidence-levels (level 1–4), which with a higher evidence-level indicating a higher likelihood being a true-positive array [87]. Search_oligodb function of Usearch (v11.0.667) was used to compare all predicted viral sequences to the database of MMA CRISPR spacers (https://drive5.com/usearch). For each pair of viral sequence and putative host genome, a valid matching was confirmed when at least one hit had ≤1 mismatch over the entire spacer length. Similarly, the tRNA genes of vOTUs, MMA genomes and MAGs were identified by tRNAscan-SE (v2.0.3) setting the parameters “-G” [88]. Blastn was used to align viral encoded tRNAs to MMA derived sequences, with complete matches as confidential links between viruses and hosts. A virus meeting the following criterions was linked to MMA: (1) viral contigs matched to evidence level 4 CRISPR spacers (22 virus-host links were identified); (2) identical tRNAs predicted from vOTUs and MMA genomes (one virus-host link was identified); (3) viral contigs sorted as MMA or with at least 5 ORFs sorted as MMA by CAT against the NCBI non-redundant protein database (nr; version 2020.03.04) (80 virus-host links were identified); (4) viral contigs binned into a MMA MAG with at least one ORF sorted as MMA (18 virus-host links were identified). Moreover, same strategy was conducted to predict the phages of Deltaproteobacterial SRB.

Gene-sharing based network analysis

To investigate the relationship between MMA viruses and publicly available viral sequences, vConTACT2 was used to construct a gene-sharing network, including all vOTUs acquired in the present study (n = 15,048), provirus regions identified from 301 MMA genomes from NCBI RefSeq database (n = 140), MMA viruses identified by IMG/VR database v3.0 (n = 349), and prokaryotic viral RefSeq v99 integrated in vConTACT2 (n = 3,464). All above sequences were pooled to call ORFs using Prodigal (parameters: -m, -p meta) [89], and the resulting protein sequences were clustered using vConTACT2 with default parameters [35]. The resulting network was visualized in Cytoscape v3.8.2 using edge-weighted spring-embedded mode [90]. Only the interactions between viruses with a score ≥10 were illustrated in network.

Functional annotations of viral sequences

All ORFs of 15,152 vOTUs called by prodigal were functionally annotated against KEGG database (release 95.0) and Pfam database (release 33.0) using KofamScan version 1.2.0 (E value < 10⁻⁵) [91] and Pfamscan (-as) [92], respectively. For the MMA viruses, the annotation was further conducted using DRAMv [93] version 1.2.0 with AMGs predicted (--max_auxiliary_score 3). All putative AMGs were manual curated by checking the upstream and downstream genes following recent protocol [94]. Genes related to nucleotide metabolism were excluded because of their widespread in viral genomes. The ORFs of viral contigs illustrated in Fig. 5 and Supplemental Fig. 6 were compared with PDB protein data bank using the online service of HHpred [95]. The protein fold recognition of viral encoded O-antigen genes were modelled using PHYRE2 to confirm and further resolve functional predictions [96]. All O-antigen gene structures modelled by PHYRE2 had 100% confidence scores and >80% coverage. The viral genome maps were visualized using Easyfig version 2.2.5 [97].

Phylogenetic and tetranucleotide analyses of MMA viruses

All proteins of MMA viruses/proviruses used in the network analysis with a terminase large subunit N-terminal domain (T4-like virus type) were queried to phylogenetic analysis. Protein sequences were aligned with MAFFT (–localpair – maxiterate 1000) [98] and then adjusted with trimAl (-automated1) [99]. Maximum likelihood tree was built using IQ-Tree v2.0.3 with model auto-detected (LG + G) and an ultrafast bootstrap of maximum iteration of 1000 [100] and visualized using Interactive Tree Of Life (iTOL) with branch length ignored [101]. Tetranucleotide frequencies of MMA and their viruses were calculated, clustered, and visualized using Emergent Self-Organizing Maps [102]. The correlation coefficients of the tetranucleotide frequencies of all MMA viruses were calculated using Python package pyani. The pairwise comparison of viruses infecting the five orders of MMA were conducted using ANOSIM.

Data availability

Metagenomic data are available in the NCBI Sequence Read Archive (https://www.ncbi.nlm.nih.gov/sra) database and detailed in Supplemental Table 2. All other data produced in the present study are all available in Supplemental materials.

References

Evans PN, Boyd JA, Leu AO, Woodcroft BJ, Parks DH, Hugenholtz P, et al. An evolving view of methane metabolism in the Archaea. Nat Rev Microbiol. 2019;17:219–32.
Article CAS PubMed Google Scholar
Reeburgh WS. Oceanic methane biogeochemistry. Chem Rev. 2007;107:486–513.
Article CAS PubMed Google Scholar
Timmers PHA, Welte CU, Koehorst JJ, Plugge CM, Jetten MSM, Stams AJM. Reverse methanogenesis and respiration in methanotrophic Archaea. Archaea. 2017;2017:1–22.
Article Google Scholar
Hallam SJ, Putnam N, Preston CM, Detter JC, Rokhsar D, Richardson PM, et al. Reverse methanogenesis: testing the hypothesis with environmental genomics. Science. 2004;305:1457–62.
Article CAS PubMed Google Scholar
Knittel K, Boetius A. Anaerobic oxidation of methane: progress with an unknown process. Annu Rev Microbiol. 2009;63:311–34.
Article CAS PubMed Google Scholar
Vanwonterghem I, Evans PN, Parks DH, Jensen PD, Woodcroft BJ, Hugenholtz P, et al. Methylotrophic methanogenesis discovered in the archaeal phylum Verstraetearchaeota. Nat Microbiol. 2016;1:16170.
Article CAS PubMed Google Scholar
McKay LJ, Dlakić M, Fields MW, Delmont TO, Eren AM, Jay ZJ, et al. Co-occurring genomic capacity for anaerobic methane and dissimilatory sulfur metabolisms discovered in the Korarchaeota. Nat Microbiol. 2019;4:614–22.
Article CAS PubMed Google Scholar
Wang Y, Wegener G, Hou J, Wang F, Xiao X. Expanding anaerobic alkane metabolism in the domain of Archaea. Nat Microbiol. 2019;4:595–602.
Article CAS PubMed Google Scholar
Wang Y, Wegener G, Ruff SE, Wang F. Methyl/alkyl‐coenzyme M reductase‐based anaerobic alkane oxidation in archaea. Environ Microbiol. 2021;23:530–41.
Article CAS PubMed Google Scholar
Bertram S, Blumenberg M, Michaelis W, Siegert M, Krüger M, Seifert R. Methanogenic capabilities of ANME‐archaea deduced from ¹³C‐labelling approaches. Environ Microbiol. 2013;15:2384–93.
Article CAS PubMed Google Scholar
Sousa DZ, Smidt H, Alves MM, Stams AJM. Syntrophomonas zehnderi sp. nov., an anaerobe that degrades long-chain fatty acids in co-culture with Methanobacterium formicicum. Int J Syst Evol Micr. 2007;57:609–15.
Article CAS Google Scholar
Yamada T, Sekiguchi Y, Hanada S, Imachi H, Ohashi A, Harada H, et al. Anaerolinea thermolimosa sp. nov., Levilinea saccharolytica gen. nov., sp. nov. and Leptolinea tardivitalis gen. nov., sp. nov., novel filamentous anaerobes, and description of the new classes Anaerolineae classis nov. and Caldilineae classis nov. in the bacterial phylum Chloroflexi. Int J Syst Evol Micr. 2006;56:1331–40.
Article CAS Google Scholar
Yamada T, Sekiguchi Y, Imachi H, Kamagata Y, Ohashi A, Harada H. Diversity, localization, and physiological properties of filamentous microbes belonging to Chloroflexi subphylum I in mesophilic and thermophilic methanogenic sludge granules. Appl Environ Microb. 2005;71:7493–503.
Article CAS Google Scholar
Manzoor S, Schnürer A, Bongcam-Rudloff E, Müller B. Complete genome sequence of Methanoculleus bourgensis strain MAB1, the syntrophic partner of mesophilic acetate-oxidising bacteria (SAOB). Stand Genomic Sci. 2016;11:80.
Article PubMed PubMed Central Google Scholar
Engelhardt T, Sahlberg M, Cypionka H, Engelen B. Biogeography of Rhizobium radiobacter and distribution of associated temperate phages in deep subseafloor sediments. ISME J. 2013;7:199–209.
Article CAS PubMed Google Scholar
Nölling J, Groffen A, de Vos WM. φ F1 and φF3, two novel virulent, archaeal phages infecting different thermophilic strains of the genus. Methanobacterium Microbiol. 1993;139:2511–6.
Google Scholar
Meile L, Jenal U, Studer D, Jordan M, Leisinger T. Characterization of ψM1, a virulent phage of Methanobacterium thermoautotrophicum Marburg. Arch Microbiol. 1989;152:105–10.
Article CAS Google Scholar
Weidenbach K, Nickel L, Neve H, Alkhnbashi OS, Künzel S, Kupczok A, et al. Methanosarcina spherical virus, a novel archaeal lytic virus targeting Methanosarcina strains. J Virol. 2017;91:e00955–17.
Article PubMed PubMed Central Google Scholar
Molnár J, Magyar B, Schneider G, Laczi K, Valappil SK, Kovács ÁL, et al. Identification of a novel archaea virus, detected in hydrocarbon polluted Hungarian and Canadian samples. PLOS ONE. 2020;15:e0231864.
Article PubMed PubMed Central Google Scholar
Paul BG, Bagby SC, Czornyj E, Arambula D, Handa S, Sczyrba A, et al. Targeted diversity generation by intraterrestrial archaea and archaeal viruses. Nat Commun. 2015;6:6585.
Article CAS PubMed Google Scholar
Pourcel C, Touchon M, Villeriot N, Vernadet J-P, Couvin D, Toffano-Nioche C, et al. CRISPRCasdb a successor of CRISPRdb containing CRISPR arrays and cas genes from complete genome sequences, and tools to download and query lists of repeats and spacers. Nucleic Acids Res. 2019;48:D535–D544.
PubMed Central Google Scholar
Roux S, Hallam SJ, Woyke T, Sullivan MB. Viral dark matter and virus–host interactions resolved from publicly available microbial genomes. eLife. 2015;4:e08490.
Article PubMed Central Google Scholar
Lever MA, Teske AP. Diversity of methane-cycling Archaea in hydrothermal sediment investigated by general and group-specific PCR primers. Appl Environ Microb. 2015;81:1426–41.
Article Google Scholar
Jian H, Yi Y, Wang J, Hao Y, Zhang M, Wang S, et al. Diversity and distribution of viruses inhabiting the deepest ocean on Earth. ISME J. 2021;15:3094–110.
Paez-Espino D, Pavlopoulos GA, Ivanova NN, Kyrpides NC. Nontargeted virus sequence discovery pipeline and virus clustering for metagenomic data. Nature Protoc. 2017;12:1673–82.
Article CAS Google Scholar
Roux S, Enault F, Hurwitz BL, Sullivan MB. VirSorter: mining viral signal from microbial genomic data. PeerJ. 2015;3:e985.
Article PubMed PubMed Central Google Scholar
Ren J, Song K, Deng C, Ahlgren NA, Fuhrman JA, Li Y, et al. Identifying viruses from metagenomic data using deep learning. Quant Biol. 2020;8:64–77.
Article CAS PubMed PubMed Central Google Scholar
Roux S, Páez-Espino D, Chen I-MA, Palaniappan K, Ratner A, Chu K, et al. IMG/VR v3: an integrated ecological and evolutionary framework for interrogating genomes of uncultivated viruses. Nucleic Acids Res. 2020;49:D764–D775.
Article PubMed Central Google Scholar
Nayfach S, Camargo AP, Schulz F, Eloe-Fadrosh E, Roux S, Kyrpides NC. CheckV assesses the quality and completeness of metagenome-assembled viral genomes. Nat Biotechnol. 2021;39:578–85.
Article CAS PubMed Google Scholar
Sandaa R, Gómez‐Consarnau L, Pinhassi J, Riemann L, Malits A, Weinbauer MG, et al. Viral control of bacterial biodiversity – evidence from a nutrient‐enriched marine mesocosm experiment. Environ Microbiol. 2009;11:2585–97.
Article CAS PubMed Google Scholar
Howard-Varona C, Hargreaves KR, Abedon ST, Sullivan MB. Lysogeny in nature: mechanisms, impact and ecology of temperate phages. ISME J. 2017;11:1511–20.
Article PubMed PubMed Central Google Scholar
Li Z, Pan D, Wei G, Pi W, Zhang C, Wang J-H, et al. Deep sea sediments associated with cold seeps are a subsurface reservoir of viral diversity. ISME J. 2021;15:2366–78.
Article CAS PubMed PubMed Central Google Scholar
Krupovič M, Forterre P, Bamford DH. Comparative analysis of the mosaic genomes of tailed archaeal viruses and proviruses suggests common themes for virion architecture and assembly with tailed viruses of bacteria. J Mol Biol. 2010;397:144–60.
Article PubMed Google Scholar
Thiroux S, Dupont S, Nesbø CL, Bienvenu N, Krupovic M, L’Haridon S, et al. The first head‐tailed virus, MFTV1, infecting hyperthermophilic methanogenic deep‐sea archaea. Environ Microbiol. 2021;23:3614–26.
Article CAS PubMed Google Scholar
Jang HB, Bolduc B, Zablocki O, Kuhn JH, Roux S, Adriaenssens EM, et al. Taxonomic assignment of uncultivated prokaryotic virus genomes is enabled by gene-sharing networks. Nat Biotechnol. 2019;37:632–9.
Article Google Scholar
Hao L, Bize A, Conteau D, Chapleur O, Courtois S, Kroff P, et al. New insights into the key microbial phylotypes of anaerobic sludge digesters under different operational conditions. Water Res. 2016;102:158–69.
Article CAS PubMed Google Scholar
Bedoya K, Hoyos O, Zurek E, Cabarcas F, Alzate JF. Annual microbial community dynamics in a full-scale anaerobic sludge digester from a wastewater treatment plant in Colombia. Sci Total Environ. 2020;726:138479.
Article CAS PubMed Google Scholar
Murphy KC, Fenton AC, Poteete AR. Sequence of the bacteriophage P22 Anti-RecBCD (abc) genes and properties of P22 abc region deletion mutants. Virology. 1987;160:456–64.
Article CAS PubMed Google Scholar
Millman A, Bernheim A, Stokar-Avihail A, Fedorenko T, Voichek M, Leavitt A, et al. Bacterial retrons function in anti-phage defense. Cell. 2020;183:1551–61.
Article CAS PubMed Google Scholar
Pawluk A, Davidson AR, Maxwell KL. Anti-CRISPR: discovery, mechanism and function. Nat Rev Microbiol. 2018;16:12–7.
Article CAS PubMed Google Scholar
Jonge PA, de, Nobrega FL, Brouns SJJ, Dutilh BE. Molecular and evolutionary determinants of bacteriophage host range. Trends Microbiol. 2018;27:51–63.
Article PubMed Google Scholar
Daly RA, Roux S, Borton MA, Morgan DM, Johnston MD, Booker AE, et al. Viruses control dominant bacteria colonizing the terrestrial deep biosphere after hydraulic fracturing. Nat Microbiol. 2019;4:352–61.
Article CAS PubMed Google Scholar
Salmond GPC, Fineran PC. A century of the phage: past, present and future. Nat Rev Microbiol. 2015;13:777–86.
Article CAS PubMed Google Scholar
Rastogi S, Liberles DA. Subfunctionalization of duplicated genes as a transition state to neofunctionalization. BMC Evol Biol. 2005;5:28.
Article PubMed PubMed Central Google Scholar
Petitjean C, Makarova KS, Wolf YI, Koonin EV. Extreme deviations from expected evolutionary rates in archaeal protein families. Genome Biol Evol. 2017;9:2791–811.
Article CAS PubMed PubMed Central Google Scholar
Anderson CL, Sullivan MB, Fernando SC. Dietary energy drives the dynamic response of bovine rumen viral communities. Microbiome. 2017;5:155.
Article PubMed PubMed Central Google Scholar
Gao S-M, Schippers A, Chen N, Yuan Y, Zhang M-M, Li Q, et al. Depth-related variability in viral communities in highly stratified sulfidic mine tailings. Microbiome. 2020;8:89.
Article PubMed PubMed Central Google Scholar
Mara P, Vik D, Pachiadaki MG, Suter EA, Poulos B, Taylor GT, et al. Viral elements and their potential influence on microbial processes along the permanently stratified Cariaco Basin redoxcline. ISME J. 2020;14:3079–92.
Article CAS PubMed PubMed Central Google Scholar
Pfennig N, Widdel F, Trüper HG. The prokaryotes, A handbook on habitats, isolation, and identification of bacteria. Springer-Verlag, Berlin, Germany. 1981.
Moran MA, Durham BP. Sulfur metabolites in the pelagic ocean. Nat Rev Microbiol. 2019;17:665–78.
Article CAS PubMed Google Scholar
Kumar S, Cheng X, Klimasauskas S, Sha M, Posfai J, Roberts RJ, et al. The DNA (cytosine-5) methyltransferases. Nucleic Acids Res. 1994;22:1–10.
Article CAS PubMed PubMed Central Google Scholar
Ashcroft AE, Lago H, Macedo JMB, Horn WT, Stonehouse NJ, Stockley PG. Engineering thermal stability in RNA phage capsids via disulphide bonds. J Nanosci Nanotechno. 2005;5:2034–41.
Article CAS Google Scholar
Walter M, Fiedler C, Grassl R, Biebl M, Rachel R, Hermo-Parrado XL, et al. Structure of the receptor-binding protein of bacteriophage Det7: a podoviral tail spike in a Myovirus. J Virol. 2008;82:2265–73.
Article CAS PubMed Google Scholar
Shai Y. Mode of action of membrane active antimicrobial peptides. Peptide Sci. 2002;66:236–48.
Article CAS Google Scholar
Thevissen K, Ferket KKA, François IEJA, Cammue BPA. Interactions of antifungal plant defensins with fungal membrane components. Peptides. 2003;24:1705–12.
Article CAS PubMed Google Scholar
Broderick JB, Duffus BR, Duschene KS, Shepard EM. Radical S-adenosylmethionine enzymes. Chem Rev. 2014;114:4229–317.
Article CAS PubMed PubMed Central Google Scholar
Wildschutte H, Preheim SP, Hernandez Y, Polz MF. O‐antigen diversity and lateral transfer of the wbe region among Vibrio splendidus isolates. Environ Microbiol. 2010;12:2977–87.
Article CAS PubMed Google Scholar
Samuel G, Reeves P. Biosynthesis of O-antigens: genes and pathways involved in nucleotide sugar precursor synthesis and O-antigen assembly. Carbohyd Res. 2003;338:2503–19.
Article CAS Google Scholar
Polz MF, Alm EJ, Hanage WP. Horizontal gene transfer and the evolution of bacterial and archaeal population structure. Trends Genet. 2013;29:170–5.
Article CAS PubMed PubMed Central Google Scholar
Markine-Goriaynoff N, Gillet L, Etten JLV, Korres H, Verma N, Vanderplasschen A. Glycosyltransferases encoded by viruses. J Gen Virol. 2004;85:2741–54.
Article CAS PubMed Google Scholar
Clifford JC, Rapicavoli JN, Roper MC. A rhamnose-rich O-antigen mediates adhesion, virulence, and host colonization for the xylem-limited phytopathogen Xylella fastidiosa. Mol Plant-microbe Interac. 2013;26:676–85.
Article CAS Google Scholar
Trueba G, Zapata S, Madrid K, Cullen P, Haake D. Cell aggregation: a mechanism of pathogenic Leptospira to survive in fresh water. Int Microbiol Official J Span Soc Microbiol. 2004;7:35–40.
Google Scholar
Trunk T, Khalil HS, Leo JC. Norway BCSG Section for Genetics and Evolutionary Biology, Department of Biosciences, University of Oslo, Oslb,. Bacterial autoaggregation. Aims Microbiol. 2018;4:140–164.
Article CAS PubMed PubMed Central Google Scholar
Guan S, Bastin DA, Verma NK. Functional analysis of the O antigen glucosylation gene cluster of Shigella flexneri bacteriophage SfX. Microbiology. 1999;145:1263–73.
Article CAS PubMed Google Scholar
Rakhuba DV, Kolomiets EI, Dey ES, Novik GI. Bacteriophage receptors, mechanisms of phage adsorption and penetration into host cell. Pol J Microbiol. 2010;59:145–55.
Article CAS PubMed Google Scholar
Silva JB, Storms Z, Sauvageau D. Host receptors for bacteriophage adsorption. FEMS Microbiol Lett. 2016;363:fnw002.
Article Google Scholar
Tsuzuki K, Kimura K, Fujii N, Yokosawa N, Oguma K. The complete nucleotide sequence of the gene coding for the nontoxic-nonhemagglutinin component of Clostridium botulinum type C progenitor toxin. Biochem Bioph Res Co. 1992;183:1273–9.
Article CAS Google Scholar
Enav H, Mandel-Gutfreund Y, Béjà O. Comparative metagenomic analyses reveal viral-induced shifts of host metabolism towards nucleotide biosynthesis. Microbiome. 2014;2:9.
Article PubMed PubMed Central Google Scholar
Emerson JB, Roux S, Brum JR, Bolduc B, Woodcroft BJ, Jang HB, et al. Host-linked soil viral ecology along a permafrost thaw gradient. Nat Microbiol. 2018;3:870–80.
Article CAS PubMed PubMed Central Google Scholar
Jin M, Guo X, Zhang R, Qu W, Gao B, Zeng R. Diversities and potential biogeochemical impacts of mangrove soil viruses. Microbiome. 2019;7:58.
Article PubMed PubMed Central Google Scholar
Anderson RE, Reveillaud J, Reddington E, Delmont TO, Eren AM, McDermott JM, et al. Genomic variation in microbial populations inhabiting the marine subseafloor at deep-sea hydrothermal vents. Nat Commun. 2017;8:1114.
Article PubMed PubMed Central Google Scholar
Buchfink B, Xie C, Huson DH. Fast and sensitive protein alignment using DIAMOND. Nat Methods. 2015;12:59–60.
Article CAS PubMed Google Scholar
Kopylova E, Noé L, Touzet H. SortMeRNA: fast and accurate filtering of ribosomal RNAs in metatranscriptomic data. Bioinformatics. 2012;28:3211–7.
Article CAS PubMed Google Scholar
Lu J, Salzberg SL. Ultrafast and accurate 16S rRNA microbial community analysis using Kraken 2. Microbiome. 2020;8:124.
Article CAS PubMed PubMed Central Google Scholar
Lu J, Breitwieser FP, Thielen P, Salzberg SL. Bracken: estimating species abundance in metagenomics data. Peerj Comput Sci. 2017;3:e104.
Article Google Scholar
Beghini F, McIver LJ, Blanco-Míguez A, Dubois L, Asnicar F, Maharjan S, et al. Integrating taxonomic, functional, and strain-level profiling of diverse microbial communities with bioBakery 3. Elife. 2021;10:e65088.
Article CAS PubMed PubMed Central Google Scholar
Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat methods. 2012;9:357–9.
Article CAS PubMed PubMed Central Google Scholar
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–9.
Article PubMed PubMed Central Google Scholar
Wu Y-W, Simmons BA, Singer SW. MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets. Bioinformatics. 2016;32:605–7.
Article CAS PubMed Google Scholar
Kang DD, Li F, Kirton E, Thomas A, Egan R, An H, et al. MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies. Peerj. 2019;7:e7359.
Article PubMed PubMed Central Google Scholar
Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;3:1043–55.
Article Google Scholar
Olm MR, Brown CT, Brooks B, Banfield JF. dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication. ISME J. 2017;11:2864–8.
Article CAS PubMed PubMed Central Google Scholar
Chaumeil P-A, Mussig AJ, Hugenholtz P, Parks DH. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics. 2019;6:1925–7.
Google Scholar
Guo J, Bolduc B, Zayed AA, Varsani A, Dominguez-Huerta G, Delmont TO, et al. VirSorter2: a multi-classifier, expert-guided approach to detect diverse DNA and RNA viruses. Microbiome. 2021;9:37.
Article PubMed PubMed Central Google Scholar
Roux S, Brum JR, Dutilh BE, Sunagawa S, Duhaime MB, Loy A, et al. Ecogenomics and potential biogeochemical impacts of globally abundant ocean viruses. Nature. 2016;537:689–93.
Article CAS PubMed Google Scholar
Paez-Espino D, Eloe-Fadrosh EA, Pavlopoulos GA, Thomas AD, Huntemann M, Mikhailova N, et al. Uncovering Earth’s virome. Nature. 2016;536:425–30.
Article CAS PubMed Google Scholar
Couvin D, Bernheim A, Toffano-Nioche C, Touchon M, Michalik J, Néron B, et al. CRISPRCasFinder, an update of CRISRFinder, includes a portable version, enhanced performance and integrates search for Cas proteins. Nucleic Acids Res. 2018;46:W246–W251.
Article CAS PubMed PubMed Central Google Scholar
Lowe TM, Eddy SR. tRNAscan-SE: A Program for Improved Detection of Transfer RNA Genes in Genomic Sequence. Nucleic Acids Res. 1997;25:955–64.
Article CAS PubMed PubMed Central Google Scholar
Hyatt D, Chen G-L, LoCascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinform. 2010;11:119.
Article Google Scholar
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–504.
Article CAS PubMed PubMed Central Google Scholar
Aramaki T, Blanc-Mathieu R, Endo H, Ohkubo K, Kanehisa M, Goto S, et al. KofamKOALA: KEGG ortholog assignment based on profile HMM and adaptive score threshold. Bioinformatics. 2019;36:2251–52.
Article PubMed Central Google Scholar
Mistry J, Bateman A, Finn RD. Predicting active site residue annotations in the Pfam database. BMC Bioinform. 2007;8:298.
Article Google Scholar
Shaffer M, Borton MA, McGivern BB, Zayed AA, La Rosa SL, Solden LM, et al. DRAM for distilling microbial metabolism to automate the curation of microbiome function. Nucleic Acids Res. 2020;48:8883–900.
Article CAS PubMed PubMed Central Google Scholar
Pratama AA, Bolduc B, Zayed AA, Zhong Z-P, Guo J, Vik DR, et al. Expanding standards in viromics: in silico evaluation of dsDNA viral genome identification, classification, and auxiliary metabolic gene curation. Peerj. 2021;9:e11447.
Article PubMed PubMed Central Google Scholar
Zimmermann L, Stephens A, Nam S-Z, Rau D, Kübler J, Lozajic M, et al. A completely reimplemented MPI bioinformatics toolkit with a new HHpred server at its core. J Mol Biol. 2018;430:2237–43.
Article CAS PubMed Google Scholar
Kelley LA, Mezulis S, Yates CM, Wass MN, Sternberg MJE. The Phyre2 web portal for protein modeling, prediction and analysis. Nat Protoc. 2015;10:845–58.
Article CAS PubMed PubMed Central Google Scholar
Sullivan MJ, Petty NK, Beatson SA. Easyfig: a genome comparison visualizer. Bioinform Oxf Engl. 2011;27:1009–10.
Article CAS Google Scholar
Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30:772–80.
Article CAS PubMed PubMed Central Google Scholar
Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25:1972–3.
Article PubMed PubMed Central Google Scholar
Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, Haeseler Avon, et al. IQ-TREE 2: New models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evol. 2020;37:1530–4.
Article CAS PubMed PubMed Central Google Scholar
Letunic I, Bork P. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021;49:gkab301-.
Article Google Scholar
Dick GJ, Andersson AF, Baker BJ, Simmons SL, Thomas BC, Yelton AP, et al. Community-wide analysis of microbial genome sequence signatures. Genome Biol. 2009;10:R85–R85.
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The work was supported by the Natural Science Foundation of China (grant numbers 91951209, 42006097, 92051116, and 42188102) and China Postdoctoral Science Foundation Grant (grant number 2020M671942). We thank Drs Xiang Xiao and Kun Zhou for their advice on the analysis and discussion. We also acknowledge all the researchers, organizations, and funding agencies that contributed to the sequencing and offering of metagenomes analyzed in this study.

Author information

Authors and Affiliations

State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen, China
Long Wang, Xingyu Huang, Ruijie Ma, Nianzhi Jiao & Rui Zhang
Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, China
Long Wang & Rui Zhang
State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
Yinzhao Wang & Fengping Wang
State Key Laboratory of Marine Geology, Tongji University, Shanghai, China
Jiangtao Li

Authors

Long Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yinzhao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xingyu Huang
View author publications
You can also search for this author in PubMed Google Scholar
Ruijie Ma
View author publications
You can also search for this author in PubMed Google Scholar
Jiangtao Li
View author publications
You can also search for this author in PubMed Google Scholar
Fengping Wang
View author publications
You can also search for this author in PubMed Google Scholar
Nianzhi Jiao
View author publications
You can also search for this author in PubMed Google Scholar
Rui Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

LW and RZ designed the research, performed the analyses, developed the metabolic models and wrote the paper. YW and FW provided useful advises about the signatures of methanogen and methanotrophic archaea and helped to revise the paper. XH and RM helped in viral prediction and provirus analyses, respectively. JL and NJ helped to revise the paper.

Corresponding author

Correspondence to Rui Zhang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplemental Tables 1-9

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, L., Wang, Y., Huang, X. et al. Potential metabolic and genetic interaction among viruses, methanogen and methanotrophic archaea, and their syntrophic partners. ISME COMMUN. 2, 50 (2022). https://doi.org/10.1038/s43705-022-00135-2

Download citation

Received: 01 December 2021
Revised: 30 May 2022
Accepted: 15 June 2022
Published: 28 June 2022
DOI: https://doi.org/10.1038/s43705-022-00135-2

This article is cited by

Viral potential to modulate microbial methane metabolism varies by habitat
- Zhi-Ping Zhong
- Jingjie Du
- Matthew B. Sullivan
Nature Communications (2024)

Subjects

Abstract

Similar content being viewed by others

Viral potential to modulate microbial methane metabolism varies by habitat

Evolutionary diversification of methanotrophic ANME-1 archaea and their expansive virome

A compendium of viruses from methanogenic archaea reveals their diversity and adaptations to the gut environment

Introduction

Results and discussion

Viruses are widely distributed in MMA inhabiting environments

The MMA viruses are diverse and unique

MMA viruses may infect mutualistic Deltaproteobacteria

MMA viruses are associated with organosulfur metabolism

MMA viruses adapt to their hosts and inhabiting ecosystems

Conclusions

Materials and methods

Data acquirement

Prokaryotic community, genomic binning, and taxonomic annotation

Recover and deduplicate viral contigs

Viral taxonomic assignment and distribution profiles

Methanogens and methane-oxidizing archaea viral prediction

Gene-sharing based network analysis

Functional annotations of viral sequences

Phylogenetic and tetranucleotide analyses of MMA viruses

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary information

Supplemental Tables 1-9

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Viral potential to modulate microbial methane metabolism varies by habitat

Search

Quick links