Diversity and potential host-interactions of viruses inhabiting deep-sea seamount sediments

Yu, Meishun; Zhang, Menghui; Zeng, Runying; Cheng, Ruolin; Zhang, Rui; Hou, Yanping; Kuang, Fangfang; Feng, Xuejin; Dong, Xiyang; Li, Yinfang; Shao, Zongze; Jin, Min

doi:10.1038/s41467-024-47600-1

Download PDF

Article
Open access
Published: 15 April 2024

Diversity and potential host-interactions of viruses inhabiting deep-sea seamount sediments

Meishun Yu¹^na1,
Menghui Zhang¹^na1,
Runying Zeng¹,
Ruolin Cheng¹,
Rui Zhang²,
Yanping Hou¹,
Fangfang Kuang¹,
Xuejin Feng¹,
Xiyang Dong ORCID: orcid.org/0000-0002-9224-5923¹,
Yinfang Li¹,
Zongze Shao¹ &
…
Min Jin ORCID: orcid.org/0000-0003-0904-0247¹

Nature Communications volume 15, Article number: 3228 (2024) Cite this article

1887 Accesses
6 Altmetric
Metrics details

Subjects

Abstract

Seamounts are globally distributed across the oceans and form one of the major oceanic biomes. Here, we utilized combined analyses of bulk metagenome and virome to study viral communities in seamount sediments in the western Pacific Ocean. Phylogenetic analyses and the protein-sharing network demonstrate extensive diversity and previously unknown viral clades. Inference of virus-host linkages uncovers extensive interactions between viruses and dominant prokaryote lineages, and suggests that viruses play significant roles in carbon, sulfur, and nitrogen cycling by compensating or augmenting host metabolisms. Moreover, temperate viruses are predicted to be prevalent in seamount sediments, which tend to carry auxiliary metabolic genes for host survivability. Intriguingly, the geographical features of seamounts likely compromise the connectivity of viral communities and thus contribute to the high divergence of viral genetic spaces and populations across seamounts. Altogether, these findings provides knowledge essential for understanding the biogeography and ecological roles of viruses in globally widespread seamounts.

Diversity and distribution of viruses inhabiting the deepest ocean on Earth

Article 10 May 2021

Deep sea sediments associated with cold seeps are a subsurface reservoir of viral diversity

Article Open access 01 March 2021

Unexpected myriad of co-occurring viral strains and species in one of the most abundant and microdiverse viruses on Earth

Article 13 November 2021

Introduction

Seamounts can be both isolated and clustered and are ubiquitous and prominent features of the world’s underwater topography, thus forming one of the major biomes of the ocean¹. The geographic features of seamounts exert complex effects on oceanic circulation and mixing at a scale ranging from regional to more local effects². Interactions between seamounts and steady and variable flows have been described, providing a better perspective for understanding the mechanisms underlying processes that influence biology³. As unique ecosystems in the deep ocean, seamounts are generally considered oases of biomass abundance and hotspots of species richness^1,4. Previous studies on seamount fauna showed that seamounts have a diverse trophic architecture and tend to support aggregations of higher consumers, such as fish⁴.

So far, most of our knowledge on seamount biodiversity is derived from studies on seamount fauna, while the diversity and ecology of microbial communities are much less understood in general. Over the past decades, with the application of metagenomics, significant efforts have been made to explore the diversity, function, and ecology of the prokaryotes inhabiting seamount environments^5,6,7,8,9. For example, Jacobson Meyers et al.⁷ explored the extracellular enzyme activity and microbial diversity on seafloor exposed basalts from Lō’ihi Seamount; they suggested that prokaryotes on basaltic rock play a substantial and quantifiable role in benthic biogeochemical processes through transforming organic matter⁶. Huo et al. utilized fosmid sequencing to explore the ecological functions of microbes in a sediment sample collected from the cobalt-rich ferromanganese crust of a seamount region in the central Pacific⁹. They suggested that microbes are involved in the nitrogen cycle, and a high frequency of horizontal gene transfer events, as well as genomic divergence, contributed to the adaption of microbes to their deep-sea environment. Collectively, these studies have suggested that the prokaryotes in seamount ecosystems have extensive diversity and play important roles. However, little is known about viral communities and their roles in seamount ecosystems.

Viruses are the most abundant and ubiquitous biological entities on the planet. The vast majority of environmental viruses are phages that infect bacteria^10,11. Because of their enormous abundance and genetic diversity, viruses are major players in marine ecosystems: (1) Viruses control host abundance and affect the host community structure by killing hosts¹². (2) Viruses influence host diversity, evolution, and environmental adaptation through horizontal gene transfer, resistance selection, and host metabolism programming^13,14,15. (3) Viruses drive biogeochemical cycling by releasing intracellular organic matter from hosts and promoting the transformation of particulate organic matter to dissolved organic matter^16,17. (4) Viruses assist microbial-mediated biogeochemical cycling processes by expressing auxiliary metabolic genes (AMGs)^18,19.

Currently, there are significant gaps in our knowledge regarding the community structure, genetic diversity, and ecological roles of viruses in seamount ecosystems. To date, only one publication used epifluorescence microscopy to count virus-like particles (VLPs) in deep-sea sediments around two seamounts in the Tyrrhenian Sea. The results showed that benthic viral production was much higher in sediments around seamounts than in non-seamount sediments²⁰. Moreover, only one culturable virus has been isolated from seamount environments to date²¹. In view of the central roles viruses play in shaping host communities and mediating biogeochemical cycles, exploring viral communities in seamount ecosystems is essential. In addition, the question of whether seamounts are isolated habitats with highly endemic faunas and related questions of connectivity have been the subject of many studies over the last 30 years^1,3,4. Overall, scientists have concluded that seamounts do not generally support high levels of endemism³. However, this conclusion has been challenged by studies on certain fauna taxa whose life history is characterized by poor dispersal and thus showing low connectivity between seamounts with high endemism at a local level²². In this context, research on the effects of geographic features of seamounts on the resulting viral community may also offer novel insights into this controversial topic as well as deepen our understanding of underlying mechanisms regarding the generation and maintenance of local viral diversity.

Here, we utilized high-depth sequencing and viral-sequence specific bioinformatics tools to explore the diversities, biogeography, and potential ecological roles of viruses in seamount ecosystems. The results show that seamount sediments are reservoirs of extremely diverse and previously unknown viruses. Extensive interactions between viruses and dominant prokaryote lineages, as well as the presence of abundant AMGs in virus genomes, highlight the central roles viruses play in shaping the structure and function of seamount microbiomes and in influencing the biogeochemical processes mediated by seamount microorganisms. Furthermore, the geographical features of seamounts likely compromise the connectivity of viral communities, highlighting the important role of the topography of the deep-sea landscape in shaping local viral communities.

Results and discussion

To explore the diversity, host interaction, and ecological function of viruses inhabiting deep-sea seamount sediments, 16S rRNA genes, metagenomes, and viromes were sequenced from seven sediment samples collected from the deep-sea seamount region in the Northwest Pacific Ocean. To fully consider the effect of the geographic features of seamounts on microbial and viral communities, sediment samples were collected across the seamount region. This region encompasses the C1 basin and three surrounding seamounts (i.e., NA, NLG, and MP4) with varying sampling locations in the bottom, hillside, and summit areas (Fig. 1 and Supplementary Data 1).

**Fig. 1: Geographic distribution of sampling sites.**

Overview of prokaryotic communities

To assess the overall prokaryotic compositions in the sampled seamount sediments, the v3–v4 region of 16S rRNA genes was sequenced and analyzed. As shown in Supplementary Fig. 1, most prokaryotes in these sediments were bacteria, dominated by Chloroflexi (38.0% on average), Proteobacteria (35.7%), Planctomycetota (5.3%), SAR324 clade (Marine group B, 3.6%), Actinobacteriota (2.7%), Acidobacteriota (2.7%), Nitrospinota (1.2%), and Firmicutes (0.8%). Chloroflexi and Proteobacteria are the most abundant in all sediment samples, together accounting for more than 50% of the total relative abundance. Consistently, dominance of Proteobacteria in prokaryotic communities is also observed in a variety of deep-sea sediments, including cold seep, hydrothermal vents, and trenches^23,24,25,26. Mantel’s correlation analysis was performed to explore the effects of environmental physicochemical factors on prokaryotic communities, and the results showed that depth but organic carbon and nitrogen had a significant impact on prokaryotic communities (Supplementary Fig. 2).

De novo assembly and binning of metagenomes resulted in 136 high- or medium-quality microbial metagenome-assembled genomes (MAGs) with completeness ≥50% and contamination ≤10%²⁷. These MAGs were then clustered at 95% average nucleotide identity (ANI) to generate 59 bacterial and three archaeal MAGs, representing species-level groups spanning 18 phyla (Fig. 2 and Supplementary Data 2). Most of the bacterial MAGs belong to dominant lineages, such as Proteobacteria (n = 26), Actinobacteriota (n = 5), Acidobacteriota (n = 3), Planctomycetota (n = 3), Nitrospirota (n = 2), Chloroflexota (n = 1), Nitrospinota (n = 1), and SAR324 (n = 1), while all archaeal MAGs belong to Thermoproteota (n = 3). These results were in line with the results of previous metagenome studies on microbial mat samples collected from Lō’ihi Seamount²⁸, and abyssal crust collected from Takuyo-Daigo Seamount²⁹; in these samples, bacterial MAGs were dominated by Proteobacteria and archaeal MAGs accounted for less than 5% of the total MAGs. Based on the read coverage of MAGs among samples, while most of the MAGs were present in all six sediment samples, some of the MAGs were specific to certain samples; for example, bin.80 (Methylomirabilota), bin.52 (Proteobacteria), and bin.117 (Bacteroidota) were only present at sites NLG_S05, NA_S06, and MP4_S01, respectively (Fig. 2). In addition to 151 medium- and high-quality MAGs assembled from the metagenome of Axial Seamount and Lō’ihi Seamount samples^30,31,32, as well as the publicly available microbial genomes, these MAGs provide a good basis to infer linkages between viruses and prokaryotes.

**Fig. 2: Maximum-likelihood phylogenetic tree of prokaryotic metagenome-assembled genomes (MAGs).**

Viral community of seamount sediments

Currently, two metagenome approaches are available to study environmental viral communities, i.e., viromes of separated ambient viruses and bulk metagenomes containing sequences of diverse origins. Both approaches have powerfully expanded our knowledge of environmental viruses^33,34,35. However, they differ greatly in their recovering efficiency of different viral populations, since viromes are greatly enriched in ambient viruses whereas bulk metagenomes are depleted from certain free viruses and enriched in actively infecting and temperate viruses (cellular fraction)^36,37. Therefore, in this paper, we utilize both bulk metagenome and virome to offer complementary perspectives of viral communities in seamount sediments. Three pipelines were used to identify viral sequences from bulk metagenome and virome datasets (Supplementary Fig. 3a), resulting in 2099 putative viral sequences (Contigs ≥5 kb or ≥2 kb and circular). Small circular Contigs (≥2 kb) were retained because they may represent small circular rep encoding single-stranded virus genomes of 2–25 kb size, such as phages belonging to Microviridae and eukaryotic viruses belonging to Circoviridae and Geminiviridae³⁸. To obtain large viral assemblies, viral sequences were further binned using vRhyme v1.1.0³⁹; the resulting viral Contigs and MAGs were then clustered at 95% identity and 85% coverage to generate 1600 viral operational taxonomic units (vOTUs) that represent approximately species-level taxonomy⁴⁰ (Supplementary Data 3). The GC content and size of these vOTUs ranged from 30.21 to 71.08% and from 2001 to 3,472,510 bp, respectively (Supplementary Fig. 3b). Fifty-nine vOTUs have a length of more than 200 Kb and possibly corresponded to giant viruses. Interestingly, three vOTUs are larger than 2.5 Mb, which is the approximately largest genome size of known isolated Pandoravirus viruses, a group of nucleocytoplasmic large DNA virus (NCLDV) related viruses that infect Ameba^41,42. Further inspection of these three vOTUs showed that they all contain NCLDV marker genes, and a large fraction of their ORFs is homologous to NCLDV genomes (Supplementary Fig. 4). In addition, no known host contamination was found in these vOTUs, suggesting that they are likely bona fide NCLDV genomes. The high occurrence of extremely large viral genomes in our study may be due to the fact that we used Metaviralspades v3.15.5⁴³ for large contig assembly and vRhyme v1.1.0³⁹ for binning. Assessment of the quality of these vOTUs by CheckV v0.9.0⁴⁴ showed that 326 vOTUs (20.3%) were of medium quality and above, including complete (1.3%), high-quality (9.3%), and medium-quality (9.7%) (Supplementary Data 3 and Supplementary Fig. 3c). Mantel’s correlation analysis showed that none of the three measured physicochemical factors (depth, organic carbon, and nitrogen) had a significant impact on viral communities (Supplementary Fig. 2).

Taxonomic affiliations of 1600 vOTUs were determined by comparing predicted ORFs against the NCBI viral_Refseq database (v94) based on the Last Common Ancestor algorithm. As shown in Fig. 3a and Supplementary Data 3, 88% of vOTUs (n = 1413) could be taxonomically affiliated at the phylum level, with primary assignment to Uroviricota (n = 1298, dsDNA tailed prokaryotic virus), Nucleocytoviricota (n = 60, dsDNA NCLDV), and Artverviricota (n = 31, reverse transcriptase-encoding ssRNA or dsDNA virus). Among Uroviricota, a total of 21 families were identified for vOTUs, including Peduoviridae (n = 36), Kyanoviridae (n = 21), Autographiviridae (n = 14), and Mesyanzhinovviridae (n = 12) (Fig. 3b). In all Nucleocytoviricota-affiliated vOTUs, matches associated with the families Phycodnaviridae (n = 26) and Mimiviridae (n = 7) were the most common (Fig. 3d), whereas viruses belonging to Artverviricota are most affiliated with the family Metaviridae (n = 26) (Fig. 3c).

**Fig. 3: Community structure of seamount sediment viruses.**

To examine the viral community structures in seamount sediments, clean reads of virome and bulk metagenome were separately mapped to vOTUs to calculate reads per kilobase per million mapped reads (RPKM) values for each vOTU. As shown in Fig. 3e, although both virome and bulk metagenome identify abundant dsDNA viruses, ssDNA viruses and RNA viruses, they differ in the relative abundance of virus sub-populations. For example, except for Duneviridae and Tectivirida, nearly all detected dsDNA viral families showed higher relative abundance in the metagenome than in the virome. We suggest that this bias is likely caused by the filtration step used to remove hosts in the preparation of the virome, which may also remove certain large dsDNA viruses, in particular giant viruses belonging to NCLDV. In addition, a portion of these dsDNA viruses may be temperate and thus difficult to be detected by viromes. Moreover, virome and metagenome have varying effectiveness levels for detecting ssDNA viruses depending on the target viral groups. For example, Inoviridae and Circoviridae were only detected in the virome and metagenome, respectively. The divergence of virome and bulk metagenome in the detection of different virus sub-populations was also observed previously in a comparative study of human gut virome and bulk metagenome. In that study, Gregory et al. found no significant difference between the bulk metagenome and virome in terms of the number of viral Contigs recovered, but the sub-population of viruses captured by the two approaches clearly differed, and the metagenome outperformed the virome in terms of the viral detection rate³⁶.

Seamount viruses are diverse and novel

In seven seamount sediment samples, a significant portion of vOTUs were classified as members of the Caudoviricetes class, constituting ~81% of the total identified vOTUs. Caudoviricetes form a class of tailed dsDNA phages and are usually the most retrieved dsDNA viruses in the environment^33,34,35. It is worth noting that the viral sequence databases still have very limited diversity represented, and the predicted dominance of Caudoviricetes in viromes is likely due to the high proportion of Caudoviricetes references in current databases. To infer DNA packaging mechanisms of these Caudoviricetes, a phylogenetic tree was generated based on a conserved gene coding for the terminase large subunit (TerL), which is necessary for DNA packaging during the maturation of tailed phages⁴⁵. A total of 414 complete open reading frames (ORFs) encoding TerL were identified from seamount vOTUs and used for phylogenetic analysis along with the reference sequence (Supplementary Data 4). As shown in Supplementary Fig. 5a, seamount Caudoviricetes adopt diverse DNA packaging mechanisms, highlighting a remarkable diversity of Caudoviricetes within seamount sediment ecosystems.

ssDNA viruses are among the smallest and simplest viruses with genome sizes ranging from 2 to 25 kb. With the application of metagenomics and viromics, ssDNA viral sequences have been found to be abundant and diverse across various habitats⁴⁶, including marine sediments^47,48,49. However, in our study, only a small fraction of vOTUs (n = 16) were identified as ssDNA viruses, including Microviridae (n = 11), Circoviridae (n = 2), Inoviridae (n = 1), Spiraviridae (n = 1), and Smacoviridae (n = 1). For Microviridae, eight complete or near-complete genomes (4361–5527 bp) were recovered from the seamount dataset. Blastn results showed that, according to NCBI’s NT database, all of the seamount microviruses shared <93% identities relative to their best matches, highlighting the novelty of this viral family in seamount sediments. To obtain further insights into the evolution of seamount microviruses, their genome structures and gene sequence conservation levels were compared with known Microviridae genomes (Supplementary Fig. 5b). Like other known Microviridae genomes⁵⁰, the characteristic genes encoding well-conserved major capsid protein (VP1), DNA pilot protein (VP2), and replication protein (VP4) were identified in the genomes of all seamount microviruses. Six seamount microviruses that are homologous to Gokushovirinae possess an additional gene coding for internal scaffolding protein (VP3) and, in most cases, also a DNA binding protein (VP5). Nevertheless, certain seamount microviruses are quite different from known Microviridae genomes in terms of their gene organization and the content of unconserved genes. For example, the viral genome homologous to group D viruses (genome V_C1_S01_k141_1198016) displays distinct gene organizations from other microviruses and contains a gene downstream of vp4 that exhibits no similarity with other group D viruses.

To further examine the diversity of viral communities in seamount sediments and their relationships with viral sequences identified from other marine habitats, a gene-sharing network was constructed using vConTACT2⁵¹. Such a weighted network can assign viral sequences into viral clusters (VCs) that correspond to genus-level groups⁵¹. The vOTUs identified from GOV 2.0⁵², cold seep³⁴, trench⁵³, and seamount were clustered into 17,518 VCs (Fig. 4a), while taxonomically known viruses from NCBI RefSeq only formed 345 VCs; this vast difference highlights the enormous and as yet undescribed diversity of marine viruses (Fig. 4a). As the largest marine virus database to date, GOV 2.0 datasets contributed the largest number of VCs (n = 14,159), while trench, cold seep, and seamount databases contributed 2171, 801, and 186 VCs, respectively (Fig. 4a and Supplementary Data 5 and 6). In line with previous research^34,35, only five VCs were shared among all marine ecosystems, reflecting a high degree of variations in viral communities across various marine habitats (Fig. 4b). Among the 1600 vOTUs from seamount sediments, only 353 vOTUs were clustered into 186 VCs, the majority of which (77.94%) have no homologs in other marine databases and NCBI RefSeq databases; this suggests that most seamount sediment viruses are unique to the seamount habitat (Supplementary Data 6). Among the 186 seamount sediment VCs, only 18 VCs were shared with the GOV 2.0 database, 73 VCs were shared with the trench database, 4 VCs were shared with the cold seep database, and 8 VCs were shared with the NCBI RefSeq database. The remaining 79 VCs (~42%) of viruses were exclusive to seamount viruses, which may represent candidate novel genera (Fig. 4b). These seamount-specific VCs contained 193 vOTUs, with the majority (n = 161, ~83%) affiliated as Caudoviricetes, followed by Nucleocytoviricota (n = 16), Artverviricota (n = 5), and ssDNA viruses (n = 2). The remaining nine vOTUs could not be taxonomically assigned at the family or even higher level at this time.

**Fig. 4: Comparative analysis of seamount sediment viruses with RefSeq viruses and other viruses found in marine environments.**

Virus-host linkages and viral lifestyles

Viruses affect various microbe-mediated processes through interactions with their hosts^13,54,55. Considering that a large fraction of microbes is infected by viruses at any given time⁵⁶, the interactions between viruses and their hosts must play important roles in the dynamics, evolution, and ecology of microbial communities. To explore virus-host interactions in seamount sediments, potential hosts were predicted for 1600 vOTUs using a combination of four bioinformatic approaches, including CRISPR-spacers matching, tRNA matching, nucleotide sequence homology, and k-mer frequencies⁵⁷. To predict virus-host connections, we used a combination of host databases to infer virus-host linkages, including the Genome Taxonomy database (GTDB-tk), MAGs binned from seamount samples in this study, and seamount samples from previous studies^30,31,32 (Supplementary Data 7). As a result, 3923 virus-host linkages were predicted, most of which were predicted by tRNA-matches (n = 3316), followed by nucleotide sequence homology (n = 587), CRISPR-spacers matches (n = 54), and k-mer frequencies (n = 53) (Fig. 5a and Supplementary Data 8). Among them, 75 virus-host connections were supported by two or more prediction approaches. Consistent with previous studies^34,35, putative hosts were predicted for only a small fraction (n = 253, ~16%) of the 1600 seamount vOTUs. Most of these vOTUs were predicted to infect specific hosts within the same phyla, and only 51 vOTUs were linked to a broader range of hosts across different phyla. These results agree with previous observations showing that most viruses only infect a narrow range of hosts^11,34,35. A total of 2007 prokaryotes were predicted to be potential hosts for seamount vOTUs, most of which (n = 1953, ~97%) were predicted from the GTDB-tk database. Interestingly, all remaining potential hosts were predicted from the MAGs assembled from our study, and no hosts were predicted from MAGs assembled from other seamount metagenomes. This result implies the potential high divergence of viral communities across different seamount habitats.

**Fig. 5: Predicted host-virus interactions.**

Phylogenetic analysis showed that the predicted prokaryotic hosts of seamount viruses spanned two archaeal and 23 bacterial phyla (Fig. 5b). A total of 251 vOTUs were associated with bacteria. Of these, Proteobacteria was the most frequently predicted host phylum (137 associated vOTUs), followed by Actinobacteriota (43 vOTUs), Bacteroidota (26 vOTUs), Planctomycetota (15 vOTUs), and Gemmatimonadota (12 vOTUs). Most of these predicted hosts were among the most abundant bacterial lineages in the sampled seamount sediments, as indicated by the results of 16S rRNA gene profiling (Supplementary Fig. 1). For example, Proteobacteria—the dominant taxa in seamounts—contributed most (n = 2763, 70.4%) of the virus-host linkages. These results are in line with the widely recognized kill-the-winner hypothesis, which suggests that abundant microbes are more likely to be infected and lysed by viruses because a high population density increases the host-virus encounter rate⁵⁸. Further taxonomic analysis showed that the most frequently predicted hosts within Proteobacteria were the subgroup Gammaproteobacteria (69 associated vOTUs) and Alphaproteobacteria (33 vOTUs); both subgroups are widely distributed across marine ecosystems, typically showing high abundance in seamount sediments and other deep-sea sediments^59,60,61. Twenty-one Proteobacteria MAGs were predicted as hosts for 99 vOTUs (Fig. 2 and Supplementary Data 8). Remarkably, according to the host metabolic capability predicted based on the presence of metabolic genes within vOTU-associated MAGs, putative virus-infecting Alphaproteobacteria and Gammaproteobacteria may play important roles in carbon, nitrogen, and sulfur cycles in seamount sediments (Supplementary Fig. 6 and Supplementary Data 9). For example, hexosaminidase-encoding gene, a gene involved in chitin degradation, is widespread in vOTU-associated Alphaproteobacteria and Gammaproteobacteria MAGs, implying potential roles of these MAGs in complex carbon degradation. In addition, several virus-infecting MAGs contain napA and nirBD genes, which are involved in the reductions of nitrate to nitrite and nitrite to ammonia, respectively. This finding is in accordance with previous studies showing that Proteobacteria play an important role in connecting nitrifying and heterotrophic microorganisms, as they can reduce nitrate to ammonia and thereby provide a nitrogen source for other microorganisms^59,62. Finally, previous studies suggest that Gammaproteobacteria, an important sulfur-oxidizing and sulfuric-acid-reducing taxon, provide an important contribution to sulfur biogeochemical cycling by being involved in and even driving sulfur transformations in sediments^60,61,63. Indeed, abundant dsrAB/sdo genes were identified in vOTU-associated Proteobacteria MAGs; these genes are involved in the oxidation of hydrosulphides and are important for both the detoxification and neutralization of hydrogen sulfide in sediments (Supplementary Fig. 6). Collectively, given that Proteobacteria are abundant, highly active, and frequently associated with viruses, their infections and lyses by viruses likely substantially impact the microbial community and biogeochemical cycling in seamount sediments.

As another abundant and ubiquitous group inhabiting marine sediments⁶⁴, Actinobacteriota is the second most frequently predicted host phylum in seamount sediment samples and formed 946 virus-host linkages with 43 vOTUs. Four Actinobacteriota MAGs were predicted as hosts for 40 vOTUs (Fig. 2). Functional annotation of these four MAGs showed that they contain abundant genes involved in complex carbohydrate degradation, such as genes encoding for hexosaminidase, beta-glucuronidase, and isoamylase, which participate in the degradation of chitin, hemicellulose, and amylum, respectively (Supplementary Fig. 6 and Supplementary Data 9). Such complex carbohydrates are major components of crustacean shells, plant cell walls, and intercellular spaces, and they are also very difficult to degrade⁶⁵; however, their biolysis is essential for biomass recycling in deep-sea sediments and critical in local and global carbon cycles. Therefore, infections and lyses of Actinobacteriota by viruses might play important roles in complex carbohydrate biolysis and carbon cycling.

Only four vOTUs were linked to archaea, including members of Thermoproteota (three vOTUs) and Crenarchaeota (one vOTU). The predicted archaeal hosts of Thermoproteota include three seamount MAGs assembled in this study (Fig. 2), which form three novel virus–host linkages (not found in the IMG/VR V3 database) with two vOTUs, including a vOTU from Caudovirales and another vOTU that could not be taxonomically assigned at this time. Based on their taxonomical annotation, these three MAGs were further affiliated with the genus DRGT01 of the phylum Thermoproteota. In the GTDB-tk database, only seven MAGs are affiliated with archaea of DRGT01, all of which are derived from sediment samples, indicating that this group may be endemic to sediments and its connection with viruses has not been disclosed so far. As a ubiquitous group of archaea inhabiting various sediments, Thermoproteota is thought to be relevant to primary production in sediments through chemoenergetic autotrophic interactions and the ammonia oxidative metabolism^66,67,68. Indeed, all of these three vOTU-linked seamount MAGs encode genes involved in the 3-hydroxypropionic acid/4-hydroxybutyric acid cycle that drive energy-efficient carbon fixation (Supplementary Fig. 6 and Supplementary Data 9)⁶⁹.

The lifestyles for 1600 vOTUs were predicted based on lysogeny-specific features, i.e., the presence of lysogeny-specific genes (e.g., genes encoding for integrase, recombinase, and excisionase) and/or location within their host genomes. As a result, approximately one-third (n = 548) of vOTUs were predicted to be lysogenic, 203 of which were predicted by both features. Based on abundances determined by read mapping, the relative abundance of temperate viruses in each sample was calculated for both virome and bulk metagenome datasets. As shown in Fig. 5c, both virome and bulk metagenome showed high occurrences of temperate viruses, accounting for averages of 34% and 27% of relative abundance in viral communities, respectively. As expected, the bulk metagenome exhibited a higher average ratio of temperate viruses than the virome, as it is conventionally enriched for genomes of temperate viruses that integrate into host genomes^36,37. Given that a large fraction of viral genes is still poorly annotated at this time, certain bona fide lysogeny-specific genes may be neglected. Moreover, the incomplete assembly of viral genomes also makes it difficult to detect temperate viruses by identifying either lysogeny-specific genes or flanking host sequences, which further leads to an underestimation of lysogeny signals. Thus, temperate viruses may be even more prevalent in seamount sediments than identified in this study. This is further supported by the fact that 84% of all MAGs assembled here can be linked to vOTUs by nucleotide sequence homology matches; consequently, these may represent sequences that are acquired by the host through phage genome integrations⁵⁷. The prevalence of temperate viruses has also been observed in a variety of deep-sea environments, such as deep water from the South China Sea and the western Pacific Ocean⁷⁰ as well as deep-sea diffuse-flow hydrothermal vents⁷¹. Previous studies have suggested that the high occurrence of temperate viruses in the deep sea possibly promotes virus-mediated gene transfer and exchange, which may be important for the survival and stability of hosts in challenging environments^70,71,72.

Potential impacts of viral AMGs on host metabolisms and biogeochemical cycles

Viruses can reshape the metabolism of their hosts through the expression of virus-encoded AMGs^73,74. To better understand the impacts of viral AMGs on host metabolisms and relevant biogeochemical cycling, AMGs were identified from vOTUs by VIBRANT and DRAMv pipelines; they were further functionally annotated with Pfam, KEGG, and CAZy databases. As a result, after manual curation, a total of 331 genes were identified as putative AMGs (Supplementary Data 10). Because viruses generally obtain and maintain AMGs from their hosts, we further performed sequence homology searches of AMGs against the NCBI NR database to predict their putative hosts. As shown in Supplementary Fig. 7a, a large proportion (n = 112, 33.8%) of these AMGs were probably derived from Proteobacteria, which is consistent with Proteobacteria being the most frequently predicted hosts for vOTUs in seamount sediments (Fig. 5a).

Based on KEGG annotation, the putative AMGs of seamount viruses were involved in diverse metabolic pathways, with a large portion participating in carbohydrate metabolism, cofactors and vitamins metabolism, as well as amino acid metabolism (Supplementary Fig. 7b). Notably, several AMGs were involved in carbon, nitrogen, and sulfur cycling. For example, six AMGs were affiliated with glycoside hydrolases predicted to catalyze the hydrolysis of complex polysaccharides, including trehalase and amylase. In marine sediments, these genes are essential for the recycling of detrital organic matter supplied from the overlying water column and thus may be critical in local and global carbon cycles⁷⁵. Several AMGs were associated with sulfur cycling, including genes encoding for phosphoadenosine phosphosulphate reductase (CysH), cysteine synthase (CysK), sulfate adenylyltransferase (Sat), and methanethiol oxidase (SELENBP1). CysH and CysK participate in assimilatory sulfate reduction, whereas the Sat is involved in dissimilatory sulfur reduction/oxidation; both reactions are important for sulfur cycling⁷⁶. SELENBP1 catalyzes the oxidation of methanethiol, which is a significant step in the sulfur cycle as methanethiol is an intermediate of the metabolism of globally important organosulfur compounds, including dimethylsulphoniopropionate⁷⁷. The most common AMG related to nitrogen cycling is nitronate monooxygenase (ncd2), which catalyzes the oxidation of nitroalkane to nitrite⁷⁸. Seamount microorganisms are involved in carbon degradation and sulfur, nitrogen, and metal cycling^9,79. Therefore, the presence of these AMGs indicates that seamount viruses may extensively participate in local and global biogeochemical cycles by assisting microbes in driving biogeochemical cycles with AMGs.

To explore the relative abundance of AMGs, clean reads of both the virome and bulk metagenome were separately mapped to AMG-carrying vOTUs. As shown in Fig. 6a, virome and bulk metagenome showed different AMG profiles. In general, these results indicated a significantly elevated relative abundance of AMGs associated with processes such as protein folding, sorting, and degradation glycan biosynthesis and metabolism, xenobiotics biodegradation and metabolism, as well as lipid metabolism within the virome in comparison to bulk metagenome. Conversely, AMGs related to energy metabolism exhibited diminished abundance within the virome when compared with the metagenome. In addition, the bulk metagenome exhibited greater AMG diversity in pathways related to carbohydrate metabolism, amino acid metabolism, and metabolism of cofactors and vitamins. This clear dissimilarity in AMG compositions and the abundance between the virome and the bulk metagenome is likely caused by their bias in enrichment for viruses of different lifestyles (Fig. 5c). Luo et al. found that the viral lifestyle was more important than habitat and prokaryotic host in driving viral AMG profiles⁷⁴. They suggested that lytic viruses tended to encode AMGs that could boost progeny reproduction, whereas temperate viruses tended to encode AMGs for host survivability. Consistently, similar trends were also observed in our study. For example, the virome that was more enriched for lytic viruses exhibited higher relative abundances of fabG and queC-E; fabG is involved in fatty acid synthesis, while queC-E plays a role in redirecting host protein synthesis, thus improving host translation efficiency and viral progeny reproduction⁸⁰. In contrast, glycoside hydrolases were more diverse and frequently encoded in the bulk metagenome, which was more enriched for temperate viruses. Such AMGs potentially facilitate the decomposition and utilization of complex carbohydrates in hosts in deep-sea sediments, thereby enhancing host adaptation to their environments.

**Fig. 6: Effects of viral auxiliary metabolic genes (AMGs) on host metabolism.**

To gain deeper insights into the functions of seamount AMGs, the specific impacts of AMGs on the host metabolism were further examined based on predicted host-virus linkages. As shown in Fig. 6b, 16 AMG-carrying vOTUs were predicted to infect 13 seamount MAGs (Supplementary Data 11 and 12), forming 16 host-virus linkages. All of these vOTUs are tailed phages belonging to Caudoviricetes, while the hosts spanned seven phyla, including Bacteroidota, Proteobacteria, and Nitrospirota. Interestingly, all these vOTUs are predicted to be temperate viruses, which tend to encode AMGs that benefit both hosts and viruses⁷⁴. Indeed, among 10 host-virus linkages, viral AMGs exert potential compensatory effects on the host metabolism because their homologs were not found in host MAGs. These AMGs were involved in a variety of host metabolisms, including energy metabolism (cysH and SELENBP1), metabolism of cofactors and vitamins (ubiG, nadM, and hspA), amino acid metabolism (tyr, dnmt1, and phnZ), glycan biosynthesis and metabolism (kdsA, nadM, and hspA), and carbohydrate metabolism (aceB). For instance, in the sulfate reduction step of the sulfur cycle pathway, the MAG of bin.117 encodes two genes (cysC and cysN) that catalyze the conversion of sulfate to phosphoadenylyl sulfate (PAPS) but lack the cysH gene that catalyzes the reduction of PAPS to sulfite (Fig. 6c). However, its associated vOTU (V_MP4_S05_vRhyme_bin_185) contains AMGs encoding for CysH. Therefore, lysogenic viral infection likely compensates for host metabolic capabilities in sulfate reduction, further highlighting the potentially important roles of viral AMGs in sulfur cycling. In the other six virus-host linkages, host MAGs contain the homologs of AMGs that are carried in associated vOTUs. The impacts of these viral AMGs on host metabolisms are unknown to date, but we suspect that they may augment host metabolic flux by overexpressing host metabolic genes⁸¹.

The impacts of seamount geographical features on local viral communities

Several studies on fauna have suggested that isolation of seamount habitats would promote localized speciation, giving rise to high levels of endemism on seamounts^1,82. This so-called “seamount endemism hypothesis” was challenged by accumulating morphological and genetic evidence on the fauna, suggesting that seamounts do not generally support high levels of endemism^83,84. To examine whether geographic features of seamounts cause high divergence in viral communities, we used two levels of information: (1) viral genetic space represented by protein clusters (PCs)⁸⁵ and (2) species-level viral populations represented by vOTUs. Totals of 790,756 PCs and 1600 vOTUs were identified across the seven samples. The accumulation curves based on pan PCs (Fig. 7a above) and vOTUs (Fig. 7a below) showed that the 5th samples and beyond still added PCs and vOTUs, but also depicted a trend to approach a plateau. These results suggest that, although it is impossible to obtain a complete sample, viral genetics and populations (in particular dsDNA viruses) from seamount sediments are relatively well sampled. Comparative analysis showed that 66% (n = 522,201) of PCs were only present within a single sample, and only 3.3% (n = 25,932) of PCs were shared by all seven samples, suggesting a high divergence of viral genetic space in these samples (Fig. 7b above). Similarly, comparative analysis of vOTUs on the population level showed that most vOTUs (n = 1526, 95.4%) were unique to a single sample, further supporting the remarkably high level of divergence among viruses across seamount sediments (Fig. 7b below).

**Fig. 7: The effects of geographical features of seamounts on viral community.**

To further explore whether the geographical features of seamounts contribute to the high divergence of seamount viruses, the connectivity of viral populations between neighboring seamount sampling sites was calculated. As shown in Fig. 7c, neighboring sites not separated by seamounts generally showed strong connectivity among viral populations, whereas neighboring sites across the same seamount (i.e., with the summit in the middle) showed decreased connectivity. For example, NLG_S03 and NLG_S05 sites, which are located on two different seamounts but have no large geographic barrier between them, showed high correlation in their viral communities. However, the MP4_S01 site showed considerably weaker connection with the MP4_S05 site, even though both are located on the same seamount but on opposite slopes. Moreover, regardless of the substantial difference in the depth between MP4_S03 and NLG_S05 sites, both displayed much stronger connectivity than most pairs of neighboring sites, even though some pairs have comparable depths (such as C1_S01 and NLG_S01). This observation suggests that depth is not one of the primary factors causing divergence in viral communities across seamount sites, which was also supported by the result of Pearson correlation analysis (p > 0.05). Collectively, our results suggest that the physical barrier of the seamount rather than the isolation of the seamount or site depth may explain the degree to which viruses are highly divergent across seamounts. Because viruses require host organisms to replicate, we further attempted to examine the impact of seamount geographical features on the connectivity of prokaryotes. Although the connectivity of the prokaryotic community was also generally impaired by seamounts, the prokaryotic community did not follow the same connectivity pattern that was observed for the viral community (Fig. 7c). The different geographic distribution patterns between viral and prokaryotic communities may be explained by the fact that the factors influencing prokaryotic communities are complex and diverse^86,87. While the virus is indeed a significant factor in shaping host communities, other factors such as grazing and environmental conditions (e.g., depth, salinity, temperature, and nutrients) also play important roles in shaping prokaryotic communities^86,87. In addition, although our results showed that virus and host communities in deep-sea seamount sediments are interacting through close associations, a substantial proportion of seamount viruses was revealed to be temperate (Fig. 5c), which tend to coexist with hosts, thus normally don’t directly impact host communities by causing host mortality. Previous studies have suggested that a significant fraction of sediment viruses are derived from sinking within the overlying seawater, either on host cells or on particulate matter^88,89; some of these viruses may even persist in marine sediments for more than thousands of years⁸⁸. The viral population could be passively transported on oceanic currents⁹⁰ and seamounts exert complex effects on deep ocean circulation and mixing (e.g., deflection of major currents)³; therefore, we suggest that the physical barrier of the seamount suppresses the passive dispersal of ambient viruses by comprising the mixing effects of deep currents, thereby leading to a high level of divergence in viral communities. This study provides the first evidence on the effects of seamount geographic features on the assembly of local viral diversity; however, closer integration of molecular, oceanographic, geological, and ecological research with more well-characterized sediment samples is needed in the future to verify our hypothesis.

Methods

Sample collection and physicochemical measurements

Deep-sea seamount sediment samples were collected from the seamount region in the Northwest Pacific (Fig. 1 and Supplementary Data 1) during the COMRA cruise DY45 in July and August 2017. To fully consider the effect of the geographic features of seamounts on microbial and viral communities, sediment samples were collected across seamounts with varying locations in the bottom, hillside, and summit areas (Fig. 1 and Supplementary Data 1). NA sample was collected from the base of the NA Seamount, and C1_S01 sample was collected from the sediment of the C1 basin between the NA and NLG Seamounts. NLG_S01, NLG_S03, and NLG_S05 samples were collected from the base, summit, and hillside of the NLG Seamount, respectively. MP4_S01 and MP4_S05 samples were collected from the base of the MP4 Seamount but on opposite sides. The sediment samples were collected using a multi-tube sampling approach, and only sediment layers at 2–6 cm below the sea floor were selected for further analysis. The detailed geological information on collected sediments is listed in Supplementary Data 1. The collected sediment samples were frozen at −80 °C on board until further analysis. Nutrient concentrations, including total nitrogen and total organic carbon, were determined at the Qingdao Science Standard Testing platform (Qingdao, China) using standard methods.

Taxonomic profiling of microbial communities by 16S rRNA gene analysis

The total DNA was extracted from sediment samples using the Powersoil DNA Isolation Kit (Mo Bio, USA) according to the manufacturer’s instructions. The V3–V4 region of the 16S rRNA gene was amplified from total DNA using the forward primer 341F (5′CCTACGGGNGGCWGCAG3′) and the reverse primer 805R (5′GACTACHVGGGTATCTAATCC3′). The library was prepared using the TruSeqTM DNA Sample Prep Kit (Illumina, USA) and sequenced on the Illumina MiSeq platform (Illumina Inc., San Diego, CA, USA) by Majorbio Bio-Pharm Technology Co., Ltd. (Shanghai, China). The raw data was first imported into the QIIME 2 v2022.2.0 pipeline⁹¹ via the “tools import” command to produce an a.qza format file suitable for downstream analysis. Sequence quality was assessed via the demux plugin, followed by quality control, denoising, and generation of OTU tables using the DADA2 plugin. Taxonomic annotation was performed using the SILVA database (132_99_16S) via the feature-classifier plugin.

Bulk metagenomic sequencing and assembly

For metagenomics sequencing, a paired-end library was generated from total DNA using NEXTFLEX Rapid DNA-Seq Kit (Bioo Scientific, USA) and sequenced on an Illumina NovaSeq 6000 platform (Illumina Inc., San Diego, CA, USA) by Majorbio Bio-Pharm Technology Co., Ltd. (Shanghai, China). The raw reads were trimmed and quality filtered using fastp v0.23.2⁹² to generate clean reads with high quality. The Contigs were then assembled from clean reads using MEGAHIT v1.2.9⁹³ software (--k-list 21, 29, 39, 59, 79, 99, 119, and 141), and subsequently quality assessed using QUAST v5.2.0⁹⁴.

Generation and analysis of prokaryotic metagenome-assembled genomes

The Contigs assembled from the metagenome were binned by the MetaWRAP v1.3.0⁹⁵ binning module based on maxbin2, metabat1, and metabat2 methods. The original bins were refined using the MetaWRAP v1.3.0⁹⁵ bin_refinement module (with parameters -c 50 -m 10), which were then quality checked by CheckM v1.0.12⁹⁶. The high- and medium-quality bins (completeness ≥50% and contamination ≤10%) were then aggregated and dereplicated at 95% ANI using dRrep v3.3.0⁹⁷, resulting in a total of 63 species-level MAGs. MAGs were taxonomically assigned using GTDB-tk v2.1.0⁹⁸ based on classify_wf workflows. Maximum-likelihood phylogeny of MAGs was inferred using IQ-TREE v2.2.0.3⁹⁹ from a concatenation of 120 bacterial or 122 archaeal marker genes produced by GTDB-tk v2.1.0⁹⁸; the generated tree was visualized using iTOL v4 (https://itol.embl.de/). To determine the relative abundance of MAGs in each sample, clean reads were mapped to MAGs using CoverM v0.6.1¹⁰⁰ (with parameters -contig -m rpkm --trim-min 5 - -trim-max 95) to calculate RPKM values. Functional annotation of MAGs was performed using METABOLIC v4.0¹⁰¹ (-m-cutoff 0.75 -kofam-db full) (Supplementary Data 9).

Virus purification

Ambient viruses were purified from seamount sediment samples according to methods described previously³³. Briefly, 20 g of seamount sediment sample was suspended in 30 ml of SM solution (100 mM NaCl, 8 mM MgSO₄·7H₂O, and 50 mM Tris/HCl; pH 7.5), shaken for 30 min at 4 °C and centrifuged at 3000 × g for 15 min at 4 °C. The precipitated sediment particles were then repeatedly extracted with SM solution, and the resulting supernatants from both extractions were combined. After filtering through a 0.45 μm mesh, the viral particles in the supernatant were enriched using 100 kDa centrifugal ultrafiltration tubes by centrifugation at 4000 × g until the final sample volume measured less than 1 ml. We used 0.45 µm filters to enrich ambient viruses instead of 0.22 µm filters, because 0.45-μm filtration offers advantages such as avoidance of missing large viruses crucial for assessing diversity, more comprehensive virus detection, and reduction of processing losses¹⁰².

Virome sequencing and assembly

Prior to viral DNA extraction, virus concentrates were treated with DNase I at 37 °C for 1 h to remove exogenous DNA. Encapsidated viral DNA was then extracted as described by Thurber et al.¹⁰³. The library was prepared using the TruSeq DNA Sample Prep Kit (Illumina, USA) and sequenced on the Illumina HiSeq 2000 platform (Illumina Inc., San Diego, CA, USA) by Majorbio Bio-Pharm Technology Co., Ltd. (Shanghai, China). The raw reads were trimmed and quality filtered using fastp v0.23.2⁹² to generate clean reads with high quality. Contigs were then assembled from clean reads using MEGAHIT v1.2.9⁹³ software (--k-list 21, 29, 39, 59, 79, 99, 119, 141).

Identification of viral sequences

To obtain as many bona fide viral sequences as possible, viral sequences were identified by the following three pipelines: (i) Contigs ≥2 kb from metagenome assemblies, as well as virome assemblies, were used to recover viral sequences by VirSorter2 v2.2.3¹⁰⁴ and VIBRANT v1.2.0¹⁰⁵ using default settings. CheckV 0.9.0⁴⁴ was then used to evaluate the quality of viral sequences and remove host contaminations. Only viral sequences containing at least one of the viral hallmark genes (such as virion morphogenesis gene and terminase gene) were retained. (ii) What the Phage (Wtp) v1.1.0¹⁰⁶ was used to identify viral sequences from virome assemblies and only complete, high-quality, or circular viral sequences recognized by CheckV v0.9.0⁴⁴ were retained. (iii) Metaviral SPAdes v3.15.5⁴³ was used to assemble and identify viral sequences from the clean reads of the virome, and only complete, high-quality, or circular viral sequences recognized by CheckV v0.9.0⁴⁴ were retained. Viral sequences identified by pipelines (i) and (ii) were combined and binned using vRhyme v1.1.0³⁹ to generate vMAGs. The remaining un-binned sequences were combined with viral sequences identified by pipeline (iii) and were regarded as vContigs. The resulting vContigs/vMAGs were de-redundant by vRhyme v1.1.0³⁹ (with parameters --derep_only --method longest --derep_id 0.95 --frac 0.80) to generate vOTUs. An overview of the bioinformatic workflow used for the identification of viral sequences is shown in Supplementary Fig. 3a.

Taxonomic assignments and abundance profiles of viruses

ORFs were predicted from vOTU sequences by Prodigal v2.6.3¹⁰⁷ (-p meta -g 11 -m -c). Predicted ORFs were then mapped against the NCBI viral_Refseq database (2023-04-26) using CAT v5.0.3¹⁰⁸ to determine the taxonomic affiliation of vOTUs based on the Last Common Ancestor algorithm. To determine the relative abundance of vOTUs in each sample, clean reads were mapped to vOTUs using the contig mode and genome mode of CoverM v0.6.1¹⁰⁰ (with parameters -m rpkm --trim-min 5 - -trim-max 95), to calculate RPKM values.

Mantel’s correlation analysis

Pairwise comparisons of environmental variables (organic carbon, depth, and nitrogen) were calculated by Pearson correlation analysis. The correlations between Bray–Curtis dissimilarity of prokaryotic and viral OTU profiles and Euclidean distances of environmental variables were assessed using the Mantel test with the “ggcor” package in R^109,110. Each environmental variable was related to prokaryotic and viral OTU profiles by the Mantel test (function mantel; permutations = 9999 and method = “pearson”). Finally, Pearson correlations and the Mantel test were visualized using R software via the “ggcor” package^109,110.

Prediction of virus-host linkages

Virus-host linkages were predicted based on four different in silico strategies³⁵: (1) CRISPR-spacers match. A CRISPR spacer database was constructed for a set of microbial genomes using MinCED v0.4.2¹¹¹. CRISPR spacers were then queried for exact sequence matches against viral Contigs using BLAST+ v2.9.0¹¹² (with parameters -task blastn-short -word_size 7 -perc_identity 95 -qcov_hsp_ perc 95). Only matches of at least 95% identity over 95% spacer length and only ≤1 mismatch were regarded as highly confident virus-host linkages. (2) tRNA match. tRNAs were identified from microbial genome dataset and vOTUs using ARAGORN v1.2.41¹¹³. Recovered tRNAs were matched using BLASTn v2.9.0¹¹² (blastn -perc_identity 100 -qcov_hsp_perc 100), and only those with exact match (100% identity over 100% coverage) were regarded as highly confident virus-host linkages. (3) Nucleotide sequence homology. Sequences of vOTUs were compared with the dataset of microbial genomes by BLASTn v2.9.0¹¹² (blastn -perc_identity 70 -qcov_hsp_perc 75 -e⁻³). The match requirements were 70% minimum nucleotide identity, 75% minimum coverage of the viral contig length, 50 minimum bit score, and 0.001 maximum e-value. (4) k-mer frequencies. WIsH v1.0¹¹⁴ was run using default parameters to infer a connection between viruses and hosts based on k-mer frequencies, and p ≤ 0.005 was considered as a match (Supplementary Data 8). Whenever multiple hosts were predicted for a specific vOTU, the one supported by multiple approaches was chosen. The microbial genome dataset used above was composed of (1) all reference genomes from GTDB-tk (release207_v2, n = 24,778), (2) All MAGs (≥50% completeness and ≤10% contamination, n = 62) recovered from seamount metagenomes in this study, and (3) other seamount MAGs obtained from IMG/MR (n = 151, Supplementary Data 7).

Life strategy prediction

Life strategies of vOTUs were predicted by the following two pipelines. (1) VIBRANT v1.2.0¹⁰⁵ and CheckV v0.9.0⁴⁴ were used to infer temperate life strategies by identifying vOTUs that contain provirus integration sites or integrase genes. (2) ORFs from all vOTUs were functionally annotated using eggNOG-mapper v2.1.6¹¹⁵, and sequences containing lysogeny-specific genes (i.e., genes encoding for integrases, recombinases, transposases, excisionases, CI/Cro repressor, and parAB) were selected and manually inspected. The vOTUs identified by the above pipelines were considered temperate, while others were considered unknown.

Construction of phylogenetic trees on major virus groups

Phylogenetic trees of Caudovirales and Microviridae were generated based on TerL and VP1, respectively. The Contigs assembled from the seamount sediment metagenome and the virome were searched by HMMER v3.3.2¹¹⁶ (hmmsearch model) using the Hidden Markov Model (PF03237.hmm, PF04466.hmm, and PF05876.hmm for TerL, and PF02305.hmm fro VP1); sequences with an e-value ≤ 0.05 were retained. The obtained sequences and reference sequences (Supplementary Data 4 and 13) were aligned by MUSCLE v5.1¹¹⁷. The alignments were then trimmed using TrimAL v1.4.rev15¹¹⁸ (-automated1). The maximum-likelihood phylogenetic tree was then constructed using IQ-TREE v2.2.0.3⁹⁹ (-bb 1000 -nt AUTO -m MFP), and the support for nodes in the trees was evaluated with 1000 bootstrap replicates. The resulting phylogenetic trees were visualized by iTOL (https://itol.embl.de/).

AMG identification and annotation

Viral AMGs were identified and annotated using both VIBRANT v1.2.0¹⁰⁵ and DRAMv v1.3.5¹¹⁹ pipelines as follows: (1) VIBRANT pipeline. The AMGs in vOTU sequences were identified by VIBRANT v1.2.0¹⁰⁵ using default parameters. (2) DRAMv pipeline. The VirSorter2 v2.2.3¹⁰⁴ (--prep-for-dramv) was run on the vOTU sequences, and the AMGs (auxiliary scores <4) were predicted from the resulting sequences by DRAMv v1.3.5¹¹⁹. The AMGs predicted by the above pipelines were combined, and manual curation was carried out to remove illegal AMGs (such as nucleotide metabolism, DNA-related reactions, modification of viral components, modification of viral components, ribosomal proteins, transcriptional/translational regulators, and viral invasion). Putative AMGs were further annotated using the dbCAN2 server (https://bcb.unl.edu/dbCAN2/) and NCBI CD-search tool (https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) with the threshold value of e-value < 10⁻⁵. To infer the host source of AMGs, they were queried against the NR database (2021-01-07) by BLASTp. For the prediction of the AMG 3D structure, Colabfold v1.5.2¹²⁰ was used (Supplementary Data 10 and 11). The relative abundance of AMGs is determined by calculating the relative abundance of vOTUs carrying those AMGs in each sample.

Comparisons to viral sequences from other marine environments and RefSeq database

To compare the viral sequences from seamount sediments with those from other marine environments and the RefSeq database, protein-sharing network analysis of seamount sediment vOTUs, Global Oceans Viromes 2 (GOV 2.0) database⁵² (>10 kb, n = 195,728), and viral Contigs from cold seep (n = 2885)³⁴ and trench (n = 12,700)⁵³ were performed. For each viral Contig, ORFs were predicted using Prodigal v2.6.3¹⁰⁷ (-p meta -g 11 -m -c), and predicted protein sequences were then subjected to all-to-all BLASTp using Diamond v2.0.15¹²¹; the result file was used as input for vConTACT2 v0.11.3⁵¹. Viral RefSeq (v94) was used as reference database to generate the protein-sharing network (Supplementary Data 5 and 6), and Cytoscape v3.9.1¹²² was used to visualize the network.

The effects of geographical features of seamounts on viruses

To generate viral core and pan PCs in the seamount virus dataset, ORFs were called from virome-assembled Contigs using Prodigal v2.6.3¹⁰⁷ and were then aligned to the GOV2.0⁵² and IMG/VR (release version 2022-12-19_7.1)¹²³ database using Diamond v2.0.15¹²¹ with a threshold of e-value ≤ 1e⁻⁵, identity ≥ 30%, and coverage ≥ 50%⁵³. The resulting ORFs were translated into proteins and clustered at 60% identity and 80 coverage using CD-HIT v4.6¹²⁴ to generate non-redundant viral PCs. Pan PCs were obtained by merging PCs from different samples, and core PCs were acquired by identifying the PCs that were shared by all samples.

The connectivity of viral and prokaryotic populations between neighboring seamount sampling sites was calculated based on their similarity in the pOTU and vOTU profiles, respectively. Briefly, pOTU and vOTU matrix tables were used as import files for the vegan package of the R software^110,125; the Bray–Curtis distance matrix between each sample was calculated and further converted to similarity values using the following formula: similarity value = 1/(1 + distance matrix)¹²⁶.

Plotting

Box plots, heat maps, bar stacking plots, and gene maps were drawn using the R packages ggplot2 v4.3.2¹²⁷, pheatmap v1.0.12¹²⁸, ggpubr v0.6.0¹²⁹, and gggenes v0.5.0 (https://cran.r-project.org/web/packages/gggenes), respectively. Venn and upset plots were plotted by Tbtools v1.120¹³⁰.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The raw data of bulk metagenome, virome, and 16S rRNA genes are deposited in the NCBI SRA database under accession numbers SAMN36987747-52, SAMN36987740-46, and SAMN36987753-58, respectively. All processed data generated in this study are provided in Supplementary Data files (Supplementary Data 1–13). Source data for all main and Supplementary Figs. are provided with this paper. The raw Treefiles for phylogenetic trees in Fig. 2 (https://doi.org/10.6084/m9.figshare.25245436), Fig. 5b (https://doi.org/10.6084/m9.figshare.25245730), and Supplementary Fig. 5 (https://doi.org/10.6084/m9.figshare.25245745) are deposited in the Figshare repository. The raw data file for the gene-sharing network in Fig. 4a is deposited in the Figshare repository (https://doi.org/10.6084/m9.figshare.25245358). The links to the databases used in this study are listed below: Silva database (release 132) [https://www.arb-silva.de/documentation/release-132/]; Genome Taxonomy database [https://data.ace.uq.edu.au/public/gtdb/data/releases/release207/]; NCBI RefSeq database [https://ftp.ncbi.nlm.nih.gov/refseq/release/viral/]; NCBI Taxonomy database [https://www.ncbi.nlm.nih.gov/taxonomy]; eggNOG database (release 5.0) [http://eggnog5.embl.de/download/eggnog_5.0/]; Pfam [https://pfam.xfam.org/]; dbCAN2 server [https://bcb.unl.edu/dbCAN2/]; NCBI CD-search tool [https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi]; Global Oceans Viromes 2 (GOV 2.0) database [https://datacommons.cyverse.org/browse/iplant/home/shared/iVirus/GOV2.0]; viral Contigs from cold seep [https://doi.org/10.6084/m9.figshare.12922229]; viral Contigs from trench (OEP001086 and OEP001087) [https://www.biosino.org/node/]; IMG/VR database (release 2022-12-19_7.1) [https://genome.jgi.doe.gov/portal/IMG_VR/IMG_VR.home.html]; IMG/MR [https://img.jgi.doe.gov/]. Source data are provided with this paper.

References

Rowden, A. A. et al. Paradigms in seamount ecology: fact, fiction and future. Mar. Ecol. 31, 226–241 (2010).
Article ADS Google Scholar
Lavelle, J. W. & Mohn, C. Motion, commotion, and biophysical connections at deep ocean seamounts. Oceanography 23, 90–103 (2010).
Article Google Scholar
Rogers, A. D. The biology of seamounts: 25 years on. Adv. Mar. Biol. 79, 137–224 (2018).
Article PubMed Google Scholar
Clark, M. R. et al. The ecology of seamounts: structure, function, and human impacts. J. Annu. Ann. Rev. Mar. Sci. 2, 253–278 (2010).
Article ADS Google Scholar
Mendonça, A. et al. Is there a seamount effect on microbial community structure and biomass? The case study of Seine and Sedlo Seamounts (Northeast Atlantic). PLoS ONE 7, e29526 (2012).
Article ADS PubMed PubMed Central Google Scholar
Jacobson Meyers, M. E., Sylvan, J. B. & Edwards, K. J. Extracellular enzyme activity and microbial diversity measured on seafloor exposed basalts from Loihi seamount indicate the importance of basalts to global biogeochemical cycling. Appl. Environ. Microbiol. 80, 4854–4864 (2014).
Article ADS PubMed PubMed Central Google Scholar
Khandeparker, R., Meena, R. M. & Deobagkar, D. Bacterial diversity in deep-sea sediments from Afanasy Nikitin seamount, equatorial Indian Ocean. Geomicrobiol. J. 31, 942–949 (2014).
Article Google Scholar
Li, H., Zhou, H., Yang, S. & Dai, X. Stochastic and deterministic assembly processes in seamount microbial communities. Appl. Environ. Microbiol. 89, e00701–e00723 (2023).
Article ADS PubMed PubMed Central Google Scholar
Huo, Y. et al. Ecological functions of uncultured microorganisms in the cobalt-rich ferromanganese crust of a seamount in the central Pacific are elucidated by fosmid sequencing. Acta Oceanol. Sin. 34, 92–113 (2015).
Article CAS Google Scholar
Suttle, C. A. Viruses in the sea. Nature 437, 356–361 (2005).
Article ADS CAS PubMed Google Scholar
Paez-Espino, D. et al. Uncovering Earth’s virome. Nature 536, 425–430 (2016).
Article ADS CAS PubMed Google Scholar
Sime-Ngando, T. Environmental bacteriophages: viruses of microbes in aquatic ecosystems. Front. Microbiol. 5, 355 (2014).
Article PubMed PubMed Central Google Scholar
Breitbart, M. Marine viruses: truth or dare. Ann. Rev. Mar. Sci. 4, 425–448 (2012).
Article PubMed Google Scholar
De Smet, J. et al. High coverage metabolomics analysis reveals phage-specific alterations to Pseudomonas aeruginosa physiology during infection. ISME J. 10, 1823–1835 (2016).
Article PubMed PubMed Central Google Scholar
Hurwitz, B. L., Westveld, A. H., Brum, J. R. & Sullivan, M. B. Modeling ecological drivers in marine viral communities using comparative metagenomics and network analyses. Proc. Natl Acad. Sci. 111, 10714–10719 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Rohwer, F. & Thurber, R. V. Viruses manipulate the marine environment. Nature 459, 207–212 (2009).
Article ADS CAS PubMed Google Scholar
Zhang, R., Wei, W. & Cai, L. The fate and biogeochemical cycling of viral elements. Nat. Rev. Microbiol. 12, 850–851 (2014).
Article CAS PubMed Google Scholar
Anantharaman, K. et al. Sulfur oxidation genes in diverse deep-sea viruses. Science 344, 757–760 (2014).
Article ADS CAS PubMed Google Scholar
Roux, S. et al. Ecogenomics and potential biogeochemical impacts of globally abundant ocean viruses. Nature 537, 689–693 (2016).
Article CAS PubMed Google Scholar
Danovaro, R. et al. Prokaryote diversity and viral production in deep-sea sediments and seamounts. Deep Sea Res. Part II Top. Stud. Oceanogr. 56, 738–747 (2009).
Article ADS Google Scholar
Guo, X., Zhang, T., Jin, M. & Zeng, R. Characterization of Bacillus phage Gxv1, a novel lytic Salasvirus phage isolated from deep-sea seamount sediments. Mar. Life Sci. Tech. 3, 13–19 (2021).
Article ADS CAS Google Scholar
Beeston, M. A., Cragg, S. M. & Linse, K. Hydrological features above a Southern Ocean seamount inhibit larval dispersal and promote speciation: evidence from the bathyal mytilid Dacrydium alleni sp. nov.(Mytilidae: Bivalvia). Polar Biol. 41, 1493–1504 (2018).
Article Google Scholar
Liu, R. et al. Novel Chloroflexi genomes from the deepest ocean reveal metabolic strategies for the adaptation to deep-sea habitats. Microbiome 10, 1–17 (2022).
Article Google Scholar
Zhang, Y. et al. Abundance and diversity of candidate division JS1-and Chloroflexi-related bacteria in cold seep sediments of the northern South China Sea. Front. Earth Sci. 6, 373–382 (2012).
Article ADS CAS Google Scholar
Dombrowski, N., Seitz, K. W., Teske, A. P. & Baker, B. J. Genomic insights into potential interdependencies in microbial hydrocarbon and nutrient cycling in hydrothermal sediments. Microbiome 5, 1–13 (2017).
Article Google Scholar
Marimuthu, J. et al. Deep-sea sediment metagenome from Bay of Bengal reveals distinct microbial diversity and functional significance. Genomics 114, 110524 (2022).
Article CAS PubMed Google Scholar
Bowers, R. M. et al. Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea. Nat. Biotechnol. 35, 725–731 (2017).
Article CAS PubMed PubMed Central Google Scholar
Fullerton, H., Hager, K. W., McAllister, S. M. & Moyer, C. L. Hidden diversity revealed by genome-resolved metagenomics of iron-oxidizing microbial mats from Lō’ihi Seamount, Hawai’i. ISME J. 11, 1900–1914 (2017).
Article CAS PubMed PubMed Central Google Scholar
Kato, S., Hirai, M., Ohkuma, M. & Suzuki, K. Microbial metabolisms in an abyssal ferromanganese crust from the Takuyo-Daigo Seamount as revealed by metagenomics. PLoS ONE 14, e0224888 (2019).
Article CAS PubMed PubMed Central Google Scholar
Fortunato, C. S., Larson, B., Butterfield, D. A. & Huber, J. A. Spatially distinct, temporally stable microbial populations mediate biogeochemical cycling at and below the seafloor in hydrothermal vent fluids. Environ. Microbiol. 20, 769–784 (2018).
Article CAS PubMed Google Scholar
McAllister, S. M. et al. Validating the Cyc2 neutrophilic iron oxidation pathway using meta-omics of Zetaproteobacteria iron mats at marine hydrothermal vents. Msystems 5, e00553-19 (2020).
Article PubMed PubMed Central Google Scholar
Monaghan, M. Whole Genome Sequencing of Aquatic Fungi Responsible for the Degradation of Recalcitrant Substrates in Liquid Environments (USDOE Joint Genome Institute (JGI), 2014).
Jin, M. et al. Diversities and potential biogeochemical impacts of mangrove soil viruses. Microbiome 7, 1–15 (2019).
Article CAS Google Scholar
Li, Z. et al. Deep sea sediments associated with cold seeps are a subsurface reservoir of viral diversity. ISME J. 15, 2366–2378 (2021).
Article CAS PubMed PubMed Central Google Scholar
Cheng, R. et al. Virus diversity and interactions with hosts in deep-sea hydrothermal vents. Microbiome 10, 1–17 (2022).
Article Google Scholar
Gregory, A. C. et al. The gut virome database reveals age-dependent patterns of virome diversity in the human gut. Cell Host Microbe 28, 724–740.e728 (2020).
Article CAS PubMed PubMed Central Google Scholar
Trubl, G. et al. Soil viruses are underexplored players in ecosystem carbon processing. MSystems 3, 00076-18 (2018).
Article Google Scholar
Krupovic, M. et al. Cressdnaviricota: a virus phylum unifying seven families of rep-encoding viruses with single-stranded, circular DNA genomes. J. Virol. 94, 00582–00520 (2020).
Article Google Scholar
Kieft, K. et al. vRhyme enables binning of viral genomes from metagenomes. Nucleic Acids Res. 50, e83–e83 (2022).
Article CAS PubMed PubMed Central Google Scholar
Roux, S. et al. Minimum information about an uncultivated virus genome (MIUViG). Nat. Biotechnol. 37, 29–37 (2019).
Article CAS PubMed Google Scholar
Mönttinen, H. A., Bicep, C., Williams, T. A. & Hirt, R. P. The genomes of nucleocytoplasmic large DNA viruses: viral evolution writ large. Microb. Genom. 7, 9 (2021).
Google Scholar
Philippe, N. et al. Pandoraviruses: amoeba viruses with genomes up to 2.5 Mb reaching that of parasitic eukaryotes. Science 341, 281–286 (2013).
Article ADS CAS PubMed Google Scholar
Antipov, D., Raiko, M., Lapidus, A. & Pevzner, P. A. Metaviral SPAdes: assembly of viruses from metagenomic data. Bioinformatics 36, 4126–4129 (2020).
Article CAS PubMed Google Scholar
Nayfach, S. et al. CheckV assesses the quality and completeness of metagenome-assembled viral genomes. Nat. Biotechnol. 39, 578–585 (2021).
Article CAS PubMed Google Scholar
Rao, V. B. & Feiss, M. Mechanisms of DNA packaging by large double-stranded DNA viruses. Annu. Rev. Virol. 2, 351–378 (2015).
Article CAS PubMed PubMed Central Google Scholar
Székely, A. J. & Breitbart, M. Single-stranded DNA phages: from early molecular biology tools to recent revolutions in environmental microbiology. FEMS Microbiol. Lett. 363, fnw027 (2016).
Article PubMed Google Scholar
Bryson, S. J. et al. A novel sister clade to the enterobacteria microviruses (family M icroviridae) identified in methane seep sediments. Environ. Microbiol. 17, 3708–3721 (2015).
Article CAS PubMed Google Scholar
Yoshida, M. et al. Quantitative viral community DNA analysis reveals the dominance of single-stranded DNA viruses in offshore upper bathyal sediment from Tohoku. Jpn. Front. Microbiol. 9, 75 (2018).
Article Google Scholar
Yoshida, M. et al. Metagenomic analysis of viral communities in (hado) pelagic sediments. PLoS ONE 8, e57271 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Quaiser, A. et al. Diversity and comparative genomics of Microviridae in Sphagnum-dominated peatlands. Front. Microbiol. 6, 375 (2015).
Article PubMed PubMed Central Google Scholar
Bin Jang, H. et al. Taxonomic assignment of uncultivated prokaryotic virus genomes is enabled by gene-sharing networks. Nat. Biotechnol. 37, 632–639 (2019).
Article Google Scholar
Gregory, A. C. et al. Marine DNA viral macro-and microdiversity from pole to pole. Cell 177, 1109–1123. e14 (2019).
Article CAS PubMed PubMed Central Google Scholar
Jian, H. et al. Diversity and distribution of viruses inhabiting the deepest ocean on Earth. ISME J. 15, 3094–3110 (2021).
Article CAS PubMed PubMed Central Google Scholar
Suttle, C. A. Marine viruses—major players in the global ecosystem. Nat. Rev. Microbiol. 5, 801–812 (2007).
Article CAS PubMed Google Scholar
Zhang, M. et al. The life cycle transitions of temperate phages: regulating factors and potential ecological implications. Viruses 14, 1904 (2022).
Article CAS PubMed PubMed Central Google Scholar
Wommack, K. E. & Colwell, R. R. Virioplankton: viruses in aquatic ecosystems. Microbiol. Mol. Biol. Rev. 64, 69–114 (2000).
Article CAS PubMed PubMed Central Google Scholar
Edwards, R. A. et al. Computational approaches to predict bacteriophage–host relationships. FEMS Microbiol. Rev. 40, 258–272 (2016).
Article CAS PubMed Google Scholar
Thingstad, T. F. & Lignell, R. Theoretical models for the control of bacterial growth rate, abundance, diversity and carbon demand. Microb. Ecol. 13, 19–27 (1997).
Article Google Scholar
Franco, N. R. et al. Bacterial composition and diversity in deep-sea sediments from the Southern Colombian Caribbean Sea. Diversity 13, 10 (2020).
Article Google Scholar
Franco, D. C. et al. High prevalence of gammaproteobacteria in the sediments of admiralty bay and north bransfield Basin, Northwestern Antarctic Peninsula. Front. Microbiol. 8, 153 (2017).
Article PubMed PubMed Central Google Scholar
Dyksma, S., Lenk, S., Sawicka, J. E. & Mußmann, M. Uncultured gammaproteobacteria and desulfobacteraceae account for major acetate assimilation in a coastal marine sediment. Front. Microbiol. 9, 3124 (2018).
Article PubMed PubMed Central Google Scholar
Tully, B. J. & Heidelberg, J. F. Potential mechanisms for microbial energy acquisition in oxic deep-sea sediments. Appl. Environ. Microbiol. 82, 4232–4243 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Dyksma, S. et al. Ubiquitous Gammaproteobacteria dominate dark carbon fixation in coastal sediments. ISME J. 10, 1939–1953 (2016).
Article CAS PubMed PubMed Central Google Scholar
Zhao, S. et al. Biodegradation of polyethylene terephthalate (PET) by diverse marine bacteria in deep‐sea sediments. Environ. Microbiol. 25, 2719–2731 (2023).
Boliang, G., Min, J. & Li, L. Genome sequencing reveals the complex polysaccharide-degrading ability of novel deep-sea bacterium Flammeovirga pacifica WPAGA1. Front. Microbiol. 8, 600 (2017).
Google Scholar
Mara, P. et al. Metagenomic profiles of archaea and bacteria within thermal and geochemical gradients of the Guaymas Basin deep subsurface. Nat. Commun. 14, 7768 (2023).
Garritano, A. N. et al. Species-specific relationships between deep sea sponges 24 and their symbiotic Nitrosopumilaceae. ISME J. 1–3 (2023).
Google Scholar
Garritano, A. N. et al. Species-specific relationships between deep sea sponges and their symbiotic Nitrosopumilaceae. ISME J. 17, 1517–1519 (2023)
Hawkins, A. S. et al. Role of 4-hydroxybutyrate-CoA synthetase in the CO2 fixation cycle in thermoacidophilic archaea. J. Biol. Chem. 288, 4012–4022 (2013).
Article CAS PubMed Google Scholar
Jin, M. et al. Prevalence of temperate viruses in deep South China Sea and western Pacific Ocean. Deep Sea Res. Part I Oceanogr. Res. Pap. 166, 103403 (2020).
Article CAS Google Scholar
Williamson, S. J. et al. Lysogenic virus–host interactions predominate at deep-sea diffuse-flow hydrothermal vents. ISME J. 2, 1112–1121 (2008).
Article CAS PubMed Google Scholar
Mizuno, C. M. et al. Genomes of abundant and widespread viruses from the deep ocean. MBio 7, 00805-16 (2016).
Article Google Scholar
Hurwitz, B. L. & U’Ren, J. M. Viral metabolic reprogramming in marine ecosystems. Curr. Opin. Microbiol. 31, 161–168 (2016).
Article CAS PubMed Google Scholar
Luo, X.-Q. et al. Viral community-wide auxiliary metabolic genes differ by lifestyles, habitats, and hosts. Microbiome 10, 1–18 (2022).
Article Google Scholar
Klippel, B. et al. Carbohydrate-active enzymes identified by metagenomic analysis of deep-sea sediment bacteria. Extremophiles 18, 853–863 (2014).
Article CAS PubMed Google Scholar
Kieft, K. et al. Ecology of inorganic sulfur auxiliary metabolism in widespread bacteriophages. Nat. Commun. 12, 3503 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Eyice, Ö. et al. Bacterial SBP56 identified as a Cu-dependent methanethiol oxidase widely distributed in the biosphere. ISME J. 12, 145–160 (2018).
Article CAS PubMed Google Scholar
Tu, Q. et al. Metagenomic reconstruction of nitrogen cycling pathways in a CO2-enriched grassland ecosystem. Soil Biol. Biochem. 106, 99–108 (2017).
Article ADS CAS Google Scholar
Liao, L. et al. Microbial diversity in deep-sea sediment from the cobalt-rich crust deposit region in the Pacific Ocean. FEMS Microbiol. Ecol. 78, 565–585 (2011).
Article CAS PubMed Google Scholar
Sabri, M. et al. Genome annotation and intraviral interactome for the Streptococcus pneumoniae virulent phage Dp-1. J. Bacteriol. 193, 551–562 (2011).
Article CAS PubMed Google Scholar
Rosenwasser, S., Ziv, C., Van Creveld, S. G. & Vardi, A. Virocell metabolism: metabolic innovations during host–virus interactions in the ocean. Trends Microbiol. 24, 821–832 (2016).
Article CAS PubMed Google Scholar
Richer de Forges, B., Koslow, J. A. & Poore, G. Diversity and endemism of the benthic seamount fauna in the southwest Pacific. Nature 405, 944–947 (2000).
Article ADS CAS Google Scholar
Schlacher, T. A., Rowden, A. A., Dower, J. F. & Consalvey, M. Seamount science scales undersea mountains: new research and outlook. Mar. Ecol. 31, 1–13 (2010).
Article ADS Google Scholar
Samadi, S. et al. Seamount endemism questioned by the geographic distribution and population genetic structure of marine invertebrates. Mar. Biol. 149, 1463–1475 (2006).
Article Google Scholar
Yooseph, S. et al. The Sorcerer II Global Ocean Sampling expedition: expanding the universe of protein families. PLoS Biol. 5, e16 (2007).
Article PubMed PubMed Central Google Scholar
Ambati, M. & Kumar, M. S. Microbial diversity in the Indian Ocean sediments: an insight into the distribution and associated factors. Curr. Microbiol. 79, 115 (2022).
Article CAS PubMed Google Scholar
Mojica, K. D. & Brussaard, C. P. Factors affecting virus dynamics and microbial host–virus interactions in marine environments. FEMS Microbiol. Ecol. 89, 495–515 (2014).
Article CAS PubMed Google Scholar
Cai, L. et al. Active and diverse viruses persist in the deep sub-seafloor sediments over thousands of years. ISME J. 13, 1857–1864 (2019).
Article CAS PubMed PubMed Central Google Scholar
Hewson, I. & Fuhrman, J. Viriobenthos production and virioplankton sorptive scavenging by suspended sediment particles in coastal and pelagic waters. Microb. Ecol. 46, 337–347 (2003).
Article ADS CAS PubMed Google Scholar
Brum, J. R. et al. Patterns and ecological drivers of ocean viral communities. Science 348, 1261498 (2015).
Article PubMed Google Scholar
Bolyen, E. et al. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat. Biotechnol. 37, 852–857 (2019).
Article CAS PubMed PubMed Central Google Scholar
Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
Article PubMed PubMed Central Google Scholar
Li, D. et al. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics 31, 1674–1676 (2015).
Article CAS PubMed Google Scholar
Gurevich, A., Saveliev, V., Vyahhi, N. & Tesler, G. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29, 1072–1075 (2013).
Article CAS PubMed PubMed Central Google Scholar
Uritskiy, G. V., DiRuggiero, J. & Taylor, J. MetaWRAP—a flexible pipeline for genome-resolved metagenomic data analysis. Microbiome 6, 1–13 (2018).
Article Google Scholar
Parks, D. H. et al. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 25, 1043–1055 (2015).
Article CAS PubMed PubMed Central Google Scholar
Olm, M. R., Brown, C. T., Brooks, B. & Banfield, J. F. dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication. ISME J. 11, 2864–2868 (2017).
Article CAS PubMed PubMed Central Google Scholar
Chaumeil, P.-A., Mussig, A. J., Hugenholtz, P. & Parks, D. H. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics 36, 1925–1927 (2019).
Article PubMed PubMed Central Google Scholar
Minh, B. Q. et al. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. 37, 1530–1534 (2020).
Google Scholar
Woodcroft, B. & Newell, R. CoverM: Read coverage calculator for metagenomics. Github (2017).
Zhou, Z. et al. METABOLIC: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks. Microbiome 10, 33 (2022).
Article CAS PubMed PubMed Central Google Scholar
Göller, P. C. et al. Uncovering a hidden diversity: optimized protocols for the extraction of dsDNA bacteriophages from soil. Microbiome 8, 1–16 (2020).
Article Google Scholar
Thurber, R. V. et al. Laboratory procedures to generate viral metagenomes. Nat. Protoc. 4, 470–483 (2009).
Article CAS PubMed Google Scholar
Guo, J. et al. VirSorter2: a multi-classifier, expert-guided approach to detect diverse DNA and RNA viruses. Microbiome 9, 1–13 (2021).
Article Google Scholar
Kieft, K., Zhou, Z. & Anantharaman, K. VIBRANT: automated recovery, annotation and curation of microbial viruses, and evaluation of viral community function from genomic sequences. Microbiome 8, 1–23 (2020).
Article Google Scholar
Marquet, M. et al. What the phage: a scalable workflow for the identification and analysis of phage sequences. GigaScience 11, giac110 (2022).
Article PubMed PubMed Central Google Scholar
Hyatt, D. et al. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinform. 11, 1–11 (2010).
Article Google Scholar
Von Meijenfeldt, F. B. et al. Robust taxonomic classification of uncharted microbial sequences and bins with CAT and BAT. Genome Biol. 20, 1–14 (2019).
Google Scholar
Huang, H., Zhou, L., Chen, J., & Wei, T. ggcor: Extended tools for correlation analysis and visualization. R package version 09 7 (2020).
Olive, D. J. Software for data analysis: programming with R. Technometrics 52, 261 (2010).
Google Scholar
Bland, C. et al. CRISPR recognition tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats. BMC Bioinform. 8, 1–8 (2007).
Article Google Scholar
Ye, J., McGinnis, S. & Madden, T. L. BLAST: improvements for better sequence analysis. Nucleic Acids Res. 34, W6–W9 (2006).
Article CAS PubMed PubMed Central Google Scholar
Laslett, D. & Canback, B. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res. 32, 11–16 (2004).
Article CAS PubMed PubMed Central Google Scholar
Galiez, C. et al. WIsH: who is the host? Predicting prokaryotic hosts from metagenomic phage contigs. Bioinformatics 33, 3113–3114 (2017).
Article CAS PubMed PubMed Central Google Scholar
Cantalapiedra, C. P. et al. eggNOG-mapper v2: functional annotation, orthology assignments, and domain prediction at the metagenomic scale. Mol. Biol. 38, 5825–5829 (2021).
Article CAS Google Scholar
Finn, R. D., Clements, J. & Eddy, S. R. HMMER web server: interactive sequence similarity searching. Nucleic Acids Res. 39, W29–W37 (2011).
Article CAS PubMed PubMed Central Google Scholar
Edgar, R. C. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinform. 5, 1–19 (2004).
Article Google Scholar
Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973 (2009).
Article PubMed PubMed Central Google Scholar
Shaffer, M. et al. DRAM for distilling microbial metabolism to automate the curation of microbiome function. Nucleic Acids Res. 48, 8883–8900 (2020).
Article CAS PubMed PubMed Central Google Scholar
Mirdita, M. et al. ColabFold: making protein folding accessible to all. Nat. Methods 19, 679–682 (2022).
Article CAS PubMed PubMed Central Google Scholar
Buchfink, B., Reuter, K. & Drost, H.-G. Sensitive protein alignments at tree-of-life scale using DIAMOND. Nat. Methods 18, 366–368 (2021).
Article CAS PubMed PubMed Central Google Scholar
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Article CAS PubMed PubMed Central Google Scholar
Paez-Espino, D. et al. IMG/VR v. 2.0: an integrated data management and analysis system for cultivated and environmental viral genomes. Nucleic Acids Res. 47, D678–D686 (2019).
Article CAS PubMed Google Scholar
Fu, L. et al. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics 28, 3150–3152 (2012).
Article CAS PubMed PubMed Central Google Scholar
Dixon, P. VEGAN, a package of R functions for community ecology. J. Veg. Sci. 14, 927–930 (2003).
Article Google Scholar
Bloom, S. A. Similarity indices in community studies: potential pitfalls. Mar. Ecol. Prog. Ser. 5, 125–128 (1981).
Gómez-Rubio, V. ggplot2-elegant graphics for data analysis. J. Stat. Softw. 77, 1–3 (2017).
Article Google Scholar
Kolde, R. & Kolde, M. R. Package ‘pheatmap’. R. package 1, 790 (2015).
Kassambara, A. ggpubr: “ggplot2” based publication ready plots. R package version 04 0 438, (2020).
Chen, C. et al. TBtools: an integrative toolkit developed for interactive analyses of big biological data. Mol. Plant 13, 1194–1202 (2020).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was financially supported by the Scientific Research Foundation of Third Institute of Oceanography, MNR (2024025, M.J.), the National Natural Science Foundation of China (41976084, M.J.), the Innovation Group Project of Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai) (311021006, M.J.), and the Deep Sea Habitats Discovery Project (DY-XZ-04, M.J.).

Author information

These authors contributed equally: Meishun Yu, Menghui Zhang.

Authors and Affiliations

State Key Laboratory Breeding Base of Marine Genetic Resource and Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Third Institute of Oceanography, Ministry of Natural Resources, Xiamen, 361000, China
Meishun Yu, Menghui Zhang, Runying Zeng, Ruolin Cheng, Yanping Hou, Fangfang Kuang, Xuejin Feng, Xiyang Dong, Yinfang Li, Zongze Shao & Min Jin
Institute for Advanced Study, Shenzhen University, Shenzhen, Guangdong, China
Rui Zhang

Authors

Meishun Yu
View author publications
You can also search for this author in PubMed Google Scholar
Menghui Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Runying Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Ruolin Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Rui Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yanping Hou
View author publications
You can also search for this author in PubMed Google Scholar
Fangfang Kuang
View author publications
You can also search for this author in PubMed Google Scholar
Xuejin Feng
View author publications
You can also search for this author in PubMed Google Scholar
Xiyang Dong
View author publications
You can also search for this author in PubMed Google Scholar
Yinfang Li
View author publications
You can also search for this author in PubMed Google Scholar
Zongze Shao
View author publications
You can also search for this author in PubMed Google Scholar
Min Jin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.J., Z.S., and R.Y.Z. conceived the project. M.J. performed the generation of virome and bulk metagenome. M.Y., M.Z., M.J., and R.C. performed data analyses. M.Y. and M.J. wrote the manuscript with contributions from Z.S., R.Z., Y.H., F.K., X.F., X.D, and Y.L. All authors read and approved the manuscript.

Corresponding authors

Correspondence to Zongze Shao or Min Jin.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplemental Data files

Reporting Summary

Source data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yu, M., Zhang, M., Zeng, R. et al. Diversity and potential host-interactions of viruses inhabiting deep-sea seamount sediments. Nat Commun 15, 3228 (2024). https://doi.org/10.1038/s41467-024-47600-1

Download citation

Received: 18 October 2023
Accepted: 04 April 2024
Published: 15 April 2024
DOI: https://doi.org/10.1038/s41467-024-47600-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.