Abstract
The present study illustrates the genetic diversity of four uncultured viral communities from the surface waters of Cochin Estuary (CE), India. Viral diversity inferred using Illumina HiSeq paired-end sequencing using a linker-amplified shotgun library (LASL) revealed different double-stranded DNA (dsDNA) viral communities. The water samples were collected from four stations PR1, PR2, PR3, and PR4, during the pre-monsoon (PRM) season. Analysis of virus families indicated that the Myoviridae was the most common viral community in the CE followed by Siphoviridae and Podoviridae. There were significant (pā<ā0.05) spatial variations in the relative abundance of dominant families in response to the salinity regimes. The relative abundance of Myoviridae and Podoviridae were high in the euryhaline region and Siphoviridae in the mesohaline region of the estuary. The predominant phage type in CE was phages that infected Synechococcus. The viral proteins were found to be involved in major functional activities such as ATP binding, DNA binding, and DNA replication. The study highlights the genetic diversity of dsDNA viral communities and their functional protein predictions from a highly productive estuarine system. Further, the metavirome data generated in this study will enhance the repertoire of publicly available dataset and advance our understanding of estuarine viral ecology.
Similar content being viewed by others
Introduction
Viruses are integral components of the marine microbial loop and are numerically most abundant biological entities in aquatic ecosystems1,2,3. They play significant roles in ecosystem functioning4,5. Apart from their direct impact on ocean biogeochemistry, viral infection significantly alters the structure and function of their prokaryotic and eukaryotic hosts3. Viruses exhibit high levels of host specificity in aquatic environments and are highly diverse in terms of their morphotypes and genotypes6. Despite their numerical abundance and ecological significance, very little is known about estuarine phage biodiversity and biogeography. Studies on marine viral diversity indicate that virioplankton diversity varies with seasonal and spatial variations in physico-chemical parameters7,8. In the past, viral diversity studies were limited compared to their microbial host community diversity studies as there is no single genetic element that is shared in all phage genomes like the bacterial 16S rDNA gene9. Further, the small size and low DNA content of viruses pose significant barriers to microscopic and molecular studies of diversity. However, metagenomics approach allows an in-depth characterization of molecular diversity, genome content, and structure of uncultured viruses, thereby delivering unique insights into the main viral families and their function in marine environments10,11,12,13.
Next generation DNA sequencing has been widely employed in the study of viral metagenomes (viromes) in different aquatic environments including fresh water14,15, oceans10,16,17,18,19 and reused wastewater20. It provides an in-depth and thorough analysis of genomics and proteomics of aquatic viruses from diverse habitats and decipher the role of viruses in aquatic ecology and biogeochemistry. This will give an estimation of the actual size of the viriosphere, information on virus infecting various hosts (both prokaryotic and eukaryotic hosts) and will subsequently contribute to the better understanding of the genetic diversity of life. However, two-thirds of the genes within the reported metavirome cannot be assigned a biological function or taxonomic affiliation, which makes viral species distribution similar among viromes from different environments21. In aquatic systems, the major factors influencing viral community structures are the trophic status, microbial diversity and their abundance. The viral communities change in response to various environmental factors such as temperature, dissolved oxygen, and chlorophyll a22. The viral communities from near shore waters, sediments and deeper oceans have been examined, but only a few reports have addressed the genomes of virioplankton in highly productive estuarine systems.
Earlier reports on viral ecology point out a unique distribution of viruses in the Cochin estuary (CE) in response to changes in host abundance and salinity23. The viral-mediated prokaryotic mortality showed significant seasonal variations with maximum viral-mediated mortality during the dry pre-monsoon season (PRM)24. The viral shunt was highest in the PRM, especially in the mesohaline regions of the estuary24. Previous studies on diversity of phytoplankton and zooplankton suggest that the species diversity, richness and evenness were high during the dry pre-monsoon season in CE due to high water temperature and reduced river run off25,26. Based on this, we chose to study the viral diversity during the dry pre-monsoon season from four different salinity regimes in the CE. We hypothesize that the spatial variations in viral activity could be due to variations in viral communities in the estuary. The present study was carried out to understand the genetic and functional diversity of viruses during the pre-monsoon season when the viral activity is high. This study presents a detailed report on the metaviriome analysis from a highly productive estuarine system.
Results
Environmental Parameters
The samples were collected during the dry pre-monsoon period when the system was highly stratified. The four sampling locations lie in a longitudinal transect between 76Ā°15ā² to 76Ā°25ā²E and in a latitudinal transect between 9Ā°30ā² to 10Ā°10ā²N, along the Cochin estuary in India (Supplementary Fig. 1). Temperature and salinity values ranged from to 31āĀ°C to 32.78āĀ°C and 4.6 to 30 ppt respectively (Table 1). The maximum temperature was noted at PR4 (32.78āĀ°C) and minimum at PR3 (31āĀ°C). The 4 stations exhibited different salinities with PR1 falling in the euryhaline region (29.9 ppt), followed by PR2 (23.36 ppt), PR3 (16.5 ppt), and PR4 (4.7 ppt). At all the locations, the highest/lowest salinity coincided with the highest/lowest tidal amplitude. The average tidal height in the inlet region (PR1) was 0.7ām, which decreased toward the upstream region (PR4) (0.5ām). The Chlorophyll a ranged from 2.36ā12.69āmg/m3 with a maximum Chl a (33.11āmg/m3) at PR2 and least at PR4 (2.36āmg/m3). The dissolved oxygen concentration was high during the study period throughout the estuary with higher values at PR4 (5.51āĀµM). The spatial differences in NO2, NO3, PO4, and SiO4 were significant between all the stations (Table 1).
Biological Parameters
The abundance of prokaryotes and viruses exhibited a distinct spatial pattern in their distribution (Table 1). The viral abundance (VA) ranged from 1.9ā3.2āĆā107 virus-like particles per mL (VLPs mLā1) whereas the prokaryotic abundance was one order of magnitude lesser (2.7ā3.1āĆā106 cells mLā1) than the viral abundance. The values of VA, PA and TVC in the study were comparable with previous reports from the Cochin estuary (CE)23,24. The virus to prokaryote abundance ratio (VPR) was used to examine the relationship between the viral and prokaryotic populations. The VPR ranged from 2.1 to 10.6. The highest and lowest VPR were recorded in the high saline and freshwater regions of the estuary, respectively (Fig. 1).
Analysis of sequences in the Cochin estuary
The denovo assembly generated 1,18,872 contigs at PR1, 1,89,912 contigs at PR2, 2,86,178 contigs at PR3, and 3,02,030 contigs at PR4. The contigs with length >=300ābp were considered for further downstream analysis. The contig lengths for the annotations of each sample are illustrated in Table 2. The viral diversity of the four estuarine samples were different, but it was interesting to see that the PR1 inlet station had the highest alpha diversity comprising 172 unique viruses, followed by PR3 (163 unique viruses), PR4 (158 unique viruses), and PR2 (129 unique viruses), respectively. Detailed information regarding sequencing metadata, assembly metrics, and BLASTx searches is summarized in Table 2.
Taxonomic diversity of viruses in the Cochin estuary
The taxonomic distribution of the assignable sequences greatly diverged among the four estuarine samples. Based on the relative occurrence values (GG PLOT2 R-package) at the family level, 18 families of double-stranded DNA (dsDNA) viruses were obtained (Fig. 2, Supplementary Figs 2 and 3). The order Caudovirales, known as tailed bacteriophages, was the most dominant order among the viruses annotated in this study in all the stations. Myoviridae was the most abundant family in all four stations, with highest dominance at PR1 (72.10% at PR1, 40.84% at PR2, and 48.05% at PR3) and was the least abundant in the freshwater region (23.69% at PR4). Siphoviridae was the second most abundant family, constituting about 6.45% at PR1, 36.27% at PR2, followed by 24.6% at PR3, and 20.43% at PR4. Family Podoviridae was least represented (7.68%, 3.26%, 3.21%, and 2.5% at PR1, PR2, PR3, and PR4, respectively) compared to Siphoviridae (Supplementary Fig. 3aād). However, more than 60% of the sequences at PR4, 18.8% at PR3, 7.8% at PR2, and 12.1% at PR1 did not show any similarity to any of the known virus species, which were considered as viral dark matter.
In the present study, the dominant families were similar in all four locations. An RDA plot was generated to understand the factors influencing the abundances of viruses and prokaryotes, TVC, and distribution of major families such as Myoviridae, Podoviridae, Siphoviridae, Poxviride, and Phydodnaviridae, along with various physico-chemical variables (Fig. 3). The salinity was superimposed onto this RDA plot to determine the impact of salinity on the distribution of different viral families in the CE. Virus to prokaryote ratio (VPR) was high in the euryhaline region of the estuary. There were spatial variations in the relative abundances of dominant families, with Myoviridae being the most dominant family in all locations. However, the relative abundance of Podoviridae and Siphoviridae varied in different regions of the estuary. The highest abundance of Siphoviridae was recorded at PR2 where the salinity was 23 ppt, while the lowest abundance was recorded in the euryhaline region (PR1) of the estuary. In contrast, Podoviridae was more abundant in the euryhaline region (PR1) and least abundant in the freshwater region (PR4) of the estuary.
There was also smaller representation of sequences in the four estuarine viromes belonging to families Baculoviridae (insects and other arthropods), Marseilleviridae (amoebal viruses, also found in humans), Mimiviridae (gaint marine protists viruses), and Phycodnaviridae (large double stranded DNA viruses that infect marine or freshwater eukaryotic algae). However, minor families showed unique spatial distribution, for e.g. Nudiviridae (viruses of insects and marine crustaceans) in PR3, and Ascoviridae (viruses of invertebrates) in PR4. Herpesvirales, responsible for causing diseases in animals and humans, were also found in the CE. However, in lower resolution, Poxviridae, Alloherpesviridae, and Iridoviridae, were also present (Supplementary Fig. 3)
The Synechococcus phage was the most dominant phage at all the four locations contributing to 70.3% in PR1, 42.8% in PR2, 41.1% in PR3, and 39.6% in PR4, respectively. The other major phages at PR1 were Prochlorococcus phage (8.5%), Cyanophage (6.9%), Pelagibacter phage (4.5%), and Pseudomonas phage (1.9%), whereas at PR2, Pseudomonas phage (5.9%), Cyanophage (3.9%), Bacillus phage (3.6%), and Mycobacterium phage (3.6%) were dominant. Similarly, in PR3, Pseudomonas phage (7.1%), Cyanophage (4.41%), Prochlorococcus phage (3.2%), and Mycobacterium phage (3%) were dominant, whereas, in PR4, Pseudomonas phage (6.3%), Erwinia phage (4.3%), Mycobacterium phage (4.3%), Prochlorococcus phage (3.8%), and Cyanophage (3.6%) were dominant (Fig. 4). The maximum hit reads of the top ten viruses are represented in Fig. 4. Synechococcus phages S-SM2 and S-SKS1 were dominant in PR1 and PR2, respectively, whereas Synechococcus phage S-CAM9 was dominant in both PR3 and PR4 (Fig. 4 and Supplementary Fig. 4). There were 66 unique viruses in PR1, 26 in PR2, 35 in PR3 and 47 in PR4. A total of 45 viruses were shared by all the four stations (Fig. 5).
Functional predictions of viral proteins
The functional prediction of the metavirome of four estuarine samples was performed based on structural and functional genes. The total percentage of annotated proteins was 25.32% for PR1 but was comparatively low for other stations (12.03%, 11.4%, and 15.41% for PR2, PR3, and PR4, respectively). The functional categories were assigned mainly to three major groups such as molecular, biological, and cellular functions.
The major molecular functions of viral genes included DNA binding (13.47%, 11.9%, 14.08%, and 9.01% for PR1, PR2, PR3, and PR4, respectively of annotated functions), followed by ATP binding (14.94%, 20.63%, 17.89%, and 16.68% for PR1, PR2, PR3, and PR4, respectively). Additionally, genes involved in activities such as ATPase, DNA polymerase, hydrolase, helicase activity, nucleotide binding, endo and exo nuclease activity, RNA binding, nucleic acid binding, etc. were also detected in different viral groups. The major biological functions were DNA replication (contributing to 11.7%, 7.14%, 9.68%, and 11.26% for PR1, PR2, PR3, and PR4 respectively of annotated functions), followed by oxidation reduction process (7.84%, 3.57%, 3.23%, and 7.21% for PR1, PR2, PR3, and PR4, respectively of annotated functions). The other biological functions performed by viruses included DNA repair, DNA integration, RNA processing, protein folding, glycine biosynthetic process, dTMP biosynthetic process, and photosynthetic electron transport in photosystem II. The major cellular functions included genes involved in integral component of membrane, viral capsid, host cell cytoplasm, host chromosome, and host cell nucleus (Fig. 6).
Spatial changes of viruses in CE
Heat map analysis of viral genes indicated that the spatial variation in the dominant families did not demonstrate significant changes, but the less abundant viruses varied with stations (Supplementary Fig. 5). The most dominant viruses were cyanophages such as Synechococcus phage in all the stations, followed by Prochlorococcs phage in PR1, Pseudomonas phage in PR2 and PR3, and Erwinia phage in PR4. Puniceispirillum phage, Chrysochromulina ericina virus, and Yellowstone lakemimivirus were present only in PR1. Xanthomonas citri phage, Simbu virus, Azospirillum phage were present in PR2, Orgyia pseudotsugata, Acanthamoeba polyphaga mimivirus, Actinoplanes phage were present in PR3, and Bromus catharticus striate mosaic virus, Choristoneura occidentalis, and Cyprinid herpes virus were present in PR4. The PCoA and heat map analysis showed that PR1 and PR2 formed a different cluster from PR3 and PR4 (Fig. 5, Supplementary Fig. 5).
Global comparison of viral metagenomes from different environments
The metavirome from the CE was compared with metaviromes from both similar and dissimilar environments, including marine, estuarine, fresh water, hyper saline water, fish pond water, and waste water based on previously published datasets. Marine and estuarine samples included metavirome data from the Chesapeake Bay, Tampa Bay, Skan Bay, Bay of British Columbia, Red Sea, Sargasso Sea, Mediterranean Sea, Arctic Sea, Gulf of Mexico, Indian Ocean, Atlantic Ocean, and Pacific Ocean. Overall, we found that global metaviromes showed a major proportion of Myoviridae, followed by Siphoviridae and Podoviridae. Within the dsDNA viruses, members of rare taxonomic groupings such as the genera Alloherpesviridae, Alphabaculovirus, Betabaculovirus, Betanudivirus, Cavemovirus, Chordopoxvirinae, Herpesviridae, Chlorovirus, Chordopoxvirinae, Phaeovirus, Prasinovirus, Prymnesiovirus, and Ranavirus were detected in the CE. These minor groups were not detected in other MG-RAST metavirome sequence datasets used for comparison. The principle component analysis (PCA) revealed the viral communities consistently clustered according to their similarity, significantly separating with dissimilar viral communities (Fig. 7). The metavirome from the Cochin estuary were quite similar to each other. The PCA indicated that CE viromes were closely related to metavirome of Salton Sea, California and Skan Bay. Salton Sea is a eutrophic lake with salinities ranging from freshwater, brackish to hypersaline waters. This lake is also characterized by high nutrient loading resulting in algal blooms throughout the year. Metavirome from coastal California and Skan Bays also represented from nutrient rich coastal environments. The results suggest similarity in metavirome data from similar environments.
Discussion
The biogeography of specific viruses or viral sequences is widely unknown. As a result of methodological limitations, only a limited number of studies, especially from estuarine environments are reported for viral metagenome. The present study investigates the viral diversity using the metavirome approach with a linker amplified shotgun library (LASL). In this method, the sheared viral DNA is ligated with an adapter or linker, which can ligate only to double āstranded DNA (dsDNA). Hence LASL is used for the amplification of double stranded DNA and has been widely employed for deciphering the marine viral metagenomes11,12,27. However, another method, known as the multiple displacement amplification (MDA) has also been applied in marine viral metagenome studies to specifically amplify single stranded viruses. Therefore, these two amplification methods have been employed in metaviriome studies to reveal different aspects of viral diversity. Majority of metavirome studies have focused on ds-DNA containing viruses, although other nucleic acid type viruses are also studied28,29,30. However, since MDA has several disadvantages such as the production of biases through the formation of chimeras31 and quantitative biases32, we resorted to the use of LASL for our study, which gives information on dsDNA viruses alone. Further studies should be performed to estimate the relative abundance of ssDNA and dsDNA by using unamplified viral metagenomes or by using unbiased amplification methods in this estuary.
The results of our study illustrate that over 80% of the metavirome sequences were similar to the tailed bacteriophages belonging to the order Caudovirales. Previous reports on aquatic viriomes suggested high prevalence of Caudovirales in the Sargasso Sea33, Iquique- Chile34, Lake Pavin and Lake Bourget in France35. However, the Tara Oceans Expedition36 and Southern Indian Ocean Expedition37 report the dominance of non-tailed viruses using metagenomic approaches. Myoviridae was the most dominant family throughout the CE, followed by Siphoviridae and Podoviridae. There were spatial variations in the relative abundance of these major families in the CE. The dominance of Myoviridae indicated that bacteria were the most important host species. This correlates with our earlier reports from the CE on the high prokaryotic abundance, high contribution of viral lysis/viral shunt to the dissolve organic carbon pool during the high saline pre-monsoon season24. High abundance of bacteriophages in CE suggests that they are not only predators of bacteria but also play significant roles in the ecology and biogeochemistry of this ecosystem24. The rich organic pollutants in this estuary support high bacterial respiration and a low bacterial growth efficiency. An increase in ionic strength during the pre-monsoon season also alters the organic matter-inorganic matter association/interaction. The resultant increase in the bioavailability of the organic matter increases the bacterial metabolism and bacterial respiration5. Thus, viral lytic activity, being host-dependent, is high during the highly saline pre-monsoon season in the CE. This substantiates the dominance of Myoviridae, the virulent broad host-range viruses, throughout this estuary. Our results were comparable with the viral diversity at the Chesapeake Bay, where Myoviridae was the most dominant family followed by Podoviridae, and Siphoviridae12.
The spatial variation among dominant families seemed to be strongly influenced by salinity regimes in the CE. Myoviridae was dominant at all the locations, Podoviridae was relatively more abundant in the euryhaline region and Siphoviridae in the mesohaline region of the estuary. The prokaryotic abundance was high ranging from 2.7āāā3.1āĆā106 cells mLā1. Previous studies from this estuary have reported high prokaryotic abundance and our recent study demonstrated prokaryotic abundance as the best predictor variable determining the abundance of viruses in the CE23,24,38. Hence, it is possible that bacteriophages are the most dominant families in this estuary. Bacteriophages are the overwhelming viral types captured in metavirome investigations of water samples39,40. Previous reports on aquatic viromes suggested high prevalence of Caudovirales in the marine environments of Sargasso Sea33, and Iquique- Chile34. Recent reports from other aquatic environments also suggest a high dominance of Myoviridae, such as from Goseong Bay (South Sea, Korea)41, and some freshwater lakes42,43. Similar dominance of bacteriophages making up most of the viral fraction has been reported previously11,34,35,44,45,46.
In the present study, the relative contribution of the dominant families Myoviridae, Podoviridae, and Siphoviridae varied spatially. Siphoviridae formed the second dominant family after Myoviridae indicating the presence of temperate phages. Though some metagenomics studies report that āsiphophages are the most abundant genome arrangement on earthā11,47,48, studies from Chesapeake Bay report on very low percentages of siphophages, especially during warm productive summer months12. Both sipho- and podo-phages infect a narrow range of host species49 and are also referred to as āspecialist phagesā. However, podoviruses were least represented in the present study, probably because of their small genome sizes. Metavirome analysis from four oceanic regions suggests that the marine viral āspeciesā are globally distributed, but the relative abundance of viral genotypes fluctuates between specific ecosystems10. However, the metagenomics studies during the Global Ocean Sampling (GOS) indicated that among Caudovirales, myoviruses are ubiquitously distributed, whereas, podo- and siphoviruses are more geographically isolated50. In addition, the distribution of tailed bacteriophages is also influenced by environmental factors in the worldās oceans. The abundance of podoviruses is positively correlated with salinity, whereas abundance of myoviruses was more dependent on temperature36.
Classification of metagenomics sequences indicated that Synechococcus phage was the most commonly detected phage throughout the estuary. Recent studies from CE reported the abundance of Synechococcus species during pre-monsoon period51. The ācyanophagesā, especially Synechoccus phage, are more abundant in many aquatic environments52,53. They play significant roles by participating in the maintenance of community diversity, abundance, and seasonal succession of their hosts54,55,56,57,58 mostly by ākilling the winner hypothesisā59, and through the movement of genes throughout the host population60,61,62. Phylogenetic and metagenomics studies also demonstrate the presence of endemic populations of Synechoccus phages from Chesapeake Bay12. In Chesapeake Bay, cyanophage assemblage was dominated by small-genome, narrow host range cyanopodophages.
Determination of functional activity is important for understanding and manipulating ecosystems. A wide range of molecular, biological, and cellular functions were found in viromes in the CE. The major annotated molecular functions were ATP binding and DNA binding. A recent report of viral genetic diversity from a mangrove origin found that viruses there possessed a high number of genes for molecular function, such as ATPase, single-stranded DNA-binding protein, DNA ligases, helicase and several nucleases; these functions were required for viral replication inside the host cell63. The environment in which the organisms live determines the metabolism or functional activity of viruses in that particular environment. Most of the viral functional diversities are similar for all the communities, but their relative occurrence varied based on the biogeochemical conditions of the environment19. One of the main biological functions performed by viruses in the CE included photosynthetic electron transport in photosystem II. Many studies have reported on the presence of photosystem II core reaction center protein DI, encoded by psbA gene in marine cyanophages12,61,64. Psb A encoding genes are known to be transcribed during lytic infection65. It is reported that 88% of the cyanophages in Chesapeake Bay carry psbA gene mainly for maintaining host photosystem functionality during infection. The susceptibility of cyanophages to carry these genes concur with the host specificity and/or genome size of a given strain62. Both broad- host-range cyanomyoviruses and narrow host range cyanopodo- and cyanosipho-viruses possess photosystem II core reaction center protein.
Conclusion
Metavirome sequencing from the Cochin estuary provides a fundamental insight into the viral diversity in a highly productive tropical monsoonal estuary. Our study demonstrated that the most dominant family in this estuary were Myoviridae, Siphoviridae, and Podoviridae. Functional predictions of viral proteins suggested important molecular, cellular and biological functions such as ATP binding, DNA binding, ATPase, DNA polymerase, hydrolase, helicase activity, endo and exo nuclease activity, DNA repair, DNA integration, and photosynthetic electron transport in photosystem II. However, a large percentage of viral sequences were unclassified especially in the freshwater region, PR4 of the estuary. Our study also demonstrated spatial variability in the relative abundances of dsDNA viruses in relation to the different salinity regimes of the estuary. This data is immensely valuable in enhancing our understanding about viruses in a tropical highly productive estuarine environment. However, the results of this study are limited by only dsDNA viruses (LASL method) and large amount of unknown sequences without any similarity with the known sequences in the database. Future studies in CE must include additional approaches to target ssDNA and RNA viruses.
Materials and Methodology
Study site and sampling
Cochin estuary (CE) is an oxbow-shaped and one of the largest tropical estuaries in India (256 km2). It receives ~2āĆā1010 m3 yearā1 of fresh water from six rivers (Periyar, Pamba,Achankovil, Manimala, Meenachil and Muvattupuzha) and salinity incursion from the Arabian Sea66. The estuary opens to the Arabian Sea through two inlets- Munambam inlet (150ām wide) and the Cochin inlet (450ām) (Fig. 1). The annual rainfall of the region is around 320āmm, of which nearly 60% occurs during the southwest monsoon (JuneāSeptember). During the pre-monsoon season (FebruaryāMay), the increased tidal activity modifies the flushing characteristics of the estuary67. The average tidal range of the estuary is 1ām. During this season high saline waters from the Arabian Sea enter the estuary through the inlets and lower reaches of the estuary acts an extension of the Arabian Sea68. Along the entire stretch of CE, water quality varies depending upon the region-specific human activities. Previous studies on biodiversity suggest that the species diversity, richness and evenness were high during the dry pre-monsoon season in CE25,26. This could be due to reduced freshwater flow during the pre-monsoon season resulting in a warmer estuary with reduced turbidity (due to low run off) and high solar radiation, eventually facilitating high biological production. Accordingly, samples were collected during pre-monsoon from four stations (Supplementary Fig. 1). Station PR1 was located at the Cochin inlet which represented a highly dynamic region receiving high saline waters from the Arabian Sea. Station PR2 was located ā¼8ākm to the south of the inlet, where lot of industrial wastes are released. Station PR3 was located on the northern side of the estuary adjacent to the vast area of aquaculture activities, whereas station PR4 was located at the southern end of the estuary which receives substantial amount of agricultural wastes. Water samples were collected from 0.5ām depth using Niskin water samplers in the month of March, transferred to sterile acid-washed bottles, and brought to the laboratory within 2āhours of collection to be analyzed for various parameters.
Environmental parameters
The physicochemical parameters such as temperature and salinity were measured using a conductivity temperature density profiler (SBE Seabird 19 CTD, Seabird Scientific, USA) with accuracy of Ā±0.001āĀ°C for temperature and Ā±0.001āS/m for conductivity. Salinity was also measured using an Autosal (Guild line) for correcting the CTD salinity. The CTD profiler is pre-calibrated and calibrated periodically by the manufacturer, Seabird. Water samples were brought to the laboratory within one hour of collection and analyzed for dissolved inorganic nutrients, such as nitrate (NO3-N), nitrite (NO2-N), ammonia (NH4-N), phosphate (PO4-P), and silicate (SiO4-Si), spectrophotometrically following standard procedures69. The dissolved oxygen (DO) was estimated by Winklerās method. Chlorophyll a was measured by filtering 500āml water samples through GF/F filters. The pigments concentrated on the filters were extracted with 90% acetone for 24āh in the dark at 4āĀ°C70, and the fluorescence was measured using a fluorometer (Model 7200-000, Turner Designs, Trilogy, USA). The fluorometer was calibrated using known standards, twice a month (Sigma, USA).
Enumeration of viruses (VA), prokaryotes (PA) and Total viable count (TVC)
Water samples collected using 5-liter Niskin bottles were immediately transferred into 50-ml centrifuge tubes and stored on ice; prior to microscopy, these samples were fixed with formaldehyde (final volume, 2%). Viral particles and bacterial cells were filtered from 1āmL of water sample by gentle vacuum filtration onto a 25-mm diameter, 0.02 Ī¼m pore-size Anodisc (Whatman) and stained with SYBR green I fluorescent dye (Invitrogen, CA, USA) as previously described71. The filter was air dried on absorbent paper and mounted between a slide and a glass coverslip with a special antifading mountant [50% glycerol, 50% PBS - phosphate buffered saline (0.05āM Na2HPO4, 0.85% NaCl, pH 7.5), 0.1% p-phenylene diamine]. When not analyzed immediately, slides were stored at ā20āĀ°C until counting under an epifluorescence microscope (Olympus BX 41, Olympus, Japan). Prokaryotes were distinguished from virus-like particles (VLPs) on the basis of their relative size and brightness71. A blank (sterile 0.02 Ī¼m filtered double distilled water), was routinely examined as a control to check for contamination of the equipment and reagents.
TVC was measured to estimate the physiologically active bacteria72. Briefly, 5āml of water sample was mixed with 50āĀµl of 0.05% yeast extract and 50āĀµl of an antibiotic cocktail (nalidixic acid, pipemidic acid, piromidic acid, and cephalexin). After incubation in the dark for 6āhours, the samples were fixed in 2% formalin, filtered through 0.2āĀµm pore-sized 25āmm diameter black nucleopore filter (Whatman), stained with 100āĀµl of acridine orange (0.1āg/100āml), and enumerated using an epifluorescence microscope.
Viral concentration and processing
A 200-L of water sample was collected using a 10āL Niskin sampler (operated multiple times) to concentrate the viruses for viral metagenomic analysis. Briefly, the water samples were prefiltered through a 5-micron nitex mesh and concentrated to approximately 300āml using a tangential flow filter (TFF) (CDUF001LT-Millipore, 30-kDa cut off). During filtration, pressure was kept below 0.6ābar (10āpsi) to ensure that the microbial cells were not destroyed. The samples were stored at 4āĀ°C until further processing12,34,73.
Sample processing for DNA isolation and sequencing
The TFF viral concentrates were filtered through a 0.22āĀµm sterivex filter (Millipore, USA) to remove any bacterial contamination. The viral fractions were treated with DNase I (20 U/ml at 37āĀ°C for 30āminutes) to eliminate free DNA. DNase-treated samples were further concentrated using a centrifugal concentration filters (Amicon Ultra, Millipore, USA) before DNA extraction. The DNA extraction was performed using DNeasy Power Soil Kit (Qiagen, Germany) as per the manufacturerās instructions. The metagenomic DNA was quantified using a genomic DNA quantification kit and purity was determined using a Nanodrop Spectrophotometer (Nanodrop 2000, Thermofischer Scientific, USA). The integrity of genomic DNA was verified on a 0.8% agarose gel (Sigma-Aldrich, USA) and the gel image was documented using Multi Doc-ITTM Imaging System (Ultraviolet products Ltd, Analytik-jena, USA). DNA was stored at ā20āĀ°C until further downstream processing (https://www.uvp.com/manuals/81021401.pdf). Potential contamination due to prokaryotic and eukaryotic DNA in the viral DNA samples was verified by PCR targeting 16S and 18S rRNA genes. Samples which passed this quality check were subjected to further sequencing analysis.
NGS library preparation
Next generation sequencing (NGS) libraries were prepared by Illumina HiSeq paired-end sequencing by the linker amplified shotgun library (LASL) method using an Illumina-compatible NEXTflex Rapid DNA sequencing kit which targets only dsDNA viruses (BIOO Scientific, Texas, U.S.A.) at Genotypic Technology Pvt. Ltd., Bangalore, India. Briefly, genomic DNA was sheared using Covaris S2 sonicator (Covaris, Massachusetts, USA) to generate approximate fragment size distribution from 150ābp to 400ābp. Fragment size distribution was checked on Agilent 2200 Tape Station and subsequently purified using High Prep magnetic beads (MagBio Genomics, Inc.USA). The purified fragments were end-repaired, adenylated and ligated to Illumina multiplex barcode adaptors as per NEXTFlex Rapid DNA sequencing kit protocol74 http://www.biooscientific.com/Next-Gen-Sequencing/Illumina-DNA-Library-Prep-Kits/DNA-Seq.
The adapters used in the study were the Illumina Universal Adapter: 5ā² AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCT-3ā² and Index Adapter: 5ā²-GATCGGAAGAGCACACGTCTGAACTCCAGTCAC [INDEX] ATCTCGTATGCCGTCTTCTGCTTG-3ā². The adapter-ligated DNA was purified using High Prep beads. The resultant fragments were PCR amplified for 12 cycles using Illumina-compatible primers provided in the NEXTFlex Rapid DNA sequencing kit. The final PCR product (i.e. sequencing library) was purified with High Prep beads, followed by library quality control check. The sequencing libraries were quantified by Qubit fluorometer (Thermo Fisher Scientific, MA, USA) which yielded a concentration range of about 1.8ā2.5āng/Āµl. The library fragment size distribution was analyzed on Agilent 2200 TapeStation (Illumina, USA) which showed a range of 200ā700ābp for all libraries74.
Illumina sequencing
The sequencing libraries were molar-normalized and then pooled into a single tube. The pooled sample was then diluted to 4ānM final concentration using resuspension Buffer (RSB ā Illumina, CA, USA). The sample was denatured for 5āminutes using 0.2āN NaOH and neutralized by HT1 Buffer (Illumina, CA, USA). It was then pooled with other libraries prepared for NGS in a ratio dependent on amplicon size/total panel size, desired sequencing depth, and the number of samples pooled in each sub-library. Pooled libraries were further diluted down to a final 12 pM library. Samples were then loaded into an Illumina HiSeq cartridge (Illumina, CA, USA) and run in 2*150 mode on an Illumina HiSeq next generation sequencer (HiSeq. 4000 sequencer, Illumina,CA,USA) (https://www.illumina.com/content/dam/illuminamarketing/documents/products/datasheets/hiseq-3000-4000-specification-sheet-770-2014-057.pdf).
Initial processing of sequence reads
Demultiplexing was completed using bcl2fastq Conversion Software that was embedded in the HiSeq. The automated FASTQC Tool Kit application on Illumina Base Space Labs was used to filter out quality reads, in which quality reads above Q30 were kept for downstream analysis (https://www.illumina.com/documents/products/technotes/technote_Q-Scores.pdf).
Data analysis
The raw reads with adapter sequences and low-quality bases were removed using ABLT perl script (proprietary tool of Genotypic Technology Pvt. Ltd) to trim adapter sequences, low quality bases (phred score <30) and blocks of Ns. The processed high-quality reads (with more than 75% bases having phred score greater than 30 were considered significant for further downstream analysis. The high-throughput metavirome sequencing analysis was computed using the MetaSPADES program. Paired-end reads were assembled using the MetaSPADES-3.7.1 assembler75. Based on the denovo assembly, contigs with length >=300ābp were first aligned against the Refseq bacterial genome database and the unaligned contigs were aligned against NCBI RefSeq viral genome sequences (release version 80) using GBLASTN with 80% sequence identity and 85% average query coverage with an E-value cutoff of 10ā5 for taxonomic classification41,76,77. The unaligned sequences against viruses were further aligned against archeal and fungal Refseq genome sequences downloaded from NCBI with the same release version and with similar cutoff parameters to ensure that only viral sequences were considered for analysis.
Functional diversity of estuarine viruses
The contigs with length >=300ābp were used for viral protein prediction using MetaGeneMark prediction software78. The predicted proteins were homology searched using BLAST based method against a viral specific protein sequences obtained from the Uniprot database and were annotated with a minimum sequence identity 30% (30ā100%), e-value cutoff of 10ā3 with an average sequence coverage of 50%79,80,81.
Statistical analysis
A one-way ANOVA was used to understand significant variations in biological parameters with respect to stations. Redundant analysis (RDA) was used to elucidate the interrelationships between the viral components and their environmental variables. Initially, the data were processed using detrended correspondence analysis (DCA) to select the suitable ordination technique. DCA resulted in an axis gradient length of <2, suggesting that linear multivariate RDA was suitable for the present data82,83, with species correlation scaling as ordination scores. The biological variables were log transformed prior to the analysis. Partial RDA was carried out to identify the environmental parameters contributing more to the explained variation in the biological components. The ordination significance was tested with Monte Carlo permutation tests (499 unrestricted permutations) (pā<ā0.05). The results of the RDA are presented in the form of triplots with stations as points and environmental variables by arrows83.
Venn diagram was generated for all identified taxaās across the samples and the common and the unique organisms were reported. The Bray-Curtis dissimilarity distance matrix (https://docs.scipy.org/doc/scipy/reference/generated/scipy.spatial.distance.pdist.html) was used to generate principal coordinate analysis (PCoA) plots using QIIME84. A clustered row-wise heatmap was generated using the R package NMF, for all the identified species across the samples based on their relative abundance values. The color slab was generated based on the maximum and minimum values in the matrix. Cochin Estuary metaviriome were compared to previously published globally distributed metaviromes. These included datasets from different oceans, namely, Atlantic ocean (MG RAST id; 4722276.3 to 4722285.3), Gulf of Mexico (4440304.3, 4441623.3 to 4441629.3), Salton Sea (4440327.3, 4440328.3), Sargasso Sea (4441624.3, 4440322.3), Arctic sea (4440306.3), Line Islands 4440036.3, 4440038.3, 4440040.3, 4440280.3), Western Sea, Korea (4464802.3, 4464804.3, 4464805.3), Indian Ocean (4722282.3), Red Sea (4722283.3) bays (4440102, 4440330.3, 4440102.3), and Fish pond (4440424.3, 4440412.3, 4440439.3, 4440414.3). We took the top 50 identified identified taxa at family level from the public dataset (available in MG-RAST) and compared with our identified taxaās for PR1 to PR4 samples. A common master table of the identified families was prepared and their corresponding values were fetched. PCA was plotted using R package ggplot2 using log-based normalization on these corresponding values with princomp function. (http://metagenomics.anl.gov/)85.
Nucleotide sequence accession number
The sequence data from this study was submitted to the NCBI Sequence Read Archive (SRA) under accession number, SUB2990896.
References
Azam, F. et al. The ecological role of water-column microbes in the sea. Mar. Ecol. Prog. Ser. 10, 257ā263 (1983).
Suttle, C. A., Chan, A. M. & Cottrell, M. T. Infection of phytoplankton by viruses and reduction of primary productivity. Nature 347, 467ā469 (1990).
Wilhelm, S. W. & Suttle, C. A. Viruses and nutrient cycles in the sea. Bioscience 49, 781ā788 (1999).
Suttle, C. A. Viruses in the sea. Nature 437, 356 (2005).
Weinbauer, M. G. Ecology of prokaryotic viruses. FEMS Microbiol. Rev. 28, 127ā181 (2004).
Paul, J. H. & Sullivan, M. B. Marine phage genomics: what have we learned? Curr Opin Biotechnol. 16, 299ā307 (2005).
Wommack, K. E., Ravel, J., Hill, R. T., Chun, J. & Colwell, R. R. Population dynamics of Chesapeake Bay virioplankton: Total-Community Analysis by Pulsed-Field Gel Electrophoresis. Appl Environ Microbiol 65, 231ā240 (1999a).
Wommack, K. E., Ravel, J., Hill, R. T. & Colwell, R. R. Hybridization analysis of Chesapeake Bay virioplankton. Appl Environ Microbiol 65, 241ā250 (1999b).
Rohwer, F. & Edwards, R. The Phage Proteomic Tree: a genome-based taxonomy for phage. J. Bacteriology 184, 4529ā4535 (2002).
Angly, F. E. et al. The marine viromes of four oceanic regions. PLoS Biol 4(e), 368 (2006).
Breitbart, M. et al. Genomic analysis of uncultured marine viral communities. Proc. Natl. Acad. Sci 99, 14250ā14255 (2002).
Bench, S. R. et al. Metagenomic characterization of Chesapeake Bay virioplankton. Appl. Environ. Microbiol. 73, 7629ā7641 (2007).
Kristensen, D. M., Mushegian, A. R., Dolja, V. V. & Koonin, E. V. New dimensions of the virus world discovered through metagenomics. Trends Microbiol. 18, 11ā9 (2010).
Djikeng, A., Kuzmickas, R., Anderson, N. G. & Spiro, D. J. Metagenomic analysis of RNA viruses in a fresh water lake. PLOS one 4, 7264 (2009).
LĆ³pez-Bueno, A. et al. High diversity of the viral community from an Antarctic lake. Science 326, 858ā861 (2009).
Hewson, I. & Fuhrman, J. A. Covariation between viral parameters with bacterial assemblage richness and diversity in the water column and sediments. Deep Sea Res. 54, 811ā830 (2007).
Hewson, I., Steele, J. A., Capone, D. G. & Fuhrman, J. A. Remarkable heterogeneity in mesoāand bathypelagic bacterioplankton assemblage composition. Limnol Oceanogr. 51, 1274ā1283 (2006).
Breitbart, M., Thompson, L. R., Suttle, C. A. & Sullivan, M. B. Exploring the vast diversity of marine viruses. Oceanogr. 20, 135ā139 (2007).
Dinsdale, E. A. et al. Functional metagenomic profiling of nine biomes. Nature 452, 629ā632 (2008).
Rosario, K., Nilsson, C., Lim, Y. W., Ruan, Y. & Breitbart, M. Metagenomic analysis of viruses in reclaimed water. Environ Microbiol 11, 2806ā2820 (2009).
Zablocki, O., Adriaenssens, E. M. & Cowan, D. Diversity and ecology of viruses in hyperarid desert soils. Appl. Environ. Microbiol 82, 770ā777 (2016).
Brum, J. R. et al. Patterns and ecological drivers of ocean viral communities. Science 348, 1261498 (2015).
Parvathi, A. et al. Effects of hydrography on the distribution of bacteria and virus in Cochin estuary, India. Ecol Res 30, 85ā92 (2015).
Jasna, V. et al. Viral-induced mortality of prokaryotes in a tropical monsoonal estuary. Front Microbiol 8 (2017).
J Jyothibabu, R. et al. Impact of freshwater influx on microzooplankton mediated food web in a tropical estuary (Cochin backwatersāIndia). Estuar. Coast.Shelf. S. 69, 505ā518 (2006).
Madhu, N. V. et al. Monsoonal impact on planktonic standing stock and abundance in a tropical estuary (Cochin backwatersāIndia). Estuar. Coast. Shelf. S. 73, 54ā64 (2007).
Breitbart, M., Miyake, J. H. & Rohwer, F. Global distribution of nearly identical phage-encoded DNA sequences. FEMS microbiol let. 236, 249ā256 (2004).
Culley, A. I., Lang, A. S. & Suttle, C. A. Metagenomic analysis of coastal RNA virus communities. Science 312, 1795ā1798 (2006).
Ng, T. F. F. et al. Discovery of a novel single-stranded DNA virus from a sea turtle fibropapilloma by using viral metagenomics. J Virol 83, 2500ā2509 (2009).
Ng, T. F. F., Suedmeyer, W. K., Wheeler, E., Gulland, F. & Breitbart, M. Novel anellovirus discovered from a mortality event of captive California sea lions. J Gen Virol. 90, 1256ā1261 (2009).
Lasken, R. S. & Stockwell, T. B. Mechanism of chimera formation during the Multiple Displacement Amplification reaction. BMC biotech. 7, 19 (2007).
Yilmaz, S., Allgaier, M. & Hugenholtz, P. Multiple displacement amplification compromises quantitative analysis of metagenomes. Nat Methods 7, 943ā944 (2010).
Angly, F. E. et al. Diuron tolerance and potential degradation by pelagic microbiomes in the Great Barrier Reef lagoon. Peer J 4, e1758 (2016).
Cassman, N. et al. Oxygen minimum zones harbour novel viral communities with low diversity. Environ. Microbiol. 14, 3043ā3065 (2012).
Roux, S., Krupovic, M., Poulet, A., Debroas, D. & Enault, F. Evolution and diversity of the Microviridae viral family through a collection of 81 new complete genomes assembled from virome reads. PloS one 7, e40418 (2012).
Brum, J. R., Schenck, R. O. & Sullivan, M. B. Global morphological analysis of marine viruses shows minimal regional variation and dominance of non-tailed viruses. The ISME journal 7, 1738ā1751 (2013).
Williamson, S. J. et al. Metagenomic exploration of viruses throughout the Indian Ocean. PLoS One 7, e42047 (2012).
Jasna, V. et al. Differential impact of lytic viruses on prokaryotic morphopopulations in a tropical estuarine system (Cochin estuary, India). PloS one 13, e0194020 (2018).
Kim, M. et al. Analytical tools and databases for metagenomics in the next-generation sequencing era. Genomics Inform 11, 102ā113 (2013).
Hatfull, G. F. Dark matter of the biosphere: the amazing world of bacteriophage diversity. J. Virol. 89, 8107ā8110 (2015).
Hwang, J., Park, S. Y., Park, M., Lee, S. & Lee, T. K. Seasonal Dynamics and Metagenomic Characterization of Marine Viruses in Goseong Bay, Korea. PloS one 12, e0169841 (2017).
Cai, L., Zhang, R., He, Y., Feng, X. & Jiao, N. Metagenomic analysis of Virioplankton of the subtropical Jiulong river estuary, China. Viruses 8, 35 (2016).
Tseng, C. H. et al. Microbial and viral metagenomes of a subtropical freshwater reservoir subject to climatic disturbances. ISME journal 7, 2374ā2386 (2013).
Rodriguez-Brito et al. Viral and microbial community dynamics in four aquatic environments. ISME 4, 739 (2010).
Anderson, R. E., William, J. B. & John, A. B. Is the genetic landscape of the deep subsurface biosphere affected by viruses. Front. Microbiol. 2, 219 (2011).
Kauffman, K. M. et al. A major lineage of non-tailed dsDNA viruses as unrecognized killers of marine bacteria. Nature 554, 118 (2018).
Breitbart, M. et al. Metagenomic analyses of an uncultured viral community from human feces. J. Bacteriol. 185, 6220ā6223 (2003).
Edwards, R. A. & Rohwer, F. Viral metagenomics. Nat Rev Microbiol. 3, 504 (2005).
Suttle, C. A. & Chan, A. M. Dynamics and distribution of cyanophages and their effect on marine Synechococcus spp. Appl. Environ. Microbiol. 60, 3167ā3174 (1994).
Williamson, S. J. et al. The Sorcerer II Global Ocean Sampling Expedition: metagenomic characterization of viruses within aquatic microbial samples. PloS one 3, e1456 (2008).
Arya, P. M., Jyothibabu, R., Jagadeesan, L., Lallu, K. R. & Karnan, C. Summer monsoon onset-induced changes of autotrophic pico-and nanoplankton in the largest monsoonal estuary along the west coast of India. Environ. Monit. Assess. 188, 93 (2016).
Lu, J., Chen, F. & Hodson, R. E. Distribution, isolation, host specificity, and diversity of cyanophages infecting marine Synechococcus spp. in river estuaries. Appl. Environ. Microbiol. 67, 3285ā3290 (2001).
Frederickson, C. M., Short, S. M. & Suttle, C. A. The physical environment affects cyanophage communities in British Columbia inlets. Microb Ecol 46, 348ā357 (2003).
Waterbury, J. B. & Valois, F. W. Resistance to co-occurring phages enables marine Synechococcus communities to coexist with cyanophages abundant in seawater. Appl. Environ. Microbiol 59, 3393ā3399 (1993).
Suttle, C. A. Cyanophages and their role in the ecology of cyanobacteria. In The ecology of cyanobacteria Springer, Dordrecht, 563ā589 (2000).
Marston, M. F. & Sallee, J. L. Genetic diversity and temporal variation in the cyanophage community infecting marine Synechococcus species in Rhode Islandās coastal waters. Appl. Environ. Microbiol. 69, 4639ā4647 (2003).
Sullivan, M. B., Waterbury, J. B. & Chisholm, S. W. Cyanophages infecting the oceanic cyanobacterium Prochlorococcus. Nature 424, 1047ā1051 (2003).
Sullivan, M. B., Waterbury, J. B. & Chisholm, S. W. Corrigendum: Cyanophages infecting the oceanic cyanobacterium Prochlorococcus. Nature 426, 584ā584 (2003).
Thingstad, T. F. Elements of a theory for the mechanisms controlling abundance, diversity, and biogeochemical role of lytic bacterial viruses in aquatic systems. Limnol Oceanogr. 45, 1320ā1328 (2000).
Lindell, D. et al. Transfer of photosynthesis genes to and from Prochlorococcus viruses. Proc. Natl. Acad. Sci. USA 101, 11013ā11018 (2004).
Coleman, M. L. et al. Genomic islands and the ecology and evolution of Prochlorococcus. Science 311, 1768ā1770 (2006).
Sullivan, M. B. et al. Prevalence and evolution of core photosystem II genes in marine cyanobacterial viruses and their hosts. PLoS biology 4, e234 (2006).
Philosof, A. et al. Novel abundant oceanic viruses of uncultured marine group II Euryarchaeota. Curr. Biol 27, 1362ā13682017 (2017).
Mann, N. H., Cook, A., Millard, A., Bailey, S. & Clokie, M. Marine ecosystems: bacterial photosynthesis genes in a virus. Nature 424, 741 (2003).
Lindell, D., Jaffe, J. D., Johnson, Z. I., Church, G. M. & Chisholm, S. W. Photosynthesis genes in marine viruses yield proteins during host infection. Nature 438, 86 (2005).
Srinivas, K., Revichandran, C., Maheswaran, P. A., Asharaf, T. M. & Murukesh, N. Propagation of tides in the Cochin estuarine system, southwest coast of India (2003).
Balachandran, K. K., Laluraj, C. M., Martin, G. D., Srinivas, K. & Venugopal, P. āEnvironmental analysis of heavy metal deposition in a flow-restricted tropical estuary and its adjacent shelf. Enviro Forensics 7, 345ā351 (2006).
Madhupratap, M. Status and strategy of zooplankton of tropical Indian estuaries: a review. B Plankton Soc Japan (1987).
Grasshoff, K. Determination of nitrite, nitrate, oxygen, thiosulphate. In K. Grasshoff, M. Ehrhardt and K. Kremling (Eds). Methods of seawater analysis, Verlag Chemie Weinheim, New York, p. 139ā142, 143ā150, 61ā72, 81ā84 (1999).
Parsons, T. R., Maita, Y. & Lalli, C. M. A manual for biological and chemical methods for seawater analysis. 173. Oxford: Pergamon. Oceanogr. 51, 2157ā2169 (1984).
Patel, A. et al. Virus and prokaryote enumeration from planktonic aquatic environments by epifluorescence microscopy with SYBR Green I. Nat Protoc 2, 269ā276 (2007).
Joux, F. & Lebaron, P. Ecological implications of an improved direct viable count method for aquatic bacteria. Appl Environ Microbiol. 63(3643), 3647 (1997).
Wommack, K. E., Sime-Ngando, T., Winget, D. M., Jamindar, S. & Helton, R. R. Filtration-based methods for the collection of viral concentrates from large water samples. MAVE, 110ā117 (2010).
Solonenko, S. A. & Sullivan., M. B. Preparation of metagenomic libraries from naturally occurring marine viruses. Methods Enzymol. 531, 143ā165 (2013).
Nurk, S., Meleshko, D., Korobeynikov, A. & Pevzner, P. meta SPAdes: a new versatile de novo metagenomics assembler. arXiv preprint arXiv 1604, 03071 (2016).
Daniel et al. Metagenomic analysis of lacustrine viral diversity along a latitudinal transect of the Antarctic Peninsula. FEMS Micro. Ecol. 92 (2016).
Segobola, J., Adriaenssens, E., Tsekoa, T., Rashamuse, K. & Cowan, D. Exploring viral diversity in a unique South African soil habitat. Sci Rep. 8, 111 (2018).
Zhu, W., Lomsadze, A. & Borodovsky, M. Ab initio gene identification in metagenomic sequences. Nucleic Acids Res 38, e132 (2010).
Barrientos-Somarribas, M. et al. Discovering viral genomes in human metagenomic data by predicting unknown protein families. Sci Rep. 8 (2018).
Han, L. L., Yu, D. T., Zhang, L. M., Shen, J. P. & He, J. Z. Genetic and functional diversity of ubiquitous DNA viruses in selected Chinese agricultural soils. Sci Rep. 7, 45142 (2017).
Imchen, M. et al. Searching for signatures across microbial communities: Metagenomic analysis of soil samples from mangrove and other ecosystems. Sci Rep. 71, 8859 (2017).
Birks, H. J. B. Numerical tools in palaeolimnology āprogress, potentialities, and problems. J Paleolimnol. 20, 307ā332 (1998).
Leps, J. & Smilauer, P. S. Multivariate analysis of ecological data using CANOCO. Cambridge, UK: CambridgeUniversity Press (2003).
Caporaso, J. G. et al. QIIME allows analysis of high-throughput community sequencing data. Nat Methods. 7, 335ā336 (2010).
Meyer, F. et al. The metagenomics RAST serverāa public resource for the automatic phylogenetic and functional analysis of metagenomes. BMC bioinform. 9, 386 (2008).
Acknowledgements
The authors are grateful to the Director, NIO, Goa and Dr. T. Pankajakshan, the Scientist- in-charge, NIO (RC), Cochin for their support and advice. Financial support from supra-institutional project SIP 1302 funded by Council for Scientific and Industrial Research (CSIR) is gratefully acknowledged. JV is grateful to Council of Scientific and Industrial Research (CSIR), New Delhi, for financial support for senior research fellowship grant. This is NIO contribution number 6299.
Author information
Authors and Affiliations
Contributions
V.J. and A.P. have collected the samples, performed sample analysis, data analysis and prepared the manuscript. A.D. helped for performing bioinformatics analysis.
Corresponding author
Ethics declarations
Competing Interests
The authors declare no competing interests.
Additional information
Publisherās note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the articleās Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the articleās Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Jasna, V., Parvathi, A. & Dash, A. Genetic and functional diversity of double-stranded DNA viruses in a tropical monsoonal estuary, India. Sci Rep 8, 16036 (2018). https://doi.org/10.1038/s41598-018-34332-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-018-34332-8
Keywords
This article is cited by
-
ViralCC retrieves complete viral genomes and virus-host pairs from metagenomic Hi-C data
Nature Communications (2023)
-
Microbial Diversity in the Indian Ocean Sediments: An Insight into the Distribution and Associated Factors
Current Microbiology (2022)
-
Optimized Metavirome Analysis of Marine DNA Virus Communities for Taxonomic Profiling
Ocean Science Journal (2022)
-
Interaction dynamics and virusāhost range for estuarine actinophages captured by epicPCR
Nature Microbiology (2021)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.