Abstract
Microbes in marine sediments constitute a large percentage of the global marine ecosystem and function to maintain a healthy food web. In continental shelf habitats such as the Gulf of Maine (GoM), relatively little is known of the microbial community abundance, biodiversity, and natural product potential. This report is the first to provide a time-series assessment (2017–2020) of the sediment microbial structure in areas open and closed to fishing within the Stellwagen Bank National Marine Sanctuary (SBNMS). A whole metagenome sequencing (WMS) approach was used to characterize the sediment microbial community. Taxonomic abundance was calculated across seven geographic sites with 14 individual sediment samples collected during the summer and fall seasons. Bioinformatics analyses identified more than 5900 different species across multiple years. Non-metric multidimensional scaling methods and generalized linear models demonstrated that species richness was inversely associated with fishing exposure levels and varied by year. Additionally, the discovery of 12 unique biosynthetic gene clusters (BGCs) collected across sites confirmed the potential for medically relevant natural product discovery in the SBNMS. This study provides a practical assessment of how fishing exposure and temporal trends may affect microbial community structure in a coastal marine sanctuary.
Introduction
Microbial communities on the ocean seafloor contain vast numbers of undiscovered bacterial species1 and harbor more biodiversity than previously recognized2, potentially in response to multiple stressors3. Compared to bacteria living in the water column, sediment bacteria at the surficial layer tend to be more diverse in deep-sea versus shallow continental regions4. These microbes also drive essential benthic processes such as nutrient regeneration, organic matter oxidation, removal of toxins, and production of biosynthetic compounds5. Continental shelf sediment ecosystems are unique since they are exposed to terrestrial-based sources of nutrients and contaminants as well as variations in regional climate conditions6. The Stellwagen Bank National Marine Sanctuary (SBNMS), located off the coast of Massachusetts, offers an ideal model to explore sediment microbial taxa, since it is a vital habitat for threatened marine species and serves as a working sanctuary, providing access to commercial fisheries. The sanctuary is a patchwork of benthic substrate types that are associated with a diverse group of ecosystems. The benthic habitat supports a variety of fish and invertebrate species, such as sand lance, cod, haddock, cephalopods, crustaceans, and annelids7.
The Gulf of Maine (GoM) system is located in the Atlantic continental region of the United States. The GoM is a vital ecosystem that warrants greater understanding. It lies within a geologically passive margin, which has allowed large areas of thick sediment to accumulate, resulting in a relatively shallow shelf. Biodiversity conservation is a key management priority for the SBNMS and for marine sanctuaries worldwide7, yet time-series assessments of microbial taxa are lacking8. There is an urgent need to characterize this vital habitat, especially in protected and fished areas, by reconstructing community profiles to better understand the effects of anthropogenic and environmental changes. Monitoring studies of food web populations in the SBNMS over the past ten years suggest that there are measurable impacts on habitat quality conditions that may threaten benthic communities, including microbial species9, possibly related to the impacts of bottom gear used in commercial fishing. Mobile, bottom-tending fishing gear can directly affect the surface sediment where the more recent microbial communities reside by mechanically altering the structural characteristics and biological components of the seabed, which is a concern for maintaining habitat integrity10,11.
Marine sediment microbial communities near continental shelf regions and similar deep-seafloor sediment ecosystems (below 200 m) are not well documented12,13. Previous reports evaluating the benthic community on a small spatial scale suggest that a relatively high level of biodiversity supports the food web within the SBNMS14, especially when compared to similar habitat in the deep sea15. A key distinction between these two habitats is that nearshore sediment communities can experience more frequent exposure to fishing, and the use of trawls and dredges is considered one of the primary activities that adversely influences the physical and biological characteristics of benthic habitat integrity16. The present report focuses on improving current knowledge of the sediment microbial community structure in the SBNMS in relation to fishing exposure and temporal trends, which may impact abundance and diversity.
According to a prior report, there are diverse species of benthic invertebrates within SBNMS, including communities of sponges and anemones considered rare within the GoM.7 However, sanctuary regulations allow recreational and commercial fishing in some areas, designating the SBNMS as a working sanctuary. Intermittent exposure to commercial fishing activity, including many types of gear (dredges, trawlers, fixed traps), may alter benthic habitats within the SBNMS17. Reports suggest that the biodiversity of the food web remains relatively healthy18; however, concerns have been raised about this status relative to the sediment habitat.
Advances in sequencing of microbial species has enabled the study of metagenomes from environmental samples19. Research has shown that microbial communities harbor biosynthetic gene clusters (BGCs) with potential utility for medicinal purposes20,21. Marine sanctuaries such as the SBNMS serve as a living resource, supporting productive fisheries and providing health-promoting ecosystem services through their potential to host medically relevant bacterial strains that can be used in natural products. Approaches such as 16S ribosomal RNA sequencing show that sediment type may influence microbial community composition22. Traditional clonal cultures and gene sequencing, such as 16S rRNA, produce a profile of diversity in an environmental sample, although these methods can exclude a vast majority of biodiversity23. Conversely, innovative whole metagenomic sequencing (WMS) techniques are valuable in assembling partial genomes to assess less abundant taxa24, which improves diversity estimates across an entire community25. To a considerable degree, metagenomics provides greater coverage in the estimation of the organisms in a sample, thereby providing a powerful tool to characterize unculturable marine sediment bacteria and establish a baseline for ecological monitoring studies26.
The objectives of this study were to characterize the taxonomic relative abundance and biodiversity of sampling sites within the SBNMS that have been exposed to different levels of fishing activity. A metagenomic database was constructed consisting of microbial taxa by geographic site and fishing exposure status from 2017 to 2020. Sites within the SBNMS designated as closed to fishing were compared to those exposed primarily to fishing with fixed gear, designated ‘minimal’ to ‘moderate’ fishing activity according to recent reports9. An additional aim was to assess bacterial species with the highest potential for harboring medically relevant biosynthetic gene clusters (BGCs) encoding secondary metabolites such as nonribosomal peptide synthetases (NRPS), which have antibacterial and antitumor properties.
Methods
Study area and sampling sites
The SBNMS study area is under permit from the National Oceanic and Atmospheric Administration (NOAA). The sanctuary is an 842-square-mile federal marine protected area that stretches from Cape Ann to Cape Cod, Massachusetts, USA. Sampling locations included sites 1 (42.566889°, − 70.486222°), 2 (42.550362° − 70.483694°), 3 (42.533667°, − 70.466763°), 4 (42.525639°, − 70.219722°), 5 (42.572306°, − 70.249528°), 6 (42.583333°, − 70.240278°), and 7 (42.518889°, − 70.219167°) (Fig. 1). The sampling scheme was designed in collaboration with SBNMS scientists. Sites 1, 2 and 3 were designated open to recreational and commercial fishing activities. Sites 4, 5, 6, and 7 were designated as closed to fishing and are part of the Western Gulf of Maine Groundfish Closure Area, where fishing activities are prohibited27. The depth ranged from 80 to 120 m from the sea surface to the benthic surface below the photic zone.
Map of sampling sites. Black symbols indicate open to fishing. Red symbols indicate closed to fishing, 30 m within the closure boundary. Note: Map was created by the authors using ArcGIS Pro 2.8 based from the NOAA Coastal Survey Data (https://www.nauticalcharts.noaa.gov/data/gis-data-and-services.html).
Sediment collection and processing
From 2017–2020, sediment samples were collected using a Van Veen grab from the surficial benthic layer (top 20 cm at the water column interface). The surficial layer at the sediment-water interface represents the more recent microbial community12,28. During the summer season, we sampled sites 1 (2017, 2019) 2 (2017), 3(2017,2018,2019,2020), 4 (2018), 5 (2018), 6 (2019), and 7 (2019). One sediment grab from each site was collected and measured for grain size by particle sieve analysis. From each grab collected across the seven geographic sites, a total of 14 sediment samples were analyzed. Approximately 30 g of sediment were transferred from the grab to a sterile 50 mL Falcon™ tube. The collection tubes were immediately placed on ice in a container onboard until returned to the laboratory. DNA was then extracted from 0.30 g of sediment samples using a PureLink® DNA kit (Thermo Fisher Scientific™). The DNA extract was pooled based on sample replicates, and concentrations were quantified using a Qubit 3.0 fluorometer (Thermo Fisher Scientific). The sample DNA was stored at − 20 °C until sequencing.
Sequencing
Whole metagenome sequencing for taxonomic profiling was performed at the University of New Hampshire, Hubbard Center for Genome Studies (Durham, NH). This method delineates a majority of random fragments from a diverse community sediment sample in a shotgun approach. This technique results in greater sequencing coverage of a wide range of organisms. Library preparation included ligation adapters designed to target short paired ends (2 × 250 bp) and performed on an Illumina™ HiSeq® 2500 using a Cluster Kit v4 (San Diego, CA, United States). Large insert (500 bp) libraries were then created to account for the diversity in the samples and to maximize high-quality assembly of all microbial taxa.
Metagenomic data processing, taxonomic classification, and abundance
All raw sample data underwent initial processing using the metaWRAP pipeline v1.3.229, employing the metaWRAP-Read_qc module. Read quality control (QC) was first carried out using FASTQC v0.11.8 to generate pre-QC reports30. All reads were then subjected to trimming index removal using TrimGalore v0.5.031, and a post-QC report was generated for comparison. Replicates of trimmed and decontaminated forward and reverse reads from each sample site, collected on the same date, were combined for all downstream analysis resulting in a forward and reverse FASTQ file for each. Metagenomic assemblies were then carried out using the metaWRAP-Assembly module employing SPADES v3.13.032, utilizing the metaSPAdes option flag, and assembly reports were generated using the program QUAST v5.0.233. The results of the Kraken2 analysis were processed with Bracken v2.6 (Bayesian re-estimation of abundance after classification with Kraken)34, which allows for the estimation of abundance at multiple taxonomic levels. The abundance of the top five groups at the phylum, class, and genus levels for each sample site was plotted using the ggplot2 v3.3.5 package in R35. The metaWRAP-Kraken2 module was then run on both the combined trimmed and decontaminated reads and the assembly produced by SPADES. Running Kraken on the reads was carried out to determine the taxonomic composition of the communities, while running Kraken on the assemblies was carried out to determine which taxonomic groups were assembled better than others. This was done for each sample site at each date to classify reads at multiple taxonomic levels utilizing the complete NCBI RefSeq bacterial genome library. Subsampling was carried out at the level of 3,500,000 reads to account for differences in the number of reads among sample sites. Additionally, combined abundance across all sample sites was calculated using the combine_bracken_outputs.py script and pooled based on the combined trimmed reads only. The output was visualized in a Sankey diagram with the web application Pavian36.
Genome binning and the detection of BGCs
To identify high-quality metagenomes (MAGs) across sample sites, reads and coassemblies were binned using the metaWRAP Binning module employing 3 different commonly used algorithms: MaxBin 2.037, CONCOCT38, and MetaBAT 239. The resulting bins for each sample site were consolidated into a final bin set with metaWRAP’s Bin_refinement module, retaining only bins that were at least 50% complete exhibiting less than 10% contamination as determined by CheckM40. The resulting MAGs were concatenated and analyzed utilizing Kraken2 to determine the taxonomic classification across all sample sites and visualized as a kronagram using KronaTools41. High-quality MAGs at the level of each sample site were also functionally annotated using the program Prokka42. Finally, annotated genomes were screened for BGCs using the antiSMASH™ pipeline43, and the results were plotted using ggplot244. A radar plot was also created using ggRadar to visualize the proportion of BGCs detected at each site in each year45. Computations were carried out using the University at Albany’s high-performance computing cluster using the default program parameters unless otherwise specified above.
Analysis of species diversity and abundance in the context of fishing exposure
Diversity analyses were first carried out using the R package vegan v2.5–746, and plots were made with ggplot2. The observed, Shannon-Weiner diversity index (H'), Simpson (measure of the probability that randomly selected individuals from a sample will be the same) InvSimpson and Fisher alpha diversity measures were calculated across all samples using the abundance estimates produced by Bracken. These measures were then plotted in the context of open and closed fishing status. The Shannon index measures the relative abundance of species present in a community12. Additional plots were created to illustrate the level of fishing exposure measured on an ordinal scale (delineated as 0 (none), 1 (low), and 2 (moderate)). Next, species richness (total number in a community) was calculated and plotted in the context of these variables. Nonmetric multidimensional scaling (NMDS) using R47,48 and generalized linear models (GLMs)49 were used to elucidate the relationships of species richness and abundance with fishing exposure over time, adjusting for sediment grain type. NMDS and GLM were chosen because these methods are robust to departures from normality and can account for nonlinear assumptions. Generalized linear models introduce a link function around the combination of the explanatory variables, so that non-normal and discrete distributions of Y can be fitted within this model class. In GLMs, the response variable yi is assumed to follow an exponential family distribution with mean μi, which can be a linear or nonlinear function of xiTβ. In our GLM, a logit link function was used with the zero-inflated negative binomial distribution to model abundance. The inverse gaussian distribution with a log link was used in models of biodiversity.
Fishing exposure levels
For the current study, an ordinal fishing exposure level variable was derived from NOAA fisheries vessel monitoring system (VMS) data in a report7, which uses codes to categorize a range of gear types50. The VMS data provide a spatial representation of commercial fishing, which is used as a standard method to account for mobile bottom-tending gear51. VMS uses integrated global positioning systems installed on vessels to transmit, via satellite, the vessel’s location. The classifications were labeled according to the following scale: 0 = no fishing exposure/closed to fishing; 1 = low fishing exposure; 2 = moderate fishing exposure.
Water quality
Water quality parameters (i.e., temperature, salinity, dissolved oxygen, and chlorophyll a) were documented from the Northeastern Regional Association of Coastal Ocean Observing Systems (NERACOOS) proximal to the open-to-fishing sampling sites52. These data provide contextual information relevant to the sampling area as a whole, although the location of the buoys does not permit granular site-level analysis. Based on the NERACOOS data, we examined the daily average standard deviations of 15-min temperature recordings against the times that we sampled, and verified that the value was within 1 SD of the seasonal mean for that year. This allowed us to confirm that there were no outliers or extremely hot or cool sampling days with respect to temperature variations.
Results
Metagenomic data summary and community composition
Protobacteria was the dominant phylum across all sites, representing 54 to 63% of the bacteria identified (Fig. 2). Actinobacteria was also prevalent, representing 18% to 25% of the bacteria identified, whereas Firmicutes, Bacteroidetes and Planctomycetes made up a smaller proportion (ranging from 4–8%, 4–8%, and 4–5%, respectively). At the class level, Gammaproteobacteria was the most prevalent across all sites, representing from 22–30% of the bacteria identified, followed closely by Actinomycetia (17–24%) and then Alphaproteobacteria (14–20%). At the genus level, abundance estimates exhibited far more diversity, although all sites were dominated by a relatively small proportion of Pseudomonas (5–6%) and Streptomyces (4–5%) (Fig. 2). Abundance estimates produced by Bracken combinations were visualized across phyla, classes, orders, families, genera and species (Fig. 3). The abundance estimates detected 5986 species across the 14 sample sites with more than 10 reads, ranging from 5123–5441 species depending on the site examined. The mean number of species with more than 10 reads identified was 5268. Analyses showed that sites 7 and 4 exhibited the highest abundance of species, while site 5 exhibited the lowest (Table S1).
Comparisons of species diversity by site and fishing exposure
Alpha diversity measures across sites and years varied minimally, ranging from 5113–5433 (observed), 7.75–8.06 (Shannon), 0.995–0.999 (Simpson), 221–1378 (InvSimpson), and 782–833 (Fisher) (Fig. 4). When viewing the measures plotted in the context of binary fishing exposure status (open vs. closed), no clear pattern emerged. Species richness ranged from 5178–5386 among sites open to fishing and from 5123–5441 among sites closed to fishing. Sites were classified according to the year and season collected, fishing exposure level and sediment type (Table 1). The NMDS Shepard plot displays the statistics for goodness of fit between ordination distances and observed dissimilarity (Fig. S1). A nonmetric fit R2 value of 0.988 and linear fit R2 value of 0.945 for fishing status were observed.
The results from the multivariable models assessing associations between the Shannon H' Index and fishing exposure using the ordinal variable (none, low, moderate) suggested that higher levels of fishing exposure were marginally associated with lower richness after adjusting for temporal trends (n = 14; β = − 0.006 (SE 0.003); P = 0.0483) (Table 2). Time (year) was inversely associated with the Shannon H' index (β = − 0.006 (SE 0.002); P = 0.0207). Descriptive box plots of the ordinal fishing variable showed modest variation across fishing exposure levels (Fig. 5).
Boxplot of Shannon H'—Index by fishing exposure level (n = 14). The descriptive graphic of fishing exposure level is measured on an ordinal scale delineated as 0 (none), 1 (low), and 2 (moderate). Mean Shannon H' Index (from Table 1) is designated by the red circle.
The results from the GLM assessing counts of species abundance in relation to the ordinal fishing exposure variable over time suggested that higher species abundance was associated with lower levels of fishing exposure (n = 83,804; β = − 0.057, SE 0.013, P < 0.0001 (95% confidence interval (CI) − 0.082, − 0.032); Table S2a. The model was adjusted for year, sediment type, and season. An inverse relationship was observed between year and abundance (β = − 0.075, SE 0.014, P < 0.0001 (95% CI − 0.102, − 0.048)). Generalized Linear Models are appropriate for skewed and clustered data structures because they specify the distribution of the observations, the linear predictor(s), a variance function, and a link function that can be specified by the user53. These features provide more efficiency and reduce bias in the variance parameters. In this study, models of species abundance (a count variable) utilized the zero-inflated negative binomial distribution with a logit link function (Table S2b)54.
Binning of high-quality metagenomes and detection of BGCs
Binning and refinement carried out with the metaWRAP binning module resulted in high-quality bins from 0 to 9 across the 14 samples, resulting in a total of 39,487 contigs. Site 4 collected in 2018 was the only sample that did not produce any bins with high-quality metagenomes (> 50% read complete, < 10% contamination). Combined analysis of the high-quality MAGs with Kraken2 resulted in the identification of 4402 different species across 1756 genera, 79 classes, and 41 phyla, with proportions largely reflecting those produced by abundance estimates (Fig. 6a). The results of the BGC analysis across all high-quality metagenomes resulted in the detection of 121 BGC sequences representing 12 different secondary metabolite clusters (Fig. 6b). Terpene was the most prevalent BGC, with 36 sequences identified, followed by RiPP-like BGCs (27 sequences) and RRE-containing BGCs (22 sequences). All other BGCs ranged from 1–13 sequences identified (overall mean = 10). The radar plot produced with ggRadar exhibits a wide range of proportions of each metabolite identified at each location, with no distinct trends (Fig. 6c). Nevertheless, RiPP-like secondary metabolites appear to make up a comparatively large proportion of BGCs identified at site 1 in 2017, while terpene exhibits the same pattern at site 5 in 2018 and site 7 in 2019.
Taxonomic composition of high-quality metagenomes (MAGs) collected across all sample sites in this study, estimated with Kraken2 and visualized with KronaTools (a). Lollipop plots describing the number of BGCs detected across all sample sites (b). Radar plots exhibiting the proportion of secondary metabolites identified at each sample site (c).
Environmental indicators
Sediment samples were classified into four main categories based on the Barnhardt scale55, which consisted of ≥ 90% of a particular sediment type. Descriptive indicators of water quality trends were evaluated in relation to metagenomic results by site, collection year, fishing exposure, season, and sediment type. Proximal to the open sites, seasonal trends in water temperature and salinity were observed, as both parameters were higher in the fall than in the summer (Table S3). More research is needed to determine whether this result reflects environmental stress to the seafloor habitat, particularly if the levels are above threshold limits for extended periods of time56.
Discussion
The spatial distribution of microbial diversity in marine sediment within a continental shelf region is not well documented.1 For SBNMS managers, a primary focus of habitat quality is the assessment of benthic habitats. The present study is unique as the first time-series report that provides a metagenomic assessment of abundance, biodiversity, and fishing exposure for sediment microbial communities in a National Marine Sanctuary. As a working sanctuary, the SBNMS is a critical fisheries resource, providing ecosystem services for conservation management57. The results underscore the need to improve our understanding of microbial habitat quality in marine protected areas. For example, the trends observed in time and fishing exposure levels highlight the fact that MPAs should be monitored more frequently to assess potentially significant changes in structure and function to support a robust food web58. Environmental stressors such as fishing activity may reduce biodiversity over time59; however, more study is needed to elucidate these complex trends within the benthic invertebrate habitat.
In terms of taxonomic abundance trends, classes Gamma-, Alpha-, Beta-, and Deltaprotobacteria were the dominant taxa, followed by Actinomycetia across all sites. Similar to previous SBNMS reports evaluating coastal sediment communities,7,9 these taxa serve to maintain habitat quality by cycling carbon and processing nutrients60. In terms of genera, Pseudomonas and Streptomyces were the two most dominant types observed. Actinobacteria was also prevalent and has great potential to produce abundant BGCs associated with antimicrobial agents61. The results are consistent with previous research within the SBNMS, which identified class Actinobacteria and species Streptomyces within the open fishing area62. Within the SBNMS, there were 4 sites that showed 50% or greater key secondary metabolites, such as terpene, RiPP-like and T3PKS. These sites were sites 1 and 5, 6 and 7. Streptomyces species were relatively abundant, with 236 different strains identified. The combined use of MAGS with antiSMASH exhibited high per-sequence accuracy and enabled BGC screening that showed a wide distribution of potential medically relevant secondary metabolites. The BGC analysis was performed on binned genomes that were nearly complete. Future research to detect diversity and further BGC characterization within strains is needed, as improved computational methods can inform metagenomic studies63.
The time-series assessment of the taxonomic structure illustrated in the present report fill a gap in the literature for the GoM. Compared to metagenomic studies of oceanic sediments, the biodiversity results from this SBNMS study are relatively similar to continental shelf sediment studies in temperate and subtropical zones at similar depths64. However, relative abundance and diversity measures should be taken in context with respect to surficial versus subseafloor habitat and bioinformatics methods.
Environmental factors such as water temperature suggested a potential restructuring of the microbial community from summer to fall as bottom temperatures increased. Although the seasonal conditions comply with an observable trend for this region65, it is difficult to assess the status of benthic communities due to the lack of consistent water quality monitoring data. Potential water quality impacts on benthic habitats may be associated with variable occurrences of vessel traffic, which can impact microbial communities by trawling activities that resuspend soft sediments and increase turbidity in bottom water habitats66. Although warmer water from the Gulf Stream current regularly influences the GoM during the fall, observations of increased warming and higher salinity levels may contribute to shifts in microbial communities.66 Further study is recommended to assess potential shifts in bacterial assemblages since indirect effects of water temperature67,68 and ocean currents may be influential.
In the present study, species diversity as measured by the Shannon H' index was marginally higher among sites that were closed to fishing than among sites with minimal or moderate levels of fishing activity. Multivariable models assessing counts of species abundance in relation to the ordinal fishing expsoure variable over time suggested that higher species abundance was associated with lower levels of fishing exposure after adjusting for year and sediment type. However, more detailed assessments of fishing activity using continuous variables from VMS data should also be conducted as more data become available. These results underscore the need to monitor the microbial community structure and diversity across the full range of fishing activities, including areas where heavier fishing vessel traffic is more consistent.
The present assessment was limited by the relatively small number of sampling units analyzed and the fact that areas with heavy fishing activity were not sampled. Resource capacity limitations precluded sampling in heavier fishing activity zones that may be exposed to more disruptive bottom gear. Furthermore, the intensity of fishing activity may vary by season and location. Sampling additional sites would have afforded a more powerful assessment of biodiversity trends by site, season, and fishing activity. Another limitation is that the VMS data do not capture data from recreational fishing vessels. Additionally, water quality data from in situ sensors at depth would provide information to assess potential abundance and biodiversity shifts with respect to fishing impact.
Prior research conducted in the northwest corner of Stellwagen Bank documented periodic impacts from fishing exposure6,7. However, it is unclear whether reductions in fishing effort have resulted in reduced risk to healthy populations of higher trophic level organisms (i.e., seabirds, cod, and right whales). Multiple stressors such as pollution and climate change continue to affect these habitats69. In the present study, the data trended toward higher species abundance and higher average biodiversity values in areas with lower levels of fishing activity. This is consistent with NOAA reports documenting lower levels of direct impact in the Western Gulf of Maine Closure Area, suggesting that closure has supported recovery in some areas70. Further analyses with a wider range of sites proximal to monitoring stations and inclusive of areas exposed to heavier fishing activity are needed to assess whether statistically significant trends in microbial community abundance and biodiversity can be consistently observed.
In conclusion, closure areas are critical to evaluating the biodiversity of the microbial benthic habitat, since long-term monitoring is not routinely conducted. The present study revealed important public health implications with respect to protecting fragile marine ecosystems. As recognized by the United Nations Sustainable Development Goals (SDG #17, Life Below Water), marine sanctuaries represent a vast oceanic resource and provide myriad ecosystem services that are important for human health71. These services include the potential for harboring secondary metabolites for natural products and drug discovery72. Collectively, this research underscores the importance of raising awareness about the life-supporting functions of marine sanctuaries and the possibility of engaging citizen scientists and health professionals in the stewardship of these resources.
Data availability
All the metagenomic raw sequencing data from this study has been deposited in the NCBI Sequence Read Archive (SRA). The BioProject accession number for the 2017 data are PRJNA524407 (BioSample SAMN10999726-SAMN10999731) and PRJNA837619 (BioSample SAMN28209579-28209594) for 2018–2020.
References
Pace, N. R. The small things can matter. PLoS Biol. 16(8), e3000009. https://doi.org/10.1371/journal.pbio.3000009 (2018).
Hoshino, T. et al. Global diversity of microbial communities in marine sediment. PNAS 117, 27587–27597 (2020).
Baker, B. J., Appler, K. E. & Gong, X. New microbial biodiversity in marine sediments. Ann. Rev. Mar. Sci. 13, 161–175. https://doi.org/10.1146/annurev-marine-032020-014552 (2021).
Zinger, L. et al. Global patterns of bacterial beta-diversity in seafloor and seawater ecosystems. PLoS ONE 6, e24570. https://doi.org/10.1371/journal.pone.0024570 (2011).
Jiao, N. et al. Microbial production of recalcitrant dissolved organic matter: Long-term carbon storage in the global ocean. Nat. Rev. Microbiol. 8, 593–599 (2010).
Ward, N. D. et al. Representing the function and sensitivity of coastal interfaces in earth system models. Nat. Commun. 11, 2458 (2020).
Cook, R., & Auster, P. J. Developing alternatives for optimal representation of seafloor habitats and associated communities in Stellwagen Bank National Marine Sanctuary. Marine Sanctuaries Conservation Series ONMS-06–02. U.S. Department of Commerce, National Oceanic and Atmospheric Administration, Office of National Marine Sanctuaries, Silver Spring, MD (2006).
Wauchope, H. S. Evaluating impact using time-series data. Trends Ecol. Evol. 36, 3. https://doi.org/10.1016/j.tree.2020.11.001 (2021).
Stellwagen Bank National Marine Sanctuary (SBNMS) Condition Report. Office of National Marine Sanctuaries National Oceanic and Atmospheric Administration. doi:https://doi.org/10.25923/48ZK-BB07. pp. 1–263. (2020).
Grieve, C., Brady, D. C. & Polet, H. Best practices for managing, measuring and mitigating the benthic impacts of fishing—Part 1. Mar. Stewardship Council Sci. Ser. 2, 18–88 (2014).
Watling, L. & Norse, E. A. Disturbance of the seafloor by mobile fishing gear: A comparison to forest clear cutting. Conserv. Biol. 12, 1180–1197 (1998).
Snelgrove, P. V. R. et al. The importance of marine sediment biodiversity in ecosystem processes. Ambio 26, 578–583 (1997).
Grassle, J. F. & Maciolek, N. J. Deep-sea species richness: Regional and local diversity estimates from quantitative bottom samples. Am. Nat. 139, 313–341 (1992).
Polinski, J. M., Bucci, J. P., Gasser, M. & Bodnar, A. G. Targeted metagenomic assessment of biodiversity across prokaryotic and eukaryotic taxa in sediments from the Stellwagen Bank National Marine Sanctuary. Sci. Rep. 9, 14820 (2019).
Petro, C. et al. Microbial community assembly in marine sediments. Aquat. Microb. Ecol. 79, 177–195 (2017).
Cook, R. et al. The substantial first impact of bottom fishing on rare biodiversity hotspots: A dilemma for evidence-based conservation. PLoS ONE 8, e69904 (2013).
Grabowski, J. H. et al. Assessing the vulnerability of marine benthos to fishing gear impacts. Rev. Fisheries Sci. Aquacult. 22, 142–155 (2014).
Silva, T. L. State of the science report: An addendum to the Stellwagen Bank National Marine Sanctuary 2020 Condition Report 1–20 (U.S. Department of Commerce, 2021).
Parks, D. H. et al. Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life. Nat. Microbiol. 2, 1533–1542 (2017).
Bech, P. K. et al. Marine sediments hold an untapped potential for novel taxonomic and bioactive bacterial diversity. MSystems 5, e00782-e820 (2020).
Newman, D. J. & Cragg, G. M. Natural products as sources of new drugs from 1981 to 2014. J. Nat. Prod. 79, 629–661 (2016).
Hou, Z. Geochemical and microbial community attributes in relation to hyporheic zone geological facies. Sci. Rep. 7, 12006 (2017).
Hugenholtz, P., Goebel, B. M. & Pace, N. R. Impact of culture-independent studies on the emerging phylogenetic view of bacterial diversity. J. Bacteriol. 180, 4765–4774 (1998).
Durazzi, F. et al. Comparison between 16S rRNA and shotgun sequencing data for the taxonomic characterization of the gut microbiota. Sci. Rep. 11, 3030 (2021).
Asnicar, F. et al. Precise phylogenetic analysis of microbial isolates and genomes from metagenomes using PhyloPhlAn 3.0. Nat. Commun. 11, 2500 (2020).
Dance, A. The search for microbial dark matter. Nature 582, 301–303. https://doi.org/10.1038/d41586-020-01684-z (2020).
Fishing Restrictions. Magnuson Fishery Conservation and Management Act (MFCMA) (16 U.S.C. Part 1801 et seq.) (1990).
Begon, M., Harper, J. L. & Townsend, C. R. Ecology: Individuals, Populations, and Communities 3rd edn. (Blackwell Science Ltd., 1996).
Uritskiy, G. V., DiRuggiero, J. & Taylor, J. MetaWRAP—a flexible pipeline for genome-resolved metagenomic data analysis. Microbiome 6, 158 (2018).
Andrews, S. Babraham bioinformatics-FastQC a quality control tool for high throughput sequence data. https://www.bioinformatics.babraham.ac.uk/projects/fastqc (2010).
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet Next Gen. Sequencing Data Anal. 17, 1 (2011).
Bankevich, A. et al. SPAdes: A new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 19, 455–477 (2012).
Gurevich, A., Saveliev, V., Vyahhi, N. & Tesler, G. QUAST: Quality assessment tool for genome assemblies. Bioinformatics 29, 1072–1075 (2013).
Lu, J., Breitwieser, F. P., Thielen, P. & Salzberg, S. L. Bracken: estimating species abundance in metagenomics data. PeerJ Comput. Sci. 2, e104 (2017).
Wickham, H. ggplot2. WIREs Comput. Stat. 3, 180–185 (2011).
Breitwieser, F. P. & Salzberg, S. L. Pavian: Interactive analysis of metagenomics data for microbiome studies and pathogen identification. Bioinformatics 36, 1303–1304 (2020).
Wu, Y. W. et al. MaxBin 2.0: An automated binning algorithm to recover genomes from multiple metagenomic datasets. Bioinformatics 32, 605–607 (2016).
Alneberg, J. et al. CONCOCT: Clustering cONtigs on COverage and ComposiTion. ArXiv 1312, 4038 (2013).
Kang, D. et al. MetaBAT 2: An adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies. PeerJ 7, e7359 (2019).
Parks, D. H. et al. CheckM: Assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 25, 1043–1055 (2015).
Ondov, B. D., Bergman, N. H. & Phillippy, A. M. Interactive metagenomic visualization in a web browser. BMC Bioinform. 12, 385 (2011).
Seemann, T. Prokka: Rapid prokaryotic genome annotation. Bioinformatics 30, 2068–2069 (2014).
Blin, K. et al. antiSMASH 6.0: Improving cluster detection and comparison capabilities. Nucleic Acids Res. 49, W29–W35 (2021).
Wickham, H. ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York. ISBN 978–3–319–24277–4. (2016).
Moon, K. W. Interactive Plot. In Learn ggplot2 Using Shiny App (ed. Moon, K.-W.) 295–347 (Springer International Publishing, 2016).
Oksanen, J., et al. Package ‘vegan’. Community ecology package, version 2, 1-295 (2013).
Wilkinson, L. SYSTAT. In Wiley Interdisciplinary Reviews: Computational Statistics, Multidimensional Scaling (eds Wegman, E. & Said, Y. H.) (John Wiley & Sons, New York, 2010).
Dexter, E., Rollwagen-Bollens, G. & Bollens, M. The trouble with stress: A flexible method for the evaluation of nonmetric multidimensional scaling. Limnol. Oceanogr. Methods 16, 434–443 (2018).
Longford, N. T. Longitudinal and time-series analysis. In Studying Human Populations. Springer Texts in Statistics (Springer, 2008). https://doi.org/10.1007/978-0-387-73251-0_11.
NOAA Office of Law Enforcement. Speed-filtered vessel monitoring system (VMS) data from Greater. Atlantic VMS Program (2019).
Palmer, M. C., & Wigley, S. E. Validating the stock apportionment of commercial fisheries landings using positional data from vessel monitoring systems (VMS). Northeast Fisheries Science Center Reference Document 07–22. U.S. Department of Commerce, National Oceanic and Atmospheric Administration, National Marine Fisheries Service, Northeast Fisheries Science Center, Woods Hole, MA. (2007).
Northeastern Regional Association of Coastal Ocean Observing Systems Buoy (NERACOOS) Monitoring Program. Portsmouth, NH. www.neracoos.org (2021).
Stroup, W. Generalized Linear Mixed Models: Modern Concepts (Methods and Applications. Taylor & Francis Group, 2013).
Ridout, M. S., Hinde, J. P., & Demétrio, C. G. B. “Models for Count Data with Many Zeros,” in Proceedings of the 19th International Biometric Conference, 179–192, Cape Town. (1998).
Barnhardt, W. A., Kelley, J. T., Dickson, S. M. & Belknap, D. F. Mapping the Gulf of maine with side-scan sonar: A new bottom-type classification for complex seafloors. J. Coast. Res. 14, 646–659 (1998).
Carrier-Belleau, C. et al. Environmental stressors, complex interactions and marine benthic communities’ responses. Sci. Rep. 11, 4194. https://doi.org/10.1038/s41598-021-83533-1 (2021).
Auster, P., Joy, K. & Valentine, P. C. Fish species and community distributions as proxies for seafloor habitat distributions: the Stellwagen Bank National Marine Sanctuary example (Northwest Atlantic, Gulf of Maine). Environ. Biol. Fishes 60, 331–346 (2001).
Solan, M., Raffaelli, D. G., Paterson, D. M., White, P. C. L. & Pierce, G. J. Marine biodiversity and ecosystem function: Empirical approaches and future research needs. Mar. Ecol. Prog. Ser. 311, 175–178 (2006).
Worm, B. et al. Impacts of biodiversity loss on ocean ecosystem services. Science 314, 787–790 (2006).
Dyksma, S. et al. Ubiquitous Gammaproteobacteria dominate dark carbon fixation in coastal sediments. ISME J. 10, 1939–1953 (2016).
Tuttle, R. N. et al. Detection of natural products and their producers in ocean sediments. Appl. Environ. Microbiol. 85, e02830-e2918 (2019).
Heinrichs, L., Aytur, S. A. & Bucci, J. P. Whole metagenomic sequencing to characterize the sediment microbial community within the Stellwagen Bank National Marine Sanctuary and preliminary biosynthetic gene cluster screening of Streptomyces scabrisporus. Mar. Genom. 50, 100718 (2020).
Belknap, K. C. et al. Genome mining of biosynthetic and chemotherapeutic gene clusters in Streptomyces bacteria. Sci. Rep. 10, 2003. https://doi.org/10.1038/s41598-020-58904-9 (2020).
Sánchez-Soto Jiménez, M. F., Cerqueda-García, D., Montero-Muñoz, J. L., Aguirre-Macedo, M. L. & García-Maldonado, J. Q. Assessment of the bacterial community structure in shallow and deep sediments of the Perdido Fold Belt region in the Gulf of Mexico. PeerJ 6, e5583. https://doi.org/10.7717/peerj (2018).
Pershing, A. J. et al. Slow adaptation in the face of rapid warming leads to collapse of the Gulf of Maine cod fishery. Science 62, 809–812 (2015).
Pittman, S. J. Relevance of the Northeast Integrated Ecosystem Assessment for the Stellwagen Bank National Marine Sanctuary Condition Report (2007–2018) Marine Sanctuaries Conservation Science Series ONMS-19–08. U.S. Department of Commerce, National Oceanic and Atmospheric Administration, Office of National Marine Sanctuaries, Silver Spring, MD. (2019).
Bucci, J. P., Szempruch, A. J., Caldwell, J. M., Ellis, C. & Levine, J. F. Seasonal changes in microbial community structure in freshwater stream sediment in a North Carolina River Basin. Diversity 6, 18–32 (2014).
Won, N. I., Kim, K. H., Kang, J. H., Park, S. R. & Lee, H. J. Exploring the impacts of anthropogenic disturbance on seawater and sediment microbial communities in korean coastal waters using metagenomics analysis. Int. J. Environ. Res. Public Health 14, 130 (2017).
Zinger, L. et al. Global patterns of bacterial beta-diversity in seafloor and seawater ecosystems. PLoS ONE 6(9), e24570 (2011).
Auster, P., Lindholm, J., Cramer, A., Nenandovic, M., Prindle, C., & Tamsett, A. The seafloor habitat recovery monitoring project (SHRMP) at Stellwagen Bank National Marine Sanctuary. Final Project Report. (2013b).
UN General Assembly, Transforming our world: The 2030 Agenda for Sustainable Development, 21 October, A/RES/70/1, available at: https://www.refworld.org/docid/57b6e3e44.html. (2015).
Malve, H. Exploring the ocean for new drug developments: Marine pharmacology. J. Pharm. Bioall. Sci. 8, 83–91. https://doi.org/10.4103/0975-7406.171700 (2016).
Acknowledgements
We thank Dave Wiley, Ben Cowie-Haskell and Mike Thompson at NOAA, SBNMS for their assistance with sample site location. We thank Kent Hubbard and John Puleo for their assistance with sample collection. We also thank Kelley Thomas and Steve Simpson at the Hubbard Center for Genome Studies, University of New Hampshire, for their assistance with metagenomic sequencing methods. Technical support was provided by the Marine Microverse Institute (MMI).
Author information
Authors and Affiliations
Contributions
S.A.B. performed all bioinformatics analyses and assisted in manuscript writing. S.A.A. performed biostatistical analysis and contributed to data interpretation and manuscript writing. C.P.A. assisted with data interpretation and manuscript writing. J.P.B. designed the original project idea, led the data collection and prepared samples for molecular sequencing, and contributed to data analysis, interpretation and writing. All authors read, discussed, edited and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Bruce, S.A., Aytur, S.A., Andam, C.P. et al. Metagenomics to characterize sediment microbial biodiversity associated with fishing exposure within the Stellwagen Bank National Marine Sanctuary. Sci Rep 12, 9499 (2022). https://doi.org/10.1038/s41598-022-13409-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-022-13409-5
This article is cited by
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.