Carbohydrates and carbohydrate degradation gene abundance and transcription in Atlantic waters of the Arctic

Priest, Taylor; Vidal-Melgosa, Silvia; Hehemann, Jan-Hendrik; Amann, Rudolf; Fuchs, Bernhard M.

doi:10.1038/s43705-023-00324-7

Download PDF

Article
Open access
Published: 09 December 2023

Carbohydrates and carbohydrate degradation gene abundance and transcription in Atlantic waters of the Arctic

ISME Communications volume 3, Article number: 130 (2023) Cite this article

1670 Accesses
3 Citations
15 Altmetric
Metrics details

Subjects

Abstract

Carbohydrates are chemically and structurally diverse, represent a substantial fraction of marine organic matter and are key substrates for heterotrophic microbes. Studies on carbohydrate utilisation by marine microbes have been centred on phytoplankton blooms in temperate regions, while far less is known from high-latitude waters and during later seasonal stages. Here, we combine glycan microarrays and analytical chromatography with metagenomics and metatranscriptomics to show the spatial heterogeneity in glycan distribution and potential carbohydrate utilisation by microbes in Atlantic waters of the Arctic. The composition and abundance of monomers and glycan structures in POM varied with location and depth. Complex fucose-containing sulfated polysaccharides, known to accumulate in the ocean, were consistently detected, while the more labile β-1,3-glucan exhibited a patchy distribution. Through ‘omics analysis, we identify variations in the abundance and transcription of carbohydrate degradation-related genes across samples at the community and population level. The populations contributing the most to transcription were taxonomically related to those known as primary responders and key carbohydrate degraders in temperate ecosystems, such as NS4 Marine Group and Formosa. The unique transcription profiles for these populations suggest distinct substrate utilisation potentials, with predicted glycan targets corresponding to those structurally identified in POM from the same sampling sites. By combining cutting-edge technologies and protocols, we provide insights into the carbohydrate component of the carbon cycle in the Arctic during late summer and present a high-quality dataset that will be of great value for future comparative analyses.

Biogeographic response of marine plankton to Cenozoic environmental changes

Article 17 April 2024

Unveiling unique microbial nitrogen cycling and nitrification driver in coastal Antarctica

Article Open access 12 April 2024

Lineage dynamics of the endosymbiotic cell type in the soft coral Xenia

Article Open access 17 June 2020

Introduction

Marine carbohydrates are chemically and structurally diverse, and represent a substantial fraction of characterised organic matter [1]. The diversity of glycans emerges from the alternative linkage types, alpha and beta, of carbon atoms between more than ten available monomers along with substitutions by a range of other chemical groups [2]. Micro- and macro-algae are the primary synthesisers of glycans in the ocean, wherein they serve structural, storage and protective functions. Glycans can constitute between 13 – 90% of algal carbon [3]. Marine glycans range from low-molecular weight (LMW) oligosaccharides to complex high-molecular weight (HMW) polysaccharides, with varying composition across taxa, life-cycle stage and environmental conditions [4, 5]. Through exudation, cell death and lysis, various glycans are released and integrated into the particulate and dissolved organic matter pools (POM and DOM) [6,7,8], which are separated based on the size of particles. Once released or outside the cell, glycans can become substrates for heterotrophic microbes.

Carbohydrate utilisation is common in bacteria and archaea, but the mechanisms employed and the degradative capabilities vary [9,10,11]. Many species take up mono-, di- and trisaccharides into the cell through porins or transporters, while longer oligosaccharides require specialised systems, such as TonB-dependent transporters (TBDTs) or other outer membrane proteins. Usually these have a high specificity to discrete glycan structures [12]. For polysaccharides, microbes must first depolymerise the structure extracellularly with excreted or outer membrane-bound glycoside hydrolases (GHs) or polysaccharide lyases (PLs), followed by uptake of the oligosaccharides and subsequent cleavage in the periplasm [12, 13]. These enzyme classes, together with carbohydrate-binding modules (CBMs) and carbohydrate esterases (CEs), are collectively referred to as carbohydrate-active enzymes (CAZymes). CAZymes are classified into families based on protein sequence similarity, with each family also containing at least one biochemically characterised protein [14, 15]. Many families are monospecific to certain glycosidic linkage types within polysaccharides while others are divided into sub-families based on specificity of target linkages [16]. Unlike those from land plants, algal glycans are often decorated with sulphate esters, which require sulfatase enzymes for complete degradation. The CAZyme, sulfatase and transporter gene profiles thus acts as the blueprint for the glycan degradation potential of microbes [10, 17].

Carbohydrate utilisation by microbial populations exhibit spatial and temporal variations. The rate of hydrolysis and the substrate spectrum of extracellular CAZymes decreases and narrows with depth [18] and with distance to the coast [19]. In addition, a broader spectrum of CAZyme activities is measureable in temperate compared to high-latitude waters, indicating latitudinal gradients [20]. Temporal shifts in CAZyme, sulfatase and transporter gene profiles are also evident following spring phytoplankton blooms [21]. These patterns are congruent with dynamic changes in microbial community composition. In particular, community-level patterns are shaped by the presence and composition of specialised carbohydrate degraders, such as Bacteroidetes. Bacteroidetes typically harbour large CAZyme repertoires [10, 17] and exhibit successional dynamics following spring phytoplankton blooms [21]. These dynamics indicate glycan-based niche partitioning [22, 23]. Detailed assessments on microbial carbohydrate utilisation have been focused on spring phytoplankton blooms in temperate ecosystems, while far less is known about later seasonal stages and higher latitude waters.

In this study, we combine analytical techniques with meta’omics approaches to explore the distribution of glycans and their potential utilisation by microbes in Atlantic waters of the Arctic during late summer. The carbohydrate composition of particulate organic matter (POM) in the upper euphotic zone was characterised using glycan microarrays and high-performance anion-exchange chromatography. Concurrently, we applied PacBio HiFi read metagenomics and Illumina short-read metatranscriptomics to assess the abundance and transcription of carbohydrate degradation-related genes in microbial communities. By combining cutting-edge techniques and protocols, we aim to provide insights into the carbohydrate component of the carbon cycle in high-latitude waters during late summer.

Methods

Sample collection

Seawater samples were collected from ten stations located in the eastern Fram Strait and around the Svalbard archipelago in September 2020 during the MSM95 Maria S. Merian research cruise. A map of the sampling locations (Fig. 1) was generated using publically available bathymetric data from the International Bathymetric Chart of the Arctic Ocean (IBCAO) [24] and the QGIS v3.14.16-Pi [25] software. Seawater was collected using a CTD-rosette sampler from surface water (SRF), typically 2 m depth, and the bottom of the surface mixed layer (BML). The BML depth was defined by the beginning of the thermo- and halocline and drop in surface fluorescence values (Fig. 1). One location was sampled twice over a two day period, with additional samples collected at 100 and 200 m depth, these are labelled as S1 and S6 (Supplementary Table S1). Of the water collected, 4 L was filtered sequentially through a 3 and 0.2 µm pore-size polycarbonate membrane filter (142 mm diameter) and immediately stored at −80 °C for ‘omics analysis. A second 4 L of seawater was filtered through a 0.7 µm pre-combusted Whatman Grade GF/F filter (47 mm diameter) and immediately stored at -80 °C for carbohydrate analysis.

**Fig. 1: Bathymetric map with sampling locations, types of samples collected and vertical profiles from CTD casts.**

Monosaccharide and polysaccharide analysis

The GF/F filters from all samples (ten stations and two depths) were cut into ten equally-sized circular pieces, with a diameter of 10 mm. The monosaccharide composition of and polysaccharide structures present on the filter pieces was analysed as described previously [26] and detailed in Supplementary Methods. First, two of the filter pieces were hydrolysed using acid (1 M HCI for 24 h at 100 °C) and the resulting monosaccharides were analysed using High-Performance Anion Exchange Chromatography with Pulsed Amperometric Detection along with monosaccharide standards. Acidic monosaccharides could not be detected due to a problem with the detector. The remaining filter pieces were subject to polysaccharide extraction using a sequential solvent protocol, with 1) MilliQ water, 2) 0.2 M EDTA (pH 7.5) and 3) 4 M NaOH with 0.1% w/v NaBH₄. The identification and semi-quantitative analysis of polysaccharide compounds was performed using a microarray and antibody-based approach. Polysaccharide extracts were printed in quadruplicates onto nitrocellulose membranes (0.45 µm pore-size) using a microarray robot (Sprint, Arrayjet, Roslin, UK). The printed arrays were blocked for 1 h in 1 x PBS with 5% (w/v) non-fat milk powder (MPBS), followed by incubation for 2 h with polysaccharide-specific monoclonal antibodies (Supplementary Table S2). After incubation, the arrays were washed in PBS and incubated for 2 h in anti-rat, anti-mouse or anti-His tag secondary antibodies conjugated to alkaline phosphatase. Arrays were thoroughly washed in PBS and deionized water before being developed in a solution containing 5-bromo-4-chloro-3-indolylphosphate and nitro blue tetrazolium in alkaline phosphatase buffer (100 mM NaCl, 5 mM MgCl₂, 100 mM Tris-HCl, pH 9.5). Developed arrays were scanned and the binding of each probe against each spotted sample was quantified using Array-Pro Analyser 6.3 (Media Cybernetics). Signal intensities for each extract against each antibody were quantified. The highest signal value in the data set (which corresponded to a standard control) was set to 100 and all other values were normalised accordingly. Only antibodies with a signal >=5 in at least one sample were retained. Mean signal intensities (from the four replicates) were then determined for each antibody in each sample.

Metagenome and metatranscriptome sequencing

Filtered seawater samples of the 0.2 – 3 µm fraction from SRF and BML depths of four different stations (S8, S10, S25, S26) were subject to a dual nucleic acid isolation protocol using the DNA/RNA Mini Prep Plus kit from Zymo Research (Irvine, CA, USA), according to the manufacturer’s instructions. The quality of extracted DNA was assessed using capillary electrophoresis with a FEMTOpulse (Agilent), whilst RNA quality was assessed using a PicoChip on a Bioanalyser (Agilent, CA, USA). Ultra-low DNA libraries were prepared from the eight samples without further fragmentation by the protocol “Procedure & Checklist - Preparing HiFi SMRTbell® Libraries from Ultra-Low DNA Input” of PacBio (CA, USA). Libraries were sequenced on 4 x 8 M SMRT cells on a Sequel II platform for 30 h with sequencing chemistry 2.0 and binding kit 2.0 (two samples multiplexed per SMRT cell). Four of the samples were additionally selected for metatranscriptome sequencing (S10_SRF, S25_SRF, S25_BML, S26_BML). Illumina-compatible libraries were produced from extract RNA using the Universal Prokaryotic RNA-Seq library preparation kit, incl. Prokaryotic AnyDeplete® (Tecan Genomics, CA, USA). Libraries were sequenced on a HiSeq 3000 platform with 2 ×150 bp paired-end read mode.

HiFi read taxonomic classification

A custom pipeline was employed to taxonomically classify metagenomic HiFi reads against a GTDB-based protein database. A Diamond blast (v0.9.14) [27] database was generated from the gene amino acid sequences of all GTDB species-representatives (release 207) after clustering at 99% sequence identity, to remove redundancy. NCBI-style taxdump files (nodes.dmp, names.dmp and accession2taxid) were then generated using scripts from https://github.com/nick-youngblut/.

Open reading frames were predicted on raw HiFi reads using FragGeneScan v1.31. Gene sequences were aligned to the generated GTDB protein database using Diamond blastp (parameters: --id 50 --top 5 --fast). A secondary filtering step was applied to the output including identity threshold >65%, e-value < 1E-10 and query-cover >50%. Using the remaining hits, a single taxonomic classification for each gene was determined using a last common ancestor approach, lca command from TaxonKit [28]. To further increase the number of genes taxonomically classified, the last common ancestor algorithm was then applied to all genes within a single HiFi read, resulting in a single taxonomic classification for each HiFi read, and its containing genes.

HiFi read functional annotation

For functional characterisation, the predicted gene sequences (see above) from HiFi reads were subject to a custom annotation pipeline, modified from Priest et al. [29]. In brief, genes were annotated against the Pfam database (release 35.0) using HMMsearch v3.3.2 (parameters: cut_ga), UniProtKB database (05.2022) using Diamond blastp v2 (parameters: -k 1 --evalue 1e-10 --query-cover 50 --id 40 --sensitive) and the KEGG database (07.2022) using kofam_scan (https://github.com/takaram/kofam_scan; parameters: -E 0.0001). Additional annotations were obtained by searching more specialised databases using HMMScan (Transporter Classification database; obtained 11.2021, TonB HMM profiles from TIGRFAM, and dbCAN; v10) and Diamond blastp (CAZyDB; release 09242021, SulfAtlas; v1.3, MEROPS; v12.1) using the same settings as described above, except for HMMScan against the dbCAN database (parameters: -E 1E-15).

Single-copy ribosomal protein (SC-RBP) gene analysis

From the gene annotations, 16 single-copy ribosomal protein (SC-RBP) genes [30] were identified and extracted from each metagenome. The average number of SC-RBP was used as a proxy for the number of genomes recovered in each metagenome. A subset of four SC-RBP genes (RBP L3, L4, L6 and S8) were clustered at previously defined gene-specific ANI thresholds [31] and the average number of clusters across the four was used as a proxy for the number of species captured. The composition of metagenomes and metatranscriptomes was compared based on the species clusters from the RBP L6 gene, selected due to its high recoverability and species delineation accuracy [31]. The taxonomy of each cluster was determined based on a majority vote between the taxonomy of the contained genes, derived from original HiFi read classifications.

Assembly, binning and metagenome-assembled genome recovery

The assembly of Hifi reads was performed using MetaFlye v2.8 [32] (parameters: --meta –pacbio-hifi –hifi-error 0.01 –keep-haplotypes). Coverage information was obtain through mapping HiFi reads to assembled contigs using Minimap2 v2.1 (parameters: -x map-hifi –MD). Contigs were binned using Metabat2 [33]. The resulting bins were subject to manual refinement using the Anvi’o v7 [34] interactive interface to generate metagenome-assembled genomes (MAGs). MAGs were dereplicated at a 99% ANI threshold using dRep v3.2.2 (parameters: --comp 50 --con 5 --sa 0.99 --nc 0.6). The completeness and contamination of representative MAGs was estimated using CheckM v1.1.2 [35]. A two-pronged approached was used for taxonomic classification of MAGs, the classify_wf pipeline of GTDB-tk v1.0.2 [36] (Release 207) and the extraction of 16 S rRNA gene sequences using Barrnap v0.9 [37] and classification against the SILVA_SSU_Ref138.1_NR99 database, following the same process described in ‘Phylogenetic characterisation of communities’. Of the species-representative MAGs, 84% contained a complete 16 S rRNA gene and thus received dual taxonomies.

MAG relative abundance estimation

The relative abundance of representative MAGs was determined using a similar approach to Orellana et al. [38]. In brief, reads were competitively recruited from each metagenome to the MAG representatives. Mapped reads were converted into depth values using Genomecov (-bga option) from the Bedtools package [39] and the 80% central truncated average of the sequencing depth (TAD) was determined using the ‘BedGraph.tad.rb’ script (option range 80) from the enveomics collection [40]. The relative abundance was then determined as the quotient between the TAD value and the number of microbial genomes captured in each metagenome, determined from SC-RBP genes.

MAG functional characterisation

The functional characterisation of MAGs was performed following the same procedure described for the HiFi reads above except for an additional process of polysaccharide utilisation loci (PULs) detection. PULs were defined as genetic loci containing a SusC/SusD gene pair with two or more degradative CAZymes or the presence of at least three degradative CAZymes in close proximity (maximum six genes apart). PULs were manually inspected and visualised at BioRender.com.

Transcription level of genes at the community- and MAG-level

Adaptors and low quality reads were removed from the metatranscriptomes using BBDuk of the BBtools programme v38.73 [41] (parameters: ktrim=r, k = 29, mink=12, hdist=1, tbo=t, tpe=t, qtrim=rl, trimq=20, minlength=100). Although an rRNA depletion step was performed prior to sequencing, it is expected that 5 – 15% of reads would still be related to rRNA. As such, SortMeRNA v2.0 [42] was used to filter out rRNA sequences from the dataset, with the SILVA SSU Ref 138 NR99 database as a reference. The transcription level of genes was determined by read recruitment of transcripts to the predicted gene sequences from the HiFi reads using BBmap (v35; parameters: minid=98 idfilter=98). Mapped read values were converted to transcripts per million (TPM), according to Wagner et al. [43]. For MAGs, transcripts were competitively recruited to all MAG genes, with same procedure as above, and the values were converted to TPM using the total number of transcripts recruited to the whole metagenome-predicted genes as the total transcript values. In order to compare the transcription level of MAGs across samples, we determined the average TPM value of the 16 SC-RBPs for each MAG in each sample, and took the quotient of this and the average TPM value of the 16 SC-RBPs in the whole sample – providing proportional transcription of all genomes recovered. To place MAG CAZyme gene family transcription into the context of the whole community, we performed an additional read recruitment step. First, we concatenated genes from all MAGs into files based on CAZyme gene family or sub-family annotations. Then, we identified all transcripts that mapped to the metagenomic read-predicted genes for each of these families and subsequently recruited them to the concatenated MAG gene files, using the same parameters assigned above. Based on the number of transcripts mapped, the relative proportion of CAZyme gene family transcription was determined for each MAG.

Results & discussion

Seawater samples were collected from surface waters (SRF) and the bottom of the surface mixed layer (BML) in the Eastern Fram Strait region to investigate the distribution of carbohydrates and their utilisation by microbial communities. The ten sites were grouped into three categories based on the underlying seafloor topography (above-slope, above-shelf and open-ocean), which also corresponded to differences in hydrographic conditions. The main water mass in this region is of North Atlantic origin. The West Spitsbergen Current (WSC) transports Atlantic water northward into the Arctic Ocean, with the main branch flowing above the continental slope. At the shelf break, a temperature-salinity front occurs, whilst above the West Spitsbergen shelf, Atlantic water (AW) converges and mixes with Arctic water and freshwater from land, resulting in intra-annual variability in hydrographic properties [44]. Based on temperature and salinity values, the main water masses in this region can be distinguished, with AW characterised by >34.9 psu and >4.1 °C [45]. The temperature of SRF and BML depths during sampling in this study were indicative of AW, ranging from 4.1 – 7.7 °C. However, the salinity values in SRF waters of above-shelf (S10) and three above-slope stations (S1, S6 and S8) were below the AW-defining thresholds (Fig. 1 and Supplementary Table S1). These observations suggest an influence of either polar-derived water or freshwater from Spitsbergen at these stations.

Carbohydrate analysis of POM samples

The monosaccharide and glycan composition of carbohydrates in POM ( > 0.7 µm) was analysed in SRF and BML depths at nine stations. The monosaccharide composition of carbohydrates in POM varied with depth and location (Fig. 2). Total neutral and amino monosaccharide concentrations ranged from 1.4 – 13.8 µg per L of seawater (hereon µg l⁻¹). Higher values were typically observed in SRF, average of 8.8 µg l⁻¹, compared to BML depths, average of 5.2 µg l⁻¹ (Supplementary Table S3). However, the magnitude of change between the two depths was station-dependent, from a negligible difference at station S8 to a threefold decrease from SRF to BML depths at station S6 (Fig. 2a). The decrease in monosaccharide concentrations with depth is in agreement with previous observations from the Pacific Ocean [46]. In addition, above-slope stations contained higher monosaccharide concentrations than open-ocean stations (Fig. 2 and Supplementary Figure S1). This spatial heterogeneity resembles that of chlorophyll a and dissolved organic compounds during early summer in this region, which reach highest concentrations in SRF depths above the continental slope ( ~ 8 °E) [47]. These patterns likely reflect hydrographic processes, such as the frontal zone situated above the shelf break.

**Fig. 2: Total concentration and relative abundance of monosaccharides in carbohydrates from POM fraction.**

The most abundant monosaccharide detected in all samples was glucose. Glucose represented a larger relative proportion of POM carbohydrates in SRF ( ~ 49%) than BML ( ~ 43%) depths and in above-slope ( ~ 47%) compared to open-ocean ( ~ 44%) samples (Fig. 2b). These values are within the range of those previously reported from oceanic surface waters (31 – 55%) [48,49,50] but lower than during phytoplankton blooms, wherein glucose can constitute >70% of POM carbohydrates [51]. Furthermore, in contrast to findings from the high North Pacific [48], the relative proportion of glucose decreased with depth, concurrent with an increase in all other monosaccharides. In particular, xylose increased 5% in relative proportion to the other monomers from SRF to BML depths (Fig. 2b). The relative decrease in glucose with depth could result from a number of factors, including recent production and retention of glucose-containing glycans in surface waters as well as the selective utilisation of these compounds.

Combining carbohydrate microarrays with monoclonal antibodies and carbohydrate binding modules resulted in the detection of 16 distinct glycan epitopes in POM (Supplementary Figure S2 and Supplementary Table S4). This structure-based detection provides semi-quantitative presence and abundance of distinct epitopes, where changes in antibody binding signal correlates to epitope concentration in the sample [26, 52]. Variations in the abundance of glycan epitopes exposed location- and sample-specific patterns. The glycan epitopes observed most frequently included glucuronoxylan in 91% and fucose-containing sulfated polysaccharide (FCSP) in 82% of samples (Fig. 3). FCSPs are unique to brown algae, wherein they serve important structural roles [53] and formulate part of the secreted carbon pool [54], while glucuronoxylans are common features of land plants [55]. Recently, we discovered both of these complex polysaccharides in microalgal blooms [26] as well as the secretion of FCSPs by diatoms in culture [56]. FCSPs synthesised by diatoms accumulate in POM over a period of weeks during a spring phytoplankton bloom, indicating stability against bacterial degradation [26], and can contribute to long-term carbon sequestration in sediments [57]. This observed stability contrasts β-1,3-glucans, such as laminarin, that are also synthesised by brown algae and diatoms. Laminarin is structurally simpler than FCSPs, with fewer unique linkages and without sulphate esters. Concurrently, the presence and activity of laminarases is observed more frequently in marine microbes compared to those targeting FCSPs, suggesting higher consumption and turnover [10, 26, 58]. In our samples, laminarin was absent at some open-ocean stations, but present in all above-slope sites (Fig. 3). In addition to a potentially more rapid utilisation by microbes, the heterogeneous distribution of laminarin may also result from variations in the distribution of phytoplankton, reflecting previous observations from this region [59].

**Fig. 3: Diverse glycan structures occur in Atlantic waters of the Arctic.**

Sampling conducted at the same location over a two day period (stations S1 and S6) and into deeper waters (down to 200 m) showed additional differences over time and with depth in the abundance and diversity of POM carbohydrates (Supplementary Figure S4). S1 samples were retrieved at 08:00 whilst S6 samples were collected one day later at 16:30. The absolute concentration of monosaccharides in POM carbohydrates was 1.1 – 3.6x higher in S6 compared to S1 samples. In addition, SRF depths in S6 samples contained more α-1,5-arabinan and β-1,3-glucan as well as β-1,4-xylan epitopes and xylosyl residues. The only epitopes more abundant in S1 samples were alginate and glucuronoxylan. It is important to note that the alginate-targeting antibody, BAM7, has cross reactivity with FCSP [60]. The variations in glucose and β-1,3-glucan between the two sampling time points may reflect diurnal fluctuations in laminarin production by diatoms [61], with production during the day and a partial consumption at night. Such diel periodicity has also been shown at the particulate organic carbon level [62], with accumulations during the day to a maximum concentration at dusk [63]. Furthermore, the higher abundance of xylosyl residues and β-1,4-xylan epitopes at S6 could suggest a change in the primary producers between the time points, resulting from shifts in water mass dynamics. It is important to note that these are observations derived from sampling only two time points, and we recognise that the limited replication possibility during sampling and capacity to further sample over temporal scales limits the ecological conclusions that can be drawn.

Sampled microbial communities were indicative of summer in high-latitude Atlantic waters

To assess microbial carbohydrate utilisation potential, eight PacBio HiFi read metagenomes were generated from SRF and BML depths of two above-slope (S8 and S25), one open-ocean (S26) and one above-shelf (S10) station. From four of the samples, metatranscriptomes were also generated (Supplementary Table S5 and S6). The above-shelf station S10 was chosen for sequencing even though POM carbohydrate samples were unavailable, due to the lower salinity values observed (psu < 33) that indicates influence from polar marine/freshwater and thus provides a contrast for comparison. Despite the limited scope of our sampling scheme and inability to obtain replicates, the employment of two complimentary, cutting-edge sequencing technologies and protocols resulted in the generation of a high-quality ‘omics dataset that can be used to gain valuable insights into microbial carbohydrate utilisation potential.

To place the sampled communities into context, we performed a taxonomy-independent comparison to previously published metagenomes from the Fram Strait [29] and Arctic Ocean [64]. Based on sequence composition dissimilarity, our metagenomes were most closely related to those previously generated from WSC, high North Atlantic and Barents Sea samples in June and July and most dissimilar to those from the polar water mass of the western Fram Strait (Supplementary Figure S5). This indicates that the communities captured in our metagenomes are representative of Atlantic waters of the Arctic during summer.

As our focus was on microbial communities, we removed metagenomic reads not classified as Bacteria or Archaea. Despite the size fractionation employed during sampling (0.2 – 3 µm), 22– 49% of the metagenomics reads were classified as Eukarya. Although our analysis was concentrated on the prokaryotic fraction, we also extracted and analysed 18 S rRNA genes to provide insights into the eukaryotic taxa present in samples (Supplementary Figure S7). Using the average sequencing depth of single-copy ribosomal protein (SC-RBP) genes, we determined the number of microbial genomes sequenced to range from 761 in S8_SRF to 1467 in S10_BML (Supplementary Figure S6b). The number of genomes detected is used to normalise the abundance of functional genes.

Composition of metagenome and metatranscriptome microbial communities

The composition and structure of microbial communities varied across samples. The most prominent families observed in the metagenomes were Flavobacteriaceae (4 – 14%), Rhodobacteraceae (3 – 13%), D2472 (SAR86; 5 – 12%) and Poseidoniaceae (2 – 12%) except for in S25_BML, which was enriched in Alteromonadaceae (25%) (Supplementary Figure S8). These bacterial families were also substantial contributors to community transcription. However, the relative proportions of abundance and transcription were not consistent, e.g. Flavobacteriaceae represented a two- to threefold higher proportion of transcription. Twenty genera were identified as reaching >2.5% relative abundance and together, constituted ~42% of the microbial communities (Fig. 4). These 20 genera included Pseudoalteromonas (0 – 29%), MGIIa-L1 ( < 0.1 - 10%), Amylibacter (<0.1 – 6%), D2472 (SAR86; 3 – 6%), ASP10-02a (<0.1 – 6%) and HTCC2207 (SAR92; 1 – 6%). However, only some of these genera were substantial contributors to community transcription. The maximum relative proportion of transcription was observed in Pseudoalteromonas (45%), Vibrio (29%), Flavobacterium (13%), Amylibacter (9%), SAR86 (9%) and Pseudothioglobus (9%). Discrepancies between metagenomes and metatranscriptomes highlights the disconnection between gene abundance and transcription, as could be expected. The magnitude of discrepancy varied across taxa, with the MGIIa-L1 (Marine Group II Archaea) representing a larger proportion of gene abundance than transcription, while genera within Bacteroidia, such as Flavobacterium and MAG121220-bin8 (NS4), showed the opposite trend. Such patterns are not uncommon in marine microbial communities [65] and can be influenced by lifestyle, fitness and differences in metabolism.

**Fig. 4: Composition of microbial communities in metagenomes and metatranscriptomes were sample-specific.**

Sample S25_BML was evidently an outlier in the dataset, harbouring a community dominated by Pseudoaltermonas and Vibrio. These genera have previously been observed under nutrient-rich conditions and are known to be associated with eukaryotic hosts and phytoplankton blooms [66, 67]. Considering the high proportion of eukaryotic reads captured in some of the metagenomes, the observed pattern in S25_BML may represent signals of processes occurring in the larger size fraction. In support of this, the eukaryotic community of this sample (Supplementary Figure S7) contained a large proportion of 18 S rRNA genes affiliated with copepods, whose microbiomes are often enriched with Pseudoalteromonas and Vibrio [68, 69]. Therefore, we assume that sample S25_BML was influenced by the presence of copepod faecal pellets on the filter, and thus it represents an outlier that ship-based, point-sampling schemes, such as that employed here, are susceptible to.

In general, the microbial community compositions observed here more closely resemble those previously described from summer [70, 71] than from the sampling time period (late September) in this region [72]. The difference likely reflects inter-annual variability in seasonal transitions, as September separates summer from the beginning of winter. However, methodological differences also play a major role. Methodological influence is particularly evident with respect to the high proportions of MGIIa-L1 in our samples, which has not been observed before in the WSC, and is typical of late summer communities in temperate coastal ecosystems [73]. The previous employment of bacterial-specific 16 S rRNA gene based primers [71] has likely contributed to the MGII, and Archaea more generally, being overlooked in the WSC, wherein they could represent an important fraction of the microbial community.

Carbohydrate utilisation potential of microbial communities varied across samples

Microbial carbohydrate utilisation potential was assessed through the abundance and transcription of CAZymes. In particular, we focused on those involved in degradative processes (glycoside hydrolase, GH; carbohydrate-binding module, CBM; carbohydrate esterase, CE; polysaccharide lyase, PL).

CAZyme genes represented a minor proportion of community gene content but a higher proportion of gene transcription. The number of CAZyme genes ranged from 7 – 19 per microbial genome, which corresponded to 0.3% of community gene content, on average. In contrast, CAZyme genes represented between 1.5% and 3.0% of community gene transcription. Employing a dissimilarity-based approach, we observed that the CAZyme compositions of samples from S8 and S26 were grouped together based on station (location) whereas those of S10 and S25 showed no coherent clustering (Supplementary Figure S9).

The CAZyme gene profiles comprised a core backbone of universally abundant and transcribed gene families. The most abundant gene families in all samples were those involved in peptidoglycan synthesis and degradation, which is in agreement with previous observations and reflects the core machinery required for bacterial cell membrane construction and maintenance [21]. These core CAZyme gene families included CE11, GH23, GH103 and GH73 that together represented, on average, 27% of CAZyme gene abundance (average of 0.83 PMG) and 35% of CAZyme gene transcription (Fig. 5 and Supplementary Tables S7–S10). Several CAZyme gene families involved in the degradation of algal-derived glycans also represented a high proportion of gene abundance and transcription. Most notable was the GH16_3 gene family, which contains enzymes that degrade laminarin [74], that constituted 3.1% of CAZyme gene abundance and 2.7% of CAZyme gene transcription. Other prominent gene families included those known to target sialic acids (GH33 [75]; ~1.4% of CAZyme abundance and 1.7% of CAZyme transcription), α-mannans (GH92 [76]; ~1.7% CAZyme abundance and ~1.3% CAZyme transcription) and alpha-linked fucose that is common in FCSPs [77] (GH29; ~1.3% CAZyme abundance and 1.2% CAZyme transcription) (Fig. 5 and Supplementary Tables S7–S10). In general, the abundance and transcription of CAZyme gene families showed a positive linear relationship, except in S25_BML (Supplementary Figure S10). However, for some CAZyme families, relative transcription was twofold higher than relative abundance, such as the alginate-targeting PL7_5 [78] in S10_SRF and the galacturonan-targeting GH28 [79] in S26_BML (Supplementary Figure S11).

**Fig. 5: Abundance, transcription and taxonomic information of the CAZyme gene families with the highest proportional transcription across samples.**

Although the microbial community analysis was concentrated on the free-living fraction, the target glycans of transcribed CAZyme gene families corresponded to those detected in the POM fraction. For example, the widespread presence of FCSPs and alginate and the transcription of alpha-fucosidases (GH29) and alginate lyases (PL7_5). Glycans that are part of the algae cell belong to the POM pool, but can become part of DOM through cell lysis, viral infection and grazing (a likely process given the presence of copepods in the extracted 18 S rRNA gene data). The same glycan epitopes can thus be present in both POM and DOM, as has been evidenced during phytoplankton blooms [26]. Therefore, the glycans detected in POM here, were likely also available to free-living heterotrophic microbes.

Distinct microbial taxa were responsible for the transcription of CAZyme families in each sample. For CAZyme gene families involved in bacterial glycan recycling, which are universally present in microbes, the transcription was dominated by the most abundant taxa in the community, such as Pelagibacter that accounted for ~45% of the peptidoglycan-targeting GH23 family transcription (Fig. 5). In contrast, the taxa contributing the most to transcription of algal glycan-targeting CAZyme families were affiliated with Bacteroidia, including Polaribacter and NS4 for laminarin (GH16_3), Formosa for α-mannan (GH92) and NS4 and UBA8316 for FCSP (GH29). Several of these genera are well known as carbohydrate-degrading specialists in temperate coastal ecosystems and are among the main microbial responders to spring phytoplankton blooms that annually re-occur [10, 21, 80]. The presence and activity of these genera in our samples suggests that they may also be key players in carbohydrate cycling in high-latitude waters and at later seasonal stages.

Recovery of metagenome-assembled genomes

To investigate carbohydrate utilisation potential at higher resolution, we next focused on the recovery and analysis of metagenome-assembled genomes (MAGs). A total of 83 population-representative MAGs were recovered, delineated at a 99% ANI threshold (Supplementary Information S1 and Supplementary Table S8). The MAGs captured a substantial fraction of microbial metagenomic reads, 48–88% (Supplementary Table S12), and metatranscriptomic reads, 11–37% (Supplementary Table S13) (Fig. 6). Analogous to the community-level patterns, the coupling of relative abundance and relative proportion of transcription varied across MAGs and taxa. MAGs affiliated with MGIIa-L1 (Marine Group II Archaea) exhibited high relative abundance (up to 12%) but low relative transcription (up to 1.5%) while the Aurantivirga-affiliated MAG showed the opposite trend, with ~0.7% relative abundance but ~2.7% relative proportion of transcription. A MAG (S25_BML_bin_129) that shares 97.1% ANI to Pseudoalteromonas primoryensis was also recovered. The Pseudoalteromonas primoryensis MAG constituted 22.8% relative abundance of sample S25_BML, indicating that the pattern observed at the community-level, and likely resulting from copepod faecal pellet influence, was driven by a single species.

**Fig. 6: Microbial populations show distinct patterns in distribution and activity across samples.**

Carbohydrate degradation gene transcription profiles of microbial populations

The carbohydrate utilisation potential of populations was assessed through the abundance and transcription of CAZymes in combination with TonB-Dependent Transporters (TBDTs), sulfatases and peptidases. Peptidases were included for comparison as proteins are another key substrate used by heterotrophic microbes. The largest CAZyme gene repertoires were observed in Verrucomicrobiae- and Bacteroidia-affiliated MAGs, with an average of 15 and 14 CAZymes per Mbp, respectively (Fig. 7). In contrast, Poseidoniia-affiliated MAGs harboured few CAZymes, 1 per Mbp, but high peptidase:CAZyme ratios, ~3.7:1. The high peptidase content of Poseidoniia- MAGs indicates a preference for proteinaceous substrates, in line with previous observations for this taxon [73]. With respect to sulfatases, Verrucomicrobiae-affiliated MAGs harboured the most extensive repertoires, with an average of 24 per Mbp. The observed differences in carbohydrate utilisation potential for these taxa are in accordance with previous findings, with large CAZyme repertoires reported for Bacteroidia and a specialisation on sulfated polysaccharides in Verrucomicrobiae [38, 77].

**Fig. 7: Count and transcription level of carbohydrate utilisation genes for population-representative MAGs.**

In each sample we observed a unique assemblage of populations that dominated carbohydrate-degradation related gene transcription (Fig. 7). In the above-shelf sample S10_SRF, CAZyme, sulfatase and TBDT transcription values were dominated by only a few Bacteroidia representatives, particularly Formosa (25_SRF_bin_51_1) and NS2b (S10_SRF_bin_49_1). In contrast, the above-slope sample S25_SRF was characterised by a larger number and diversity of populations of the Bacteroidia and Gammaproteobacteria that exhibited comparable transcription values. The key contributors to CAZyme transcription in S25_SRF included two distinct populations affiliated with the NS4 Marine Group (S10_SRF_bin_85_1 and S10_BML_bin_54_1) along with one of Formosa (S25_SRF_bin_51_1), SAR92 (S25_SRF_bin_75_1) and SAR86 (S10_BML_bin_72_1). In addition, high sulfatase transcription in S25_SRF was observed for two populations of Roseibacillus (Verrucomicrobiae). In sample S26_BML, comparably high CAZyme gene transcription levels were observed across numerous populations assigned to Bacteroidia, Gammaproteobacteria, Poseidoniia, Alphaproteobacteria and Verrucomicrobiae that were less active in the other samples, which may reflect differences in depth. The largest contributors to CAZyme transcription in S26_BML that were less active in other samples included those assigned to Cand. Arcticimaribacter (S26_BML_bin_6_1) [81], Flavobacteriaceae (S26_BML_bin_6_1) and Planktomarina (S10_BML_bin_107_1). As could be expected, the TPM values for the focal gene groups in S25_BML were strongly dominated by the Pseudoalteromonas representative, and thus we did not include this sample in the visualisation. The MAG-based analysis revealed that population’s exhibit sample-specific transcription profiles and suggest that the community-level profiles may be underpinned by only a few populations.

Disentangling transcription profiles of dominant populations

To place population transcription into the context of the communities, we determined the proportion of community transcription of each CAZyme gene family by each MAG (see Methods). We further focused on the top six MAGs contributing to transcription of each gene family and within those, only the gene families that were more transcribed than single-copy ribosomal protein genes – considered up-transcribed in relation to the genome (Fig. 8).

**Fig. 8: Comparison of CAZyme gene family transcription by selected MAGs across the four metatranscriptome samples.**

Amongst the populations contributing the most to CAZyme gene transcription, we observed gene families that were universally transcribed along with those uniquely transcribed by a single, or few discrete populations. The potential use of communal substrates was evidenced by the similar transcription level of a CAZyme gene family by multiple populations in each sample (Fig. 8). The most notable example was laminarin (GH16_3 and GH17), which is a structurally simple glycan that has previously been shown to be widely accessible to carbohydrate-degrading microbes [10, 17]. In contrast, the community transcription of other CAZyme gene families was dominated by only a few populations that typically varied across samples, including those that target FCSP (GH29 and GH95) and β-xylan (GH10 [82]). For some predicted glycan targets, only a single population was identified as dominating community transcription of the respective CAZyme gene families, such as rhamnogalacturonan (GH28 and GH78) by an NS5 population. The populations showing unique CAZyme gene family transcription patterns are taxonomically related to those that are primary responders to spring phytoplankton blooms in temperate ecosystems and are documented as glycan specialist degraders, such as Formosa and the NS4 and NS5 Marine Groups [21]. Therefore, although our limited sampling scope inhibits confident ecological conclusions, it suggests that similar microbial players may be important in glycan degradation at higher latitudes and later seasonal stages, which warrants further investigation.

Microbes that rely on carbohydrates as a main substrate source often have specific genomic arrangements to optimise utilisation, namely polysaccharide utilisation loci (PULs). As such, PULs are often used in targeted studies to assess microbial glycan utilisation and glycan-based niche partitioning in environmental samples [10, 17, 22]. In line with this, we investigated the presence and transcription of PULs within MAGs, but gained little additional insights into potential glycan utilisation by doing so. In populations harbouring multiple PULs, negligible differences in transcription were observed despite their predicted glycan targets being distinct, suggesting little transcriptional regulation. Only in the case of the NS2b representative were distinct PUL transcription levels observed, with a xylan-targeting PUL exhibiting twofold higher transcription than a mannan-targeting PUL in S26_BML (Supplementary Table S14 and S15). The potentially low level of regulation could be indicative of a priming effect, where the detection of one substrate results in transcription of all PULs, however this would need to be investigated over a much larger dataset and complimented by substrate-based incubations.

Conclusion

In Atlantic waters of the Arctic during late summer, the distribution of POM carbohydrates and their potential utilisation by microbes exhibit variations over spatial scales. The monomeric and glycan composition of POM carbohydrates varied across locations and depths. Typically, higher abundances were observed above the continental slope compared to open-ocean locations. Monosaccharide compositions were dominated by glucose, which decreased in proportion with depth, suggesting preferential utilisation of glucose-based glycans in surface waters. Structurally complex glycans, such as FCSPs that accumulate in POM during phytoplankton blooms, were widely detected, while those with more simple structures, such as laminarin, exhibited patchy distributions. The observed distributions of POM carbohydrates is likely a result of spatial heterogeneity in primary production, as has been described from early summer in this region [47], along with variations in microbial utilisation. Through metatranscriptome analysis, we identified the active fraction of microbial communities and observed variations in carbohydrate degradation-related gene transcription at the community- and population- level across samples. Although gene transcription cannot prove substrate utilisation, we observed that the dominant populations transcribing CAZyme genes belong to the same lineages as microbes that are known to be primary responders to phytoplankton blooms and glycan degraders in temperate ecosystems, suggesting their importance at higher latitudes and later seasonal stages. Furthermore, we observed the up-regulation (transcription higher than that of cellular maintenance machinery) of gene families by specific populations that corresponded to glycans that we could structurally detect in the same sample. In combination, these results provide insights into carbohydrate distribution and potential utilisation patterns by microbes in Atlantic waters of the Arctic during a late summer period. Although our limited sampling scope and inability to replicate samples limits ecological conclusions, the high-quality dataset generated in this study, employing state of the art technologies and protocols, will be of great value in future analysis of marine carbohydrates and the carbon cycle.

Data availability

The measurements of several abiotic parameters from sensors mounted on the CTD have been published under the PANGAEA accession 943220 [83]. The monosaccharide concentrations have been deposited under the PANGAEA accession 957737. The metagenomic raw reads, assemblies and metagenome-assembled genomes along with the metatranscriptomic raw reads were deposited at ENI-EBA, under the project accession PRJEB58071 (Supplementary Table S16). Tables containing the functional and taxonomic annotation of genes from metagenomic reads and MAGs are provided at https://doi.org/10.17617/3.DZSEAN. Detailed information on how to reproduce our results and generate the figures presented in this manuscript is provided at https://github.com/tpriest0/FRAM_STRAIT_WSC20_data_analysis.

References

Benner R, Pakulski JD, Mccarthy M, Hedges JI, Hatcher PG. Bulk chemical characteristics of dissolved organic matter in the ocean. Science. 1992;255:1561–4.
Article CAS PubMed Google Scholar
Hofmann J, Hahm HS, Seeberger PH, Pagel K. Identification of carbohydrate anomers using ion mobility–mass spectrometry. Nature. 2015;526:241–4.
Article CAS PubMed Google Scholar
Myklestad S. Production of carbohydrates by marine planktonic diatoms. I. Comparison of nine different species in culture. J Exp Mar Biol Ecol. 1974;15:261–74.
Article CAS Google Scholar
Bellinger B, Abdullahi A, Gretz M, Underwood G. Biofilm polymers: relationship between carbohydrate biopolymers from estuarine mudflats and unialgal cultures of benthic diatoms. Aquat Microb Ecol. 2005;38:169–80.
Article Google Scholar
Abdullahi AS, Underwood GJC, Gretz MR. Extracellular matrix assembly in diatoms (Bacillariophyceae). V. Environmental effects on polysaccharide synthesis in the model diatom, Phaeodactylum Tricornutum1. J Phycol. 2006;42:363–78.
Article Google Scholar
Wetz MS, Wheeler PA. Release of dissolved organic matter by coastal diatoms. Limnol Oceanogr. 2007;52:798–807.
Article CAS Google Scholar
Møller EF. Production of dissolved organic carbon by sloppy feeding in the copepods Acartia tonsa, Centropages typicus, and Temora longicornis. Limnol Oceanogr. 2007;52:79–84.
Article Google Scholar
Biddanda B, Benner R. Carbon, nitrogen, and carbohydrate fluxes during the production of particulate and dissolved organic matter by marine phytoplankton. Limnol Oceanogr. 1997;42:506–18.
Article CAS Google Scholar
Reintjes G, Arnosti C, Fuchs B, Amann R. Selfish, sharing and scavenging bacteria in the Atlantic Ocean: a biogeographical study of bacterial substrate utilisation. ISME J. 2019;13:1119–32.
Article CAS PubMed Google Scholar
Krüger K, Chafee M, Ben Francis T, Glavina del Rio T, Becher D, Schweder T, et al. In marine Bacteroidetes the bulk of glycan degradation during algae blooms is mediated by few clades using a restricted set of genes. ISME J. 2019;13:2800–16.
Article PubMed PubMed Central Google Scholar
Arnosti C, Wietz M, Brinkhoff T, Hehemann J-H, Probandt D, Zeugner L, et al. The Biogeochemistry of marine polysaccharides: sources, inventories, and bacterial drivers of the carbohydrate cycle. Annu Rev Mar Sci. 2021;13:81–108.
Article CAS Google Scholar
Cuskin F, Lowe EC, Temple MJ, Zhu Y, Cameron EA, Pudlo NA, et al. Human gut Bacteroidetes can utilize yeast mannan through a selfish mechanism. Nature. 2015;517:165–9.
Article CAS PubMed PubMed Central Google Scholar
Reintjes G, Arnosti C, Fuchs BM, Amann R. An alternative polysaccharide uptake mechanism of marine bacteria. ISME J. 2017;11:1640–50.
Article CAS PubMed PubMed Central Google Scholar
Henrissat B. A classification of glycosyl hydrolases based on amino acid sequence similarities. Biochem J. 1991;280:309–16.
Article CAS PubMed PubMed Central Google Scholar
Cantarel BL, Coutinho PM, Rancurel C, Bernard T, Lombard V, Henrissat B. The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics. Nucleic Acids Res. 2009;37:D233–8.
Article CAS PubMed Google Scholar
Stam MR, Danchin EGJ, Rancurel C, Coutinho PM, Henrissat B. Dividing the large glycoside hydrolase family 13 into subfamilies: towards improved functional annotations of α-amylase-related proteins. Protein Eng Des Sel. 2006;19:555–62.
Article CAS PubMed Google Scholar
Kappelmann L, Krüger K, Hehemann J-H, Harder J, Markert S, Unfried F, et al. Polysaccharide utilization loci of North Sea Flavobacteriia as basis for using SusC/D-protein expression for predicting major phytoplankton glycans. ISME J. 2019;13:76–91.
Article CAS PubMed Google Scholar
Hoarfrost A, Arnosti C Heterotrophic extracellular enzymatic activities in the Atlantic Ocean follow patterns across spatial and depth regimes. Front Mar Sci. 2017;4:200.
D’Ambrosio L, Ziervogel K, MacGregor B, Teske A, Arnosti C. Composition and enzymatic function of particle-associated and free-living bacteria: a coastal/offshore comparison. ISME J. 2014;8:2167–79.
Article PubMed PubMed Central Google Scholar
Arnosti C, Steen AD, Ziervogel K, Ghobrial S, Jeffrey WH. Latitudinal gradients in degradation of marine dissolved organic carbon. PLOS ONE. 2011;6:e28900.
Article CAS PubMed PubMed Central Google Scholar
Teeling H, Fuchs BM, Bennke CM, Krüger K, Chafee M, Kappelmann L, et al. Recurring patterns in bacterioplankton dynamics during coastal spring algae blooms. eLife. 2016;5:e11888.
Article PubMed PubMed Central Google Scholar
Avcı B, Krüger K, Fuchs BM, Teeling H, Amann RI. Polysaccharide niche partitioning of distinct Polaribacter clades during North Sea spring algal blooms. ISME J. 2020;14:1369–83.
Article PubMed PubMed Central Google Scholar
Teeling H, Fuchs BM, Becher D, Klockow C, Gardebrecht A, Bennke CM, et al. Substrate-controlled succession of marine bacterioplankton populations induced by a phytoplankton bloom. Science. 2012;336:608–11.
Article CAS PubMed Google Scholar
Jakobsson M, Mayer LA, Bringensparr C, Castro CF, Mohammad R, Johnson P, et al. The international bathymetric chart of the Arctic Ocean version 4.0. Sci Data. 2020;7:176.
Article PubMed PubMed Central Google Scholar
QGIS. QGIS Geographic Information System. QGIS Association. 2021. https://qgis.org.
Vidal-Melgosa S, Sichert A, Francis TB, Bartosik D, Niggemann J, Wichels A, et al. Diatom fucan polysaccharide precipitates carbon during algal blooms. Nat Commun. 2021;12:1150.
Article CAS PubMed PubMed Central Google Scholar
Buchfink B, Xie C, Huson DH. Fast and sensitive protein alignment using DIAMOND. Nat Methods. 2015;12:59–60.
Article CAS PubMed Google Scholar
Shen W, Ren H. TaxonKit: A practical and efficient NCBI taxonomy toolkit. J Genet Genomics. 2021;48:844–50.
Article PubMed Google Scholar
Priest T, Orellana LH, Huettel B, Fuchs BM, Amann R. Microbial metagenome-assembled genomes of the Fram Strait from short and long read sequencing platforms. PeerJ. 2021;9:e11721.
Article PubMed PubMed Central Google Scholar
Hug LA, Baker BJ, Anantharaman K, Brown CT, Probst AJ, Castelle CJ, et al. A new view of the tree of life. Nat Microbiol. 2016;1:1–6.
Article Google Scholar
Olm MR, Crits-Christoph A, Diamond S, Lavy A, Matheus Carnevali PB, Banfield JF. Consistent metagenome-derived metrics verify and delineate bacterial species boundaries. mSystems. 2020;5:e00731–19.
Article CAS PubMed PubMed Central Google Scholar
Kolmogorov M, Bickhart DM, Behsaz B, Gurevich A, Rayko M, Shin SB, et al. metaFlye: scalable long-read metagenome assembly using repeat graphs. Nat Methods. 2020;17:1103–10.
Article CAS PubMed PubMed Central Google Scholar
Kang DD, Li F, Kirton E, Thomas A, Egan R, An H, et al. MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies. PeerJ. 2019;7:e7359.
Article PubMed PubMed Central Google Scholar
Eren AM, Esen ÖC, Quince C, Vineis JH, Morrison HG, Sogin ML, et al. Anvi’o: an advanced analysis and visualization platform for ‘omics data. PeerJ. 2015;3:e1319.
Article PubMed PubMed Central Google Scholar
Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25:1043–55.
Article CAS PubMed PubMed Central Google Scholar
Chaumeil P-A, Mussig AJ, Hugenholtz P, Parks DH. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics. 2020;36:1925–7.
Article CAS Google Scholar
Seeman T. barrnap 0.9: rapid ribosomal RNA prediction. 2018. https://github.com/tseemann/barrnap.
Orellana LH, Francis TB, Ferraro M, Hehemann J-H, Fuchs BM, Amann RI Verrucomicrobiota are specialist consumers of sulfated methyl pentoses during diatom blooms. ISME J. 2022;16:30–641.
Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–2.
Article CAS PubMed PubMed Central Google Scholar
Rodriguez-R LM, Konstantinidis KT. The enveomics collection: a toolbox for specialized analyses of microbial genomes and metagenomes. PeerJ Prepr. 2016;4:e1900v1.
Google Scholar
Bushnell B. BBTools software package. http://bbtools.jgi.doe.gov.
Kopylova E, Noé L, Touzet H. SortMeRNA: fast and accurate filtering of ribosomal RNAs in metatranscriptomic data. Bioinformatics. 2012;28:3211–7.
Article CAS PubMed Google Scholar
Wagner GP, Kin K, Lynch VJ. Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples. Theory Biosci. 2012;131:281–5.
Article CAS PubMed Google Scholar
Nilsen F, Cottier F, Skogseth R, Mattsson S. Fjord–shelf exchanges controlled by ice and brine production: The interannual variation of Atlantic Water in Isfjorden, Svalbard. Cont Shelf Res. 2008;28:1838–53.
Article Google Scholar
Richter ME, von Appen W-J, Wekerle C. Does the East Greenland Current exist in the northern Fram Strait? Ocean Sci. 2018;14:1147–65.
Article CAS Google Scholar
Skoog A, Benner R. Aldoses in various size fractions of marine organic matter: Implications for carbon cycling. Limnol Oceanogr. 1997;42:1803–13.
Article CAS Google Scholar
von Jackowski A, Grosse J, Nöthig E-M, Engel A. Dynamics of organic matter and bacterial activity in the Fram Strait during summer and autumn. Philos Trans R Soc Math Phys Eng Sci. 2020;378:20190366.
Google Scholar
Tanoue E, Handa N. Monosaccharide composition of marine particles and sediments from the Bering Sea and northern North Pacific. Oceanol Acta. 1987;10:91–9.
CAS Google Scholar
Liebezeit G, Bölter M. Water-extractable carbohydrates in particulate matter of the Bransfield Strait. Mar Chem. 1991;35:389–98.
Article CAS Google Scholar
Compiano A-M, Romano J-C, Garabetian F, Laborde P, de la Giraudièrea I. Monosaccharide composition of particulate hydrolysable sugar fraction in surface microlayers from brackish and marine waters. Mar Chem. 1993;42:237–51.
Article CAS Google Scholar
Ittekkot V, Brockmann U, Michaelis W, Degens E. Dissolved free and combined carbohydrates during a phytoplankton bloom in the northern North Sea. Mar Ecol Prog Ser. 1981;4:299–305.
Article CAS Google Scholar
Øbro J, Sørensen I, Moller I, Skjøt M, Mikkelsen JD, Willats WGT. High-throughput microarray analysis of pectic polymers by enzymatic epitope deletion. Carbohydr Polym. 2007;70:77–81.
Article Google Scholar
Deniaud-Bouët E, Hardouin K, Potin P, Kloareg B, Hervé C. A review about brown algal cell walls and fucose-containing sulfated polysaccharides: Cell wall context, biomedical properties and key research challenges. Carbohydr Polym. 2017;175:395–408.
Article PubMed Google Scholar
Buck-Wiese H, Andskog MA, Nguyen NP, Bligh M, Asmala E, Vidal-Melgosa S, et al. Fucoid brown algae inject fucoidan carbon into the ocean. Proc Natl Acad Sci. 2023;120:e2210561119.
Article CAS PubMed Google Scholar
Ebringerová A, Heinze T. Xylan and xylan derivatives – biopolymers with valuable properties, 1. Naturally occurring xylans structures, isolation procedures and properties. Macromol Rapid Commun. 2000;21:542–56.
Article Google Scholar
Huang G, Vidal-Melgosa S, Sichert A, Becker S, Fang Y, Niggemann J, et al. Secretion of sulfated fucans by diatoms may contribute to marine aggregate formation. Limnol Oceanogr. 2021;66:3768–82.
Article CAS Google Scholar
Vidal-Melgosa S, Lagator M, Sichert A, Priest T, Pätzold J, Hehemann J-H Not digested: algal glycans move carbon dioxide into the deep-sea. 2022. bioRxiv. 2022.03.04.483023
Sidhu C, Kirstein IV, Meunier CL, Rick J, Fofonova V, Wiltshire KH, et al. Dissolved storage glycans shaped the community composition of abundant bacterioplankton clades during a North Sea spring phytoplankton bloom. Microbiome. 2023;11:77.
Article CAS PubMed PubMed Central Google Scholar
Trudnowska E, Gluchowska M, Beszczynska-Möller A, Blachowiak-Samolyk K, Kwasniewski S. Plankton patchiness in the Polar Front region of the West Spitsbergen Shelf. Mar Ecol Prog Ser. 2016;560:1–18.
Article Google Scholar
Torode TA, Siméon A, Marcus SE, Jam M, Le Moigne M-A, Duffieux D, et al. Dynamics of cell wall assembly during early embryogenesis in the brown alga Fucus. J Exp Bot. 2016;67:6089–100.
Article CAS PubMed PubMed Central Google Scholar
Becker S, Tebben J, Coffinet S, Wiltshire K, Iversen MH, Harder T, et al. Laminarin is a major molecule in the marine carbon cycle. Proc Natl Acad Sci. 2020;117:6599–607.
Article CAS PubMed PubMed Central Google Scholar
Granum E, Kirkvold S, Myklestad SM. Cellular and extracellular production of carbohydrates and amino acids by the marine diatom Skeletonema costatum: diel variations and effects of N depletion. Mar Ecol Prog Ser. 2002;242:83–94.
Article CAS Google Scholar
Boysen AK, Carlson LT, Durham BP, Groussman RD, Aylward FO, Ribalet F, et al. Particulate metabolites and transcripts reflect diel oscillations of microbial activity in the surface ocean. mSystems. 2021;6:e00896–20.
Article CAS PubMed PubMed Central Google Scholar
Salazar G, Paoli L, Alberti A, Huerta-Cepas J, Ruscheweyh H-J, Cuenca M, et al. Gene expression changes and community turnover differentially shape the global ocean metatranscriptome. Cell. 2019;179:1068–83.e21.
Article CAS PubMed PubMed Central Google Scholar
Rinta-Kanto JM, Sun S, Sharma S, Kiene RP, Moran MA. Bacterial community transcription patterns during a marine phytoplankton bloom. Environ Microbiol. 2012;14:228–39.
Article CAS PubMed Google Scholar
Pontiller B, Martínez-García S, Joglar V, Amnebrink D, Pérez-Martínez C, González JM, et al. Rapid bacterioplankton transcription cascades regulate organic matter utilization during phytoplankton bloom progression in a coastal upwelling system. ISME J. 2022;16:2360–72.
Holmström C, Kjelleberg S. Marine Pseudoalteromonas species are associated with higher organisms and produce biologically active extracellular agents. FEMS Microbiol Ecol. 1999;30:285–93.
Article PubMed Google Scholar
Moisander PH, Sexton AD, Daley MC. Stable associations masked by temporal variability in the marine copepod microbiome. PLOS ONE. 2015;10:e0138967.
Article PubMed PubMed Central Google Scholar
Shoemaker KM, Moisander PH. Microbial diversity associated with copepods in the North Atlantic subtropical gyre. FEMS Microbiol Ecol. 2015;91:fiv064.
Article PubMed Google Scholar
Wietz M, Bienhold C, Metfies K, Torres-Valdés S, von Appen W-J, Salter I, et al. The polar night shift: seasonal dynamics and drivers of Arctic Ocean microbiomes revealed by autonomous sampling. ISME Commun. 2021;1:1–12.
Article Google Scholar
Fadeev E, Salter I, Schourup-Kristensen V, Nöthig E-M, Metfies K, Engel A, et al. Microbial communities in the east and west Fram Strait during sea ice melting season. Front Mar Sci. 2018;5.
von Jackowski A, Becker KW, Wietz M, Bienhold C, Zäncker B, Nöthig E-M, et al. Variations of microbial communities and substrate regimes in the eastern Fram Strait between summer and fall. Environ Microbiol. 2022;24:4124–36.
Article Google Scholar
Orellana LH, Ben Francis T, Krüger K, Teeling H, Müller M-C, Fuchs BM, et al. Niche differentiation among annually recurrent coastal Marine Group II Euryarchaeota. ISME J. 2019;13:3024–36.
Article CAS PubMed PubMed Central Google Scholar
Becker S, Scheffel A, Polz MF, Hehemann J-H. Accurate quantification of laminarin in marine organic matter with enzymes from marine microbes. Appl Environ Microbiol. 2017;83:e03389–16.
Article CAS PubMed PubMed Central Google Scholar
Friebolin H, Brossmer R, Keilich G, Ziegler D, Supp M. 1H-NMR-spectroscopic evidence for the release of N-acetyl-alpha-D-neuraminic acid as the first product of neuraminidase action (author’s transl). Hoppe Seylers Z Physiol Chem. 1980;361:697–702.
Article CAS PubMed Google Scholar
Zhu Y, Suits MDL, Thompson AJ, Chavan S, Dinev Z, Dumon C, et al. Mechanistic insights into a Ca2+-dependent family of α-mannosidases in a human gut symbiont. Nat Chem Biol. 2010;6:125–32.
Article CAS PubMed Google Scholar
Sichert A, Corzett CH, Schechter MS, Unfried F, Markert S, Becher D, et al. Verrucomicrobia use hundreds of enzymes to digest the algal polysaccharide fucoidan. Nat Microbiol. 2020;5:1026–39.
Article CAS PubMed Google Scholar
Thomas F, Lundqvist LCE, Jam M, Jeudy A, Barbeyron T, Sandström C, et al. Comparative characterization of two marine alginate lyases from Zobellia galactanivorans reveals distinct modes of action and exquisite adaptation to their natural substrate. J Biol Chem. 2013;288:23021–37.
Article CAS PubMed PubMed Central Google Scholar
Biely P, Benen J, Heinrichová K, Kester HCM, Visser J. Inversion of configuration during hydrolysis of α-1,4-galacturonidic linkage by three Aspergillus polygalacturonases. FEBS Lett. 1996;382:249–55.
Article CAS PubMed Google Scholar
Francis B, Urich T, Mikolasch A, Teeling H, Amann R. North Sea spring bloom-associated Gammaproteobacteria fill diverse heterotrophic niches. Environ Microbiome. 2021;16:15.
Article CAS PubMed PubMed Central Google Scholar
Priest T, Heins A, Harder J, Amann R, Fuchs BM. Niche partitioning of the ubiquitous and ecologically relevant NS5 marine group. ISME J. 2022;16:1570–82.
Article CAS PubMed PubMed Central Google Scholar
Withers SG, Dombroski D, Berven LA, Kilburn DG, Miller RC, J. Warren RA, et al. Direct 1H N.M.R. determination of the stereochemical course of hydrolyses catalysed by glucanase components of the cellulase complex. Biochem Biophys Res Commun. 1986;139:487–94.
Article CAS PubMed Google Scholar
Priest T, Merten V, Purser A. Physical oceanography during RV MARIA S. MERIAN cruise MSM95. 2022. PANGAEA. https://doi.org/10.1594/PANGAEA.943220.

Download references

Acknowledgements

We would like to thank the captain and crew of the RV. Maria. S. Merian for their support throughout all sampling aspects of this project. We thank Alek Bolte for his assistance with HPAEC-PAD. We thank Tina Trautmann for her assistance with the carbohydrate microarray quantification. We thank Bruno Hüttel and the team at the Max Planck Genome Centre in Cologne for their work in generating the metagenomes and metatranscriptomes. We thank A. Murat Eren for his valuable comments and feedback with respect to data analysis. We thank the Max Planck Society for funding, and Hehemann further acknowledges funding from the DFG Heisenberg programme HE 7217/5-1 and the DFG Exzellenzcluster 2077.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Institute of Microbiology, ETH Zurich, Zurich, Switzerland
Taylor Priest
Max Planck Institute for Marine Microbiology, Bremen, Germany
Silvia Vidal-Melgosa, Jan-Hendrik Hehemann, Rudolf Amann & Bernhard M. Fuchs
University of Bremen, MARUM, Bremen, Germany
Silvia Vidal-Melgosa & Jan-Hendrik Hehemann

Authors

Taylor Priest
View author publications
You can also search for this author in PubMed Google Scholar
Silvia Vidal-Melgosa
View author publications
You can also search for this author in PubMed Google Scholar
Jan-Hendrik Hehemann
View author publications
You can also search for this author in PubMed Google Scholar
Rudolf Amann
View author publications
You can also search for this author in PubMed Google Scholar
Bernhard M. Fuchs
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

TP wrote the manuscript, performed the metagenomic, metatranscriptomic and carbohydrate data analysis. TP and SVM extracted the carbohydrates and SVM subsequently carried out the microarray analysis. SVM, JHH, BMF and RA contributed to the interpretation of the results and the formulation of the story. BMF, RA and TP planned the work and devised the project. All authors contributed to reviewing and improving the manuscript.

Corresponding authors

Correspondence to Taylor Priest or Bernhard M. Fuchs.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Methods

Supplementary Figures

Supplementary Tables

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Priest, T., Vidal-Melgosa, S., Hehemann, JH. et al. Carbohydrates and carbohydrate degradation gene abundance and transcription in Atlantic waters of the Arctic. ISME COMMUN. 3, 130 (2023). https://doi.org/10.1038/s43705-023-00324-7

Download citation

Received: 11 August 2023
Revised: 17 October 2023
Accepted: 25 October 2023
Published: 09 December 2023
DOI: https://doi.org/10.1038/s43705-023-00324-7