Introduction

Microbes in aquatic environments are drivers of many biogeochemical cycles. Many temperate freshwater ecosystems experience dramatic seasonal variation with some environments remaining ice-covered for most of the winter1. These strong perturbations have the potential to dramatically alter the microbial community composition during times of ice cover. Recent advances in sequencing technologies have provided unprecedented insights into microbial community diversity and dynamics. However, much still remains to be known about the dynamics of microbial communities during times of ice formation and persistence of lake ice.

While lakes and other freshwater systems represent a small portion of the total water on earth, they are integral to ecosystem function and biogeochemical cycling. Microbial community composition, activity, and function have been shown to change seasonally in freshwater settings2. Despite the dramatic change in environmental conditions from ice-free to ice-covered times, there is limited information regarding the impact of ice cover on microbial community composition in freshwater environments. Changes in microbial activity during ice-covered periods have implications on the ecosystem function and biogeochemical processes in these settings2,3,4.

Factors such as temperature, salinity, and dissolved oxygen may all act as limiting constraints on microbial activity and thus impact microbial diversity. During times of ice cover there are decreased temperatures, limited nutrient inputs, and low light levels1. Temperature is known to affect rates of biological reactions5. As temperatures are lower during times of ice cover compared to ice-free periods, it is expected that microbial activity would be lower. However, many microbes including psychrophilic and psychrotolerant microbes have adapted for growth under cold conditions and maintain high enzymatic activities and growth rates6.

As nutrients are often the limiting factor for microbial growth in these systems, the altered nutrient regime that occurs during times of ice cover may select for distinct populations capable of growth under nutrient limited conditions1,7. Additionally, as ice and snow accumulate at the surface of the lake, the extent of penetration of photosynthetically active radiation (PAR) into the underlying water is decreased8. This decrease in PAR can impact the activity of phototrophs during times of ice cover, and alter the primary productivity during ice cover. Despite lower light levels during ice cover, many systems experience routine under ice algal blooms, which have been shown to dramatically alter the microbial community composition following the algal bloom9,10. Additional work has shown that the extent of ice cover can strongly affect the phytoplankton and bacterial communities11.

Microbial communities are also known to be highly dynamic, with community changes on the timescales of days to months to seasons. Seasonal changes in microbial community composition have been studied in the oceans12,13,14,15, lakes2,16, rivers and streams17,18. These reports indicate that there was a distinct microbial community composition in different seasons. In many cases, this seasonal variation exhibits reproducible and predictable patterns in both the oceans and lakes12,13,15,17,18,19. Higher resolution variability on a day-to-day basis has been more finely investigated in marine settings15,20 with a few time series studies in freshwater systems21. These studies in the oceans have shown that there were dramatic and rapid shifts in the marine microbial community composition following seasonal change and in the days following phytoplankton blooms15,20. Time series of microbial community composition in freshwater lakes has shown that there is surprising variation in the microbial community between years21. These studies have also demonstrated that extrinsic factors such as environmental conditions play a role in determining the microbial community composition and providing some synchrony to the system22. In many freshwater and coastal systems, seasonal differences have been linked to differences in organic matter and nutrient inputs17. Biological interactions were also shown to be important in controlling the bacterial community composition of several temperate lakes during a time series study22. Despite our growing knowledge of temporal variation in freshwater microbial communities, very little is known about the stability of freshwater microbial communities on daily timescales, especially during the formation and persistence of lake ice.

The goal of this study was to understand the temporal dynamics of microbial community diversity before, during, and after ice-cover using high-resolution time series sample collection in a freshwater lake. The fresh water system chosen was the Keweenaw Waterway located in the Upper Peninsula of Michigan adjacent to Michigan Technological University. The Keweenaw Waterway is primarily fed by Portage Lake with some inputs from Lake Superior23. The Keweenaw Waterway experiences ice cover for approximately 3 months out of the year. Water samples were collected on a daily basis enabling a detailed look at the extent of change that a microbial community undergoes during this dramatic seasonal change. We expected there would be limited change in the microbial community during times of ice cover due to the stability of environmental conditions. We additionally, expected that microbial abundance and diversity would decrease during winter months. This study advances our understanding of the microbial community dynamics and diversity in freshwater systems that experience seasonal ice cover and opens the way for more targeted studies investigating the biogeochemical implications of these changes under lake ice.

Results

Environmental conditions stabilize during ice cover

Water temperature along with other physical and chemical measurements (temperature, conductivity, total dissolved solids (TDS), dissolved oxygen, pH, and ORP) were measured from surface water at the time of sampling during this time series (Supplemental Materials). The temperature consistently dropped until ice formation, at which point, the surface water temperature remained constant throughout the winter at approximately 0 °C (Fig. 1A). A slight increase in surface water temperature was observed in mid-December 2015. Dissolved oxygen in the surface waters also remained high throughout the study period (Fig. 1B). Conductivity remained fairly constant with a peak occurring at the time of ice melt and a rapid decline after ice melt (Fig. 1C). Other conditions including pH and ORP remained fairly constant throughout the study (Supplemental Materials).

Figure 1
figure 1

Environmental conditions. (A) Temperature changes observed throughout the time series. Ice-free samples are shown in blue and ice-covered samples are shown in white. (B) Dissolved oxygen measurements across the time series. (C) Specific conductivity across the time series.

Changes in microbial abundance across the transition from fall to winter to spring

To determine the impact of ice cover on microbial abundance, qPCR was used to measure abundances of archaea and bacteria across this time-series. Primers targeting the 16S rRNA gene were used to quantify the abundance of bacteria and archaea (Fig. 2 and Supplemental Materials). The average copies of archaeal 16S rRNA genes across the time series were found to be 7.24 × 106 copies per ml of lake water, and the average abundance of bacterial 16S rRNA gene copies across the time series were found to be 8.52 × 107 copies per ml. The average abundance of archaea during ice-free times was 6.35 × 106 copies ml−1 whereas in ice-covered samples it was 8.62 × 106 copies ml−1. The average bacterial abundance in ice-free conditions was 1.26 × 108 copies ml−1 and during ice cover was 3.3 × 107 copies ml−1. A Kruskal-Wallis test was performed on log transformed abundance data to determine if there was a significant difference in the bacterial and archaeal abundance in ice-free compared to ice-covered samples (Table 1). The average bacterial abundance varied significantly between ice-free and ice-covered conditions (Kruskal Wallis test p-value < 2.2 × 10−16, chi-squared 50.86, degrees of freedom = 1) (Fig. 2). Abundance of archaea was not significantly different between ice-free and ice-covered time periods. Despite the significance of this difference, there was less than a log change in the copies of the bacterial 16S rRNA gene in ice-covered and ice-free samples. Bacteria and archaeal abundance followed different trends with bacterial abundance decreasing during times of ice cover whereas there was a slight increase in archaeal abundance during times of ice cover. In particular, bacterial abundance decreased most substantially in the month of February and responded with an increase of almost one log by April (Supplementary Figs 1 and 2). In contrast the lowest archaeal abundance was observed in November and peaked in March.

Figure 2
figure 2

Bacterial and archaeal 16S rRNA gene abundance. (A) Abundance of bacterial 16S rRNA genes across the time series. (B) Abundance of archaeal 16S rRNA genes across the time series. Ice-free sample are shown in blue and ice-covered samples are shown in white.

Table 1 Statistics comparing prokaryotic abundance between ice-free and ice-covered conditions using Kruskal-Wallis test.

Increase in phylogenetic diversity during ice cover

The 16S rRNA gene was sequenced for each sample collected during this time series. The relative abundance of various phyla varied across the time series, with a distinct community present during times of ice cover (Fig. 3A) Faith’s Phylogenetic diversity was used to assess the phylogenetic diversity within each sample. Phylogenetic diversity increased during the fall with a maximum occurring at the same time as ice formation (Fig. 3B). This high phylogenetic diversity persists throughout the time of ice cover and then decreases upon ice melt to levels seen during the pre-ice period. This increase in diversity during times of ice cover was statistically different from the phylogenetic diversity during ice-free periods (Kruskal-Wallis p-value < 0.00001, Chi-squared = 150.33, degrees of freedom = 1) (Table 2). Additionally, there was a significant increase in richness (Observed species and Shannon) during times of ice cover (Supplementary Figs 3, 4 and 5 and Supplementary Table 3).

Figure 3
figure 3

Microbial diversity across the timeseries. (A) Taxonomic area chart of microbial orders. Relative abundance of microbial orders are shown as different colors across the timeseries. Orders with abundance of less than 0.1% were grouped together in the other category. The corresponding colors for each microbial order are shown in the legend. (B) Faith’s Phylogenetic Diversity as a function of time. Samples collected during ice-free times are shown in blue squares. Samples collected during ice-covered times are shown in white circles.

Table 2 ANCOVA analysis of rates of change for time-decay analysis.

A distinct microbial community is established during times of ice cover

Multivariate analysis was performed in order to understand changes in the microbial community that occur during times of ice cover. Non-metric multidimensional scaling (NMDS) of weighted unifrac distances indicated that a distinct community of microbes existed in the water during times of ice cover relative to ice-free samples (Fig. 4A). PERMANOVA analysis on weighted unifrac distances indicated that there was a significant difference in the microbial community composition during times of ice-cover relative to ice-free conditions (PERMANOVA p-value = 0.001, F statistic = 44.927, R2 = 0.1488, degrees of freedom = 1). Furthermore, there are temporal changes in the microbial community that occurred throughout the year with a significant difference in the microbial community composition between months of the year (PERMANOVA p-value = 0.001, F statistic = 41.923, R2 = 0.45311, degrees of freedom = 5). To better understand the role of environmental conditions in explaining the microbial diversity, we fit the measured environmental variables to the NMDS of weighted unifrac distances (Fig. 4A). Temperature, conductivity, dissolved oxygen, and Total Dissolved Solids (TDS) all fit the data significantly (p-value = < 0.01). Temperature and conductivity strongly impacted the fall ice-free samples whereas dissolved oxygen was highest in the ice-covered samples and TDS was in the direction of the spring ice-free samples. CCA analysis showed similar impact of these environmental factors on the microbial community composition (Supplementary Fig. 6). Analysis of the R2 values for various comparisons indicated that ice cover alone was able to explain about 7% of the variance in the microbial community composition (adjusted R2 = 0.0773). The environmental conditions (temperature, conductivity, TDS and dissolved oxygen) explained about 17% of the variance (adjusted R2 = 0.174).

Figure 4
figure 4

Microbial community composition changes during the time series. (A) Non-metric multidimensional scaling (NMDS) of weighted unifrac distances. The shape of the points corresponds to the ice condition and the color corresponds to month of collection. Stress for this plot is 0.1878. Environmental factors were fit to the NMDS and factors that significantly fit the data are shown as vectors. (B) Time-decay analysis of weighted-unifrac distances. Pairwise comparison of community distances versus number of days between treatment. Data was fit to a linear regression. Confidence regions are represented as gray shading. Ice-covered samples are shown in orange. Ice-free communities in the fall are shown in gray, and ice-free communities in the spring are shown in blue.

The clustering of the samples on the NMDS plot indicated that during ice-free periods there was a high amount of variability on a day-to-day basis in community composition (Fig. 4A). This finding was also supported with time-decay analysis (Fig. 4B). Furthermore, the microbial community from samples collected during times of ice cover clustered more tightly together, which suggests that the microbial community is more stable during times of ice cover from day-to-day. Despite the apparent stability of the microbial community during ice-cover, there were significant distinctions in the microbial community composition from month to month during ice-covered periods. Time-decay analysis demonstrates that ice-covered communities show a very slow rate of change relative to the ice-free samples (Fig. 4B). The rates of change were shown to be significantly different between ice-covered and ice-free conditions (ANCOVA p-value < 2.2e-16, F value = 224.29, degrees of freedom = 2). During the ice free periods, there were a few of times where there were abrupt changes in the microbial community composition. These changes to some extent occur at times of prolonged high winds (Supplementary Fig. 8).

To determine which species were differentially abundant between ice-free and ice-covered samples, a statistical analysis known as MetagenomeSeq was applied24,25. This test is designed to normalize the sequencing data and determine taxa that are differentially abundant between different conditions. Eighteen orders were significantly enriched in samples collected from ice-free conditions, whereas thirty-eight orders were enriched in ice-covered conditions relative to ice-free conditions (Tables 3 and 4).

Table 3 Microbial orders enriched in ice-free conditions with greater than 2-fold enrichment.
Table 4 Microbial orders enriched in ice-covered conditions with greater than 2-fold enrichment.

Members of the Burkholderiales, Verrucomicrobales, and Caulobacteriales, were some of the most significantly enriched orders under ice-free conditions. Verrucomicrobiales were enriched 5 fold under ice-free conditions, while Caulobacteriales were enriched 3 fold, and Burkhoderiales were enriched 1.9 fold in ice-free conditions. Both ammonia oxidizing bacteria (AOB – Nitrosomonadales) and ammonia oxidizing archaea (AOA – Thaumarchaeota) were enriched during ice cover. Additionally, relatives of nitrite oxidizing bacteria of the order Nitrospirilales were enriched more than 6-fold during times of ice cover (Fig. 5).

Figure 5
figure 5

Differentially abundant taxa. Select taxa that are significantly different between ice-free and ice-covered states. (A) Relative abundance of sequences classified as Nitrospira as a function of time. (B) Relative abundance of sequences classified as Nitrosomonadales as a function of time. (C) Relative abundance of sequences classified as Chloroplasts as a function of time.

Under-ice algal bloom

A substantial increase in reads classified as chloroplast sequences was observed during times of ice cover leading up to ice melt (Fig. 5). Reads assigned to chloroplasts were typically less than 5% of the recovered reads in November and December. However, there was a steady increase in chloroplast sequences during times of ice cover. The percent of chloroplast sequences during this under ice bloom peaked at (18%) just after ice melt in March and subsequently dropped off to pre-ice levels in April. For the most part, these sequences could not be further classified to detail from which photosynthetic eukaryotes these chloroplasts were derived. This finding would suggest that there was an under-ice algal bloom that occurred in this water body during the ice-covered season.

Discussion

Here we sought to investigate the dynamics of the microbial community of surface water across dramatic seasonal changes in the Keweenaw Waterway. The Keweenaw Waterway is a highly dynamic system where movement of water can be influenced by wind driven currents23. The environmental conditions of the surface water exhibited dramatic changes leading up to ice cover, but stabilized during times of ice cover. This stability is similar to the trends in temperature that were observed in Churchill et al.23. Temperature can affect microbial activity and abundances. There was a slight but statistically significant decrease in the bacterial abundance during times of ice cover (p-value < 2.2 × 10−16). Ice cover was present from early January to mid-March during the time of sampling in this study. During ice cover the temperature of the surface water stabilized at approximately 0 °C and the dissolved oxygen of the surface water stabilized at approximately 12.5 mg/L. This indicated that despite the limited gas exchange during times of ice cover, the surface water remained oxygenated, allowing for dominance of aerobic physiologies during the ice-covered time period.

Our results suggest that microbial biomass in the surface water showed some limited change between ice-free and ice-covered conditions. Previous studies have shown that bacteria cell numbers remain fairly constant throughout the year with blooms associated with thermal mixing26. Others have reported bacterial biomass is typically lower during ice cover1. The decrease in bacterial abundance observed in this study was slight (less than one log), suggesting that microbes in this lake are well adapted for cold conditions. This could be due to the consistent long periods of ice cover and the limited warming that occurs in this lake during the summer.

Despite the conditions found under ice, previous studies would suggest that these microbes are actively growing1. This active growth suggests the possibility for rapid change to brief or more gradual changes in environmental conditions or inputs of nutrients. This can be observed in the gradual changes that occur throughout the ice-covered times. Our results would suggest that the microbial community present during times of ice cover was more stable on a day-to-day basis compared to the ice-free periods. However, the microbial community present under ice exhibited gradual changes, which is in line with the previous studies that demonstrate an actively growing microbial community during times of ice cover and changes in the bacterial community composition during ice cover26. Part of this actively growing community appears to be algae. We observed a substantial increase in chloroplast sequences during the middle of ice cover and increasing until ice melt. This under ice algal bloom matches with other previously described blooms of under ice algae9. This bloom was quite slow in forming as the build-up lasted for almost 2 months – starting in mid-January not long after ice formation and continued to increase until ice melt in mid-March.

Interestingly, we observed an increase in the phylogenetic diversity of the microbes during the ice-covered conditions. Our initial hypothesis, that the diversity would decrease, was based on the previous observations that microbial biomass decreases during times of ice cover coupled with the fact that the conditions found under ice cover are much more selective for growth (low temperature, low light, limited inputs of nutrients). Therefore, it was surprising to observe an increase in the both richness and evenness during times of ice cover. This increase in diversity could be explained through a couple different ways. The increased diversity could be due to limited grazing that occurs during times of ice cover27. The decreased grazing would relieve some of the top down controls on the microbial community composition and allow for a more diverse community to proliferate without the control of grazing that occurs during warmer months. The increase in phylogenetic diversity could also be explained by alternation in the nutrient inputs into the system. It is possible that during warmer months the increased temperatures or nutrients allow for particular taxa to dominate the system. The seasonal changes that occurs during ice cover could select against these warm-adapted microbes and may allow for more access to the available nutrients, thus enabling the proliferation of a more diverse community. Alternatively, this increased diversity could be explained by the persistence of a large number of dormant microbes in this system. Our work was done on the level of DNA, which often does not represent the actively metabolizing microbes in the system28. Additionally, there are studies that have suggested that microbial dormancy can be a mechanism by which microbial diversity is maintained in low nutrient environments29. In Jones and Lennon (2010), a dormant taxon is considered to be one which is detectable on the DNA level, but is not detectable at the RNA level. It is therefore possible that the increased diversity reflects an expanding number of detectable, but dormant taxa that are now observable due the decrease in the dominant taxa found in warmer months. Previous work has demonstrated that the percent of dormant organisms in a lake was in part affected by the nutrient loads29. This previous work demonstrated that, in some nutrient-poor systems, up to 40% of the taxon richness is derived from dormant cells. However, we did not measure the active community in these samples and thus cannot confirm that this observed increase in diversity is indeed due to dormancy. Furthermore, the dynamic nature of the Keweenaw Waterway leads to mixing of water from Lake Superior and Portage Lake. While it is possible that this mixing could contribute to some of the increased diversity, the stability of this elevated diversity over many months suggests that this is not a local or ephemeral phenomenon impacted by transient changes in mixing and water movement. Additionally, Churchill, et al. (2004) observed little mixing during times of ice cover, which supports the concept that the microbial community compositional changes are stable and not impacted by wind driven mixing23. Therefore, it is possible that this increased diversity could be due to a number of factors including decreased grazing, competition, or dormancy, which should be further investigated in future studies.

To parse out the differences observed on the overall community, differentially abundant operational taxonomic units (OTUs) were identified. A number of taxa were shown to be differentially abundant between ice-free and ice-covered periods. Many of taxa that showed the highest fold change during times of ice cover were related to microbes known to be involved in then nitrogen cycle. In particular, sequences related to microbes involved in the process of nitrification were enriched during times of ice cover. Members of the Nitrospirales, Nitrosomonadales, and Thaumarchaeota were all significantly enriched in the ice-covered conditions. A recent study suggested that ice duration drives accumulation of nitrate in north temperate lakes30. This increased nitrate was proposed to be associated with nitrification, which could decreased oxygen levels in deeper waters under ice31. In our study, we observed an increase in sequences related to nitrifying bacteria in the surface water below the ice. However, we did not measure nitrification as part of this study. While other studies have shown increased nitrification under ice, more work is needed to confirm the role of these organisms related to nitrifiers in nitrogen cycling under ice in the Keweenaw Waterway. Our work only examined the microbial community in the surface water using 16S rRNA gene sequencing. We did not observe decreased oxygen levels, which may be more prevalent in deeper waters. It would be important to follow up this work to link these putative nitrifiers with increase rates of nitrification under ice observed in other under ice lake systems.

Conclusion

This study aimed to understand how microbial abundance and diversity changes over drastic seasonal transitions, and how ice cover affects microbial abundance and diversity. Molecular techniques and next generation sequencing provided a detailed survey of the microbial community in the Keweenaw waterway through a period of ice cover. Our results suggest that ice formation may cause shifts in the microbial community composition. The increase in diversity during times of ice cover suggests that the conditions during ice formation may be a strong constraint on the growth of some species and in return remove the constraint for growth of other species. Additionally, an increase in sequences related to taxa involved in nitrification during times of ice-cover suggests that ice cover may impact biogeochemical cycling and potentially have impacts on the nitrogen cycle. A better understanding of the impact of ice cover on biogeochemical cycling will be important for characterizing the impact of changes in the extent and duration of lake ice cover on biogeochemical fluxes. In the long-term there is need to characterize the temporal changes in microbial community composition and biogeochemical cycling in order to better understand how seasonal variations and ice cover impact ecosystem function.

Materials and Methods

Water Sampling

Triplicate water samples were collected five days per week from the surface of the Keweenaw Waterway. During times of ice cover, these samples were collected from directly below the ice. During sampling, in situ environmental variables were measured with a YSI proDSS sonde which measured temperature, dissolved oxygen (DO), conductivity, Total Dissolved Solids (TDS), ORP, and pH (Supplemental Table 1). Samples were collected from 2 November 2015 to 20 April 2016. Water samples were immediately transported to the lab (<10 minutes). Samples for microbial community analysis were collected by filtering 600 ml of water through a 0.2 µm PES filter, using a vacuum pump system. Filters were immediately stored at −80 °C.

DNA Extraction

DNA was extracted from half of each filter using a modified Miller protocol32. Half of each filter was placed into a Lysing Matrix E tube (MP Biomedicals) along with 300 µl of Miller Phosphate buffer, 300 µl of Miller SDS lysis buffer, and 600 µl of Phenol:Chloroform:Isoamly alcohol (25:24:1). A process blank was also setup and subjected to all following steps of extraction without a filter. The tubes were homogenized in a FastPrep-24 bead-beater (MP Biomedicals) for 45 seconds at a speed of 5.5 m/s. To remove cell debris and filter material, the tubes were centrifuged at 10,000 × g for 5 minutes. 600 µl of aqueous supernatant was transferred to a new 2 ml tube along extracted with one volume of chloroform. Tubes were centrifuged at 10,000 × g for 5 minutes, then the aqueous phase was kept. Purification and concentration of recovered DNA was performed by adding two volumes of MoBio solution C4 to the aqueous phase. This was passed over a MoBio spin filter. The spin filters were then washed with 400 µl of MoBio C5 solution. Residual C5 was removed by centrifuging the empty tubes at 10000 × g for two minutes. DNA was eluted by two 30 µl additions of MoBio C6 solution. Final eluted environmental DNA was stored in −80 °C freezer. DNA concentrations were determined using NanoDrop spectrophotometer.

qPCR

The abundance of bacterial and archaeal 16S rRNA was determined using qPCR. qPCR was performed on an StepOne Plus instrument (Applied Biosystems, Foster City CA). Six-point standard curves were performed in triplicate with concentrations ranging from 2 × 10−4 pM to 20 pM. The equation for the standard curve to convert CT to pM of 16S rRNA genes was y = 16736(e(−0.515x)). The R2 of our standard curves was 0.97. Environmental DNA was diluted 1:10 to limit the impact of inhibitors from environmental DNA. To each reaction 1 µl of the diluted environmental DNA was used in 20 µl qPCR reactions. The copy numbers of bacterial 16S rRNA, archaeal 16S rRNA in environmental DNA were determined in duplicate for each sample. For bacterial 16S rRNA gene quantification, Bact341 and Uni519R primers were used according to the protocol described in Jorgenson et al.33. Archaeal 16S rRNA abundance was determined using Uni519R and Arch908R as described in Jorgenson et al.33. Standards and PCR reaction set up was similar to that described in Techtmann et al.34. Normality of the qPCR data was tested with the Shapiro-Wilk test for normality as implemented in R35. Since the data was not normally distributed, the Kruskal-Wallis one way analysis of variance was used to determine if there was a significant difference in the microbial abundances between ice-free and ice-covered periods.

16sS rRNA sequencing

The V4 and V5 hypervariable region of the 16S rRNA gene was amplified using Phusion DNA polymerase (Thermo Scientific, Waltham, MA) with universal primers 515F-Y and 926R36. These primers were able to amplify Bacterial and Archaeal 16S rRNA gene as well as many Eukaryotic 18S rRNA genes. The 16S rRNA gene was amplified by 25 cycles. The amplicon was purified using AxyPrep PCR Clean up magnetic beads (Axigen). Index sequences and sequencing adapters were added to each sample by an additional 8 cycle PCR as described in the Illumina 16S rRNA Metagenomic Library Preparation protocol. Sequencing was performed on the Illumina MiSeq according to the Illumina protocol for amplicon sequencing. The resulting DNA sequences were analyzed using the QIIME version 1.9.0-dev pipeline37. Paired-end raw reads were assembled using fastq-join38. The assembled sequences were demultiplexed and quality filtered in QIIME to remove reads with phred scores below 20 (-q 19). Chimera detection was then performed on assembled reads using VSEARCH39. The taxonomy for each read was assigned using RDP classifier40 retrained with SILVA release 123. Data was rarified to the lowest number of sequences in a sample (3052). Alpha diversity was calculated using the alpha_diversity.py command in QIIME. Faith’s phylogenetic diversity, Observed Species, and Shannon diversity were calculated from the rarified OTU table. Weighted and unweighted Unifrac distances as well as Bray Curtis dissimilarity were calculated from the rarified OTU table using the beta_diversity.py command in QIIME. Raw sequencing reads have been deposited in the SRA as accession number SRP159623.

Statistical Analysis on 16S rRNA sequencing data

Normality of the alpha diversity data was tested with the Shapiro-Wilk test. The Shapiro-Wilk test for normality of the Faith’s Phylogenetic alpha diversity returned a p-value of less than 2.2 × 10−16, which indicates that the data is not normally distributed. Since the Faith’s Phylogenetic Diversity data was not normally distributed, we chose to use a Kruskal-Wallis test to test the hypothesis that there were significant differences in the alpha diversity between ice-free and ice-covered samples. To determine if there was a significant difference in the alpha diversity between months, a Kruskal-Wallis test was performed comparing differences in the Faith’s Phylogenetic diversity between months. A Dunn test was used as a post-hoc test to determine between which months there was a statistically significant difference. The Dunn test was performed using the dunn.test package in R41. P-values for the Dunn test were adjusted for multiple comparisons using the Bonferroni method.

To visualize differences in the community structure between ice states and between months, nonmetric multidimensional scaling (NMDS) was performed on the weighted unifrac distance matrix using the metaMDS command implemented in the vegan package42. The lowest stress configuration of 50 iterations was chosen. NMDS analysis with unweighted unifrac distances was also performed and is included in Supplementary Fig. 7. Environmental variables were fit to the NMDS plots using the envfit function in vegan. Factors that significantly fit the data were plotted as vectors. PERMANOVA analysis was used to identify if there were significant differences in community structure between ice states and months using the adonis function in the vegan package with 999 permutations42. To further understand the impact of environmental variables on explaining the data, a CCA was performed on the weighted unifrac distances. Co-linear environmental factors were removed. The role of Temperature, conductivity, TDS, and Dissolved oxygen was tested using the CCA. The CCA plot is shown in Supplementary Fig. 6. Redundancy analysis was used to understand that amount of variance explained by ice-cover and the other environmental variables. Rda was performed in R using the vegan package. The rarified OTU table was normalized using heilinger method. RDA was then performed against Ice cover or a table of scaled environmental variables. The R2 was then adjusted using the RsquareAdj function in vegan.

To determine rates of change in community composition a time-decay analysis was performed to identify how ice cover impacted rates of community change. This approach was used to further determine the rates of community composition change during times of ice cover compared to ice-free conditions. Weighted unifrac distances were determined for each sample in a pair-wise manner. The log of these pair-wise dissimilarities were then plotted based on the time between each comparison and regression line generated for each case. Rates of change can be determined through analyzing the slope of these regression lines. ANCOVA analysis was performed to determine if the differences in rates of change were significant between ice-free and ice-covered conditions.

A finer scale analysis on the level of OTUs was undertaken to examine how particular OTUs are affected by ice state. The MetagenomeSeq package24,25, as implemented in QIIME, was used to identify which OTUs that were differentially abundant between ice-states. This analysis was done on the non-rarified OTU table. OTUs that were significantly different (corrected p value < 0.05) in one state were considered to be differentially abundant.