Introduction

Molecular biology approaches have proven useful for describing the diversity and gene content of microorganisms in Antarctic lakes and for inferring the functional roles of the taxa present (Laybourn-Parry and Pearce, 2007; Wilkins et al., 2013b). However to date, only a few large-scale shotgun metagenome studies have been performed on the Antarctic continent and in the surrounding Southern Ocean (reviewed in Wilkins et al., 2013b). In the Vestfold Hills, metagenomics and metaproteomics have been used to study Ace Lake (Ng et al., 2010; Lauro et al., 2011) and Organic Lake (Yau et al., 2011).

Owing to the polar light cycle and low overall levels of photosynthetically active radiation, phototrophic growth and biomass production are restricted, being relatively high in summer and negligible in winter (Laybourn-Parry et al., 2005). In Ace Lake, phototrophic algae, particularly phytoflagellates, engage in mixotrophy, thereby supplementing their carbon and nutrient requirements to enable them to remain active during winter and poised to photosynthesize in summer (Laybourn-Parry et al., 2005). Resourcefulness has also been demonstrated by heterotrophic marine bacteria that exploit light energy, either directly through photoheterotrophic processes involving aerobic anoxygenic photosynthesis (AAnP), or indirectly through lithoheterotrophic processes utilizing inorganic compounds (for example, CO) formed from dissolved organic carbon reacting with light (Moran and Miller, 2007).

Organic Lake is located in the Vestfold Hills, on the eastern shore of Prydz Bay, East Antarctica (Supplementary Figure S1). It is shallow (6.8 m) and has variable surface water temperatures (−14 to +15 °C) while remaining sub-zero throughout most of its depth (Franzmann et al., 1987b; Gibson et al., 1991; Roberts et al., 1993; Gibson, 1999). The lake has a high organic load generated from autochthonous production, and input from penguins and terrestrial algae (for example, in soaks, shallow ephemeral streams and depressions), and nutrient turnover is expected to be slow due to the constraints imposed on microbial activity by the lake’s hypersalinity (230 g l−1 maximum salinity) and low temperature (Franzmann et al., 1987b; Gibson et al., 1991; Roberts et al., 1993; Gibson, 1999). The salt and marine biota in the lake originate from seawater that was trapped in a basin 3000 years BP when the continental ice sheet receded and the land rose above sea level (Bird et al., 1991; Zwartz et al., 1998; Gibson, 1999). The bottom water of Organic Lake is unusual due to the absence of hydrogen sulfide and the high concentration of the gas dimethylsulfide (DMS), dimethylsulfoniopropionate (DMSP) and polysulfides (Deprez et al., 1986; Franzmann et al., 1987b; Gibson et al., 1991; Roberts and Burton 1993; Roberts et al., 1993). Concentrations of DMS as high as 5000 nM have been recorded in Organic Lake (Gibson et al., 1991), 100 times the maximum concentration recorded from seawater in the adjacent Prydz Bay and at least 1000 times that of the open Southern Ocean (Curran and Jones, 1998).

While it has been 40 years since DMS was identified as a carrier of sulfur from the oceans to the atmosphere (Lovelock and Maggs, 1972) and later proposed to have a regulatory effect on the global cloud cover (Charlson et al., 1987), the genes that encode for the enzymes involved in DMS production were only identified in the last 6 years (Todd et al., 2007). Rapid progress has been made in the short period since then, and the pathways and organisms involved in DMS transformations have been extensively reviewed (Johnston et al., 2008; Schäfer et al., 2010; Curson et al., 2011; Reisch et al., 2011; Moran et al., 2012). The main source of DMS in the marine environment is from the breakdown of DMSP. Eucaryal phytoplankton, in particular sea-ice diatoms, and dinoflagellates (for example, Prymnesiophytes such as Phaeocystis spp.) and haptophytes, produce DMSP, an organosulfur compound that is thought to function principally as an osmolyte (Yoch, 2002; Stefels et al., 2007; Hatton et al., 2012). DMSP is released due to cell lysis, grazing or leakage and follows two known fates: DMSP cleavage by DMSP lyases (DddD, -L, -P, -Q, -W and -Y) or demethylation by DMSP demethylase (DmdA). Both pathways are associated with diverse microorganisms that can utilize DMSP as a sole carbon and energy source. However, it is only the cleavage pathway that releases volatile DMS that can lead to sulfur loss through ventilation to the atmosphere.

The very high levels of DMS in Organic Lake make it an ideal system for identifying the microorganisms and the processes involved in DMS accumulation. The previous metagenome study of Organic Lake examined viruses from the 0.1-μm fraction of surface water that was collected from Organic Lake in December 2006, and November and December 2008 (Yau et al., 2011). In the present study, we focused on the cellular population rather than viruses in these samples. Our study determined the composition and functional potential of Organic Lake microbiota and, in conjunction with historic and contemporary physicochemical data, generated an integrative understanding of the whole-lake ecosystem.

Materials and methods

Metagenomic analyses (2.4 Gbp) were performed on biomass captured by sequential filtration through a 20-μm pre-filter onto 3.0, 0.8 and 0.1 μm filters based on the design of the Global Ocean Sampling expedition (Rusch et al., 2007). Water was sampled (including for epifluorescence microscopy, and physical and chemical measurements) from a depth profile (1.7, 4.2, 5.7, 6.5 and 6.7 m) taken in November 2008 (Supplementary Figures S1–S3 and Supplementary Tables S1–S3), and cellular diversity and functional potential were determined based on approaches developed for studying Antarctic aquatic microbial communities (Ng et al., 2010; Lauro et al., 2011; Yau et al., 2011; Brown et al., 2012; Wilkins et al., 2013a; Williams et al., 2013). Full details of Materials and methods are provided in Supporting Information.

Results and discussion

Abiotic properties and water column structure

In situ physicochemical profiles (Supplementary Figure S2) were measured at the deepest point in the lake (Supplementary Figure S3), and samples were collected from the upper mixed (1.7, 4.2 and 5.7 m) and suboxic deep (6.5 and 6.7 m) zones (Figure 1a). All nutrients, except for nitrate and nitrite, reached maximum concentrations at 6.5 m (Table 1), suggesting a layer of high biological activity above the lake bottom. Consistent with this, cell and virus-like particle counts were highest at 6.5 m. However, turbidity was lowest at this depth, demonstrating that turbidity was not principally determined by cell density (Figure 1b). Microscopy images did not show a shift in cell morphology that could account for the large drop in turbidity (Supplementary Figure S4), which suggests that particulate matter primarily contributed to turbidity readings. The low turbidity and peak in cell counts and nutrients at the oxycline at 6.5 m may be caused by an active microbial community degrading particulate matter. This inference is supported by the report of high concentrations of dissolved organic acids and free amino acids in the deep zone (Gibson et al., 1994), as these nutrients are indicative of the breakdown of high molecular weight carbohydrates, lipids and proteins. Furthermore, the C:N and C:P ratios throughout the lake were high compared with the Redfield ratio (Redfield et al., 1963) except at 6.5 m, indicating that this was the only depth where dissolved nitrogen and phosphorus were not relatively limited (Table 1). Principal component analysis of physicochemical parameters showed all samples, except the 6.5 m sample, separated with depth along the principal component 1 axis (Supplementary Figure S5). Accordingly, turbidity, total sulfur and cell density were the strongest explanatory variables for the separation of the 6.5 m sample from the other deep sample, indicating that increased activity at 6.5 m was related to the breakdown of particulate matter and sulfur chemistry.

Figure 1
figure 1

Vertical structure of Organic Lake. (a) Parameters that varied unimodally with depth showed two zones: an aerobic mixed zone above 5.7 m and a denser suboxic deep zone below. (b) Additional factors that revealed stratification within the deep zone. The peak in concentration at 6.5 m for ammonia was also observed for all other nutrients assayed except nitrate and nitrite (see Table 1). σT=(1000−density); cond, conductivity; DO, dissolved oxygen; turb, turbidity.

Table 1 Physicochemical properties, cell counts and VLP counts of Organic Lake samples

Variation of microbial composition according to size and depth

SSU rRNA genes (3959 reads) that were retrieved from the metagenome data grouped into 983 operational taxonomic units (OTUs). OTUs for Bacteria comprised 76.2% and, Eucarya 16.3%, whereas 7.5% of SSU rRNA gene sequences could not be classified (Figure 2). Only two reads were assigned to a deep-sea hydrothermal clade of Halobacteriales (Supplementary Table S4), indicating that Archaea were rare in Organic Lake.

Figure 2
figure 2

Diversity of Bacteria and Eucarya in Organic Lake. (a) Bacteria and (b) Eucarya from each size fraction (0.1, 0.8 and 3.0 μm) at each sample depth (1.7, 4.2, 5.7, 6.5 and 6.7 m) aggregated according to class. The x-axis shows counts of SSU rRNA gene sequences normalized to average reads acquired per sample filter. Taxa that belong to the same higher rank are shown grouped with a square bracket in the legend. Abundant taxa are labeled in the plot with a number that corresponds to the numbered boxes in the legend. (c) Composition of abundant bacterial classes across all size fractions combined. See Supplementary Table S4 for lower taxonomic rank assignments.

Community composition varied with size fraction and depth. This was supported by seriation analysis that showed samples clustered according to size fraction, and those clusters further separated into upper mixed and deep-zone groups (Figure 3). A significant difference in genus-level composition between the upper mixed and deep zones was supported by the analysis of similarity test (ρ: 0.53, significance: 0.1%). Taxa were differentially distributed consistent with vertical, physical and chemical stratification, indicating that ecological functions were partitioned in the lake.

Figure 3
figure 3

Heatmap and biclustering plot of the SSU rRNA gene composition in Organic Lake. Samples are shown according to size fractionate (3.0, 0.8 and 0.1 μm) and depth (1.7, 4.2, 5.7, 6.5 and 6.7 m). SSU rRNA genes were classified to the lowest taxonomic rank that gave bootstrap confidence>85% until the rank of genus. SSU rRNA gene counts were normalized and square root transformed. Taxa that comprised <2% of the sample were not included.

20–3.0-μm fraction community composition

The upper mixed zone samples had a relatively high OTU abundance of Dunaliella chloroplasts and chlorophyte algae, consistent with the large, active photosynthetic organisms concentrated near surface light. They are likely the main source of primary production in Organic Lake and have previously been reported to be the dominant algae (Franzmann et al., 1987b). The SSU rRNA gene sequences for these algae at the bottom of the lake are likely to be due to sedimentation of dead cells or resting cysts.

Psychroflexus OTUs were over-represented in the surface and 6.7 m samples. Consistent with enrichment on the 3.0 μm filters, Psychroflexus (formerly Flavobacterium) gondwanensis (Bowman et al., 1998) isolated from Organic Lake (Franzmann et al., 1987b) had cells 1.5–11.5 μm in length (Dobson et al., 1991). Flavobacteria associate with phytoplankton blooms in the Southern Ocean (Abell and Bowman 2005a, 2005b; Williams et al., 2013) and have specialized abilities to degrade polymeric substances from algal exudates and detritus (reviewed in Kirchman, 2002; Williams et al., 2013). It is likely that Organic Lake Psychroflexus fills a similar ecological role. In support of this, Psychroflexus OTUs cluster with Dunaliella chloroplasts in the seriation analysis (Figure 3) and P. gondwanensis abundance in Organic Lake has been correlated with average hours of sunshine per day, indicating population dynamics that are related to summer algal blooms (James et al., 1994). The Psychroflexus OTUs in the deep zone are most likely due to sedimentation, as P. gondwanensis is nonmotile and strictly aerobic (Dobson et al., 1991; Dobson et al., 1993).

Roseovarius OTUs were enriched at 4.2 and 6.5 m, suggesting that different ecotypes may be present in the upper mixed zone compared with the deep zone. Roseovarius tolerans, an isolate from Ekho Lake in the Vestfold Hills, Antarctica, has a cell size (1.1–2.2 μm; Labrenz et al., 1999) that would be expected to be captured on the 0.8-μm filter. The Roseovarius captured on the 3-μm filter may, therefore, be a different species, or a strain similar to R. tolerans from Ekho Lake that exhibits different growth characteristics (that is, larger cell size or forms aggregates). A strain of R. tolerans from Ekho Lake is capable of microaerobic growth (Labrenz et al., 1999). Over-representation at 6.5 m may, therefore, be indicative of growth at that depth rather than sedimentation because sinking cells would be more abundant close to the lake bottom at 6.7 m. Roseovarius OTUs cluster with Dunaliella chloroplast and Psychroflexus OTUs in the seriation analysis (Figure 3), suggesting that Organic Lake Roseovarius may be utilizing compounds released from algal-derived particulate matter, or made available by processing of complex organic matter by Psychroflexus. Roseovarius is a member of the Roseobacter clade, which is inferred to have an opportunistic ecology frequently associated with nutrient-replete plankton aggregates, including by-products of flavobacterial exoenzymatic attack (Moran et al., 2007; Teeling et al., 2012). Additionally, the diverse metabolic capabilities of the Roseobacter clade include DMSP degradation, AAnP and CO oxidation (reviewed in Wagner-Döbler and Biebl, 2006). All of these capabilities should facilitate growth in both the upper mixed and deep zones of Organic Lake (see Carbon resourcefulness in dominant heterotrophic bacteria below).

3–0.8-μm size fraction community composition

On the 0.8-μm filter, OTUs for Marinobacter dominated at all depths except 6.5 m. Their capture on this size fraction is consistent with the cell size of isolates (1.2–3 μm) (Gauthier et al., 1992). The genus is metabolically versatile, which likely permits it to occupy the entire water column. Marinobacter is heterotrophic and the genus includes hydrocarbon-degrading strains (e.g., Gauthier et al., 1992; Huu et al., 1999), although deep-sea metal-oxidizing autotrophs have also been reported (Edwards et al., 2003). Marinobacter isolates from Antarctic lakes are capable of anaerobic respiration using dimethylsulfoxide (DMSO) (Matsuzaki et al., 2006) or nitrate (Ward and Priscu, 1997). Analysis of functional potential linked to Marinobacter revealed additional metabolic capabilities potentially related to its dominance in Organic Lake (see Carbon resourcefulness in dominant heterotrophic bacteria and Molecular basis for unusual sulfur chemistry below).

OTUs for RF3 and Halomonas were over-represented at 6.5 m, and RF3 sequences were more abundant (Figures 2 and 3). Their relative abundance in the deep zone indicates a role in microaerobic processes. The majority of RF3 sequences retrieved to date are from anaerobic environments including mammalian gut (Tajima et al., 1999; Ley et al., 2006; Samsudin et al., 2011), sediment (Yanagibayashi et al., 1999; Röske et al., 2012), municipal waste leachate (Huang et al., 2005), anaerobic sludge (Chouari et al., 2005; Goberna et al., 2009; Rivière et al., 2009; Tang et al., 2011), a subsurface oil well head (Yamane et al., 2011), and the anaerobic zone of saline lakes (Bowman et al., 2000; Humayoun et al., 2003; Schmidtova et al., 2009). However, some members have been found in surface waters (Demergasso et al., 2008; Xing et al., 2009; Yilmaz et al., 2012) suggesting that not all members are strict anaerobes. Several Halomonas isolates have been sourced from Organic Lake including two described species Halomonas subglaciescola and H. meridiana, both of which grow as rods with dimensions consistent with capture on this size fraction (Franzmann et al., 1987a; James et al., 1990). Despite these isolates being aerobic, Halomonas has been reported to be enriched at the oxycline in Organic Lake (James et al., 1994) indicating Halomonas in the lake plays an ecological role in the suboxic zone. This capacity may be linked to the ability of free amino acids and organic acids (which are abundant in the deep zone) to stimulate the growth of isolates (Franzmann et al., 1987a).

0.8–0.1-μm size fraction community composition

A large number of eucaryal sequences were evident in the 0.1-μm size fraction. The upper zone was over-represented by OTUs for Pedinellales (silicoflagellate algae) that co-varied with chloroplasts (Figures 2 and 3). Pedinellales have only been detected in Antarctic lakes from molecular studies (Unrein et al., 2005; Lauro et al., 2011), including Organic Lake (Yau et al., 2011), and light microscopy studies of Antarctic Peninsula freshwater lakes reported 5–8-μm diameter cells resembling Pseudopedinella (Unrein et al., 2005). It is possible that in Organic Lake, small (0.8–0.1 μm) free-living members or chloroplast-containing cyst forms (Thomsen, 1988) exist. However, without evidence to support this (for example, by microscopy), it seems more likely that the lake sustains a relatively small number of active photosynthetic cells and the sequences detected arise from cysts or degraded cellular material.

OTUs for Candidatus ‘Aquiluna’, in the Luna-1 cluster of Actinobacteria, (Hahn et al., 2004; Hahn, 2009) were most abundant at 1.7 m. The genus has small cells (<1.2 μm; Hahn, 2009), accounting for their concentration on this size fraction. Although originally described in freshwater lakes, the same clade was detected in abundance in Ace Lake (Lauro et al., 2011) and surface Arctic seawater (Kang et al., 2012), demonstrating that they have ecological roles in polar saline systems. In Ace Lake surface waters, they were associated with utilization of labile carbon and nitrogen substrates (Lauro et al., 2011), and in Organic Lake surface waters, they probably perform similar functions. The presence of this clade in the deep zone implies a facultative anaerobic lifestyle or sedimented cells.

The bottom of the water column was distinguished by the presence of OTUs for candidate divisions OD1 and TM7. OD1 was more abundant, and its prevalence on this size fraction is consistent with similar findings for size fractionation of ground water (Miyoshi et al., 2005). OD1 is consistently associated with reduced, sulfur-rich, anoxic environments (Harris et al., 2004; Elshahed et al., 2005). OD1 from Zodletone Spring, Oklahoma, was reported to possess enzymes related to those from anaerobic microorganisms (Elshahed et al., 2005). Genomic analyses identified OTUs for OD1 in the anoxic zone of Ace Lake (Lauro et al., 2011). The distribution of OD1 in Organic Lake is consistent with an anaerobic metabolism and potential involvement in sulfur chemistry.

Organic Lake functional potential

To determine the potential for functional processes in Organic Lake, gene markers for carbon, nitrogen and sulfur conversions (Figure 4) were retrieved from metagenomic reads. BEST analysis, which examines at all the abiotic variables in combination and finds a subset sufficient to ‘best’ explain the biotic structure, showed that variation in the population structure was significantly correlated (ρ: 0.519, significance: 0.3%) with the abiotic parameters, dissolved oxygen, temperature, total sulfur and total nitrogen. The dissolved oxygen gradient has an obvious effect of separating aerobic from anaerobic taxa, and allows oxygen-sensitive nitrogen and sulfur processes to occur in the deep zone.

Figure 4
figure 4

Carbon, nitrogen and sulfur cycles in Organic Lake. Vertical profiles of genetic potential for (a) carbon (b) nitrogen and (c) sulfur conversions for each size fraction. The y-axis shows sample depths (m) and the x-axis shows counts of marker genes normalized to 100 Mbp of DNA sequence. The 0.1, 0.8 and 3.0 μm size fractions are shown as blue, red and green, respectively. Counts for marker genes for the same pathway or enzyme complex were averaged and those from different pathways were summed. For marker gene descriptions see Supplementary Tables S2 and S3. AAnP, aerobic anoxygenic photosynthesis; rTCA, reverse TCA; WL, Wood-Ljungdahl; DNRA, dissimilatory nitrate reduction to ammonia; DSR, dissimilatory sulfate reduction; DMSO, dimethylsulfoxide; ASR, assimilatory sulfate reduction; DMSP, dimethylsulfoniopropionate.

Carbon resourcefulness in dominant heterotrophic bacteria

In both the upper mixed and deep zones, potential for carbon fixation was much lower than for degradative processes, indicating the potential for net carbon loss (Figure 4a). Potential for carbon fixation via the oxygen-tolerant Calvin cycle (Figure 4a) was originally assessed by presence of the marker genes ribulose-bisphosphate carboxylase oxygenase (RuBisCO) and phosphoribulokinase (prkB) (Hügler and Sievert, 2011). The majority of RuBisCO homologs were related to Viridiplantae (Table 2) supporting the ecological role of green algae as the principle photosynthetic organisms. RuBisCO was only associated with a small proportion of Gammaproteobacteria (Table 2), principally from sulfur-oxidizing Thiomicrospira, indicating some Gammaproteobacteria are autotrophs. However, the majority of prkB matched to Gammaproteobacteria (Table 2), predominantly Marinobacter. Although deep-sea, iron-oxidizing autotrophic members of Marinobacter have been isolated (Edwards et al., 2003), all genomes reported for Marinobacter have prkB, but lack RuBisCO genes. Across Marinobacter genomes, the prkB homolog is consistently adjacent to a gene for a putative phosphodiesterase, suggesting that the enzymes expressed by these genes may be involved in pentose phosphate metabolism unrelated to carbon fixation. Albeit exceptional, this decoupling of prkB from RuBisCO involved in carbon fixation (forms I and II), also observed in Ammonifex (Hügler and Sievert, 2011), undermines the utility of prkB as a marker gene for the Calvin cycle within certain groups. Thus, there is no evidence for autotrophy in Organic Lake mediated by Marinobacter.

Table 2 Contribution of different taxonomic groups to counts of marker genes involved in carbon, nitrogen and sulfur conversions

Evidence for carbon fixation via the reverse tricarboxylic acid (rTCA) cycle was also indicated (Figure 4a), with genes for ATP citrate lyase (aclAB) linked to sulfur-oxidizing Epsilonproteobacteria (Table 2). In general, the rTCA cycle is restricted to anaerobic and microaerobic bacteria (Hügler and Sievert, 2011), which is consistent with the detection of Epsilonproteobacteria in the lake bottom where oxygen is lowest, and the microaerophilic/anaerobic metabolisms characteristic of the group (Campbell et al., 2006). Anaerobic carbon fixation was represented by potential for the Wood-Ljungdahl (or reductive acetyl-CoA) pathway (Figure 4a). Wood-Ljungdahl-mediated carbon fixation, for which CO dehydrogenase/acetyl-CoA synthase is the key enzyme, was linked to Firmicutes and Deltaproteobacteria that are known to grow autotrophically using this pathway (Hügler and Sievert, 2011).

Potential for carbon loss by respiration was indicated by an abundance of cytochrome c oxidase genes (coxAC) throughout the water column. In the deep zone, potential for fermentation was greatest at 6.5 m (Figure 4a) and likely the main biological activity that was occurring at that depth. Fermentation was indicated by the marker gene lactate dehydrogenase (ldh). These genes were linked to Firmicutes (Table 2), which was only present at 6.5 m and represented by the classes Clostridia and Bacilli (Figure 2a). As the related candidate division RF3 (Tajima et al., 1999) also has relatively high abundance in this zone (Figure 2a) (see 0.8–3.0-μm size fraction community composition above), there is circumstantial evidence that RF3 possesses a fermentative metabolism and may, therefore, have an important ecological role in Organic Lake by degrading high molecular weight compounds to organic acids that other organisms can utilize. Assimilation of fermentation products appears to have a greater role in Organic Lake rather than complete anaerobic oxidation involving methanogens or sulfate-reducing bacteria; the former were absent and the latter were present in low abundance (Figures 2a and c).

Alphaproteobacteria, predominantly Roseovarius (Figure 2c), were implicated in CO oxidation (Table 2), which is used to generate energy for lithoheterotrophic growth (Moran and Miller, 2007), although CO oxidation may also be involved in anaplerotic C fixation (Moran et al., 2007). The CO oxidation capacity was at a maximum at 6.5 m (Figure 4a), and therefore, associated with the putative deep-zone Roseovarius ecotype of Organic Lake. CO oxidation can function as a strategy to limit oxidation of organic carbon for energy so that a greater proportion can be directed towards biosynthesis (Moran and Miller, 2007).

Photosynthesis reaction center genes pufLM, involved in photoheterotrophy via AAnP, were abundant in Organic Lake (Figure 4a; Table 2). These were linked to the Roseobacter clade of Alphaproteobacteria (Table 2), major contributors to AAnP in ocean surface waters (Béjà et al., 2000; Béjà et al., 2002; Moran et al., 2007). This is consistent with the known metabolic potential of bacteriochlorophyll-a-producing R. tolerans from Ekho Lake (Labrenz et al., 1999). Photoheterotrophy can also be rhodopsin-dependent, with proteorhodopsins of marine Flavobacteria and Vibrio previously linked to light-dependent energy generation to supplement heterotrophic growth, particularly during carbon limitation (Gómez-Consarnau et al., 2007, 2010). However, the function(s) of rhodopsins are diverse, and PRs are also hypothesized to be involved in light or depth sensing (Fuhrman et al., 2008).

Rhodopsin genes were abundant in Organic Lake (Figure 4a), and were associated with all the dominant Organic Lake aerobic heterotrophic lineages (Supplementary Figure S6). Phylogenetic analysis revealed six well-supported Organic Lake rhodopsin groups (Supplementary Figure S6). All groups had an L or M residue at position 105 (versus the SAR86 proteorhodopsin), denoting tuning to surface green light (Man et al., 2003; Gómez-Consarnau et al., 2007), and is characteristic of oceanic coastal samples (Rusch et al., 2007). Four of the groups clustered with homologs of genera detected in the lake, namely Marinobacter, Psychroflexus, Octadecabacter and ‘Ca. Aquiluna’ (Supplementary Figure S6 and Supplementary Table S4). Another group (SAL-R group) originates from the sphingobacterium Salinibacter ruber, which produces xanthorhodopsin (Balashov et al., 2005); it is therefore likely that Organic Lake Sphingobacteria (Supplementary Table S4) were the origin of this rhodopsin group. The most abundant group (OL-R1; Supplementary Figure S6) had no close homologs from GENBANK, but it was abundant on the 3.0-μm fraction and has a distribution, suggesting that it originates from Organic Lake members of the Roseobacter clade (Figure 4a). All ORFs adjacent to OL-R1 rhodopsin-containing scaffolds were related to Octadecabacter, further supporting their Roseobacter clade provenance (Supplementary Figure S7). Genes downstream of OL-R1 were involved in carotenoid synthesis, indicating that OL-R1 is a xanthorhodopsin occurring as a retinal protein or in a carotenoid complex (Balashov et al., 2005).

Photoheterotrophic potential of Organic Lake was compared with other aquatic environments including nearby Ace Lake, Southern Ocean and Global Ocean Sampling expedition samples. The Organic Lake 0.1-μm fraction had the lowest rhodopsin counts and percentage of cells containing rhodopsin genes (estimated from the ratio of rhodopsin genes to the single copy of recA gene) of all size-matched samples surveyed (Table 3). Non-marine Global Ocean Sampling samples from the 0.1-μm fraction have been noted to have lower rhodopsin abundance (Sharma et al., 2008), which was similarly evident from our analysis (Table 3). In contrast, the 3.0-μm Organic Lake size fractions had higher rhodopsin counts than Ace Lake and comparable counts to the Southern Ocean samples, although the percentage of cells containing rhodopsin genes was still lower than that of the Southern Ocean. The paucity of rhodopsins in the Organic Lake 0.1-μm fraction is likely due to the lack of SAR11 clade, which is expected to be the main source of rhodopsin genes in Ace Lake and marine samples. This indicates that although Organic Lake has an overall lower frequency of rhodopsin genes compared with sites for which size fraction-matched metagenomes are available, the rhodopsins associated with larger or particle-associated cells are as abundant as in the marine environment.

Table 3 Counts of genes involved in DMSP catabolism and photoheterotrophy

Counts of pufLM genes in the Organic Lake 0.1-μm size fraction were similar to Global Ocean Sampling samples, except for Punta Cormorant hypersaline lagoon that had the highest pufLM counts and percentage of cells containing AAnP genes (Table 3). However, the highest overall counts of pufLM were from the 3.0-μm size fraction of Organic Lake, likely due to the high proportion of members of the Roseobacter clade. Notably, pufLM genes were not detected in high abundance in Ace Lake or the Southern Ocean samples, indicating that AAnP is a unique adaptation in Organic Lake among these polar environments. The similarly high abundance of pufLM genes in Punta Cormorant hypersaline lagoon indicates that AAnP may be advantageous in environments with salinity above marine levels.

The contribution of light-driven energy generation processes to the carbon budget is difficult to infer from genetic potential alone. For example, the relative abundance of AAnP and proteorhodopsin genes in Arctic bacteria has been reported to be the same in winter and summer (Cottrell and Kirchman, 2009). Furthermore, regulation of pigment synthesis is complex; for example, bacteriochlorophyll a expression in R. tolerans occurs in the dark but is inhibited by continuous dim light (Labrenz et al., 1999). However, it is possible that the apparent negative balance in carbon conversion potential could be ameliorated by photoheterotrophy performed by bacterial groups that are abundant in Organic Lake. In particular, the Organic Lake Psychroflexus could have a particular role, as it has a proteorhodopsin related to Dokdonia, which was shown to function under carbon limitation (Gómez-Consarnau et al., 2007). Furthermore, the detection of higher AAnP potential in Organic Lake than other aquatic environments, and the presence of taxa known to be capable of AAnP, suggests that AAnP may have a relatively important role in the carbon budget of Organic Lake.

Regenerated nitrogen is predominant in the nitrogen cycle

Nitrogen cycling potential throughout the lake was dominated by assimilation and mineralization/assimilation pathways (Figure 4b). Glutamate dehydrogenase (GDH) genes (gdhA) were abundant (Figure 4b), and linked predominantly to Alpha- and Gammaproteobacteria, and to a lesser extent Bacteroidetes (Table 2). However, the functional significance of the readily reversible GDH depends on its origin; Bacteroidetes are likely to use GDH in the oxidative direction for glutamate catabolism (Takahashi et al., 2000; Williams et al., 2013), whereas the use of GDH in the oxidative or reductive directions by Proteobacteria is likely to depend upon the available source of reduced nitrogen (ammonia versus amino acids). Glutamine synthetase (glnB) and glutamate synthase genes (gltBS) were predominantly linked to Alpha- and Gammaproteobacteria (Table 2), indicating the potential for high-affinity ammonia assimilation by these groups in Organic Lake. The high ammonia concentration in the deep zone (Figure 1b; Table 1) would result from a higher rate of mineralization (ammonification) than assimilation. This is consistent with abundant OTUs for Psychroflexus (Bacteroidetes) in this zone, and due to either turnover of organic matter or lysis of Bacteroidetes cells after sedimentation in anoxic water. In addition, the gene for ammonia-generating nitrite reductase (nrfA) was linked to Bacteroidetes and Planctomycetes (Table 2), indicating that ammonia may also be produced by these putative aerobic heterotrophs. Overall, the data suggest that ammonia is actively assimilated in the aerobic upper mixed zone, but is permitted to accumulate in the anaerobic deep zone.

Potential for nitrogen conversions typically found in other aquatic environments was greatly reduced in Organic Lake. There was a very low potential for nitrogen fixation that was confined to the deep zone (Figure 4b) and principally linked to anaerobic Epsilonproteobacteria (Table 2). This diazotrophic potential may not be realized by Epsilonproteobacteria, given the high ammonia concentration present in the deep zone. No ammonia monooxygenase genes (amoA) were detected. The potential for ammonia oxidation was only represented by hydroxylamine/hydrazine oxidase-like (hao) genes, which were in low abundance and linked to Deltaproteobacteria (Table 2). hao genes are present in non-ammonia-oxidizing bacteria (Bergmann et al., 2005), and those from Organic Lake belong to a family of multiheme cytochrome c genes present in sulfate-reducing Deltaproteobacteria that have no proven role in ammonia oxidation. In the genomes of sulfate-reducing Deltaproteobacteria, the hao gene is invariably situated adjacent to a gene for a NapC/NirT protein, which suggests a role in dissimilatory nitrate reduction. Collectively, these data indicate an inability for nitrification to occur in the upper mixed zone and no potential for ammonia loss in the deep zone.

Denitrification genes (norCB and nozB) and genes for nitrate assimilation (nasA) were present throughout the water column (Figure 4b), and were linked primarily to Gammaproteobacteria (Table 2). One hypothesis is that low nitrate and nitrite in the deep zone (Figure 1b Table 1) is the result of oxidized nitrogen being depleted by dissimilatory or assimilitory reduction by heterotrophic Gammaproteobacteria. Denitrification genes are phylogenetically widespread and usually induced by low oxygen or oxidized nitrogen species (Kraft et al., 2011), and thus expected to be active in the deep zone or oxycline. However, denitrification may be inhibited even if conditions appear appropriate. For example, in Lake Bonney, Antarctica, denitrification occurs in the west lobe, but not in the east lobe of the lake despite the presence of anoxia, nitrate and denitrifying Marinobacter species (Ward and Priscu, 1997; Ward et al., 2005). Moreover, in the absence of nitrification, denitrification and nitrate assimilation would be limited by the lack of potential to reform oxidized nitrogen. While exogenous input may augment the carbon and nitrogen budget (see Conclusion below), the preponderance of assimilation/mineralization pathways geared towards reduced nitrogen appears to reflect a ‘short circuit’ of the typical nitrogen cycle that would conserve nitrogen in a largely closed system. Therefore, the predominant nitrogen source is the regenerated fixed nitrogen. Similar findings were also made for Ace Lake, although in this system the presence of a dense layer of green sulfur bacteria with the potential to fix nitrogen augments the nitrogen cycle (Lauro et al., 2011).

Molecular basis for unusual sulfur chemistry

Several meromictic hypersaline lakes in the Vestfold Hills, including Organic Lake, with practical salinity greater than 150 are characterized by an absence of hydrogen sulfide and photoautotrophic sulfur bacteria (Burke and Burton, 1988). Although sulfate is present (Franzmann et al., 1987b), geochemical conditions of these lakes are not conducive to dissimilatory sulfur cycling between sulfur-oxidizing and sulfate-reducing bacteria typical of other stratified systems such as Ace Lake (Ng et al., 2010; Lauro et al., 2011). Consistent with this, potential for dissimilatory sulfate reduction represented by dissimilatory sulfite reductase (dsrAB) and adenylylsulfate reductase (aprAB) linked to sulfate-reducing Deltaproteobacteria (Table 2) was low in Organic Lake. Sulfate reduction potential was confined to the 6.7 m sample (Figure 4c) where oxygen concentration was lowest and Deltaproteobacteria were present (Figure 2a).

Capacity for oxidation of reduced sulfur compounds, represented by the sulfur oxidation multienzyme genes (soxAB), was present throughout the water column (Figure 4c) and linked primarily to Alpha- and Gammaproteobacteria (Table 2). Sulfur-oxidizing Gammaproteobacteria and Alphaproteobacteria are known to oxidize sulfur compounds, such as thiosulfate, aerobically. Although a small proportion of Gammaproteobacteria had the capacity for autotrophy (see Carbon resourcefulness in dominant heterotrophic bacteria), the majority of sulfur oxidizers were likely chemolithoheterotrophs as they were related to heterotrophic Marinobacter and Roseobacter clade. The sulfur dehydrogenase genes soxCD linked to Alpha- and Gammaproteobacteria were similarly present throughout the water column. soxCD are accessory components of the Sox enzyme system without which complete oxidation of thiosulfate cannot occur (Friedrich et al., 2005). Thus, the presence of soxCD indicates that complete oxidization likely occurs, although the different distribution of soxAB and soxCD in the water column (Figure 4c) suggests that a proportion of the community may lack soxCD and deposit sulfur. Sulfur-oxidizing Epsilonproteobacteria possessing soxAB genes (Table 2) were present only in the deep zone of Organic Lake (Figures 2a and c) and were related to autotrophic deep-sea sulfur-oxidizers, some members of which are capable of anaerobic sulfur oxidation using nitrate (Yamamoto and Takai, 2011). It is unlikely that appreciable sulfur oxidation occurs in the deep zone as the known terminal electron acceptors, oxygen and nitrate are depleted and the abundance of sulfur-oxidizing Epsilonproteobacteria is low (Figure 2a). Epsilonproteobacteria were also linked to a capacity for oxidation of sulfide to elemental sulfur by utilizing sulfide:quinone oxidoreductase (sqrA) (Figure 4a, Table 2). In this pathway, sulfur is released as polysulfides, which is a potential biological source of the abundant polysulfides that have been detected in the lake (Roberts et al., 1993).

It is likely that the limited anaerobic dissimilatory sulfur cycle contributes to the accumulation of DMS in Organic Lake in the deep zone. In the upper mixed zone, DMS could potentially be oxidized as a carbon and energy source or utilized as an electron donor by sulfur-oxidizing bacteria (Schäfer et al., 2010). In anoxic zones, methanogenic Archaea or sulfate-reducing bacteria are the main organisms known to break down DMS (Schäfer et al., 2010). As the genes that encode enzymes involved in DMS catabolism have not been identified, potential for DMS breakdown in the deep zone was assessed from the taxonomic composition. Methanogens and genes involved in methanogenesis were not detected, nor has methane been detected (Gibson et al., 1994), leaving sulfate reduction the most likely route of DMS catabolism. The low dissimilatory sulfate reduction potential in the deep zone coupled with the relatively stagnant water would likely minimize DMS oxidation and loss by ventilation. DMS would therefore be expected to accumulate in the deep zone if production rates were higher than breakdown.

To determine the source of high DMS in the bottom water of Organic Lake, the genes involved in DMS formation were surveyed (Supplementary Table S3). Genes for DMSP lyases dddD, dddL and dddP were detected in Organic Lake at levels comparable to other dominant processes such as respiration and fermentation (Figure 4c), indicating that DMSP is an important carbon and energy source in Organic Lake. The other known DMSP lyases, dddQ, dddW and dddY (Todd et al., 2012), were not detected. dddD was the most abundant of the Organic Lake DMSP lyases (Table 3) and comprised two main types: MAR-dddD and OL-dddD (Supplementary Figure S8). Neither of these types clustered with the non-functional Dinoroseobacter shibae DFL 12 and Ruegeria pomeroyi DSS-3 dddD homologs (Todd et al., 2011) or carnitine coenzyme A transferase outgroups, thereby providing support for their proposed role as functional DMSP lyases. The MAR-dddD type includes the Marinobacter sp. ELB17 dddD homolog, and MAR-dddD sequences were most abundant on the 0.8-μm fraction where Marinobacter OTUs were also abundant, indicating that MAR-dddD derives from Organic Lake Marinobacter (Supplementary Figure S8). OL-dddD had 83% identity to the dddD of Halomonas sp. HTNK1, but was designated as a separate type due to poor support for the Organic Lake dddD and Halomonas sp. HTNK1 dddD cluster (Supplementary Figure S8). Although Halomonas OTUs were detected in Organic Lake (Figure 3, Supplementary Table S4), they were only detected on the 0.1 and 0.8 μm size fractions, which does not correlate with the distribution of the OL-dddD genes (data not shown). The abundance of OL-dddD on the 3.0-μm fraction suggests that it originates from Alphaproteobacteria. OL-dddD containing contigs carried genes of mixed Alpha- and Gammaproteobacterial origin supporting its provenance from one of these classes and consistent with the ‘pick ‘n’ mix’ arrangement of genes found beside sequenced dddD regions (Johnston et al., 2008). Adjacent to OL-dddD was dddT (Supplementary Figure S9), a betaine/choline/carnitine transporter family protein that likely functions in DMSP import, demonstrating that OL-dddD forms an operon-like structure, similar to Halomonas sp. HTNK1 (Todd et al. 2010). DMSP lyase DddD converts DMSP to 3-OH-propionate and DMS. A dddA gene was also located next to dddD (Supplementary Figure S9); this encodes an alcohol dehydrogenase (DddA; originally annotated as ‘choline dehydrogenase’) for the conversion of 3-OH-propionate to malonate semialdehyde in the second step of DMSP catabolism (Todd et al., 2010).

Two dddL groups were detected in Organic Lake: SUL-dddL and MAR-dddL (Supplementary Figure S10). The former includes the Sulfitobacter sp. EE-36 dddL and the latter the Marinobacter manganoxydans MnI7-9 homolog, indicating that they originate from Roseobacter clade and Gammaproteobacteria, respectively. Sulfitobacter sp. EE-36 has demonstrated DMSP lyase activity and the dddL gene alone is sufficient for DMS generation (Curson et al., 2008). These data indicate that the Organic Lake members of the SUL-dddL group perform the same functional role. The MAR-dddL clade appears to be an uncharacterized branch of the dddL family. dddP was detected as the least abundant of the DMSP lyases (Table 3). Phylogenetic analyses showed that Organic Lake dddP likely originate from Roseovarius (Supplementary Figure S11). The Organic Lake sequences formed a clade with the functionally verified Roseovarius nubinhibens ISM dddP (Todd et al., 2009).

A single type of DMSP demethylase, dmdA, was identified. It clustered with Roseobacter clade dmdA (Supplementary Figure S12), corresponding to the marine clade A (Howard et al., 2008), and includes the functionally verified R. pomeroyi DSS-3 homolog. These data indicate that the Organic Lake sequences correspond to true DMSP demethylases and not the related glycine cleavage T proteins or other aminomethyltransferases (Howard et al., 2008).

DMSP cleavage appears to be a significant source of DMS in Organic Lake. DMSP likely originates from Bacillariophyta or Dinoflagellida as Organic Lake Dunaliella have been reported not to produce DMSP in culture (Franzmann et al., 1987b). Based on the abundance of marker genes, DMSP cleavage is predicted to occur at highest levels in the deep zone (Figure 4c) where the DMS concentration has been measured to be highest (Deprez et al., 1986; Franzmann et al., 1987b; Gibson et al., 1991; Roberts and Burton, 1993; Roberts et al., 1993). Consistent with DMS levels, the concentration of DMSP (Gibson et al., 1991) and DMSO (J Gibson, unpublished results) in the bottom zone of Organic Lake has also been found to be high, although DMSP levels reported in a separate study were not as high (Roberts et al., 1993), possibly reflecting technical difficulties measuring methylated sulfur compounds. In these studies (Gibson et al., 1991; Roberts et al., 1993), chlorophyll a levels were not measured.

DMS can also be produced in anoxic environments from the reduction of DMSO, degradation of sulfur-containing amino acids and sulfide methylation (Schäfer et al., 2010). Our data indicate that some DMSO reduction linked to Firmicutes could occur, but is not likely a major pathway because DMSO reductase genes were not abundant (Table 2, Figure 4c). The potential for the other DMS-yielding processes could not be determined because the enzymes involved in these pathways have not been established. When cultivated, Halomonas isolates from Organic Lake produced DMS from cysteine (Franzmann et al., 1987b), providing some evidence that DMS production from anaerobic degradation of amino acids can occur. Abiotic pathways for anaerobic production of DMS have also been proposed (Roberts et al., 1993).

The potential for DMSP cleavage was more than twice that of DMSP demethylation (Figure 4c). This is unusual compared with the marine environment or Ace Lake where DMSP demethylation potential is much higher than cleavage (Table 3). Previous estimates have similarly shown marine environments to have demethylation potential up to two orders of magnitude higher than cleavage (Howard et al., 2008; Todd et al., 2009; Reisch et al., 2011; Todd et al., 2011). The frequency of DMSP lyase genes dddD and dddL in Organic Lake exceeded those of all other environments examined, except Punta Cormorant hypersaline lagoon, where dddL abundance was comparable (Table 3). This suggests the selection in Organic Lake for DMSP cleavage due to functional advantage and/or selection for taxa that carry DMSP lyase genes. There is evidence that high DMSP cleavage potential is adaptive in hypersaline systems, as a high proportion of ddd genes were similarly detected in Punta Cormorant hypersaline lagoon and saltern ponds (Raina et al., 2010). The accumulated DMS in Organic Lake suggests that conditions in Organic Lake favor the relatively inefficient lysis pathway, where both sulfur and carbon are lost to the organism performing the DMSP lysis, over the more ‘thrifty’ demethylation pathway. This is particularly pertinent to the Roseobacter lineages that can also perform either process.

Conclusion

Through the use of shotgun metagenomics and size partitioning of samples, we discovered that the Organic Lake system is dominated by heterotrophic bacteria related to Psychroflexus, Marinobacter and Roseovarius, with primary production provided largely by chlorophyte algae related to Dunaliella. Genetic potential for oxidation of fixed carbon by heterotrophic bacteria occurs greatly in excess of the potential for carbon fixation, suggesting possible net carbon loss. However, by linking key metabolic processes to the dominant heterotrophic lineages we uncovered processes that were unusually abundant in Organic Lake that may serve to maximize exploitation of limited resources and minimize loss. Recalcitrant polymeric algal material and particulate matter are likely remineralized by Psychroflexus in the upper mixed zone and by Firmicutes in the deep zone to provide labile substrates for use by other heterotrophic bacteria. The generalist Marinobacter and Roseovarius lineages were associated with abundant genes involved in rhodopsin-mediated and AAnP photoheterotrophy, the latter of which was more abundant in Organic Lake than any other system surveyed in an analogous manner. Potential for chemolithoheterotrophy, sulfur oxidation and CO oxidation was also high, and along with photoheterotrophy, may provide a supplementary energy source if organic carbon becomes limiting.

In addition to being able to describe the functional capacities and potential importance of poorly understood microbial processes occurring in the lake (for example, photoheterotrophy by Alphaproteobacteria), we were able to answer targeted questions about the biology of the unusual lake sulfur chemistry. The low potential for dissimilatory sulfur cycling (both sulfur oxidation and DSR) and relatively stable waters of the deep zone, combined with the generation of DMS from DMSP, facilitate the accumulation of a high level of DMS in the lake. It appears Marinobacter and Roseovarius have a key role in DMS formation by cleaving DMSP generated by upper mixed zone eucaryal algae. The remarkable abundance of DMSP lyase genes suggests that DMSP is a significant carbon source in Organic Lake, and the cleavage pathway provides a selective advantage under the unique constraints of the Organic Lake environment.

In view of the minimal capacity for biological fixation of carbon and nitrogen, and yet organic richness, including high levels of DMS in Organic Lake, we evaluated what input the lake may have received throughout its relatively brief 3000 year history. The volume of the lake is small (6 × 104 m3), and exogenous input may occur from guano deposited in a small penguin rookery nearby the lake, through giant petrel or skua grazing and defecation, and/or by decaying animal carcasses such as elephant seals, which can weigh on the order of 1 ton and are present near the lake. Precipitation in Antarctica tends to be high in nitrate and relatively high levels have been reported in snow, ice, inland drainage systems and isolated water bodies (Vincent and Howard-Williams, 1994; Webster-Brown et al., 2010). It is also possible that during isolation from the ocean, the base of the water column in the marine basin that formed the lake may have acted as a sump for organic material. Phytoplankton blooms and benthic mats (for example, fixation of nitrogen by cyanobacteria and oxidation of nitrogen compounds) tend to make coastal marine basins very productive, and organic matter that sediments out of the surface waters will become trapped in the denser, more saline bottom layers (Bird et al., 1991). Retention of captured organic matter in the lake may also have been facilitated by Organic Lake having become highly saline quickly (Bird et al., 1991). Studies in the future that experimentally determine exogenous input and historical lake dynamics (for example, stable isotope and biomarker analyses of lake sediment), the role of benthic communities (for example, in reoxidation of reduced nitrogen compounds), and metaproteogenomic analyses of interannual community composition and function, will provide improved knowledge of the unusual biogeochemistry of Organic Lake and better enable predictions to be made about how the lake may be affected by ecosystem changes.