A shared core microbiome in soda lakes separated by large distances

Zorz, Jackie K.; Sharp, Christine; Kleiner, Manuel; Gordon, Paul M. K.; Pon, Richard T.; Dong, Xiaoli; Strous, Marc

doi:10.1038/s41467-019-12195-5

Download PDF

Article
Open access
Published: 17 September 2019

A shared core microbiome in soda lakes separated by large distances

Nature Communications volume 10, Article number: 4230 (2019) Cite this article

10k Accesses
76 Citations
31 Altmetric
Metrics details

Subjects

Abstract

In alkaline soda lakes, concentrated dissolved carbonates establish productive phototrophic microbial mats. Here we show how microbial phototrophs and autotrophs contribute to this exceptional productivity. Amplicon and shotgun DNA sequencing data of microbial mats from four Canadian soda lakes indicate the presence of > 2,000 species of Bacteria and Eukaryotes. We recover metagenome-assembled-genomes for a core microbiome of < 100 abundant bacteria, present in all four lakes. Most of these are related to microbes previously detected in sediments of Asian alkaline lakes, showing that common selection principles drive community assembly from a globally distributed reservoir of alkaliphile biodiversity. Detection of > 7,000 proteins show how phototrophic populations allocate resources to specific processes and occupy complementary niches. Carbon fixation proceeds by the Calvin-Benson-Bassham cycle, in Cyanobacteria, Gammaproteobacteria, and, surprisingly, Gemmatimonadetes. Our study provides insight into soda lake ecology, as well as a template to guide efforts to engineer biotechnology for carbon dioxide conversion.

Holocene life and microbiome profiling in ancient tropical Lake Chalco, Mexico

Article Open access 05 July 2021

Depth-discrete metagenomics reveals the roles of microbes in biogeochemical cycling in the tropical freshwater Lake Tanganyika

Article Open access 09 February 2021

Metagenomics datasets of water and sediments from eutrophication-impacted artificial lakes in South Africa

Article Open access 06 May 2024

Introduction

Soda lakes are among the most alkaline natural environments on earth, as well as among the most productive aquatic ecosystems known^1,2. The high productivity of soda lakes is due to a high bicarbonate concentration. Tens to hundreds of millimolars of bicarbonate are typically available for photosynthesis using carbon concentrating mechanisms^3,4, compared to generally <2 mM in the oceans⁵. This can lead to the formation of thick, macroscopic microbial mats with rich microbial biodiversity⁶. Because of the high pH, alkalinity, and high sodium salinity of these environments, the microorganisms that reside in soda lakes are considered extremophiles⁷. Using conditions of high pH and alkalinity is also a promising option to improve the cost-effectiveness of biotechnology for biological carbon dioxide capture and conversion^8,9,10.

Soda lakes have contributed to global primary productivity on a massive scale in Earth’s geological past¹¹. Currently, groups of much smaller soda lakes exist, for example, in the East African Rift Zone, rain-shadowed regions of California and Nevada, and the Kulunda steppe in South Russia¹². Many microorganisms have been isolated from these lakes. These include cyanobacteria^13,14,15, chemolithoautotrophic sulfide oxidizing bacteria^16,17,18, sulfate reducers^19,20, nitrifying^21,22, and denitrifying bacteria²³, as well as aerobic heterotrophic bacteria^24,25, methanotrophs²⁶, fermentative bacteria^27,28, and methanogens²⁹. Recently, almost one thousand metagenome-assembled-genome sequences (MAGs) were obtained from sediments of Kulunda soda lakes³⁰.

In the present study we investigate the microbial mat community structure of four alkaline soda lakes located on the Cariboo Plateau in British Columbia, Canada. This region has noteworthy geology and biology due to the diversity in lake brine compositions within a relatively small region³¹. There are several hundred shallow lakes on the Cariboo Plateau and these range in size, alkalinity, and salinity. Underlying basalt in some areas of the plateau, originating from volcanic activity during the Miocene and Pliocene eras, offers ideal conditions for forming soda lakes, as it provides little soluble calcium and magnesium^6,32,33. Some of these lakes harbor seasonal microbial mats that are either dominated by cyanobacteria or eukaryotic green algae. However, beyond this little is currently known about these systems in terms of microbiology.

We use a combination of shotgun metagenomes, and 16 S and 18 S rRNA amplicon sequencing to establish a microbial community structure for the microbial mats of four soda lakes. We perform proteomics to show how specific populations allocate resources to specific metabolic pathways, focusing on photosynthesis, and carbon, nitrogen, and sulfur cycles. Through the use of metagenomics and metaproteomics, this study provides a comprehensive molecular characterization of a phototrophic microbial mat microbiome. Specifically, we offer evidence in support of widespread phototrophy and niche differentiation among populations inhabiting these alkaline microbial mats, as well as the unexpected potential for mixotrophy in a member of the Gemmatimonadetes phylum. Also, by comparing metagenomic reads between the present study and a recent study from soda lake sediments 8000 km away in central Asia³⁰, we find the presence of a core soda lake microbiome with some strikingly similar populations, potentially the result of recent dispersal events.

Results and discussion

Soda lake geochemistry and community composition

The Cariboo Plateau contains hundreds of lakes of different size, alkalinity and salinity. Here we focused on four alkaline soda lakes (Fig. 1) that feature calcifying microbial mats with similarities to ancient stromatolites or thrombolites^6,34,35. Between 2014 and 2017, the total alkalinity in these lakes was between 0.20–0.65 mol L⁻¹ at pH 10.1–10.7 (Supplementary Table 1). Four years of amplicon sequencing data (16 S and 18 S rRNA) showed the microbial mats contain at least 1662 bacterial and 587 eukaryotic species-level operational taxonomic units (OTUs, clustered at 97% similarity) (Supplementary Data 1). The mat communities from different lakes were similar, but distinct, and relatively stable over time (Fig. 1). Probe, Deer and Goodenough Lakes harbored predominantly cyanobacterial mats, whereas the mats of more saline Last Chance Lake contained mainly phototrophic Eukaryotes. This was shown with proteomics (see below), because it was impossible to compare abundances of Eukaryotes and Bacteria using amplicon sequencing. Bacterial species associated with 340 OTUs were found in all four lakes. These species accounted for 20.5% of the region’s species richness and 84% of the total sequenced reads, suggesting that there is a common and abundant core microbiome shared among the alkaline lakes of the Cariboo Plateau. Despite the high proportion of eukaryotic biomass and phototrophs, the core alkaline lake, prokaryotic microbiome was still present in Last Chance Lake (although at lower relative abundances).

Metagenomes reveal similarity between distant lakes

After amplicon sequencing had outlined the core microbiome of the Cariboo soda lake microbial mats, shotgun metagenome sequencing, assembly, and binning were used to obtain the provisional whole-genome sequences, or metagenome-assembled-genomes (MAGs), of its key microbiota. We selected 91 representative, de-replicated MAGs for further analysis (Supplementary Data 2). Most of these MAGs were near-complete (>90% for 85 MAGs), and contained relatively few duplicated conserved single-copy genes (<5%, for 83 MAGs). For fifty-six MAGs, we independently assembled and binned 2–5 nearly identical (>95% average nucleotide identity) versions, indicating the presence of multiple closely related strains. 40–60% of quality-controlled reads were mapped to the 91 MAGs, showing that the associated bacteria accounted for approximately half of the DNA extracted. Most of the remaining reads were mapped to MAGs of lower quality and coverage, associated with a much larger group of less abundant bacteria. This was not surprising because amplicon sequencing had already indicated the presence of >2000 different bacterial and eukaryotic OTUs. Full length 16 S rRNA gene sequences (Supplementary Data 3) were reconstructed from shotgun metagenome reads. Fifty-seven of those could be associated with a MAG based on taxonomic classifications and abundance profiles. Perfect alignment of full length 16 S rDNA gene sequences to consensus OTU amplicon sequences showed that almost all these MAGs were core Cariboo microbiome members, present in each lake (Supplementary Table 2).

Figure 2 shows the taxonomic affiliation and average relative sequence abundances for the bacteria associated with the MAGs. For taxonomic classification we used the recently established GTDB taxonomy³⁶. We also used the GTDB toolkit to investigate the similarity of the Cariboo mat genomes to >800 MAGs recently obtained from sediments of the Central Asian soda lakes of the Kulunda Steppe³⁰. The distance between the two systems of alkaline lakes is approximately 8000 km. Yet, 56 of the Cariboo MAGs were clustered together with Kulunda MAGs and defined family or genus level diversity in the context of the GTDB database (release 86, >22,000 whole-genome sequences). This degree of similarity between geographically distant lake systems was surprising, especially because DNA was obtained from Kulunda sediments, not mats. It suggests that the core microbiome defined here for Cariboo lake mats, also applies to at least one other, well described system of soda lakes.

Interestingly, the genetic distance between the most similar MAGs from each of the two regions decreased with increasing abundance in Cariboo mats (Pearson correlation −0.49, p: 0.0003, n = 48, Fig. 2b, Supplementary Data 2), but not with abundance in Kulunda sediments. For example, the most abundant Cariboo cyanobacterium (C1—affiliated with Nodosilinea, relative abundance >7%) displayed 99% average nucleotide identity over 85% of its genome with Kulunda MAG GCA_003550805. The latter displayed <0.1% relative abundance in Kulunda sediments. Mapping of Kulunda sequencing reads directly to Cariboo genomes (Supplementary Data 2) did not provide any evidence for the presence of previously undetected bacteria/MAGs in Kulunda sediments that were more similar to Cariboo bacteria/MAGs than those presented by Vavourakis et al.³⁰.

These results suggest that when the Cariboo lakes formed ~10,000 years ago after the last ice age⁶, their microbiomes assembled from a much older, global reservoir of alkaliphile biodiversity. The striking relationship between Cariboo abundance and Kulunda-Cariboo relatedness might be explained by increased rates of successful dispersal/colonization for more abundant populations. Identification of vectors for dispersal still awaits future research, but bird migration is an obvious candidate. For example, the Northern Wheatear, which migrates between Northern Canada and Africa via Central Asia, could potentially link many known soda lakes worldwide. Abundance in sediments, located below mats, might not explain dispersal well, because sediments are less exposed to dispersal vectors than mats.

In any case, the genetic distances separating related bacteria were generally large, indicating that successful colonization by invading bacteria from a different lake system must be extremely rare. Possibly, only a single bacterium (MAG C1) traveled between and successfully colonized another lake system since the last ice age. A strong degree of isolation was also observed for other ecological islands, such as hot springs³⁷. Thus, the observed similarities of the microbiota between distant lake systems indicate shared outcomes of community assembly for microbial mat microbiomes in two distant soda lake environments. Future studies will indicate whether the core microbiota of Kulunda and Cariboo soda lakes has also assembled in other soda lakes.

Dispersal between Cariboo soda lakes, separated by at most 40 km, was very effective. For all 56 sets of 2–5 nearly identical MAG variants (average nucleotide identity >95%) we detected co-occurrence of all variants (Supplementary Data 4). This also showed that competitive exclusion was irrelevant, even for these nearly identical bacteria. Comparison of ratios of synonymous and non-synonymous mutations among the most rapidly evolving core genes—genes present in all genome variants, Supplementary Data 5—showed that diversifying selection acted on 775 genes, including many transporters and genes involved in cell envelope biogenesis. Accessory genes—not encoded on all variant genomes—and CRISPRs could display many more ecologically relevant differences, which could prevent competitive exclusion.

Proteomics reveals niche partitioning of cyanobacteria

The processes that dictate assembly of effective phototrophic microbial mat communities are well understood, with ecological adaptations and responses to dynamic light, oxygen, sulfide, pH, and carbon dioxide gradients³⁸. But, to what extent do these known rules of engagement also apply to alkaline soda lake microbial mats, where primary productivity has access to unlimited inorganic carbon², as was previously shown for Cariboo Soda lakes⁶? We performed environmental proteomics and connected protein expression to abundant MAGs to answer this question for the Cariboo Plateau soda lake mats (Supplementary Data 6).

Over 7000 expressed proteins were identified, with high confidence, in daytime mat samples from each of the lakes. For comparison, the most comprehensive environmental proteomes obtained so far have identified up to ~10,000 proteins³⁹. Given the high diversity and extremely complex nature of the mat samples, identification of 7217 proteins is an excellent starting point for ecophysiological interpretation. Approximately half of the expressed proteins could be attributed to the 91 MAGs, consistent with abundance estimates inferred from amplicon and shotgun data. This enabled us to investigate how the bacteria associated with the MAGs distributed their resources over different ecophysiological priorities⁴⁰. Given that a substantial amount of cellular energy goes towards manufacturing proteins, the relative proportion of a proteome dedicated to a particular function provides an estimate of how important that function is to the organism. Proteomic data were also used to estimate the ¹³C content of some abundant species, providing additional information on which carbon source they used and to what extent their growth was limited by carbon availability⁴¹. Brady et al. (2013) previously showed that microbial mat organic matter had δ¹³C values of −19 to −25‰, up to 27‰ depleted in ¹³C compared to bulk dissolved carbonates, consistent with non-CO₂-limited photosynthesis⁶. Overall protein δ¹³C values for the four lakes inferred from the proteomics data in the present study were between −19 and −25‰, in line with previous results for mat organic matter.

Consistent with their reputation as productive ecosystems with virtually unlimited access to inorganic carbon, the most abundant bacteria were large, mat-forming (filamentous) cyanobacteria, related to Nodosilinea and Phormidium. Pigment antenna proteins and photosynthetic reaction center proteins accounted for the largest fraction of detected proteins overall. The organism with the highest presence in the metaproteome was the cyanobacterial MAG C1, affiliated with Nodosilinea and accounting for up to 42% of mat metaproteomes. Remarkably, we were able to identify 1103 proteins from this MAG, 27% of its predicted proteome (Fig. 3). This level of detection is comparable to the proteomics results from other studies of pure cultures of cyanobacteria, such as Arthrospira, 21%, and Cyanothece, 47%^42,43. Nine cyanobacterial MAGs were assembled in total, and proteins from all nine were detected in the metaproteomes of all four lakes (Fig. 3, Supplementary Data 6). It is clear that the presence of so many cyanobacteria provides functional redundancy and contributes to functional robustness and resiliency^44,45. However, we also detected strong evidence for niche differentiation for those cyanobacteria with larger numbers of proteins detected, in particular MAG C1 (Nodosilinea), and MAG C5 (Phormidium A) (Fig. 4).

Phycobilisomes, the large, proteinaceous, light harvesting complexes of cyanobacteria, contain an assortment of pigments, which absorb at different wavelengths of light, and re-emit that light at longer wavelengths, around 680 nm, compatible with the reaction center of Photosystem II. Phycobilisome pigment composition varied among the cyanobacterial populations, leading to niche differentiation based on light quality, as was also observed in the marine environment⁴⁶. C1 and most other cyanobacterial populations expressed high amounts of phycocyanin, maximum absorbance 620 nm, and allophycocyanin, maximum absorbance 650 nm. In contrast, C5 uniquely expressed the pigment phycoerythrocyanin, with a maximum absorbance at 575 nm (Fig. 4). Phycoerythrocyanin would enable this population to absorb shorter wavelengths of light, in comparison to its cyanobacterial neighbours, and expands the spectral reach of photosynthesis for these mat communities, increasing productivity. The absence of expression of phycoerythrin, which has a maximum absorbance at 495 and 560 nm, is consistent with the light attenuation profile of aquatic environments with high dissolved organic matter, such as productive alkaline lakes, where wavelengths <500 nm are rapidly attenuated^47,48.

Shorter wavelength light (blue/green light) has higher energy, and high energy photons can damage photosynthetic machinery in cyanobacteria. If C5 would be exposed to these photons, as its pigment profile suggests, this could lead to more photodamage. Consistently, this population displayed higher expression of proteins like thioredoxin, for scavenging reactive oxygen species, and orange carotenoid protein for photoprotection (Fig. 4).

Inorganic carbon fixation and acquisition are central to realizing high primary productivity and the associated enzymes were highly expressed. The rate-limiting, Calvin-Benson-Bassham Cycle (CBB) enzyme RuBisCO accounted for ~1% of the expressed proteomes of cyanobacterial MAGs, a large fraction for a single enzyme (Fig. 4). In contrast, the expression of the carbon concentrating mechanism (CCM, needed for bicarbonate uptake) varied greatly among cyanobacteria. In C1 and C8, CCM proteins accounted for less than 0.2% of the proteomes. In C5, CCM proteins accounted for almost 3% of the expressed proteomes. C5 was the only population to express CCM proteins to a greater level than RuBisCO proteins, suggesting that this population might, to some extent, deplete bicarbonate in its micro-environment. Indeed, C5’s δ¹³C value was −20.6 ± 2.7‰, compared to −25.2 ± 0.8‰ for C1. A decrease in isotopic fractionation during photosynthesis is usually associated with CO₂ (or bicarbonate) limitation⁴⁹. We might conclude that C5’s access to higher energy radiation leads to a higher rate of photosynthesis, increased oxygen production, a higher need for protection against free radicals, a higher growth rate and a need for active import of bicarbonate. At a relative abundance of up to 2.3%, C5 was not the most abundant cyanobacterium, so if it had a higher growth rate, it must also have had a higher decay rate. This would make this organism an ecological R strategist, prioritizing cell growth over cell conservation. Because of the high dissolved bicarbonate concentration in these lakes (Supplementary Table 1), it is unlikely that bicarbonate was persistently limiting growth. It is more likely that limitation occurred occasionally, in thick mats or after dilution of dissolved bicarbonate after rain or snow melt.

Proteomics consistent with low nutrient concentrations

Nitrogen is a commonly limiting nutrient for primary production in soda lakes globally⁵⁰. The Cariboo Plateau lakes also display low or undetectable concentrations of ammonium and nitrate in lake waters (Supplementary Table 1). Consistently, no expression was detected for any proteins involved in nitrogen loss processes, such as nitrification or denitrification, or for assimilatory nitrate reductases or nitrate transporters.

Many bacteria, including the cyanobacteria C1, C5, and C8, expressed the key genes for the energetically expensive process of nitrogen fixation (Fig. 3, Supplementary Data 6). All cyanobacteria further expressed glutamine synthetase, for the assimilation of ammonia under nitrogen limiting conditions⁵¹, and the urea transporter. Dinitrogen, urea and, possibly, ammonia, were apparently the main nitrogen sources supporting photosynthesis. Parallel performance of nitrogen fixation by different bacteria provided functional redundancy, contributing to functional robustness and resiliency.

Phosphate can also be a limiting nutrient in soda lakes⁵⁰, and this appeared to be the case for Deer Lake in the present study, where phosphate was undetectable in lake waters (Supplementary Table 1). Cyanobacterium C8 (Gloeocapsa) was the most abundant population in Deer Lake (12.9% of Deer Lake metaproteome), and expressed a high-affinity phosphate transport system at higher levels (1.5% of C8 expressed proteome) than the other cyanobacteria. Phosphate potentially limited primary production in Deer Lake, as anoxygenic photoheterotrophs were 4–40× more abundant here than in the other lakes (Fig. 3, Supplementary Data 2 and 6).

Diversity of phototrophs in lake mats

The microbial mats of the Cariboo region display steep oxygen and sulfide gradients⁶, providing opportunities for photoheterotrophic bacteria that use any remaining light, which penetrates beyond the oxic layer created by cyanobacteria^38,52. Puf or Puh photosystem reaction center proteins were expressed by purple non-sulfur bacteria affiliated with Rhodobacteraceae, MAG A4, and Geminicoccales, MAG A7, as well as autotrophic purple sulfur bacteria, affiliated with Thiohalocapsa, MAG G8. Both photoheterotrophs were relatively abundant in phosphate-limited Deer Lake, at 3.2% and 2.8%, respectively. In addition to puhA, MAG A4 expressed all three subunits of carbon monoxide dehydrogenase (coxSML). Carbon monoxide could be produced by photooxidation of organic material⁵³, and could serve as an alternative energy source for these bacteria. Organic substrates supporting photoheterotrophic growth likely consist of cyanobacterial fermentation products, glycolate from photorespiration³⁸ or could originate from biomass decay. By re-assimilation of organic matter or re-fixation of bicarbonate using light energy, these organisms enhance the overall productivity of the mats.

Most unexpected among photoheterotrophs was population Ge1, a representative of an uncultured family within the recently defined phylum Gemmatimonadota. This particular population expressed the pufC subunit of the photosynthetic reaction center and contains the remaining photosystem genes in its genome (pufLMA, puhA, acsF). The ability for members of this phylum to use light energy was only recently discovered⁵⁴, and the capacity for phototrophy appears to be widespread among members of that phylum⁵⁵.

The Gemmatimonadetes bacterium isolated by Zheng and colleagues is heterotrophic, without evidence for a carbon fixation pathway. Interestingly, all genes required for a complete carbon-fixing CBB cycle are present in the genome of MAG Ge1. Genes homologous to the functional RuBisCO Form 1 C large subunit (rbcL), and RuBisCO small subunit (rbcS) were identified, as well as a copy of the CBB cycle-specific enzyme Phosphoribulokinase (prk). These genes were arranged sequentially in the genome: rbcS, rbcL, and prk, an arrangement that points at facultative autotrophy⁵⁶. Upon further investigation of the published MAGs from the Kulunda Steppe soda lakes in Central Asia, we found five additional Gemmatimonadetes MAGs (Fig. 5), that encoded these three CBB cycle genes with the same synteny, and with 88–98% amino acid identity, to the genes of Ge1. All identified rbcL genes are functional Form 1 C rbcL sequences (Fig. 5b). To our knowledge no other sequenced representatives from the Gemmatimonadetes phylum, apart from these six MAGs, contain the full suite of CBB cycle genes. Given the large number of amino acids (>90%) shared with homologuous genes encoded in Alphaproteobacteria (e.g., Rhizobiales bacterium YIM 77505 rbcL), it seems likely that the last common ancestor of these Gemmatimonadetes populations acquired the CBB genes via horizontal gene transfer from an Alphaproteobacterium, prior to the dispersal and speciation of the clade into the Kulunda Steppe and Cariboo Plateau populations. Although assembly and binning of genomes from metagenomic data sometimes lead to artefactual inference of a horizontal gene transfer event, detection of six sets of phylogenetically congruent genes in six different MAGs from two independent datasets, is unlikely to be artifact. We did not detect expression for these genes and were not able to estimate the δ¹³C value for this bacterium (too few high quality MS1 spectra) so it remains unknown to what extent this bacterium used bicarbonate as a carbon source.

Sulfur cycle in lake mats identified in proteomes

The presence of the autotrophic purple sulfur bacterium G8, affiliated with Thiohalocapsa, indicated active sulfur cycling within the mats, as expected based on the previous detection of sulfide within the mats⁶. Indeed, MAG D1, affiliated with Desulfonatronum^20,57 expressed aprAB, sat, and dsrAB, indicating that at least part of the sulfide was produced inside the mats. It also expressed an alcohol dehydrogenase, a formate dehydrogenase, and a hydrogenase, indicating that it oxidized compounds such as ethanol, formate, and hydrogen. These could be derived from dark fermentation by cyanobacteria or from decaying biomass. Sulfide produced by D1 was likely re-used by MAGs G8 and G4, the latter affiliated with Thioalkalivibrionaceae^18,58. G4 expressed soxX, soxC, dsrA, and fccB, suggesting sulfide oxidation through both the sox pathway and the reverse dsr pathway. Expression of sox and fcc was also detected for other unbinned populations, affiliated with Alphaproteobacteria, Chromatiales, and other Gammaproteobacteria.

In conclusion, we used metaproteomes and metagenomes to address fundamental questions on the microbial ecology of soda lake mats. We obtained 91 metagenome-assembled-genomes and showed that part of these taxa define a core microbiome, a group of abundant bacteria present in all samples over space (four lakes) and time (4 years). We showed that a very similar community assembled independently in Central Asian soda lakes. The similarity between some of the microbial genomes found in these soda lake regions, incredible in the light of their vast physical separation, suggests that vectors for dispersal are generally ineffective, but can sometimes distribute abundant community members at the global scale. We also showed both functional redundancy and existence of complemental niches among cyanobacteria, with evidence for K and R strategists living side by side. Cyanobacterium C1 was always most abundant but appeared to grow more slowly than C5, based on expression and isotopic signatures. C5 appeared to grow sufficiently fast to occasionally deplete bicarbonate in its surroundings, inconsistent with the prevailing paradigm of unlimited access to bicarbonate in alkaline soda lakes. The nature and origin of carbon sources for photoheterotrophs, including potentially mixotrophic Gemmatimonadetes is an exciting avenue for future research. The presented core microbiome provides a blueprint for design of a productive and robust microbial ecosystem that could guide effective biotechnology for carbon dioxide conversion.

Methods

Study site and sample collection

Samples from benthic microbial mats were collected from four lakes in the Cariboo Plateau region of British Columbia, Canada in May of 2014, 2015, 2016, and 2017. Microbial mats from Last Chance Lake, Probe Lake, Deer Lake, and Goodenough Lake were sampled (coordinates in Supplementary Table 1). Mats were homogenized, immediately frozen, transported on dry ice, and stored at −80 °C within 2 days of sampling. In 2015 and 2017, water samples for aqueous geochemistry were also taken and stored at −80 °C until analysis.

Aqueous geochemistry

Frozen lake water samples were thawed and filtered through a 0.45 µm nitrocellulose filter (Millipore Corporation, Burlington, MA) prior to analysis. Carbonate/bicarbonate (HCO₃⁻) alkalinity analysis was conducted using an Orion 960 Titrator (Thermo Fisher Scientific, Waltham, MA), and concentrations were calculated via double differentiation using EZ 960 software. Major cations (Ca²⁺, Mg²⁺, K⁺, and Na⁺) were analyzed using a Varian 725-ES Inductively Coupled Plasma Optical Emission Spectrophotometer (ICP-OES). Major anions (Cl⁻, NO₃⁻, PO₄³⁻, and SO₄²⁻) were analyzed using a Dionex ICS 2000 ion chromatograph (Dionex Corporation, Sunnyvale, CA), with an Ion Pac AS18 anion column (Dionex Corporation, Sunnyvale, CA).

Water for reduced nitrogen quantification was filtered through a 0.2 µm filter (Pall Life Sciences, Port Washington, NY). Concentrations were measured using the ortho-phthaldialdehyde fluorescence assay⁵⁹, with excitation at 410 nm, and emission at 470 nm.

Amplicon sequencing and data processing

DNA extraction and amplicon sequencing were performed, with primer sets TAReuk454FWD (565 f CCAGCASCYGCGGTAATTCC) and TAReukREV3 (964b ACTTTCGTTCTTGATYRA), targeting Eukaryota, and S-D440 Bact-0341-a-S-17 (b341, TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGCCTACGGGAGGCAGCAG), and S-D-Bact-0785-a-A-21 (805 R, GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGGACTA CHVGGGTATCTAATCC) targeting Bacteria¹⁰. Sequencing was performed using the MiSeq Personal Sequencer (Illumina, San Diego, CA) using the 2 × 300 bp MiSeq Reagent Kit v3. The reads were processed with MetaAmp⁶⁰. After merging of paired-end reads (>100 bp overlap and <8 mismatches in the overlapping region), primer trimming and quality filtering (<2 mismatches in primer regions and at most 1 expected error), trimming to 350 bp, reads were clustered into operational taxonomic units (OTUs) of >97% sequence identity. Non-metric multidimensional scaling (NMDS) was performed in R, using the package vegan⁶¹. For NMDS, OTUs <1% abundant in all samples were excluded, as were those affiliated with Metazoa, because of large variations in rRNA copy and cell numbers.

Shotgun metagenome sequencing and data processing

Metagenomes of the 2015 mat samples were sequenced⁶². Briefly, DNA was sheared into fragments of ~300 bp using a S2 focused-ultrasonicator (Covaris, Woburn, MA). Libraries were created using the NEBNext Ultra DNA Library Prep Kit (New England Biolabs, Ipswich, MA) according to the manufacturer’s protocol, which included a size selection step with SPRIselect magnetic beads (Beckman Coulter, Indianapolis, IN) and PCR enrichment (eight cycles) with NEBNext Multiplex Oligos for Illumina (New England Biolabs, Ipswich, MA). DNA concentrations were estimated using qPCR and the Kapa Library Quant Kit (Kapa Biosystems, Wilmington, MA) for Illumina. 1.8 pM of DNA solution was sequenced on an Illumina NextSeq 500 sequencer (Illumina, San Diego, CA) using a 300 cycle (2 × 150 bp) high-output sequencing kit at the Center for Health Genomics and Informatics in the Cumming School of Medicine, University of Calgary. Raw, paired-end Illumina reads were filtered for quality⁶³. After that, the reads were coverage-normalized with BBnorm (sourceforge.net/projects/bbmap) with target = 100 min = 4. Overlapping reads were merged with BBMerge with default settings. All remaining reads were assembled separately for each library with MetaSpades version 3.10.0⁶⁴, with default parameters. Contigs of <500 bp were not further considered. tRNA, ribosomal RNA, CRISPR elements, and protein-coding genes were predicted and annotated using MetaErg (sourceforge.net/projects/metaerg/). Per-contig sequencing coverage was estimated and tabulated by read mapping with BBMap, with default settings and “jgi_summarize_bam_contig_depths”, provided with MetaBat⁶⁵. Each assembly was binned into metagenome-assembled-genomes (MAGs) with MetaBat with options “-a depth.txt –saveTNF saved_2500.tnf –saveDistance saved_2500.dist -v –superspecific -B 20–keep”. MAG contamination and completeness was estimated with CheckM⁶⁶. MAGs were classified with GTDBtk (version 0.2.2, database release 86)³⁶, together with MAGs previously obtained from Kulunda soda lakes³⁰. fastANI was used to compare MAGs across libraries/assemblies⁶⁷. Relative sequence abundances of MAGs were estimated as (MAG contig sequencing coverage) × (MAG genome size) / (total nucleotides sequenced). 16 S rRNA gene sequences were obtained with Phyloflash2⁶⁸ and were associated with MAGs based on phylogeny and sequencing coverage covariance across samples, and to OTUs based on sequence identity (Supplementary Table 2). Core genes of MAG variants were identified using blast and these genes were used to determine the abundances of variants across samples using BBMap, with parameters minratio = 0.9 maxindel = 3 bwr = 0.16 bw = 12 fast ambiguous = toss. To identify diversified core genes, variants were aligned with mafft⁶⁹ and only genes with >50 single nucleotide polymorphisms (SNPs), >1% of positions with a SNP, and with a fraction of non-synonymous SNPs of >0.825 were kept.

Protein extraction and metaproteomics

Protein was extracted and analyzed from 2014 mat samples⁶². Briefly, lysing matrix bead tubes A (MP Biomedicals) containing mat samples and SDT-lysis buffer (0.1 M DTT) in a 10:1 ratio were bead-beated in an OMNI Bead Ruptor 24 for 45 s at 6 m s⁻¹. Next, tubes were incubated at 95 °C for 10 min, spun down for 5 min at 21,000 × g and tryptic peptides were isolated from pellets by filter-aided sample preparation (FASP)⁷⁰. Peptides were separated on a 50 cm × 75 µm analytical EASY-Spray column using an EASY-nLC 1000 Liquid Chromatograph (Thermo Fisher Scientific, Waltham, MA) and eluting peptides were analyzed in a QExactive Plus hybrid quadrupole-Orbitrap mass spectrometer (Thermo Fisher Scientific). Each sample was run in technical quadruplicates, with one quadruplicate run for 260 min with 1 µg of peptide loaded, and the other three for 460 min each, with 2–4 µg of peptide loaded.

Expressed proteins were identified and quantified with Proteome Discoverer version 2.0.0.802 (Thermo Fisher Scientific), using the Sequest HT node. The Percolator Node⁷¹ and FidoCT were used to estimate false discovery rates (FDR) at the peptide and protein level respectively. Peptides and proteins with DFR >5% were discarded. Likewise, proteins without protein-unique-peptides, or <2 unique peptides were discarded. Relative protein abundances were estimated based on normalized spectral abundances⁷². Abundances of MAGs in the metaproteome were estimated by dividing the sum of the relative abundances for all of its expressed proteins by the sum of the relative protein abundances for all expressed proteins. The identification database was created using predicted protein sequences of binned and unbinned contigs, after filtering out highly similar proteins (>95% amino acid identity) with cd-hit⁷³, while preferentially keeping proteins from binned contigs. Sequences of common contaminating proteins were added to the final database (http://www.thegpm.org/crap/), which is available under identifier PXD011230 in ProteomeXchange. In total, 3,014,494 MS/MS spectra were acquired, yielding 298,187 peptide spectral matches, and 7217 identified proteins. Per population stable isotope fingerprints were estimated based on spectra obtained for all samples⁴¹.

Phylogenetic analysis

For the MAG phylogenetic tree (Fig. 5), a set of 16 ribosomal genes⁷⁴ plus the RNA polymerase genes rpoABC (TIGR02013, TIGR02027, TIGR02386) were identified and aligned as previously described⁷⁴. After removing poorly aligned regions with gblocks (⁷⁵, used with option “−b5 = h”), the alignments were concatenated (5053 positions total) and bootstrapped maximum likelihood phylogeny was estimated with RaxML, using model PROTGAMMALG, as described in ref. ⁷⁴. All Gemmatimonadota genomes present in GTDB were included as reference sequences. Supplementary Table 3 shows all reference sequences used as well as their geographical origin. The RuBisCO tree was made in the same manner as the MAG phylogenetic tree (636 positions), and RuBisCO reference sequences can be found in Supplementary Table 4. Expanded Gemmatimonadetes tree can be found in Supplementary Fig. 1 and the expanded RuBisCO tree can be found in Supplementary Fig. 2.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Amplicon sequences can be found under the Bioproject PRJNA377096. The 16 S rRNA sequence Biosamples are: SAMN06456834, SAMN06456843, SAMN06456852, SAMN06456861, SAMN09986741-SAMN09986751, and the 18 S rRNA sequence Biosamples are: SAMN09991649-SAMN09991660. The metagenome raw reads and metagenome-assembled-genomes can also be found under the Bioproject PRJNA377096. The Biosamples for the metagenome raw reads are SAMN10093821-SAMN10093824, and the Biosamples for the MAGs are SAMN10237340-SAMN10237430. The metaproteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository⁷⁶ with the dataset identifier PXD011230.

References

Melack, J. M. & Kilham, P. Photosynthetic rates of phytoplankton in East African alkaline, saline lakes. Limnol. Oceanogr. 19, 743–755 (1974).
Article ADS CAS Google Scholar
Talling, J. F., Wood, R. B., Prosser, M. V. & Baxter, R. M. The upper limit of photosynthetic productivity by phytoplankton: evidence from Ethiopian soda lakes. Freshw. Biol. 3, 53–76 (1973).
Article Google Scholar
Raven, J. A., Cockell, C. S. & De La Rocha, C. L. The evolution of inorganic carbon concentrating mechanisms in photosynthesis. Philos. Trans. R. Soc. Lond. B Biol. Sci. 363, 2641–2650 (2008).
Article CAS Google Scholar
Price, G. D., Badger, M. R., Woodger, F. J. & Long, B. M. Advances in understanding the cyanobacterial CO2-concentrating- mechanism (CCM): functional components, Ci transporters, diversity, genetic regulation and prospects for engineering into plants. J. Exp. Bot. 59, 1441–1461 (2008).
Article CAS Google Scholar
Fabry, V. & Seibel, B. Impacts of ocean acidification on marine fauna and ecosystem processes. ICES J. Mar. Sci. 65, 414–432 (2008).
Article CAS Google Scholar
Brady, A. L., Druschel, G., Leoni, L., Lim, D. S. S. & Slater, G. F. Isotopic biosignatures in carbonate-rich, cyanobacteria-dominated microbial mats of the Cariboo Plateau, B.C. Geobiology 11, 437–456 (2013).
Article CAS Google Scholar
Grant, W. D. Alkaliphiles: ecology, diversity and applications. Fems. Microbiol. Rev. 75, 255–269 (1990).
Article CAS Google Scholar
Canon-Rubio, K. A., Sharp, C. E., Bergerson, J., Strous, M. & De la Hoz Siegler, H. Use of highly alkaline conditions to improve cost-effectiveness of algal biotechnology. Appl. Microbiol. Biotechnol. 100, 1611–1622 (2016).
Article CAS Google Scholar
Daelman, M. R. J., Sorokin, D., Kruse, O., van Loosdrecht, M. C. M. & Strous, M. Haloalkaline bioconversions for methane production from microalgae grown on sunlight. Trends Biotechnol. 34, 450–457 (2016).
Article CAS Google Scholar
Sharp, C. E. et al. Robust, high-productivity phototrophic carbon capture at high pH and alkalinity using natural microbial communities. Biotechnol. Biofuels 10, 1–13 (2017).
Article Google Scholar
Tutolo, B. M. & Tosca, N. J. Experimental examination of the Mg-silicate-carbonate system at ambient temperature: Implications for alkaline chemical sedimentation and lacustrine carbonate formation. Geochim. Cosmochim. Acta 225, 80–101 (2018).
Article ADS CAS Google Scholar
Grant, W. D. & Sorokin, D. Y. in Extremophiles Handbook, 27–54 (Springer, Tokyo, 2011).
Dadheech, P. K., Mahmoud, H., Kotut, K. & Krienitz, L. Haloleptolyngbya alcalis gen. et sp. nov., a new filamentous cyanobacterium from the soda lake Nakuru, Kenya. Hydrobiologia 691, 269–283 (2012).
Article CAS Google Scholar
Duckworth, A. W., Grant, S., Grant, W. D., Jones, B. E. & Meijer, D. Dietzia natronolimnaios sp. nov., a new member of the genus Dietzia isolated from an East African soda lake. Extremophiles 2, 359–366 (1998).
Article CAS Google Scholar
Florenzano, G., Sili, C., Pelosi, E. & Vincenzini, M. Cyanospira rippkae and Cyanospira capsulata (gen. nov. and spp. nov.): new filamentous heterocystous cyanobacteria from Magadi lake (Kenya). Arch. Microbiol. 140, 301–306 (1985).
Article Google Scholar
Sorokin, D. Y., Banciu, H., Van Loosdrecht, M. & Kuenen, J. G. Growth physiology and competitive interaction of obligately chemolithoautotrophic, haloalkaliphilic, sulfur-oxidizing bacteria from soda lakes. Extremophiles 7, 195–203 (2003).
Article Google Scholar
Sorokin, D. Y. et al. Thioalkalimicrobium aerophilum gen. nov., sp. nov. and Thioalkalimicrobium sibericum sp. nov., and Thioalkalivibrio versutus gen. nov., sp. nov., Thioalkalivibrio nitratis sp.nov., novel and Thioalkalivibrio denitrificancs sp. nov., novel obligately alkaliphilic and obligately chemolithoautotrophic sulfur-oxidizing bacteria from soda lakes. Int. J. Syst. Evol. Microbiol. 51, 565–580 (2001).
Article CAS Google Scholar
Sorokin, D. Y. & Kuenen, J. G. Haloalkaliphilic sulfur-oxidizing bacteria in soda lakes. FEMS Microbiol. Rev. 29, 685–702 (2005).
Article CAS Google Scholar
Foti, M. et al. Diversity, activity, and abundance of sulfate-reducing bacteria in saline and hypersaline soda lakes. Appl. Environ. Microbiol. 73, 2093–2100 (2007).
Article CAS Google Scholar
Pikuta, E. V. et al. Desulfonatronum thiodismutans sp. nov., a novel alkaliphilic, sulfate-reducing bacterium capable of lithoautotrophic growth. Int. J. Syst. Evol. Microbiol. 53, 1327–1332 (2003).
Article CAS Google Scholar
Sorokin, D. et al. Isolation and properties of obligately chemolithoautotrophic and extremely alkali-tolerant ammonia-oxidizing bacteria from Mongolian soda lakes. Arch. Microbiol. 176, 170–177 (2001).
Article CAS Google Scholar
Sorokin, D. Y. et al. Nitrolancea hollandica gen. nov., sp. nov., a chemolithoautotrophic nitrite-oxidizing bacterium isolated from a bioreactor belonging to the phylum Chloroflexi. Int. J. Sys. Evol. Microbiol. 64, 1859–1865 (2014).
Article CAS Google Scholar
Shapovalova, A. A., Khijniak, T. V., Tourova, T. P., Muyzer, G. & Sorokin, D. Y. Heterotrophic denitrification at extremely high salt and pH by haloalkaliphilic Gammaproteobacteria from hypersaline soda lakes. Extremophiles 12, 619–625 (2008).
Article CAS Google Scholar
Sorokin, D. Y., Van Pelt, S., Tourova, T. P. & Evtushenko, L. I. Nitriliruptor alkaliphilus gen. nov., sp. nov., a deep-lineage haloalkaliphilic actinobacterium from soda lakes capable of growth on aliphatic nitriles, and proposal of Nitriliruptoraceae fam. nov. and Nitriliruptorales ord. nov. Int. J. Sys. Evol. Microbiol. 59, 248–253 (2009).
Article CAS Google Scholar
Sorokin, D. Y., Muntyan, M. S., Toshchakov, S. V., Korzhenkov, A. & Kublanov, I. V. Phenotypic and genomic properties of a novel deep-lineage haloalkaliphilic member of the phylum Balneolaeota from soda lakes possessing Na+-translocating proteorhodopsin. Front. Microbiol. 9, 2672 (2018).
Article Google Scholar
Lin, J.-L. et al. Molecular diversity of methanotrophs in Transbaikal soda lake sediments and identification of potentially active populations by stable isotope probing. Environ. Microbiol. 6, 1049–1060 (2004).
Article CAS Google Scholar
Kevbrin, V. V., Zhilina, T. N., Rainey, F. A. & Zavarzin, G. A. Tindallia magadii gen. nov., sp. nov.: An alkaliphilic anaerobic ammonifier from soda lake deposits. Curr. Microbiol. 37, 94–100 (1998).
Article CAS Google Scholar
Sorokin, D. Y. et al. Syntrophic associations from hypersaline soda lakes converting organic acids and alcohols to methane at extremely haloalkaline conditions. Environ. Microbiol. 18, 3189–3202 (2016).
Article CAS Google Scholar
Sorokin, D. Y. et al. Methanogenesis at extremely haloalkaline conditions in the soda lakes of Kulunda Steppe (Altai, Russia). Fems. Microbiol. Ecol. 91, 1–11 (2015).
Article Google Scholar
Vavourakis, C. D. et al. A metagenomics roadmap to the uncultured genome diversity in hypersaline soda lake sediments. Microbiome 6, 1–18 (2018).
Article Google Scholar
Hammer, U. T. Saline Lake Ecosystems of the World. (Springer, The Netherlands, 1986).
Renaut, R. W. Recent carbonate sedimentation and brine evolution in the saline lake basins of the Cariboo Plateau, British Columbia, Canada. Hydrobiologia 197, 67–81 (1990).
Article CAS Google Scholar
Renaut, R. W. & Long, P. R. Sedimentology of the saline lakes of the Cariboo Plateau, Interior British Columbia, Canada. Sediment. Geol. 64, 239–264 (1989).
Article ADS CAS Google Scholar
Wilson, S. E., Cumming, B. F. & Smol, J. P. Diatom-salinity relationships in 111 lakes from the Interior Plateau of British Columbia, Canada: the development of diatom-based models for paleosalinity reconstructions. J. Paleolimnol. 12, 197–221 (1994).
Article ADS Google Scholar
Bos, D., Cumming, B. F., Watters, C. E. & Smol, J. P. The relationship between zooplankton, conductivity and lake-water ionic composition in 111 lakes from the Interior Plateau of British Columbia, Canada. Int. J. Salt. Lake. Res. 5, 1–15 (1996).
Article Google Scholar
Parks, D. H. et al. A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life. Nat. Biotechnol. 36, 996–1004 (2018).
Article CAS Google Scholar
Valverde, A., Tuffin, M. & Cowan, D. Biogeography of bacterial communities in hot springs: a focus on the actinobacteria. Extremophiles 6, 669–679 (2012).
Article Google Scholar
Stal, L. Physiological ecology of cyanobacteria in microbial mats and other communities. N. Phytol. 131, 1–32 (1995).
Article CAS Google Scholar
Hinzke, T., Kouris, A., Hughes, R. A., Strous, M. & Kleiner, M. More is not always better: evaluation of 1D and 2D-LC-MS/MS methods for metaproteomics. Front. Microbiol. 10, 1–13 (2019).
Article Google Scholar
Bagnoud, A. et al. Reconstructing a hydrogen-driven microbial metabolic network in Opalinus Clay rock. Nat. Commun. 7, 1–10 (2016).
Article Google Scholar
Kleiner, M. et al. A metaproteomics method to determine carbon sources and assimilation pathways of species in microbial communities. Proc. Natl Acad. Sci. USA 115, E5576–E5584 (2018).
Article CAS Google Scholar
Aryal, U. K. et al. Dynamic proteomic profiling of a unicellular cyanobacterium Cyanothece ATCC51142 across light-dark diurnal cycles. Bmc. Syst. Biol. 5, 194 (2011).
Article CAS Google Scholar
Matallana-Surget, S. et al. Proteome-Wide analysis and diel proteomic profiling of the cyanobacterium Arthrospira platensis PCC 8005. PLoS ONE. 9, e99076 (2014).
Article ADS Google Scholar
Allison, S. D. & Martiny, J. B. H. Resistance, resilience, and redundancy in microbial communities. Proc. Natl Acad. Sci. USA 105, 11512–11519 (2008).
Article ADS CAS Google Scholar
Shade, A. et al. Fundamentals of microbial community resistance and resilience. Front. Microbiol. 3, 417 (2012).
Article Google Scholar
Ting, C. S., Rocap, G., King, J. & Chisholm, S. W. Cyanobacterial photosynthesis in the oceans: the origins and significance of divergent light-harvesting strategies. Trends Microbiol. 10, 134–142 (2002).
Article CAS Google Scholar
Croce, R. & Van Amerongen, H. Natural strategies for photosynthetic light harvesting. Nat. Chem. Biol. 10, 492–501 (2014).
Article CAS Google Scholar
Markager, S. & Vincent, W. Of UV and blue light in natural spectral light attenuation and the absorption waters. Limnol. Oceanogr. 45, 642–650 (2000).
Article ADS CAS Google Scholar
Pearson, A. in Handbook of Hydrocarbon and Lipid Microbiology. (Springer, Berlin, 2010).
Melack, J. M., Kilham, P. & Fisher, T. R. Responses of phytoplankton to experimental fertilization with ammonium and phosphate in an African soda lake. Oecologia 52, 321–326 (1982).
Article ADS Google Scholar
Harper, C. J., Hayward, D., Kidd, M., Wiid, I. & van Helden, P. Glutamate dehydrogenase and glutamine synthetase are regulated in response to nitrogen availability in Myocbacterium smegmatis. BMC Microbiol. 10, 138 (2010).
Article Google Scholar
Li, T. et al. Microscale profiling of photosynthesis-related variables in a highly productive biofilm photobioreactor. Biotechnol. Bioeng. 113, 1046–1055 (2016).
Article CAS Google Scholar
Wilson, D. F., Swinnerton, J. W. & Lamontagne, R. A. The ocean: a natural source of carbon monoxide. Science 167, 984–986 (1970).
Article ADS Google Scholar
Zeng, Y., Feng, F., Medova, H., Dean, J. & Koblížek, M. Functional type 2 photosynthetic reaction centers found in the rare bacterial phylum Gemmatimonadetes. Proc. Natl Acad. Sci. USA 111, 7795–7800 (2014).
Article ADS CAS Google Scholar
Zeng, Y. et al. Metagenomic evidence for the presence of phototrophic Gemmatimonadetes bacteria in diverse environments. Environ. Microbiol. Rep. 8, 139–149 (2016).
Article CAS Google Scholar
Scott, K. M. et al. The genome of deep-sea vent chemolithoautotroph Thiomicrospira crunogena XCL-2. PLoS Biol. 4, e383 (2006).
Article Google Scholar
Sorokin, D. Y., Kuenen, J. G. & Muyzer, G. The microbial sulfur cycle at extremely haloalkaline conditions of soda lakes. Front. Microbiol 2, 44 (2011).
Article CAS Google Scholar
Ahn, A. C. et al. Genomic diversity within the haloalkaliphilic genus Thioalkalivibrio. PLoS ONE. 12, 1–23 (2017).
Holmes, R. M., Aminot, A., Kérouel, R., Hooker, B. A. & Peterson, B. J. A simple and precise method for measuring ammonium in marine and freshwater ecosystems. Can. J. Fish. Aquat. Sci. 56, 1801–1808 (1999).
Article CAS Google Scholar
Dong, X. et al. Fast and simple analysis of MiSeq amplicon sequencing data with MetaAmp. Front. Microbiol. 8, 1461 (2017).
Article Google Scholar
Dixon, P. VEGAN, a package of R functions for community ecology. J. Veg. Sci. 14, 927–930 (2003).
Article Google Scholar
Kleiner, M. et al. Assessing species biomass contributions in microbial communities via metaproteomics. Nat. Commun. 8, 1558 (2017).
Article ADS Google Scholar
Saidi-Mehrabad, A. et al. Methanotrophic bacteria in oilsands tailings ponds of northern Alberta. ISME J. 7, 908–921 (2013).
Article CAS Google Scholar
Nurk, S., Meleshko, D., Korobeynikov, A. & Pevzner, P. A. MetaSPAdes: a new versatile metagenomic assembler. Genome Res. 27, 824–834 (2017).
Article CAS Google Scholar
Kang, D. D., Froula, J., Egan, R. & Wang, Z. MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities. PeerJ. 3, e1165 (2015).
Article Google Scholar
Parks, D. H., Imelfort, M., Skennerton, C. T., Hugenholtz, P. & Tyson, G. W. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 25, 1043–1055 (2015).
Article CAS Google Scholar
Jain, C., Rodriguez-R, L. M., Phillippy, A. M., Konstantinidis, K. T. & Aluru, S. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nat. Commun. 9, 5114 (2018).
Article ADS Google Scholar
Gruber-Vodicka, H. R., Seah, B. K. B. & Pruesse, E. phyloFlash—Rapid SSU rRNA profiling and targeted assembly from metagenomes. bioRxiv. 521922 (2019). https://www.biorxiv.org/content/10.1101/521922v1
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Article CAS Google Scholar
Wisniewski, J. R., Zougman, A., Nagaraj, N. & Mann, M. Universal sample preparation method for proteome analysis. Nat. Methods 6, 359–362 (2009).
Article CAS Google Scholar
Spivak, M., Weston, J., Bottou, L., Käll, L. & Stafford, W. Improvements to the Percolator Algorithm for peptide identification from shotgun proteomics data sets. J. Proteome Res. 8, 3737–3745 (2009).
Article CAS Google Scholar
Zybailov, B. et al. Statistical analysis of membrane proteome expression changes in Saccharomyces cerevisiae. J. Proteome Res. 5, 2339–2347 (2006).
Article CAS Google Scholar
Li, W. & Godzik, A. Cd-Hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22, 1658–1659 (2006).
Article CAS Google Scholar
Hug, L. A. et al. A new view of the tree of life. Nat. Microbiol 1, 16048 (2016).
Article CAS Google Scholar
Talavera, G. & Castresana, J. Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst. Biol. 56, 564–577 (2007).
Article CAS Google Scholar
Vizcaíno, J. A. et al. 2016 Update of the PRIDE database and its related tools. Nucleic Acids Res. 44, D447–D456 (2016).
Article Google Scholar

Download references

Acknowledgements

We thank the University of Calgary’s Center for Health Genomics and Informatics for sequencing and informatics services. We thank Michael Nightingale and Agasteswar Vadlamani for help with analysis of aqueous geochemistry. We also thank Timber Gillis, Hayley Todesco, Harsimrit Lakhyan, Zachary Urquhart, Oliver Horanszky, Virginia Hermanson, Christopher Chow, Peter Zhao, Tong Wang, and Sydney Urschel for help with sample collection and DNA extractions. We would like to thank Dan Liu and Angela Kouris for help with metaproteomics sample preparation and analysis. We thank Carmen Li for help with MiSeq sequencing, and Maryam Ataeian for help with metagenome analysis. This study was supported by the Natural Sciences and Engineering Research Council (NSERC), Canada Foundation for Innovation (CFI), Canada First Research Excellence Fund (CFREF), Genome Canada, Western Economic Diversification, the International Microbiome Center (Calgary), Alberta Innovates, the Government of Alberta, and the University of Calgary.

Author information

Authors and Affiliations

Department of Geoscience, University of Calgary, Calgary, AB, T2N 1N4, Canada
Jackie K. Zorz, Christine Sharp, Xiaoli Dong & Marc Strous
Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, 27695, USA
Manuel Kleiner
Centre for Health Genomics and Informatics, University of Calgary, Calgary, AB, T2N 2T9, Canada
Paul M. K. Gordon & Richard T. Pon

Authors

Jackie K. Zorz
View author publications
You can also search for this author in PubMed Google Scholar
Christine Sharp
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Kleiner
View author publications
You can also search for this author in PubMed Google Scholar
Paul M. K. Gordon
View author publications
You can also search for this author in PubMed Google Scholar
Richard T. Pon
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoli Dong
View author publications
You can also search for this author in PubMed Google Scholar
Marc Strous
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.Z. collected samples, analyzed data, made figures, and wrote manuscript. C.S. conceived study, collected samples, extracted DNA, and prepared libraries for sequencing. M.K. extracted protein, performed metaproteomics, and analyzed data. P.G. and R.P. performed metagenomics sequencing. X.D. analyzed data, wrote manuscript, and developed pipelines used in metagenomics data analysis. M.S. conceived study, analyzed data, made figures, and wrote manuscript. All authors provided feedback to the manuscript.

Corresponding author

Correspondence to Jackie K. Zorz.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Brett Baker, Trinity Hamilton and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Supplementary Data 6

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zorz, J.K., Sharp, C., Kleiner, M. et al. A shared core microbiome in soda lakes separated by large distances. Nat Commun 10, 4230 (2019). https://doi.org/10.1038/s41467-019-12195-5

Download citation

Received: 01 May 2019
Accepted: 16 August 2019
Published: 17 September 2019
DOI: https://doi.org/10.1038/s41467-019-12195-5

This article is cited by

Globally distributed marine Gemmatimonadota have unique genomic potentials
- Xianzhe Gong
- Le Xu
- Brett J. Baker
Microbiome (2024)
Lake microbiome composition determines community adaptability to warming perturbations
- Xiaotong Wu
- Qixing Zhou
- Xiangang Hu
Ecological Processes (2024)
Biogeochemical explanations for the world’s most phosphate-rich lake, an origin-of-life analog
- Sebastian Haas
- Kimberly Poppy Sinclair
- David C. Catling
Communications Earth & Environment (2024)
Revealing the hierarchical structure of microbial communities
- Beatrice Ruth
- Stephan Peter
- Peter Dittrich
Scientific Reports (2024)
Large scale exploration reveals rare taxa crucially shape microbial assembly in alkaline lake sediments
- Zhiguang Qiu
- Shuhang He
- Ke Yu
npj Biofilms and Microbiomes (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.