Sabkhas are hypersaline, mineral-rich, supratidal mudflats that harbor microbes that are adapted to high salt concentration. Sabkha microbial diversity is generally studied for their community composition, but less is known about their genetic structure and heterogeneity. In this study, we analyzed a coastal sabkha for its microbial composition using 16S rDNA and whole metagenome, as well as for its population genetic structure. Our 16S rDNA analysis show high alpha diversity in both inner and edge sabkha than outer sabkha. Beta diversity result showed similar kind of microbial composition between inner and edge sabkha, while outer sabkha samples show different microbial composition. At phylum level, Bacteroidetes (~ 22 to 34%), Euryarchaeota (~ 18 to ~ 30%), unclassified bacteria (~ 24 to ~ 35%), Actinobacteria (~ 0.01 to ~ 11%) and Cyanobacteria (less than 1%) are predominantly found in both inside and edge sabkha regions, whereas Proteobacteria (~ 92 to ~ 97%) and Parcubacteria (~ 1 to ~ 2%) are predominately found in outer sabkha. Our 225 metagenomes assembly from this study showed similar bacterial community profile as observed in 16S rDNA-based analysis. From the assembled genomes, we found important genes that are involved in biogeochemical cycles and secondary metabolite biosynthesis. We observed a dynamic, thriving ecosystem that engages in metabolic activity that shapes biogeochemical structure via carbon fixation, nitrogen, and sulfur cycling. Our results show varying degrees of horizontal gene transfers (HGT) and homologous recombination, which correlates with the observed high diversity for these populations. Moreover, our pairwise population differentiation (Fst) for the abundance of species across the salinity gradient of sabkhas identified genes with strong allelic differentiation, lower diversity and elevated nonsynonymous to synonymous ratio of variants, which suggest selective sweeps for those gene variants. We conclude that the process of HGT, combined with recombination and gene specific selection, constitute the driver of genetic variation in bacterial population along a salinity gradient in the unique sabkha ecosystem.
Sabkhas (Arabic) are salt flats that dominate the terrestrial landscape in arid tropical and subtropical regions1. They are distributed across North Africa, the Gulf States, Mexico, the USA, and Australia, and the United Arab Emirates (UAE) contains the highest percentage of sabkha area per country2. Sabkhas are of two types: coastal and inland sabkhas (i.e., continental sabkha). Coastal sabkhas are generally found along coastline areas of arid regions, formed by seawater evaporation and contain siliciclastics or carbonate sediments, whereas inland sabkhas are usually flanked by towering dunes3. Coastal sabkha in the Gulf coast of the Arabian peninsula are Pleistocene dunes that has developed over the past 7000 years through wind erosion of pre-existing dunes and progradation of inter- and supratidal carbon sediments4.
Sabkha soil is near neutral in pH and has soluble and non-soluble salt deposits5,6, that includes high concentrations of Na, Mg, Ca, K, SO4, HCO3, and especially Cl−7; A salinity gradient is marked from the internal areas of sabkhas to their outer edges. The high concentrations of Cl− are remnants of past hydrological activity and mirror Cl− deposits found in the analogous salt flat sites of Mars8,9,10. In addition, ionic composition of soils, mineralogical, and structural, sabkhas are considered excellent models for where petroleum is formed11,12.
Despite the extreme environmental conditions within sabkhas, they provide a habitat for various microbial life forms13. Microbial communities interactions of coastal sabkhas are expected to play biogeochemical roles in the maintenance of this ecosystem14. Earlier studies using 16S, 18S rDNA sequencing to profile microorganisms in sabkhas (culturable and unculturable) have been conducted in Saudi Arabia and other Middle Eastern countries1,5,6,13,15,16. Although, these studies are useful at elucidating microbial communities and their ecological role, they were not sufficient to examine genetic variation within sabkha microbial populations17. Since most of these identified sabkha microorganisms from earlier studies lack genomic information, there is a need for whole genome metagenomic analysis to further understand the genomic structure of the sabkha ecosystem18,19,20,21. Sequence variation within assembled metagenomes will help elucidate the evolutionary forces operating in the natural populations of sabkhas.
In this study, we examined the abundance of microbial communities in coastal sabkha and sabkha-shore regions in Abu Dhabi, UAE using 16S rDNA, and whole-genome metagenome analysis to elucidate the genetic heterogeneity of microbial species. Based on the 16S rDNA based prokaryotic microbial profile, we identified unusual coastal sabkha-specific microbial communities, and it is consistent with the whole genome metagenome analysis. Out of 225 assembled microbial metagenomes, we analyzed 82 of the most abundant assembled genomes at the order and class taxonomic levels. We observed higher diversity for some microbial populations on the inner regions of the sabkha as well as the outside it. However, the overall population diversity was higher within the sabkha. Our results show genetic structure over local spatial scales, different level of homologous recombination for different species as well as gene-specific selective sweeps. The results enhance the understanding of the ecological roles, salt stress tolerance mechanisms, and potential applications of sabkha microbial genes.
Sabkha soil sample collection, DNA isolation and sequencing
In this study, soil samples were collected from the Abu Dhabi sabkha (24°04′01.1″N 53°22′28.5″E, Al Mirfa region), UAE, in February 2018 (18/02/2018) (Fig. 1A). We sampled three sites—one in the inner sabkha area (I site; S1), one at the edge of the sabkha (E site; S2) and one outside sabkha (O site; S3). All samples consists of duplicates. Each sample consisted of vertical three layers (upper, middle, and lower layer) as indicated in Fig. 1B. In total, 18 samples were collected from the sabkha for DNA isolation (16S rDNA analysis) as well as soil analysis.
From the collected soil samples, DNA isolation was carried out using Qiagen (Hilden, DE) Power Soil kit as per the manufacturer’s instruction. Isolated DNA quality was confirmed using agarose gel electrophoresis, and DNA concentration was measured with NanoDrop spectrophotometer (ThermoFisher Scientific, Waltham, MA) and Qubit 2.0 fluorometer (ThermoFisher Scientific). In addition, DNA samples were amplified using universal V3–V4 primers (16S rDNA primer 341F: CCTACGGGNGGCWGCAG and 805R: GACTACHVGGGTATCTAATCC), and sequencing libraries were prepared for the PCR amplicons and sequenced using Illumina MiSeq platform (300 bp PE chemistry).
16S rDNA-based (V3–V4 region) prokaryotic microbial profiling
Quality of the 16S rDNA raw Illumina data was confirmed using the FastQC tool22. The sequencing adapter and low quality regions found in the reads were removed using Scythe v0.923 and Sickle v1.3324 programs. Trimmed reads were further analyzed using DADA2 metagenomic analysis pipeline25. During DADA2 analysis, filterAndTrim (truncLen = c (250, 230), maxN = 0, maxEE = c (2,2), truncQ = 2) function was used for the PE reads length trimming. During trimming, forward and reverse reads were trimmed to 250 and 230 bps, respectively. Reads with lengths less than 250 bps (forward reads) and length less than 230 bps (reverse reads) were removed from the analysis, and the mergePairs function was used to merge forward and reverse reads. Function assignTaxonomy was used to map the merged reads against the 16S RNA database (silva_nr_v138)26, and biom file was created. The biom file with metadata file was used in a web-based tool MicrobiomeAnalyst27. Alpha diversity was generated with the Chao1, and Shannon diversity index and beta diversities was generated using the Bray–Curtis dissimilarity distance distribution. ANOVA test for alpha diversity across sampled layers was used to test for significance between inside and outside the sabkha, while PERMANOVA test was used for beta diversity analysis. Taxonomic identity was generated as a heatmap of abundance at the phylum and order level inside and at the edge of the sabkha, and outside, as well as at different depths between the upper (upper and lower), middle (upper and lower) and lower (upper and lower).
Sabkha soil property and mineral analysis
One hundred grams of sabkha soil from the collected sample was dissolved by adding 200 ml of distilled water and stirred for an hour, filtered and salinity, pH, TDS and conductivity was measured using a multiparameter Hack instrument (USA). For mineral analysis, sabkha soil samples were dried, and the minerals present in the samples were measured using XRD PANalytical and XRF Epsilon instruments.
Metagenome DNA isolation, sequencing and analysis
For whole genome metagenome analysis, 8 different sabkha sites were sampled (s1, s3, s8, s12 for inner) and (s13, s14, s17, s18 for outer) (Fig. 1B). Each site is a pool of 3 replicate samples. From the collected soil samples, DNA isolation was carried out using Qiagen (Hilden, DE) power soil kit as per manufacturer’s instruction. Eight shotgun whole genome metagenome libraries were generated using NEBNext® UltraTM DNA Library Prep Kit (New England Bioloabs, Ipswich, Massachusetts, United States) according to the prescribed protocol. Each of the 8 sampled sites were spaced equally with half of meter from inside and from outside (Fig. 1B). Furthermore, libraries were sequenced with Illumina NovaSeq 6000 (150-bp paired end reads).
Metagenome read quality and trimming were performed in FastQC and Trimmomatic28. Metagenome analysis was carried out using the MetaWRAP v1.3.2 pipeline29. Metagenome assembly was performed using Meta-SPAdes v3.13 assembler30. The initial read and contig level species diversity was estimated using kraken2 module implemented in MetaWRAP pipeline (-t 90 -s 50,000,000). Furthermore, we used binning, bin_refinement, quant_bins and reassemble bins modules to separate the contigs into separate bacterial contig bins (minimum completeness > 50% and contamination level < 10%). The gene annotation of the assembled genome was performed using Prokka v1.14.531. Biosynthetic gene clusters as well as transporters found in the assembled bacterial genomes were identified using antiSMASH v5.1.1 tool32. For downstream population genetic analysis of whole metagenome, bin assemblies were dereplicated using dRep v3.2.233 and clustering threshold of –sa 0.97 was used and then were filtered with ChekM34 to have a dereplicated species level of 97% Average Nucleotide Identity (ANI) for read mapping with 74% completeness, less than 10% contamination and a minimum of 1% abundance for each pooled triplicate sample/per site on inner and outer sabkha. The final set of contigs used in the downstream analysis is reliably and independently assembled for a species in multiple samples/sites. We looked at the individual microbiome contribution to community functions, using a replication index iRep v1.1.1435. The estimate was performed on the realigned reads to the assembled meta-genomes bins. This method uses an algorithm that assumes a bi-directional replication from a single origin accounting for coverage changes in fragmented genomes. For instance an iRep of 2 means that the population of cells are all actively replicating, whereas an iRep of 1.25 implies that a quarter of them are active35. Viral and fungal community profile of the sabkha was identified using MiCoP tool36. MetaPhlAn v3.0 pipeline37 was used for taxonomic profiling of the meta-genome shot gun reads with and without inclusion of viral community. Pavian38 was used to visualize the abundance and diversity at various taxonomic levels.
Genes involved in metabolic pathways and biogeochemical cycle were identified from the assembled metagenome using METABOLIC39 pipeline. The characterization of markers was identified using KEGG40 and curated custom protein HMM databases with motif confirmation using validated conserved residues in proteins. Diagrams representing metabolic pathways are generated for individual genomes as well as summary of the biogeochemical cycling processes.
Horizontally transferred gene regions using whole meta-genome assemblies
To detect horizontal gene transfers (HGTs) inside and outside sabkha, we used the HGT MetaChip v1.10.3 pipeline (metagenomics community-level HGT identification pipeline)41, which uses a combination of best-match and phylogeny for HGT detection. The set of metagenome assemblies (bins) were used as an input. We used GTDB-Tk tool42, for taxonomic classification of our input genomes using phylogenetically calibrated the Genome Taxonomy Database (GTDB)43. We defined the grouping of our taxonomic analysis of HGT at the order level. Enrichment analysis of PFAMs of horizontally transferred genes were scanned for enriched biological, cellular and molecular categories using an online tool (https://supfam.mrc-lmb.cam.ac.uk/SUPERFAMILY/cgi-bin/dcenrichment.cgi). Plots of PFAM relationship terms are generated with R using Revigo44.
All metagenomic reads per sample/site were mapped using bwa45 to indexed databases of dereplicated genomes obtained from inner and outer sabkha. Uniquely mapped reads to the representative genomes were used for analysis. The generated BAM files were filtered for uniqueness in which both reads mapped to the same scaffold with mappability of 96% match and mapping quality of mapq > 1 to their representative reference genomes. To achieve this, we used published custom script filter_reads.py available from (https://github.com/alexcritschristoph/soil_popgen). We have conducted quality check comparison where we compared ANI genes with 96% to a 98% ANI cutoff. SNPs were called for each population for the pooled samples/site at frequencies > 5% using a simple null model that assumes a FDR < ~ 10–6.
Population diversity analysis was carried out using reproducible custom scripts that were fitted to our data from https://github.com/alexcritschristoph/soil_popgen. Nucleotide diversity was calculated as the expected frequency difference between two reads at each separate position with minimum 20 × coverage within each sample and averaged across genes. We quantified if there is any bias in changing coverage on nucleotide diversity, where we calculated nucleotide diversity at subsampled coverage for each genomic position. SNPs were assigned as synonymous and nonsynonymous using reproducible scripts from https://github.com/alexcritschristoph/soil_popgen on gene calls annotated with Prodigal v2.6.146.
Linkage disequilibrium, population dynamics and selection tests
Linkage was calculated for all pairs of segregating sites of mapped reads with minimum spanning of 30 high quality base pair reads. R2 and D′ was calculated using equation from VanLiere47. We used mcorr package48 on synonymous third position codon sites to calculate the rate of recombination to mutation (δ/μ) for each population. Population structure, Fst, which measures the difference in allele frequencies between two populations, was calculated on segregating sites across inside, outside and across edge of sabkha. This was done using 20 × coverage per sampled site using the method by Hudson49, as recommended by Bhatia et al.50 using the python package scikit-allel51. Genes with coverage exceed two standard deviations were excluded from the analysis. Mean Fst was determined as the ratio of averages for each gene. The average nucleotide diversity of high Fst compared to genomic average of each species was done using t tests and similarly the average linkage of high Fst compared to genomic linkage average was calculated using t test in R (https://www.r-project.org/).
Sabkha soil analysis
Soil biochemical analysis shows higher concentration of salt in the middle (average of 200 ppt) which decreases to 20 ppt outside of the sabkha (Fig. 1B). Soil conductivity and total dissolved solids (TDS) were positively correlated with salinity (r = 0.8, p < 0.05). Cl−; Na2O was only abundant in the middle, whereas Cu, Fe2O3, P2O5, SiO2 and Al2O3 were abundant in the inner and edge of the sabkhas compared to the outside. In contrast, SO3 was more pronounced on the outside (Fig. 1C, Supplementary Tables 1, 2).
Dynamic shift in metagenomic profile across sabkha salinity gradient
Species diversity within the samples (alpha-diversity) were measured by Shannon diversity index method. The highest diversity was identified in the high salt regions Site I (diversity: 4–4.8) and the edge of sabkha (E) samples (diversity: 4.1–4.6) (Fig. 1D). The lowest microbial diversity (2.5–3.8) was observed in the samples collected from the combined sand and overflow sabkha region (outside). Alpha diversity measures for upper, lower and middle across the eighteen samples show no significant difference among layers (ANOVA F-value = 0.86572, p = 0.53) (Supplementary Fig. 1). The difference in the diversity is only observed between inside and outside the sabkha, which correlates with the salt gradient (ANOVA F-value = 18.962, p-value = 7.8 × 10–5). Diversity between the samples (Beta diversity) analysis of the sabkha microbial composition along the three different sites (I, E, O) identified similar microbial populations between inside and edge sabkha samples, clustered together in a PCoA analysis. Samples collected from outside sabkha region formed a separate cluster in the PCoA analysis (Fig. 1E,F) (PERMANOVA F-value = 45.062, R2 = 0.857, p-value < 0.001).
Taxonomic identity shows that there is shift in abundance between inside and outside the sabkha (Fig. 2A,B) and it is correlated with the salt gradient (R = 0.85, p < 0.05). At phylum level, Bacteroidetes (~ 22 to 34%), Euryarchaeota (~ 18 to ~ 30%), unclassified bacteria (~ 24 to ~ 35%), Actinobacteria (~ 0.01 to ~ 11%) and Cyanobacteria (less than 1%) are predominantly found in both inside and edge sabkha regions, whereas Proteobacteria (~ 92 to ~ 97%) and Parcubacteria (~ 1 to ~ 2%) are predominant in outer sabkha, Cyanobacterial classes were found to be over-represented inside the sabkha compared to the outside. Several archaeal taxa, as well as some unclassified bacteria, were rich in the inner and edge sabkha samples, while halophilic bacteria were overrepresented on the inside hypersaline region of the sabkha (Fig. 2A). Halobacteriaceae, Actinobacteria, Planctomycetaceae, Bacteroidetes and Planctomycetes are found more abundant in inside sabkha. Gammaproteobacteria, Rhodobacteraceae, Alphaproteobacteria, Rhizobiales and Oceanospirillales are found more abundant in the hyposaline outer sabkha. We see a shift in the Proteobacteria structure (Fig. 2B, Supplementary Fig. 2), where Delta/Epsilon Proteobacteria were more abundant in the hypersaline region inside the sabkha, while Gammaproteobacteria, and Alphaproteobacteria were more abundant in the hyposaline outer sabkha (Fig. 2B). Interestingly, we see similar taxonomic groups in the middle and lower layer of inside sabkha and less groups in the upper salt crust of each section (Fig. 2B), but not on the outside the sabkha.
For the sabkha whole metagenome assembly, the 4 sites sampled on inside, edge (s1, s3, s8, s12) and the 4 sites outside (s13, s14, s17, s18) (Fig. 1B) were used for assembly to resolve the extent of genome completeness. In total ~ 830 million PE reads (150 bp) were generated (Supplementary Table 3) for the metagenome assembly, resulting in 664,455 contigs (final genome assembly length ~ 2.3 Gb) with the N50 value of 3.945 Kb (Supplementary Table 4). All the assembled contigs were taxonomically classified (at phylum level) based on the sample location. The contigs related to the phyla Euryarchaeota, Bacteroidetes, Actinobacteria, Planctomycetes, Cyanobacteria, and Firmicutes are predominantly found in the inside sabkha samples, whereas phyla Proteobacteria is predominant in outside sabkha region (Fig. 3A,B). In addition, contigs were binned into 225 genomes across the sabkha (Fig. 4A). Assembled genome bacterial size ranges from ~ 0.5 to ~ 9.3 Mb, with the N50 of the assembled genome from ~ 2 Kb to ~ 1 Mb (Supplementary Table 5). From the assembled bins, we annotated CDS, tRNA, rRNA and tmRNA sequences (Supplementary Table 5). In total, 651,373 protein coding (CDS) and 6698 noncoding genes were annotated from the assembled genome. The phylogenetic tree generated using assembled genomes is shown in (Fig. 4B).
Taxonomically resolved genomes show high nucleotide diversity (π) levels on the inside compared to the outside of the sabkha (Fig. 4C). We observed comparable taxonomic microbial profiles using whole metagenome and 16S rDNA analysis. Interestingly, Halobacteria have the highest diversity on the inside followed by Rhodotermales, whereas Rhizobiales and Pseudomonadales are highest from the outside (Fig. 4C).
The pairwise genome-wide ANI (Average Nucleotide Identity) and alignment coverage of 125 dereplicated but quality-filtered genomes across the sabkha showed a decrease of pairwise ANI at ~ 97% (Supplementary Fig. 3), which is comparable to other reported thresholds in bacterial species52. The clustering of genomes using dRep into groups of species-like populations using ANI = 97% (Supplementary Fig. 4) was done and genomes with > 74% completeness and less than 10% contamination kept for further analysis. The focus on abundant populations/sites with more than 1% abundance resulted in 129 dereplicated genomes with a least 3 genomes per population summarized in Fig. 4D. Using metagenome-assembled genomes, the measure of Irep index show that the inside and outside is fully replicating (Fig. 4D). There is statistically significant difference (t test p = 2.43 × 10−5) between Irep for overall taxonomic order group from hypersaline inside (1.744 ± 0.274) compared to hyposaline outside (1.527 ± 0.145) sabkha (Supplementary Table 6). On the inside, we observe Halobacteriale with an average of 1.754 ± 0.184, while on the outside, Pseudomonadales has 1.554 ± 0.124, which is significantly lower (t test p = 0.02) than inside sabkha. Rhizobiales on the outside is replicating more than the overall outside with Irep around 1.632 ± 0.218 (Fig. 4E). We found no correlation between Irep and Pi for the different taxonomic order (Pearson R = − 0.12, p = 0.53) (Supplementary Fig. 5, Supplementary Table ,6), which suggest involvement of replication of these bacteria in nutrient cycling of sabkha (Supplementary Fig. 7).
Nucleotide diversity in the sabkha
The overall combined microbial population nucleotide diversity (π) was highly significant for all inner microbial communities compared to outside the sabkha (Fig. 5A) as is evident for each sampling site on the inner (s1, s3, s8, s12) compared to outer (s13, s14, s17, s18) (t test Bonferroni p < 0.05) (Supplementary Fig. 8). Clearly, Halobacteriales and Pseudomonadales populations have high diversity from inside the sabkha, while Rhizobiales and Pseudomonadales have highest diversity from outside the sabkha (Supplementary Fig. 8). The comparable diversity of combined as well as per site samples suggests that diversity persists within soils at even finer scale. We also found comparable ranges of per gene nucleotide diversity values (Pearson R = 0.9, p < 0.05) (Supplementary Fig. 9) for different populations at 96% and 98% ANI (Supplementary Fig. 10).
We found in almost all sabkha populations that ribosomal genes have lower nucleotide diversities that the average for genes in the genome (Fig. 5A), consistent with purifying selection53. An exception is for the inner sabkha Halobacteriales population, and outer sabkha populations such as Propionibacteriales, Pseudomonadales, Rhizobiales, and Rhodobacteriales, which have higher ribosomal nucleotide diversities, suggesting that they play an adaptive role in diverse ecological niches and microenvironment. Moreover, biosynthetic genes annotated with antiSMASH have significantly higher diversity than the genome average (pairwise t test Bonferroni p < 0.05), except for Halobacteriales, Thermoplasmatota, Pseudomonadales, and Rhizobiales.
When we looked at the diversity of different classes of biosynthetic genes from inside and outside the sabkha, we find unique patterns of biosynthetic gene class counts (Supplementary Fig. 11, Fig. 5B). Overall shared classes of inside and outside the sabkha include terpene, bacteriocin, betalactone, hserlactone, T3PKS, T1PKS, siderophores and ectoine (Supplementary Fig. 11, Fig. 5B). Inside the sabkha, however, we observe for Halobacteriales LAP (linear azol (in) e-containing peptides) that are synthetized in ribosomes, lanthipeptide and butyrolactones. On the other hand, we observed in outside of the sabkha, we see biosynthetic genes from different unique populations classes, such arylpolyene, CDPS, furan, lassopeptide, nucleoside, NRPS (NonRibosomal Peptide-Synthetase), phenazine, phosphonate, resorcinol and TfuA-related (Supplementary Fig. 11, Fig. 5B).
Horizontal gene transfer, homologous recombination and sabkha microbial diversity
Our analysis at the order level show that there is extensive horizontal gene transfer on the inside and outside the sabkha (Fig. 6A). On the inside, there is an exchange of genetic information between different levels especially Halobacteriales–Rhodotermales, reaching close to 42 HGT events (Fig. 6A, Supplementary Fig. 12). On the other hand, our analysis shows 30–47 HGT events between Rhizobiales–Rhodobacteriales, Kiloniellales–Rhizobiales and Pseudomonadales–Nevskiales outside the sabkha (Fig. 6A, Supplementary Fig. 12).
Using PFAM domains, we observed enriched transfer of abiotic stress genes, detoxification, biosynthetic of small molecules, transport, homeostasis and transport genes associated with HGT events (hypergeometric test, p < 0.05) (Fig. 6B). Two examples of HGT are highlighted in Fig. 6C. Interestingly, we see a gene for social cooperative development inside the sabkha. Cooperation among bacteria is a common strategy of interaction via cell–cell signaling, termed ‘quorum sensing’ (QS), which is an evolutionary stable strategy under extreme abiotic or biotic stress54.
Measuring linkage disequilibrium (LD) of SNPs spanned by paired end reads48,55 in different bacterial population can tell us on the impact of homologous recombination and its effect on genetic diversity. Our results are consistent with the expectation that natural microbial population experience a large amount of recombination, and this is observed with linkage disequilibrium (r2) that decay as the distance increases between two SNPs, both inside and outside the sabkha (Fig. 7A, Supplementary Fig. 13). The estimate of the ratio of the neutral rate of recombination to mutation in synonymous third position (Fig. 7B) shows a range for different species on the inside and outside of the sabkha. High rates of recombination is observed in some Halobacteriales, Balneolales, Enterobacteriales, Pseudomonadales, Rhizobiales and Rhodobacteriales species. On other hand, we observe species like Bacillales on the inside and some Pseudomonadales on the outside with very low rates of recombination (Fig. 7B). We also note, the ratio of recombination of nonsynonymous to synonymous (r2N/r2S) increases with higher diversity population (Fig. 7C; Supplementary Fig. 14).
We also measured the LD measure, namely, D′, which is less than 1 suggesting all possible combinations of a pair of biallelic sites, an serves as indication that recombination is present. Our results for inside the sabkha shows that D′ is correlated with mean r2 (R2 = 0.78, p = 0.005) (Fig. 7D); this correlation is similarly found outside the sabkha (Supplementary Fig. 15). D′ was less than 1 in 9–21% of all site pairs in inside sabkha populations, while D′ < 1 in 2–12% outside the sabkha (Supplementary Fig. 16; Supplementary Table 7). Given that D′ less than one is observed in a significant fraction of loci, as well as the decay of linkage with genomic distance suggests that homologous recombination in the sabkha microbiome.
Population dynamics and gene-specific selective sweeps across salinity gradient of sabkha
Changes in factors such as soil geochemistry, particle structure, salinity, and many other cues make soil ecosystems heterogeneous, as these factors can change over millimeter scales56. Given the complexity in examining the effect of changing abiotic environmental factors over small spatial scales, we estimated the pairwise fixation index Fst for each gene between inside sites (s1, s3, s8, s12) and outside sites (s13, s14, s17, s18); these display a gradual decrease in salinity between sabkha sites (Fig. 8A). Our results show that Fst per populations between sites increases as salinity increase among sites (Fig. 8A), consistent with geographic population structure at most loci across the genome (Supplementary Fig. 17). In contrast, a fraction of Fst values were low (< 5%), consistent with dispersal of alleles among sites.
Population-specific selective pressures acting on specific loci have a signature of high Fst compared to the average genome background57. We identified loci with high Fst using a sliding window of 5 genes and tested if these regions had Fst values greater than 1.5 standard deviations above the mean genome average (Supplementary Table 8). The length of genomic region with high Fst was defined when the mean Fst dropped below this threshold. We identified 72 loci with high Fst, in different microbial genomes (e.g., Halobacteriales 166 s1 vs. s3) despite having low average Fst values (Fig. 8B, Supplementary Fig. 18).
However, the low nonsynonymous to synonymous ratios in these genes that suggests a signal of purifying selection (Supplementary Fig. 19), as well as the reduction in sequencing coverage at some genomic regions, are both correlated with nucleotide diversity (Supplementary Fig. 20). This suggests that we cannot rely solely on identifying selective sweeps by a significant reduction in nucleotide diversity (Fig. 8C), therefore we looked for a significant increase in LD at these loci compared to the genomic background. Using this criterion, we identified 18 loci with an observed significant reduction in nucleotide diversity and increased LD for different population (Halobacteriales, Pseudomonadales, Nitriliruptorales, Bipolaricaulia) within inner and outside sabkha sites (Fig. 8C, Supplementary Fig. 21) suggesting selective sweep events at these genes at different sites in sabkhas. Several of these genes are related to uptake, movement and transport while others have functions related to abiotic stress sensing and response regulation (Supplementary Table 9). Selective sweeps are also evident between sabkha edge and outside sabkha sites (Fig. 8B) (e.g., Pseudomonadales 194), which together suggests that selection is driving some of the differences in microbial population structure across this ecosystem.
Diverse metabolic potential of microbial communities of sabkha
We investigated the ability of microbial communities of sabkha in harboring specific metabolic activity, and nutrient cycling (e.g.; carbon cycle, Nitrogen cycle, sulfur cycle) (Supplementary Fig. 22). We found, Perchlorate reduction and Chlorite reduction which are enriched on the inside of sabkha (Supplementary Table 10). Interestingly, when we look at the number of genes for combined metabolic pathway; on the inside extreme Halophilic microbes (e.g., Halobacteriales, Cyanobacteriales) have more than 50 percent of genes (t test p < 0.05), while outside, moderately halophilic microbes (e.g., Pseudomonadales, Rhizobiales, Nitriliruptorales) have them (t test p < 0.05) (Supplementary Fig. 6, Supplementary Tables 10, 11).
We found evidence for multiple carbon fixation pathways on the inside sabkha, including the Calvin–Benson–Bassham (CBB) cycle in Cyanobacteriales, Halobacteriales, Kiloniellales and Thiohalorhabdales, the 3-hydroxypropionate bicycle (3HP bicycle) in Enterobacterales and Pseudomonadales and the reductive citric acid cycle (rTCA) only found in Nanosalinales. In addition to the latter, carbon fixation pathways that are found in most of the outside metagenomes, the outside of sabkha harbors 3HP/4HB cycle, which is only found in Acidimicrobiales, Burkholderiales, Nitriliruptorales and Polyangiales (Supplementary Tables 10, 11, Supplementary Fig. 7). The key enzyme methyl-coenzyme M reductase (mcr) for methanogenesis was absent from sabkha (Supplementary Tables 10, 11, Supplementary Fig. 7). However, methanotrophy was present through soluble methane monooxygenase (mmoB) gene in Halobacteriales and Pseudomonadales from inside and in Pseudomonadales, Mycobacteriales, and Solirubrobacterales from outside. We see three abundant genes encoding aerobic monoxide dehydrogenase (coxSML) on the inside of sabkha and found only in our Halobacteriales, Kiloniellales and Bipolaricaulia, whereas it is only in Rhizobiales, Rhodobacteriales and Nitriliruptorales from outside (Supplementary Tables 10, 11).
In term of Nitrogen cycling, we did not observe any metagenomic evidence for nitrogen fixation for inside or outside sabkha. On the other hand, we found evidence for the first step of nitrification and denitrification process, using nitrite oxidoreductase (nxrAB), which is involved in nitrite to nitrate and nitrate reductase (narGH), for the reverse reaction (Supplementary Tables 10, 11; Supplementary Fig. 7). The second step of denitrification involves 2 genes (nirSK, norBC) and nirK is much abundant on the inside in Halobacteriales, Kiloniellales and Bipolaricaulia than nirS. nirK genes are found only in 6 metagenomes from the outside, such as in Rhodobacteriales, Pseudomonadales. As for the last step of dissimilatory nitrate reduction to ammonia mediated by nitrite reductase, we see abundance of genes encoding the large and small subunit of the NADH-dependent for of the enzyme (nirB, nirD) on the outside of sabkha and less abundant on the inside and this shift is the same genes encoding the cytochrome c for of the enzyme (nrfAH) (Supplementary Tables 10, 11, Supplementary Fig. 7).
In terms of Sulfur cycling, genes encoding sulfide-quinone reductase (sqr) are not abundant in Sabkha, with 3 genomes from the inside (Cyanobacteriales, Enterobacteriales and Pseudomonadales) and 3 genomes on the outside (Nitrosococcales, Gammaproteobacteria, Enterobacteriales). Instead, others genes such as Sulfur dioxygenase (sdo) and Sulfate adenylyltransferase (sat) are equally abundant on both sides of sabkha. Genes associated with thiosulfate oxidation (soxBY) were most abundant in the outer sabkha and included in metagenomes such as Burkholderiales, Rhizobiales, Rhodobacteriales and Rhodospirillales. The inside has only soxY and soxB genes in Thiohalorhabdales and Kiloniellales respectively (Supplementary Tables 10, 11; Supplementary Fig. 7).
We see evidence for aromatic compounds, complex carbon and fatty acid biodegradation pathways for bacteria and archaea metagenomes. Other pathways, such Arsenic cycling, Urea utilization, oxidative phosphorylation, fermentation and Amino acid utilization are metabolic functions that sabkha ecosystem is harboring (Supplementary Tables 10, 11).
Our results from a UAE coastal sabkha using 16S and whole metagenome analysis shows a shift in archaeal and bacterial taxa composition, with unexpected diversity inside compared to outside the sabkha. The shift of total microbial genetic composition inside vs. outside coastal sabkhas (Figs. 2A,B, 3B, 4C,D), suggests that salinity may act as an important driver of metabolic microbial activity. There is an increased number of halophiles (e.g., Halobacteriales, Cyanobacteriales) in the hypersaline (brine > 200 ppt) conditions within a sabkha compared to the less saline environment (brackish < 35 ppt sea water) outside the sabkha where other microbes exists (e.g., Pseudomonadales, Rhizobiales, Nitriliruptorales) (Supplementary Tables 10, 11, Supplementary Fig. 6, Fig. 4D). This result is consistent with the replication index Irep across all assembled metagenomes where we observed more replication on the inside (> 1.7) compared to outside for the order taxonomic level (Fig. 4E).
The significant biogeochemical consequences can be classified for inside sabkha to sensing/interaction, osmotic stress tolerance and photosynthesis, while outside mostly are involved in osmotic tolerance. Although they have a limited influx of freshwater and little to no surface vegetation, sabkhas have a high potential for diverse carbon sequestration (e.g., CBB cycle RuBisCo pathway as well as others). In our study, we observed Halobacteriales, Cyanobacteriales, and Pseudomonadales taxonomic order from inside sabkha that are likely to contribute to the carbon sequestration of the coastal sabkha (Supplementary Tables 10, 11). Cyanobacteriales species are at the heart of light-driven metabolic processes in sabkhas; we note a high abundance of Chloroflexi in our samples, similar to the study carried out by Ward et al.58. The cyanobacteria, Rubidibacter from inside sabkha have rod-like morphology that is similar to Synechocystis59. It was discovered that betaine is synthesized in cyanobacteria, through Choline as a precursor and its concentration is salt dependent in halophilic bacteria60,61. Betaine is a common osmoprotectants in halotolerant and halophilic microorganisms61,62 and an important feature of halotolerant Cyanobacteria as well as phototrophic bacteria63. The release of high concentrations of homoserine betaine upon lysing likely impacts the biogeochemical cycling of carbon and nitrogen in oligotrophic subtropical and tropical oceans64. Rubidibacter species may function as solar umbrellas that enhance the viability of more radiation-susceptible microbes, through the accumulation of the UV-blocking pigments zeaxanthin, beta-carotene, and echinenone to high levels59. A similar hypothesis was made by Di Loretta et al.65 in their description of filamentous anoxygenic photosynthetic bacteria (FAPB) mats and their dominance of the uppermost photo-oxic layers in the Khor al-Adaid sabkha in Qatar65. Thus, large-scale interdependence of cyanobacteria in juxtaposition with other microbial assemblages may be a crucial factor in thriving, dynamic sabkha development.
Interestingly, in addition to the light harvesting photosynthetic cyanobacteria, Halophitic Archaea (e.g., Halobacteriales, Nanosalinales) found inside sabkha expand the diversity of known Carboxydovores (CO-oxidizers). These CO-oxidizing groups have the ability to use light-driven proton pump bacteriorhodopsin to supply energy66. Different mechanisms such as CO-oxidation, light-harvesting complexes to sequester energy is very important for heterotrophs inside sabkha subjected to extreme saline environment, which impose extra costs for osmoregulation. This is also consistent with the noticeable presence of perchlorate reduction on the inside more than outside as well as more Chlorite reduction for these groups, which is an evidence for plausible microbe reaction on Martian brines and the reaction will emit oxygen, which will strongly contribute to facilitating the process of Mars colonization66,67.
Consistent with the CO-oxidizers, nitrogen cycling via denitrification (nitrate to nitrite) is an abundant portion of the inside (Halobacteriales, Nanosalinales) and outside sabkha (Pseudomonadales, Rhizobiales), which is consistent with other studies68,69. This abundance of denitrification genes contrasts with the absence of nitrogen fixation genes (Supplementary Fig. 7). For instance, Cyanobacteria are known nitrogen-fixing bacteria70, however, they have only norB genes, which is required for nitric oxide reduction during denitrification. The lack of nitrogenase sequences could be the lack of more sampling of deeper anoxic sediments.
The presence of a dynamic sulfur cycle via sulfide oxidation is prominent in inside and outside sabkha. This process takes place via aerobic and anaerobic pathways, similar to the denitrification process, using alternative electron acceptors71. The sulfide quinone reductase (sqr) is known to be involved in energy conservation, where it is linked to ATP production via sulfide oxidation or sulfide detoxification, where the sulfide oxidation is only used to protect microbial cell from damage72,73,74.
Other strategy for carbon fixation is via methylated carbon source in methanogenesis, which is a known strategy to overcome competition for CO2 in saline environment75,76. Sequences encoding hydrogenases for methanogenesis is absent on the inside, and outside sabkha, however Methanotrophy is present in both side of sabkha (Supplementary Fig. 7). The lack of methanogenesis genes may require deeper sediments sampling.
Ribosomal genes, which are in general conserved and subject to purifying selection, show lower diversity across sabkhas; however, we observe a significant increase in the diversity of ribosomal genes in Halobacteriales (p < 0.05) (Fig. 5A,B). Interestingly, ribosomal biosynthetic genes LAP and lanthipeptide have the highest diversity inside sabkhas (Fig. 5B; Supplementary Fig. 10) for Halobacteriales. LAP and lanthipeptide are known to coordinate cooperative social interaction among bacteria, including toxin secretion, acquisition of metals such as iron for all bacteria and signaling antimicrobials molecules77,78,79.
We observe extensive horizontal gene transfer (HGT) between microbial species in the sabkha environment (Fig. 6A). This process has been demonstrated in the soil microbiome, where gene gain and loss are evidenced by the difference in genome sizes among closely related isolates80. Horizontal gene transfer is an important process in sharing beneficial genes across populations and communities. Enrichment PFAM analysis (Fig. 6B) show that horizontally transferred genes related to abiotic stress tolerance, including loci for metabolic/catabolic process, temperature response, detoxification and transport of solutes, lipid modification have higher diversity than the genome average (Supplementary Tables 12, 13). Interestingly, we observed HGT inside sabkhas as a source of social cooperative development genes among the microbiome. This is consistent with what the LAP and lanthipeptide gene we observe in Halobacteriales, which is known to be involved in sensing and communicating via an ABC transporter81.
The relative rate of recombination and levels of LD confirm that homologous recombination is common in sabkha. We see a large variation in recombination for greater than 90 percent of the populations (Fig. 7B), and a significant fraction of D′ less than 1 with LD decay as a function of distance across the genome. This provides a good evidence for homologous recombination in sabkha populations. Moreover, the increase of the ratio of r2N/r2S with nucleotide diversity (Fig. 7C), suggest an increase of the ratio of beneficial to slightly deleterious nonsynonymous SNPs. The presence in different genomes of genes that confer natural competence for DNA uptake and recombination, such as ComEC and archaeal pili IV, suggest that they are likely to play a role in this process (Supplementary Table 14).
Our results show that a change in salt concentration at short length scales (within centimeters) contributes to divergence of alleles across the sabkha salt gradient. We observed allele frequencies for genes consistent with changes in salinity concentration (Fig. 8A) as shown in Fst comparisons between sites inside and outside the sabkhas (Fig. 8A; Supplementary Fig. 18). Specific loci with high Fst (Fig. 8B) show decreased nucleotide diversity compared to the average background genome (Fig. 8C) as well as high LD compared to the average genome background (Supplementary Fig. 21), consistent with selective sweeps events along the salinity gradient across sabkha. In addition, these genes have higher nonsynonymous to synonymous ratios, suggesting recent selection on beneficial nonsynonymous mutations. The differentially selected genes across sabkha, which includes ABC transporters, abiotic stress genes, quorum sensing in Halobacteriales, osmoprotection in Pseudomonadales and Rhizobiales (Supplementary Table 9), suggest that gene-specific selection is another evolutionary force driving genetic structure in sabkha population. The hydrocarbon-degrading bacteria Proteobacteria, Bacteroidetes, Actinobacteria and Halobacteria have previously been documented in high saline soil environments82, and further study of members from these clades may provide insight into novel industrial applications. Therefore, the study of the whole metagenomes assembled genomes and protein-coding sequences of these microbes, and other undiscovered species (cultured and uncultured) through different method of isolation, can provide foundations for new green chemistry applications. Salt stress tolerance genes isolated from Halobacteria, Proteobacteria, Actinobacteria, Gemmatimonadetes, Bacteroidetes, Firmicutes, and Acidobacteria have been previously cloned, and new osmotolerance mechanisms were revealed that might provide benefits for biosaline agriculture83. We managed to isolate 12 species from inside and outside sabkha, and some important physiological traits have been measured and are ready for more detailed functional studies related to biosaline agriculture applications (Supplementary Fig. 23, Supplementary Table 15).
This study is the first to examine the sabkha ecosystem not only for its microbial composition using 16S rDNA analysis13,65, but with whole metagenomic data from sabkha sites that vary across a salinity gradient. This provides a cohesive picture of the genetic structure of sabkha soil population within a relatively short spatial scale. We conclude that the sabkha environment, both inside this high salinity ecosystem as well as the adjacent environment outside the sabkha, is a dynamic ecosystem, containing eubacterial and archaebacterial species. Interestingly, despite its stressful environment, sabkhas appear to harbor even greater microbial diversity compared to the area outside. Moreover, our metagenomic analysis suggests that some of the evolution in the sabkha environment is driven by the horizontal transfer of genes, as well as recombination and gene-specific selection, which shape the population genetic structure in this ecosystem. In addition, this study shed the light on the importance and potential of these bacterial consortium in alleviating and enhancing abiotic stress for agricultural application.
The 16S rDNA (V3–V4) and Whole metagenomic data generated during this study were deposited in NCBI-SRA database under the Bioproject id PRJNA753691. Metagenome resolved assembled genomes are located in https://github.com/KCGEB-UAEU/Sabkha-dataanalysis/tree/main/Bins_genome. Additional information is located at this github (https://github.com/KCGEB-UAEU/Sabkha-data-analysis). SNPs, linkage, diversity, and pairwise Fst comparison for inner and outer sabkha data is hosted at: https://doi.org/10.5281/zenodo.6381683. All horizontal genes transfer between different orders for inner and outer sabkha: https://github.com/KCGEB-UAEU/Sabkha-data-analysis.
Average Nucleotide Identity
Auxiliary metabolic genes
Horizontal gene transfer
Operational taxonomic units
Single nucleotide polymorphism
Kyoto encyclopedia of genes and genomes
False discovery rate
Total dissolved solids
Al-Amoudi, O. S. B. Studies on Soil-Foundation Interaction in the Sabkha Environment of Eastern Province of Saudi Arabia (King Fahd University of Petroleum and Minerals, 1992).
Kirkham, A. & Evans, G. Sabkha Ecosystems 15–40 (Springer, 2019).
Abuelgasim, A. & Ammad, R. Mapping Sabkha Land surfaces in the United Arab Emirates (UAE) using Landsat 8 data, principal component analysis and soil salinity information. Int. J. Eng. Manuf. 7, 1 (2017).
Evans, G., Schmidt, V., Bush, P. & Nelson, H. Stratigraphy and geologic history of the sabkha, Abu Dhabi, Persian Gulf. Sedimentology 12, 145–159 (1969).
Arifuzzaman, M., Habib, M. A., Al-Turki, M. K., Khan, M. & Ali, M. Improvement and characterization of sabkha soil of Saudi Arabia: A review. Jurnal Teknologi 78, 1–11 (2016).
Al-Jaloud, A. A. & Hussain, G. Sabkha Ecosystems 1–7 (Springer, 2006).
Basyoni, M. H. & Aref, M. A. Composition and origin of the sabkha brines, and their environmental impact on infrastructure in Jizan area, Red Sea Coast, Saudi Arabia. Environ. Earth Sci. 75, 1–17 (2016).
Cousin, A. et al. Classification of igneous rocks analyzed by ChemCam at Gale crater, Mars. Icarus 288, 265–283 (2017).
Lasue, J. et al. Martian eolian dust probed by ChemCam. Geophys. Res. Lett. 45, 10968–910977 (2018).
Hecht, M. H. et al. Detection of perchlorate and the soluble chemistry of Martian soil at the Phoenix lander site. Science 325, 64–67 (2009).
Barbieri, R. & Cavalazzi, B. Microterracettes in Sabkha Oum Dba (Western Sahara, Morocco): Physical and biological interactions in the formation of a surface micromorphology. Astrobiology 18, 1351–1367 (2018).
Benison, K. C. & LaClair, D. A. Modern and ancient extremely acid saline deposits: Terrestrial analogs for Martian environments?. Astrobiology 3, 609–618 (2003).
Alotaibi, M. O. et al. Microbial diversity of some sabkha and desert sites in Saudi Arabia. Saudi J. Biol. Sci. 27, 2778–2789 (2020).
Fierer, N. Embracing the unknown: Disentangling the complexities of the soil microbiome. Nat. Rev. Microbiol. 15, 579–590 (2017).
Al-Enazi, N. M., Awaad, A. S., Al-Othman, M. R., Al-Anazi, N. K. & Alqasoumi, S. I. Isolation, identification and anti-candidal activity of filamentous fungi from Saudi Arabia soil. Saudi Pharm. J. 26, 253–257 (2018).
Albokari, M., Cinar, S. & Mutlu, M. Microbial characterization of Jazan sabkha in Saudi Arabia. Appl. Ecol. Environ. Res. 15, 1069–1077 (2017).
Chevrette, M. G. et al. Taxonomic and metabolic incongruence in the ancient genus Streptomyces. Front. Microbiol. 10, 2170 (2019).
Hultman, J. et al. Multi-omics of permafrost, active layer and thermokarst bog soil microbiomes. Nature 521, 208–212 (2015).
White, R. A. III. et al. Moleculo long-read sequencing facilitates assembly and genomic binning from complex soil metagenomes. Msystems 1, e00045-00016 (2016).
Ji, M. et al. Atmospheric trace gases support primary production in Antarctic desert surface soil. Nature 552, 400–403 (2017).
Woodcroft, B. J. et al. Genome-centric view of carbon processing in thawing permafrost. Nature 560, 49–54 (2018).
Andrews, S. Babraham Bioinformatics (Babraham Institute, 2010).
Buffalo, V. Scythe—A Very Simple Adapter Trimmer (Github, 2014).
Joshi, N. & Fass, J. Sickle: A Sliding-Window, Adaptive, Quality-Based Trimming Tool for FastQ Files (Github, 2011).
Callahan, B. J. et al. DADA2: High-resolution sample inference from Illumina amplicon data. Nat. Methods 13, 581–583 (2016).
Quast, C. et al. The SILVA ribosomal RNA gene database project: Improved data processing and web-based tools. Nucleic Acids Res. 41, D590–D596 (2012).
Dhariwal, A. et al. MicrobiomeAnalyst: A web-based tool for comprehensive statistical, visual and meta-analysis of microbiome data. Nucleic Acids Res. 45, W180–W188 (2017).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Uritskiy, G. V., DiRuggiero, J. & Taylor, J. MetaWRAP—A flexible pipeline for genome-resolved metagenomic data analysis. Microbiome 6, 1–13 (2018).
Nurk, S., Meleshko, D., Korobeynikov, A. & Pevzner, P. A. metaSPAdes: A new versatile metagenomic assembler. Genome Res. 27, 824–834 (2017).
Seemann, T. Prokka: Rapid prokaryotic genome annotation. Bioinformatics 30, 2068–2069 (2014).
Blin, K. et al. AntiSMASH 5.0: Updates to the secondary metabolite genome mining pipeline. Nucleic Acids Res. 47, W81–W87 (2019).
Olm, M. R., Brown, C. T., Brooks, B. & Banfield, J. F. dRep: A tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication. ISME J. 11, 2864–2868 (2017).
Parks, D. H., Imelfort, M., Skennerton, C. T., Hugenholtz, P. & Tyson, G. W. CheckM: Assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 25, 1043–1055 (2015).
Brown, C. T., Olm, M. R., Thomas, B. C. & Banfield, J. F. Measurement of bacterial replication rates in microbial communities. Nat. Biotechnol. 34, 1256–1263 (2016).
LaPierre, N. et al. MiCoP: Microbial community profiling method for detecting viral and fungal organisms in metagenomic samples. BMC Genomics 20, 1–10 (2019).
Truong, D. T. et al. MetaPhlAn2 for enhanced metagenomic taxonomic profiling. Nat. Methods 12, 902–903 (2015).
Breitwieser, F. P. & Salzberg, S. L. Pavian: Interactive analysis of metagenomics data for microbiome studies and pathogen identification. Bioinformatics 36, 1303–1304 (2020).
Zhou, Z., Tran, P., Liu, Y., Kieft, K. & Anantharaman, K. METABOLIC: A scalable high-throughput metabolic and biogeochemical functional trait profiler based on microbial genomes. Biorxiv 10, 761643 (2019).
Okuda, S. et al. KEGG Atlas mapping for global analysis of metabolic pathways. Nucleic Acids Res. 36, W423–W426 (2008).
Song, W., Wemheuer, B., Zhang, S., Steensen, K. & Thomas, T. MetaCHIP: Community-level horizontal gene transfer identification through the combination of best-match and phylogenetic approaches. Microbiome 7, 1–14 (2019).
Chaumeil, P.-A., Mussig, A. J., Hugenholtz, P. & Parks, D. H. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics 36,1925–1927 (2020).
Parks, D. H. et al. A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life. Nat. Biotechnol. 36, 996–1004 (2018).
Supek, F., Bošnjak, M., Škunca, N. & Šmuc, T. REVIGO summarizes and visualizes long lists of gene ontology terms. PLoS One 6, e21800 (2011).
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows–Wheeler transform. Bioinformatics 26, 589–595 (2010).
Hyatt, D. et al. Prodigal: Prokaryotic gene recognition and translation initiation site identification. BMC Bioinform. 11, 1–11 (2010).
VanLiere, J. M. & Rosenberg, N. A. Mathematical properties of the r2 measure of linkage disequilibrium. Theor. Popul. Biol. 74, 130–137 (2008).
Lin, M. & Kussell, E. Inferring bacterial recombination rates from large-scale sequencing datasets. Nat. Methods 16, 199–204 (2019).
Hudson, R. Gene trees, species trees and the segregation of ancestral alleles. Genetics 131, 509 (1992).
Bhatia, G., Patterson, N., Sankararaman, S. & Price, A. L. Estimating and interpreting FST: The impact of rare variants. Genome Res. 23, 1514–1521 (2013).
Miles, A. & Harding, N. cggh/scikit-allel: v1. 1.8, July 2017. https://doi.org/10.5281/zenodo822784 (2017).
Jain, C., Rodriguez-R, L. M., Phillippy, A. M., Konstantinidis, K. T. & Aluru, S. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nat. Commun. 9, 1–8 (2018).
Rosen, M. J., Davison, M., Fisher, D. S. & Bhaya, D. Probing the ecological and evolutionary history of a thermophilic cyanobacterial population via statistical properties of its microdiversity. PLoS One 13, e0205396 (2018).
Asfahl, K. L. & Schuster, M. Additive effects of quorum sensing anti-activators on Pseudomonas aeruginosa virulence traits and transcriptome. Front. Microbiol. 8, 2654 (2018).
Rosen, M. J., Davison, M., Bhaya, D. & Fisher, D. S. Fine-scale diversity and extensive recombination in a quasisexual bacterial population occupying a broad niche. Science 348, 1019–1023 (2015).
Holsinger, K. E. & Weir, B. S. Genetics in geographically structured populations: Defining, estimating and interpreting FST. Nat. Rev. Genet. 10, 639–650 (2009).
Oksanen, J. et al. Package ‘vegan’. Community ecology package, version 2, 1–295 (2013).
Ward, L. M., Hemp, J., Shih, P. M., McGlynn, S. E. & Fischer, W. W. Evolution of phototrophy in the Chloroflexi phylum driven by horizontal gene transfer. Front. Microbiol. 9, 260 (2018).
Choi, J. S. et al. Photomovement of the gliding cyanobacterium Synechocystis sp. PCC 6803. Photochem. Photobiol. 70, 95–102 (1999).
Cánovas, D., Vargas, C., Calderón, M. I., Ventosa, A. & Nieto, J. J. Characterization of the genes for the biosynthesis of the compatible solute ectoine in the moderately halophilic bacterium Halomonas elongata DSM 3043. Syst. Appl. Microbiol. 21, 487–497 (1998).
Roberts, M. F. Organic compatible solutes of halotolerant and halophilic microorganisms. Saline Syst. 1, 1–30 (2005).
Oren, A. Microbial life at high salt concentrations: Phylogenetic and metabolic diversity. Saline Syst. 4, 1–13 (2008).
Welsh, D. T. Ecological significance of compatible solute accumulation by micro-organisms: From single cells to global climate. FEMS Microbiol. Rev. 24, 263–290 (2000).
Pade, N. et al. Trimethylated homoserine functions as the major compatible solute in the globally significant oceanic cyanobacterium Trichodesmium. Proc. Natl. Acad. Sci. 113, 13191–13196 (2016).
DiLoreto, Z. A. et al. Microbial community composition and dolomite formation in the hypersaline microbial mats of the Khor Al-Adaid sabkhas, Qatar. Extremophiles 23, 201–218 (2019).
Myers, M. R. & King, G. Halobacterium bonnevillei sp. nov., Halobaculum saliterrae sp. nov. and Halovenus carboxidivorans sp. nov., three novel carbon monoxide-oxidizing Halobacteria from saline crusts and soils. Int. J. Syst. Evol. Microbiol. 70, 4261–4268 (2020).
Ren, C. et al. A bioinspired molybdenum catalyst for aqueous perchlorate reduction. J. Am. Chem. Soc. 143, 7891–7896 (2021).
Lynch, K. L. et al. Discrete community assemblages within hypersaline Paleolake sediments of Pilot Valley, Utah. bioRxiv, 634642 (2019).
McGonigle, J. M., Bernau, J. A., Bowen, B. B. & Brazelton, W. J. Robust archaeal and bacterial communities inhabit shallow subsurface sediments of the Bonneville Salt Flats. Msphere 4, e00378-00319 (2019).
Li, Y. et al. Reconstruction of the functional ecosystem in the high light, low temperature union glacier region, Antarctica. Front. Microbiol. 10, 2408 (2019).
Shao, M.-F., Zhang, T. & Fang, H.H.-P. Sulfur-driven autotrophic denitrification: Diversity, biochemistry, and engineering applications. Appl. Microbiol. Biotechnol. 88, 1027–1042 (2010).
Schütz, M., Maldener, I., Griesbeck, C. & Hauska, G. N. Sulfide-quinone reductase from Rhodobacter capsulatus: Requirement for growth, periplasmic localization, and extension of gene sequence analysis. J. Bacteriol. 181, 6516–6523 (1999).
Marcia, M., Ermler, U., Peng, G. & Michel, H. The structure of Aquifex aeolicus sulfide: Quinone oxidoreductase, a basis to understand sulfide detoxification and respiration. Proc. Natl. Acad. Sci. 106, 9625–9630 (2009).
Van Der Meer, M. T. et al. Cultivation and genomic, nutritional, and lipid biomarker characterization of Roseiflexus strains closely related to predominant in situ populations inhabiting Yellowstone hot spring microbial mats. J. Bacteriol. 192, 3033–3042 (2010).
McGenity, T. J. & Sorokin, D. Y. Methanogens and methanogenesis in hypersaline environments. in Handbook of Hydrocarbon and Lipid Microbiology 665–680 (2010).
Sorokin, D. Y. & McGenety, T. Biogenesis of Hydrocarbons 1–24 (Springer, 2019).
Cornforth, D. M. & Foster, K. R. Competition sensing: The social side of bacterial stress responses. Nat. Rev. Microbiol. 11, 285–293 (2013).
Hider, R. C. & Kong, X. Chemistry and biology of siderophores. Nat. Prod. Rep. 27, 637–657 (2010).
Hibbing, M. E., Fuqua, C., Parsek, M. R. & Peterson, S. B. Bacterial competition: Surviving and thriving in the microbial jungle. Nat. Rev. Microbiol. 8, 15–25 (2010).
Iranzo, J., Wolf, Y. I., Koonin, E. V. & Sela, I. Gene gain and loss push prokaryotes beyond the homologous recombination barrier and accelerate genome sequence divergence. Nat. Commun. 10, 1–10 (2019).
Li, Y. & Rebuffat, S. The manifold roles of microbial ribosomal peptide-based natural products in physiology and ecology. J. Biol. Chem. 295, 34–54 (2020).
Beazley, M. J. et al. Microbial community analysis of a coastal salt marsh affected by the Deepwater Horizon oil spill. PLoS One 7, e41305 (2012).
Ahmed, V., Verma, M. K., Gupta, S., Mandhan, V. & Chauhan, N. S. Metagenomic profiling of soil microbes to mine salt stress tolerance genes. Front. Microbiol. 9, 159 (2018).
We thank the Ministry of Presidential Affairs, United Arab Emirates for the generous funding of Khalifa Center for Genetic Engineering and Biotechnology. We would like to thank the Unit of Genome studies in Biology Department, College of Science, United Arab Emirates University.
The authors declare no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Hazzouri, K.M., Sudalaimuthuasari, N., Saeed, E.E. et al. Salt flat microbial diversity and dynamics across salinity gradient. Sci Rep 12, 11293 (2022). https://doi.org/10.1038/s41598-022-15347-8