Permafrost underlies approximately one quarter of Northern Hemisphere terrestrial surfaces and contains 25–50% of the global soil carbon (C) pool. Permafrost soils and the C stocks within are vulnerable to ongoing and future projected climate warming. The biogeography of microbial communities inhabiting permafrost has not been examined beyond a small number of sites focused on local-scale variation. Permafrost is different from other soils. Perennially frozen conditions in permafrost dictate that microbial communities do not turn over quickly, thus possibly providing strong linkages to past environments. Thus, the factors structuring the composition and function of microbial communities may differ from patterns observed in other terrestrial environments. Here, we analyzed 133 permafrost metagenomes from North America, Europe, and Asia. Permafrost biodiversity and taxonomic distribution varied in relation to pH, latitude and soil depth. The distribution of genes differed by latitude, soil depth, age, and pH. Genes that were the most highly variable across all sites were associated with energy metabolism and C-assimilation. Specifically, methanogenesis, fermentation, nitrate reduction, and replenishment of citric acid cycle intermediates. This suggests that adaptations to energy acquisition and substrate availability are among some of the strongest selective pressures shaping permafrost microbial communities. The spatial variation in metabolic potential has primed communities for specific biogeochemical processes as soils thaw due to climate change, which could cause regional- to global- scale variation in C and nitrogen processing and greenhouse gas emissions.
Permafrost is a crucial part of ecosystem biogeochemistry in high latitude soils. The microbial communities within permafrost slowly transform organic matter though geologic time, shaping the permafrost environment, and contribute to greenhouse gas emissions upon thaw. Microbial communities vary over space and time and are strongly affected by ecosystem and soil characteristics such as pH, vegetation, temperature, and precipitation across broad environmental gradients [1,2,3]. Yet in permafrost, microbial communities are often disconnected from modern ecosystems, drawn from communities that were present prior to permafrost formation and modified over time [4, 5].
Permafrost has formed at multiple times in the past, and its presence is dependent on geography and local site conditions. Much of the permafrost in northern latitude soils was formed during the late Pleistocene (11,700–129,000 years) . While Holocene (present – 11,700 years) aged permafrost can be hundreds to thousands of years old, rates of formation were greatest during the Little Ice Age (~1300 to 1850 CE) . Permafrost soils may be derived from wetlands, glaciofluvial deposits, loess, or weathered bedrock . Due to differences in glaciation, loess deposition rates, and thaw events driven by disturbances such as wildfire, there is often a strong landscape-scale variation in permafrost age, mode of formation, and physical structure that is likely reflected in the microbial communities that reside there.
Permafrost microbial communities are influenced by the age of permafrost aggradation , sedimentation, thaw history, marine exposure [10, 11], permafrost origin , and mineral composition . Prominent members of the permafrost community include Proteobacteria and Actinobacteria whereas Firmicutes, Bacteroidetes, and Chloroflexi can vary substantially with age and other physicochemical factors [9, 14]. Most permafrost Archaea belong to methanogens from Euryarchaeota , though recent studies have identified Thaumarchaeota, Bathyarchaeota, and Heimdallarchaeota [10, 11, 15]. Consequently, permafrost microbial communities typically reflect local climates, vegetation, and soil properties present at the time of formation. But over geologic time (with little influx of water, resources, or microbial immigrants), diversity declines and genetic potential changes [4, 5, 14]. Although we know something about the composition and diversity of permafrost microbial communities, we know much less about the genetic composition that gives rise to community function in situ.
Examining large scale biogeography of microbial communities allows us to understand features common to all permafrost communities, as well as factors that vary most among communities. We analyzed 133 permafrost metagenomes from the panarctic, including the United States, Canada, Russia, Sweden, and Svalbard (Fig. 1A, Table S1, Table S2) to (1) identify the dominant permafrost taxa, (2) ascertain the environmental variables shaping the occurrence of taxa and potential functions defined by the relative abundance of functional genes, and (3) identify highly variable genes that reflect selective pressures driving differences among permafrost microbial communities. We determined the distribution of taxa and functional genes across the Arctic and we related them to environmental metadata derived from landform, landform history, and soil chemical attributes. This study provides unprecedented insights into factors structuring microbial communities and their functional potential in this climate-critical ecosystem.
Materials and methods
We analyzed 133 permafrost metagenomes from samples collected across the panarctic, including the United States, Canada, Russia, Sweden, and Svalbard (Fig. 1A, Table 1). We sequenced 91 samples and combined these results with data from 42 samples obtained from previously published studies (Tables S1 and S2) [9, 10, 16,17,18,19,20,21,22,23,24,25]. Permafrost soils were collected in different ways depending on their depth and access limitations. Within a few meters of the surface, we used a Snow, Ice and Permafrost Research Establishment (SIPRE) core barrel (7.62-cm inner diameter, up to 120 cm in length ). For samples tens of meters deep we used a truck-mounted deep core auger which operated without drilling fluid . When we had access to permafrost from tunnels or cliff faces, we collected samples from the sidewalls using a large hole saw attached to a hand drill. Where possible, at least three replicates were collected per site. In all cases, methods were used to ensure soils were free from surface microbial contamination as described previously [9, 17]. Details specific to each location sampled are contained in the Supplementary Information.
Permafrost age dating, chemical analyses, and core processing
We collected geographic and geomorphic data for each site using their GPS locations. These data included continental region (Siberia, Interior Alaska, Alaskan Tundra, High Arctic Canada, Spitzbergen, and Sweden), permafrost type (continuous versus discontinuous and yedoma versus non-yedoma [28, 29], origin (lacustrine, aeolian, alluvium), and elevation. Permafrost age (e.g., Holocene versus Pleistocene or kya) was derived from existing data or inferred from published literature (Table S2). Soil chemical attributes were collected from existing data or measured as needed (Table 1).
Core sub-sectioning, DNA extraction, library preparation, and sequencing
Prior to DNA extraction, the outer 1-2 cm of permafrost cores were removed using sterile tools to expose the uncontaminated interior. Cores were further subsectioned with sterile knives and chisels and DNA was extracted from 0.5 g to 2 g using the FastDNA SPIN Kit for soil (MP Biomedicals, Santa Ana, CA, USA) or the DNeasy PowerSoil Kit, Qiagen, Germantown, MD, USA). Sequencing libraries were prepared using either emulsion PCR  or the Illumina TruSeq DNA sample preparation kit v2 following the low-throughput protocol and sequenced on a HiSeq 4000 or 2500 instrument (Illumina Inc, San Diego, CA). Additional details describing contamination precautions, library preparation, and sequencing specific to each site are in the Supplementary Information.
Statistical analyses of the full KEGG dataset
Reads were annotated through comparison to the KEGG database as described in the Supplementary Information. The resulting data are available at ScienceBase. Statistical analyses of the environmental dataset and the full KEGG dataset were carried out in R  and Primer-E version 6 . The environmental dataset had 133 samples and 12 continuous variables (latitude, longitude, elevation, age, depth, total C, total N, C/N, organic C, pH, ice content, and EC) for a total of 1596 possible observations. Because our multivariate analysis methods exclude a sample if there is even a single missing observation, we used an expectation maximization (EM) algorithm to impute the missing values, as implemented in the Amelia R package . We normalized all data to meet the assumptions of the EM algorithm (see Supplementary Information).
The imputed data were imported into Primer-E for PCA and PERMANCOVA analyses. A PCA of the environmental data was performed on the correlation matrix. PC1 primarily represented age, depth, TN, TC, OC, and pH (eigenvector coefficients: −0.31, −0.36, 0.45, 0.45, 0.44, −0.36) and explained 33.4% of the variation. PC2 primarily represented latitude, elevation, and C/N (eigenvector coefficients: 0.55, −0.43, −0.47) and explained 25.8% of the variation. The z-scores for PC1 and PC2 for each sample were output and used as covariates in subsequent PERMANCOVAs.
For PERMANCOVA analyses, we applied an abundance filter retaining only the genes observed more than 10 times in at least 10% of samples. Hypothesis testing was carried out using PERMANCOVA on KEGG gene relative abundance data, which were square root transformed and used to create a Bray-Curtis dissimilarity matrix. Two PERMANCOVA runs were performed testing the effects of PC1 and PC2 on gene distribution. An additional PERMANCOVA was also performed testing latitude as an a priori contrast fit to linear, quadric, or cubic orthogonal polynomial models. For all three PERMANCOVA tests, we used a 3-factor mixed, nested model with continent as a fixed factor and the random factors were nested (site within region and region within continent).
Identification of highly variable functional genes
We identified KEGG genes showing the highest variability in abundance across sites using the DESeq2 package  in R . Prior to this, we applied an abundance filter where genes observed more than 100 times in at least 10% of samples were retained. The p values were adjusted for multiple hypothesis testing using the Benjamini and Hochberg method. Sites with at least one replicate were compared to all other replicated sites, and the number of times each gene differed significantly (p < 0.01) between pairs of sites was counted. We defined highly variable genes as the top 5% differing most often between pairs of sites (ScienceBase Data). These genes were clustered using the ward.D2 method and visualized in the pheatmap package .
To determine if specific KEGG pathways, modules, or functional categories were significantly enriched in groups of highly variable genes identified by ward.D2 clustering, we used the hypergeometric distribution. We defined N as the total number of highly variable genes, n was the total number of genes in the ward.D2-identifed group, K was the total number of variable genes in a particular pathway/module/category, and k was the number of variable genes within the subgroup that were part of the particular pathway/module/category. The p value of finding k genes within a group was determined using the phyper function and corrected for multiple hypothesis testing using the Benjamini-Hochberg method in R.
Relating taxonomic composition and potential function to biophysical parameters
Taxonomic classification was performed by extracting shotgun metagenome reads matching the V4 hypervariable region of the 16 S rRNA gene as described in the Supplementary Information. To understand the biophysical parameters that explained variation at the class level we performed constrained redundancy analysis (RDA) using the vegan  package in R . For biophysical measurements, we used imputed and transformed data as described above. Taxonomic abundance data were Hellinger transformed. Linear dependencies between variables were assessed using the variation inflation factor (VIF). Multicollinearity was indicated by VIF values exceeding ten, so forward selection of explanatory variables was performed using the ordiR2step function. Significance of the remaining predictors was assessed using the anova.cca function and then corrected for multiple testing using the false discovery rate. Principal component analysis (PCA) on the transformed taxonomic abundance data was performed to compare constrained and unconstrained ordinations. We also regressed the relative abundance of the most abundant taxa against environmental variables (JMP 16, SAS Institute, Cary NC) including latitude, depth, pH, TC, OC, TN, electrical conductivity, and ice content.
We performed k-means cluster analysis on the highly variable KEGG genes and 16 S rRNA gene data (JMP 16, SAS Institute, Cary NC). For both KEGG and 16 S rRNA genes, we used a Johnson transformation and within cluster standard deviations to improve cluster assignments. We then quantified the optimal number of clusters using the elbow method, which minimized the sums of squares within clusters relative to the number of clusters examined. Clusters with four or fewer samples were removed from downstream comparative analysis using ANOVA. To examine whether different clusters were associated with different biophysical parameters, we utilized a one-way ANOVA with cluster as the main factor and biophysical parameters as the dependent variables. Prior to ANOVA we ensured that data were normally distributed by testing using a Shapiro-Wilk W Goodness of Fit test followed by normalization if necessary. Significant one-way ANOVA was followed by a Tukey-Kramer HSD test with an accepted p value of 0.05. Data are presented as means and one standard deviation unless otherwise stated.
Soil physicochemical properties
In our panarctic investigation of 133 permafrost samples physicochemical characteristics ranged broadly. Our dataset represented samples from subarctic to high Arctic latitudes (Table 1). Permafrost age ranged from approximately one thousand years before present (kyr) to 880 kyr (ScienceBase). Most soils represent near-surface permafrost the upper several meters, but several locations contained deeper permafrost (>10 m) from tunnels or borehole samples. Pleistocene aged samples were deeper (11.9 ± 0.79 m vs 2.6 ± 0.9 m (standard error (s.e.)) than Holocene aged samples (F1,131 = 56.7, p < 0.001). Organic C ranged from approximately 1% to over 40% in frozen wetland soils. Holocene age permafrost soils had higher % OC compared to Pleistocene permafrost soils (9.2 ± 1.2 vs 2.6 ± 1.0 %) due to a large number of organic rich wetland soils contained in Holocene permafrost (F1,128 = 26.6, p < 0.0001). Similarly, %N was higher in Holocene aged permafrost (0.7 ± 0.07 vs 0.2 ± 0.06, F1,126 = 19.5, p < 0.0001 (s.e.)). C/N ratios varied widely, sometimes as low as 2 to as high as 23, with a mean of 13.2 ± 8.0 and was higher in Holocene than Pleistocene aged permafrost (16.0 ± 1.1 vs 10.8 ± 0.9, F1,124 = 14.5, p = 0.0002). Permafrost also ranged in soil pH. Soil pH differed by soil age (F1,125 = 37.5; p < 0.0001), with Pleistocene aged permafrost having a higher pH (7.3 ± 0.11) than Holocene aged permafrost (6.28 ± 0.13). Ice content ranged from 0.14 g/g to over 1.0, and did not differ by permafrost age (Table 1).
Microbial taxonomic composition in permafrost across the Arctic
Analysis of 16 S rRNA gene sequences extracted from shotgun metagenomes demonstrated that permafrost bacterial and archaeal communities were highly variable in composition and diversity (Fig. 1B). Sequences affiliated with Proteobacteria and Actinobacteria dominated permafrost samples across the Arctic, averaging 41% and 20% of the total community, respectively. The distribution of other highly abundant taxa (Firmicutes, Bacteroidetes, and Chloroflexi) showed substantial variability across sites. For example, Firmicutes represented nearly 50% relative abundance in some interior Alaska samples (e.g. Fox Tunnel) but were vanishingly rare in others (Fig. 1B).
Shannon diversity was unimodally related to soil pH, where highest alpha diversity was observed in circumneutral pH soils and lower in acidic or alkaline soils (Fig. 1C). Several samples from the Vault Creek Tunnel, Wadepiper Lake, and Kolyma showed very low alpha diversity (Fig. 1C). Outlier samples from Vault Creek Tunnel tended to be deeper than much of the rest of the dataset, and the outlier from Kolyma was the oldest sample in our dataset (880 kyr).
Constrained redundancy analysis (RDA) demonstrated that environmental gradients explained variation in taxonomic structure at the class level. Permafrost microbial compositional variability was significantly explained by depth, latitude, elevation, pH, age, and yedoma classification (all p < 0.002, Fig. S1). We examined the distribution of the most abundant classes in relation to latitude, depth, and other factors. Regression statistics were corrected for multiple comparisons using the Benjamini and Hochberg method. With increasing latitude, Actinobacteria (R2 = 0.13, p < 0.008) and Bacteroidia (R2 = 0.05, p < 0.03) increased in relative abundance while Alphaproteobacteria declined (R2 = 0.08, p = 0.007). With increasing depth, Gammaproteobacteria declined (R2 = 0.06, p = 0.03). Permafrost age (Pleistocene vs Holocene) also affected the relative abundance of different taxa. Bacilli, Alphaproteobacteria, Chlamydiia, Planctomycetia, and Chloroflexia were more abundant in Pleistocene permafrost whereas Gammaproteobacteria and Deltaproteobacteria were more abundant in Holocene aged permafrost (Figure S2). At the level of phylum, Acidobacteria were also more abundant in Holocene permafrost. Soil pH, ice content, %OC, %N, and C/N did not consistently affect the relative abundance of many of these taxa across the panarctic.
Variation in microbial functional gene in permafrost across the Arctic
To determine how functional genes were distributed across the Arctic, we annotated metagenome reads through comparison to the Kyoto Encyclopedia of Genes and Genomes (KEGG) database and evaluated the relationships between the relative abundances of KEGG genes, location, and biophysical factors. Permutational analysis of covariance (PERMANCOVA) tests indicated that interactions between biophysical factors and sampling location drove functional assemblages (Table S3). Covariates such as OC, N, and pH did not explain significant variation in the gene abundance data on their own but were significant when interacting with sample site. For the PERMANCOVA analysis that included the linear combination of %N, total C, % OC, age, depth, and pH as a single covariate (the first principal component (PC1) of the biophysical data), site nested within region explained 26.5% of the variation in gene relative abundance data (Pseudo-F = 6.36, p < 0.0001) and the interaction between PC1 and site within region explained 56.0% (Pseudo-F = 2.25, p = 0.0002). For PERMANCOVA analysis with the second principal component (PC2 representing a linear combination of latitude, elevation, and C/N) as a covariate, site within region explained 13.7% of the variation (Pseudo-F = 6.65, p < 0.0001) and the interaction between PC2 and site within region explained 77.3% (Pseudo-F = 2.48, p < 0.001). Tests to detect covariation between gene relative abundances and latitude (and whether patterns were linear, quadratic, or cubic) were not significant (p > 0.05).
Clustering of samples based on highly variable functional genes and their relationship to environmental conditions
We performed a selection procedure, identifying a subset of 327 genes from the original set that had the greatest variability among sites (top 5%, ScienceBase Data), which likely reflect the strongest selective pressures driving differences among microbial communities. We clustered samples based on the abundances of highly variable genes to determine whether clusters related to microbial functional potential exist across the Arctic and whether they are related to broad environmental gradients. This resulted in fifteen distinct clusters of samples. Six clusters had between six and 43 samples, which were retained for downstream analyses. Six clusters, each with an individual sample, and three clusters containing between 2 and 4 samples were removed from further comparisons. We used ANOVA to determine differences in biophysical properties among the remaining six clusters, which revealed that permafrost properties among clusters differed most significantly with respect to pH, latitude, and soil depth (significant one-way ANOVA, p < 0.0001, R2 > 0.26, Table S4, Fig. 2). Though we conservatively required clusters to have greater than six samples, this trend was similar even when all clusters were considered (Fig. S3). Permafrost age, %OC, and %N differed among clusters, but the variance explained was low (p < 0.05, R2 ≤ 0.19, Table S4). Clusters did not vary by continent, region, permafrost type (continuous vs discontinuous), C/N ratio, EC, or ice content. Nitrate + nitrite in all samples was relatively low, averaging 0.14 ± 0.26 mg kg−1 soil, and did not differ among sample clusters.
Clusters 1 through 6 were named and arranged according to environmental gradients (Table S4). For example, permafrost soils from low numbered clusters were more acidic, shallower, higher latitude, and richer in C and nitrogen (N) whereas permafrost soils from higher numbered clusters were from more alkaline, deeper, lower latitude, and lower C and N. Cluster 1 was above 70° N (Arctic) and Cluster 2 was from the low Arctic (between 67 and 70° N, Fig. 2). Clusters 3 and 4 were both from a more neutral pH and similar latitude (low Arctic) but differed by depth (Fig. 2). Clusters 5 and 6 from the subarctic (below 67° N), were the most alkaline of all clusters (pH > 7.0), and differed by soil depth (Fig. 2, Table S4). But clusters 5 and 6 also differed strongly in the relative abundance of archaea (15.3% versus 2.2%, respectively), which we used to label these two clusters (Fig. 2).
Comparison of taxonomic composition and functional genes
We compared clusters of samples generated with 16 S rRNA gene abundance data with clusters of samples generated with functional gene profiles to determine whether taxonomic clusters corresponded to functional clusters. A contingency analysis indicated taxonomic clusters were not associated with functional gene-based clusters. The Agreement Statistic, based on the frequency of co-occurrence between taxonomic and functional gene clusters was very low (Kappa statistic = −0.01, p > 0.05) indicating that the observed agreement was less than by chance.
Characterization of highly variable functional genes
We evaluated the distribution of highly variable genes among sample clusters, revealing that clusters had distinct metabolic and functional themes, which were associated with differences in energy metabolism, C assimilation, and substrate utilization (Table S5). Hierarchical clustering partitioned highly variable genes into two primary groups (A and B) with two sub-groups in each main group (A1, A2, B1, B2; Fig. 3, data available in ScienceBase). We refer to these as groups of functional genes rather than clusters to distinguish them from the clusters of samples described earlier. Group A1 (dark purple in Fig. 3), which contained genes that were highly abundant in Clusters 4 and 5, was significantly enriched in genes related to metabolism under reducing anaerobic conditions, specifically methanogenesis (16% of genes in A1, p < 0.0001) and fermentation (7.5% of genes in A1, p < 0.0001). Concomitant with the high abundance of methanogenesis genes (and therefore a high abundance of archaea), group A1 was significantly enriched in archaeal genes (e.g., archaea-specific elongation factors, polymerases, and translation initiation factors) (23% of A1 genes, p < 0.0001). The fermentation pathways in Group A1 produce end-products that fuel methanogenesis—formate, acetate, and CO2. These data indicate that the oxidative conversion of pyruvate to acetyl-CoA and CO2 is catalyzed by pyruvate ferredoxin oxidoreductase, formate is produced by pyruvate formate lyase, and acetate is produced by acetate-CoA ligase. Methanogenesis genes included both hydrogenotrophic and acetoclastic pathways (Fig. 3). The second set of genes in Group A (Group A2, indicated by green in Fig. 3) did not reveal a central metabolic theme. Instead, we observed a significant enrichment of genes related to toxin/antitoxin systems and host defense mechanisms (p = 0.0022), and transposases (p = 0.042). There is substantial uncertainty and debate over the function of toxin/antitoxin systems but they are often found on mobile genetic elements and may be involved in competition between cells .
Functional gene groups varied in the abundance of genes related to the metabolism of amino acids and other nitrogenous molecules, aromatic hydrocarbons, fatty acids, and carbohydrates (Table S5). The most notable trend in Group A was a greater number of genes related to carbohydrate degradation (p = 0.048) and, in Group A2, an enrichment of peptidases (p = 0.0022). Both of which were absent in Group B (Table S5).
Genes from Group A were highly abundant in the “subarctic archaea-enriched” and the “low Arctic deep” clusters of samples (Clusters 5 and 4, respectively) (Fig. 3). Clusters 4 and 5 contained samples from Blacksheep Pond, the oldest and deepest (~90 kya, 40 m) Vault Creek Tunnel samples, 19 kya section of the Fox Tunnel, Holocene permafrost from Bol’shoy Lyakhovsky Island, and Utqiaġvik polygons. Methanogens and methanogenic activity have been previously observed in 19 kya Fox Tunnel samples and Utqiaġvik polygons [9, 37, 38] and the Holocene Bol’shoy Island samples originate from ancient lake sediments, which may account for the high abundance of methanogenic pathways . Shallow Blacksheep Pond samples are likely derived from a large ancient thermokarst lake system that existed at the site approximately 6 kya, which thawed through Pleistocene-aged yedoma. Prior to draining, methanogenic communities were likely established and subsequently incorporated into permafrost when it re-froze. Neither of the acidic sample clusters (Clusters 1 & 2) contained a high abundance of methanogenesis-related genes, which is consistent with the inhibitory effect low pH has on methanogenesis .
The genes in Group B are suggestive of more favorable redox conditions than genes in Group A. Nitrogen cycle genes were significantly enriched in Group B (p = 0.019) and are predicted to support anaerobic respiration using nitrate and nitrite as terminal electron acceptors through nitrate reduction to nitrite (narG, narH, and napA) and dissimilatory nitrate reduction to ammonia (DNRA) (nirB). We also observed evidence of the ability to conduct assimilatory nitrate reduction (nasA) and nitrate transport (nrtA) (Fig. 3).
Group B was also enriched in genes associated with anabolic pathways that use substrates entering central metabolism via acetyl-CoA (e.g., fatty acids, waxes, alkenes, ketogenic amino acids, methylated compounds, or C2 compounds such as acetate). This stands in contrast to molecules that enter through pyruvate or phosphoenolpyruvate, such as carbohydrates and most amino acids [41, 42]. Potential increased use of these substrates is represented by the significant enrichment of two anaplerotic pathways, the glyoxylate cycle and the ethylmalonyl-CoA pathway (p = 0.0022), which replenish citric acid cycle intermediates that have been drained for biosynthetic processes (Fig. 3) . This observation is supported by the presence of fatty acid metabolism genes in Group B and their corresponding absence in Group A (Table S5).
Group B genes were associated with managing osmotic stress, such as the uptake and synthesis of osmoprotectants and compatible solutes (p = 0.0051) (Fig. 3). These compounds include glycine betaine, choline, and proline, and osmolytes that balance osmotic pressure, maintain turgor pressure, inhibit protein aggregation, and maintain membrane fluidity at low temperatures . Group B also contained a transporter gene for spermidine/putrescine, which are small polyamines that protect cells against diverse stressors such as temperature, osmotic pressure, and reactive oxygen species .
Genes from Groups B were highly abundant in the “sub Arctic non-archaeal” cluster of sites (Cluster 6). These sites include most Fox Tunnel samples from the late Pleistocene 27 kya and 33 kya age categories, Eemian (125 kya) and some late Pleistocene (30 kya) permafrost samples from Bol’shoy Lyakhovsky Island in Russia, and some samples from Kaktovik, Alaska. Clusters 1, 2, and 3 (Arctic acidic, low Arctic acidic, and low Arctic shallow), were defined by a high abundance of genes from Groups A2 and B1. Metabolically, these clusters are more like Cluster 6 than Clusters 4 & 5 because of the low abundance of methanogenesis genes and high abundance of a subset of the nitrate reduction and anaplerosis genes. Notable samples from these clusters are from Svalbard, Kaktovic, Stordalen Mire, most late Pleistocene samples from Bol’shoy Lyakhovsky Island, and the youngest Vault Creek Tunnel samples (25 kya). Many samples from this metabolically similar group of clusters (i.e., 1, 2, 3, and 6) were previously found to contain a high abundance of acetate and/or other short chain fatty acids. Specifically, late Pleistocene (30 to 54 kya) Bol’shoy Lyakhovsky Island samples had high concentrations of acetate compared with Holocene samples . Similarly, 27 kya and 33 kya Fox Tunnel permafrost samples had higher concentrations of acetate, butyrate, and isovalerate than 19 kya tunnel samples , which is consistent with our observations of an increased ability to assimilate these molecules.
Across the panarctic, permafrost microbial communities displayed significant site to site variability in both taxonomic and functional gene composition, much of which could be explained by latitudinal, soil chemical, and soil depth gradients. We observed latitudinal gradients in community composition and functional genes that are likely related to local climate, which we associate with a decline in mean annual air temperature with increasing latitude. Cold ground surface temperatures correspond to lower permafrost temperatures generally , with concomitant declines in substrate diffusion and microbial activity .
Soil pH had an impact on both the diversity of permafrost organisms and their functional genes. Soil pH strongly influences the structure and function of microbial communities in Arctic active layer soils and many other ecosystems worldwide [3, 48]. Significant variation in soil pH exists across the Arctic , and has been an important factor in structuring plant communities as well . Soil pH may affect microbial diversity and their functional characteristics by modifying resource availability, energetics of reactions, and the presence of inhibitory substances . We observed that in slightly acidic soils (<pH 6.25), functional genes associated with methanogenesis and fermentation were in relatively low abundance. For methanogens, this result could be due to the direct inhibitory effect of pH, particularly on acetoclastic methanogens .
Soil depth was a major influence on the taxonomic composition and functional genes of permafrost microbial communities as well. Increasing soil depth likely adds stress to microbial communities due to lower resource availability . Deeper permafrost soils (10 s of meters) had lower C and N concentrations compared to shallow permafrost. But other factors such as lower permafrost temperatures, lower rates of diffusion, and older permafrost age may also be an important contributor to this depth dependent pattern [53, 54].
Several other factors, particularly permafrost age, soil OC, and soil N, had a significant influence on permafrost taxonomic composition and functional genes, but the amount of variability explained was lower than we expected. We had hypothesized that these factors, including the soil C/N ratio, would have more of an influence over taxonomic or functional clusters, as observed in another study  and given the importance of C/N ratios in governing rates of C fluxes from thawed soils . This incongruity may come from the fact that permafrost microbial communities only access a small portion of the permafrost soil C and N pool because they survive and grow in narrow water films . Thus, bulk chemical measurements may not represent the fraction of permafrost C and N being accessed by the community prior to thaw. Permafrost age is particularly complex because it covaries with both soil depth and soil pH. Our Pleistocene samples were deeper and more alkaline than Holocene aged permafrost. This could be in part because Pleistocene permafrost is more often associated with loess hills while Holocene permafrost is more often located near lowlands. Regardless, once Pleistocene permafrost thaws, soil pH could become more acidic as carbonates are removed . Thus, Pleistocene permafrost that thawed and reformed in the Holocene could be more acidic on average, indirectly producing the pH-dependent functional gene clusters observed in this study.
Though we found that environmental parameters had a significant effect on both taxonomic and functional genes, the strengths of these effects differed among data types. The strongest patterns were observed in the subset of highly variable functional genes. There are likely several explanations why highly variable genes show the strongest relationships with biophysical attributes of soils. First, they represent genes experiencing the strongest selective pressures. Second, by excluding genes that did not vary (e.g., housekeeping genes) we likely enhanced detectable signals of variation across the landscape. Furthermore, differences in association with environmental parameters between taxonomic and functional genes could be partially explained by functional redundancy . In this case, functional genes may be strongly associated with environmental parameters, with little change in taxonomic composition. We also acknowledge the limitations of short reads in performing taxonomic identifications. We conservatively chose to limit analyses to class, which is a somewhat coarse level of resolution. It is possible that genus or “species” level classifications would yield a more complete view of the relationships between environment, genes, and taxonomy.
Conceptual model of permafrost microbial community assembly
Based on the patterns of taxonomic composition and functional gene abundances, we propose a conceptual model of permafrost microbial community assembly (Fig. 4). Just as environmental variation creates differences in microbial community structure and functional gene distribution in non-permafrost soils [1,2,3, 60], the paleoenvironment prior to and during permafrost formation shaped communities in the soils that eventually become permafrost [4, 61, 62]. The permafrost formation process itself, which is driven by the coupled effects of paleoclimate (e.g., warm and wet, cold and dry) and paleoecosystem (e.g., forest, grassland, lake sediment), also affects OM characteristics [63, 64] and likely the corresponding microbial communities . For example, syngenetic permafrost generally experiences fewer freeze-thaw cycles than epigenetic permafrost, which may lead to a more labile C pool because of reduced exposure to microbial processing prior to formation . On the other hand, epigenetic permafrost, which can take millennia to form , may contain a more decomposed C pool .
Once entrained, frozen conditions are an environmental filter as a barrier to the import of resources, water, and microbial immigrants. But this filter also varies because microorganisms may remain active, especially in permafrost where temperatures approach 0 °C , transforming the community and the resources surrounding it [4, 9, 67, 68]. The longer microorganisms are entrained in permafrost, the further these compositional and functional transformations can occur. Therefore, contemporary permafrost communities are a subset of ancient communities that have passed through selective filters or survived through stochastic processes (e.g., drift) and may have diversified since entrainment depending on permafrost age, resource availability, and temperature (which affect cell division rates and growth).
Functional genes differentiating permafrost microbial communities
The most highly variable genes across all permafrost samples were overwhelmingly related to energy acquisition and substrate availability suggesting that soil redox conditions, OM pools, and thermodynamic constraints on microbial metabolism are some of the strongest selective pressures affecting permafrost microbial communities. Co-occurrence of fermentation and methanogenesis genes underline that fermenters degrade labile organic C and produce the acetate, formate, and CO2 that supply methanogens. On the other hand, genes and pathways abundant in the non-methanogenic clusters indicate more favorable redox conditions. DNRA and nitrate to nitrite reduction genes indicate nitrate may serve as a thermodynamically favorable terminal electron acceptor to fuel microbial oxidation of OM through anaerobic respiration. While nitrate reduction functional genes show significant site to site variability, other abundant N cycle genes were more uniform in their distribution. For example, the relative abundance of norB (responsible for nitric oxide reduction to nitrous oxide (N2O)) was similar to the highly variable narG (nitrate reduction to nitrite) but norB relative abundance was comparatively stable across sites (see Supplementary Information). N2O is a potent greenhouse gas and thawing permafrost may be a significant source in the coming decades and centuries . Our data suggest that microorganisms with the potential to produce N2O are widely distributed in permafrost. This, coupled with the observation that nosZ (catalyzes the final step in denitrification–N2O to N2) is rare in our samples indicates that permafrost microorganisms are poised to produce N2O and may currently be doing so.
Differences in functional gene groups also suggest pathways of OM transformations in permafrost may vary across sites. The high relative abundance of genes in the ethylmalonyl-CoA pathway, the glyoxylate shunt, amino acid and fatty acid metabolism in non-methanogenic permafrost could indicate microorganisms are unable to access carbohydrate-rich OM and are required to use less energetically favorable substrates. The high abundance of the glyoxylate shunt may also be an indicator of quiescence, where cells are viable and metabolically active but persist in a non-replicating state . Another recent study from the coastline of the Eastern Siberian Sea similarly found that metabolic strategies were key to adaptation to cold anoxic permafrost environments and even identified many of the same genes (e.g., nitrate reduction, peptidases, and fermentation) that differentiated permafrost communities from distinctive depths and ages .
Because functional gene profiles indicate potential function, we do not know how these relate to actual function in intact permafrost and post thaw, particularly because microbial communities can quickly change after transition from a frozen to thawed state [16, 71]. Furthermore, we did not differentiate between active, dead, or dormant microorganisms so we are limited in our view of who is active or who may become active. However, we did not observe substantial differences between live and dead populations within intact permafrost in a previous study . Cluster analysis, although useful to understand similarities among samples, does not perfectly aggregate samples within sites. We observed instances where samples from the same site were associated with different clusters, which likely indicates that sometimes local scale variability outweighs regional differences. Furthermore, the biophysical descriptors associated with each cluster are averages and individual samples may be from differing environments. Lastly, much of the available data on permafrost microbial communities, including data from this study, are biased towards Alaska and even particular sites within Alaska. Future efforts should increase representation from Canada, Europe, and Asia and systematically target geographically, geologically, and ecologically distinct permafrost environments using well-replicated sampling designs.
Though we identified environmental factors that contribute to shaping permafrost microbial community functional potential, there are still gaps in our understanding that limit our ability to make predictions about the distribution of functional gene groups based on permafrost characteristics. Incorporating information about paleoenvironment and site history may be important keys to predictive understanding of the distribution of microbial communities and their functions in permafrost. For example, understanding paleovegetation may fill some of our knowledge gaps. Permafrost microbial communities should show adaptations related to characteristics of the plant community present at the time of formation. Prior studies from the Fox Permafrost Tunnel metagenomes provide support for this argument. Paleovegetation (inferred from metagenomic sequences derived from plant detrital DNA) was correlated with historical climate, dissolved organic C characteristics, and the abundance of genes encoding carbohydrate active enzymes [4, 9]. In the future, expanding studies to diversify permafrost environmental locations coupled with functional and biogeochemical measurements and site history investigations promises to enable predictive understanding of global patterns and will enable us to address increasingly complex questions about the principles governing organization of microbial life in permafrost and the effects of these communities on crucial biogeochemical cycles, both currently and in response to climate change.
In soils, approximately 40% of DNA is from cells that are no longer intact . In permafrost, the proportion of recovered DNA from dead cells may be even greater because frozen conditions act as a preservative . In samples from the Fox Permafrost Tunnel, depletion of exogenous DNA decreased yields by 24–62% depending on sample age . The extent to which DNA from dead cells alters conclusions in soil DNA-based studies is an open question. For this study, we suggest that the overarching conclusions would likely not change, though some of the smaller details might. Previous work in permafrost suggests that removal of relic DNA does not substantially alter microbial community structure . This result is expected when the death rate and degradation of detrital DNA is similar across taxa. Similar observations have been made in deep marine sediments  which, like permafrost, host communities that must survive in a cold environment with limited resources.
Permafrost is poised to be one of the largest biological feedbacks to future climate change . However, our limited understanding of permafrost and its microbial inhabitants may hinder our understanding of system feedbacks and representation in climate models, jeopardizing efforts to predict the consequences of permafrost thaw in a changing climate. The fact that permafrost soils are not a monolith is a hurdle to our understanding: they are incredibly diverse in their paleoecology, geology, soil chemistry, and soil microbiology. Permafrost soil microorganisms have the unique feature of containing information about both the paleoenvironment and the modern environment, which can be interpreted through their genetic composition. The microorganisms in thawed permafrost soils may be derived from the permafrost community, thereby having an outsized effect on soil function. Despite enormous diversity in permafrost environments and variation in community structure and functional genes, we identified large scale patterns. The entrainment of microorganisms in permafrost during its formation, coupled with frozen static conditions and barriers to immigration, likely makes paleoenvironment a crucial yet overlooked factor in the structure of contemporary communities . These microorganisms are poised to act as “first responders” during thaw, controlling turnover of the most labile and accessible substrates , and greenhouse gas fluxes. In this way, past legacies will uniquely shape future ecosystem responses.
Datasets including biophysical data, taxonomic data, and KEGG data are available on ScienceBase at https://doi.org/10.5066/P9ZDRJ3K. Sequence data are available at the National Center for Biotechnology Information (NCBI) under BioProject accession number PRJNA830449.
Bahram M, Hildebrand F, Forslund SK, Anderson JL, Soudzilovskaia NA, Bodegom PM, et al. Structure and function of the global topsoil microbiome. Nature. 2018;560:233–7.
Waldrop MP, Holloway JM, Smith DB, Goldhaber MB, Drenovsky RE, Scow KM, et al. The interacting roles of climate, soils, and plant production on soil microbial communities at a continental scale. Ecology. 2017;98:1957–67.
Delgado-Baquerizo M, Oliverio AM, Brewer TE, Benavent-González A, Eldridge DJ, Bardgett RD, et al. A global atlas of the dominant bacteria found in soil. Science. 2018;359:320–5.
Leewis M-C, Berlemont R, Podgorski DC, Srinivas A, Zito P, Spencer RGM, et al. Life at the frozen limit: microbial carbon metabolism across a late Pleistocene permafrost chronosequence. Front Microbiol. 2020;11:1753.
Abramov A, Vishnivetskaya T, Rivkina E. Are permafrost microorganisms as old as permafrost? FEMS Microbiol Ecol. 2021;97:fiaa260.
Strauss J, Schirrmeister L, Grosse G, Fortier D, Hugelius G, Knoblauch C, et al. Deep Yedoma permafrost: A synthesis of depositional characteristics and carbon vulnerability. Earth-Sci Rev. 2017;172:75–86.
Treat CC, Jones MC. Near-surface permafrost aggradation in Northern Hemisphere peatlands shows regional and global trends during the past 6000 years. Holocene. 2018;28:998–1010.
Ping CL, Jastrow JD, Jorgenson MT, Michaelson GJ, Shur YL. Permafrost soils and carbon cycling. Soil. 2015;1:147–71.
Mackelprang R, Burkert A, Haw M, Mahendrarajah T, Conaway CH, Douglas TA, et al. Microbial survival strategies in ancient permafrost: insights from metagenomics. ISME J. 2017;11:2305–18.
Mitzscherling J, Horn F, Winterfeld M, Mahler L, Kallmeyer J, Overduin PP, et al. Microbial community composition and abundance after millennia of submarine permafrost warming. Biogeosciences. 2019;16:3941–58.
Liang R, Li Z, Lau Vetter MCY, Vishnivetskaya TA, Zanina OG, Lloyd KG, et al. Genomic reconstruction of fossil and living microorganisms in ancient Siberian permafrost. Microbiome. 2021;9:110.
Rivkina E, Petrovskaya L, Vishnivetskaya T, Krivushin K, Shmakova L, Tutukina M, et al. Metagenomic analyses of the late Pleistocene permafrost – additional tools for reconstruction of environmental conditions. Biogeosciences. 2016;13:2207–19.
Bottos EM, Kennedy DW, Romero EB, Fansler SJ, Brown JM, Bramer LM, et al. Dispersal limitation and thermodynamic constraints govern spatial structure of permafrost microbial communities. FEMS Microbiol Ecol. 2018;94:fiy110.
Jansson JK, Taş N. The microbial ecology of permafrost. Nat Rev Microbiol. 2014;12:414–25.
Winkel M, Mitzscherling J, Overduin PP, Horn F, Winterfeld M, Rijkers R, et al. Anaerobic methanotrophic communities thrive in deep submarine permafrost. Sci Rep. 2018;8:1291.
Mackelprang R, Waldrop MP, DeAngelis KM, David MM, Chavarria KL, Blazewicz SJ, et al. Metagenomic analysis of a permafrost microbial community reveals a rapid response to thaw. Nature. 2011;480:368–71.
Hultman J, Waldrop MP, Mackelprang R, David MM, McFarland J, Blazewicz SJ, et al. Multi-omics of permafrost, active layer and thermokarst bog soil microbiomes. Nature. 2015;521:208–12.
Chauhan A, Layton AC, Vishnivetskaya TA, Williams D, Pfiffner SM, Rekepalli B, et al. Metagenomes from thawing low-soil-organic-carbon mineral cryosols and permafrost of the canadian high Arctic. Genome Announc. 2014;2:e01217–14.
Woodcroft BJ, Singleton CM, Boyd JA, Evans PN, Emerson JB, Zayed AAF, et al. Genome-centric view of carbon processing in thawing permafrost. Nature. 2018;560:49–54.
Liang R, Lau M, Vishnivetskaya T, Lloyd KG, Wang W, Wiggins J, et al. Predominance of anaerobic, spore-forming bacteria in metabolically active microbial communities from ancient Siberian permafrost. Appl Environ Microbiol. 2019;85:e00560–19.
Wagner R, Zona D, Oechel W, Lipson D. Microbial community structure and soil ph correspond to methane production in Arctic Alaska soils. Environ Microbiol. 2017;19:3398–410.
Yergeau E, Hogues H, Whyte LG, Greer CW. The functional potential of high Arctic permafrost revealed by metagenomic sequencing, qPCR and microarray analyses. ISME J. 2010;4:1206–14.
Taş N, Prestat E, McFarland JW, Wickland KP, Knight R, Berhe AA, et al. Impact of fire on active layer and permafrost microbial communities and metagenomes in an upland Alaskan boreal forest. ISME J. 2014;8:1904–19.
Ward CP, Nalven SG, Crump BC, Kling GW, Cory RM. Photochemical alteration of organic carbon draining permafrost soils shifts microbial metabolic pathways and stimulates respiration. Nat Commun. 2017;8:772.
Xue Y, Jonassen I, Øvreås L, Taş N. Bacterial and archaeal metagenome-assembled genome sequences from Svalbard permafrost. Microbiol Resour Announc. 2019;8:e00516–19.
Rand J, Mellor M. Ice-coring augers for shallow depth sampling. CRREL Rep. 1985;85–21:1–22.
Schwamborn G, Wetterich S. Russian-German Cooperation CARBOPERM: Field campaigns to Bol’shoy Lyakhovsky Island in 2014. Berichte zur. Polar- und Meeresforsch = Rep. Polar Mar Res. 2015;686:1–104.
Strauss J, Laboor S, Schirrmeister L, Fedorov AN, Fortier D, Froese DG, et al. Database of ice-rich yedoma permafrost ersion 2 (IRYP v2). 2022. PANGAEA. https://doi.org/10.1594/PANGAEA.940078.
Strauss J, Laboor S, Schirrmeister L, Fedorov AN, Fortier D, Froese D, et al. Circum-Arctic map of the yedoma permafrost domain. Front Earth Sci Chin. 2021;9:758360.
R Core Team. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing, 2021. https://www.R-project.org/.
Anderson MJ, Gorley RN, Clarke RK. Permanova+ for Primer: guide to software and statistical methods. PRIMER-E: Plymouth, UK, 2008.
Honaker J, King G, Blackwell M. AmeliaII: A program for missing data. J Stat Softw. 2011;45:1–47.
Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550.
Kolde R. pheatmap: Pretty Heatmaps. R package version 1.0.12, https://CRAN.R-project.org/package=pheatmap. 2019.
Oksanen J, Kindt R, Legendre P, O’Hara B, Stevens MHH, Oksanen MJ, et al. The vegan package. Community Ecol package. 2007;10:719.
Jurėnas D, Van Melderen L. The variety in the common theme of translation inhibition by type II toxin-antitoxin systems. Front Genet. 2020;11:262.
Waldrop MP, Wickland KP, White RIii, Berhe AA, Harden JW, Romanovsky VE. Molecular investigations into a globally important carbon pool: permafrost-protected carbon in Alaskan soils. Glob Chang Biol. 2010;16:2543–54.
Taş N, Prestat E, Wang S, Wu Y, Ulrich C, Kneafsey T, et al. Landscape topography structures the soil microbiome in Arctic polygonal tundra. Nat Commun. 2018;9:777.
Stapel JG, Schwamborn G, Schirrmeister L, Horsfield B, Mangelsdorf K. Substrate potential of last interglacial to Holocene permafrost organic matter for future microbial greenhouse gas production. Biogeosciences. 2018;15:1969–85.
Jain SR, Mattiasson B. Acclimatization of methanogenic consortia for low pH biomethanation process. Biotechnol Lett. 1998;20:771–5.
Lorenz MC, Fink GR. Life and death in a macrophage: role of the glyoxylate cycle in virulence. Eukaryot Cell. 2002;1:657–62.
Dolan SK, Welch M. The glyoxylate shunt, 60 years on. Annu Rev Microbiol. 2018;72:309–30.
Schneider K, Peyraud R, Kiefer P, Christen P, Delmotte N, Massou S, et al. The ethylmalonyl-CoA pathway is used in place of the glyoxylate cycle by Methylobacterium extorquens AM1 during growth on acetate. J Biol Chem. 2012;287:757–66.
Gregory GJ, Boyd EF. Stressed out: Bacterial response to high salinity using compatible solute biosynthesis and uptake systems, lessons from Vibrionaceae. Comput Struct Biotechnol J. 2021;19:1014–27.
Rhee HJ, Kim E-J, Lee JK. Physiological polyamines: simple primordial stress molecules. J Cell Mol Med. 2007;11:685–703.
Obu J, Westermann S, Bartsch A, Berdnikov N, Christiansen HH, Dashtseren A, et al. Northern Hemisphere permafrost map based on TTOP modelling for 2000-16 at 1 km2 scale. Earth Sci Rev. 2019;193:299–316.
Rivkina EM, Friedmann EI, McKay CP, Gilichinsky DA. Metabolic activity of permafrost bacteria below the freezing point. Appl Environ Microbiol. 2000;66:3230–3.
Malard LA, Anwar MZ, Jacobsen CS, Pearce DA. Biogeographical patterns in soil bacterial communities across the Arctic region. FEMS Microbiol Ecol. 2019;95:fiz128.
Hollingsworth TN, Walker MD, Chapin FS III, Parsons AL. Scale-dependent environmental controls over species composition in Alaskan black spruce communities. Can J Res. 2006;36:1781–96.
Gray ND, McCann CM, Christgen B, Ahammad SZ, Roberts JA, Graham DW. Soil geochemistry confines microbial abundances across an Arctic landscape; implications for net carbon exchange with the atmosphere. Biogeochemistry. 2014;120:307–17.
Ye R, Jin Q, Bohannan B, Keller JK, McAllister SA, Bridgham SD. pH controls over anaerobic carbon mineralization, the efficiency of methane production, and methanogenic pathways in peatlands across an ombrotrophic–minerotrophic gradient. Soil Biol Biochem. 2012;54:36–47.
Tripathi BM, Kim M, Kim Y, Byun E, Yang J-W, Ahn J, et al. Variations in bacterial and archaeal communities along depth profiles of Alaskan soil cores. Sci Rep. 2018;8:504.
Varsadiya M, Urich T, Hugelius G, Bárta J. Microbiome structure and functional potential in permafrost soils of the Western Canadian Arctic. FEMS Microbiol Ecol. 2021;97:fiab008.
Koven CD, Lawrence DM, Riley WJ. Permafrost carbon-climate feedback is sensitive to deep soil carbon decomposability but not deep soil nitrogen dynamics. Proc Natl Acad Sci USA. 2015;112:3752–7.
Zhang X, Xu S, Li C, Zhao L, Feng H, Yue G, et al. The soil carbon/nitrogen ratio and moisture affect microbial community structures in alkaline permafrost-affected soils with different vegetation types on the Tibetan plateau. Res Microbiol. 2014;165:128–39.
Schädel C, Schuur EAG, Bracho R, Elberling B, Knoblauch C, Lee H, et al. Circumpolar assessment of permafrost C quality and its vulnerability over time using long-term incubation data. Glob Chang Biol. 2014;20:641–52.
Gilichinsky D, Rivkina E, Shcherbakova V, Laurinavichuis K, Tiedje J. Supercooled water brines within permafrost-an unknown ecological niche for microorganisms: a model for astrobiology. Astrobiology. 2003;3:331–41.
Zolkos S, Tank SE. Experimental evidence that permafrost thaw history and mineral composition shape abiotic carbon cycling in thermokarst-affected stream networks. Front Earth Sci. 2020;8:eaav0550.
Nemergut DR, Schmidt SK, Fukami T, O’Neill SP, Bilinski TM, Stanish LF, et al. Patterns and processes of microbial community assembly. Microbiol Mol Biol Rev. 2013;77:342–56.
Crowther TW, van den Hoogen J, Wan J, Mayes MA, Keiser AD, Mo L, et al. The global soil community and its influence on biogeochemistry. Science. 2019;365:6525.
Holm S, Walz J, Horn F, Yang S, Grigoriev MN, Wagner D, et al. Methanogenic response to long-term permafrost thaw is determined by paleoenvironment. FEMS Microbiol Ecol. 2020;96:fiaa021.
Ernakovich JG, Barbato RA, Rich VI, Schädel C, Hewitt RE, Doherty SJ, et al. Permafrost thaw and microbiome assembly. Glob Change Biol. 2022;28:5007–26.
Jorgenson MT, Romanovsky V, Harden J, Shur Y, O’Donnell J, Schuur EAG, et al. Resilience and vulnerability of permafrost to climate change. Can J Res. 2010;40:1219–36.
Hugelius G, Loisel J, Chadburn S, Jackson RB, Jones M, MacDonald G, et al. Large stocks of peatland carbon and nitrogen are vulnerable to permafrost thaw. Proc Natl Acad Sci USA. 2020;117:20438–46.
Jorgenson MT, Douglas TA, Liljedahl AK, Roth JE, Cater TC, Davis WA, et al. The roles of climate extremes, ecological succession, and hydrology in repeated permafrost aggradation and degradation in fens on the Tanana flats, Alaska. J Geophys Res Biogeosci. 2020;125:e2020JG005824.
Trubl G, Kimbrel JA, Liquet-Gonzalez J, Nuccio EE, Weber PK, Pett-Ridge J, et al. Active virus-host interactions at sub-freezing temperatures in Arctic peat soil. Microbiome. 2021;9:208.
Drake TW, Wickland KP, Spencer RGM, McKnight DM, Striegl RG. Ancient low-molecular-weight organic acids in permafrost fuel rapid carbon dioxide production upon thaw. Proc Natl Acad Sci USA. 2015;112:13946–51.
James SR, Minsley BJ, McFarland JW, Euskirchen ES, Edgar CW, Waldrop MP. The biophysical role of water and ice within permafrost nearing collapse: Insights from novel geophysical observations. J Geophys Res Earth Surf. 2021;126:e2021JF006104.
Marushchak ME, Kerttula J, Diáková K, Faguet A, Gil J, Grosse G, et al. Thawing yedoma permafrost is a neglected nitrous oxide source. Nat Commun. 2021;12:7107.
Chodkowski JL, Shade A. Exometabolite Dynamics over Stationary Phase Reveal Strain-Specific Responses. mSystems. 2020;5:e00493–20.
Coolen MJL, Orsi WD. The transcriptional response of microbial communities in thawing Alaskan permafrost soils. Front Microbiol. 2015;6:197.
Burkert A, Douglas TA, Waldrop MP, Mackelprang R. Changes in the active, dead, and dormant microbial community structure across a Pleistocene permafrost chronosequence. Appl Environ Microbiol. 2019;85:e02646–18.
Carini P, Marsden PJ, Leff JW, Morgan EE, Strickland MS, Fierer N. Relic DNA is abundant in soil and obscures estimates of soil microbial diversity. Nat Microbiol. 2016;2:16242.
Hansen AA, Herbert RA, Mikkelsen K, Jensen LL, Kristoffersen T, Tiedje JM, et al. Viability, diversity and composition of the bacterial community in a high Arctic permafrost soil from Spitsbergen, Northern Norway. Environ Microbiol. 2007;9:2870–84.
Ramírez GA, Jørgensen SL, Zhao R, D’Hondt S. Minimal influence of extracellular DNA on molecular surveys of marine sedimentary communities. Front Microbiol. 2018;9:2969.
IPCC. Climate Change 2022: Impacts, Adaptation, and Vulnerability. Contribution of Working Group II to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change (Pörtner H-O, Roberts DC, Tignor M, Poloczanska ES, Mintenbeck K, Alegría A, eds.). Cambridge University Press: Cambridge University Press, Cambridge, UK and New York, NY, USA, 2022;3056. https://doi.org/10.1017/9781009325844.
Research was made possible by the NASA exobiology grant NNX15AM12G to RM and NNH15AB58I to MPW. RM was also supported by the National Science Foundation (NSF) award DEB2029573. The U.S. Geological Survey Climate and Land Use Research and Development Program provided funding to MPW and MCL. The National Institute of General Medical Sciences (NIGMS) Building Infrastructure Leading to Diversity (BUILD) Initiative, grant number RL5GM118975, provided support to RM. Funding for this work was provided to NT by the Office of Biological and Environmental Research in the DOE Office of Science – Early Career Research Program. SL was supported by the European Commission’s Marie Sklowdowska Curie Actions Program (grant #675546) and the German Ministry of Education and Research (grant #03G0836B). TAD was funded by the Department of Defense’s Strategic Environmental Research and Development Program (Project RC‐2110) and the U.S. Army Corps of Engineers Engineer Research and Development Center’s Basic Research (6.2) Program under 0602182 A/Network C3I Enabling Technologies. Special thanks to Chris Conaway for assistance with permafrost chemical analyses. Any use of trade, firm, or product names is for descriptive purposes only and does not imply endorsement by the U.S. Government
The authors declare no competing interests.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Waldrop, M.P., Chabot, C.L., Liebner, S. et al. Permafrost microbial communities and functional genes are structured by latitudinal and soil geochemical gradients. ISME J (2023). https://doi.org/10.1038/s41396-023-01429-6