Introduction

Permafrost underlies 26% of terrestrial ecosystems (Williams and Smith, 1989) and is estimated to contain around 15% of the world's soil carbon (much of which is frozen as poorly decomposed plant remains) (Post et al., 1982). Recent modeling of permafrost indicates that as much as 90% of the near-surface Arctic permafrost could thaw by 2100 (Lawrence and Slater, 2005), which could lead to an increase in atmospheric CO2 of roughly 25–50 p.p.m (Ping et al., 2008). The fate of the carbon stocked in permafrost-affected soils will partly depend on the local environmental conditions, as the relative production of methane and carbon dioxide varies according to soil water/oxygen content. The genetic potential of the microorganisms present in permafrost is also likely to determine whether permafrost environments will become net sources or sinks of greenhouse gases following warming. However, the microbiology of permafrost remains relatively unexplored (reviewed in Steven et al. 2006, 2009; Wagner, 2008). The concerns about potential large-scale positive feedback loops on global warming make it crucial to gain in-depth knowledge of the microbes inhabiting permafrost environments.

Processes that will have direct influences on greenhouse gas emissions, such as methane production, methane oxidation and organic matter decomposition, are all mainly carried out by soil microorganisms. The production of CH4 is solely driven by a small group of strictly anaerobic organisms belonging to the Euryarchaeota, which were proven to be viable inhabitants of permafrost (Wagner et al., 2005; Ganzert et al., 2007; Rivkina et al., 2007). The biological oxidation of methane by methane-oxidizing bacteria (methanotrophs) was shown to be a major sink for methane in the permafrost habitats (Trotsenko and Khmelenina, 2005). However, most of the previous microbiological studies of permafrost focused on 16S rRNA genes, on a few functional genes, on the measurement of general processes or on the isolation and discovery of new microbial species (reviewed in Steven et al. 2006, 2009; Wagner, 2008). Highly parallelized methods such as next-generation sequencing can target all functional genes present in a given sample at once, providing an in-depth assessment of the genetic potential of the microorganisms of an environment.

The main objectives of this study were (1) to describe the genetic potential of indigenous permafrost microorganisms and (2) to identify the functional and phylogenetic differences between permafrost and its overlying active layer soil. To attain these objectives, we sequenced the metagenomes of two samples from a single core from Ellesmere Island in the Canadian high Arctic (core Eur3, Steven et al., 2008) using 454 GS FLX Titanium pyrosequencing. We complemented these results with quantitative PCR (qPCR) analyses targeting a range of genes from different microbes and with microarray analyses of the 16S rRNA genes of microorganisms frequently found in cold environments.

Materials and methods

Permafrost samples

The core (Eur3, Steven et al., 2008) consisted in a continuous core from the surface soil up to 14-m deep permafrost from a single-drill hole, which was collected in May 2003 near Eureka, in the Canadian high Arctic (80°0.029 N, 85°50.367 W). Coring equipment, control of microbiological contamination during coring and subsampling was as previously described (Juck et al., 2005). Core samples were kept frozen during transport and stored at −20 °C until further analysis. The core was sectioned at 1-m depth intervals, and the surface (active layer) and 2-m deep (2-m permafrost) subsections were used in this study. Soil subsamples for molecular analyses were collected under sterile conditions from the center of the core. The active layer soil and the permafrost had the following characteristics (active and 2-m permafrost): pH: 7.9 and 7.8; salinity: 1.5 and 3.1 mEq per 100 g soil dry weight; gravimeteric moisture content: 12.7% and 27.0%; total C: 3% and 1.4%; total N: 0.3% and 0.1% (Steven et al., 2008). The depth of the summer thaw in Eureka is 50–70 cm and the estimated age of the permafrost is 5000–6000 years.

DNA extraction, amplification and 454 sequencing

DNA was extracted from a 10 g subsample using the MoBio PowerMax Soil DNA Isolation kit (MoBio Laboratories, Carlsbad, CA, USA). From the DNA quantities extracted from 10 g of the material, it was calculated that to gather enough DNA for direct 454 shotgun library preparation (that is, 4–5 μg DNA), it would have required >1 kg of the material for the 2-m permafrost sample. This amount of material was not available, hence DNA amplification was necessary. Extracted DNA was then subjected to multiple displacement amplification (MDA) using the illustra GenomiPhi V2 DNA amplification kit (GE Healthcare, Piscataway, NJ, USA) following the manufacturer's instructions for chemical denaturation, except that volumes were increased to 50 μl and amplification was carried out for 4 h. Several amplification reactions were pooled to obtain sufficient DNA for sequencing. Amplified DNA was purified using the QiaEx II Gel Extraction kit (Qiagen, Valencia, CA, USA). Roche 454 GS FLX Titanium sequencing (454 Life Sciences, Branford, CT, USA) was performed at The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, ON, Canada.

Bioinformatic analyses

Replicated sequences were first removed from the data set using the method of Gomez-Alvarez et al. (2009). After this first cleanup step, sequences were assembled in contigs using the Phrap software (http://www.phrap.org/). All contigs and singletons were then annotated using the metagenomics (MG)-RAST server (Meyer et al., 2008). For comparison purposes, the raw (unfiltered and unassembled) sequence data sets were also annotated in MG-RAST. The maximum e-value for a significant match was set to 10−5 and the minimum alignment length was set to 50 bp.

PMA treatments

Propidium monoazide (PMA) intercalates into DNA and can be covalently cross-linked to DNA with light exposure, which inhibits subsequent enzymatic reactions. Intact cell membranes are impermeable to PMA and this molecule can thus be used to differentiate naked DNA and dead cells from bacteria with an intact cell membrane (Nocker et al., 2007). To test whether the DNA extracted from the 2-m permafrost and the active layer soil was from viable cells, the samples were treated with PMA following the concentrations and conditions given for soils in previous studies (Pisz et al., 2007; Bae and Wuertz, 2009). The material (10 g) was mixed with 10 ml sterile ultrapure water and 50 μl of 20 mM PMA. This mixture was incubated in the dark for 5 min, shaking manually every minute. The samples were then placed on ice and exposed for 5 min to a 500 W halogen lamp placed 20 cm from the tubes, shaking manually every minute. No-PMA controls were subjected to the same procedure but without adding PMA. DNA was then extracted directly from the samples as described above. The effect of PMA treatments was evaluated by qPCR analyses at the phylum or class level, as described below.

Real-time qPCR

qPCR was performed on unamplified DNA samples (unless stated otherwise) in 25 μl volumes using the iQ SYBR green Supermix (Bio-Rad Laboratories, Hercules, CA, USA) on a Rotor-Gene 3000 apparatus (Corbett Life Science, Sydney, NSW, Australia) as previously described (Yergeau et al., 2009a). The qPCR conditions and primers are given in Supplementary Table S1. Standards were made from 10-fold dilutions of linearized plasmids containing the gene fragment of interest that was cloned from amplified soil DNA. Phage λ DNA was used to evaluate the presence of PCR inhibitors as previously described (Yergeau et al., 2009a). The inhibition was comparably low for both DNA extracts (active: 1.50%, 2-m permafrost: 2.50%). qPCR values for all other assays were corrected accordingly.

16S rRNA genes microarray

A microarray platform targeting the 16S rRNA genes of Bacteria and Archaea found in cold environments was used as previously described (Yergeau et al., 2009a). To summarize, the 16S rRNA genes were amplified from the raw DNA extracts, chemically labeled and hybridized overnight onto the microarray. Details about the microarray and probe design are available at the National Center for Biotechnology Information (NCBI) GEO database under the platform accession number GPL8953. The presence–absence of the different taxa targeted by the microarray was then used as a community profile to evaluate the similarity between the active layer soil and 2-m permafrost, and to contrast it with previously assessed pristine Arctic surface soil samples from the same region (Yergeau et al., 2009a).

Statistical analyses

All statistical analyses were performed in R (version 2.9.0, The R Foundation for Statistical Computing). Distance matrices were calculated using the ‘vegdist’ function of the ‘vegan’ library and principal coordinate analyses were carried out using the ‘cmdscale’ function based on the square root of 1-Jaccard (16S rRNA gene microarray) or on the square root of Bray-Curtis (MG-RAST subsystems). For the MG-RAST subsystems, the sums of the relative abundance of selected level 2 subsystems were added to the ordination as supplementary variables that were not involved in the calculation of the ordination.

Data deposition

The microarray data discussed in this publication have been deposited in NCBI GEO (Edgar et al., 2002) and are accessible through GEO series accession number GSE20073 (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE20073). The sequence data generated in this study was deposited in the Sequence Read Archive of NCBI and are accessible through accession number SRA010889.6.

Results

Metagenomic sequence statistics

Sequencing resulted in 1 103 48 and 1 024 347 sequences for the active layer soil and 2-m permafrost, respectively (Supplementary Table S2). Relatively large proportions (10.3–65.6%) of these sequences were replicates, which were removed from the data set, as they arise from the attachment of DNA to empty beads during emulsion PCR and were not derived independently from the community DNA (Gomez-Alvarez et al., 2009). Only the longest representative of each replicate group was kept for downstream analyses. For the active layer soil, assembly resulted in 125 722 contigs and 276 150 singletons for a total of 401 872 sequences. For the 2-m permafrost, assembly resulted in 8601 contigs and 60 834 singletons for a total of 69 435 sequences. These data sets (contigs+singletons) were used in most downstream analyses.

Phylogenetic community composition

There were significant matches from the active layer (50.76% of all sequences) and the 2-m permafrost (53.50% of all sequences) to organisms present in the MG-RAST database (based on all genes). These sequences mapped to 929 (active layer) and 761 (2-m permafrost) different organisms. Among these, 711 organisms were detected in both samples with the remaining 268 organisms being unique to one sample or the other. Owing to biases related to MDA (see below), the community composition was not calculated from the number of sequences present in the metagenomic data sets. The 16S rRNA gene sequences were not used for community composition assessment primarily not only for the same reason but also because of the low number of 16S rRNA genes recovered (86 bacterial genes for the active layer and 11 bacterial sequences for the 2-m permafrost, with no Archaea sequences for both layers). To have an idea of the phylogenetic community composition in the permafrost samples, a range of phyla or classes was assessed using qPCR. According to qPCR analyses, bacteria dominated the microbial communities of the active layer soil and of the 2-m permafrost (Table 1). Archaea and Fungi were detected in much lower abundance (200–10 000 times less; Table 1). Among the bacterial phyla or classes measured, Actinobacteria was dominant in both samples (10–25% of the total community), with a codominance of Betaproteobacteria (15%) in the 2-m permafrost (Figure 1a and Table 1). The next most abundant groups were Bacteroidetes and Alphaproteobacteria (5%), whereas Acidobacteria and Firmicutes were the least abundant groups measured (1–2%) (Figure 1a and Table 1).

Table 1 Real-time PCR quantification results for the active layer soil and the 2-m permafrost
Figure 1
figure 1

Phylogenetic community composition of the active layer soil and 2-m permafrost. (a) Percentage of total bacterial abundance for different phyla or classes based on qPCR quantification for MDA-amplified and unamplified DNA extracted from the active layer soil or 2-m permafrost. Alpha-: Alphaproteobacteria, Beta-: Betaproteobacteria. (b) Principal coordinate analysis based on Jaccard similarity calculated from hybridization patterns of DNA extracted from the active layer soil (), the 2-m permafrost (Δ), four Eureka surface soils (•) and from four Alert surface soils (▪) using a 16S rRNA gene microarray targeting microorganisms found in cold environments. The hybridization patterns of the Eureka and Alert soils are from a previous study (Yergeau et al., 2009a).

A microarray targeting the 16S rRNA genes of microorganisms frequently found in cold environments was used as a ‘cold-adapted’ community fingerprinting method. Even though some genera were retrieved using both methods, the microarray platform is completely independent of the metagenomic data set and was used to confirm the similarity in community structure observed between the two samples. The hybridization patterns of other pristine surface soil samples from the same location (Eureka) and from another location on Ellesmere Island (Alert) from a previous study (Yergeau et al., 2009a) were used here as a comparison for the active layer soil and 2-m permafrost. The active layer DNA hybridized to 33 probes and the 2-m permafrost DNA hybridized to 23 probes. Of these probes, 12 were scored as present in the two samples. The hybridization patterns were used to construct a principal coordinate analysis ordination in which the second axis separated samples from Eureka (including the active layer soil and 2-m permafrost) from the Alert soils (Figure 1b). Among the Eureka soils, the active layer soil and 2-m permafrost did not cluster together and were distributed among the other soils from that location (Figure 1b).

Functional community composition

On the basis of the relative abundance of the different subsystems (sets of related functional genes) represented in MG-RAST, we created an ordination of the active layer and permafrost samples (both the assembled-filtered and the raw sequence data sets) together with several other environmental metagenomic data sets publicly available in MG-RAST (Figure 2). In the ordination, the permafrost samples all clustered closely together, apart from all other samples. This clustering was principally because of a higher relative abundance of subsystems related to the response to different stresses, to pathogenicity, toxicity and to the degradation of simple sugars (Figure 2). The only other soil sample (Waseca farm soil) included in the analysis was the most closely related to our samples. Other samples were clearly separated from the active layer and permafrost samples on the first or second ordination axis. This analysis revealed that active layer soil and 2-m permafrost had highly similar levels of functional genes as compared with other samples, and that the filtering and assembly procedures did not cause significant shifts in the functional community composition. These similarities were well mirrored in the number of subsystems detected in each of the samples. For instance, 635 and 620 subsystems were detected (after assembly) in the active layer soil and 2-m permafrost, respectively, among which 576 susbsystems were found in both samples (representing 90% of the detected subsystems). Similarly, in the active layer data set before assembly, 598 subsystems were detected, among which 589 (98.5%) were also found in the assembled data set. For the 2-m permafrost, 613 subsystems were detected in the metagenomic data set before assembly, among which 598 subsystems (97.6%) were also found in the assembled data set.

Figure 2
figure 2

Functional community composition of the active layer soil and the 2-m permafrost. Principal coordinates analysis based on the relative abundance of all MG-RAST subsystems for assembled and raw active layer soil and 2-m permafrost as compared to other publicly available metagenomes. The arrows represent the relative abundance of level 2 subsystems and were added as supplementary variables after the analysis. EBPR, enhanced biological phosphorus removal bioreactors; AMD, acid mine drainage; A, active layer soil; 2 m, 2-m permafrost.

Among the many genes detected in the metagenomic libraries, we focused our analyses on methane generation and oxidation, organic matter decomposition and nitrogen cycle genes that might have particular importance in permafrost related to climatic changes. Methanogens were detected in both samples. In the active layer, among the 6763 sequences that mapped to Archaea, 2035 (30.1%) were related to the Methanomicrobia, 207 (3.1%) to the Methanobacteria, 204 (3.0%) to the Methanococci and 38 (0.6%) to the Methanopyri. For the 2 m permafrost, 633 sequences were related to the Archaea, among which 242 (38.2%) were related to the Methanomicrobia, 30 (4.7%) to the Methanobacteria, 23 (3.6%) to the Methanococci and 2 (0.3%) to the Methanopyri. Several genes related to the generation of methane from the different genera mentioned above were detected (Table 2). The central gene in methane generation, methyl-coenzyme M reductase (mcr) was not detected in the metagenomic libraries. However, qPCR analyses revealed that this gene was present at low concentrations in both samples (Table 1).

Table 2 Number of different gene variants retrieved in the metagenomic data sets for different functions in the active layer soil and the 2-m permafrost

Type I methanotrophs were detected in metagenomic libraries: 269 and 56 genes were related to Methylococcaceae (type I) in the active layer and 2-m permafrost samples, respectively. However, no genes related to type II methanotrophs were detected in the metagenomic libraries. Although type II methanotrophs were detected in large numbers by qPCR, quantification of the 16S rRNA gene of types I and II methanotrophs indicated that type I methanotrophs were dominant in the active layer and 2-m permafrost samples (Table 1). No pmoA sequences were detected in the metagenomic data set, but pmoA-specific qPCR assays detected this gene in the active layer soil and 2-m permafrost (Table 1).

Several genes related to the degradation of carbon compounds were detected in the permafrost and active layers, including trehalase, chitinase, β-glucosidase and β-galactosidase (Table 2). These genes were from a diverse range of organisms (10–70 different organisms), indicating that several members of the microbial community can carry out the functions corresponding to these genes. qPCR confirmed the presence of chitinase genes, but because of the lack of a reliable standard, the exact copy number of this gene could not be evaluated. The cycle threshold (Ct) values were of 18.03 and 26.78 cycles for the active layer soil and 2-m permafrost, respectively. As mentioned above, the Actinobacteria, known for their capacity to degrade complex organic matter, were detected in high numbers in both samples. As compared to other published metagenomic data sets, active layer and permafrost habitats contained relatively more genes related to the degradation of sugar alcohols, aminosugars, mono-, di- and oligosaccharides (Figure 2).

Almost all the genes related to the different steps of the N-cycle were detected in both samples (Table 2). A single type of nitrogenase (nifH), belonging to Mesorhizobium loti (Alphaproteobacteria, active layer) or to Anabaena variabilis (Cyanobacteria, 2-m permafrost) was detected in each of the two samples. qPCR quantification of the nifH gene confirmed the presence of this gene in the active layer soil and 2-m permafrost (Table 1). Although ammonia monooxygenase, the key gene in ammonia oxidation, was only detected in the active layer metagenomic data set, several recognized ammonia-oxidizing Bacteria and Archaea were detected in both samples: Nitrosomonas (Betaproteobacteria), Nitrospira (Betaproteobacteria), Nitrosococcus (Gammaproteobacteria) and Nitrosopumilus (Creanarcheota). qPCR analysis of bacterial and archaeal amoA genes confirmed the presence of these genes in both samples (Table 1). The archaeal-to-bacterial amoA ratio (measured by qPCR) was 31.3 for the active layer and 5.1 for the 2-m permafrost.

Evaluation of biases related to MDA and the presence of naked DNA

The low biomass present in the 2-m permafrost sample made it necessary for 454 pyrosequencing to amplify the extracted DNA using MDA. The representational biases associated with such a procedure were evaluated to assess the quantitative power of the data set following amplification. MDA-amplified DNA extracts were compared with the unamplified DNA extracts using qPCR with primers targeting the 16S rRNA genes of six different phyla or classes. The resulting quantitative results indicated that MDA amplification caused major biases in the community composition at the phylum or class level. For instance, MDA decreased the proportion of Actinobacteria, whereas it increased the proportion of Bacteroidetes for both samples (Figure 1a). For the 2-m permafrost sample, MDA amplification also increased the proportion of Alphaproteobacteria and Firmicutes, whereas it decreased the proportion of Betaproteobacteria (Figure 1a). To evaluate what part of the analyzed DNA was extracted from cells with an intact membrane compared with naked DNA, we treated samples with PMA before DNA extraction. Using the same set of qPCR analyses at the phylum or class level, the comparison of PMA-treated samples with untreated samples did not reveal any major differences (Supplementary Figure S1).

Discussion

This study represents the first metagenomic analysis of permafrost samples. We deliberately choose to focus our sequencing effort on a single core represented by two samples, which allowed us to achieve in-depth comparisons between permafrost and its overlying active layer soil. Owing to the low biomass available in these samples, whole-community genome amplification was carried out before sequencing and caused a representational bias in our data set. This representational bias was seen as an acceptable compromise to access the metagenomic information of permafrost, and was partly counterbalanced by complementary analyses that were performed on the raw DNA extracts.

Phylogenetic and functional community composition

On the basis of qPCR quantification of the unamplified DNA, Actinobacteria appeared as the dominant phylum in the active layer soil, whereas Actinobacteria and Betaproteobacteria codominated the 2-m permafrost. This is in very good agreement with the 16S rRNA genes clone libraries that were constructed from the same samples (Steven et al., 2008) and from a neighboring 9-m deep permafrost sample (Steven et al., 2007). These results also generally agree with the community composition reported from other polar soils (Kobabe et al. (2004); Liebner et al. (2008); Wagner et al. (2009); Yergeau et al. (2007, 2009b)). Non-spore forming Actinobacteria were reported to dominate ancient permafrost potentially because of their metabolic activity at low temperature and the presence of DNA-repair mechanisms (Johnson et al., 2007). Their abundance in the samples assessed here seems to confirm that Actinobacteria are well adapted to the conditions occurring in permafrost habitats.

Compared with other publicly available metagenomic data sets (downloaded from MG-RAST) or to other surface soils from the same region (Yergeau et al., 2009a), the active layer soil and the 2-m permafrost had highly similar functional and phylogenetic community compositions. It therefore appears that the microorganisms present in the permafrost layer are a representative subset of the microorganisms present in the active layer and that, consequently, the active layer could be a good proxy to study the microbiology of the near-surface permafrost layers of this site. The long-term exposure to low temperature, low water and nutrient availability probably reduced microbial diversity and abundance in the permafrost, also causing some shifts in the community composition. The larger environmental fluctuations occurring in the active layer soil as compared with the permafrost might also have caused some of the differences observed in community composition. For instance the Actinobacteria, Alphaproteobacteria and Bacteroidetes were less abundant in permafrost in comparison with the active layer and, in contrast, some other groups were more abundant (for example, the Betaproteobacteria and the Firmicutes) in the permafrost. These latter groups might better resist to long-term exposure to subzero temperatures, take better advantage of the limited availability of both nutrients and water, and/or, in the case of the Firmicutes, form spores. In summary, the microorganisms inhabiting the 2-m permafrost sample studied here are most likely originating from the overlying active layer, as was previously suggested for Antarctic permafrost (Gilichinsky et al., 2007). Although it would have to be confirmed using a single core, similarities in community composition (that is, dominance by Actinobacteria and Proteobacteria) that were observed with a neighboring 9-m deep permafrost sample (Steven et al., 2007) indicate that this relationship between the active layer soil and the permafrost might also hold for deeper permafrost layers.

Functional genes and associated communities

The microorganisms that degrade organic matter in soil are normally highly diverse and the permafrost-affected soils studied here were no exception. The dominant phylum, Actinobacteria, contains members that are recognized as degraders of cellulose, chitin and other complex C compounds, which gives them a central role in the carbon cycle and in the turnover of organic matter. In general, the active layer soil and permafrost microorganisms observed in our core seemed particularly well adapted to the degradation of soil carbon, especially when compared with other environments. This indicates that permafrost thawing at this site could rapidly lead to CO2 emissions after the aerobic degradation of soil carbon.

Recognized methanogens and methanotrophs were detected in the active layer soil and permafrost, but they only formed a minor part of the total microbial community. Changes in environmental conditions could however increase the activity and abundance of these groups and lead to CH4 emissions from this site. Indeed, methanogens are influenced by the quality of soil organic carbon (Wagner et al., 2005; Ganzert et al., 2007), by water regime (Hoj et al., 2006) and temperature (Wagner et al., 2007). In this study, both qPCR and metagenomic analyses showed that type I methanotrophs were more abundant than type II methanotrophs at the two depths of the core analyzed. This is in line with previous studies from the Canadian Arctic, where type II methanotrophs were not detected among the active microbial community (Martineau et al., 2010). Type II methanotrophs are regularly isolated and can be dominant in Siberian permafrost (Liebner and Wagner, 2007), but PLFA-SIP showed that the communities that were active at low in situ temperatures were dominated by type I methanotrophs (reviewed by Wagner, 2008).

Previous studies suggested that Arctic terrestrial environments are generally nitrogen limited (Shaver and Chapin, 1980; Martineau et al., 2010). In that context, the microorganisms responsible for nitrogen fixation and nitrification in permafrost might have an important role in the microbial response to global warming. Nitrogen deficiency could indeed limit the decomposition of organic matter and the subsequent release of greenhouse gases, but this limitation could be relieved if nitrogen fixers and ammonia oxidizers increase their activities after warming. Improved soil nitrogen status in Arctic soils could also increase plant growth and C fixation, counterbalancing a part of the soil C emissions (Mack et al., 2004). Both the nitrogen fixation and ammonia oxidation genes exhibited low diversity in the metagenomic libraries, with one N-fixing species and four ammonia-oxidizer species being detected in each of the samples. This indicates a potentially low redundancy in these functions, which could be vulnerable following acute changes in environmental conditions at this site. However, nifH and amoA genes were relatively abundant in the core sampled following qPCR assays, indicating that organisms harboring these genes are already present in large numbers and could rapidly increase nitrogen content of the core if provided with favorable environmental conditions. Interestingly, archaeal ammonia oxidation genes were dominant (10-fold greater), as previously reported in a range of other soils (Leininger et al., 2006). This dominance was lower in the 2-m permafrost as compared with the active layer soil, indicating some differences in the ammonia-oxidizing community between the two samples.

Experimental biases

We used PMA to differentiate naked DNA from intracellular DNA in the 2-m permafrost and the active layer soil. This is a critical issue when looking at permanently cold environments, as naked DNA can survive for up to 400 000 years in permafrost (Willerslev et al., 2004). Our results indicated that the presence of naked DNA did not cause strong biases in the community composition observed. It therefore appears probable that the majority of the DNA that was extracted from the 2-m permafrost and the active layer soil was from viable cells. It should be stressed that PMA treatment does not indicate that the organisms are active, but that they are viable.

Permafrost environments have low microbial biomass and consequently, it is very difficult, if not impossible, to collect large amounts of DNA from permafrost core samples. The representational biases caused by MDA were therefore seen as a tolerable concession to access, for the first time, the genomic information of permafrost. We accordingly adjusted our metagenomic data analysis to be conservative and to focus only on the presence or absence of different genes, that is, the genetic potential contained within the samples. We also performed complementary analyses on the unamplified DNA to confirm and quantify the trends observed in the metagenomic libraries. Recent studies reported that MDA caused relatively little biases for the sequencing of single genomes (Pinard et al., 2006), but indicated that MDA caused biases in community composition when used on mixed communities (Abulencia et al., 2006; Bodelier et al., 2009). qPCR analyses indicated here that the bias in community composition introduced by MDA is quite strong. In fact, it markedly changed the community composition: from an Actinobacteria-dominated community in the unamplified samples to a Bacteroidetes-dominated community in the amplified samples. It is uncertain whether these biases are specific to permafrost samples or whether they will be present in other types of environmental samples. It has already been suggested that these representational biases could be partly related to the GC content of the amplified genomes (Bodelier et al., 2009), which is in line with the fact that Actinobacteria (high G+C) were systematically underestimated and Firmicutes (low G+C) were overestimated following MDA in our study. These results clearly call for more caution in interpreting quantitative results from metagenomic studies that use whole-community genome amplification. More methodological research is needed in this field, as many environmental samples of interest are too small or too precious to yield the large quantities of DNA required for next-generation sequencing.