Gut microbiome comparability of fresh-frozen versus stabilized-frozen samples from hospitalized patients using 16S rRNA gene and shotgun metagenomic sequencing

Ilett, Emma E.; Jørgensen, Mette; Noguera-Julian, Marc; Daugaard, Gedske; Murray, Daniel D.; Helleberg, Marie; Paredes, Roger; Lundgren, Jens; Sengeløv, Henrik; MacPherson, Cameron

doi:10.1038/s41598-019-49956-7

Download PDF

Article
Open access
Published: 16 September 2019

Gut microbiome comparability of fresh-frozen versus stabilized-frozen samples from hospitalized patients using 16S rRNA gene and shotgun metagenomic sequencing

Emma E. Ilett¹,
Mette Jørgensen¹,
Marc Noguera-Julian ORCID: orcid.org/0000-0002-6194-1395²,
Gedske Daugaard³,
Daniel D. Murray¹,
Marie Helleberg¹,
Roger Paredes ORCID: orcid.org/0000-0002-6553-691X^2,4,
Jens Lundgren¹,
Henrik Sengeløv⁵^na1 &
…
Cameron MacPherson¹^na1

Scientific Reports volume 9, Article number: 13351 (2019) Cite this article

6229 Accesses
12 Citations
16 Altmetric
Metrics details

Subjects

Abstract

Collection of faecal samples for microbiome analysis in acutely sick patients is logistically difficult, particularly if immediate freezing is required (i.e. fresh-frozen, or FF sampling). Previous studies in healthy/non-hospitalized volunteers have shown that chemical stabilization (i.e. stabilized-frozen, or SF sampling) allows room-temperature storage with comparable results to FF samples. To test this in a hospital setting we compared FF and SF approaches across 17 patients undergoing haematopoietic stem cell transplantation (HSCT) using both 16S rRNA gene and shotgun metagenomic sequencing. A paired (same stool specimen) comparison of FF and SF samples was made, with an overall comparable level in relative taxonomic abundances between the two sampling techniques. Though shotgun metagenomic sequencing found significant differences for certain bacterial genera (P < 0.001), these were considered minor methodological effects. Within-sample diversity of either method was not significantly different (Shannon diversity P_16SrRNA = 0.68 and P_shotgun = 0.89) and we could not reject the null hypothesis that between-sample variation in FF and SF were equivalent (P_16SrRNA = 0.98 and P_shotgun = 1.0). This indicates that SF samples can be used to reliably study the microbiome in acutely sick patient populations, thus creating and enabling further outcomes-based metagenomic studies on similarly valuable cohorts.

Critical evaluation of faecal microbiome preservation using metagenomic analysis

Article Open access 05 May 2021

Human faecal collection methods demonstrate a bias in microbiome composition by cell wall structure

Article Open access 14 November 2019

Minor compositional alterations in faecal microbiota after five weeks and five months storage at room temperature on filter papers

Article Open access 12 December 2019

Introduction

It is becoming increasingly clear that analysis of the human gut microbiome can reveal critical aspects of human disease pathogenesis, as well as the variation of treatment responses among a variety of patient groups^1,2,3. As a result, a growing number of hospitals are trying to establish clinical biobanks of stool samples for use in downstream research projects.

Currently, the considered “gold standard” for microbiome studies suggests that stool samples be collected and then undergo immediate DNA-extraction or at least immediate freezing at −20/−80 °C. However, it is very difficult to comply with these conditions in a hospital setting, where even refrigeration of samples is difficult due to limited resources, irregular sampling time-points or patient visits, as well as the need to prioritize patient care. Additionally, complete capture and follow-up of many patient populations would require at-home sampling of out-patients, adding further variability to the time between sampling and freezing. Therefore, the ideal sample collection method within hospitals would rather allow for samples to be kept at room temperature prior to freezing. This would enable at-home sample collection for out-patients, no need for urgent hospital carriers for sample delivery, and no requirement for same-day DNA-extraction.

Although chemical stabilization followed by a delayed time to freezing (stabilized-frozen freeze (stable-freeze, or SF) appears to be a good alternative to direct freezing (fresh-freeze, or FF) or fresh DNA-extraction (fresh-extract, or FE), all studies to date assessing its use are based on cohorts of non-hospitalized/healthy volunteers^{4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22}. In particular, there is an open question as to the use of SF protocols when microbiomes are expected to have low diversity⁸. It is therefore currently uncertain whether or not results from previous method studies can be applied to samples collected in a hospital, where low diversity gut microbiomes are likely. Additionally, there have been little data evaluating the effect of SF methods using shotgun metagenomic sequencing^4,7,14, which, unlike 16S rRNA gene sequencing data, provides detailed species level information. As bacterial species can differ in diseased populations, it is essential to collect this information and evaluate whether SF affects their compositions or not.

To address these remaining aspects of collection-associated bias within gut microbiomes of the diseased, we have conducted a study comparing FF samples to those collected using a SF protocol. This is the first study (to our knowledge) assessing the differences between these protocols using both 16S rRNA gene and shotgun metagenomic sequencing in a hospital setting. Our study was performed in a cohort of patients undergoing haematopoietic stem cell transplantation (HSCT) at Rigshospitalet, University of Copenhagen, Denmark. Samples were delivered to the Centre of Excellence for Personalised Medicine for Infectious Complications in Immune Deficiency (PERSIMUNE)²³ biobank, also located at Rigshospitalet. HSCT serves as a useful disease proxy as it is reserved for severely diseased patients and, critically, has been shown to heavily impact microbiome structure²⁴.

Results

A total of 26 paired stool samples were generated from 17 haematology patients undergoing myeloablative or non-myeloablative HSCT. Each stool sample pair consisted of 1 FF sample (i.e. freshly frozen at time of sampling), and 1 SF sample (i.e. chemically stabilized prior to freezing after a variable period no greater than 16 days). An overview of the study design, summarized baseline patient and sample information, as well as time from sampling to freezing can be found in Fig. 1. Specific patient/sample numbers with time from HSCT to sampling and freezing can be found in Supplementary Table S1.

Data quality

To ensure that data from both sampling methods were acceptable for analysis, all samples underwent quality control steps as described in the methods section.

16S rRNA gene sequencing data

One sample was removed due to a severe lack of overlap between paired-ends (a fundamental requirement of our analysis pipeline). Another sample was lost due to low sequence content (<10,000 unfiltered reads). We observed a minor batch effect across sample runs that slightly biased mean quality and read numbers by affecting reverse-end sequences. However, our pipeline largely removed this bias by merging paired ends and filtering (see Methods). All filtered samples had mean quality scores of over 34 with >10,000 reads and batch effects did not influence which samples passed quality control steps. To summarize, high quality data was collected for 24 of the 26 stool samples.

Shotgun metagenomic sequencing data

Seven samples were removed as their sequenced data did not contain sufficient information for further study, either due to poor quality, human contamination, or sequencing issues (threshold of 1 million reads). All samples had final mean quality scores of over 37 with > 3.5 million reads. To summarize, high quality data was collected for 21 of the 26 stool samples, resulting in a paired sample overlap of 76% (i.e. 76% of paired samples had both 16S rRNA gene and shotgun metagenomic sequencing data after quality control steps). The other 24% of samples had either 16S rRNA gene or shotgun metagenomic sequencing data available for analysis.

Alpha-diversity

We next assessed possible differences between methods for within-sample diversity (i.e. alpha-diversity).

16S rRNA gene sequencing data

We found a general increase of observed genera in the SF versus FF samples. However, this was not significant (Wilcoxon paired test, P = 0.11) and other alpha diversity measures showed no differences (Fig. 2a). The mean Simpson index was 0.78 (IQR 0.68-0.88) vs 0.77 (IQR 0.67-0.87) in FF and SF samples respectively. Similarly, the mean Shannon indices were likewise comparable at 2.30 (IQR 1.82-2.69) vs 2.25 (IQR 1.80-2.69) in FF and SF samples respectively. Indeed, there was no statistically significant difference in either alpha diversity measure (Wilcoxon paired test, P_Simpson = 0.81 and P_Shannon = 0.68).

Shotgun metagenomic sequencing data

As with 16S rRNA gene sequencing, the mean Simpson and Shannon indices were similar between FF and SF samples at 0.50 (IQR 0.13-0.74) vs 0.51 (IQR 0.43-0.70) and 1.15 (IQR 0.38-1.61) vs 1.16 (IQR 0.72-1.55), respectively (Fig. 2b). As expected from the IQRs there was no significant difference in either diversity measure (Wilcoxon paired test, P_Simpson = 0.73 and P_Shannon = 0.89).

Shotgun metagenomics gene richness

As an additional measure to assess diversity we also examined gene richness (the number of unique genes) in the shotgun metagenomic sequencing data. This measure is used to divide samples into those of low versus high gene richness, with those of high gene richness generally considered to have higher diversity.

There was a median of 5,507,430 reads per sample mapping to the IGC (Integrated Gene Catalogue)²⁵ (interquartile range (IQR) 4,460,836–6,047,576). Based on a rarefaction curve of mapped reads and the number of unique genes (Supplementary Fig. S1), samples with <5 million mapped reads were discarded for the gene richness analysis as their diversity was biased by sampling. The 12 remaining sample pairs had a median of 47,180 unique genes per sample (IQR 34,876–69,799). FF and SF samples had similar medians of 49,086 and 44,004, respectively (Fig. 3). There was no evidence of a statistically significant difference in gene richness between FF and SF samples (Wilcoxon paired test, P = 0.91). Two paired samples are notable for their unusually high gene richness (Fig. 3).

Beta-diversity

We next assessed the beta-diversity, or between-sample diversity, looking for changes in variance between paired samples and testing whether potential differences were driven by sampling method.

16S rRNA gene sequencing data

A non-metric multidimensional scaling (NMDS) plot based on Bray-Curtis distances showed a strong overlap of sampling method types and clustering of sample pairs (Fig. 4a). An ADONIS test on the Bray-Curtis distances showed no evidence of differences between SF and FF samples (P = 1.00).

Shotgun metagenomic sequencing data

As with the 16S rRNA gene sequencing data, the NMDS plot showed a strong overlap of sampling methods and clustering of sample pairs (Fig. 4b). The ADONIS again showed no evidence of differences between samples being based on sampling method (P = 0.67).

Taxonomy and relative abundance

To look more specifically at possible compositional differences, i.e. if certain bacteria were affected by sampling methods, we compared taxonomical differences between FF and SF samples by calculating the relative abundances of their genera.

16S rRNA gene sequencing data

Relative abundances of genera between FF and SF sample pairs appeared visually similar when examining bacteria present in the sample cohort (Fig. 5). This was verified statistically by paired DESeq2 analysis, which found no significant difference (α = 5%) in the relative abundances of all genera between FF and SF methods.

Shotgun metagenomic sequencing

As with the 16S rRNA gene sequencing data, relative abundances of genera were visually similar (Fig. 5) but the Wald test of paired DESeq2 highlighted statistically significant differences between FF and SF in the relative abundance of 7 genera (Actinomyces, Brevibacterium, Cryptobacterium, Propionibacteriaceae (unclassified), Propionibacterium, Ralstonia, Ruminococcus, P < 0.001 for all). These differences were, however, found in a maximum of 3/21 sample pairs per bacterial genus, with all but one sample pair having differences in abundancy of < 3% (Supplementary Fig. S2). This highlights the sensitivity and reproducibility of shotgun-based sequencing, but also demonstrates the lack of consistent or predictable change one would expect if driven by sampling methodology.

Discussion

In our study we show that SF is comparable to FF protocols in a diseased and hospitalized cohort using both 16S rRNA gene and shotgun metagenomic sequencing data. When comparing the alpha-diversity between sampling methods we found no significant difference between SF and FF samples. Differences between samples (based on Bray-Curtis distances) in our study were not driven by the different sampling methods. Both findings are in agreement with the current literature based on healthy cohorts^{5,6,7,8,9,16,22}.

One concern with the use of chemical stabilization was the introduction of biases with regard to changes in the relative abundance of certain bacteria. Two studies using 16S rRNA gene sequencing found a significant increase in the Sutterella genus in SF versus FF samples^5,9. While another study found significant differences in the relative abundances of Faecalibacterium, Sporobacter, Clostridium XVIII, and Clostridium XIVa, with differences being exacerbated in samples of low diversity⁸. However, we found no difference in the relative abundance of these or any other bacteria in our 16S rRNA gene sequencing analysis. In our shotgun metagenomic analysis there were minor differences in several bacteria driven only by a few sample pairs. This shows that differences in taxonomical abundance between methods are inconsistent across studies and may be due to background noise. Additionally, in the case of Hill et al., differences could also stem from comparing the SF protocol with directly processed samples (i.e. direct DNA extraction prior to freezing). Moreover, previous studies in healthy subjects support our conclusion being that there is little difference between SF and FF methods^{5,6,7,14,15,16,18}.

Limitations to our study may include the non-homogenisation of samples. It has been shown that sampling from different parts of a stool can give different microbial compositional results²⁰. However, stool sampling is random and therefore the paired design of our study should control for the non-homogenisation. Another potential limitation to our study is the fact that a number of FF samples were not frozen directly (Fig. 1c and Supplementary Table S1). However, all of these samples were kept at 4 °C until frozen. Previous studies have shown that cold storage can preserve microbial composition for a short period of time and up to 1 week^6,8,9,26,27. This difficulty in direct freezing highlights the need for methodologies that allow for more flexibility between sampling and freezing, as immediate freezing is not always possible in a hospital setting. Finally, there was a slight batch effect in our 16S rRNA gene sequencing data due to an uneven distribution of FF versus SF samples in sequencing runs. This mainly affected reverse-end sequence quality and related variance between batches was minimized during processing. All reads went through the same quality control process, as described in our methods and results, and there were no observed microbial compositional differences between samples run on different batches. Importantly, batch-free shotgun metagenomic data yielded markedly similar results to the 16S rRNA gene sequencing data.

In conclusion, chemical stabilization (SF) proved to be comparable to freshly frozen (FF) stool samples in a diseased, low microbial diversity, cohort. We confirmed this by statistical examination of measurements by two independent high-throughput sequencing technologies, namely 16S rRNA gene and shotgun metagenomic sequencing. We further showed that use of stabilization techniques maintains reproducibility in cross-sectional studies by minimizing technical variance in sample handling. Our results support the use of chemical stabilization to increase the feasibility of hospital biobank stool sample collection, which will help facilitate the collection of longitudinal faecal microbiome samples from diseased patient groups.

Methods

Sample collection

Two samples (i.e. aliquots) were taken from each stool by the patient or nursing staff at the same time-point. The EasySampler® Stool Collection Kit was used to hold the stool above toilet water during collection. If patients were bedridden, samples were collected directly from stool in bedpans. No stool was stored prior to sampling (i.e. sampling was done directly after defecation) and stool material was not homogenised. FF samples were placed in a non-sterile tube without stabilization fluid and delivered to the PERSIMUNE biobank by urgent hospital carrier and frozen directly during regular working hours (i.e. Monday-Friday, 8am-3pm). If outside of regular working hours, the FF sample would still be delivered to the biobank by urgent hospital carrier but then placed in the biobank refrigerator at 4 °C until regular working hours when it would then be frozen. SF samples were placed in the OMNIgene.GUT tube (containing stabilization fluid) according to the manufacturer’s instructions and delivered to the PERSIMUNE biobank by non-urgent hospital carrier. Non-urgent hospital carrier meant that samples could be kept at room temperature for up to several days prior to freezing. Due to the need for swift delivery of FF samples, no samples were taken at patients’ homes, i.e. all stool was collected at the hospital.

All samples were handled by personnel at the PERSIMUNE biobank. Half of the SF samples (n = 13) were further aliquoted into two sterile tubes prior to freezing, whereas remaining SF samples and all FF samples were frozen directly without further aliquoting. All samples were frozen at −80 °C.

The OMNIgene.GUT (DNA Stabilized-frozen Inc., Ottawa, Canada) stabilisation tube was chosen as the best candidate for our SF protocol. No commercial influence on this decision was allowed, i.e. this decision was not made to promote a specific brand, we did not consult with marketing holders, and did not receive financial support. The decision was based on a review of available documentation and literature: OMNIgene.GUT was found to have the least effect on gut microbiome composition compared to other stabilization fluids⁹, with results generally being comparable to FF samples^5,8,15 and differences among sampling methods being of a lower scale than those between individuals^6,8,16. The OMNIgene.GUT stabilisation tube should allow samples to be kept at room temperature for up to 60 days (according to manufacturer guidelines) and a previous study (based on a healthy cohort) has shown that the OMNIgene.GUT collection kit gave similar results after 28 days at room temperature as samples with freshly-extracted DNA⁷. For additional information regarding the stabilizer decision process, and an overview of stabilization comparison studies please see Supplementary Methods.

Sample shipment

All samples (26 FF and 26 SF, i.e. 52 paired samples) were transported on dry ice to the IrsiCaixa AIDS Research Institute in Spain for 16S rRNA gene and shotgun metagenomic sequencing.

DNA extraction

All samples were thawed on ice and DNA was extracted from ~200 mg of each faecal sample using the PowerSoil DNA Extraction Kit (MO BIO Laboratories, Carlsbad, CA, US). Extracted DNA was then stored at −80 °C until sequencing.

Library construction and sequencing

16S rRNA gene sequencing

DNA was PCR amplified using primers for targeting regions flanking the variable regions V3-4 of the 16S rRNA gene (expected amplicon size ~460 base pairs). Primers were those described in the “MiSeq rRNA Amplicon Sequencing” protocol from Illumina, which already have the Illumina adapter overhang nucleotide sequences added to the 16S rRNA V3-V4 specific-primers, i.e.: 16S_F 5′-(TCG TCG GCA GCG TCA GAT GTG TAT AAG AGA CAG CCT ACG GGN GGC WGC AG)-3′ and 16S_R 5′-(GTC TCG TGG GCT CGG AGA TGT GTA TAA GAG ACA GGA CTA CHV GGG TAT CTA ATC C)-3′.

Amplifications were performed in triplicate 25 μL reactions, each containing 2.5 μL of non-diluted DNA template, 12.5 μL of KAPA HiFi HotStart Ready Mix (containing KAPA HiFi HotStart DNA Polymerase, buffer, MgCl2, and dNTPs, KAPA Biosystems Inc., Wilmington, MA, USA), and 5 μL of each primer at 1 μM. Thermal cycling conditions consisted of an initial denaturation step (3 min at 95 °C), followed by 30 cycles of denaturation (30 sec at 95 °C), annealing (30 sec at 55 °C) and extension (30 sec at 72 °C). These were followed by a final extension step of 10 min at 72 °C. Once the desired amplicon was confirmed in 1% agarose gel electrophoresis, all three replicates were pooled and stored at −30 °C until sequencing library preparation. Amplified DNA templates were cleaned-up for non-DNA molecules and Illumina sequencing adapters and dual indices were attached using Nextera XT Index Kit (Illumina, Inc.) followed by the corresponding PCR amplification program as described in the MiSeq. 16S rRNA Amplicon Sequencing protocol. After a second round of clean-up, amplicons were quantified using Quant-iT™ PicoGreen® dsDNA Assay Kit (Invitrogen, Carlsbad, MA, USA) and diluted in equimolar concentrations (4 nM) for further pooling. Sequencing was performed on an Illumina MiSeqTM platform according to the manufacturer’s specifications to generate paired-end reads of 300 base-length in each direction. Samples were sequenced in two batches (not all sample pairs were sequenced in the same batch).

Shotgun metagenomic sequencing

Whole faecal DNA was chemically fragmented using the Nextera-XT® Illumina kit. One library of approximately 300-basepair-clone insert size was constructed for each sample. Total faecal DNA was sequenced in five runs on an Illumina Hi-Seq® platform at the Institut de Medicina Predictiva del Cancer (IMPPC), Badalona, Spain.

Pre-processing and quality control

16S rRNA gene sequencing data

The FastQC software²⁸ and the R(v3.5.0)²⁹/DADA2³⁰ plotQualityProfile was used to assess raw sequence quality. Primer trimming and read merging was performed using Cutadapt³¹ and PEAR (Illumina Paired-End reAd merger)³². Non-merging reads and those with unidentified primers were discarded. Resulting reads were put into the R(v3.5.0)²⁹/DADA2³⁰ pipeline and subsequently quality filtered and trimmed. Briefly, reads were trimmed by 6 nucleotides from the start of the read, all were truncated at the first nucleotide with a quality score of ≤5 (if there was one) and those with ≥2 expected errors in the read were discarded. Reads with <350 base pairs after filtering and trimming were removed. One million random filtered reads were used to create the error model in DADA2³⁰. Chimeric sequences were filtered off using a consensus approach. Samples with <10,000 read counts after processing were discarded. If one sample failed quality control the sample pair was removed from further analysis.

Shotgun metagenomic sequencing data

Specific adapters were removed using the Illumina HiSeq software and Trimmomatic³³. Trimmomatic³³ was also used for quality trimming (nucleotides were trimmed from the start or end of the read if they had a quality score <30, then using a sliding window approach, reads were cut if the average quality of 4 nucleotides was <30). Reads <50 base pairs were removed. Paired reads were mapped with bowtie2³⁴ to the human genome (GRCh38). All read pairs where one or both mates mapped to the human genome were discarded. Final read quality was assessed using the R package ShortRead³⁵ and bowtie2³⁴ output files. Samples with <1 million reads were removed from further analysis. If one sample failed quality control the sample pair was removed from further analysis.

Ethical approval

This study was conducted with approval from the Danish Ethical Committee (J.nr.H-16047481). All patients had given informed consent to the donation and storage of stool samples for microbial research. All methods were performed in accordance with the relevant guidelines and regulations.

Taxonomy and diversity measures

16S rRNA gene sequencing data

Taxonomy was assigned through alignment to the RDP (Ribosomal Database Project)³⁶ using the R(v3.5.0)²⁹/DADA2³⁰ program. Taxonomical annotation was used to remove all OTUs assigned as mitochondria or chloroplasts, all OTUs assigned a non-bacterial kingdom and all OTUs present in <2 samples. The filtered OTU table was used in the R(v3.5.0)²⁹/Phyloseq³⁷ package to calculate relative abundances (at genus and species level), alpha diversity measures (Shannon and Simpson indexes) and beta diversity measures (Bray-Curtis dissimilarity; used as input for ordination analysis using non-metric multidimensional scaling (NMDS)). Specifically, relative abundances were based on OTU counts and were normalised as percentages (100 * (x/sum(x))) at species and genera taxonomy levels. Alpha diversity measures were calculated based on OTU counts without prior normalisation. Beta diversity measures were calculated based on normalised (x/sum(x)) OTU counts.

Shotgun metagenomic sequencing data

Taxonomical annotation was assigned using MetaPhlAn v2.0³⁸. The relative proportions calculated from MetaPhlAn v2.0³⁸ on genus and/or species level were used to calculate relative abundances, alpha diversity measures (Shannon and Simpson indexes) and beta diversity measures (Bray-Curtis dissimilarity; used as input for ordination analysis using non-metric multidimensional scaling (NMDS)) in the R(v3.5.0)²⁹/Phyloseq³⁷ package. Relative abundances were calculated as percentages (i.e. relative proportions were multiplied by 100) at both species and genera taxonomy levels. Due to alpha diversity measures often being calculated based on unnormalized data, we used the rounded fraction of the relative proportions of bacterial species from MetaPhlAn v2.0³⁸ (i.e. relative proportion * 10,000 rounded up to the closest integer) to calculate these measures. For beta diversity measures the relative proportion values of bacterial species could be used directly (i.e. values did not need to be re-normalised prior to calculation).

MetaPhlAn v2.0³⁸ also provided estimations of the number of reads that should come from a given species by considering the coverage of the species’ markers and the length of the genome (taken from reference genomes). Using the estimated number of reads per genera a count table was created for shotgun taxonomical data compatible with the R(v3.5.0)²⁹/PhyloSeq³⁷ package. This count table, based on the estimation of reads from bacterial genera, was used for later DESeq2³⁹ analyses.

Gene richness was calculated using bowtie2³⁴ to map filtered reads to the IGC (Integrated Gene Catalogue)²⁵. Based on a rarefaction curve of number of new unique genes and number of mapped reads (counting forward reads only), a cut-off of 5 million mapped reads was decided to calculate gene richness (Supplementary Fig. S1). Only forward reads were used to calculate the number of unique genes, with a maximum of 1 unique gene per read.

Statistical analysis

The following statistical analyses were performed for both 16S rRNA gene and shotgun metagenomic sequencing data. Detection of differentially abundant bacterial genera between methods was performed using the R(v3.5.0)²⁹/PhyloSeq³⁷ and R(v3.5.0)²⁹/DESeq2³⁹ packages. Using the Wald test in DESeq2³⁹, SF and FF methods were compared for potential statistical differences in their geometric means of bacterial genera. PERMANOVA (adonis) tests (via the R(v3.5.0)²⁹/vegan package⁴⁰) were also performed using Bray-Curtis distances to test for a potential association between sampling methods and microbiota composition. For all statistical analyses, P-values < 0.05 were considered significant.

16S rRNA gene sequencing data

The filtered OTU count table created through R(v3.5.0)²⁹/DADA2³⁰ with taxonomy down to genera-level was used for the DESeq2³⁹ analysis. Normalised (x/ (sum (x)) OTU values from the same table were used for the PERMANOVA (adonis) test.

Shotgun metagenomic sequencing data

Values from the created count table (based on the estimated number of reads from bacterial genera calculated by MetaPhlAn v2.0³⁸) was used for the DESeq2³⁹ analysis. For the PERMANOVA (adonis) test, proportional abundances (calculated by MetaPhlAn v2.0³⁸) were used. Gene richness differences between methods (shotgun metagenomic sequencing data only) were assessed using the Wilcoxon signed-rank test.

Data Availability

The datasets generated and analysed during this study are derived from patients treated in Denmark. The datasets contain sensitive patient data governed by GDPR and Danish law. Due to Danish legislation (Act No. 502 of 23 May 2018) and approvals granted by the Danish Data Protection Agency, it is not possible to upload raw data to a publicly available database. However, access to these data can be made available from the corresponding author on reasonable request, provided a data transfer agreement is entered into according to current regulations.

References

Lynch, S. V. & Pedersen, O. The Human Intestinal Microbiome in Health and Disease. The New England journal of medicine 375, 2369–2379, https://doi.org/10.1056/NEJMra1600266 (2016).
Article CAS PubMed Google Scholar
Gilbert, J. A. et al. Microbiome-wide association studies link dynamic microbial consortia to disease. Nature 535, 94–103, https://doi.org/10.1038/nature18850 (2016).
Article ADS CAS PubMed Google Scholar
Young, V. B. The role of the microbiome in human health and disease: an introduction for clinicians. BMJ (Clinical research ed.) 356, j831, https://doi.org/10.1136/bmj.j831 (2017).
Article Google Scholar
Han, M. et al. A novel affordable reagent for room temperature storage and transport of fecal samples for metagenomic analyses. Microbiome 6, 43, https://doi.org/10.1186/s40168-018-0429-0 (2018).
Article PubMed PubMed Central Google Scholar
Panek, M. et al. Methodology challenges in studying human gut microbiota - effects of collection, storage, DNA extraction and next generation sequencing technologies. Scientific reports 8, 5143, https://doi.org/10.1038/s41598-018-23296-4 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Penington, J. S. et al. Influence of fecal collection conditions and 16S rRNA gene sequencing at two centers on human gut microbiota analysis. Scientific reports 8, 4386, https://doi.org/10.1038/s41598-018-22491-7 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Anderson, E. L. et al. A robust ambient temperature collection and stabilization strategy: Enabling worldwide functional studies of the human microbiome. Scientific reports 6, 31731, https://doi.org/10.1038/srep31731 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Hill, C. J. et al. Effect of room temperature transport vials on DNA quality and phylogenetic composition of faecal microbiota of elderly adults and infants. Microbiome 4, 19, https://doi.org/10.1186/s40168-016-0164-3 (2016).
Article PubMed PubMed Central Google Scholar
Choo, J. M., Leong, L. E. & Rogers, G. B. Sample storage conditions significantly influence faecal microbiome profiles. Scientific reports 5, 16350, https://doi.org/10.1038/srep16350 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Mehta, R. S. et al. Stability of the human faecal microbiome in a cohort of adult men. Nature microbiology 3, 347–355, https://doi.org/10.1038/s41564-017-0096-0 (2018).
Article CAS PubMed PubMed Central Google Scholar
Vogtmann, E. et al. Comparison of Collection Methods for Fecal Samples in Microbiome Studies. American journal of epidemiology 185, 115–123, https://doi.org/10.1093/aje/kww177 (2017).
Article PubMed PubMed Central Google Scholar
Mathay, C. et al. Method optimization for fecal sample collection and fecal DNA extraction. Biopreservation and biobanking 13, 79–93, https://doi.org/10.1089/bio.2014.0031 (2015).
Article CAS PubMed Google Scholar
Dominianni, C., Wu, J., Hayes, R. B. & Ahn, J. Comparison of methods for fecal microbiome biospecimen collection. BMC microbiology 14, 103, https://doi.org/10.1186/1471-2180-14-103 (2014).
Article PubMed PubMed Central Google Scholar
Voigt, A. Y. et al. Temporal and technical variability of human gut metagenomes. Genome biology 16, 73, https://doi.org/10.1186/s13059-015-0639-8 (2015).
Article CAS PubMed PubMed Central Google Scholar
Szopinska, J. W. et al. Reliability of a participant-friendly fecal collection method for microbiome analyses: a step towards large sample size investigation. BMC microbiology 18, 110, https://doi.org/10.1186/s12866-018-1249-x (2018).
Article CAS PubMed PubMed Central Google Scholar
Wang, Z. et al. Comparison of Fecal Collection Methods for Microbiome and Metabolomics Studies. Frontiers in cellular and infection microbiology 8, 301, https://doi.org/10.3389/fcimb.2018.00301 (2018).
Article PubMed PubMed Central Google Scholar
Song, S. J. et al. Preservation Methods Differ in Fecal Microbiome Stability, Affecting Suitability for Field Studies. mSystems 1, https://doi.org/10.1128/mSystems.00021-16 (2016).
Wu, G. D. et al. Sampling and pyrosequencing methods for characterizing bacterial communities in the human gut using 16S sequence tags. BMC microbiology 10, 206, https://doi.org/10.1186/1471-2180-10-206 (2010).
Article CAS PubMed PubMed Central Google Scholar
Al, K. F., Bisanz, J. E., Gloor, G. B., Reid, G. & Burton, J. P. Evaluation of sampling and storage procedures on preserving the community structure of stool microbiota: A simple at-home toilet-paper collection method. Journal of microbiological methods 144, 117–121, https://doi.org/10.1016/j.mimet.2017.11.014 (2018).
Article PubMed Google Scholar
Gorzelak, M. A. et al. Methods for Improving Human Gut Microbiome Data by Reducing Variability through Sample Processing and Storage of Stool. PloS one 10, e0134802, https://doi.org/10.1371/journal.pone.0134802 (2015).
Article CAS PubMed PubMed Central Google Scholar
Angebault, C. et al. Combined bacterial and fungal intestinal microbiota analyses: Impact of storage conditions and DNA extraction protocols. PloS one 13, e0201174, https://doi.org/10.1371/journal.pone.0201174 (2018).
Article CAS PubMed PubMed Central Google Scholar
Ezzy, A. C. et al. Storage and handling of human faecal samples affect the gut microbiome composition: A feasibility study. Journal of microbiological methods 164, 105668, https://doi.org/10.1016/j.mimet.2019.105668 (2019).
Article PubMed Google Scholar
Centre of Excellence for Personalised Medicine for Infectious Complications in Immune Deficiency (PERSIMUNE), http://www.persimune.dk/ (2019).
Wang, H. et al. Global, regional, and national life expectancy, all-cause mortality, and cause-specific mortality for 249 causes of death, 1980–2015: a systematic analysis for the Global Burden of Disease Study. The Lancet 388, 1459–1544, https://doi.org/10.1016/S0140-6736(16)31012-1 (2015).
Article Google Scholar
Li, J. et al. An integrated catalog of reference genes in the human gut microbiome. Nature biotechnology 32, 834–841, https://doi.org/10.1038/nbt.2942 (2014).
Article CAS PubMed Google Scholar
Tedjo, D. I. et al. The effect of sampling and storage on the fecal microbiota composition in healthy and diseased subjects. PloS one 10, e0126685, https://doi.org/10.1371/journal.pone.0126685 (2015).
Article CAS PubMed PubMed Central Google Scholar
Franzosa, E. A. et al. Relating the metatranscriptome and metagenome of the human gut. Proceedings of the National Academy of Sciences of the United States of America 111, E2329–2338, https://doi.org/10.1073/pnas.1319284111 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Andrews, S. FastQC: a quality control tool for high throughput sequence data, http://www.bioinformatics.babraham.ac.uk/projects/fastqc 2010).
R: A Language and Environment for Statistical Computing (2018).
Callahan, B. J. et al. DADA2: High-resolution sample inference from Illumina amplicon data. Nat Methods 13, 581–583, https://doi.org/10.1038/nmeth.3869 (2016).
Article CAS PubMed PubMed Central Google Scholar
Martin, M. Cutadapt Removes Adapter Sequences From High-Throughput Sequencing Reads, http://cutadapt.readthedocs.io/en/stable/index.html (2019).
Zhang, J., Kobert, K., Flouri, T. & Stamatakis, A. PEAR: a fast and accurate Illumina Paired-End reAd mergeR. Bioinformatics (Oxford, England) 30, 614–620, https://doi.org/10.1093/bioinformatics/btt593 (2014).
Article CAS Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics (Oxford, England) 30, 2114–2120, https://doi.org/10.1093/bioinformatics/btu170 (2014).
Article CAS Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nature Methods 9, 357, https://doi.org/10.1038/nmeth.1923 https://www.nature.com/articles/nmeth.1923#supplementary-information (2012).
Article CAS Google Scholar
Morgan, M. et al. ShortRead: a bioconductor package for input, quality assessment and exploration of high-throughput sequence data. Bioinformatics (Oxford, England) 25, 2607–2608, https://doi.org/10.1093/bioinformatics/btp450 (2009).
Article CAS Google Scholar
Wang, Q., Garrity, G. M., Tiedje, J. M. & Cole, J. R. Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy. Applied and environmental microbiology 73, 5261–5267, https://doi.org/10.1128/aem.00062-07 (2007).
Article CAS PubMed PubMed Central Google Scholar
McMurdie, P. J. & Holmes, S. phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data. PloS one 8, e61217, https://doi.org/10.1371/journal.pone.0061217 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Truong, D. T. et al. MetaPhlAn2 for enhanced metagenomic taxonomic profiling. Nat Methods 12, 902–903, https://doi.org/10.1038/nmeth.3589 (2015).
Article CAS PubMed Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome biology 15, 550, https://doi.org/10.1186/s13059-014-0550-8 (2014).
Article CAS PubMed PubMed Central Google Scholar
Oksanen, J. B. F. et al. R package vegan: Community Ecology Package. version 2, 5–2 (2018).
Google Scholar

Download references

Acknowledgements

We thank the members of the IGTP Genomics and Bioinformatics Core Facilities (Maria Pilar Armengol, Lauro Sumoy and Iñaki Martínez) for their contribution to this publication. IrsiCaixa is supported by the RED de SIDA RD16/0025/0041 co-financed by the ISCIII and the European Regional Development Fund (ERDF), “Investing in your future”. We would also like to thank those involved at the PERSIMUNE biobank situated at Rigshospitalet (Erik Sørensen, Betina Poulsen and Charlotte Matthews) for their contribution to this publication. In addition, we thank the staff and patients at the haematological department, Rigshospitalet, for their contribution to sample collection. This study was funded by the Danish Research Foundation [grant number DNRF126], the Svend Andersen Foundation and the Lundbeck Foundation [grant number R218-2016-1482]. None of the funding bodies had any influence on the study design, the collection, analysis or interpretation of data or the writing of the manuscript.

Author information

Henrik Sengeløv and Cameron MacPherson jointly supervised this work.

Authors and Affiliations

PERSIMUNE Centre of Excellence, Rigshospitalet, Blegdamsvej 9, DK-2100, Copenhagen Ø, Denmark
Emma E. Ilett, Mette Jørgensen, Daniel D. Murray, Marie Helleberg, Jens Lundgren & Cameron MacPherson
Institut de Recerca de la SIDA – IrsiCaixa, Hospital Universitari Germans Trias i Pujol, Badalona, Catalonia, Spain
Marc Noguera-Julian & Roger Paredes
Department of Oncology, Rigshospitalet Blegdamsvej 9, DK-2100, Copenhagen Ø, Denmark
Gedske Daugaard
Infectious Diseases Service, Hospital Universitari Germans Trias i Pujol, Badalona, Spain
Roger Paredes
Department of Haematology, Rigshospitalet, Blegdamsvej 9, DK-2100, Copenhagen Ø, Denmark
Henrik Sengeløv

Authors

Emma E. Ilett
View author publications
You can also search for this author in PubMed Google Scholar
Mette Jørgensen
View author publications
You can also search for this author in PubMed Google Scholar
Marc Noguera-Julian
View author publications
You can also search for this author in PubMed Google Scholar
Gedske Daugaard
View author publications
You can also search for this author in PubMed Google Scholar
Daniel D. Murray
View author publications
You can also search for this author in PubMed Google Scholar
Marie Helleberg
View author publications
You can also search for this author in PubMed Google Scholar
Roger Paredes
View author publications
You can also search for this author in PubMed Google Scholar
Jens Lundgren
View author publications
You can also search for this author in PubMed Google Scholar
Henrik Sengeløv
View author publications
You can also search for this author in PubMed Google Scholar
Cameron MacPherson
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.S., J.L., M.H. and E.I. designed the study. M.N.-J. and R.P. performed the sequencing. M.J., M.N.-J., C.M. and E.I. wrote the computer pipeline. M.J., M.N.-J., C.M., D.M., R.P., J.L. and E.I. contributed to the framework and statistical plan of the study. E.I. wrote the first draft of the paper, with M.J. and C.M. contributing to all figures. All authors made substantial contributions to the analysis of data, have been involved in drafting the manuscript and have read and approved the final manuscript.

Corresponding author

Correspondence to Emma E. Ilett.

Ethics declarations

Competing Interests

The authors state that there is no conflict of interest in the work presented. OMNIgene.GUT kits were purchased from DNA Genotek and none of the authors owned stock, were under the employment or consulted for the DNA Genotek company.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Table S2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ilett, E.E., Jørgensen, M., Noguera-Julian, M. et al. Gut microbiome comparability of fresh-frozen versus stabilized-frozen samples from hospitalized patients using 16S rRNA gene and shotgun metagenomic sequencing. Sci Rep 9, 13351 (2019). https://doi.org/10.1038/s41598-019-49956-7

Download citation

Received: 02 January 2019
Accepted: 04 September 2019
Published: 16 September 2019
DOI: https://doi.org/10.1038/s41598-019-49956-7

This article is cited by

Long-term taxonomic and functional stability of the gut microbiome from human fecal samples
- Jae Hyun Kim
- Ji-Young Jeon
- Min-Gul Kim
Scientific Reports (2023)
Multicenter evaluation of gut microbiome profiling by next-generation sequencing reveals major biases in partial-length metabarcoding approach
- Hugo Roume
- Stanislas Mondot
- Joël Doré
Scientific Reports (2023)
Critical evaluation of faecal microbiome preservation using metagenomic analysis
- Alena L Pribyl
- Donovan H Parks
- Philip Hugenholtz
ISME Communications (2021)
Parkinson’s disease-associated alterations of the gut microbiome predict disease-relevant changes in metabolic functions
- Federico Baldini
- Johannes Hertel
- Paul Wilmes
BMC Biology (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Data quality

16S rRNA gene sequencing data

Shotgun metagenomic sequencing data

Alpha-diversity

16S rRNA gene sequencing data

Shotgun metagenomic sequencing data

Shotgun metagenomics gene richness

Beta-diversity

16S rRNA gene sequencing data

Shotgun metagenomic sequencing data

Taxonomy and relative abundance

16S rRNA gene sequencing data

Shotgun metagenomic sequencing

Discussion

Methods

Sample collection

Sample shipment

DNA extraction

Library construction and sequencing

16S rRNA gene sequencing

Shotgun metagenomic sequencing

Pre-processing and quality control

16S rRNA gene sequencing data

Shotgun metagenomic sequencing data

Ethical approval

Taxonomy and diversity measures

16S rRNA gene sequencing data

Shotgun metagenomic sequencing data

Statistical analysis

16S rRNA gene sequencing data

Shotgun metagenomic sequencing data

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links