Abstract
Natural sulfide rich deposits are common in coastal areas worldwide, including along the Baltic Sea coast. When artificial drainage exposes these deposits to atmospheric oxygen, iron sulfide minerals in the soils are rapidly oxidized. This process turns the potential acid sulfate soils into actual acid sulfate soils and mobilizes large quantities of acidity and leachable toxic metals that cause severe environmental problems. It is known that acidophilic microorganisms living in acid sulfate soils catalyze iron sulfide mineral oxidation. However, only a few studies regarding these communities have been published. In this study, we sampled the oxidized actual acid sulfate soil, the transition zone where oxidation is actively taking place, and the deepest un-oxidized potential acid sulfate soil. Nucleic acids were extracted and 16S rRNA gene amplicons, metagenomes, and metatranscriptomes generated to gain a detailed insight into the communities and their activities. The project will be of great use to microbiologists, environmental biologists, geochemists, and geologists as there is hydrological and geochemical monitoring from the site stretching back for many years.
Measurement(s) | rRNA_16S • Metagenome • transcription profiling assay |
Technology Type(s) | amplicon sequencing • DNA sequencing • RNA sequencing |
Factor Type(s) | biological replicate • sampling depth |
Sample Characteristic - Environment | soil biocrust |
Sample Characteristic - Location | Finland |
Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.9901265
Similar content being viewed by others
Background & Summary
Naturally occurring deposits containing sulfidic sediments that form acid sulfate soils (ASS) cover over 17 million hectares of coastal areas. These extreme soils occur in North America (e.g. in the South and South West of the U.S) as well as in Europe (e.g. surrounding the Baltic Sea), Asia, and Australia1. Exposure of the sulfidic materials to air initializes chemical reactions that produce sulfuric acid2, thus creating ASS with a pH < 4, while mobilizing large quantities of acidity and leachable toxic metal(loid)s (e.g. Al, As, Cd, Co, Ni, and Zn). The leached metals and acidity are ultimately transported to the surrounding waters where they cause severe environmental problems3, negative economic consequences4, and impact human health5.
Pyrite oxidation is a complex biogeochemical process that involves a series of chemical reactions aided by microbiological catalysis1. Intermediate sulfur species formed during the oxidation process are metabolized by sulfur-oxidizing bacteria6 while ferrous iron can be further oxidized at low pH by iron-oxidizing bacteria7. Ultimately, pyrite oxidation causes acidification and mobilization of trace metals that are leached to recipient waters. These processes have been extensively studied at the Risöfladan experimental field, Vaasa, Finland (Fig. 1) and there is a large amount of available geochemical and microbial 16S rRNA gene sequencing data8,9,10,11. The previous 16S rRNA gene based investigations from the oxidized zone at the site identified a mixed community of acidophilic bacteria and archaea similar to that found in acid mine drainage environments along with sulfate reducing bacteria in the underlying un-oxidized potential acid sulfate soil (PASS) zone10,11. Finally, 16S rRNA genes with similarity to low temperature adapted microbes were identified in all soil layers, reflecting the boreal environment in Finland. However, more detailed investigations into the metabolic potential and activities of the contrasting microbial communities in oxidized ASS, the transition zone containing the oxidative front, and the PASS have not been reported.
In this Data Descriptor, we present triplicate biological replicate data for 16S rRNA gene amplicons, metagenomes, and metatranscriptomes from the oxidized, transition, and un-oxidized soils horizons at the Risöfladan experimental field (total nine samples) along with the accompanying geochemical metadata (Table 1). The 16S rRNA gene amplicon sequencing dataset contains on average 369 027 reads (min 83 731, max 634 072) for the nine samples (Table 2). In addition, the nine metagenomes (three biological replicates from the three soil types designated OX-MG, TR-MG, and UN-MG) contained 3.78 × 109 paired-end reads (2 × 151 bp) of raw sequence data (Table 3). Finally, the corresponding nine metatranscriptomes (designated OX-MT, TR-MT, and UN-MT) to the three soil types generated 1.11 × 1011 paired-end reads (2 × 151 bp) of raw sequence data (Table 3).
The broader goals that motivated the study were to take advantage of multi-omics and geochemical data to gain insights into the metabolic landscape and the molecular mechanisms underlying microbial life in this extreme environment. In addition, based on 16S rRNA gene sequencing many of the microbes in the PASS and ASS are unknown and the data will characterize the structure and function of these microbial populations. Moreover, the microbial metabolic pathways in the actual ASS will be used to answer the question how they catalyze the oxidation process.
The project will be of great use to the research community, as it will generate biological data for which there is hydrological and geochemical monitoring of the site stretching back for many years. Since ASS are extreme environments, the project will be of direct use to researchers, e.g. microbiologists, environmental biologists, geochemists, and geologists. Scientists interested in the iron and sulfur cycles as well as the contaminants released from the ASS during the oxidation process will additionally have a direct use of the project. As sulfidic sediments are affected during dredging, building of infrastructure, and agriculture; researchers involved in these areas will also have use of this work as it may give an explanation in how to minimize the negative impact of these sediments. Finally, the study is relevant to bioinformaticians that study large data sets.
Methods
Soil sampling
Soil was sampled in mid-August 2017 from the Risöfladan experimental field located in Vaasa, Finland (63° 02′ 50.22″N, 21° 42′ 41.85″E; Figs 1 and 2). Ten replicate soil samples of 5 g were taken from 75 cm below ground (oxidized zone, OX), 140 cm below ground (transition zone, TR), and 190 cm below ground (un-oxidized zone, UN). The samples were placed in separate sterile tubes (30 tubes in total), immediately preserved by freezing in liquid nitrogen, and stored at −80 °C until RNA and DNA were extracted within ten days of sampling.
RNA and DNA extraction
RNA and DNA were extracted simultaneously from 3 g of soil using the RNeasy® PowerSoil Total RNA Kit (Qiagen) and the following RNeasy® PowerSoil DNA elution Kit (Qiagen). Briefly, 5 g of frozen soil was thawed and carefully mixed before 3 g of soil was transferred to a Bead Tube provided in the RNeasy® PowerSoil Total RNA Kit and RNA was extracted according to the manufacturer’s instructions. The final RNA pellet was suspended in 25 µL RNase/DNase-free water. After eluting the RNA from the RNA Capture Column provided by the RNeasy® PowerSoil Total RNA Kit, the bound DNA was eluted from the RNA Capture Column using the RNeasy® PowerSoil DNA Elution Kit according to the manufacturer’s instructions. The final DNA pellet was suspended in 50 µL RNase/DNase-Free water.
After extraction, RNA samples were treated twice with RNase-free DNase using the Turbo DNA-free™ Kit (Invitrogen) according to the manufacturer’s instructions and then stored at –80 °C. In order to obtain sufficient samples for the final three replicates per soil zone, two to three of the extracted RNA and DNA samples were pooled. The same replicate extractions were pooled for RNA and DNA in order for the metatranscriptomes to match the metagenomes and the RNA transcripts to be mapped to the metagenomes. Ribosomal RNA was depleted from the pooled RNA samples with the Ribominus™ Transcriptome Isolation Kit (Invitrogen) according to the manufacturer’s instructions. Nucleic acid concentrations were measured using a Qubit® 2.0 Fluorometer (Life Technologies). DNA and RNA samples were stored at −80 °C before being submitted to the U.S. Department of Energy Joint Genome Institute (JGI) for sequencing.
16S rRNA gene amplicon sequencing
Subsamples from the DNA extracts were used for amplifying the V3-V4 region of the microbial 16S rRNA gene by using primers 341F (CCTACGGGNGGCWGCAG) and 805R (GACTACHVGGGTATCTAATCC)12, followed by PCR amplification for Illumina sequencing13 (Fig. 2). Samples were sequenced at the Science for Life Laboratory, Sweden, on the Illumina MiSeq platform14. A summary of the 16S rRNA gene amplicon sequencing reads is shown in Table 2.
Metagenome and metatranscriptome library construction and sequencing
For metagenomes, 100 ng of DNA was sheared to 300 bp using the Covaris LE220 and size selected using SPRI beads (Beckman Coulter). The fragments were treated with end-repair, A-tailing, and ligation of Illumina compatible adapters (IDT, Inc) using the KAPA-Illumina library creation kit (KAPA Biosystems). The prepared libraries were quantified using KAPA Biosystem’s next-generation sequencing library qPCR kit and run on a Roche LightCycler 480 real-time PCR instrument. Sequencing of the flowcell was performed on the Illumina NovaSeq sequencer using NovaSeq XP V1 reagent kits, tbd-sample dependant flowcell, following a tbd-sample dependant indexed run recipe.
For metatranscriptomes, stranded cDNA libraries were generated using the Illumina Truseq Stranded RNA LT kit. Ten ng of total RNA was fragmented using divalent cations and high temperature. The fragmented RNA was reversed transcribed using random hexamers and SSII (Invitrogen) followed by second strand synthesis. The fragmented cDNA was treated with end-pair, A-tailing, adapter ligation, and eight cycles of PCR. The prepared libraries were quantified using KAPA Biosystem’s next-generation sequencing library qPCR kit and run on a Roche LightCycler 480 real-time PCR instrument. Sequencing of the flowcell was performed on the Illumina NovaSeq sequencer using NovaSeq XP V1 reagent kits, tbd-sample dependent flowcell, following a tbd-sample dependant indexed run recipe.
The generated reads and bases of all samples can be seen in Table 3.
Data Records
A summary of the data records included in the study is given in Table 4. The raw Illumina MiSeq 16S rRNA gene sequencing reads are available from the NCBI Sequence Read Archive under the NCBI BioProject accession number PRJNA52414415. The raw Illumina NovaSeq sequencing reads for all metagenomes are available from the NCBI Sequence Read Archive under accession numbers SRP18508216, SRP18508417, SRP18508618, SRP18508719, SRP18508820, SRP18508921, SRP18509222, SRP18509123 and SRP18522324. The raw Illumina NovaSeq sequencing reads for all metatranscriptomes are also available from the NCBI Sequence Read Archive under accession numbers SRP18522225, SRP18509426, SRP18509327, SRP18522528, SRP18522429, SRP18522630, SRP18522731, SRP18522832 and SRP18522933.
Technical Validation
Soil samples were taken aseptically by using sterilized equipment and sterile RNase and DNase-free tubes. RNA and DNA were extracted in an RNase free environment and the quantity of the extracted nucleic acids were measured between every step with a Qubit® 2.0 Fluorometer. The quality of the extracted nucleic acids was analyzed using both a NanoDrop™ 2000 Spectrophotometer and agarose gel electrophoresis using a 1% agarose gel with 0.01% SYBR™ Safe DNA Gel Stain (Invitrogen) and a Thermo Scientific™ GeneRuler™ Ready-to-use 1 kb Plus DNA Ladder.
For 16S rRNA gene amplification, negative and positive controls in the form of RNase/DNase-free water and DNA extracted from Pseudoateromonas citrea, respectively were included in the first amplification step. In addition, negative controls in the form of mastermix with primer-barcodes without templates were included in the second PCR amplification to confirm that both the primers and the barcodes identifying the sequences functioned correctly and that no contamination occurred. The concentration of the 16S rRNA gene amplicons and controls was measured with a Qubit® 2.0 Fluorometer and their quality were analyzed using agarose gel electrophoresis (Fig. 3).
References
Karimian, N., Johnston, S. G. & Burton, E. D. Iron and sulfur cycling in acid sulfate soil wetlands under dynamic redox conditions: A review. Chemosphere 197, 803–816 (2018).
Schippers, A. & Sand, W. Bacterial leaching of metal sulfides proceeds by two indirect mechanisms via thiosulfate or via polysulfides and sulfur. Appl. Environ. Microbiol. 65, 319–321 (1999).
Sutela, T. & Vehanen, T. The effects of acidity and aluminium leached from acid-sulphate soils on riverine fish assemblages. Boreal Env. Res. 22, 385–391 (2017).
Macdonald, B. C. T., Denmead, O. T., White, I. & Melville, M. D. Natural sulfur dioxide emissions from sulfuric soils. Atmos. Environ. 38, 1473–1480 (2004).
Hinwood, A., Horwitz, P. & Rogan, R. Human exposure to metals in groundwater affected by acid sulfate soil disturbance. Arch. Environ. Contam. Toxicol. 55, 538–545 (2008).
Dopson, M. & Johnson, D. B. Biodiversity, metabolism and applications of acidophilic sulfur- metabolizing microorganisms. Environ. Microbiol. 14, 2620–2631 (2012).
Bonnefoy, V. & Holmes, D. S. Genomic insights into microbial oxidation and iron homeostasis in extremely acidic environments. Environ. Microbiol. 14, 1597–1611 (2012).
Åström, M., Österholm, P., Bärlund, I. & Tattari, S. Hydrochemical effects of surface liming, controlled drainage and lime-filter drainage on boreal acid sulfate soils. Water Air Soil Pollut. 179, 107–116 (2007).
Boman, A., Fröjdö, S., Backlund, K. & Åström, M. E. Impact of isostatic land uplift and artificial drainage on oxidation of brackish-water sediments rich in metastable iron sulfide. Geochim. Cosmochim. Acta 74, 1268–1281 (2010).
Wu, X. et al. Impact of mitigation strategies on microbial community from an Ostrobothnian acid sulfate soil. Sci. Tot. Environ. 526, 215–221 (2015).
Wu, X. et al. Microbial community potentially responsible for acid and metal release from an Ostrobothnian acid sulfate soil. FEMS Microbiol. Ecol. 84, 555–563 (2013).
Herlemann, D. P. et al. Transitions in bacterial communities along the 2000 km salinity gradient of the Baltic Sea. ISME J. 5, 1571–1579 (2011).
Hugerth, L. W. et al. DegePrime, a program for degenerate primer design for broad-taxonomic-range PCR in microbial ecology studies. Appl. Environ. Microbiol. 80, 5116–5123 (2014).
Lindh, M. V. et al. Transplant experiments uncover Baltic Sea basin-specific responses in bacterioplankton community composition and metabolic activities. Front. Microbiol. 6, 223 (2015).
Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRP187263 (2019).
Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRP185082 (2019).
Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRP185084 (2019).
Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRP185086 (2019).
Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRP185087 (2019).
Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRP185088 (2019).
Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRP185089 (2019).
Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRP185092 (2019).
Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRP185091 (2019).
Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRP185223 (2019).
Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRP185222 (2019).
Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRP185094 (2019).
Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRP185093 (2019).
Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRP185225 (2019).
Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRP185224 (2019).
Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRP185226 (2019).
Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRP185227 (2019).
Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRP185228 (2019).
Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRP185229 (2019).
Acknowledgements
(A portion of) this research was performed under the Facilities Integrating Collaborations for User Science (FICUS) initiative and used resources at the DOE Joint Genome Institute and the Environmental Molecular Sciences Laboratory, which are DOE Office of Science User Facilities. Both facilities are sponsored by the Office of Biological and Environmental Research and operated under Contract Nos. DE-AC02-05CH11231 (JGI) and DE-AC05-76RL01830 (EMSL). E.H.-R. acknowledges the Nessling Foundation (grant number 201700273 and 201800502) and Svensk-Österbottniska Samfundet r.f. (grant number 18/0604) for financial support. M.D. acknowledges The Swedish Research Council Formas (grant number 2018-00760) and the Geological Survey of Sweden (grant number 36-1878/2017) for financial support. M.D further acknowledges the Science for Life Laboratory (SciLifeLab), the National Genomics Infrastructure (NGI) for complementary computations that were performed on resources provided by SNIC through Uppsala Multidisciplinary Center for Advanced Computational Science (UPPMAX) under project b2013127. D.L. and D.B. acknowledges the Swedish Research Council infrastructure project Biodiversity Atlas Sweden (VR, grant number 2017-00688) and the marine strategic research environment EcoChange (Formas) for financial support.
Author information
Authors and Affiliations
Contributions
E.H.-R., S.E. and M.D. conceived the study and wrote the manuscript with contributions from all authors. E.H.-R. and S.E. performed the sampling, E.H.-R. extracted the biomolecules together with M.L-F. and S.C. and M.L-F. amplified the 16S rRNA gene for amplicon sequencing. S.C., D.B. and D.L. carried out bioinformatic analysis of the data sets. M.H., A.C., Bri.F., Bry.F., S.R., K.P., N.V., S.M., T.B.K.R., C.D., A.C., I.-M.C., N.N.I., N.C.K., M.H.-S. and E.A.E.-F. carried out the sequencing and bioinformatic analysis of the metagenomic and metatranscriptomic data sets.
Corresponding author
Ethics declarations
Competing Interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.
About this article
Cite this article
Högfors-Rönnholm, E., Lopez-Fernandez, M., Christel, S. et al. Metagenomes and metatranscriptomes from boreal potential and actual acid sulfate soil materials. Sci Data 6, 207 (2019). https://doi.org/10.1038/s41597-019-0222-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-019-0222-3