Introduction

Chemical weathering of silicate minerals has conventionally been treated as the most effective abiotic sink for atmospheric CO21. While the fluxes of photosynthesis and respiration are nearly balanced2, the removal of atmospheric CO2 through silicate weathering in line with the burial of biospheric carbon in major river systems can offset a considerable portion of the CO2 emitted from geological sources (e.g., volcanic and metamorphic degassing), thereby regulating climatic fluctuations over geological time scales3. However, weathering of geological materials does not necessarily lead to carbon sequestration but instead contributes to intensifying CO2 emissions (e.g., pyrite-induced carbonate dissolution and oxidation of petrogenic organic carbon)4,5. Growing evidence has shown that pyrite oxidation is much more prevalent than previously thought5,6. Based on the solute contents and sulfur isotopic compositions of sulfate, the model calculation estimates that the export of pyrite-derived sulfate from major rivers could reach 1.3 \(\times\) 1012 moles yr−1, a quantity comparable to the solute budget of silicate weathering7. As sulfuric acid dissolves carbonate minerals more rapidly than carbonic acid, this reaction leads to a net release of CO2 gas at a time scale of less than 107 yrs8, thereby counteracting the removal of CO2 by silicate weathering5.

Pyrite-induced carbonate weathering could have been even more effective than silicate weathering in tectonically active terranes9. This inference is readily illustrated in previous studies on catchments distributed in the Himalayas, Andes, New Zealand, and Taiwan orogens9,10,11,12,13. The area-specific flux of pyrite-induced carbonate weathering in these active mountain belts has been estimated to reach a level (millions of moles km−2 yr−1) several orders of magnitude greater than that in major river basins (see review and compilation in ref. 1). This pattern has been attributed to the enhanced exposure of fine-grained pyrite and rock fragments to rain and groundwater infiltrating through the pore space and fracture of talus deposits and bedrock, a process aided by rapid fluvial incision and hillslope mass wasting associated with rapid tectonic uplift12,14.

Microbial catalysis has been speculated to contribute to the production of sulfuric acid from pyrite in orogenic belts1. However, the potential niche and community assemblage, and the extent to which microbial sulfur respiration is involved remains largely unknown. Addressing such a knowledge gap can provide a basis to unveil how the weathering thermostat of climatic feedback is regulated by and co-evolves with microbially acid production. In this study, we aimed to determine the potential microbial role in pyrite-induced chemical weathering in a rapidly exhuming catchment of eastern Taiwan. Field campaigns were conducted between 2016 and 2018 to sample and analyze river water, sediments in benthic zones, short-path creeks, groundwater, hot springs, soils, and rocks from the Hsin-Wu-Lu River headwaters of the Bei-Nan catchment (Fig. 1). The catchment was chosen by taking the advantage that the bedrock mainly composed of slate, greenschist, and blackschist is rapidly exposed to the surface environment (several to tens of mm yr−1, depending on the time scale15,16,17,18) by tectonic uplift and accompanying river incision and mass wasting, providing vast amounts of reactive pyrite for mineral weathering. The catchment is also characterized by the export of high suspended particulates to the ocean at a flux (20–88 Mt yr−1 or 0.013–0.055 Mt km−2 yr−1)19,20 ranking among the highest in Taiwan. Spatial and temporal variabilities in chemical weathering and microbial community across the catchment were examined to resolve the relationships between hydrological and microbial shifts imposed by event-driven erosion.

Fig. 1: Geological map of the study area overlaid with sampling locations, the contributions of weathering pathways to the solute production, the net DIC production, and the transient CO2 gas emission.
figure 1

The relative fractions of cations from silicate and carbonate weathering were calculated from the data set corrected for precipitation and hot spring inputs using a Bayesian algorithm based on the modeled compositions of silicate and carbonate minerals. The fractions of the solute loads from individual pathways were further deconvoluted to constrain the acid contribution and bulk CO2 production considering the charge balance for reactions that produce both DIC and gaseous CO2 (Supplementary Methods). The fraction of each pathway (“F” value) represents an average for the high-water or low-water periods (sample names attached with “-H” and “-L”, respectively). The net DIC production represents the production of DIC from carbonate dissolution. For comparison, the transient CO2 emission is derived from the subtraction of the gaseous CO2 produced from carbonate dissolution from the dissolution of atmospheric CO2. The red and green arrows mark the net transient emission and sink of gaseous CO2, respectively. The error bars represent one standard deviation. Detailed computation approaches can be found in Supplementary Methods.

Results and discussions

Pyrite-induced chemical weathering in an active orogen

Analyses of 102 river water samples yielded that Ca2+, SO42-, and dissolved inorganic carbon (DIC) were the main solute constituents (Supplementary Table S1). After correcting for rain inputs, the plot of Ca2+/Na+ against Mg2+/Na+ revealed that most samples clustered around an end member represented by carbonate minerals rather than silicate minerals (Supplementary Fig. S1; detailed correction in Supplementary Methods). The predominance of carbonate over silicate weathering was further quantified using a Bayesian approach for the mass balance configured by the model compositions of silicate and carbonate minerals (Supplementary Methods). The results indicated that carbonate weathering produced >90% of the total cation yields in most tributaries (HLBS, LD, and MLL02) and the main stem (CWL01 and 04), regardless of the sampling time (Fig. 1 and Supplementary Figs. S2 and S3).

The only exception occurs for DL (Da-Lun River), along which hot fluids are discharged from a cluster of springs identified 15 km upstream from the confluence with the Hsin-Wu-Lu River. These hot fluids were enriched in Na+ and DIC (Supplementary Table S1), a geochemical pattern modulated primarily by water-rock interactions at high temperatures (<12% of cations in DL water originated from high-temperature reactions). By subtracting this hot-spring component, the DL solute pattern revealed that carbonate weathering contributed 93 ± 6% and 79 ± 8% of the total cation yields during the high- and low-water periods, respectively (Fig. 1 and Supplementary Figs. S2 and S3).

The capacity of carbonate minerals to buffer pyrite-derived acids appears to be insufficient to accommodate the production of DIC for most tributaries and the main stem, as evidenced by the (Ca2+ + Mg2+)/sulfate ratios between one and two (Fig. 2a; except for some samples from HLBS, DL, and CWL04). The stoichiometric disproportionation suggests that pyrite-induced carbonate weathering can lead to the formation of both bicarbonate and CO2 gas. With a approach considering the charge balance, the net emission of gaseous CO2 occurred in all periods for LD, MLL02, and CWL01, and in some periods for CWL04 (Supplementary Methods). In contrast, the net removal of atmospheric CO2 occurred in all periods for HLBS and DL, and some periods for CWL04, and was associated with the lower sulfate (0.5–1.3 mM for HLBS and DL versus >2 mM for other tributaries) and comparable concentrations of divalent ions and DIC. The derived net CO2 directivity does not incorporate the CO2 emissions associated with the export of riverine DIC into the ocean (Fig. 1) and the subsequent precipitation of carbonate minerals over geological time, thereby representing a transient status for the CO2 cycle.

Fig. 2: Sulfate abundances and isotopic compositions of river water.
figure 2

Plots of a (Ca2++Mg2+) versus sulfate concentrations, b sulfur versus oxygen isotopic compositions of sulfate, and c oxygen isotopic compositions of sulfate versus of water. Dashed lines with slopes of 1 and 2 in a represent carbonate weathering by sulfuric acid with the production of DIC (Eq. (S10) in Supplementary Methods) and gas CO2 (Eq. (S11) in Supplementary Methods), respectively. Isotopic compositions of sulfate from seawater (or evaporite), pyrite originally from microbial sulfate reduction during early diagenesis, and atmospheric deposits are plotted for comparison in b. The dashed lines in c represent the source contribution for oxygen in sulfate produced by pyrite oxidation with oxygen incorporated from the atmospheric oxygen or the water molecule. Detailed modeling approaches are described in Supplementary Note 1.

The predominance of pyrite-driven carbonate weathering on solute export is supported by the isotopic composition of sulfate (Fig. 2b; Supplementary Notes 1). The δ34S and δ18O values of sulfate ranged from −12 to +3 ‰ and from −8 to +4 ‰, respectively. Such a wide range of δ34S values and their distinction from marine sulfate composition (+10 to +35‰)21 suggest that sulfur in sulfate is not sourced from marine evaporites but rather from reduced sulfur that experienced diagenesis and subsequent metamorphism. On the other hand, the δ18O values reflect the mixed source compositions of oxygen (atmospherically derived oxygen of +23.8‰ and water-derived oxygen between −16 and −10‰; Supplementary Fig. S4). The mass balance calculation showed that water contributed >50% of oxygen in sulfate for all samples (Fig. 2c; Supplementary Note 1). For some samples from the tributary (HLBS) located farthest upstream, the contribution of water-derived oxygen dominated over the atmospherically derived oxygen at a much greater level (>90%; Fig. 2c). Both ferrihydrite and atmospheric oxygen are important oxidants to drive the production of sulfuric acid. Considering the sluggish reactivity of pyrite oxidation under ambient conditions22,23, microbial enzymatic catalysis with or without oxygen may account for copious sulfate production in the investigated catchment.

Microbial role in pyrite oxidation

Analyses of 16S rRNA gene amplicons yielded diverse community compositions (Fig. 3). These amplicon sequence variants (ASVs) were phylogenetically assigned to 88 phyla across Bacteria (74) and Archaea (14) domains. While 30,765 ASVs were recovered, soil communities appeared to be the most diverse among all habitats (Supplementary Tables S2 and S3). For river water and sediments, the diversity indices for high-water periods were lower than those for low-water periods (p < 0.05, Dunn’s test; Supplementary Tables S2 and S3, and Supplementary Fig. S5). Higher percentages of ASVs ( > 10%) were shared between the water and sediment communities than between the soil and water/sediment communities (Supplementary Fig. S6). Among the diverse community members, ASVs closely affiliated with two genera, Sulfuricurvum within Campylobacteria and Thiobacillus within Gammaproteobacteria, were particularly abundant in the high-water samples. The proportions of these two genera ranged from 1% to 79% and from 3% to 36% of the reads, respectively (Fig. 3a, b; Supplementary Table S4). The summed proportions were greater than 40% in 23 of the 29 samples. Comparable results were obtained from benthic sediments (summed proportions from 24% to 74% of the reads; Supplementary Table S4). In contrast, these two genera were much less abundant (<11% in total) in the majority of the low-water river and sediment samples (17 out of 21 samples; exceptions were some samples collected in May 2017) and in short-path creeks, groundwater, hot springs, soils, and rocks. Instead, Acidobacteriota, Actinobacteriota, Alphaproteobacteria, Bacteroidota, Cyanobacteria, and Gammaproteobacteria constituted the main classes or phyla (Fig. 3a). The contrasting proportions of these two genera between water levels were further supported by quantitative PCR (qPCR) results (Fig. 4). Their 16S rRNA gene copies ranged between 3 × 105 and 3 × 108 copies L−1 for the high-water samples, and were generally greater than those for the low-water samples (from below detection to 1 × 108 copies L−1; p < 0.05, Dunn’s test; Supplementary Table S4). The gene copies also varied positively with the sediment concentrations (Supplementary Fig. S7) and sequence proportions (Supplementary Fig. S8). By multiplying the measured characteristics with the discharges, the annual yields of these two taxa were found to be highly correlated with those of sediment and sulfate (ρ > 0.6 and p < 0.0001; Fig. 4; Supplementary Note 2). Finally, two of the metagenome assembled genomes (MAGs) recovered (Supplementary Note 2, Supplementary Figs. S9 and S10, and Supplementary Table S5) contained a suite of genes affiliated with Sulfuricurvum and Thiobacillus related strains capable of autotrophically metabolizing reduced sulfur under low-oxygen conditions (for Sulfuricurvum) or variable oxygen levels (for Thiobacillus)24,25.

Fig. 3: Community compositions and variance based on 16S rRNA genes for samples collected from rivers, sediments, short-path creeks, groundwater, hot springs, soils, and schists.
figure 3

a Abundances of major taxonomic units (phyla and classes with >3% read proportions), b abundances and taxonomic affiliations of the top 10 most abundant ASVs, and c community variance based on the Bray-Curtis distance ordinated by the non-metric multidimensional scaling (NMDS) approach. Sample affiliations were expressed as either site for river water (LD, MLL02, CWL01, DL, and CWL04) or sample category (“SED” for sediment, “SPR” for hot spring, “GW” for groundwater, “SW” for short path creek, and “Schist” for rock). Samples of rivers and sediments were further categorized with “-H” for the high-water level, and “-L” for the low-water level. Scaled circles in b represent the read proportions of specific ASVs. Abbreviations for taxonomic units in b include “Thio” for Thiobacillus, “Sulfuri” for Sulfuricurvum, “Tend” for Candidatus Tenderia, Burk” for Burkholderiaceae, and “Acine” for Acinetobacter.

Fig. 4: Abundances of Sulfuricurvum and Thiobacillus members and their relationships with the yields of sediment and sulfate.
figure 4

a, b qPCR derived 16S rRNA gene copies, and log-log plots for the yields of 16S rRNA gene copies versus sediment (c, d) and sulfate (e, f) in river samples. The boxes in a and b represent the median, and the first and third quartiles of 16S rRNA gene copies for individual specific sites or sample categories. Whiskers represent 1.5 times the interquartile range. Blue and gold boxes are the results for river and sediment samples collected during high- and low-water periods at specific sites, respectively, whereas gray boxes are the results for different categories (abbreviations the same as Fig. 3). Significant differences for abundances between the high-water versus low-water at specific sites are marked by stars (* for p < 0.05 by Dunn’s test). The yields in cf were calculated by multiplying individual parameters with daily discharges that were proportionated from the discharge at CWL04 using the fraction of catchment area (Supplementary Note 2). The discharge during low-water periods (25 cms) was better approximated using the channel width derived from the satellite image and measured flow velocity (Supplementary Note 2).

The sequence resemblance with the Sulfuricurvum and Thiobacillus affiliated strains (Supplementary Fig. S9), the community variation and functional pattern correlated with the water level and sediment concentration (Figs. 3 and 4, and Supplementary Figs. S7 and S10), and the correlation between the yields of target taxa, sediment, and sulfate (Fig. 4) provide important implications for the microbial role, capacity, and niche in pyrite oxidation. First, these sulfur oxidizers were intimately associated with sediments, suggesting that surface and near-surface environments with erodible materials, such as talus deposits on the hillslope, sediments temporally piled in the bank, or fractured bedrock on the channel wall or as part of the riverbed, would be the most plausible niches for the proliferation of these two genera. Second, these data patterns support the likelihood that the target sulfur-respiring taxa actively oxidized reduced sulfur. The oxidation product, sulfate, along with the solutes derived from weathering processes, could have accumulated in either the pore space of talus deposits and bank sediments or fracture bedrock and migrated toward the river in accordance with hydraulic gradients. Over a period impacted by typhoons or torrential rains, talus deposits, bank sediments, or fractured bedrock are rapidly recharged and saturated with groundwater, becoming highly conductive to groundwater transport14,26,27. In particular, the erodible materials are destabilized by either the failure of the slope base undermined by the turbulent flow or the reduced cohesion associated with the saturation of pore space28, thereby draining the sediment-bound communities into the river. As a result, the taxon yields along with sulfate yields recovered from river water were highly dependent on the amount of sediment or material entrained by high-water events (Fig. 4). During low-water periods, the solute dilution caused by the torrential runoff is essentially absent29,30 Therefore, the groundwater sulfate would constitute a substantial portion of river sulfate when compared with high-water periods. By contrast, only trace amounts of sediment-bound taxa or sessile populations were released into the river through shallow and deep groundwater circulation. Therefore, the river communities would be primarily composed of members that loosely colonized the surface of rocks and sediments exposed to the river water.

Weathering in deep subsurface environments can be a potential contributor to the solute budget during low-water periods. The assertion is supported by nearly coeval variations in precipitation, groundwater level, and river discharge particularly for the extreme events (Supplementary Fig. S11), and elevated sulfate concentrations (~9 mM) of groundwater in a 100-m deep borehole located at ~30 m from river DL (Supplementary Table S1). The observed groundwater is likely connected to the riverbed through the fracture network extended from the subsurface beneath the hillslope, channeling subsurface sulfate to the river water. One notable feature is that the abundances of potential sulfur oxidizers in groundwater were drastically lower (0.91%) than those in river water during high-water periods (Supplementary Fig. S12), which seems to contradict microbially mediated pyrite oxidation. A plausible explanation for such an apparent discrepancy is that these sulfur oxidizers might have colonized the fracture surface, actively converting sulfur to sulfuric acid for energy conservation. This scenario might be intuitively valid because pyrite is visible and prevalent in bank sediments, slate, and schist. The direct contact between sulfur-respiring microorganisms and pyrite surface enables to harness metabolic energy and facilitate electron transfer more efficient potentially through electron carrier or nanowire postulated for extracellular electron transfer during the iron reduction31. Previous studies have also suggested that the abundance of sessile communities in the subsurface can range up to 1000 times higher than those of planktonic communities32. If so, these attached sulfur oxidizers can still produce elevated sulfate, impacting downstream carbonate weathering and other biogeochemical cycling while being exempted from detection through current sample retrieval. Abiotic pyrite oxidation previously reported for shales with small pore throats was noted and discussed33 (Supplementary Discussion).

High chemical flux and CO2 emission in small mountainous catchments

Based on conservative decadal discharge estimates, the mountainous catchment drained by the investigated river in eastern Taiwan exported a total of 1.5 ± 0.3 \(\times\) 109 moles yr−1 sulfate downstream through chemical weathering (Supplementary Discussion). Such a quantity is comparable with the export from the Gao-Ping River in southern Taiwan (7.8 \(\times\) 109 mole yr−1) and the Li-Wu River in eastern Taiwan (7.8–9.3 \(\times\) 108 mole yr−1)1,11,14. The combined export of sulfate from these three catchments with an area of 4331 km2 (0.003% of the global catchment area) results in a total yield of ~1.1 \(\times\) 1010 moles yr−1, which represents 0.85% of the global riverine pyrite-induced sulfate7, and an area-specific flux of 2.5 \(\times\) 106 moles km−2 yr−1. While the area-normalized flux surpasses the global average by two to three orders of magnitude7, the yield of riverine sulfate is comparable to the amount of sulfur in bedrock (1.6 \(\times\) 1010 moles yr−1; Supplementary Discussion) exposed through rapid exhumation and erosion by channel incision and mass wasting16. Therefore, the high fluxes represent the outcome of intensive carbonate weathering driven by pyrite oxidation, which is potentially mediated by Sulfuricurvum and Thiobacillus members at a pace consistent with the exposure of fresh bedrock. The estimates also suggest that microbially driven chemical weathering is supply-limited, converting nearly all exhumed sulfur minerals to soluble sulfate prior to export to the ocean. Furthermore, where the localized acid production exceeds the buffering capacity of carbonate minerals, the intimate microbe-mineral association enables the net, transient emission of gaseous CO2 and produces an overwhelmingly large quantity of DIC, which could ultimately lead to net CO2 generation over geological time scales (Fig. 1). The exported DIC flux from mineral weathering was calculated to be 2.8 \(\times\)109 moles yr−1 (Supplementary Discussion), which is equivalent to 1.4 \(\times\) 109 moles C yr−1, or 1.7 \(\times\) 104 tons C yr−1, or 27 tons C km−2 yr−1 emission of CO2. Nevertheless, the derived conservative CO2 fluxes are either comparable to or greater than the net CO2 fluxes of mineral weathering, export from the oxidation of petrogenic carbon, and sequestration by the burial of biospheric carbon for catchments in Taiwan1,34,35,36, and exceed the values observed for most catchments around the world by up to orders of magnitude (except for site Laval37; see compilation in ref. 1). The investigated catchment represents a hotspot of CO2 emissions potentially rendered by microbe-hydrology-rock interactions in the context of active orogenesis. Finally, high-standing islands around the Pacific Ocean, which constitute only ~3% of the landmass and are prone to convergent tectonics and the impact of meteorological extremes, are estimated to export up to 35% of global sediment flux20,38. While microbial pyrite oxidation is likely sediment-bound, the quantitative extrapolation of the current results to other analogous high-standing islands could be valid only if a transfer function that connects the yields of sulfur oxidizers, sediments, and microbial sulfate is established, tested, and generalized. An extra consideration of sediment residence time would be even more vital when applying this to river systems with wider riparian zones and floodplains. In summary, this study presents a conceptual framework and modern analog to examine whether the engagement of microbial roles in mineral weathering and elemental cycling in global mountain belts is as prevalent and significant, as observed in this study, either on a contemporary time scale or over the geological past.

Methods

Site background and sampling

Taiwan is located at the boundary juxtaposing the Eurasian and Philippine Sea Plates. The northwestward subduction of the Philippine Sea Plate has led to an oblique collision between the Luzon arc and the Eurasian Plate since five million years ago, driving the southward and westward propagation of uplift and exhumation across the island39. Rapid uplift exposes various grades of metamorphic rocks above sea level and enables the formation of deeply incised rivers oriented by structural configuration40. Among numerous river systems in Taiwan, the Bei-Nan River is selected as the study target because of its high sediment export (20–88 million metric ton yr−1)19,20, uplift and erosion rates (3–15 mm yr−1 depending on the time scale)16,41, and the limited impact of human activity. The study catchment, drained by the Hsin-Wu-Lu River and its tributaries, constitutes the upstream part of the Bei-Nan catchment (Fig. 1), and spans an elevation change between 200 and 2900 m above sea level and an area of 639 km2. The bedrock classified as the Backbone Range is composed of the Hsinkao Formation (slate belt) to the west and the Tananao Complex (schist belt) to the east42 (Fig. 1). While the Hsinkao Formation consists mainly of slate/phyllite43 (Fig. 1b), the Tananao Complex is composed of blackschist, light-colored mica schist, greenschist, and minor marble and meta-sandstone. Major mineral constituents of either slate or schist include quartz, mica, and plagioclase (±chlorite and graphite)43. The peak metamorphic temperatures of the schist and slate belts have been estimated to be 550–580 °C and 450 °C, respectively42,44,45,46. A metagranite body in contact with schist and marble is distributed approximately 15 km upstream of the Da-Lun River47. The mineral constituents of the metagranite include quartz, K-feldspar, and plagioclase, with minor amounts of garnet, biotite, zircon, apatite, and pyrrhotite. U-Pb dating on zircon yields ages of ~200 Ma, suggesting arc magmatism related to the subduction of the paleo-Pacific plate. The intrusive body was further rifted from the South China Block during Eocene – Miocene, and accreted into the Tananao Complex by subduction and collision since Miocene47. Talus deposits produced from landslides are distributed in the upstream and midstream regions where topographic gradients are high. Soil development is very limited, primarily because of unstable hillslopes and high erosion rates. Hot springs were identified along the Hsin-Wu-Lu River and one of the tributaries, the Da-Lun River (Fig. 1). These hot springs are enriched with bicarbonate, and sodium (Supplementary Table 1) derived from the water-rock interactions at high temperatures. Visual inspection suggests a low discharge of hot fluids from most springs. The only exception was the spring outcrops distributed along the Da-Lun River. Hot fluids at >90 °C were vigorously released from the interface between the metagranite and schists into the river. Therefore, the correction for river chemistry was made particularly for the samples collected from DL and the downstream section of the Hsin-Wu-Lu River (Supplementary Methods for the detailed correction procedures).

Five main field sites were chosen to cover the major tributaries and possible variations in the chemical characteristics and microbial communities across the catchment. Types of samples include river water from the main stem of the Hsin-Wu-Lu River (sites CWL01 and 04; n = 18, respectively) and its tributaries (sites DL, MLL02, LD, and HLBS; n = 20, 10, 16, and 12, respectively), sediments in the benthic zone (site DL; n = 10), groundwater (near site DL; n = 3), hot springs (LL, LS, LK, SM, and WL; n = 2, 1, 1, 1, and 1, respectively), short-path creeks (n = 5), soils (n = 8), schists (n = 2), and rainwater (near site LD; n = 7). Sample information could be found in Supplementary Table S1. Fieldwork and most sampling activities were conducted between September 2016 and January 2018. The only exception occurred for groundwater collected from a borehole near site DL in September and October of 2018 when the borehole drilled and maintained by the Taiwanese Central Geological Survey was available for this study. Prior to sample collection, some physical characteristics (including temperature, pH, and conductivity) of the water were measured on-site. River water and sediment samples were collected several days after high-water events (typhoons or storms) or during low-water periods (Supplementary Fig. S9). Benthic sediments collected were submerged slightly in river water and distributed near the boundary between the active channel and riparian zone at site DL. They were relatively fine-grained (smaller than the size of coarse sand). The exact depositional age remains unconstrained but is speculated to be on the scale of months as this grain size range is susceptible to mobilization during high-water events. All benthic sediments were collected during the same campaign for sampling of the river water at site DL. Groundwater was collected from a borehole drilled to 100 mbls (meters below land surface) near site DL. The borehole was located approximately 30 meters from the nearby DL River. Groundwater samples were obtained from depth of 67 mbls in September 2018, and 67 mbls and 85 mbls in October 2018. The depth range was used to represent the groundwater from the screen opening at two depth intervals: 64–67 mbls and 88–97 mbls. Rainwater was collected whenever possible. All the other types of samples were retrieved from specific sites once during the investigation period. The attribution of samples to either the high- or low-water period is not straightforward because the water level represents the cumulative effect of precipitation, runoff, groundwater, and events (such as typhoons, monsoons, and storms) that influence the water bodies (Supplementary Fig. S11). Furthermore, the steep topographic variation across the catchment causes rapid variation in water level over an event cycle. Based on field observations, monthly accumulated precipitation, and sediment yield, samples collected in September and October of 2016 (days after typhoons), and June-August and October of 2017 (days after storms) were assigned as high-water samples, whereas the rest were assigned as low-water samples. While most samples were available for geochemical analyses (except that only DNA samples were collected on September 12th, 2016), samples collected on selected dates were used for DNA analyses.

Sample processing

Water samples were filtered using 0.2 µm pore-sized cellulose membranes for different analytical purposes. The particulates on the membranes were frozen in a –20 °C freezer for temporary storage in the field (two days) and in a –80 °C freezer for permanent storage for DNA analyses. Because no additional particulate samples were collected in September and October 2016, wet particulates for DNA analyses were used to infer the sediment concentration. Our in-house test showed that the difference between the dry and wet sediment concentrations was small (<20%) under high sediment conditions (>5 g L−1). Considering that the sediment concentrations of these samples were high (as high as 22 g L−1), wet sediment concentrations were adopted. The filtrates were split into aliquots with or without preservatives for different geochemical analyses. Samples for analyses of isotopic compositions of water were collected by filling polypropylene vials without headspace. Samples for analyses of anion and cation concentrations were collected in polypropylene vials without preservatives and with nitric acid at a final concentration of 2%, respectively. Filtered samples for analyses of isotopic compositions of sulfate were collected in polypropylene bottles, acidified to pH < 4, and preserved with excess BaCl2. Sample processing for DNA and aqueous geochemistry followed similar operational standards and approaches, minimizing the human-induced contamination.

Geochemical analyses

Anions in the filtrates, mainly Cl and SO42−, were analyzed using an ion chromatograph (ICS-3000, Dionex, USA). Cations were analyzed using an inductive coupled plasma mass spectrometry (ICP-MS-7700, Agilent Technologies, USA). Synthetic and river standards were used to calibrate a range of concentrations (depending on elements). Hydrogen and oxygen isotopic compositions of water were measured using an off-axis integrated cavity output spectroscopy (LGR, USA). Sulfur and oxygen isotopic compositions of sulfate were measured using an isotope ratio mass spectrometry (Delta V advantage or a MAT253, Thermo, Germany) connected with an elemental analyzer (Flash 2000 or EA-Isolink, Thermo, Germany) through which BaSO4 precipitated from the sample was combusted at 900 °C with CuO (for sulfur isotope) and at 1100 °C without oxygen (for oxygen isotope) to generate SO2 and CO, respectively. The obtained isotopic compositions were expressed as the δ notation referenced to standards [δ = (Rsample/Rstd ‒1) × 1000‰, where R is the ratio of heavy over light isotopes]. Standards include the Vienna Standard Modern Ocean Water (VSMOW) for hydrogen and oxygen isotopes, and Vienna Canyon Diablo Troilite (VCDT) for sulfur isotopes. Certified reference materials were analyzed along with the samples to monitor their precision. The precisions were ±0.2‰ for δ34S and ±0.4‰ for δ18O for sulfate by the mass spectrometry measurements, ±0.2‰ for δ18O and ±2.0‰ for δ2H‰ by the spectroscopy technology.

Molecular analyses

Crude DNA was extracted from filters, sediments, soils, and rock fragments using the PowerSoil DNA Isolation Kit (Qiagen, Germany). A total of 85 DNA extracts were obtained (Supplementary Table S2) and stored at −80 °C for subsequent analyses. Polymerase chain reaction (PCR) amplification was applied to all DNA extracts using the primers F515 (5’–GTG CCA GCM GCC GCG GTA A–3’) and R806 (5’–CCC GTC AAT TCM TTT RAG T–3’) targeting the V4 region of 16S rRNA genes48. The amplicons were checked on gel electrophoresis, purified, and sequenced on an Illumina Miseq platform (Illumina, USA). For metagenomic analyses, two DNA extracts from river water collected at MLL02 and DL in May 2017 were used to construct libraries. An insert size of 200–600 bp and 50 ng of DNA of each sample were prepared using an Ovation Ultralow System V2 1-96 (Nugen Technologies, USA). Paired-end sequencing was performed on an Illumina MiSeq platform. The schemes and reagents used for PCR were the same as those described in refs. 49,50.

Quantitative PCR (qPCR) was used to analyze the copies of 16S rRNA genes for bacteria, and populations related to genera Sulfuricurvum and Thiobacillus in crude extracts from all sample categories using a MyiQ Real Time PCR Detection System (Bio-Rad, USA) and a QuantStudio 3 Real-Time PCR system (Applied Biosystems, USA). Standard preparations and PCR conditions were the same as those described in refs. 50,51. Primers targeting specific groups included: B27f and EUB338r for bacteria50, Sucl-650f (5’–CTA GAG TTC AGA AGG GGC–3’) and Sucl-839r (5’–GTC ACC GAA AAG ACG AGC–3’) for Sulfuricurvum (in-house design), and Thio3 (5’–AGT TCA AAA CGC CAT TCC CA–3’) and Thio1032 (5’–CCT GTG TTC CGA TTC CCT–3’) for Thiobacillus52. The 16S rRNA gene copies were calculated based on the length of the amplicon, assuming 650 g mol−1 for one base pair of DNA.

Amplicon sequence processing

The bioinformatics packages, Mothur ver.1.40.053 and QIIME2 ver. 2019.1054, were used to process the sequence reads. Quality filtering, dechimerization, and inference of amplicon sequence variants (ASVs) were carried out using the “DADA2” plugin in QIIME255 with the default parameters. The classifier was trained using the SILVA 16S rRNA reference database (release 138) with 99% identity56.

Metagenome assembling

Reads were co-assembled using MEGAHIT (v.1.2.9) with the ‘meta-large’ preset parameter57. Genomic binning was conducted using Metabat 258, MaxBin59, and CONCOCT60, after which genome refinement was performed using MetaWRAP61. The quality of metagenome assembled genomes (MAGs) was measured for the completeness and contamination using CheckM62 and those MAGs with more than 70% completeness and less than 3% contamination were further analyzed (Supplementary Table S5). Taxonomy for MAGs was assigned using GTDB-tk (v1.3.0), based on the Genome Taxonomy Database63. Gene calling was performed using the Prodigal v2.6.364 with translation code 11 within the MetaWRAP pipeline. The PROKKA pipeline65 was used for gene prediction and functional annotation. The sequences of 16S rRNA genes were identified from the MAGs using the ‘ssu_finder’ tool integrated within CheckM. The abundance of MAGs was calculated with Salmon66 built in MetaWRAP67 and expressed as genome copies per million reads.

Abundances of key functional genes in metagenomes

To determine the abundance of genes matching specific metabolic functions, merged and filtered reads were mapped to assembled contigs using bowtie2 with the default settings68. The number of reads mapped to open reading frames (ORFs) of defined functions based on annotations of the JGI KEGG Orthology database67,69,70 were counted. Specific ORFs of interest were manually checked with tablet searches using an e-value cutoff of 10−12. Key metabolic genes were defined according to previously published databases71,72. The count of read hits to specific key genes was divided by the average count of read hits to 35 single-copy COGs to normalize the gene abundance for each sample72,73.

Microbial community analyses

To equalize the sequencing depth for 16S rRNA gene analysis, the read numbers for individual samples were rarefied to 4899 (the lowest sequence number across all samples) through 100 sequence random re-sampling of the original ASV table. The rarefied sequences were used to calculate the alpha diversity indices, including richness (observed-species and Chao174) and evenness (Shannon diversity75 and ACE76), to assess the diversity and completeness of the ASV representation in each sample. For beta diversity, the entire ASV table was normalized using the function cumNorm from the R package metagenomeSeq77. A cumulative-sum scaling method was used to calculate the scaling factors equal to the sum of counts up to a particular quantile to normalize the read counts that takes uneven sequencing depth into account77. The dissimilarity matrix between samples was computed using the Bray-Curtis method78, and visualized through the ordination of non-metric multidimensional scaling (NMDS). Permutational multivariate analysis of variance (PERMANOVA) with 1000 permutations was used to identify changes in microbial community structure between water levels or sample categories79. All statistical analyses were performed in R using packages ggplot2, phyloseq, and vegan80,81,82.

Network analyses

Packages, including igraph, Hmisc, and qvalue libraries in R, were used to construct the co-occurrence network. To reduce the complexity, only ASVs with a proportion of >0.05% across all samples were retained. In this regard, 271 and 276 ASVs from 49 and 63 samples were used for the high- and low-water periods, respectively. The pairwise Spearman’s correlation between ASVs was valid with a correlation coefficient >|0.7| and a p value < 0.01 (Benjamini and Hochberg adjusted). Network visualization and modular analysis were conducted using Gephi83.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.