Knowledge discovery from high-frequency stream nitrate concentrations: hydrology and biology contributions

Aubert, Alice H.; Thrun, Michael C.; Breuer, Lutz; Ultsch, Alfred

doi:10.1038/srep31536

Download PDF

Article
Open access
Published: 30 August 2016

Knowledge discovery from high-frequency stream nitrate concentrations: hydrology and biology contributions

Alice H. Aubert¹^nAff4,
Michael C. Thrun²,
Lutz Breuer^1,3 &
…
Alfred Ultsch²

Scientific Reports volume 6, Article number: 31536 (2016) Cite this article

2367 Accesses
17 Citations
3 Altmetric
Metrics details

Subjects

Abstract

High-frequency, in-situ monitoring provides large environmental datasets. These datasets will likely bring new insights in landscape functioning and process scale understanding. However, tailoring data analysis methods is necessary. Here, we detach our analysis from the usual temporal analysis performed in hydrology to determine if it is possible to infer general rules regarding hydrochemistry from available large datasets. We combined a 2-year in-stream nitrate concentration time series (time resolution of 15 min) with concurrent hydrological, meteorological and soil moisture data. We removed the low-frequency variations through low-pass filtering, which suppressed seasonality. We then analyzed the high-frequency variability component using Pareto Density Estimation, which to our knowledge has not been applied to hydrology. The resulting distribution of nitrate concentrations revealed three normally distributed modes: low, medium and high. Studying the environmental conditions for each mode revealed the main control of nitrate concentration: the saturation state of the riparian zone. We found low nitrate concentrations under conditions of hydrological connectivity and dominant denitrifying biological processes, and we found high nitrate concentrations under hydrological recession conditions and dominant nitrifying biological processes. These results generalize our understanding of hydro-biogeochemical nitrate flux controls and bring useful information to the development of nitrogen process-based models at the landscape scale.

EUSEDcollab: a network of data from European catchments to monitor net soil erosion by water

Article Open access 04 August 2023

Francis Matthews, Gert Verstraeten, … Panos Panagos

Eighteen years of upland grassland carbon flux data: reference datasets, processing, and gap-filling procedure

Article Open access 23 May 2023

Bruna R. Winck, Juliette M. G. Bloor & Katja Klumpp

The FLUXNET2015 dataset and the ONEFlux processing pipeline for eddy covariance data

Article Open access 09 July 2020

Gilberto Pastorello, Carlo Trotta, … Dario Papale

Introduction

Human activities modify the global nitrogen cycle, particularly through farming¹. These practices have unintended consequences; for example, nitrate lost from terrestrial runoff to streams and estuaries can impact aquatic life². Thus, studying nitrate export, i.e., nitrate concentrations at the outlet of a watershed, is a major concern. Existing labor-intensive monitoring strategies that have been in place for several decades have recently been complemented by the development of in-situ technologies that allow for high-frequency (sub-hourly) sampling. High-frequency monitoring has been shown to be a beneficial addition to the previous lower frequency monitoring schemes³. A decade ago, high-frequency monitoring was expected to bring new insights into watersheds functioning⁴, and indeed, it has helped⁵ identify sources and transport pathways of nutrients⁶ and quantify processes and metabolisms of coupled nutrients⁷ across multiple time scales⁸. This has allowed researchers to disentangle the effects of multiple processes⁹. Now, the availability of several year-long high-frequency datasets invites the application of data mining techniques¹⁰.

Catchments are dynamic systems, and present observations rely on previous hydrological states. In water sciences, data are mostly analyzed with respect to time. Analyses focus on either long-term, seasonal, or short-term variations, including fluctuations resulting from flood events or diurnal cycles. Temporal data structure is regularly analyzed in the time and frequency domain by time series decomposition^11,12 and spectral analysis^8,13,14,15, respectively. These methods are used to identify cycles and variability in the main transfer processes. Time-variant process modelling also allows us to explain the old water paradox^16,17,18.

In this study, we look at nitrate concentration data differently by neglecting its temporal component in the data analysis. This approach is now possible and worth considering given the availability of datasets large enough to mine and expand our general knowledge. Data structures have already been studied independently of time, e.g., plotting a variable of interest against another variable. For example, correlating observed nitrate concentration with a simulated index based on the watershed wetness state¹⁹ refined the flushing hypothesis, and relating production of nitrogen gaseous species to the percent of the soil’s pore volume filled by water²⁰ defined a conceptual model of nitrogen oxide emissions from soil. These non-temporal data structure analyses have brought new insights in watershed functioning and the nitrogen cycle.

Non-temporal data structure analysis was also used to compare high- and low-frequency monitoring data³. There, probability density functions (PDF, a function describing the likelihood a variable can take a given value) were estimated using kernel density estimators. However, as in any PDF estimation method, one of the critical parameters, the kernel width, was left undetermined. If too large of a kernel width is chosen, important structures may be undetected. Likewise, if the kernel width is too small, random fluctuations are overemphasized. To avoid an unclear choice in kernel width, we used the Pareto Density Estimation (PDE), in which kernel width has proved to be particularly suitable for detecting modes in continuous data. PDE is particularly suitable for the discovery of mixtures of Gaussians²¹, but in the case of skewed distributions, transforming the data is required beforehand. In other scientific domains, thorough analysis of data structure focused primarily on the estimation of the PDF using the Pareto Density Estimation (PDE) of a variable of interest^21,22.

The objective of this study is to generalize–or cast doubt on–the current understanding of nitrate fluxes at the catchment scale. At this point, in-stream nitrate concentrations in an agricultural catchment are mainly described in relation to time. Seasonal and event-related variations in nitrogen sources and transport processes throughout the year or during a wetting-drying cycle confer seasonal and short-term fluctuations to nitrate concentrations. To avoid the tendency of reinforcing the understanding of already described relationships, we included all measured variables from the catchment²³. This naïve look at the data is common in data mining. We focused on the shape of the PDFs from high-frequency nitrate concentrations monitored in a 3.7-km² mixed-land use catchment. Environmental variables (discharge, groundwater depth, soil moisture, soil temperature, stream temperature, stream conductivity, rainfall, air temperature and solar radiation) were considered as potential explanatory variables and were used in the process of knowledge discovery to identify the drivers of nitrate fluctuations in the catchment. Particularly, we were interested in whether these drivers are the same for low and high nitrate concentrations, as this result may assist in refining mechanistic models of nitrate fluxes.

Results

The large dataset

The available dataset contained in total 32,196 data points for each of the 14 variables (in total, 4% missing data), making it suitable for data mining (pp. 243 of ²⁴). The raw time series for each year are presented in Fig. 1. For technical reasons, no nitrate data were available during winter, so the actual time span of nitrate monitoring was 05 March 2013 12:45 to 24 September 2013 12:30 and 27 April 2014 00:00 to 23 October 13:15. Data were analyzed as a whole, without differentiating between the hydrological years. To do this, we filtered out the seasonal variation (see methods). Hereafter, when referring to sub-daily high-frequency fluctuations (after seasonal variation removal) a tilde (~) superscript will be added to the variable names.

High-frequency nitrate data: a composite of the three modes

The PDF of the empirical values of nitrate~ concentration from the Vollnkirchener Bach watershed was modelled using PDE, resulting in three distinctive modes (see methods). The estimation of the empirical distribution (black curve) was modelled (red curve) using a Gaussian mixture model (GMM) composed of three Gaussians (blue curves) (Fig. 2). The goodness of fit was visualized with a quantile-quantile plot (Supplementary Fig. S1) and verified statistically with the Xi-Quadrat test (p = 1e-05) and Kolmogorov–Smirnov test (p < 1e-10). The central Gaussian represented typical nitrate~ concentrations, while the left and right Gaussians described the lower and higher concentrations of nitrate~, respectively. Bayes Theorem was used to calculate the class posterior probabilities. This calculation classified nitrate~ into the three classes: low (5% of the data), typical (89% of the data), and high nitrate~ concentrations (6% of the data).

**Figure 2: Component Gaussian Mixture Model (GMM) (blue lines), superposition of the Gaussians (GMM) (red line), and the PDF describing sub-daily nitrate~ in-stream concentrations (black line).**

Modes characterized by environmental conditions

We compared the concurrently measured environmental variables for each mode of nitrate~. Lower nitrate~ was characterized by more superficial groundwater depth (GW32~), higher soil moisture (Smoist24~) and, on average, lower solar radiation (Sol71~) (Fig. 3, Table 1). High nitrate~ was characterized by deeper groundwater depth (GW32~), moderate soil moisture (Smoist24~) and, on average, higher solar radiation (Sol71~) than the low nitrate~ Gaussian curves. The typical nitrate~ class was quite similar to the high nitrate~ class, but the high nitrate~ was associated with more humid soils.

Table 1 The p-values of the differences between environmental conditions corresponding to each nitrate~ mode (low-typical, low-high, typical-high) are calculated by the Bonferroni corrected two-sample t-test with unequal variances, where “n.s.” indicates a non-significant result.

Full size table

Discussion

Variables driving rapid nitrate fluctuations

The three depths of groundwater (GW3~, GW25~ and GW32~) represent the typical range in the spatial variability of the groundwater table conditions. They are located in the mid-reach lowland meadow (GW3), the cultivated land on the hillslope (GW25), and in a riparian meadow where a temporary tributary joins the stream (GW32)²⁵. The riparian meadow groundwater depth was always selected as a driver of nitrate~. This result supports the importance of near-stream zones that are often reported as having a major impact on stream water quality^26,27. GW32~ shows little seasonal fluctuation; most of the time, this groundwater depth remains high, which reflects connectivity to the stream network. This location is also more reactive than the other piezometers (Fig. 1). Conversely, the hillslope and lowland meadow groundwater depths are less reactive and fluctuate seasonally with a high amplitude. These locations have little influence on nitrate~ concentrations. The short-term nitrate fluctuations support the assumption that the constantly connected landscape elements are a major determinant of the high-frequency variability of solutes. Landscape elements for which connectivity exhibits low frequency fluctuation of a high amplitude are not predominant for stream water chemistry at the fine time scale^28,29. Soil moisture (Smoist24~) was determined to be another major driver of nitrate~ concentrations in the high nitrate mode, supporting the results of previous studies^30,31. Discharge also impacted nitrate~. Last, electric conductivity (cond47~) generally follows nitrate~ concentrations: when nitrate~ is low, conductivity (which also accounts for nitrate salts) is low.

Generally, air-, soil- and stream- temperatures~ are not meaningful to explain the high frequency fluctuations in water chemistry. Air, soil and stream temperatures showed primarily low-frequency fluctuations, aligning with variables such as groundwater depth in the agricultural hillslope, and the temperature data formed almost perfect Gaussians. Perfect Gaussians characterize variables with a clear combination of sinusoidal patterns, for both seasonal and diurnal time scales. Rainfall intensity (rain~) was not meaningful to explain the high frequency fluctuations in nitrate~ either. The lack of relation between rainfall and nitrate~ supports the findings of a previous study³² performed in the catchment using isotopic signatures. In the isotopic study, stream water was found to be more similar to groundwater than to rain water or soil water, illustrating the “old water paradox”^16,33 once more, where old water flows during storm events. Rain and high celerity (the speed at which the perturbation wave is transmitted) lead to a reactive stream water level; however, it does not imply that rainwater is transported directly to the stream, where it could affect nitrate. Stream flow velocity, defining chemical transport, is, by definition, lower than celerity³⁴.

Combined effect of hydrology and biology on nitrate~

The low nitrate~ mode (left curve, Fig. 2) is driven by groundwater depth close to the surface and high soil moisture (Fig. 3), indicating the subsurface is saturated and hydrologically connected. Moreover, under such wet conditions, denitrification might be the most active biological process, adding to the importance of the hydrological state. We conclude that the low nitrate~ mode is defined by hydrological connectivity and a dominance of the denitrifying biological activity, that is, by a saturated catchment.

The high nitrate~ mode (right curve, Fig. 2) is driven by high solar radiation and deep groundwater, but soils are still moist. High solar radiation could suggest high evapotranspiration, given that moist soil indicates water is sufficiently available to plants. This behavior is typical for drying conditions. The drying phase has been linked to biological changes in the microbial community of soil aggregates³⁵. Microbial diversity should increase with drying. When nutrient transport is reduced because of limited diffusion and the gaseous phase in the pores becomes important, anaerobic and aerobic communities will likely coexist. Thus, the denitrifying community is no longer the only one active; nitrification can occur. This balance between nitrification and denitrification could lead to the production and build-up of nitrate in the soil. This nitrate can then be mobilized during low intensity rain events. We interpret that high levels of nitrate~ are defined by hydrological recession and biologically active soils, where nitrification dominates.

The tipping-point, threshold or “hot moment”²⁶, when biological drivers over-take hydrological drivers, is still unclear and needs to be determined in future work. The shift from denitrification to nitrification dominance also needs further data-based research. These interpretations align with models based on hydrological storage, distinguishing celerity and velocity^17,36,37,38; however, these hydrological models were developed for conservative tracers, such as chloride. Our work highlights the need to add biological processes to hydrological models to allow for the production and consumption of chemicals, such as nitrate, known to strongly affect our environment in some regions.

A method for initial data exploration

Our goal is to draw attention to the benefits of thorough analyses of large environmental datasets. In this case study, we show that new knowledge can be mined from empirical PDEs, thus we recommend data mining as a first step to understanding the driving forces in a catchment because it can provide a simplified, non-temporal view of solute export. This approach provides a glimpse of the catchment’s behavior, which is the compilation of many processes, by making use of low-flow data as well as storm-flow data²⁷. By considering the variable of interest, in this case nitrate~, as non-temporal, the system was simplified and data structuration was observed²¹.

Data mining revealed a differentiation in nitrate~ modes and differences in underlying conditions. The roles of hydrology and soil microbiology in controlling nitrate~ were highlighted. Low nitrate~ occurred under hydrological connectivity and microbial denitrification. High nitrate~ occurred under hydrological recession and nitrifying conditions. The highly fluctuating component of the nitrate concentrations seems to be influenced by the saturation state of the catchment, although the seasonal component, which is known to be driven by saturation state, was removed. We are confident that other datasets analyzed with the described methods herein will produce strong advances in the interpretation of catchment hydro-biogeochemical processes. In future high frequency monitoring work, it will be important to monitor variables that allow the identification of the various biological processes occurring in the soil and in-stream, as the latter are reported to dampen terrestrial signals³⁹. The difficulty will be to find variables that can be easily and cost-effectively measured at high temporal resolution. Potential biological variables include in-stream measurements of biological oxygen demand (BOD) and soil redox potential. We showed that connectivity plays an important role in nitrate concentrations; therefore, the identification of contributing (or connected) areas as well as the spatial identification of controlling variables would shed further light on solute export⁴⁰. In the future, the development of networks of sensors⁴¹ or the use of high-temporal sensing distributed throughout a catchment⁴² could help to overcome these limitations.

Methods

Nitrate concentration data were collected for two years in the Vollnkirchener Bach watershed, which is nested in the Critical Zone Observatory of the Schwingbachtal, in central Germany (references and data available at http://fb09-pasig.umwelt.uni-giessen.de:8081/). Technical issues and data checking reduced the time span to two growing seasons (05 March 2013 to 24 September 2013, n = 15,475 measurements and 27 April 2014 to 23 October 2014 n = 16,721 measurements, in total n = 32,196 measurements). Land-use is dominated by agricultural land and forests, covering 44 and 48% of the catchment, respectively. An in-situ hyperspectral UV-spectrometer (ProPS, Trios, Rastede, Germany, wavelength range 200–360 nm, path length 5 mm, solar panel supplied) measured absorption spectra every 15 min after a 5 s air blast to prevent the optics from biofouling. Wavelengths of 200–220 nm allowed the calculation of nitrate concentration, using a calibration adapted to the stream water’s baseline chemical composition. More detailed information on the calibration and the nitrate data checking is reported in ref. 43.

Other variables were monitored at high-frequency and used to explain the variations in nitrate, as they depicted the catchment state. Discharge (q, l s⁻¹) and water temperature (Wt, °C) were measured at two gauging stations q13/Wt13 at the outlet and q18/Wt18 upstream and were measured every 5 min by pressure transducers (Diver DCX, Schlumberger Water Services, ON, Canada). Groundwater depth (GW, m) at three wells (GW25 on the hillslope, GW3 in lowland and GW32 in the riparian zone) were measured every 10 min by pressure transducers. Meteorology, i.e., air temperature (At47, °C), solar radiation (Sol71, W m²) and rainfall intensity (rain, mm), was captured every 5 min at a climate station 4 km from the outlet (Campbell Scientific Inc., CR1000 data logger, Loughborough, UK). Soil moisture (Smoist24, m³ m⁻³) and soil temperature (St24, °C) were measured hourly at 0.1 m depth, in the riparian zone, by electromagnetic induction (5TE sensors, EM50 data logger, Decagon, Labcell LTD, Alton, UK) beginning on 14 June 2013. Some of these variables are expected to directly influence nitrate in-stream concentrations, such as groundwater depths or rainfall intensity. Others are considered as proxies for biological activity, such as temperature and soil moisture, and evapotranspiration, the variable solar radiation⁴³.

All time series were detrended to create a joint data set for both years. This process allowed the analysis of rapid fluctuations in the variables and considered both growing seasons at once. A two-component model with the variable-baseline subtracted from the raw time series was applied to obtain the high-frequency component of the variables. The variable-baseline was calculated using a low pass filter as a Fourier Transformation; the filter was set to 50 days. Thus, the high-frequency component is composed of fluctuations below the monthly time scale, down to 15 min. This residual time series is interpreted as a rapid and high-temporal fluctuation and is marked with a tilde (~) throughout the manuscript. Discharges and rainfall, which were typical of a reactive catchment, presented a seasonal baseline set to zero.

We then focused on our variable of interest: nitrate. First, we modelled the nitrate with three distinctive modes using the Adapt Gauss toolbox²² as shown in Fig. 2. The Adapt Gauss¹⁷ toolbox in R package allows for the modelling and verification of possible multimodal distributions as a mixture of Gaussian components. This approach is called Gaussian Mixture Modelling (GMM). Verification of the model was performed visually using a QQplot (Fig. S1A) and statistically with a Xi-Quadrat test (p < 1e-05) and a Kolmogorov–Smirnov test (p < 1e-10). In other words, GMM was constructed to fit the nitrate’s empirical PDF. The number of modes was calculated as the minimum of the Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC)⁴⁴. AIC and BIC were computed for the GMM with one to ten modes using an Expectation-Maximization (EM) optimization in the R package mclust⁴⁵ (Fig. S1B). AIC and BIC were both in agreement. EM fitting, using a user-defined starting point in the Adapt Gauss toolbox²², resulted in better values for AIC and BIC. The GMM was supported by goodness-of-fit checks. These checks resulted in three different Gaussians for high-frequency nitrate~ (Fig. S1B). Bayes Theorem was used to calculate the class posterior probabilities.

The data were then mined to address if there were different drivers for the high and low nitrate~ concentrations. All data points for the potentially related variables were grouped according to the synchronous nitrate mode into the same three distributions: low, typical and high with respect to nitrate. We compared the three distributions for each variable visually using the PDE²¹ (Supplementary Information, Fig. S2), using boxplots resulting from the PDE (Fig. 3), and statistically using a Bonferroni corrected two-sample t-test for unequal sample sizes and unequal variances (Table 1). We only considered the variables that showed visually and statistically significant (p-values > 0.01) differences between modes in our interpretation.

Additional Information

How to cite this article: Aubert, A. H. et al. Knowledge discovery from high-frequency stream nitrate concentrations: hydrology and biology contributions. Sci. Rep. 6, 31536; doi: 10.1038/srep31536 (2016).

References

Fowler, D. et al. The global nitrogen cycle in the twenty-first century. Phil. Trans. R. Soc. B 368, 20130164, doi: 10.1098/rstb.2013.0164 (2013).
Article CAS PubMed Google Scholar
Vitousek, P. M. et al. Human alteration of the global nitrogen cycle: sources and consequences. Ecological Applications 7, 737–750, doi: 10.1890/1051-0761(1997)007[0737:HAOTGN]2.0.CO;2 (1997).
Article Google Scholar
Bieroza, M. Z., Heathwaite, A. L., Mullinger, N. J. & Keenan, P. O. Understanding nutrient biogeochemistry in agricultural catchments: the challenge of appropriate monitoring frequencies. Environ. Sci.: Processes Impacts 16, 1676–1691, doi: 10.1039/C4EM00100A (2014).
Article CAS Google Scholar
Kirchner, J. W., Feng, X., Neal, C. & Robson, A. J. The fine structure of water-quality dynamics: the (high-frequency) wave of the future. Hydrological Processes 18, 1353–1359, doi: 10.1002/hyp.5537 (2004).
Article ADS Google Scholar
Rode, M. et al. Sensors in the stream: the high-frequency wave of the present. Environmental Science & Technology (submitted).
Bowes, M. J. et al. Characterising phosphorus and nitrate inputs to a rural river using high-frequency concentration–flow relationships. Science of The Total Environment 511, 608–620, doi: 10.1016/j.scitotenv.2014.12.086 (2015).
Article CAS ADS PubMed Google Scholar
Hensley, R. T., Cohen, M. J. & Korhnak, L. V. Inferring nitrogen removal in large rivers from high-resolution longitudinal profiling. Limnology and Oceanography 59, 1152–1170, doi: 10.4319/lo.2014.59.4.1152 (2014).
Article CAS ADS Google Scholar
Aubert, A. H. et al. Fractal Water Quality Fluctuations Spanning the Periodic Table in an Intensively Farmed Watershed. Environmental Science & Technology 48, 930–937, doi: 10.1021/es403723r (2014).
Article CAS ADS Google Scholar
Read, D. S., Bowes, M. J., Newbold, L. K. & Whiteley, A. S. Weekly flow cytometric analysis of riverine phytoplankton to determine seasonal bloom dynamics. Environ. Sci.: Processes Impacts 16, 594–603, doi: 10.1039/C3EM00657C (2014).
Article CAS Google Scholar
Babovic, V. Data mining in hydrology. Hydrological Processes 19, 1511–1515, doi: 10.1002/hyp.5862 (2005).
Article ADS Google Scholar
Worrall, F. & Burt, T. P. A univariate model of river water nitrate time series. Journal of Hydrology 214, 74–90, doi: 10.1016/S0022-1694(98)00249-2 (1999).
Article CAS ADS Google Scholar
Lee, T. & Ouarda, T. B. M. J. Stochastic simulation of nonstationary oscillation hydroclimatic processes using empirical mode decomposition. Water Resources Research 48, W02514, doi: 10.1029/2011wr010660 (2012).
Article ADS Google Scholar
Kirchner, J. W. Catchment-scale advection and dispersion as a mechanism for fractal scaling in stream tracer concentrations. Journal of Hydrology 254, 82–101 (2001).
Article ADS Google Scholar
Koirala, S. R., Gentry, R. W., Mulholland, P. J., Perfect, E. & Schwartz, J. S. Time and frequency domain analyses of high-frequency hydrologic and chloride data in an east Tennessee watershed. Journal of Hydrology 387, 256–264, doi: 10.1016/j.jhydrol.2010.04.014 (2010).
Article CAS ADS Google Scholar
Gunnerson, C. G. Optimizing sampling intervals in tidal estuaries. J. Sanit. Eng. Div. ASCE 92, 103–125 (1966).
Google Scholar
Kirchner, J. W. A double paradox in catchment hydrology and geochemistry. Hydrological Processes 17, 871–874, doi: 10.1002/hyp.5108 (2003).
Article ADS Google Scholar
Benettin, P., Kirchner, J. W., Rinaldo, A. & Botter, G. Modeling chloride transport using travel time distributions at Plynlimon, Wales. Water Resources Research 51, 3259–3276, doi: 10.1002/2014WR016600 (2015).
Article CAS ADS Google Scholar
Hrachowitz, M., Fovet, O., Ruiz, L. & Savenije, H. H. G. Transit time distributions, legacy contamination and variability in biogeochemical 1/fα scaling: how are hydrological response dynamics linked to water quality at the catchment scale? Hydrological Processes, n/a-n/a, doi: 10.1002/hyp.10546 (2015).
Creed, I. F. & Band, L. E. Exploring functional similarity in the export of Nitrate-N from forested catchments: A mechanistic modeling approach. Water Resources Research 34, 3079–3093, doi: 10.1029/98WR02102 (1998).
Article CAS ADS Google Scholar
Davidson, E. A., Keller, M., Erickson, H. E., Verchot, L. V. & Veldkamp, E. Testing a Conceptual Model of Soil Emissions of Nitrous and Nitric Oxides Using two functions based on soil nitrogen availability and soil water content, the hole-in-the-pipe model characterizes a large fraction of the observed variation of nitric oxide and nitrous oxide emissions from soils. BioScience 50, 667–680, doi: 10.1641/0006-3568(2000)050[0667:TACMO (2000).
Article Google Scholar
Ultsch, A. In Innovations in Classification, Data Science, and Information Systems Studies in Classification, Data Analysis, and Knowledge Organization (eds Daniel Baier & Klaus-Dieter Wernecke ) Ch. 12, 91–100 (Springer Berlin Heidelberg, 2005).
Ultsch, A., Thrun, M., Hansen-Goos, O. & Lötsch, J. Identification of Molecular Fingerprints in Human Heat Pain Thresholds by Use of an Interactive Mixture Model R Toolbox (AdaptGauss). International Journal of Molecular Sciences 16, 25897 (2015).
Article CAS Google Scholar
Hogarth, R. In Judgement and choice, the psychology of decision Ch. 6, 114–131 (John Wiley and sons, 1987).
Lee, J. A. & Verleysen, M. Nonlinear dimensionality reduction. (Springer, 2007).
Orlowski, N., Lauer, F., Kraft, P., Frede, H.-G. & Breuer, L. Linking Spatial Patterns of Groundwater Table Dynamics and Streamflow Generation Processes in a Small Developed Catchment. Water 6, 3085–3117, doi: 10.3390/w6103085 (2014).
Article Google Scholar
McClain, M. E. et al. Biogeochemical Hot Spots and Hot Moments at the Interface of Terrestrial and Aquatic. Ecosystems 6, 301–312, doi: 10.1007/s10021-003-0161-9 (2003).
Article CAS Google Scholar
Lischeid, G. Combining Hydrometric and Hydrochemical Data Sets for Investigating Runoff Generation Processes: Tautologies, Inconsistencies and Possible Explanations. Geography Compass 2, 255–280, doi: 10.1111/j.1749-8198.2007.00082.x (2008).
Article Google Scholar
Aubert, A. H. et al. Solute transport dynamics in small, shallow groundwater-dominated agricultural catchments: insights from a high-frequency, multisolute 10 yr-long monitoring study. HESS 17, 1379–1391, doi: 10.5194/hess-17-1379-2013 (2013).
Article CAS ADS Google Scholar
Herndon, E. M. et al. Landscape heterogeneity drives contrasting concentration–discharge relationships in shale headwater catchments. HESS 19, 3333–3347, doi: 10.5194/hess-19-3333-2015 (2015).
Article CAS ADS Google Scholar
Western, A. W., Blöschl, G. & Grayson, R. B. Toward capturing hydrologically significant connectivity in spatial patterns. Water Resources Research 37, 83–97, doi: 10.1029/2000WR900241 (2001).
Article ADS Google Scholar
Borken, W. & Matzner, E. Reappraisal of drying and wetting effects on C and N mineralization and fluxes in soils. Global Change Biology 15, 808–824, doi: 10.1111/j.1365-2486.2008.01681.x (2009).
Article ADS Google Scholar
Orlowski, N., Kraft, P. & Breuer, L. Exploring water cycle dynamics through sampling multitude stable water isotope pools in a small developed landscape of Germany. Hydrol. Earth Syst. Sci. Discuss. 12, 1809–1853, doi: 10.5194/hessd-12-1809-2015 (2015).
Article ADS Google Scholar
Botter, G., Bertuzzo, E. & Rinaldo, A. Transport in the hydrologic response: Travel time distributions, soil moisture dynamics, and the old water paradox. Water Resources Research 46, n/a–n/a, doi: 10.1029/2009WR008371 (2010).
Article Google Scholar
McDonnell, J. J. & Beven, K. Debates—The future of hydrological sciences: A (common) path forward? A call to action aimed at understanding velocities, celerities and residence time distributions of the headwater hydrograph. Water Resources Research 50, 5342–5350, doi: 10.1002/2013WR015141 (2014).
Article ADS Google Scholar
Ebrahimi, A. & Or, D. Hydration and diffusion processes shape microbial community organization and function in model soil aggregates. Water Resources Research 51, 9804–9827, doi: 10.1002/2015WR017565 (2015).
Article ADS Google Scholar
Harman, C. J. Time-variable transit time distributions and transport: Theory and application to storage-dependent transport of chloride in a watershed. Water Resources Research 51, 1–30, doi: 10.1002/2014WR015707 (2015).
Article CAS ADS Google Scholar
Hrachowitz, M., Savenije, H., Bogaard, T. A., Tetzlaff, D. & Soulsby, C. What can flux tracking teach us about water age distribution patterns and their temporal dynamics? HESS 17, 533–564, doi: 10.5194/hess-17-533-2013 (2013).
Article ADS Google Scholar
Rinaldo, A. et al. Storage selection functions: A coherent framework for quantifying how catchments store and release water and solutes. Water Resources Research 51, 4840–4847, doi: 10.1002/2015WR017273 (2015).
Article ADS Google Scholar
Bernhardt, E. S. et al. Can’t See the Forest for the Stream? In-stream Processing and Terrestrial Nitrogen Exports. BioScience 55, 219–230, doi: 10.1641/0006-3568(2005)055[0219:ACSTFF]2.0.CO;2 (2005).
Article Google Scholar
Ali, G. A. & Roy, A. G. Shopping for hydrologically representative connectivity metrics in a humid temperate forested catchment. Water Resources Research 46, n/a–n/a, doi: 10.1029/2010WR009442 (2010).
Article Google Scholar
Bogena, H. R., Huisman, J. A., Oberdörster, C. & Vereecken, H. Evaluation of a low-cost soil water content sensor for wireless network applications. Journal of Hydrology 344, 32–42, doi: 10.1016/j.jhydrol.2007.06.032 (2007).
Article ADS Google Scholar
Lauer, F., Frede, H.-G. & Breuer, L. Uncertainty assessment of quantifying spatially concentrated groundwater discharge to small streams by distributed temperature sensing. Water Resources Research 49, 400–407, doi: 10.1029/2012WR012537 (2013).
Article ADS Google Scholar
Aubert, A. H. & Breuer, L. New seasonal shift in in-stream diurnal nitrate cycles identified by mining high-frequency data. PLoS ONE in press, doi: 10.1371/journal.pone.0153138 (2016).
Aho, K., Derryberry, D. & Peterson, T. Model selection for ecologists: the worldviews of AIC and BIC. Ecology 95, 631–636, doi: 10.1890/13-1452.1 (2014).
Article PubMed Google Scholar
Fraley, C. & Raftery, A. E. Model-Based Clustering, Discriminant Analysis, and Density Estimation. Journal of the American Statistical Association 97, 611–631 (2002).
Article MathSciNet Google Scholar

Download references

Acknowledgements

Authors warmly thank Ina Plesca, Tobias Houska, Nicole Werstein for their field work (i.e., data collection) and data checking.

Author information

Alice H. Aubert
Present address: Present address: Eawag - Swiss Federal Institute of Aquatic Science and Technology, Ueberlandstrasse 133, 8600 Duebendorf, Switzerland,

Authors and Affiliations

Institute for Landscape Ecology and Resources Management (ILR), Research Centre for BioSystems, Land Use and Nutrition (iFZ), Justus Liebig University Giessen, Heinrich-Buff-Ring 26, Giessen, D-35392, Germany
Alice H. Aubert & Lutz Breuer
Databionics, Mathematics and Computer Science, Philipps University Marburg, Hans-Meerwein-Strasse 6, Marburg, D-35032, Germany
Michael C. Thrun & Alfred Ultsch
Centre for International Development and Environmental Research (ZEU), Justus Liebig University Giessen Goethestrasse 58, D-35390 Giessen, Germany. ,
Lutz Breuer

Authors

Alice H. Aubert
View author publications
You can also search for this author in PubMed Google Scholar
Michael C. Thrun
View author publications
You can also search for this author in PubMed Google Scholar
Lutz Breuer
View author publications
You can also search for this author in PubMed Google Scholar
Alfred Ultsch
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.H.A. and L.B. developed the broad goals of this study. M.C.T. and A.U. completed the computational analyses, supporting the methodological component. A.H.A. and L.B. interpreted the results. A.H.A., M.C.T., L.B. and A.U. wrote the paper.

Corresponding author

Correspondence to Alice H. Aubert.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information (PDF 875 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Aubert, A., Thrun, M., Breuer, L. et al. Knowledge discovery from high-frequency stream nitrate concentrations: hydrology and biology contributions. Sci Rep 6, 31536 (2016). https://doi.org/10.1038/srep31536

Download citation

Received: 23 February 2016
Accepted: 21 July 2016
Published: 30 August 2016
DOI: https://doi.org/10.1038/srep31536

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.