Environmental DNA reveals that rivers are conveyer belts of biodiversity information

Deiner, Kristy; Fronhofer, Emanuel A.; Mächler, Elvira; Walser, Jean-Claude; Altermatt, Florian

doi:10.1038/ncomms12544

Download PDF

Article
Open access
Published: 30 August 2016

Environmental DNA reveals that rivers are conveyer belts of biodiversity information

Kristy Deiner^1,2,
Emanuel A. Fronhofer^1,3,
Elvira Mächler^1,3,
Jean-Claude Walser⁴ &
…
Florian Altermatt^1,3

Nature Communications volume 7, Article number: 12544 (2016) Cite this article

22k Accesses
325 Citations
218 Altmetric
Metrics details

Subjects

Abstract

DNA sampled from the environment (eDNA) is a useful way to uncover biodiversity patterns. By combining a conceptual model and empirical data, we test whether eDNA transported in river networks can be used as an integrative way to assess eukaryotic biodiversity for broad spatial scales and across the land–water interface. Using an eDNA metabarcode approach, we detect 296 families of eukaryotes, spanning 19 phyla across the catchment of a river. We show for a subset of these families that eDNA samples overcome spatial autocorrelation biases associated with the classical community assessments by integrating biodiversity information over space. In addition, we demonstrate that many terrestrial species are detected; thus suggesting eDNA in river water also incorporates biodiversity information across terrestrial and aquatic biomes. Environmental DNA transported in river networks offers a novel and spatially integrated way to assess the total biodiversity for whole landscapes and will transform biodiversity data acquisition in ecology.

Nutrient-induced acidification modulates soil biodiversity-function relationships

Article Open access 03 April 2024

Zhengkun Hu, Manuel Delgado-Baquerizo, … Manqiang Liu

Revealing uncertainty in the status of biodiversity change

Article Open access 27 March 2024

T. F. Johnson, A. P. Beckerman, … R. P. Freckleton

Disproportionate declines of formerly abundant species underlie insect loss

Article Open access 20 December 2023

Roel van Klink, Diana E. Bowler, … Jonathan M. Chase

Introduction

While rivers cover <1% of the landmasses on earth, they are invaluable for biodiversity and ecosystem services, such as drinking water and energy production¹. Rivers, because of their characteristic dendritic network structure, also integrate information about the landscape through the collection and transport of sediments, organic matter, nutrients, chemicals and energy^2,3. For example, information contained in sediments allows us to understand how river drainages form and change in time as a result of climate and tectonic forces⁴. Rivers also act as the lung of the landscape by releasing large fluxes of CO₂ derived from terrestrial plant macromolecules, such as lignin and cellulose, through the breakdown and transport of coarse and fine particulate organic matter⁵. River networks additionally play an important role in shaping patterns of genetic and species diversity for many organisms across the landscape by dictating dispersal pathways^6,7.

Organic matter in the form of DNA is produced from organisms and is also transported through rivers via cells, tissues, gametes or organelles, and is termed environmental DNA (eDNA)^8,9,10. DNA can be isolated from these organismal remains in the water, sequenced and assigned back to the species of origin through the method of eDNA metabarcoding^10,11. This elegant process of collection and detection of a species DNA is becoming highly valuable for sampling biodiversity in ecology and conservation^{10,11,12,13,14,15,16,17}. The spatial signal of eDNA, has only recently been explored and shows that in rivers eDNA can be transported over larger distances^8,18. Therefore, we hypothesized that rivers, through the aggregation and transport of eDNA, act as conveyer belts of biodiversity information that can be used to estimate species richness over broad spatial scales and potentially across the land–water interface.

The relevance of biodiversity sampling with eDNA found in river water is twofold. First, identifying biodiversity hotspots is invaluable for prioritizing global and regional conservation efforts¹⁹. Estimates of richness to establish a place as a hotspot or not have suffered from being under-sampled²⁰. Under-sampling of biodiversity has many causes (and consequences) in conservation and ecology in general, but mainly comes from the sampling methods used for estimating richness in a way that is aggregated with respect to space²¹. For example, a classical method for estimating richness of aquatic macroinvertebrates in rivers is to use a kicknet method, where all individuals in a certain defined area of a stream are collected in a net²². Many such samples are then taken and subsequently pooled to represent richness for an entire river stretch or catchment. The pooling of spatially autocorrelated samples such as this causes an underestimation of biodiversity compared with if each species was independently sampled. Because it is typically infeasible to sample all species independently, statistical removal of the sampling artefact is recommended²¹. Estimating biodiversity through eDNA is a potential way to sample each species independent of space via their DNA becoming aggregated and transported through a river’s network.

Second, an eDNA method of biodiversity monitoring in rivers has several advantages in that it is non-lethal for most classically sampled taxonomic groups, minimizes habitat disruption and can assess diversity across the tree of life with a single-field sampling protocol making it extremely cost effective. Therefore, demonstrating the power of this tool to monitor biodiversity of important indicator groups in rivers will provide a fast, non-lethal and inexpensive alternative tool compared with classically used methods.

Whole community detection with eDNA has been called the ‘game changer’ for the biodiversity sampling¹⁶, and in this study, we move this idea from theory into practice. We test the hypothesis that transported eDNA in rivers can be used in an unprecedented way to assess biodiversity of eukaryotes. We validate the ability of an eDNA metabarcoding method in vitro and in situ to assess globally important macroinvertebrate communities and produce taxonomic richness estimates of which reflect the biodiversity of a rivers’ catchment. Lastly, we demonstrate that a large number of eukaryotic phyla from both aquatic and terrestrial taxa can be assessed from eDNA in river water and provide support for the hypothesis that rivers are conveyor belts of biodiversity information for landscapes.

Results

eDNA detection of metazoan eukaryotes

We detected a total of 296 families that span 19 eukaryotic phyla from the Glatt river catchment in Switzerland (Fig. 1). All families were independently geographically verified as known to occur in Switzerland or the four neighbouring countries (Fig. 2a; Supplementary Data 1). The majority of the families detected were Arthropoda (N=196). Diversity in number of families detected was not proportional to read count and smaller organisms represented a much higher proportion of the sequences obtained (Rotifera; Fig. 2b). For example, two species in the phylum Rotifera accounted for 39% (92,907 sequences) of our data set. The majority of families were represented by >10 sequences (N=140; Supplementary Data 1). The largest data reduction step in the bioinformatic workflow was in linking a taxonomic name with our sequences (Supplementary Fig. 1; step E), resulting in only 4% (240,340 sequences) of acquired sequences that could be used for inferences in our study (Table 1). Of the sequences that were identified to species and that were independently geographically verified as occurring in Switzerland, many are terrestrial (N=255; Fig. 3; Supplementary Data 2).

**Figure 1: Study area and location of sampling sites where environmental DNA samples and classical sampling methods were carried out.**

**Figure 2: Total eukaryotic diversity detected from the river Glatt using environmental DNA metabarcoding.**

Table 1 Sequences remaining after each bioinformatic filtering step and taxonomic assignment.

Full size table

**Figure 3: Percent terrestrial or freshwater species for the subset of each phylum detected in eDNA.**

eDNA detection of macroinvertebrates

Of the 296 families detected with eDNA for eukaryotes, 65 are used in the Swiss biomonitoring program²³. Thirteen additional families were detected by kicknet samples only, totalling 78 macroinvertebrate families detected among our sampling sites of the river Glatt (Supplementary Fig. 2). From eDNA, we recovered between 23 and 40 families at each site (Supplementary Fig. 2). With the classical kicknet method, we sampled 17–24 families at each site (Supplementary Fig. 2). Of the total 78 families detected, 33 were detected by both methods, and often at the same location (Supplementary Fig. 2). Of the remaining 45 families, 32 were only detected with eDNA and 13 where only detected with the kicknet sample. Eleven of these 13 families only detected with the kicknet were detected in the eDNA data set, but did not meet bioinformatic thresholds used for filtering assignment values (for example, where below a 90% sequence similarity or an alignment length <100 base pairs (bp), Supplementary Table 1). The two undetected families (Potamanthidae and Aphelocheiridae) likely had insufficient sequence data on GenBank for the identification of their DNA sequence from eDNA (Supplementary Table 1). Of the 32 families only detected with eDNA, 8 have been found in previous sampling events over the 18 years of monitoring (Supplementary Table 2) and an additional 2 (Molannidae, Notonectidae) are known to occur in lake Greifensee, which feeds into the river Glatt, but are not known from the river Glatt (Supplementary Table 2).

Family richness (α-diversity) increased as a function of cumulative catchment area sampled for eDNA, whereas this was not observed for kicknet samples (F_1,6=5.45, P=0.058, r²=0.95, eDNA; F_1,6=0.0001, P=0.99, r²=0.92, kicknet; Fig. 4a). The slopes of the family–area relationship were different (slope_kicknet=0.0006; slope_eDNA=0.1077; F_1,12=29.87, P=0.0001), and the y intercept was higher for eDNA compared with kicknet (F_1,13=25.99, P=0.0002; Fig. 4a). β-diversity in the form of community dissimilarity did not increase as a function of distance for eDNA (r=0.02, P=0.44), whereas for kicknet sampling, we observed an increase in dissimilarity (β-diversity) as a function of distance between sampling sites (r=0.52, P=0.005; Fig. 4b).

**Figure 4: Difference of benthic macroinvertebrate family richness and community dissimilarity estimated between the environmental DNA and kicknet sampling.**

In vitro test using a mock community

In total, we recovered 57,641 sequences from the mock community after the bioinformatic filtering and these sequences were identified to 25 of the 33 invertebrate taxa included in the mock community (Table 1; Supplementary Table 3). Of these sequences, 99.97% were correctly assigned to one of these taxa included in the mock community (Supplementary Table 4; Supplementary Table 5). The number of incorrectly assigned sequences was 0.03% (20/57,641) and all of these sequences belonged to two taxa (Tabanidae and Leuctridae; Supplementary Table 4). This resulted in a false-positive rate of 8% (2/25). Increasing the stringency of our bioinformatics thresholds set for accepting an assignment to a level that removes all false positives in the mock community (for example, increasing assignment similarity to >92%) introduces a false absences of 16% (4/25) in the mock community, that is, the exclusion of taxa that were present in the mock community, but had an assignment similarity <92% (Supplementary Table 4). Using a similarity threshold of 92% on our data derived from eDNA in water samples to account for possible false presences, however, did not change our main findings (Supplementary Fig. 3).

Discussion

We demonstrate that rivers, through their collection and transport of eDNA, can be used to sample catchment-level biodiversity across the land–water interface. For aquatic macroinvertebrates, we found a greater richness in the number of families detected with eDNA compared with the classical kicknet method at the same sample location (Fig. 4a). This increased sensitivity is hypothesized to come from the process of transport of DNA through the network of a river. Transport of DNA through a river network decreases the biases associated with the spatial autocorrelation (or limited scale of inference) inherent to the classical kicknet community sampling. The evidence from our work supports that eDNA found in rivers is a spatially integrated measure of biodiversity and this finding offers ecologists a new and unprecedented tool to sample landscape biodiversity with less sampling effort and potentially estimate richness of eukaryotic communities across biomes.

We hypothesize the following conceptual model as the explanation of our data. Typically, sampling methods for communities only capture a fraction of local α-diversity due to the imperfect detection and sampling bias (Fig. 5):

**Figure 5: Conceptual model of environmental DNA dynamics in a hypothetical river network.**

with representing the measured α-diversity at a spatial location x in a river network using classical sampling methods, as the real α-diversity at this location and δ^classical as the detection rate of the sampling method. To comprehensively estimate the biodiversity of a river catchment, a large number of such samples are required. If samples are spatially autocorrelated, pooling of community samples will result in an underestimation of the real local richness²¹.

Riverine networks have the potential to collect this information for us^2,3 if we use an appropriate sampling method not biased by spatial autocorrelation for the area under study. Characteristic properties of rivers, such as the specific distribution of biodiversity²⁴ and transport of eDNA by the flow of water⁸ are the mechanisms that enable an eDNA metabarcoding method to estimate the catchment-level biodiversity, while sampling at only one or very few locations:

with as the integrated measure of catchment α-diversity (Fig. 5). The sum captures the information integrated by the riverine system for all locations y (Strahler stream order) upstream of the sampling location x. The local diversity at a site of Strahler stream order y has to be weighted according to Horton's Law to capture the number of streams of this Strahler stream order (N_y)³, as well as by the Strahler stream order-characteristic β-diversity (β_y). The estimate of catchment-level biodiversity increases with increasing β-diversity between the sampling point and all upstream locations (β_x,y), as well as with increasing transport distance (τ_x,y; net rate including shedding and degradation). Note that the eDNA specific detection probability (δ^eDNA) tends to be high as, in principle, only very few DNA molecules are needed for successful detection.

Our conceptual model identifies three important messages for the utility of eDNA as a genomic tool for the biodiversity assessment. First, eDNA detection of species from river water decouples the presence of a species from its physical location in a habitat through downstream transport. Transport distance in empirical systems has been measured between 240 m and 12 km (refs 8, 25), and thus allows for the increased sensitivity in the detection of patchily or elusively distributed species. In addition, transport of eDNA allows for richness estimates with less sampling effort because of the integrated signal over space. Second, eDNA will likely represent a sample of higher diversity compared with the classical sampling methods at any given site, but this depends on the local distribution of species and, factors affecting transport and degradation of eDNA. Third, the interpretation of the species presence inferred from an eDNA sample in a river is different from that of the classical sampling methods. Namely, eDNA detection of species should be interpreted as an integrated signal of presence and the spatial scale that is relevant is determined based on the potential transport distance for a system. Thus, our model suggests that eDNA in rivers is an efficient tool for broad scale biodiversity assessments, and depending on the distance between water samples, less authoritative for very localized richness estimates.

Our data comparing eDNA with kicknet samples at each site highlights several important factors that illustrate both the power and current limitations of using eDNA for the biodiversity assessment. Many families of macroinvertebrates were detected at each site by both methods and have a great degree of overlap, in which sites families were co-detected. For all sites, however, eDNA recovered more macroinvertebrate families compared with kicknet samples. We hypothesize this is likely due to the integrated signal from transported DNA, which is evident by the fact that community composition does not change much (that is, β-diversity remaining constant over distance), compared with kicknet estimated β-diversity that increased over the same river distance in our study area. This difference means that the two sampling methods give different information at the same site. Classical sampling methods give information that is localized, whereas the eDNA metabarcoding method in rivers measures presence of species on broader spatial scales. Scaling up of the classical community sampling method will likely always underestimate diversity²¹, eDNA offers an empirical method to overcome this limitation and is an unparalleled way to estimate richness for larger areas. This novel finding is of great importance because in many cases estimating diversity for a large area is the goal, such as that for biodiversity hotspots¹⁹, conservation preserves or entire river catchments²⁶.

Much of the current degradation of river habitat is at the catchment scale and cannot be attributed to a single point or source¹. Biomonitoring currently relies on the costly and lethal sampling of macroinvertebrates across many sites to understand the ecosystem health of rivers²⁷ and tracking these changes in space and time is of high interest²⁸. Biomonitoring is entering a new era and the demand in its use has generated an undue burden on resource agencies. For example, the United States, England and Switzerland combined spend ∼117.4–206.6 million US dollars annually on biomonitoring of aquatic systems (Supplementary Table 6). This number represents only a small fraction of what countries spend on biomonitoring at more local levels, but characterizes the value we place on using species in their environment to monitor the health of aquatic ecosystems. Biomonitoring is costly because of the different methods and expertise required to collect information about each targeted taxonomic group (for example, Supplementary Table 6)^22,27. An eDNA signal of macroinvertebrates can be used to estimate more accurately diversity of a catchment with much less sampling effort and would therefore decrease the cost associated with biomonitoring when the goal is to measure the ecosystem health on large scales in river systems.

By contrast, understanding local changes in richness at a restoration site, for example, may still require classical sampling with kicknets. Interestingly, however, transport distances of eDNA are on a similar scale at which local species’ pools are recognized to be important for recolonization of restored patches in a river system (0–5 km)²⁹. Therefore, eDNA could be used as a way to measure the species’ pool available for recolonization. The scale of inference for eDNA, however, can be >5 km due to long-distance transport within basins and between basins due to other vectors such as faeces from predators. The complementarity between methods will aid in prioritizing river restoration efforts by identifying regions that have high recolonization potential of target species and possibly set expectations for the magnitude of change expected for restoration sites already in recovery.

Our results also identify a way of empirically measuring transport of community eDNA in rivers. Our analysis of β-diversity in this study system shows that community eDNA is likely transported and detected over a scale <12 km. To determine the scale of transport for community eDNA in a river system, one subsequently needs to detect the scale at which there is a positive spatial autocorrelation with β-diversity (for example, Fig. 4b). This empirical measure of transport is needed because, as shown by our conceptual model, eDNA detection of biodiversity is a function of the transport distance, but also a function of the distribution of species within the network. Transport itself is furthermore affected by local factors, such as degradation of eDNA due to ultraviolet, pH and temperature³⁰, as well as discharge rates²⁵. Therefore, eDNA may not be necessarily transported and detected over the same distance for all river systems or consistently in time due to extreme events like heavy rainfall or drought. By using the correlation between an eDNA estimate of β-diversity and river distance between sampling points; however, an in situ test can be performed and the scale of transport for community eDNA can be uncovered for any system, and can be repeatedly measured across time to test if eDNA transport distance is stable in a system.

There are still important current limitations of the eDNA metabarcoding method. These challenges are related to factors, such as the importance of primer or marker choice, the amplicon sequence length and the biodiversity detected, as a function of the reference data available for identification of sequences^31,32. For example: fish, flatworms and diatoms in our data set are underrepresented to what we know occurs in the studied system. This is most likely due to the choice of primers, the genetic marker and the reference database. The primers used in this study are the universal Folmer primers for the 5′ end of cytochrome c oxidase I (COI)³³, and it is known that these primers do not amplify DNA from fish and flatworms very well^34,35; respectively. In addition, for diatoms it is known that COI is not the best genetic marker suitable for species level identification³⁶. Therefore, it is clear that more than one marker and/or primer set is needed to adequately assess biodiversity for the tree of life³⁷. However, use of an eDNA metabarcode method does not require additional sampling in the field. Rather it creates a single-field sampling method, whereby careful amplification of many genetic markers in the laboratory will enable an integrated detection for total biodiversity from a single sample³⁸.

An additional challenge faced by the further application of this approach is the need for continued development of diverse, but curated databases with taxonomically classified sequences. Our mock community analysis corroborated that we had a high accuracy in assignment of sequences, when compared with the reference sequence generated from the DNA used for the mock community (96.4–99.9% similarity). The variance in assignment accuracy increased to (90.1–99.8%) when compared with NCBI’s nucleotide database. The gaps in NCBI’s nucleotide database for targeted groups, such as macroinvertebrates used for biomonitoring will need to be augmented and assessed before the tool can be more widely applied in management. Because of uncertainly in the database, we removed many sequences that could not be confidently assigned to the family taxonomic level. At the current filtering level, we are already accepting a false-absence rate of 14% (Supplementary Table 1). Reducing our data set further using more stringent criteria increased type II error by creating many more false absences for taxa we actually collected in our kicknet samples at the time of sampling (Supplementary Note 1). Therefore, at this stage in deployment of an eDNA metabarcoding approach, researchers need to strive to reduce false absences and false presences, while understanding that the tool is in rapid development and false error rates for macroinvertebrates related to this method are still unknown beyond the estimates given here. In comparison with morphological assessments of macroinvertebrates at the family level, however, identification error is reported to range between 22.1% (ref. 39) and 33.8% (ref. 40), suggesting that the only alternative used in regulatory monitoring settings already has a high false-positive-presence/absence rate. Most of the sequences from our data set were removed because the taxonomic assignment failed. The solution for this is to increase the deposition of sequences in curated databases such as The Barcode of Life Database⁴¹ through continued collaboration between molecular ecologists and taxonomists. Digitizing specimens in the form of sequences is an essential step that will vastly improve our ability to accurately identify DNA found in the environment.

We have demonstrated that rivers convey, through the collection and transport of eDNA, an unprecedented amount of information on biodiversity in landscapes. Our study shows that eDNA can be used to sample community structure of river catchments and do so even across the land–water interface. As such, detection of eukaryotic fauna with DNA found and transported in rivers may unite historically separated research fields of aquatic and terrestrial ecology, and provide an integrated measure of total biodiversity for rapid assessment for one of the most highly impacted biomes of the world.

Methods

eDNA sampling and library preparation for next generation sequencing

Water samples were collected from eight sites along the Glatt river network, a subcatchment of the Rhine river in Switzerland (Fig. 1). The study sites were chosen because they represent nodes in the river network, where water from the major subcatchment tributaries combine and flow into the main stem of the river Glatt. They also have a known history of monitoring macroinvertebrates for the past 15 years⁴². At each site, DNA was isolated from between 840 and 900 ml of river water sampled. Method for sampling, capture and extraction of DNA followed that of Deiner et al.⁴³, where the capture method of filtration was coupled with a phenol–chloroform isoamyl DNA extraction. Strict adherence to contamination control was followed using a controlled lab for eDNA isolation and pre-PCR preparations⁴³. Three independent extractions of 280–300 ml were carried out, and then pooled to equal DNA captured and purified from 840 to 900 ml of water. Total volume of water filtered for each extraction replicate depended on the suspended solids in the sample of which clogged the filter. Water for this study was collected minutes before collecting aquatic macroinvertebrates, using a classical sampling method kicknet, for description see below and (refs 24, 42), and therefore allowed for a comparison between the kicknet and eDNA methods for the detection of aquatic macroinvertebrate communities within the same watershed at the same time point.

PCRs were carried out for the target gene, COI, using the standard COI primers³³ on pooled eDNA extractions for each of the eight sites and amplified a fragment of 658 bp excluding primer sequences. PCRs were carried out in 15 μl volumes with final concentrations of 1 × supplied buffer (Faststart TAQ, Roche, Inc., Basel, Switzerland), 1,000 ng μl⁻¹ bovine serum albumin (New England Biolabs, Inc., Ipswich, MA, USA), 0.2 mMol dNTPs, 2.0 mMol MgCl₂, 0.05 U μl⁻¹ Taq DNA polymerase (Faststart TAQ, Roche, Inc., Basel, Switzerland) and 0.50 μMol of each forward and reverse primer³³. A measure of 2 μl of the pooled extracted eDNA was added. The thermal-cycling regime was 95 °C for 4 min, followed by 35 cycles of 95 °C for 30 s, 48 °C for 30 s and 72 °C for 1 min. A final extension of 72 °C for 5 min was carried out, and the PCR was cooled to 10 °C until removed and stored at –20 °C until confirmation of products occurred. PCR products were confirmed by gel electrophoresis on a 1.4% agarose gel stained with GelRed (Biotium Inc., Hayward, CA, USA). Three PCR replicates were performed on each of the eight eDNA samples from our study sites and products from the three replicates were pooled. Negative filtration, extraction and PCR controls were used to monitor any contamination during the molecular workflow, and were also replicated three times. Reactions were then cleaned using AMPure XP beads following recommended manufacturer’s protocol except 0.6 × bead concentration was used instead of 1.8 × based on recommended protocol for fragment size retention of >500 bp (p. 31, Nextera XT DNA 96 kit, Illumina, Inc., San Diego, CA, USA). We quantified each pooled reaction using the Qubit (1.0) fluorometer following recommended protocols for the dsDNA high-sensitivity DNA assay that has an accuracy for double stranded DNA between 0.005 and 0.5 pg μl⁻¹ (Agilent Technologies, Santa Clara, CA, USA). At this step negative controls showed no quantifiable DNA and we therefore did not process them further.

The eight reactions were then each diluted with molecular grade water (Sigma-Aldrich, Co. LLC. St. Lewis, MO, USA) to 0.2 ng μl⁻¹ following the recommended protocol for library construction (Nextera XT DNA 96 kit, Illumina, Inc., San Diego, CA, USA). Libraries for the eight sites were prepared using the Nextera XT DNA kit following the manufacturer’s recommended protocols and dual indexed using the Nextera XT index kit A (Illumina, Inc., San Diego, CA, USA). In brief, this protocol uses a process called tagmentation whereby the amplicon is cleaved preferentially from the 5′ and 3′ ends, and the index and adaptor are ligated onto the amplicon. The tagmentation process produces an amplicon pool for each site (that is, library) with randomly cleaved fragments averaging 300 bp in length that are subsequently duel indexed. The library constructed for each site was then pooled and paired-end sequenced (2 × 250 bp) on an Illumina MiSeq at the Genomic Diversity Center at the ETH, Zurich, Switzerland following the manufacturer’s run protocols (Illumina, Inc., San Diego, CA, USA). The MiSeq Control Software Version 2.2 including MiSeq Reporter 2.2 was used for the primary analysis and the de-multiplexing of the raw reads.

Bioinformatic analysis

Workflow of process is presented in Supplementary Fig. 1. Run quality was assessed using FastQC version 0.10.1. Forward and reverse sequences were merged with a minimum overlap of 25 bp and minimum length of 100 bp using SeqPrep⁴⁴. Sequences that could not be merged were excluded from further analysis. Merged sequences with quality scores less than a mean of 25 where removed. Merged sequences were then de-replicated by removing exact duplicates, were de-noised using a sequence identity threshold of 99%, and were quality trimmed left and right by 28 bp using PrinSeq Lite version 0.20.3 to remove any primer sequence⁴⁵. Sequences were then mapped to the COI Barcode of Life Database (iBOL phase 4.00)⁴¹ using a map_reads_reference.py script with the minimum per cent identity to consider a match as 50% and the minimum sequence length match to a reference of 50% to remove any sequences not likely of COI origin. Subsequent sequences were then chimera checked using usearch version 6 (ref. 46). Remaining sequences <100 bp in length were then taxonomically identified using customized Blast searches against the NCBI non-redundant nucleotide database using the package blast 2.2.28, build on 12 March 2013 16:52:31 (ref. 47). Taxonomic assignment of a sequence was done using the best blast hit based on a bit score calculated using the default blastn search of a −3 penalty for a nucleotide mismatch and a reward of +1 for a nucleotide match. Sequences that did not match eukaryotes, were <90.0% sequence similarity, had <100 bp overlap with query, had a taxonomic name not assigned below the level of family, matched best with unknown environmental samples and/or had a bit score <100 were excluded from biodiversity detection analysis for all sites. These parameters were used because they removed likely taxonomic identification errors or exclude data that was unidentified at the family level used for analysis^43,48.

After identification of sequences with the NCBI nucleotide sequence database, each uniquely identified taxon from any site was geographically verified as known to be present in Switzerland to the lowest level of taxonomy, or if no data was available for Switzerland, it was also considered present when the taxon was known to be present in Austria, France, Germany and Italy. We excluded the one and very rare case (that is, Culicoides fascipennis), where it is known for sure that a species is not in Switzerland, but found in all four neighbouring countries. Geographic verification was done in consultation with 25 expert taxonomists for various groups, primary literature and through database repositories as described in Supplementary Tables 1 and 2. If the species could be confidently confirmed as being present in Switzerland or in all four neighbouring countries, their known habitat use was identified as being freshwater (defined as having at least one life stage inhabiting water) or terrestrial (which included species that inhabit riparian or wet habitats or typically feed in aquatic habitats, but do not have full life stages or reproduce in the water; Supplementary Table 2). In addition, because we used bovine serum albumin as an additive in PCR, we cannot rule out that detections of Bos taurus or Bos indicus were due to this reagent and therefore excluded them from analysis.

Mock community analysis

A mock community approach was used to verify that our laboratory methods and bioinformatics pipeline were capable of correctly detecting the taxa of interest. We composed a mock community of invertebrate taxa from 33 different families spanning three phyla (all known to be present in our study area, Supplementary Table 3). We individually extracted their DNA, pooled and sequenced the mock community in accordance with the same methods used for analysis of eDNA samples from the river Glatt (see Supplementary Note 1 for complete methods). We additionally Sanger sequenced all 33 DNA extractions from taxa following that of Mächler et al.⁴⁹ to generate a sequence reference database to assess the assignment errors when using NCBI’s nucleotide database⁴⁷.

Kicknet sampling and identification

Macroinvertebrates were detected using a standard kicknet sampling design described for federal and cantonal guidelines in Switzerland^23,24 and represent our positive control for each site. In brief, we took eight independent kicknet samples per site on 29 October 2012. Large inorganic and organic debris was removed, and samples were pooled into a single collection jar with 70% EtOH. Jars were then stored at room temperature until morphological identification. This method and time of year has been shown to reflect the different microhabitats and provides a robust presence measure for many macroinvertebrates in Switzerland²³. Since eDNA has been shown to decay over short time periods of a few days to a few months;³⁰, using a single time point from a kicknet sample to compare with that of what is detected in the eDNA is valid. However, it is known that kicknet samples taken at different times of year, such as in the spring, can detect different species due to the morphological constraints in the identification of specimens at young life stages or that their physical presence in the water is limited due to timing for their life cycle²³. Specimens from each site were sorted to the lowest taxonomic level possible (family, genus or species level), using dichotomous keys agreed upon by the Swiss Federal Office of the Environment²³. Specimens that could not be identified to at least to the taxonomic rank of family were excluded from further analysis.

Comparison of eDNA and kicknet macroinvertebrate detection

For each site, we summarized the number of eDNA detected families of macroinvertebrates and number of families observed for the classical kicknet method, including only aquatic taxa on the standardized list of macroinvertebrates for biomonitoring of Swiss waters by the Federal Office for the Environment²³. Using this standardized list, we calculated each site’s observed α-diversity (local richness) for macroinvertebrates and visualized it on a heatmap of incidence. The estimated catchment area sampled for each position in the network was calculated as the cumulative sum of the area of all subcatchments into which all surface waters (excluding the lake) drain above the sampling point (Fig. 1). Topological distance between the sampling sites was calculated along the river’s path. Catchment area and distance between the sampling sites were calculated using Quantum Geographic Information System in version 2.8 (ref. 50). The number of families detected (considered here as α-diversity) by each sampling method (eDNA and kicknet) was log₁₀ transformed and regressed against the log₁₀ of the river area to test for the taxon–area relationship. We were interested in whether or not the two sampling methods differ in the magnitude of diversity detected due to the transport of DNA (y intercept of the taxon–area relationship), and that the rate of increase in number of taxa for a given area was faster for eDNA compared with the kicknet (slope of the regression lines), as predicted from our conceptual model. Slopes and y intercepts of the two regressions for the taxon–area relationship were tested using an analysis of covariance.

To test for a spatial autocorrelation in community dissimilarity (β-diversity, using the Jaccard dissimilarity index) and between sampling locations, we used a Mantel’s test with 9,999 permutations. Here we exclude the tributaries as it is not possible for eDNA to flow into these locations (for example, cd into a). The Jaccard measure of β-diversity was used as it has been shown to estimate community dissimilarity for incidence data with less biases because of nestedness that is expected for the eDNA estimate of β-diversity due to transport⁵¹. All statistical analyses were performed in R version 3.1.0 (ref. 52).

Data availability

All raw data associated with this study have been deposited on the NCBI’s Sequence Read Archive (SRA) under the BioProject PRJNA291617. Details for each individual file are given in Supplementary Table 7. All other intermediate processed data files are available from the authors upon request.

Additional information

How to cite this article: Deiner, K. et al. Environmental DNA reveals that rivers are conveyer belts of biodiversity information. Nat. Commun. 7:12544 doi: 10.1038/ncomms12544 (2016).

References

Vörösmarty, C. J. et al. Global threats to human water security and river biodiversity. Nature 467, 555–561 (2010).
Article ADS PubMed Google Scholar
Willett, S. D., McCoy, S. W., Perron, J. T., Goren, L. & Chen, C.-Y. Dynamic reorganization of river basins. Science 343, 1248765 (2014).
Article PubMed Google Scholar
Rodríguez-Iturbe, I. & Rinaldo, A. Fractal River Basins: chance and Self-Organization Cambridge University Press (1997).
Clift, P. D. & Blusztajn, J. Reorganization of the western Himalayan river system after five million years ago. Nature 438, 1001–1003 (2005).
Article ADS CAS PubMed Google Scholar
Ward, N. D. et al. Degradation of terrestrially derived macromolecules in the Amazon River. Nat. Geosci. 6, 530–533 (2013).
Article ADS CAS Google Scholar
Altermatt, F. Diversity in riverine metacommunities: a network perspective. Aquat. Ecol. 47, 365–377 (2013).
Article Google Scholar
Mari, L., Casagrandi, R., Bertuzzo, E., Rinaldo, A. & Gatto, M. Metapopulation persistence and species spread in river networks. Ecol. Lett. 17, 426–434 (2014).
Article PubMed Google Scholar
Deiner, K. & Altermatt, F. Transport distance of invertebrate environmental DNA in a natural river. PLoS ONE 9, e88786 (2014).
Article ADS PubMed PubMed Central Google Scholar
Turner, C. R. et al. Particle size distribution and optimal capture of aqueous macrobial eDNA. Methods Ecol. Evol. 5, 676–684 (2014).
Article Google Scholar
Taberlet, P., Coissac, E., Hajibabaei, M. & Rieseberg, L. H. Environmental DNA. Mol. Ecol. 21, 1789–1793 (2012).
Article CAS PubMed Google Scholar
Ji, Y. et al. Reliable, verifiable and efficient monitoring of biodiversity via metabarcoding. Ecol. Lett. 16, 1245–1257 (2013).
Article PubMed Google Scholar
Bohmann, K. et al. Environmental DNA for wildlife biology and biodiversity monitoring. Trends Ecol. Evol. 29, 358–367 (2014).
Article PubMed Google Scholar
Cristescu, M. E. From barcoding single individuals to metabarcoding biological communities: towards an integrative approach to the study of global biodiversity. Trends Ecol. Evol. 29, 566–571 (2014).
Article PubMed Google Scholar
Goldberg, C. S., Strickler, K. M. & Pilliod, D. S. Moving environmental DNA methods from concept to practice for monitoring aquatic macroorganisms. Biol. Conserv. 183, 1–3 (2015).
Article Google Scholar
Kelly, R. P. et al. Harnessing DNA to improve environmental management. Science 344, 1455–1456 (2014).
Article ADS CAS PubMed Google Scholar
Lawson Handley, L. How will the ‘molecular revolution’contribute to biological recording? Biol. J. Linn. Soc. 115, 750–766 (2015).
Article Google Scholar
Rees, H. C., Maddison, B. C., Middleditch, D. J., Patmore, J. R. & Gough, K. C. REVIEW: The detection of aquatic animal species using environmental DNA–a review of eDNA as a survey tool in ecology. J. Appl. Ecol. 51, 1450–1459 (2014).
Article CAS Google Scholar
Laramie, M. B., Pilliod, D. S. & Goldberg, C. S. Characterizing the distribution of an endangered salmonid using environmental DNA analysis. Biol. Conserv. 183, 29–37 (2015).
Article Google Scholar
Myers, N., Mittermeier, R. A., Mittermeier, C. G., Da Fonseca, G. A. & Kent, J. Biodiversity hotspots for conservation priorities. Nature 403, 853–858 (2000).
Article ADS CAS PubMed Google Scholar
Noss, R. F. et al. How global biodiversity hotspots may go unrecognized: lessons from the North American Coastal Plain. Divers. Distrib. 21, 236–244 (2015).
Article Google Scholar
Gotelli, N. J. & Colwell, R. K. Quantifying biodiversity: procedures and pitfalls in the measurement and comparison of species richness. Ecol. Lett. 4, 379–391 (2001).
Article Google Scholar
Barbour, M. T., Gerritsen, J., Snyder, B. & Stribling, J. Rapid bioassessment protocols for use in streams and wadeable rivers: periphyton, benthic macroinvertebrates, and fish. USEPA 339, 1–340 (1999).
Google Scholar
Stucki, P. Methoden zur Untersuchung und Beurteilung der Fliessgewässer: Makrozoobenthos-Stufe F (Bundesamt für Umwelt-Vollzug, 2010).
Altermatt, F., Seymour, M. & Martinez, N. River network properties shape α-diversity and community similarity patterns of aquatic insect communities across major drainage basins. J. Biogeogr. 40, 2249–2260 (2013).
Article Google Scholar
Jane, S. F. et al. Distance, flow and PCR inhibition: eDNA dynamics in two headwater streams. Mol. Ecol. Resour. 15, 216–227 (2015).
Article CAS PubMed Google Scholar
Sheldon, F. et al. Identifying the spatial scale of land use that most strongly influences overall river ecosystem health score. Ecol. Appl. 22, 2188–2203 (2012).
Article PubMed Google Scholar
Stein, E. D., Martinez, M. C., Stiles, S., Miller, P. E. & Zakharov, E. V. Is DNA barcoding actually cheaper and faster than traditional morphological methods: results from a survey of freshwater bioassessment efforts in the United States? PLoS ONE 9, e95525 (2014).
Article ADS PubMed PubMed Central Google Scholar
Heino, J. et al. A comparative analysis reveals weak relationships between ecological factors and beta diversity of stream insect metacommunities at two spatial levels. Ecol. Evol. 5, 1235–1248 (2015).
Article PubMed PubMed Central Google Scholar
Sundermann, A., Stoll, S. & Haase, P. River restoration success depends on the species pool of the immediate surroundings. Ecol. Appl. 21, 1962–1971 (2011).
Article PubMed Google Scholar
Strickler, K. M., Fremier, A. K. & Goldberg, C. S. Quantifying effects of UV-B, temperature, and pH on eDNA degradation in aquatic microcosms. Biol. Conserv. 183, 85–92 (2015).
Article Google Scholar
Deagle, B. E., Jarman, S. N., Coissac, E., Pompanon, F. & Taberlet, P. DNA metabarcoding and the cytochrome c oxidase subunit I marker: not a perfect match. Biol. Lett. 10, 20140562 (2014).
Article PubMed PubMed Central Google Scholar
Elbrecht, V. & Leese, F. Can DNA-based ecosystem assessments quantify species abundance? Testing primer bias and biomass-sequence relationships with an innovative metabarcoding protocol. PeerJ PrePrints 3, e1258 (2015).
Google Scholar
Folmer, O., Black, M., Hoeh, W., Lutz, R. & Vrijenhoek, R. DNA primers for amplification of mitochondrial cytochrome c oxidase subunit I from diverse metazoan invertebrates. Mol. Mar. Biol. Biotechnol. 3, 294–299 (1994).
CAS PubMed Google Scholar
Ivanova, N. V., Zemlak, T. S., Hanner, R. H. & Hebert, P. D. Universal primer cocktails for fish DNA barcoding. Mol. Ecol. Notes 7, 544–548 (2007).
Article CAS Google Scholar
Moszczynska, A., Locke, S. A., McLaughlin, J. D., Marcogliese, D. J. & Crease, T. J. Development of primers for the mitochondrial cytochrome c oxidase I gene in digenetic trematodes (Platyhelminthes) illustrates the challenge of barcoding parasitic helminths. Mol. Ecol. Resour. 9, 75–82 (2009).
Article CAS PubMed Google Scholar
Zimmermann, J., Glöckner, G., Jahn, R., Enke, N. & Gemeinholzer, B. Metabarcoding vs. morphological identification to assess diatom diversity in environmental studies. Mol. Ecol. Resour. 15, 526–542 (2015).
Article CAS PubMed Google Scholar
Gibson, J. et al. Simultaneous assessment of the macrobiome and microbiome in a bulk sample of tropical arthropods through DNA metasystematics. Proc. Natl Acad. Sci. 111, 8007–8012 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Drummond, A. J. et al. Evaluating a multigene environmental DNA approach for biodiversity assessment. GigaScience 4, 1 (2015).
Article Google Scholar
Stribling, J. B., Pavlik, K. L., Holdsworth, S. M. & Leppo, E. W. Data quality, performance, and uncertainty in taxonomic identification for biological assessments. J. N. Am. Benthol. Soc. 27, 906–919 (2008).
Article Google Scholar
Haase, P., Pauls, S. U., Schindehütte, K. & Sundermann, A. First audit of macroinvertebrate samples from an EU Water Framework Directive monitoring program: human error greatly lowers precision of assessment results. J. N. Am. Benthol. Soc. 29, 1279–1291 (2010).
Article Google Scholar
Ratnasingham, S. & Hebert, P. D.N. bold: the barcode of life data system. Mol. Ecol. Notes 7, 355–364 (2007).
Article CAS PubMed PubMed Central Google Scholar
AWEL. Zürcher Gewässer 2012: Entwicklung - Zustand - Ausblick. Amt für Abfall, Wasser, Energie und Luft, Zürich, Switzerland. Available at http://www.awel.zh.ch/internet/baudirektion/awel/de/wasser/gewaesserschutz/wasserqualitaet.html (2012).
Deiner, K., Walser, J.-C., Mächler, E. & Altermatt, F. Choice of capture and extraction methods affect detection of freshwater biodiversity from environmental DNA. Biol. Conserv. 183, 53–63 (2015).
Article Google Scholar
SeqPrep. Available at https://github.com/jstjohn/SeqPrep (2011).
Schmieder, R. & Edwards, R. Quality control and preprocessing of metagenomic datasets. Bioinformatics 27, 863–864 (2011).
Article CAS PubMed PubMed Central Google Scholar
Edgar, R. C. Search and clustering orders of magnitude faster than BLAST. Bioinformatics 26, 2460–2461 (2010).
Article CAS PubMed Google Scholar
Benson, D. A. et al. GenBank. Nucleic Acids Res. 40, D48–D53 (2012).
Article CAS PubMed Google Scholar
Deiner, K., Knapp, R. A., Boiano, D. M. & May, B. Increased accuracy of species lists developed for alpine lakes using morphology and cytochrome oxidase I for identification of specimens. Mol. Ecol. Resour. 13, 820–831 (2013).
Article CAS PubMed Google Scholar
Mächler, E., Deiner, K., Steinmann, P. & Altermatt, F. Utility of Environmental DNA for Monitoring Rare and Indicator Macroinvertebrate Species. Freshw. Sci. 33, 1174–1183 (2014).
Article Google Scholar
QGIS Geographic Information System. Available at http://qgis.osgeo.org (Open Source Geospatial Foundation Project, (2015).
Cardoso, P., Borges, P. A. & Veech, J. A. Testing the performance of beta diversity measures based on incidence data: the robustness to undersampling. Divers. Distrib. 15, 1081–1090 (2009).
Article Google Scholar
R: A language and environment for statistical computing. Version 3.0.1 (R Foundation for Statistical Computing (2013).

Download references

Acknowledgements

We thank Patrick Steinmann for assistance with kicknet samples and taxonomic identification, Katharina Kaelin for assistance with our study area map and geographic information used in this study, and Peter Penicka for contributing illustrations to cover art. We thank the 25 taxonomists who donated their time to help with the geographic verification of taxa identified through eDNA (all are listed in Supplementary Data 3 and 4). We thank Florian Leese, Vasco Elbrecht, Simon Creer and Michael Pfrender, and two anonymous reviewers for providing the insightful feedback on a previous versions of the manuscript. Data analysed in this paper were generated in collaboration with the Genetic Diversity Centre (GDC), ETH Zurich, and the University of Notre Dame’s Genomics and Bioinformatics Core Facility. This work was funded by Eawag: Swiss Federal Institute of Aquatic Science and Technology, and the Swiss National Science Foundation (grant no. PP00P3_150698 to F.A.).

Author information

Authors and Affiliations

Department of Aquatic Ecology, Eawag: Swiss Federal Institute of Aquatic Science and Technology, Überlandstrasse 133, Dübendorf, CH-8600, Switzerland
Kristy Deiner, Emanuel A. Fronhofer, Elvira Mächler & Florian Altermatt
Department of Biological Sciences, University of Notre Dame, 290B Galvin Life Sciences, Notre Dame, 46556, Indiana, USA
Kristy Deiner
Department of Evolutionary Biology and Environmental Studies, University of Zurich, Winterthurerstrasse 190, Zürich, CH-8057, Switzerland
Emanuel A. Fronhofer, Elvira Mächler & Florian Altermatt
Swiss Federal Institute of Technology (ETH), Zürich, Genetic Diversity Centre, CHN E 55 Universitätstrasse 16, Zürich, 8092, Switzerland
Jean-Claude Walser

Authors

Kristy Deiner
View author publications
You can also search for this author in PubMed Google Scholar
Emanuel A. Fronhofer
View author publications
You can also search for this author in PubMed Google Scholar
Elvira Mächler
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Claude Walser
View author publications
You can also search for this author in PubMed Google Scholar
Florian Altermatt
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.D. and F.A. designed the study, K.D. and E.M. collected the data, J.-C.W. developed the pipeline and conducted the bioinformatics analysis, E.F., F.A. and K.D. developed the conceptual model, and all authors contributed to the analysis and writing of the manuscript.

Corresponding authors

Correspondence to Kristy Deiner or Florian Altermatt.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures 1-3, Supplementary Tables 1-7, Supplementary Notes 1 and Supplementary References (PDF 405 kb)

Supplementary Data 1

Families that were geographically verified as occurring in Switzerland or known from all four neighboring countries. For each family their higher order classification (Phylum, Class and Order) is given. Furthermore, we give for each macroinvertebrate family recorded the number of sequences, the average number of identical base pairs matched (and standard deviation s.d.) to Genbank sequences (in percent), the average (and standard deviation s.d.) alignment length with the Genbank sequence (# number of base pairs), as well as the geographic confirmation source. For a detailed list of all confirmation sources, see sheet "Confirmation sources" of this Excel file. Not applicable is abbreviated as "na". (XLSX 40 kb)

Supplementary Data 2

Confirmation sources used for identifying families that were geographically verified as occurring in Switzerland or known from all four neighboring countries. (XLSX 11 kb)

Supplementary Data 3

Species that were geographically verified as occurring in Switzerland or known from all four neighboring countries. For each species their higher order classification (Phylum, Class, Order, Family, and Genus) is given. Furthermore, we give for each macroinvertebrate species their habitat, number of sequences, the average number of identical base pairs matched (and standard deviation s.d.) to Genbank sequences (in percent), the average (and standard deviation s.d.) alignment length with the Genbank reference (base pairs), as well as the geographic confirmation source. For a detailed list of all confirmation sources, see sheet "Confirmation sources" of this Excel file. (XLSX 34 kb)

Supplementary Data 4

Confirmation sources used for identifying species that were geographically verified as occurring in Switzerland or known from all four neighboring countries. (XLSX 11 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Deiner, K., Fronhofer, E., Mächler, E. et al. Environmental DNA reveals that rivers are conveyer belts of biodiversity information. Nat Commun 7, 12544 (2016). https://doi.org/10.1038/ncomms12544

Download citation

Received: 06 November 2015
Accepted: 12 July 2016
Published: 30 August 2016
DOI: https://doi.org/10.1038/ncomms12544

This article is cited by

Comparing methods and indices for biodiversity and status assessment in a hydropower-regulated river
- Markus Majaneva
- Line Elisabeth Sundt-Hansen
- Frode Fossøy
Hydrobiologia (2024)
Towards global traceability for sustainable cephalopod seafood
- Ian G. Gleadall
- Hassan Moustahfid
- Tadanori Yamaguchi
Marine Biology (2024)
Functional responses to deforestation in fish communities inhabiting neotropical streams and rivers
- Isabel Cantera
- Céline Jézéquel
- Sébastien Brosse
Ecological Processes (2023)
Modelling environmental DNA transport in rivers reveals highly resolved spatio-temporal biodiversity patterns
- Luca Carraro
- Rosetta C. Blackman
- Florian Altermatt
Scientific Reports (2023)
Catchment-based sampling of river eDNA integrates terrestrial and aquatic biodiversity of alpine landscapes
- Merin Reji Chacko
- Florian Altermatt
- Loïc Pellissier
Oecologia (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.