Mapping hydrologic alteration and ecological consequences in stream reaches of the conterminous United States

McManamay, Ryan A.; George, Rob; Morrison, Ryan R.; Ruddell, Benjamin L.

doi:10.1038/s41597-022-01566-1

Download PDF

Data Descriptor
Open access
Published: 28 July 2022

Mapping hydrologic alteration and ecological consequences in stream reaches of the conterminous United States

Scientific Data volume 9, Article number: 450 (2022) Cite this article

3661 Accesses
6 Citations
88 Altmetric
Metrics details

Subjects

Abstract

Environmental flows are critical for balancing societal water needs with that of riverine ecosystems; however, data limitations often hinder the development of predictive relationships between anthropogenic modifications to streamflow regimes and ecological responses – these relationships are the basis for setting regional water policy standards for rivers. Herein, we present and describe a comprehensive dataset of modeled hydrologic alteration and consequences for native fish biodiversity, both mapped at the stream-reach resolution for the conterminous U.S. Using empirical observations of reference conditions and anthropogenically altered streamflow at over 7000 stream gauges, we developed a predictive model of hydrologic alteration, which was extended to >2.6 million stream reaches. We then used a previous nationwide assessment of ecological responses to hydrologic alteration to predict fish biodiversity loss in stream reaches resulting from streamflow modification. Validation efforts suggested hydrologic alteration models had satisfactory performance, whereas modeled ecological responses were susceptible to compounded errors. The dataset could ameliorate regional data deficits for setting environmental flow standards while providing tools for prioritizing streamflow protection or restoration.

Measurement(s)	hydrologic alteration, human alteration of streamflow regimes • fish biodiversity
Technology Type(s)	Random forest • Quantile regression techniques
Factor Type(s)	human disturbances in streams (land use, dam storage, water use)
Sample Characteristic - Organism	Freshwater fish
Sample Characteristic - Environment	stream
Sample Characteristic - Location	contiguous United States of America

Over 200,000 kilometers of free-flowing river habitat in Europe is altered due to impoundments

Article Open access 09 October 2023

Piotr Parasiewicz, Kamila Belka, … Wiesław Wiśniewolski

Global prevalence of non-perennial rivers and streams

Article 16 June 2021

Mathis Loïc Messager, Bernhard Lehner, … Thibault Datry

Predicting climate heating impacts on riverine fish species diversity in a biodiversity hotspot region

Article Open access 01 September 2023

Toktam Makki, Hossein Mostafavi, … Brian W. Coad

Background & Summary

Freshwater ecosystems are facing a global biodiversity crisis¹. Despite only 0.01% of the Earth’s surface water occurring as freshwater ecosystems, these systems harbor 7% of the world’s described species and one third of all vertebrates^2,3. Yet these same ecosystems are facing alarming rates of habitat and biodiversity loss. Even among freshwater ecosystems, rivers and streams are considered numerically rare despite providing disproportionate services to society, ultimately leading to their over-exploitation⁴. Much of the exploitation of lotic systems is directly related to the manipulation of natural streaflow variability. For instance, roughly 50% of rivers and streams across the world are hydrologically altered from their natural state^5,6, with as much as 80% of streams in Canada⁷ and 80% of streams in the conterminous US⁸ displaying signs of alteration in streamflow regimes. By 2050, climate change is expected to have altered streamflow regimes in as much as 80% of the global terrestrial land surface aside from that already altered from human infrastructures^9,10 - only exacerbating decades of declines in freshwater fauna populations¹¹. The monumental challenge faced by modern society is “bending the curve” of freshwater biodiversity loss, all while addressing climate change mitigation and human population growth¹.

In their Emergency Recovery Plan for dampening or reversing declines in freshwater biodiversity, Tickner et al.¹ lay out several major actions, the first being the widespread implementation of environmental flows, i.e., the quantity and timing of streamflow volumes required to sustain the ecological integrity of river systems. With competing demands for limited water quantities, the inevitable question that follows is “How much water do river [ecosystems] need?”^12,13. Prescribing environmental flows for rivers requires understanding dependencies or linkages between essential stream flow components (e.g., magnitude, duration, frequency of flows) and ecological processes. Even more critical is understanding ecological responses to human-induced alterations to naturally variant streamflow regimes – generally termed “flow-ecology relationships”¹⁴. At a regional level, flow-ecology relationships, especially thresholds or tipping points of streamflow alteration, can be used as general guidelines for environmental flow implementation and preventatives for over-extraction – this general process has been formalized as the Ecological Limits of Hydrologic Alteration (ELOHA) framework¹⁵. The ELOHA framework has been implemented in at least 20 US states and several countries¹⁶; however, a practical demand of developing empirical flow-ecology relationships include identifying datasets with sufficient empirical observations. A commonly reported shortcoming of regional ELOHA studies is the limited availability of information to support the development of flow-ecology relationships, either from the scientific literature¹⁴ or from pre-existing inventories of biological surveys and hydrologic records¹⁷.

Imposing restrictive criteria on observation datasets dramatically reduces sample sizes and generalizations that can be drawn from the data. For example, in a nationwide study evaluating flow-ecology relationships, only 237 (3%) of 7,000 US Geological Survey gaging locations had co-occurring macroinvertebrate or fish sampling surveys⁸. One solution to ameliorate the data-shortage issue is to increase the volume and density of systematic biological surveys; however, US federal and state-funded systematic biological surveys are already considered regionally comprehensive (>1.5 million sampling events) yet provide high-quality survey data for only 13% of US streams¹⁸. Even marginal increases in the percentage of surveyed streams would come at great cost to resources. A more economical solution is to model expected hydrologic and ecological conditions in ungauged stream reaches to fully leverage the wealth of existing hydrologic and biological data¹⁹. For instance, George et al.¹⁷ used modeled estimates of hydrologic conditions in streams to yield a nationwide flow-ecology dataset of 6,452 observations – a value almost 9 times the observations reported in a comparable study relying on overlapping biological surveys and hydrological data²⁰. Obviously, a limitation of such an approach is the uncertainty of modeled versus observed hydrologic conditions and subsequent error propagation that arises in developing flow-ecology relationships¹⁹.

Given the importance of ensuring region-wide extensions of the ELOHA framework, especially in data-limited regions, a comprehensive dataset of modeled hydrologic conditions or hydrologic alteration values and ecological consequences, mapped to the stream-reach resolution, would be valuable to the research and practitioner community. Not only could such a dataset address limitations of contemporary flow-ecology relationship development, but it expedites the entire 4-step scientific components embodied within the ELOHA process and provides substantial time-savings for researchers and managers. Herein, we present a dataset of modeled hydrologic (streamflow) alteration, summarized in 43 metrics, and resulting estimates of losses in fish species richness for 2.6 million stream reaches within the conterminous US.

Methods

Overview of hydrologic and ecological mapping protocol

Mapping hydrologic and ecological alteration at the stream reach level followed a 7-step process that builds upon several previously published methods (Fig. 1). The steps include: (1) compiling a nationwide dataset of streamflow gauges from the US Geological Survey (USGS) and distinguishing reference and non-reference gages and associated records^21,22,23, (2) assembling stream flow records and calculating hydrologic indices²³, (3) quantifying hydrologic alteration for stream gages²², (4) developing models to predict hydrologic alteration from human disturbance variables²⁴, (5) using models to extrapolate hydrologic alteration to ungauged stream reaches²⁴, (6) developing empirical models of fish species richness responses to hydrologic alteration¹⁷, and (7) mapping fish richness responses to ungauged stream reaches based on modeled estimates of hydrologic alteration. Methodological details are provided in each of the publications cited above; however, an overview of the steps is provided here. We elaborate more fully on the detailed methodology starting at step 3, as this reflects more of the focus of the technical validation of the dataset (Fig. 1).

Step 1 - Compiling a nationwide streamflow dataset

We assembled streamflow information for 7,088 USGS stream gages with at least 15 years of daily discharge data as of 2010. We only included gages with at least 15 years of complete annual records (i.e., those with <= 30 days of missing daily data). The influence of climate variability on hydrologic statistic values dampens as periods of records increase, and typically, at least 15 years of record are required to stabilize variation in indices, at least to acceptable levels for spatial analyses²⁵. However, a noted limitation of our analysis is that we did not formally control for climate variation in the calculation of hydrologic statistics from stream records varying in length and duration, as compared to Eng et al.²⁶. However, Eng et al.²⁶ found that climate variability had minimal influence on hydrologic alteration at stream gages relative to human land and water management.

Gages were partitioned into reference (n = 2,249) and non-reference (n = 4,839) condition based on geographical evaluations of human disturbance regimes in basins upstream of each gage, reviews of USGS water reports, and visually inspecting flow duration curves^21,22. For ease of terminology, we use the term “reference” to indicate the “least disturbance” condition for a region, as defined by Stoddard et al.²⁷. For streams currently regulated by dams and with at least 15-yr records extending prior to dam regulation (n = 250), streamflow records were partitioned into pre- and post-dam construction time periods²². Except for pre-dam construction records, periods of record for reference conditions displayed considerable temporal overlap (at least 50% of overlap in records)²³, as did periods of record for non-reference conditions²².

Step 2- Assembling streamflow records and calculating hydrologic statistics

Daily streamflow records were obtained from the USGS National Water Information System (NWIS) website (https://waterdata.usgs.gov/nwis). For reference streams, the entire period of record was used for calculating hydrologic statistics, whereas for non-reference gages, only periods overlapping with contemporary human disturbance regimes were used (1980–2010) due to the temporal limitations of anthropogenic disturbance variables used to predict hydrologic alteration. The National Hydrologic Assessment Tool (www.sciencebase.gov/catalog/item/5387735ee4b0aa26cd7b5461) was used to calculate 110 hydrologic statistics summarizing the magnitude, frequency, duration, timing, and rate of change in flow for all reference and non-reference stream flow records²³.

Step 3- Calculating hydrologic alteration indices at gages

Calculating hydrologic alteration at non-reference gages first required estimating reference or natural hydrologic conditions as a baseline from which the degree of alteration could be assessed. Of the 110 hydrologic statistics above, we selected 41 indices that adequately represent the multi-dimensional nature of regional variation in hydrologic across the US²³ and have been used in previous assessments of hydrologic alteration²² (Table 1). These 41 indices include the Indicators of Hydrologic Alteration¹², a series of non-redundant metrics representing the predominant variation embodied by almost 200 hydrologic variables²⁸. Since the 41 indices are univariate summaries of hydrologic conditions, two additional indices, a hydrologic alteration index and a seasonality alteration index, were calculated to represent multivariate impacts to overall variation among hydrologic metrics and the distribution of monthly flows, respectively (more details provided in Step 3 expanded).

Table 1 Hydrologic indices used in the study and their description. Table taken directly from George et al.¹⁷.

Full size table

Generally, reference condition values of hydrologic indices were estimated for non-reference gages using random forest statistical models constructed from reference gauges or gauges with pre-dam hydrologic records²². Random forest model performance was high with a median variance explained of 91% among all hydrologic indices and median normalized RMSE of 0.513²² (normalized RMSE by range of values). In cases where statistical models were unreliable (i.e., indices depicting timing of low and high flows), non-reference stream gages were assigned to a hydrologic class representing a range of reference condition index values²². In these situations, the 90^th percent confidence interval of hydrologic index values represented by all reference gauges within a hydrologic class was used to represent the reference flow condition. Observed hydrologic indices were then compared to estimated reference conditions to calculate hydrologic alteration indices, characterizing the degree of changes in stream flow due to human influence (see next section).

Step 4 and 5 - Predicting hydrologic alteration and mapping to U.S. streams

Random forest models were constructed to predict each hydrologic alteration index at stream gages using an ensemble of human disturbance variables summarized in the upstream basins contributing to each gage. Predictor variables included landcover, dam storage, infrastructure, and water withdrawals (Supplementary Table 1). Random forest models were developed for the entire US and for each of 29 ecohydrologic regions (Fig. 2), which represent unique combinations of Freshwater Ecoregions²⁹ and two-digit hydrologic unit codes (i.e., major river basins). The same human disturbance variables were compiled in the networks upstream of all NHDplus V1 stream reaches (https://nhdplus.com/NHDPlus/NHDPlusV1_home.php) and models were then applied to predict hydrologic alteration in those reaches. Values were then extended to NHDplus V2 stream reaches (https://www.epa.gov/waterdata/get-nhdplus-national-hydrography-dataset-plus-data) using crosswalk tables.

Step 6 and 7 – Predicting and mapping ecological responses to hydrologic alteration in stream reaches

Comprehensive maps of hydrologic alteration in stream reaches provide a foundation for subsequent modeling efforts, such as evaluating ecological responses to altered streamflow. Once these flow-ecology relationships are developed for multiple regions, ecological conditions, similar to hydrologic conditions, can be extrapolated to stream reaches. Recently, George et al.¹⁷ used the same hydrologic alteration indices reported in this study to develop regionally explicit nationwide flow-alteration-ecological-response relationships. These models were used to extrapolate ecological conditions (i.e., losses in fish species richness) to the stream reach resolution based on modeled hydrologic alteration values.

Step 3 expanded: Calculating hydrologic alteration indices for stream gages

We elaborate on the above methodology, starting here with step 3. We used two approaches for calculating hydrologic alteration for non-reference stream gages²⁴. For the majority of indices (Table 1), we calculated hydrologic alteration as proportional changes of observed (O) index values (i.e., human-altered conditions), versus expected (E) index values (i.e., reference conditions) in the equation: (O – E)/E³⁰. Hence, indices ranged from −1 to values» 1. Performance, as measured by area-under-the-curve (AUC), of preliminary models using these raw values were lower than desired (AUC < 0.7). Hence, following Eng et al.³⁰, we scaled values from 0 to 1 to represent a likelihood of hydrologic alteration in the following fashion. For indices < = 1, we used the absolute value of (O – E)/E, whereas indices >1 were assigned maximum values of 1. For reference gages, hydrologic alteration values were set to 0 for each metric.

While the above measures are informative for individual flow components, indices that summarize the multi-dimensional nature of stream flow alterations provide convenient single measures of alteration. We calculated a seasonality index, analogous to Zaerpour et al.⁷, representing shifts in the monthly flow magnitudes, as cumulative differences in observed (O) and expected (E) values for all mean monthly flows using the following equation:

$$\mathop{\sum }\limits_{m=i}^{12}\left(\frac{\left({O}_{i}-{E}_{i}\right)}{{E}_{i}}\right)$$

(1)

As a second multidimensional measure, we calculated a cumulative hydrologic alteration index (HAI), which evaluates the degree of separation between the flow regime of non-reference streams’ and that of reference streams within the same hydrologic class. Hydrologic classes represent groups of streams that share similar natural hydrologic patterns. McManamay et al.²³ developed a hydrologic classification of reference streams in the US, and subsequently, non-reference gages were assigned to those hydrologic classes using models²². To calculate the HAI, all 110 hydrologic metrics (step 2 above) for reference and non-reference gages were centered, scaled from 0 to 1, and assessed in a principal components analysis (PCA)²⁴. Thirteen of the components were significant according to the broken-stick method³¹. We partitioned the 13 principal component scores by hydrologic class membership and calculated 90^th percentile confidence intervals for only reference streams. The confidence interval (a…b) for significant PC scores (S) is represented by the lower (a) and upper (b) bounds. For each non-reference gage and each significant PC, we calculated a rank (r) value using the following:

$${\rm{If}}\;{a}_{i}\le {S}_{i}\le {b}_{i}\;{\rm{is}}\;{\rm{true}}\;{\rm{then}}\;{r}_{i}=0,\;{\rm{otherwise}}\;{r}_{i}={V}_{i},$$

(2)

Where V_i is the eigenvalue for the i^th significant PC. The HAI was then calculated for each non-reference gage using:

$${\sum }_{i=1}^{n}\left|{S}_{i}-{a}_{i}\right|\ast {r}_{i}\;{\rm{for}}\;{S}_{i} < {a}_{i},{\rm{and}}\;{\,\sum }_{i=1}^{n}\left|{S}_{i}-{b}_{i}\right|\ast {r}_{i}\;{\rm{for}}\;{S}_{i} > {b}_{i}$$

(3)

The formula accounts for both the degree of alteration of the PC (i.e., S_i-a_i or S_i-b_i) as well as the importance of each PC to overall variability in hydrologic regimes (i.e., eigenvalue, V_i).

To ensure all metrics were on a similar scale, both the seasonality index and HAI were scaled from 0 to 1 for each ecohydrologic region based on:

$$\frac{{x}_{i}-{\rm{\min }}(x)}{\max \left(x\right)-{\rm{\min }}(x)}$$

(4)

Step 4 and 5 expanded: Hydrologic alteration models and mapping

Random forests³² were constructed to model all hydrologic alteration indices as binomial distributions using 50 predictor variables summarizing natural characteristics and human disturbances, such as landscape alteration and infrastructures (Supplementary Table 1, see²⁴). Random forests are a form of machine learning where large numbers of decision trees are constructed in an iterative fashion using a bootstrapped subsample of observations and subsets of variables³². The remaining observations are termed the out-of-bag (OOB) sample, which is used in cross-validation measurements of variance explained, error, and variable importance. Each tree is constructed from training data and then predictions are combined among all trees. We used the randomForest package³³ in the R programming environment to develop tree-based models for the entire US (all gauges) and separately for each ecohydrologic region. Hence, with 43 hydrologic indices and 29 regions, over 1,000 forest models were generated. Predictor variables used in models are classified into 8 groups (number of variables in parentheses): urbanization (14), agriculture (10), dams and reservoirs (6), power generation (6), dischargers and flow modifiers (5), human disturbance indices (3), basin size, stream size, and climate (3), and natural land cover (3) (Supplementary Table 1). Predictor variables were obtained from multiple sources or our own geospatial analysis and were summarized for both the local catchment surrounding each stream reach containing the stream gauge or were accumulated for the entire catchment contributing to each gauge (Supplementary Table 1). Similarly, the same predictor variables were compiled for all 2.6 million NHDPlus V1 stream reaches, both for local catchments and entire stream networks upstream of each reach. Following construction and calibration, random forest models were used to extrapolate hydrologic alteration values to all 2.6 million NHDPlus V1 stream reaches in the CONUS (Fig. 2). Using crosswalks between NHDPlus V1 and V2, we extended hydrologic alteration values to NHD Plus V2 stream reaches.

Step 6 and 7 expanded: Ecological alteration models and mapping

To develop flow-alteration-ecological response relationships, George et al.¹⁷ developed a comprehensive dataset of overlapping hydrologic and ecological data for 6,452 stream reach locations. At each location, measures of hydrologic alteration and ecological alteration were compiled, where ecological alteration was measured as the deviation in observed native fish richness from expected natural conditions¹⁷. Flow-alteration-ecological-response relationships typically adopt a “wedge-shaped” distribution well-suited for quantile regression³⁴. Hence, George et al. generated quantile regression models predicting 50^th, 75^th, and 95^th percentile alterations in fish richness from hydrologic alteration values for all hydrologic metrics, except HAI, within each 4-digit hydrologic unit code (HUC-4), except watersheds where limited sample size prohibited model development (12% of watersheds).

Quantile regression model coefficients for each hydrologic metric within each HUC4 were used to predict alterations in native fish richness at the stream reach resolution based on modeled estimates of hydrologic alteration. In situations where coefficients were unavailable for HUC-4s, average model coefficients for entire ecohydrologic regions were used. Residuals in fish richness were calculated for each hydrologic alteration metric (e.g., Fig. 3a). Flow thresholds or tipping points represent hydrologic alteration values beyond which ecological degradation is expected¹³ or can be deemed socially unacceptable¹⁵. Presuming that loss of any native fish species is unacceptable, the hydrologic alteration value at which residuals in fish richness <0 is considered the threshold or limit. Based on the quantile regression models¹⁷, thresholds were identified and applied to all stream reaches based on HUC-4 or ecohydrologic region. Modeled hydrologic alteration values for each metric were compared to each respective threshold in each stream reach to yield a binary response where losses in fish biodiversity are expected (1) or not (0), depending on if the hydrologic threshold was exceeded. The mean value among these responses for all hydrologic alteration metrics yields a probability of fish biodiversity loss (Fig. 3b), ranging from 0 to 1, based on all components of the flow regime.

Data Records

All data records available and their file structure are described in Table 2. Modeled estimates of hydrologic alteration for all 43 metrics and estimates of fish biodiversity loss in response to each metric (except HAI) are provided at the NHDPlus stream-reach resolution in .csv or .txt files. Each row entry includes the corresponding stream reach identifiers (i.e., COMIDs) for both NHDPlus V1 and V2 versions. Given that fish species richness responses were modeled for many hydrologic alteration metrics, the median and extreme fish biodiversity loss values across all metrics, in addition to binary and probability measures of fish biodiversity loss, are also provided for each stream reach. Estimates of hydrologic alteration are provided separately for different modeled outcomes, including the random forests developed for all gauges in the US and the aggregated results from all forests developed for each ecohydrologic region. Likewise, the fish species richness response data are provided in two datasets for reach set of responses, one for the entire US and one for the aggregated regional results. The data are generally split into two or four regional sub-datasets for the entire CONUS. Random forest model performance measures, including AUC values and relative importance of predictor variables are also provided in .csv files, both for the entire-US model and for each ecohydrologic region model. Data records are freely and openly accessible under a Creative Commons Attribution 4.0 International license on Zenodo³⁵.

Table 2 Description of files in the Data Record openly accessible through Zenodo³⁵.

Full size table

Technical Validation

We evaluated hydrologic alteration model performance using area-under-the-curve (AUC) measures for all models. AUC values represent cross-validation measures, as predictions from model trees are taken from the out-of-bag (OOB) sample, representing roughly 33% of observations, modeled in that forest. AUC values are appropriate for evaluating the performance of hydrologic alteration models as the indices range from 0 to 1; however, AUC evaluation requires a binary classifier as the control from which to measure performance. In this case, we used two binary control measures to calculate AUC values: a binary categorization of reference and non-reference gages (measure 1), and a categorization of high (index > 0.5) and low (index < = 0.5) hydrologic alteration values in the empirical observations (measure 2). Relative importance (RI) of variables in predicting hydrologic alteration indices was also measured from the OOB sample as the average decrease in mean squared error (MSE) rates across all permutations of a given variable among all trees. Because RI can vary with MSE values, we scaled all RI values from 0 to 1 using the Eq. 4 to support comparison among hydrologic alteration indices, groups of predictor variables, and regions. MSE values for all random forest models are provided in Supplementary Material 1.

AUC values for models developed for the entire US averaged 0.87 and ranged from 0.72 to 0.91 for measure 1, whereas for measure 2, AUC values averaged 0.82 and ranged from 0.78 to 0.88 (Fig. 4, results also available via³⁵). Performance for models developed for individual ecohydrologic regions varied considerably. For measure 1, AUC values averaged 0.87 and ranged from 0.5 to 0.998 depending on the hydrologic index (Fig. 4a) and region (Fig. 4b). For measure 2, AUC values averaged 0.81 and ranged from 0.53 to 0.995 (Fig. 4c,d). Variation in model performance seemed to be related to intrinsic properties of each region and less influenced by different hydrologic indices. Differences in model performance was not influenced by the gauges available for model construction. For instance, there was no relationship between the number of observations (number of gauges) and average variation explained in hydrologic indices for each ecohydrologic region.

The relative importance of predictor variables in explaining variation in hydrologic indices was also explored. Climate and natural basin characteristics, such as drainage area and precipitation, were the most important predictor variables, followed by variables characterizing dams (e.g., dam storage, degree of regulation). Landcover variables were of moderate importance, whereas energy infrastructure and water discharges (unrelated to dam regulation) were of minor importance (Fig. 5a). With a few exceptions, variation in importance among models within a region generally mirrored the overall importance of variables, where basin and climate variables had higher variation in importance, followed by dam-related variables (Fig. 5b).

As a second and coarse validation of hydrologic alteration, we compared our nationwide rates of hydrologic alteration from our assessment to three other nationwide assessments of hydrologic alteration, conducted at either the stream-gage or stream-reach level^8,26,36. Comparing rates of alteration among different assessments requires establishing common hydrologic alteration thresholds, which, in the case of Carlisle et al.⁸, was > = 10% change in hydrologic indices from reference conditions. Carlisle et al. found that 86% of stream gages showed signs of at least 10% deflated mean annual minima and maxima values. Based on our assessment, 93% of stream gages had alteration values of > = 0.1 for DL1 (1-day annual minimum), 66% of gages with > = 0.1 for DH1 (1-day annual maximum), and 96% of gages considering both DL1 and DH1. In contrast to Carlisle et al., our analysis considers both decreases and increases to hydrologic metrics as a measure of alteration. Extending the comparison to stream reaches, approximately 74% of U.S. stream mileage, on average (79% median), across all hydrologic indices had hydrologic alternation values equal or exceeding 10%. According to the hydrologic alteration index (HAI), 79% of streams were hydrologically altered, whereas the mean daily flow index (MA1) indicated 81% of streams were altered.

Eng et al.²⁶ conducted a thorough study of hydrologic alteration at stream gages and reported percentages of non-reference gages displaying alteration for twelve hydrologic metrics, some of which were compatible with those used in our study, although none were identical (Supplementary Material 2, Table S1). Aside from differences in the metrics used, the scale of values used in Eng et al.’s approach to measure degrees of hydrologic alteration differed from our approach and required some adjustment to be compatible for comparison (Supplementary Material 2, Table S2). Generally, the frequency of non-reference stream gages displaying degrees of alteration for various flow components in our study agreed relatively well with that of Eng et al. (Fig. 6). Carlisle et al.³⁶ used machine learning models to extend Eng et al.’s stream gage assessment to the stream reach level and reports that 1.2 million km, or roughly 38% of streams have “modified flows”. Carlisle et. al. does not report what threshold was used to generate this result; however, if we assume that a 20% presumptive standard threshold¹³ was used, then our dataset suggests 57% of streams are hydrologically altered based on HAI values > = 0.2 (note: HAI is a more comprehensive measure of 110 hydrologic metrics, compared to the 12 reported in Eng et al.). Collectively, despite inconsistencies in approaches, these comparisons suggest that our hydrologic alteration assessment is within reasonable estimates of other nationwide assessments.

Validation of the ecological alteration data required consideration of compounding error arising from both the hydrologic alteration models and quantile regression models. Hence, cumulative uncertainty in fish biodiversity loss can stem from error in the quantification of hydrologic alteration thresholds, as well as estimation of hydrologic alteration. Using mean-square error (MSE) rates for each random forest model, modeled hydrologic alteration estimates $(\hat{HA})$ were adjusted for each stream reach by subtracting or adding MSE values (from random forest models), yielding minimum $(\overline{HA})$ and maximum $(\overline{\overline{HA}})$ values, respectively, for each hydrologic metric. For quantile regression models, standard error values for slope coefficients were used to vary slopes to yield minima $(\bar{T})$ and maxima $(\overline{\overline{T}})$ threshold values for each hydrologic metric. Subsequently, fish biodiversity loss (R) was modeled as a binary outcome of surpassed thresholds for the h^th hydrologic metric at the i^th stream reach, where

$${R}_{h,i}\in \left\{0,1\right\}{\rm{for}}\;{\rm{all}}\;h\;{\rm{and}}\;i.$$

(5)

Lower limits (${\bar{R}}_{h,i}$) and upper limits $({\overline{\overline{R}}}_{h,i})$ of fish biodiversity loss were subject to the following constraint:

$${\rm{If}}\,{\bar{T}}_{h}\ge {\overline{\overline{HA}}}_{h,i},\;{\rm{then}}\;{\overline{R}}_{h,i}=1,\;{\rm{else}}\;{\overline{R}}_{h,i}=0,$$

(6)

and

$${\rm{If}}\,{\overline{\overline{T}}}_{h}\ge {\overline{HA}}_{h,i},\;{\rm{then}}\;{\overline{\overline{R}}}_{h,i}=1,\;{\rm{else}}\;{\overline{\overline{R}}}_{h,i}=0.$$

(7)

Lower and upper limits to the probability of fish biodiversity loss for each stream reach were then calculated as:

$$\frac{{\sum }_{h}^{n}{\bar{R}}_{h,i}}{n}$$

(8)

and

$$\frac{{\sum }_{h}^{n}{\overline{\overline{R}}}_{h,i}}{n}$$

(9)

Where n is the number of hydrologic metrics. Lower and upper limits of probability of fish biodiversity loss were calculated separately for 50^th, 75^th, and 95^th quantile model results, each of which had different hydrologic threshold values corresponding to each hydrologic alteration metric.

Cumulative frequency distributions were used to examine the variability and uncertainty in the probability of fish biodiversity loss in stream reaches according to different quantile models. The distribution of fish biodiversity loss probability varied widely among different quantiles (Fig. 3c–e). Uncertainty was considerable, indicating high amounts of compounding error arose from both hydrologic alteration models and quantile models predicting fish biodiversity loss (Fig. 3c–e); however, total uncertainty was lower for the 95^th percentile models (Fig. 3e).

Usage Notes

Mapping hydrologic alteration indices and associated measures of ecological responses in stream reaches provides a rich dataset for environmental flow applications, especially in data-poor regions. The dataset provides general “rules of thumb” for environmental flow guidelines, specifically which hydrologic indices are important to ecology, and hence, management. Maps at the stream-reach resolution can be used to explore hydrologic indices with the highest rates of alteration, as well as the risk of stream habitat alteration and biodiversity loss. As such, the data can be useful for exploring the frequency of alteration in the landscape according to hydrologic or biologically meaningful spatial units (e.g., species ranges), and can support aquatic species GAP analyses or prioritization for areas of flow restoration. Data are provided in accessible .csv formats and summarized at the NHDplus stream-reach scale. Hence, the data provide inherent interoperability with other NHDplus-derived products with associated COMID identifiers. The full functionality of the NHDplus framework, including topological connections and stream routing, can be leveraged with the hydrologic alteration dataset.

Users of the dataset should be aware of its limitations. The modeled hydrologic alteration and ecological alteration values are not meant to replace site-specific environmental assessments and are most appropriately used as information building blocks for regional environmental flow applications. As indicated in the validation section, random forest models of hydrologic alteration had variable error rates. While model performance is satisfactory for most hydrologic metrics, uncertainty would have likely reduced had we controlled for climate variation among reference and non-reference gages²⁶. Although 15 years of hydrologic data records is considered sufficient for calculating hydrologic statistics²⁵, a formal control of climate shifts would have likely improved model performance. Furthermore, compounded uncertainties were significant when predicting fish biodiversity responses to alteration. An important consideration is the interpretation of hydrologic alteration values, which have been standardized from 0 to 1. As such, these are not directly translatable as measures of hydrologic alteration. Although values were generally derived from ratios of alteration and intended to represent a probability of alteration, interpreting the values as measures of risk and indicators of alteration is more appropriate than in a strict probabilistic sense. Finally, random forest models developed for each ecohydrologic region were influenced by the range of hydrologic alteration values and frequency of altered streams in those regions. Additionally, values were standardized based on minima and maxima within each ecohydrologic region; therefore, hydrologic alteration values derived from regional models will display slight regional biases in values. Therefore, users desiring US-wide applications should perhaps use values from US-wide models.

Code availability

Data processing, visualization, and random forest development were conducted using pre-existing libraries, particularly the randomForest package³³ and ggplot2 package³⁷ (for density plots and stacked bar plots) in the R programming environment, whereas spatial mapping was conducted in ArcMap 10.7.1, as indicated in the methods and validation. Therefore, no custom code was used for this study.

References

Tickner, D. et al. Bending the curve of global freshwater biodiversity loss: an emergency recovery plan. BioScience 70(4), 330–342 (2020).
Article Google Scholar
Dudgeon, D. et al. Freshwater biodiversity: importance, threats, status and conservation challenges. Biological reviews 81(2), 163–182 (2006).
Article Google Scholar
Balain, E. V., Segers, H., Lévèque, C. & Martens, K. The Freshwater Animal Diversity Assessment: An overview of the results. Hydrobiologia 595, 627–637 (2008).
Article Google Scholar
McManamay, R. A. & Jager, H. I. Stream Biomes of the World in Encyclopedia of the World’s Biomes (ed. Goldstein, M. I. & DellaSala, D. A.) https://doi.org/10.1016/B978-0-12-409548-9.12047-0 (Elsevier, 2020).
Grill, G. et al. An index-based framework for assessing patterns and trends in river fragmentation and flow regulation by global dams at multiple scales. Environ. Res. Lett. 10(1), 015001 (2015).
Article ADS Google Scholar
Virkki, V. et al. Environmental flow envelopes: quantifying global, ecosystem-threatening streamflow alterations. Hydrol. Earth Syst. Sci. https://doi.org/10.5194/hess-2021-260 (2021)
Zaerpour, M., Hatami, S., Sadri, J. & Nazemi, A. A global algorithm for identifying changing streamflow regimes: application to Canadian natural streams (1966–2010). Hydrol. Earth Syst. Sci. 25(9), 5193–5217 (2021).
Article ADS Google Scholar
Carlisle, D. M., Wolock, D. M. & Meador, M. R. Alteration of streamflow magnitudes and potential ecological consequences: a multiregional assessment. Front. Ecol. Environ. 9(5), 264–270 (2011).
Article Google Scholar
Döll, P. & Zhang, J. Impact of climate change on freshwater ecosystems: a global-scale analysis of ecologically relevant river flow alterations. Hydrol. Earth Syst. Sci. 14(5), 783–799 (2010).
Article ADS Google Scholar
Asadieh, B. & Krakauer, N. Y. Global change in streamflow extremes under climate change over the 21^st century. Hydrol. Earth Syst. Sci. 21, 5863–5874 (2017).
Article ADS CAS Google Scholar
Grooten, M. & Almond, R. E. A. Living Planet Report - 2018: Aiming Higher. https://www.worldwildlife.org/pages/living-planet-report-2018 (World Wildlife Fund, 2018)
Richter, B. D., Baumgartner, J. V., Powell, J. & Braun, D. P. A method for assessing hydrologic alteration within ecosystems. Conserv. Biol. 10(4), 1163–1174 (1996).
Article Google Scholar
Richter, B. D., Davis, M., Apse, C. & Konrad, C. A presumptive standard for environmental flow protection. River Research and Applications 28(2012), 1312–1321 (2012).
Article Google Scholar
Poff, N. L. & Zimmerman, J. K. Ecological responses to altered flow regimes: a literature review to inform the science and management of environmental flows. Freshw. Biol. 55(1), 194–205 (2010).
Article Google Scholar
Poff, N. L. et al. The ecological limits of hydrologic alteration (ELOHA): a new framework for developing regional environmental flow standards. Freshwater biology 55(1), 147–170 (2010).
Article Google Scholar
TNC (The Nature Conservancy). ELOHA Case Studies. https://www.conservationgateway.org/ConservationPractices/Freshwater/EnvironmentalFlows/MethodsandTools/ELOHA/Pages/Case-Studies.aspx (2018)
George, R., McManamay, R., Perry, D., Sabo, J. & Ruddell, B. L. Indicators of hydro-ecological alteration for the rivers of the United States. Ecological Indicators 120, 106908 (2021).
Article Google Scholar
Troia, M. J. & McManamay, R. A. Completeness and coverage of open-access freshwater fish distribution data in the United States. Diversity and Distributions 23, 1482–1498, https://doi.org/10.1111/ddi.12637 (2017).
Article Google Scholar
Irving, K. et al. A high-resolution streamflow and hydrological metrics dataset for ecological modeling using a regression model. Scientific data 5(1), 1–14 (2018).
Article Google Scholar
Carlisle, D. M., Grantham, T. E., Eng, K. & Wolock, D. M. Biological relevance of streamflow metrics: regional and national perspectives. Freshwater Sci. 36(4), 927–940 (2017).
Article Google Scholar
Falcone, J. A. GAGES-II: Geospatial Attributes of Gages for Evaluating Streamflow. U.S. Geological Survey https://doi.org/10.3133/70046617 (2011).
McManamay, R. A. Quantifying and generalizing hydrologic responses to dam regulation using a statistical modeling approach. J. Hydrol. 519, 1278–1296 (2014).
Article ADS Google Scholar
McManamay, R. A., Bevelhimer, M. S. & Kao, S. C. Updating the US hydrologic classification: an approach to clustering and stratifying ecohydrologic data. Ecohydrology 7(3), 903–926 (2014).
Article Google Scholar
McManamay, R. A. et al. US cities can manage national hydrology and biodiversity using local infrastructure policy. Proceedings of the National Academy of Sciences 114(36), 9581–9586 (2017).
Article ADS CAS Google Scholar
Kennard, M. J., Mackay, S. J., Pusey, B. J., Olden, J. D. & Marsh, N. Quantifying uncertainty in estimation of hydrologic metrics for ecohydrological studies. River Res. Applic. 26, 137–156, https://doi.org/10.1002/rra.1249 (2010).
Article Google Scholar
Eng, K., Carlisle, D. M., Grantham, T. E., Wolock, D. M. & Eng, R. L. Severity and extent of alterations to natural streamflow regimes based on hydrologic metrics in the conterminous United States, 1980–2014: U.S. Geological Survey Scientific Investigations Report 2019–5001, 25 p., https://doi.org/10.3133/sir20195001 (2019)
Stoddard, J. L., Larse, D. P., Hawkins, C. P., Jonson, R. K. & Norris, R. H. Setting expectations for the ecological conditions of streams: the concept of reference condition. Ecological Applications 16, 1267–1276 (2006).
Article Google Scholar
Olden, J. D. & Poff, N. L. Redundancy and the choice of hydrologic indices for characterizing streamflow regimes. River Res. Appl. 19(2), 101–121 (2003).
Article Google Scholar
Abell, R. et al. Freshwater Ecoregions of the World: A new map of biogeographic units for freshwater biodiversity conservation. BioScience 58, 403–414, https://doi.org/10.1641/B580507 (2008).
Article Google Scholar
Eng, K., Carlisle, D. M., Wolock, D. M. & Falcone, J. A. Predicting the likelihood of altered streamflows at ungauged rivers across the conterminous United States. River Res. Appl. 29(6), 781–791 (2013).
Article Google Scholar
Jackson, D. A. Stopping rules in principal components analysis: a comparison of heuristical and statistical approaches. Ecology 74, 2204–2214 (1993).
Article Google Scholar
Breiman, L. Random forests. Mach Learn 45, 5–32 (2001).
Article Google Scholar
Liaw, A. & Wiener, M. Classification and Regression by randomForest. R News, 2(3), 18–22, https://CRAN.R-project.org/doc/Rnews/ (2002).
Knight, R. R., Murphy, J. C., Wolfe, W. J., Saylor, C. F. & Wales, A. K. Ecological limit functions relating fish community response to hydrologic departures of the ecological flow regime in the Tennessee River basin, United States. Ecohydrology 7(5), 1262–1280 (2014).
Article Google Scholar
McManamay, R. A., George, R., Morrison, R. A. & Ruddell, B. L. A dataset of modeled hydrologic alteration and ecological consequences in stream reaches of the conterminous United States, Zenodo https://doi.org/10.5281/zenodo.5839011 (2022).
Carlisle, D. M. et al. Flow modification in the Nation’s streams and rivers: U.S. Geological Survey Circular 1461, 75 p., https://doi.org/10.3133/cir1461 (2019)
Wickham, H. ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York. ISBN 978-3-319-24277-4, https://ggplot2.tidyverse.org (2016)

Download references

Acknowledgements

This research was partially funded through the US Fish and Wildlife Service (USFWS), Grant TX T-234-R-1, F21AF02689-00 under the authority of the State Wildlife Grant Program (CFDA #15.634) issued by Texas Parks and Wildlife Department. We thank two anonymous reviewers for providing comments that improved previous version of this manuscript.

Author information

Authors and Affiliations

Department of Environmental Science, Baylor University, Waco, TX, 76798, USA
Ryan A. McManamay
School of Informatics, Computing and Cyber Systems, Northern Arizona University, Flagstaff, AZ, 86011, USA
Rob George & Benjamin L. Ruddell
Department of Civil and Environmental Engineering, Colorado State University, Fort Collins, CO, 80523, USA
Ryan R. Morrison

Authors

Ryan A. McManamay
View author publications
You can also search for this author in PubMed Google Scholar
Rob George
View author publications
You can also search for this author in PubMed Google Scholar
Ryan R. Morrison
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin L. Ruddell
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.A.M. conceived the idea of the data, constructed the dataset, and drafted the manuscript. R.G. built the ecological thresholds and helped with dataset integration. R.R.M. assisted in writing and editing the manuscript. B.L.R. co-conceived the data and edited the draft.

Corresponding author

Correspondence to Ryan A. McManamay.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information 1

Supplementary Information 2

Supplementary Table 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

McManamay, R.A., George, R., Morrison, R.R. et al. Mapping hydrologic alteration and ecological consequences in stream reaches of the conterminous United States. Sci Data 9, 450 (2022). https://doi.org/10.1038/s41597-022-01566-1

Download citation

Received: 08 February 2022
Accepted: 13 July 2022
Published: 28 July 2022
DOI: https://doi.org/10.1038/s41597-022-01566-1

This article is cited by

Impact of LULC changes on hydrological flow regimes and runoff coefficient in Upper Jhelum Basin, India
- Shahid Ul Islam
- Sumedha Chakma
Sustainable Water Resources Management (2024)
Degradation of floodplain integrity within the contiguous United States
- Ryan R. Morrison
- Kira Simonson
- Dan Carver
Communications Earth & Environment (2023)