A hydrological simulation dataset of the Upper Colorado River Basin from 1983 to 2019

Tran, Hoang; Zhang, Jun; O’Neill, Mary Michael; Ryken, Anna; Condon, Laura E.; Maxwell, Reed M.

doi:10.1038/s41597-022-01123-w

Download PDF

Data Descriptor
Open access
Published: 20 January 2022

A hydrological simulation dataset of the Upper Colorado River Basin from 1983 to 2019

Hoang Tran¹,
Jun Zhang²,
Mary Michael O’Neill³,
Anna Ryken⁴,
Laura E. Condon² &
…
Reed M. Maxwell^1,5

Scientific Data volume 9, Article number: 16 (2022) Cite this article

5173 Accesses
12 Citations
4 Altmetric
Metrics details

Subjects

Abstract

This article presents a hydrological reconstruction of the Upper Colorado River Basin with an hourly temporal resolution, and 1-km spatial resolution from October 1982 to September 2019. The validated dataset includes a suite of hydrologic variables including streamflow, water table depth, snow water equivalent (SWE) and evapotranspiration (ET) simulated by an integrated hydrological model, ParFlow-CLM. The dataset was validated over the period with a combination of point observations and remotely sensed products. These datasets provide a long-term, natural-flow, simulation for one of the most over-allocated basins in the world.

Measurement(s)	streamflow • groundwater • evaporation • evapotranspiration • total water storage
Technology Type(s)	integrated hydrological model
Factor Type(s)	temporal interval • geographic location
Sample Characteristic - Environment	river
Sample Characteristic - Location	Colorado River

Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.16632454

Multi-model Hydroclimate Projections for the Alabama-Coosa-Tallapoosa River Basin in the Southeastern United States

Article Open access 18 February 2020

Sudershan Gangrade, Shih-Chieh Kao & Ryan A. McManamay

A comprehensive, multisource database for hydrometeorological modeling of 14,425 North American watersheds

Article Open access 20 July 2020

Richard Arsenault, François Brissette, … Annie Poulin

The floodplain inundation history of the Murray-Darling Basin through two-monthly maximum water depth maps

Article Open access 23 September 2023

David J. Penton, Jin Teng, … Carmel Pollino

Background & Summary

The Upper Colorado River Basin (UCRB) is one of the most over-allocated basins in the world. It provides water for 40 million people in Colorado and downstream states of Arizona, California, Nevada, and Utah. However, the UCRB water capacity is decreasing as a combined result of climate change^{1,2,3,4,5,6,7,8} and anthropogenic activities^1,5,9,10,11. Understanding the dynamic of the UCRB water cycle is crucial for long-term and short-term water resources management.

While understanding the overallocation of an important headwaters system is challenging and depends on many factors, data (in the form of observations and data products) can help provide insight. There are currently three data sources for studying the UCRB’s water resources, namely, point observations, remote sensing products and model output products. All three are valuable tools to quantify the quantity and flux of water in the system, but each are incomplete.

Point observations, such as the streamflow monitoring network, have existed since the early 20^th century in the UCRB. While some observations have dense networks (e.g. there are 630 and 490 monitoring locations for streamflow and air temperature, respectively), others, such as evapotranspiration (ET) and groundwater depth, are more sparse. For example, there is only one AmeriFlux station measuring ET in the entire UCRB (approximately 280,000 km²).

Remote sensing products have become a valuable source of earth systems data and are often used on their own or to complement in-situ point observations. For example, a family of the Moderate Resolution Imaging Spectroradiometer (MODIS) products has been used globally to study various water cycle components and extremes such as snow^12,13, flood^14,15, and drought^16,17. Remote sensing products’ temporal and spatial resolutions may be limited for some applications. For example, the GRACE remote sensing product have been successfully applied to understand large scale depletion of groundwater worldwide¹⁸, but its native spatial resolution, at hundreds of kilometers laterally, can be prohibitively coarse for many hydrological applications¹⁹.

Hydrologic models are also useful tools to understand the quantity and flux of water in the UCRB. A prime example of models used to understand water states and fluxes is the North American Land Data Assimilation System (NLDAS) platform. NLDAS is a collaborative project between NASA, NOAA and a group of the universities and incorporates many different hydrologic models: Mosaic^20,21, Noah-2.8^22,23, Soil moisture Accounting Model (SAC^24,25) and Variable Infiltration Capacity (VIC^26,27). While useful tools, all hydrologic models simplify some aspects of the hydrologic cycle, demonstrate some mismatch between simulated output and observations, and provide output at a discrete spatial and temporal resolution.

Here we present an integrated hydrologic simulation of UCRB spanning from October 1982 to September 2019 at high spatial (1 km) and temporal (hourly) resolutions. The model that we used, ParFlow-CLM, increases both the number of processes simulated over the UCRB (deep groundwater to the top of the canopy) and the spatial and temporal resolution at which these processes are simulated. To build confidence in the simulation results, ParFlow is exhaustively compared to available observations and data products. While still an imperfect representation of the hydrology of the UCRB, this dataset pushes our modeling capabilities forward and augments existing observations and data products to help provide a more complete understanding of this important watershed.

Methods

Site

The Upper Colorado River Basin (UCRB) is a snowmelt-dominated system that covers about 280,000 km². It extends from headwaters in the Rockies in Colorado and Wyoming to Lee’s Ferry in Northern Arizona with elevation ranges between 3300 m and 900 m. During the winter season, from October to the end of April, the snow cover area (SCA) for the UCRB ranges from 50,000 km² to 280,000 km² which plays a crucial role in energy²⁸ and hydrological cycles²⁹.

ParFlow-CLM

The hydrologic simulation of the UCRB was conducted using the integrated hydrologic model, ParFlow-CLM^30,31,32. ParFlow computes both the surface and subsurface fluxes by solving the Richards equation³³ in three spatial dimensions together with the kinematic wave equation over a terrain following grid. Furthermore, ParFlow is coupled to a land surface model (Common Land Model; CLM), ParFlow-CLM, to resolve the energy and water balances from the canopy to the ground surface.

The technical details of ParFlow-CLM are well-documented in Maxwell and Miller³⁰, Kollet and Maxwell^31,34, Kollet et al.³⁵, Maxwell et al.³⁶, Jefferson and Maxwell³⁷, Maxwell and Condon³⁸ and Kuffour et al.³⁹. ParFlow integrates groundwater and surface water systems using a free surface overland flow boundary condition³¹. In other words, the surface water and variably saturated groundwater flow equations directly exchange fluxes without a conductance layer. In ParFlow, streams are formed by either Hortonian (excess infiltration⁴⁰) or Dunne (excess saturation⁴¹) runoff without the need of a priori embedded rivers.

CLM is the land surface component of the model. CLM solves the terrestrial energy balance (e.g. net radiation, sensible, latent and ground heat fluxes) in addition to a multi-layer snow model⁴² and a complete canopy water balance. Sensible and latent heat are solved through a resistance scheme including soil, vegetation and atmospheric resistances⁴³. The ground heat is calculated based on the one-dimensional heat conduction equation³⁵. Ground and sensible heat fluxes are directly dependent on the water content in soil layers which is solved by ParFlow³⁴. Conversely, soil moisture is also dependent on infiltration and plant uptake which is passed back to ParFlow by CLM^34,37,44.

Input datasets

The main inputs in this study can be divided into two groups: dynamic atmospheric forcing and static model parameters. The first group of inputs includes a subset from the North American Land Data Assimilation System (NLDAS) project. The second group of inputs includes two types of model parameters: surface information (i.e. topographic slopes and land cover) and subsurface information (i.e. soil, geology and bedrock types and their characteristics).

The NLDAS project is a collaboration between NASA, NOAA and a group of universities to provide high accuracy and consistent datasets for a wide variety of hydrologic studies. Studies modeling streamflow^45,46, soil moisture^47,48 and snow^49,50 have been using NLDAS as inputs. Thus, we decided to use a subset of the NLDAS dataset for this simulation which includes eight variables, namely, precipitation, air temperature, short-wave radiation, long-wave radiation, east-west wind speed, south-north wind speed, atmospheric pressure and specific humidity. The NLDAS has two versions which were used in this study: NLDAS-1^51,52 which spans from 1983 to 2002 and NLDAS-2^45,53 which spans from 2003 to 2019. Major improvements from NLDAS-1 to NLDAS-2 include additional measurement sources of precipitation such as gauge (Climate Prediction Center - CPC product), radar (National Centers for Environmental Prediction-NCEP 4-km hourly Doppler radar Stage II) and satellite (CPC MORPHing technique – CMORPH)⁴⁵.

The model parameters consist of two types: surface and subsurface. The surface parameters, topographic slopes and land cover, were computed as follows. Topographic slopes were calculated using the Priority Flow toolbox⁵⁴ with an elevation input from the Hydrological data and maps based on Shuttle Elevation Derivatives at multiple Scales (HydroSHEDS). Land cover information was obtained from the National Land Cover Database (NLCD) at 30-m resolution. The obtained land cover dataset was then upscaled to model resolution at 1-km. Land cover values are based on the International Geosphere-Biosphere Program (IGBP) classifications.

The subsurface of the ParFlow domain consists of four soil layers at the top and one geology layer at the bottom. Categories for the soil units were obtained from the Soil Survey Geographic Database (SSURGO; https://websoilsurvey.sc.egov.usda.gov) and hydrogeologic categories were obtained from a global permeability map developed by Gleeson et al.⁵⁵. Parameters such as saturated hydraulic conductivity and van Genuchten relationships of those soil and hydrogeology layers were obtained from Schaap and Leij⁵⁶. More details about the subsurface parameters and configurations can be found in Condon and Maxwell⁵⁷, and Maxwell et al.³⁶.

Model spinup

A model spinup is the initialization process used to bring the system into a more realistic set of initial conditions when the true starting point of the model (for example, the pressures everywhere in the UCRB) is unknown. This starting point is particularly important for groundwater systems which take longer time to evolve than the surface systems.

In preparation for the 37-year simulation, we completed a model spinup in two steps. First, potential recharge (calculated as Precipitation Minus Evapotranspiration (PME)) was applied to the model until the change in subsurface storage was less than 3% of the total storage. The potential recharge PME was derived from the average precipitation and evapotranspiration products for the period between 1950 and 2000 by Maurer et al.⁵⁸. For the second step, the hourly atmospheric forcing for the initial water year (1983) was repeatedly applied to bring the model into quasi-equilibrium.

Simulation from 1983 to 2019

The spinup process described above provided an initial pressure model of the UCRB for the 37-year simulation. To do this, we simulated each year for a time period spanning from October to the end of September next year, often known as the Water Year (WY) which better matches with the precipitation cycle that occurs in late autumn. All simulations were executed on the Cheyenne supercomputer operated by the National Center for Atmospheric Research (NCAR). On average, one WY simulation used about 6,100 cores hours, which resulted in about a day of wall-clock time given parallel computing and batch submission processes. The entire 37-year simulation used approximately 220,000 core hours of computing time, spanning about 1.2 months of wall-clock time.

Observation datasets used for comparison

A comprehensive comparison between model simulation results, observations and remotely sensed products was conducted. A summary of each dataset is provided below.

Streamflow observations were compiled from the USGS Water Data web service. Since this was a pre-development simulation (i.e. excluding surface water management and groundwater pumping), we filtered out observations from stations that are clearly affected by anthropogenic activities. Although small drainage area basins can have water withdrawals and irrigation ditches, the effect of anthropogenic activities on these basins are much less compared to larger drainage area basins, especially in monthly or annual scales^59,60. Thus, we defined a drainage area threshold of 500 km²; stations whose drainage areas are larger than the threshold were then manually inspected. For example, we removed the station at Lee’s Ferry (drainage area: 289,560 km²) located right after the Glen Canyon Dam.

In total, there were a total of 602 UGSG stream stations in the UCRB with observations from 1983 to 2019 (shown as blue stars in Fig. 1). Eight stations situated at the outlet of watersheds that represent medium to large drainage areas were used for comparison demonstration in Fig. 2. These stations were: Green River at Green River (116,160 km²), Colorado River near Cisco (62,419 km²), San Juan River near Bluff (59,570 km²), Yampa River at Deerlodge (20,541 km²), Gunnison River near Grand Junction (20,520 km²), Colorado River below Glenwood Springs (15,576 km²), San Juan River at Four Corners (37,813 km²), and East River at Almont (749 km²).

In addition to the USGS stream observations, we also used the Bureau of Reclamation natural flow dataset^{61,62,63,64,65,66} which is available for 20 stations in the UCRB from 1906 to 2020. The natural flow was constructed by combining history gauge flow with consumptive uses and losses⁶⁴ and reservoir regulation⁶⁶. The dataset has been used in several other studies including drought analysis in the UCRB^67,68,69.

The next observational set was the USGS groundwater database (https://waterdata.usgs.gov/nwis/gw). All data from wells that have at least two months of observations during the period between 1983 and 2019 were used for comparison here. Measurements that did not pass the USGS quality control (i.e. flagged for potential measurement inconsistency or negative outlier values) were filtered out. Also, wells with water table depths (WTD) below 52 m (i.e. below the depth for the center of the bottom grid cell in the model domain) were removed. A total of 36 wells were used to compare water table levels after this filtering process (shown as blue hexagons in Fig. 1).

In addition to these temporal groundwater observations, there are a total of 3,865 well locations in the UCRB from the Fan et al.⁷⁰ water table observations. Fan et al.⁷⁰ compiled this water table observational dataset by calculating the average WTD for USGS sites between 1927 and 2009. While Fan et al.⁷⁰ noted that about 90% of the wells have only one observation at different times, they found that wells whose WTD were above 20 m aligned well with their global simulated WTD. Based on these findings, their dataset was determined to be an appropriate resource to validate model performance.

We also employed a derived snow cover extent from MODIS for comparison to simulations. The daily cloud-free snow cover dataset⁷¹ was developed via a series of mitigated cloud filters and the Variational Interpolation algorithm to the MODIS-Snow Cover Area (SCA) Daily (MOD10C1 and MYD10C1) version 6 product^12,72. The product has been proved to effectively capture the dynamic changes of snow from 2000 to 2017 with the average of Probability Of Detection and False Alarm Ratio are 0.955 and 0.179, respectively⁷¹. The cloud-free product’s spatial and temporal resolutions are 0.05° and daily, respectively.

The Snow Water Equivalent (SWE) data was obtained from the Snow Telemetry (SNOTEL) network. There was a total of 133 SNOTEL stations used for comparison. SNOTEL stations have an average elevation of nearly 2,900 m with the station in the highest elevation of more than 3,500 m at the Italian Creek, CO.

Total water storage (TWS) change measured by the Gravity Recovery and Climate Experiment (GRACE) mission was used to compare with simulated TWS. Launched in 2002, GRACE estimates monthly changes in terrestrial water storages globally based satellite location (http://www2.csr.utexas.edu/grace/RL05_mascons.html). The data used in this study, CSR Release-06 GRACE Mascon Solutions, was released from the Center for Space Research (CSR), the University of Texas at Austin. Mass fluxes (measured in terms of mass concentration—mascon) derived directly from raw GRACE data often have north-south stripes due to modeling errors, measurement noise and observability issues⁷³. To decrease the uncertainty in these mass fluxes, a series of filters were applied to GRACE gravity information in a 1° geodesic grid domain. Those filters include (1) mascon geodesic grid correction, (2) Glacial Isostatic Adjustment (GIA) correction, (3) Degree-1 coefficients (Geocenter) corrections and (4) C20 (degree 2 order 0) replacement. The final total water storage change is obtained by subtracting the mean from 2003 to 2009. Please note that GRACE measures the storage anomaly at approximately monthly intervals, but it does not measure total quantity of water stored. GRACE storage anomalies were available monthly from April 2002 to June 2017 at the time of this analysis with a spatial resolution of 1°. Given the relatively low spatial resolution, Scalon et al.¹⁹ suggested to use GRACE only for watersheds which have areas greater than 100,000 km² (The area of the UCRB is approximately 280,000 km²). Uncertainty analysis for CSR RL06 is not available yet, however, uncertainty value suggested for the RL05 version is roughly 2 cm⁷³.

Four stations from the Community Collaborative Rain, Hail and Snow Network (CoCoRaHS) provide potential ET estimates. Additionally, the AmeriFlux station at Niwot Ridge, CO (US-NR1^74,75) provides latent heat observations (which can be translated directly to ET by dividing to a unit of latent heat of evaporation of water 2256 kJ/kg). While ET stations are scarce in the UCRB, because of the diversity in their locations and temporal coverage, we feel that those observations still play a crucial role in the simulation evaluation. First is the range of elevation and land cover that those stations represent: two high elevation stations located at Niwot Ridge, CO (3050 m) and Crested Butte, CO (2912 m), one moderate elevation station located at Carbondale, CO (1887 m) and two low altitude stations located at Grand Junction, CO (1428 m) and Castle Valley, UT (1464 m). With respect to land cover, stations at Carbondale, CO and Crested Butte, CO are located in evergreen forest; stations at Niwot Ridge, CO and Grand Junction, CO are located in deciduous forest; the station at Castle Valley, UT is located in shrubland. We compared ParFlow simulations with CoCoRaHs data from June 2012 to present and AmeriFlux data from 1999 to present, respectively.

In addition to ET estimated from stations, we also used remotely sensed ET from Simplified Energy Balance (SSEBop) MODIS product to compare with simulated ET. The SSEBop model provides daily and 1-km ET estimations for the whole UCRB from 2000 to the end of the validation period and has been shown to be reliable in various regions^76,77. Senay et al.⁷⁸ simulates ET in SSEBop by using pre-defined hot and cold boundary conditions. Each pixel is assigned a hot and cold boundary values based on maximum air temperature and differential temperature. Based on land surface temperature (K) obtained from MODIS images, ET fraction is computed and then multiplied with a short grass reference (mm.d⁻¹) and a scaling coefficient to produce final ET⁷⁹.

Lastly, ground temperature data from the National Oceanic and Atmospheric Administration (NOAA) Regional Climate Center (RCC) was used for comparison. The NOAA RCC data consist of observations compiled from the Global Historical Climatology Network (GHCN⁸⁰) database, and other federal and regional agencies. There are a total of 490 stations that monitor temperature. These stations are well distributed over the UCRB (green diamonds in Fig. 1).

Two of the remotely sensed products used for comparison, namely, MODIS-SCA and GRACE, are downscaled to match with the dataset’s spatial resolution of 1-km and geographic projection (specified at Table 1). Specifically, MODIS-SCA and GRACE data were downscaled from 5-km and 100-km, respectively, to 1-km using the Nearest Neighbor algorithm.

Table 1 Descriptive characteristics of the ParFlow output dataset.

Full size table

Evaluation metrics

For timeseries data, we primarily used two metrics to evaluate model performance, Spearman’s Rho and Total Absolute Relative Bias. As explained in Maxwell and Condon³⁸, plotting these two metrics against one another produces a figure that will concisely describe a model’s ability to reproduce appropriate timing and magnitude of flows. We hereafter refer to this type of figure as a Condon Diagram. Spearman’s Rho was used to assess the differences in the simulated and observed variables timing while the relative bias measures differences in their volumes. If simulations are closed to observation, we expect high Speaman’s Rho value and low relative bias value. Spearman’s Rho is computed as:

$$srho=1-\frac{6{\sum }_{i=1}^{n}{d}_{i}^{2}}{n({n}^{2}-1)}$$

(1)

where d_i is the difference in the independent ranking for the simulated and observed values at i time step, n is the number of values in each time series. The Total Absolute Relative Bias is calculated as:

$$bias=\frac{\left|{\sum }_{i=1}^{n}{S}_{i}-{\sum }_{i=1}^{n}{O}_{i}\right|}{{\sum }_{i=1}^{n}{O}_{i}}$$

(2)

where S and O are simulated and observed timeseries, respectively, and n is the number of values in each time series.

Additionally, we used the Kling-Gupta Efficiency (KGE^81,82) to evaluate the streamflow performance. The KGE coefficient is proposed by Gupta et al.⁸¹ to achieve a more balanced evaluation of simulated mean flow, flow variability and daily correlation than the traditional Nash-Sutcliffe efficiency (NSE⁸³)^84,85.

For spatial data, we used two categorical validation indices, namely, Probability of Detection and False Alarm Ratio:

$$POD=\frac{Hit}{Hit+Miss}$$

(3)

$$FAR=\frac{False}{Hit+False}$$

(4)

where Hit is grid where both simulated and observed events occurred; Miss is grid cell where the observed event occurred but the simulated one did not; False is a grid cell where the simulated event occurred but the observed one did not.

Data Records

The dataset is available to the public through an unrestricted data repository hosted by CyVerse⁸⁶. All the inputs for the simulation are included, namely, NLDAS (Table 2), related hydrologic and land-surface variables (Table 3) and a TCL script that contains the complete list of ParFlow input keys for the simulation.

Table 2 Hourly NLDAS inputs for the simulation.

Full size table

Table 3 Files that contain the model input parameters for the simulation.

Full size table

For each WY of simulation time, there are three output files per hourly timestep, namely, pressure, saturation and CLM output.

The hourly outputs shown in Table 4 and inputs shown in Table 2 were averaged into monthly variables for comparison to observations. Some additional quantities were calculated from ParFlow outputs (e.g. water table depth). These variables are listed in Table 5.

Table 4 Hourly ParFlow-CLM output variables and units.

Full size table

Table 5 Monthly variables derived from the ParFlow output/input datasets.

Full size table

ParFlow grid data is stored in a ParFlow binary file format (.pfb) which is written as BIG ENDIAN binary bit ordering. More information about ParFlow binary file format can be found in the ParFlow manual (https://github.com/parflow/parflow/blob/master/parflow-manual.pdf). Developed modules that read the ParFlow binary files can be found at https://github.com/parflow/parflow/tree/master/pftools/prepostproc and https://github.com/hydroframe/parflowio.

Technical Validation

In this section we compared simulated water and energy fluxes to a wide range of datasets for the UCRB from 1983 to 2019, including station measurements and modeled outputs to validate the fidelity of the simulation and prioritize for future model improvement. The available station measurements were streamflow, water table depth (WTD), snowpack, water storage and evapotranspiration (Fig. 1). The available modeled outputs were Snow Cover Area, Total Water Storage Anomalies, and evapotranspiration.

Streamflow comparisons

We compared simulated and observed monthly streamflow for eight representative stations within the UCRB (Fig. 2). Streamflow varied over two orders of magnitude with peak flow from around 2000 (ft³/s) at East River at Almont station to around 40,000 (ft³/s) at Colorado River near Cisco station and at Green River at Green River station. Simulated streamflow reflected accurately wet (1983–1985, 1993–1996, 2008–2010) and dry (1989–1992, 2002–2004, 2011–2013,2018) periods. In general, simulated flows matched observed ones well with average relative bias and Spearman’s Rho of 0.15 and 0.51, respectively.

The simulated and observed streamflow were in good agreement at Colorado River near Cisco station and at Green River at Green River station with low relative bias of 0.4 and 0.35 and high Spearman’s Rho of 0.52 and 0.73, respectively. Simulated flows at stations from the San Juan River had moderate bias between 1989 and 1992, but the bias reduced later. In the drought years between 1989 and 1992, we found NLDAS indicated more snow than one measured from SNOTEL and snow melt later than from SNOTEL, this snow component in NLDAS caused the moderate bias in streamflow simulation.

We found some discrepancies between streamflow simulations and observations in the station San Juan River near Bluff which could be attributed to anthropogenic activities. Model results were also compared to the Bureau of Reclamation natural flow dataset. Figure 3 shows examples of streamflow performances for two stations that are affected by upstream dams: San Juan River near Bluff and Colorado River at Lees Ferry. Overall, the ParFlow results shown in Fig. 3 are in good agreement with the naturalized flows. In the Colorado River at Lees Ferry station, the natural flow and ParFlow showed similar peaks and low streamflow bias across the comparison period. The mean monthly hydrograph comparisons highlighted a good correlation in seasonal cycles between the ParFlow results and the naturalized flows (Fig. 3). However, the ParFlow results were lower than the naturalized flows during summer and early winter in both stations (Fig. 3).

Streamflow performance using Spearman’s Rho and relative bias is shown in Fig. 4 for all stations inside the UCRB. Stations were considered to have a good shape (i.e. matching temporal pattern) when their Spearman’s Rho values are greater than 0.5 and bad shape when their Spearman’s Rho values were smaller than 0.5. Likewise, stations were considered as low bias if their relative biases were smaller than 1 and high bias if their relative biases were greater than 1. Hence, there were four types of streamflow performance: (1) stations with good shape and low bias – green stations; (2) stations with good shape and high bias – blue stations; (3) stations with bad shape and low bias – purple stations; (4) stations with bad shape and high bias – red stations.

Despite the complex terrain of the UCRB, most of the stations (94%) had relative bias lower than 1. The majority of stations (52%) had both low bias and good shape (i.e. relative bias smaller than 1 and Spearman’s Rho greater than 0.5), most of them located in the upstream part of rivers originating from the Rocky Mountains, namely, the Gunnison, Yampa and San Juan. Simulated flows matched with observed ones along the Green River (Figs. 2 and 4). Stations with bad shape generally fall into two categories: (1) impacted by anthropogenic activities (e.g. stations along the Colorado River); (2) located in relatively small streams (e.g. western tributaries of the Green River or tributaries of the San Juan River). The results agreed well with previous ParFlow streamflow evaluations of the UCRB^38,87.

Water table depth comparisons

A representative sample of wells is taken for comparison between observed and simulated WTD (Fig. 5). ParFlow-CLM accurately simulated both timing and magnitude for most of the wells, however there were some points of discrepancy.

Overall, both streambank and deep wells had good agreements between simulations and observations with the relative bias for 14 wells out of 24 wells was smaller than 0.3. Simulated WTDs at the Lower Gunnison River Basin (LGRB) matched close to observations in mean depth and temporal variability. Similarly, simulated streambank wells (e.g. at station 09205000 (left) and at station 09209400 (left and right) showed a good agreement with observed WTD. An exception to this close agreement was the well at the right bank of station 09205000 where simulated WTD showed much greater seasonal cycles than observed.

Deep wells with WTD of around 20 m such as 19–105-10bbb01 Rock Springs (near Rock Springs, WY) and (D-36-22)22daa-1 (near Blanding, UT), showed decreasing and increasing trends, respectively, over time and simulated WTD shows similar trends. With wells that are close to the maximum WTD of the domain (e.g. (D-33-24)30dab-1), simulated WTD did not reflect the observed multi-year cycle in these locations. This bias was likely caused by the model spatial resolution. Deeper wells are often located in mountainous areas where the topography is complex. ParFlow-CLM’s 1-km resolution may not be sufficient to capture this type of behavior. Please note that all wells were mapped to the closest grid cell center with no further adjustment.

We also compared the average simulated WTD (Fig. 6b) with the average USGS WTD dataset compiled from Fan et al. 2013 (Fig. 6a). Most of the wells had WTD differences of less than 1 m compared to the simulations, and some deep wells (around 40-m depth) in the Green and San Juan sub basins were 2–4 m deeper than the simulated. The observed dataset had more shallow water tables than simulated (Fig. 6c,d) and simulated WTD had more points which reach the maximum depth of 52-m (Fig. 6d). The output WTD was on average over one square kilometer (model grid spatial resolution). Most of the “max depth” points were in the mountainous areas where the average output WTD was often dominated by WTD that is close to 52-m. This often led to overestimation of the depth (i.e. WTDs are deeper than in observation wells). However, the WTD average for most of the well was consistent with the observations with nearly 2,650 wells (70%) having absolute differences smaller than 0.5 m. Lastly, the average of absolute difference in depth for all the wells was 1.042 m.

Snow covered extent comparison

We compared the simulated SCA to one obtained from the cloud-free MODIS product (Fig. 7a). Model outputs were upscaled and averaged to match with the cloud-free product’s spatial and temporal resolutions. Simulated SCA had consistent high Probability of Detection (average 0.82) throughout the period of 17 year (Fig. 7b). False Alarm Ratio was often high (around 0.4) during accumulating and melting months (i.e. October and April, respectively). During winter, simulated SCA accurately reflected the dynamic of SCA with almost 100% snow pixel captured and below 5% overestimated snow pixels. For typically dry years such as 2001, 2002 and 2016, ParFlow-CLM tended to produce slightly more SCA.

Figure 7c shows a snapshot of comparison between simulated and cloud-free MODIS SCA in a melting month of April 2008. Both SCAs agreed well in the Green and Gunnison river basins. Cloud-free MODIS SCA indicated that snow had already melted in various places along the Colorado River and Northern Arizona (lower left of the UCRB) while it was contrasting in the simulated SCA. Lastly, the model missed some snowpack in Northern New Mexico (lower right of the UCRB). A detailed discussion regarding sources of bias in snow simulation is covered in the next section.

Snow water equivalent comparisons

All SWE stations were mapped to the closest grid cell center with no adjustments made to scale point value to grid block or to adjust for differences between the station’s elevation and the mean grid cell elevations. Comparison results for six representative stations from Green, Gunnison and San Juan basins are shown in Fig. 8. While Rock Creek and Strawberry Divide stations are in Green basin, Summit Ranch and Butte stations are in Gunnison basin, Cascade and Mineral Creek are in San Juan basin (Fig. 9). Elevations of these stations ranges from the lowest of 2403 m (Rock Creek) to the highest of 3100 m (Butte). In general, simulated and observed SWE agreed on peak timing and magnitude trend over the period. Simulated SWE accurately reflected drought periods (i.e. 1988–1990, 2002–2004, 2012–2014 and 2018) and wet periods (1983–1985, 1993–1995 and 2008–2010).

However, before 2003, simulated SWE was systematically lower than observed in all stations. This dry bias was worst in the Strawberry Divide station, where simulated peak monthly SWE was often 200 mm lower than the observed. A potential source of bias was directly linked to the NLDAS dataset. Earlier version of NLDAS used in this simulation from WY 1983 to WY 2002 has been shown to have SWE bias from: (1) dry bias in annual precipitation, (2) air temperatures are systematically lower in winter and higher in spring, (3) coarse spatial resolution of 1/8° ^49,50. For simulation from WY 2003, we used a newer version of NLDAS which was greatly enhanced in precipitation and temperature estimations⁸⁸. The dry SWE bias was much alleviated overall although the bias persisted during some drought years of 2012 and 2013.

Figure 9 shows an interesting contradiction. While simulated maximum annual SWE was lower than observed SWE for most of the stations (Fig. 9a), when doing monthly average by year, ParFlow-CLM simulated earlier snow accumulation and later snow melt than observations (Fig. 9b). In the Green basin, maximum annual SWEs were often lower by 100 m than observations. In contrast, basins in higher altitude such as Gunnison and San Juan had both underestimated (smaller by 200 mm) and overestimated (greater by 100 mm) stations.

Beside bias in forcing, there always existed an inconsistency when comparing point measurements with the model 1-km grid. The model grid resolution could smooth out the dynamic changes of snowpack in high altitude regions. Figure 9b shows a reverse pattern of fraction of station with snow between SNOTEL and ParFlow-CLM that Maxwell and Condon (2016) found in water year 1985. The reverse pattern could be a result of replacing forcing inputs in the second half of the decadal simulation.

GRACE terrestrial water storage comparison

Changes in water storage from GRACE were compared with the ParFlow-CLM simulation from 2002 and 2017 (Fig. 10). Water storage change from GRACE was obtained by subtracting the water storage mean from 2004 to 2009. We also did the same process with outputs from ParFlow-CLM to ensure the consistency in comparison. Thus, we compared between the products basin-wide.

Two products agreed well from 2003 to 2011. Drought years of 2003 and 2004 were accurately reflected as well as wet years of 2005, 2008 and 2011. During periods of data discrepancy from GRACE (i.e. 2002 and from 2011 to 2017), GRACE showed much lower water storage anomaly than ParFlow-CLM.

ET comparisons

Direct observations of ET and/or latent heat were from two sources: (1) AmeriFlux network (one station, data from 1999) and (2) COCORAHS (four stations, data from 2012). Stations from COCORAHS measure potential evaporation. Measuring potential ET rather than actual ET by COCORAHS explained the systematic underestimation of simulated ET in months of June and July for Grand Junction and Crested Butte and in months of May, June and July for Castle Valley, as potential ET would be greater than actual ET. For other summer months, ParFlow-CLM simulated accurately with observations (Fig. 11).

The Niwot Ridge Forest station from the AmeriFlux network is located at 3050 m elevation in subalpine forest. Flux magnitude and trend from both simulated and observed ET were overall matching (Fig. 11). Niwot Ridge station showed small amounts of evaporation (around 1.5 mm) during winters of 1999, 2000, 2007, 2009 and 2014 while ParFlow-CLM did not show winter ET values.

When comparing with a remotely sensed product (SSEBop) over the entire basin, the simulated ET had lower ET peak than SSEBop in the Green, Gunnison and ultimately Upper Colorado River basins (Fig. 12). In the San Juan basin, the two ET estimates agreed well over the period. The SSEBop’s estimation approach is fundamentally different from the physical based modelling of ParFlow-CLM, and thus requires detailed comparisons of model specifications in order to examine the difference in ET.

Land surface temperature comparison

Simulated and observed ground temperature matched closely throughout the study period. Out of 490 NOAA RCC stations, 263 stations had Spearman’s Rho value greater than 0.95, 486 stations had relative bias smaller than 0.05. Figure 13 shows comparison for six representative stations for the Green, Gunnison and San Juan basins (Vernal and Steamboat Springs in Green, Dillon and Rifle Garfield in Gunnison, Ignacio and Teec Nos Pos in San Juan). We can see great matches between simulations and observations in different terrains and temperature ranges.

Comparison summary

We compared seven variables output from the decadal simulation, (1) streamflow, (2) water table depth, (3) snow cover area, (4) snow water equivalent, (5) actual evapotranspiration, (6) total storage water anomalies and (7) ground temperature, to observations and data products. In general, this comparison demonstrates good agreements between model and observations and data products.

A summary of comparisons is given in Tables 6, 7. The average relative bias for streamflow, WTD, SWE, ET and temperature was 0.123. For this pre-development simulation, more than half of the stream gauges (53%) had good timing correlation (Spearman’s Rho > 0.5) between simulated and observed streamflow. This simulation accurately captured the mean WTD with 70% of the wells having absolute difference in mean WTD smaller than 0.5 m. Other land surface variables such as: SWE, ET, TWSA and ground temperature also showed a good match to observations and had an average relative bias of 0.078 and an average Spearman’s Rho of 0.838.

Table 6 Comparison summary for point observations.

Full size table

Table 7 Comparison summary for remotely sensed products.

Full size table

Sources of bias in simulated streamflow and WTD were due to four reasons: (1) lateral and vertical resolutions; (2) water management operations; (3) bias in meteorological forcing data; (4) uncertainties in subsurface properties.

Usage Notes

In the CyVerse repository, beside ParFlow output files, we also provided necessary input files to reproduce the simulation. Specifically, we included (1) NLDAS data; (2) ParFlow and CLM parameter files (e.g. vegetation parameter file, subsurface indicator file and topographic information files); (3) A tcl script for running the simulation. More information can be found in the repository readme file.

Interested users are encouraged to use the newly developed parflowio tool (https://github.com/hydroframe/parflowio) to work with.pfb format files.

While it was shown that anthropogenic activities reflected in USGS streamflow and WTD still contributed to the comparison bias, the results are encouraging with most of the gauges and wells having low relative bias and high Spearman’s Rho scores. As Maxwell and Condon³⁸ indicated that the ParFlow simulation platform is evolving with more refined surface⁸⁹ and subsurface and more anthropogenic activities coupled, the model formulation will be improved.

The dataset produced in this study is useful in hydrological studies due to its high spatial and temporal resolutions and validated accuracy. Moreover, having more consistent data about the groundwater dynamic, one can study its impact on the full water cycle of the UCRB and possibly other river basins. We hope the dataset will be used for a wide range of stakeholders from decision makers to ecologic scientists.

Code availability

These simulations were conducted using ParFlow version 3.6.0 (https://github.com/parflow/parflow/tree/v3.6.0/). The data processing step was done using Python3.5 programming language with necessary toolboxes including NumPy (https://numpy.org/), the Geospatial Data Abstraction Library (GDAL; https://gdal.org/) and the Python Data Analysis Library (PANDAS; https://pandas.pydata.org/).

References

Christensen, N. S., Wood, A. W., Voisin, N., Lettenmaier, D. P. & Palmer, R. N. The effects of climate change on the hydrology and water resources of the Colorado River basin. Clim. Change 62, 337–363 (2004).
Article Google Scholar
Christensen, N. S. & Lettenmaier, D. P. A multimodel ensemble approach to assessment of climate change impacts on the hydrology and water resources of the Colorado River Basin. Hydrol. Earth Syst. Sci. 11, 1417–1434 (2007).
Article ADS Google Scholar
McCabe, G. J. & Wolock, D. M. Warming may create substantial water supply shortages in the Colorado River basin. Geophys. Res. Lett. 34 (2007).
Milly, P. C. D. & Dunne, K. A. Colorado River flow dwindles as warming-driven loss of reflective snow energizes evaporation. Science (80-.). 367, 1252–1255 (2020).
Article ADS CAS Google Scholar
Ficklin, D. L., Stewart, I. T. & Maurer, E. P. Climate change impacts on streamflow and subbasin-scale hydrology in the Upper Colorado River Basin. PLoS One 8 (2013).
Overpeck, J. T. & Udall, B. Climate change and the aridification of North America. Proceedings of the National Academy of Sciences of the United States of America. 117, 11856–11858 (2020).
Article CAS PubMed PubMed Central Google Scholar
National Research Council. Colorado River Ecology and Dam Management: Proceedings of a Symposium, May 24–25, 1990 Santa Fe, New Mexico. (1991).
Belnap, J. & Campbell, D. Effects of climate change and land use on water resources in the Upper Colorado River Basin. https://doi.org/10.3133/fs20103123 (2011).
Andersen, D. C., Cooper, D. J. & Northcott, K. Dams, floodplain land use, and riparian forest conservation in the semiarid Upper Colorado River Basin, USA. Environ. Manage. 40, 453–475 (2007).
Article ADS PubMed Google Scholar
DeWine, J. M. & Cooper, D. J. Effects of river regulation on riparian box elder (Acer negundo) forests in canyons of the upper Colorado River Basin, USA. Wetlands 27, 278–289 (2007).
Article Google Scholar
Woodhouse, C. A., Gray, S. T. & Meko, D. M. Updated streamflow reconstructions for the Upper Colorado River Basin. Water Resour. Res. 42 (2006).
Hall, D. K., Riggs, G. A., Salomonson, V. V., DiGirolamo, N. E. & Bayr, K. J. MODIS snow-cover products. Remote Sens. Environ. 83, 181–194 (2002).
Article ADS Google Scholar
Maurer, E. P., Rhoads, J. D., Dubayah, R. O. & Lettenmaier, D. P. Evaluation of the snow-covered area data product from MODIS. Hydrol. Process. 17, 59–71 (2003).
Article ADS Google Scholar
Policelli, F. et al. The NASA Global Flood Mapping System. in 47–63, https://doi.org/10.1007/978-3-319-43744-6_3 (2017).
Tran, H. et al. Improving hydrologic modeling using cloud-free modis flood maps. J. Hydrometeorol. 20, 2203–2214 (2019).
Article ADS Google Scholar
Gu, Y. et al. Evaluation of MODIS NDVI and NDWI for vegetation drought monitoring using Oklahoma Mesonet soil moisture data. Geophys. Res. Lett. 35 (2008).
Caccamo, G., Chisholm, L. A., Bradstock, R. A. & Puotinen, M. L. Assessing the sensitivity of MODIS to monitor drought in high biomass ecosystems. Remote Sens. Environ. 115, 2626–2639 (2011).
Article ADS Google Scholar
Rodell, M. et al. Emerging trends in global freshwater availability. Nature 557, 651–659 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Scanlon, B. R. et al. Global evaluation of new GRACE mascon products for hydrologic applications. Water Resour. Res. 52, 9412–9429 (2016).
Article ADS Google Scholar
Koster, R. D. & Suarez, M. J. Modeling the land surface boundary in climate models as a composite of independent vegetation stands. J. Geophys. Res. 97, 2697–2715 (1992).
Article ADS Google Scholar
Koster, R. D. & Suarez, M. J. The components of a ‘SVAT’ scheme and their effects on a GCM’s hydrological cycle. Adv. Water Resour. 17, 61–78 (1994).
Article ADS Google Scholar
Chen, F. et al. Modeling of land surface evaporation by four schemes and comparison with FIFE observations. J. Geophys. Res. Atmos. 101, 7251–7268 (1996).
Article ADS Google Scholar
Koren, V. et al. A parameterization of snowpack and frozen ground intended for NCEP weather and climate models. J. Geophys. Res. Atmos. 104, 19569–19585 (1999).
Article ADS Google Scholar
Burnash, R., Ferral, R. & McGuire, R. A generalized streamflow simulation system - Conceptual modeling for digital computers. (U.S. Department of Commerce, 1973).
Anderson, E. A. National Weather Service River Forecast System - Snow Accumulation and Ablation Model. NOAA Technical Memorandum NWS HYDRO-17. NOAA Technical Memorandum NWS HYDRO-17, 1–87 (1973).
Liang, X., Lettenmaier, D. P., Wood, E. F. & Burges, S. J. A simple hydrologically based model of land surface water and energy fluxes for general circulation models. J. Geophys. Res. 99 (1994).
Wood, E. F., Lettenmaier, D., Liang, X., Nijssen, B. & Wetzel, S. W. Hydrological modeling of continental-scale basins. Annu. Rev. Earth Planet. Sci. 25, 279–300 (1997).
Article ADS CAS Google Scholar
Painter, T. H., Skiles, S. M. K., Deems, J. S., Bryant, A. C. & Landry, C. C. Dust radiative forcing in snow of the Upper Colorado River Basin: 1. A 6 year record of energy balance, radiation, and dust concentrations. Water Resour. Res. 48 (2012).
Liu, Y., Peters-Lidard, C. D., Kumar, S. V., Arsenault, K. R. & Mocko, D. M. Blending satellite-based snow depth products with in situ observations for streamflow predictions in the Upper Colorado River Basin. Water Resour. Res. 51, 1182–1202 (2015).
Article ADS Google Scholar
Maxwell, R. M. & Miller, N. L. Development of a coupled land surface and groundwater model. J. Hydrometeorol. 6, 233–247 (2005).
Article ADS Google Scholar
Kollet, S. J. & Maxwell, R. M. Integrated surface-groundwater flow modeling: A free-surface overland flow boundary condition in a parallel groundwater flow model. Adv. Water Resour. 29, 945–958 (2006).
Article ADS Google Scholar
Maxwell, R. M. A terrain-following grid transform and preconditioner for parallel, large-scale, integrated hydrologic modeling. Adv. Water Resour. 53, 109–117 (2013).
Article ADS Google Scholar
Richards, L. A. Capillary conduction of liquids through porous mediums. J. Appl. Phys. 1, 318–333 (1931).
ADS MATH Google Scholar
Kollet, S. J. & Maxwell, R. M. Capturing the influence of groundwater dynamics on land surface processes using an integrated, distributed watershed model. Water Resour. Res. 44 (2008).
Kollet, S. J. et al. The Influence of Rain Sensible Heat and Subsurface Energy Transport on the Energy Balance at the Land Surface. Vadose Zo. J. 8, 846–857 (2009).
Article Google Scholar
Maxwell, R. M., Condon, L. E. & Kollet, S. J. A high-resolution simulation of groundwater and surface water over most of the continental US with the integrated hydrologic model ParFlow v3. Geosci. Model Dev. 8, 923–937 (2015).
Article ADS Google Scholar
Jefferson, J. L. & Maxwell, R. M. Evaluation of simple to complex parameterizations of bare ground evaporation. J. Adv. Model. Earth Syst. 7, 1075–1092 (2015).
Article ADS Google Scholar
Maxwell, R. M. & Condon, L. E. Connections between groundwater flow and transpiration partitioning. Science (80-.). 353, 377–380 (2016).
Article ADS CAS Google Scholar
Kuffour, B. N. O. et al. Simulating coupled surface-subsurface flows with ParFlow v3.5.0: Capabilities, applications, and ongoing development of an open-source, massively parallel, integrated hydrologic model. Geosci. Model Dev. 13, 1373–1397 (2020).
Article ADS Google Scholar
Horton, R. E. The Rôle of infiltration in the hydrologic cycle. Eos, Trans. Am. Geophys. Union 14, 446–460 (1933).
Article Google Scholar
Dunne, T. Relation of field studies and modeling in the prediction of storm runoff. J. Hydrol. 65, 25–48 (1983).
Article ADS Google Scholar
Ryken, A., Bearup, L. A., Jefferson, J. L., Constantine, P. & Maxwell, R. M. Sensitivity and model reduction of simulated snow processes: Contrasting observational and parameter uncertainty to improve prediction. Adv. Water Resour. 135 (2020).
Dai, Y. et al. The common land model. Bull. Am. Meteorol. Soc. 84, 1013–1023 (2003).
Article ADS Google Scholar
Condon, L. E. & Maxwell, R. M. Systematic shifts in Budyko relationships caused by groundwater storage changes. Hydrol. Earth Syst. Sci. 21, 1117–1135 (2017).
Article ADS Google Scholar
Xia, Y. et al. Continental-scale water and energy flux analysis and validation for the North American Land Data Assimilation System project phase 2 (NLDAS-2): 1. Intercomparison and application of model products. J. Geophys. Res. Atmos. 117 (2012).
Lohmann, D. et al. Streamflow and water balance intercomparisons of four land surface models in the North American Land Data Assimilation System project. J. Geophys. Res. Atmos. 109 (2004).
Xia, Y., Ek, M. B., Wu, Y., Ford, T. & Quiring, S. M. Comparison of NLDAS-2 simulated and NASMD observed daily soil moisture. Part I: Comparison and analysis. J. Hydrometeorol. 16, 1962–1980 (2015).
Article ADS Google Scholar
Zhuo, L., Han, D., Dai, Q., Islam, T. & Srivastava, P. K. Appraisal of NLDAS-2 multi-model simulated soil moistures for hydrological modelling. Water Resour. Manag. 29, 3503–3517 (2015).
Article Google Scholar
Sheffield, J. et al. Snow process modeling in the North American Land Data Assimilation System (NLDAS): 1. Evaluation of model-simulated snow cover extent. J. Geophys. Res. Atmos. 108 (2003).
Pan, M. et al. Snow process modeling in the North American Land Data Assimilation System (NLDAS): 1. Evaluation of model simulated snow water equivalent. J. Geophys. Res. D Atmos. 108 (2003).
Cosgrove, B. A. et al. Real-time and retrospective forcing in the North American Land Data Assimilation System (NLDAS) project. J. Geophys. Res. Atmos. 108 (2003).
Mitchell, K. E. et al. The multi-institution North American Land Data Assimilation System (NLDAS): Utilizing multiple GCIP products and partners in a continental distributed hydrological modeling system. J. Geophys. Res. Atmos. 109 (2004).
Xia, Y. et al. Continental-scale water and energy flux analysis and validation for North American Land Data Assimilation System project phase 2 (NLDAS-2): 2. Validation of model-simulated streamflow. J. Geophys. Res. Atmos. 117 (2012).
Condon, L. E. & Maxwell, R. M. Modified priority flood and global slope enforcement algorithm for topographic processing in physically based hydrologic modeling applications. Comput. Geosci. 126, 73–83 (2019).
Article ADS Google Scholar
Gleeson, T. et al. Mapping permeability over the surface of the Earth. Geophys. Res. Lett. 38 (2011).
Schaap, M. G. & Leij, F. J. Database-related accuracy and uncertainty of pedotransfer functions. Soil Sci. 163, 765–779 (1998).
Article ADS CAS Google Scholar
Condon, L. E. & Maxwell, R. M. Implementation of a linear optimization water allocation algorithm into a fully integrated physical hydrology model. Adv. Water Resour. 60, 135–147 (2013).
Article ADS Google Scholar
Maurer, E. P., Wood, A. W., Adam, J. C., Lettenmaier, D. P. & Nijssen, B. A long-term hydrologically based dataset of land surface fluxes and states for the conterminous United States. J. Clim. 15, 3237–3251 (2002).
Article ADS Google Scholar
Hao, X., Chen, Y., Xu, C. & Li, W. Impacts of climate change and human activities on the surface runoff in the Tarim River Basin over the last fifty years. Water Resour. Manag. 22, 1159–1171 (2008).
Article Google Scholar
Zhang, A. et al. Assessments of Impacts of Climate Change and Human Activities on Runoff with SWAT for the Huifa River Basin, Northeast China. Water Resour. Manag. 26, 2199–2217 (2012).
Article ADS Google Scholar
BUREAU OF RECLAMATION. Colorado River Basin Consumptive Uses and Losses Report. (2004).
BUREAU OF RECLAMATION. PROVISIONAL Upper Colorado River Basin Consumptive Uses and Losses Report. (2007).
Bureau of Reclamation. Upper Colorado River Basin Consumptive Uses and Losses Report 2011–2015. https://www.usbr.gov/uc/DocLibrary/Reports/ConsumptiveUsesLosses/20191000-ProvisionalUpperColoradoRiverBasin2011–2015-CULReport-508-UCRO.pdf (2015).
Clayton, R. Upper Colorado River Consumptive Use Determination at CRSS Natural Flow Node Locations Calendar Years 1971–1995 Methodology Peer Review. (2004).
Prairie, J. & Callejo, R. Natural Flow and Salt Computation Methods, Calendar Years 1971–1995. (2005).
Prairie, J. R., Rajagopalan, B., Fulp, T. J. & Zagona, E. A. Statistical Nonparametric Model for Natural Salt Estimation. J. Environ. Eng. 131, 130–138 (2005).
Article CAS Google Scholar
Blythe, T. L. & Schmidt, J. C. Estimating the Natural Flow Regime of Rivers With Long-Standing Development: The Northern Branch of the Rio Grande. Water Resour. Res. 54, 1212–1236 (2018).
Article ADS Google Scholar
Rajagopalan, B. et al. Water supply risk on the Colorado River: Can management mitigate? Water Resour. Res. 45 (2009).
Xiao, M., Udall, B. & Lettenmaier, D. P. On the Causes of Declining Colorado River Streamflows. Water Resour. Res. 54, 6739–6756 (2018).
Article ADS Google Scholar
Fan, Y., Li, H. & Miguez-Macho, G. Global patterns of groundwater table depth. Science (80-.). 339, 940–943 (2013).
Article ADS CAS Google Scholar
Tran, H. et al. A cloud-free modis snow cover dataset for the contiguous United States from 2000 to 2017. Sci. Data 6, 1–13 (2019).
Article Google Scholar
Hall, D. K. & Riggs, G. A. MODIS/Terra Snow Cover Daily L3 Global 500m SIN Grid, Version 6 | National Snow and Ice Data Center. https://nsidc.org/data/MOD10C1/versions/6 (2016).
Save, H., Bettadpur, S. & Tapley, B. D. High-resolution CSR GRACE RL05 mascons. J. Geophys. Res. Solid Earth 121, 7547–7569 (2016).
Article ADS Google Scholar
Burns, S. P. et al. The Niwot Ridge Subalpine Forest US-NR1 AmeriFlux site – Part 1: Data acquisition and site record-keeping. Geosci. Instrumentation, Methods Data Syst. 5, 451–471 (2016).
Article ADS Google Scholar
Monson, R. K. et al. Carbon sequestration in a high-elevation, subalpine forest. Glob. Chang. Biol. 8, 459–478 (2002).
Article ADS Google Scholar
Sharma, D. N. & Tare, V. Evapotranspiration estimation using ssebop method with sentinel -2 and landsat-8 data set. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. XLII–5, 563–566 (2018).
Article Google Scholar
Dias Lopes, J., Neiva Rodrigues, L., Acioli Imbuzeiro, H. M. & Falco Pruski, F. Performance of SSEBop model for estimating wheat actual evapotranspiration in the Brazilian Savannah region. Int. J. Remote Sens. 40, 6930–6947 (2019).
Article Google Scholar
Senay, G. B. et al. Operational Evapotranspiration Mapping Using Remote Sensing and Weather Datasets: A New Parameterization for the SSEB Approach. J. Am. Water Resour. Assoc. 49, 577–591 (2013).
Article ADS Google Scholar
Senay, G. B. Satellite Psychrometric Formulation of the Operational Simplified Surface Energy Balance (SSEBop) Model for Quantifying and Mapping Evapotranspiration. Appl. Eng. Agric. 34, 555–566 (2018).
Article Google Scholar
Menne, M. J., Durre, I., Vose, R. S., Gleason, B. E. & Houston, T. G. An overview of the global historical climatology network-daily database. J. Atmos. Ocean. Technol. 29, 897–910 (2012).
Article ADS Google Scholar
Gupta, H. V., Kling, H., Yilmaz, K. K. & Martinez, G. F. Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling. J. Hydrol. 377, 80–91 (2009).
Article ADS Google Scholar
Kling, H., Fuchs, M. & Paulin, M. Runoff conditions in the upper Danube basin under an ensemble of climate change scenarios. J. Hydrol. 424–425, 264–277 (2012).
Article Google Scholar
Nash, J. E. E. & Sutcliffe, J. V. V. River Flow Forecasting Through Conceptual Models- Part {I}- A Discussion Of Principles. J. Hydrol. 10, 282–290 (1970).
Article ADS Google Scholar
Santos, L., Thirel, G. & Perrin, C. Technical note: Pitfalls in using log-transformed flows within the KGE criterion. Hydrol. Earth Syst. Sci. 22, 4583–4591 (2018).
Article ADS Google Scholar
Mizukami, N. et al. On the choice of calibration metrics for ‘high-flow’ estimation using hydrologic models. Hydrol. Earth Syst. Sci. 23, 2601–2614 (2019).
Article ADS Google Scholar
Tran, H. et al. A full hydrological dataset suite for the Upper Colorado River Basin from 1983 to 2019. CyVerse https://doi.org/10.25739/nv2q-ct31 (2020).
Tran, H., Zhang, J., Cohard, J. M., Condon, L. E. & Maxwell, R. M. Simulating Groundwater-Streamflow Connections in the Upper Colorado River Basin. Groundwater 58, 392–405 (2020).
Article CAS Google Scholar
Sheffield, J., Goteti, G. & Wood, E. F. Development of a 50-year high-resolution global dataset of meteorological forcings for land surface modeling. J. Clim. 19, 3088–3111 (2006).
Article ADS Google Scholar
Zhang, J., Condon, L. E., Tran, H. & Maxwell, R. M. A national topographic dataset for hydrological modeling over the contiguous United States. Earth Syst. Sci. Data 13, 3263–3279 (2021).
Article ADS Google Scholar

Download references

Acknowledgements

High-performance computing for ParFlow-CLM simulations were provided by the National Center for Atmospheric Research Computational and Information Systems Laboratory. This work was supported by the US National Science CyberInfrastructure project, HydroFrame (NSF-OAC 1835903) and the IDEAS-Watersheds (DOE Office of Science) project.

Author information

Authors and Affiliations

Department of Civil and Environmental Engineering, Princeton University, Princeton, NJ, USA
Hoang Tran & Reed M. Maxwell
Department of Hydrology and Atmospheric Sciences, The University of Arizona, Tucson, AZ, USA
Jun Zhang & Laura E. Condon
NASA/GSFC, Greenbelt, USA
Mary Michael O’Neill
Department of Geology and Geological Engineering, Colorado School of Mines, Golden, CO, USA
Anna Ryken
High Meadows Environmental Institute, Princeton University, Princeton, NJ, USA
Reed M. Maxwell

Authors

Hoang Tran
View author publications
You can also search for this author in PubMed Google Scholar
Jun Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Mary Michael O’Neill
View author publications
You can also search for this author in PubMed Google Scholar
Anna Ryken
View author publications
You can also search for this author in PubMed Google Scholar
Laura E. Condon
View author publications
You can also search for this author in PubMed Google Scholar
Reed M. Maxwell
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.T. developed the simulation and its outputs, managed the validation process and drafted the manuscript. L.E.C. and R.M.M. advised the project, reviewed the manuscript and provided valuable feedback and suggestions. J.Z., M.M.O. and A.R. audited data quality and contributed to the writing of the manuscript.

Corresponding author

Correspondence to Hoang Tran.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.

Reprints and permissions

About this article

Cite this article

Tran, H., Zhang, J., O’Neill, M.M. et al. A hydrological simulation dataset of the Upper Colorado River Basin from 1983 to 2019. Sci Data 9, 16 (2022). https://doi.org/10.1038/s41597-022-01123-w

Download citation

Received: 06 January 2021
Accepted: 10 December 2021
Published: 20 January 2022
DOI: https://doi.org/10.1038/s41597-022-01123-w

Subjects

Abstract

Similar content being viewed by others

Multi-model Hydroclimate Projections for the Alabama-Coosa-Tallapoosa River Basin in the Southeastern United States

A comprehensive, multisource database for hydrometeorological modeling of 14,425 North American watersheds

The floodplain inundation history of the Murray-Darling Basin through two-monthly maximum water depth maps

Background & Summary

Methods

Site

ParFlow-CLM

Input datasets

Model spinup

Simulation from 1983 to 2019

Observation datasets used for comparison

Evaluation metrics

Data Records

Technical Validation

Streamflow comparisons

Water table depth comparisons

Snow covered extent comparison

Snow water equivalent comparisons

GRACE terrestrial water storage comparison

ET comparisons

Land surface temperature comparison

Comparison summary

Usage Notes

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links