Abstract
This paper describes a multi-site and multi-decadal dataset of artificially drained agricultural fields in seven Midwest states and North Carolina, USA. Thirty-nine research sites provided data on three conservation practices for cropland with subsurface tile drainage: saturated buffers, controlled drainage, and drainage water recycling. These practices utilize vegetation and/or infrastructure to minimize off-site nutrient losses and retain water in the landscape. A total of 219 variables are reported, including 90 field measurement variables and 129 management operations and metadata. Key measurements include subsurface drain flow (206 site-years), nitrate-N load (154 site-years) and other water quality metrics, as well as agronomic, soil, climate, farm management and metadata records. Data are published at the USDA National Agricultural Library Ag Data Commons repository and are also available through an interactive website at Iowa State University. These multi-disciplinary data have large reuse potential by the scientific community as well as for design of drainage systems and implementation in the US and globally.
Measurement(s) | subsurface drain flow • Nitrate-N concentration • water table • soil nitrate concentration • precipitation • grain yield |
Technology Type(s) | pressure transducers within drainage control structure/outlet • automated flow-proportional sampling • pressure transducers installed inside deep wells of perforated PVC pipes • dry combustion • on-site weather station • mechanical harvest with combine |
Factor Type(s) | controlled drianage • drainage water recylcing • saturated buffers |
Sample Characteristic - Organism | Zea mays • Glycine max |
Sample Characteristic - Environment | agricultural field |
Sample Characteristic - Location | United States |
Similar content being viewed by others
Background & Summary
Subsurface drainage is used to remove excess water from the landscape for agronomic fieldwork and to ensure suitable crop growth conditions. However, subsurface drainage in the US Midwest provides a pathway for nitrogen and phosphorus loss into waterways, a major concern in nutrient impacted areas such as the Gulf of Mexico. A comprehensive management approach is increasingly needed given the expected increase in artificially drainage due to climate change1.
Conservation practices for drained agricultural landscapes are being developed to mitigate nutrient losses from cropland in the Midwest and other highly drained regions. Among these conservation practices, some have the potential to store or redirect water in the landscape and are promising for increasing the resiliency of cropping systems to climate change.
-
Controlled drainage (CD), also known as drainage water management, holds water in the field when drainage is not needed using a water control structure to raise the height of water level outflow2,3.
-
Saturated buffers (SB) divert a fraction of the nitrate-laden drain flow through riparian buffers as shallow groundwater. The nitrate becomes available for plant uptake and denitrification, thus reducing nitrate loads to surface water4,5.
-
Drainage water recycling (DWR) stores subsurface drainage water in on-farm ponds or reservoirs until it can be applied through irrigation. This reduces water and nutrient outflows while providing supplemental irrigation6,7,8.
The USDA National Institute of Food and Agriculture (NIFA) funded the “Transforming Drainage” project in 2015 to advance and evaluate these drainage management practices’ productivity and environmental tradeoffs. The multi-state, multidisciplinary team conducted research at 24 sites and gathered historical data (not previously available to the public) from collections starting in 1996 from 15 drainage research sites. The resulting dataset provides a resource for understanding subsurface drainage and the effects of drainage management practices in row crop agriculture. Here we present the data, which have been published in the USDA National Agriculture Library Ag Data Commons repository at data.nal.usda.gov9 and are also available at drainagedata.org10.
The Transforming Drainage data are unique given the availability of daily, plot-level water quantity and quality measurements paired with agronomic, soil and on-site weather data for artificially drained landscapes. Data allow the study of drainage management practices on water quantity and quality through experimental designs appropriate for each practice. A control-treatment pair design was used for the controlled drainage and drainage water recycling sites (a control plot with uncontrolled drainage and treatment plot employing the drainage management practice) and an upgradient/downgradient design for the saturated buffer sites. Treatments included CD (19 sites), DWR (7 sites), SB (8 sites) and others such as a wetland and undrained reference fields (5 sites) (Fig. 1). The length of data records at each site are 2–17 years, with nine sites including more than 10 years of data.
The 39 experimental sites were located across eight states: Iowa (9), Indiana (2), Ohio (11), Minnesota (9), Missouri (5), North Carolina (1), North Dakota (1), and South Dakota (1). Of the 219 data variables available, key variables (reported in site-years) included tile drainage (206), nitrate concentration (183), water table (92), on-site precipitation (227) and crop yield (201). The data were collected using variable methodologies due to differences in original research goals that dictated the experimental design of each study. Method descriptions are available for each site to improve user understanding of the data.
The data contribute to a more comprehensive understanding of hydrologic, environmental, and agronomic processes in U.S. drained agriculture and globally. Scientists, modellers, and policymakers can gain insights into factors and key drivers (e.g., precipitation patterns, soils, drainage characteristics and cropping systems) behind the agronomic and environmental performance of these drainage practices. Data will be useful for calibrating or validating hydrologic or agroecosystem models. Furthermore, data may be coupled with projections from regional climate models to extrapolate the observed results to climate regimes not included within the study period.
Methods
Experimental sites
Experimental designs varied across the 39 research sites with plot size ranging from 0.04 ha to 80 ha. The size of the plot drainage areas varied accordingly from 0.02 to 56 ha. The number of site-years of available data ranging from 2 to 17 with a mean of 7 years. There were diverse soil types, five soil textural classes and soil organic carbon ranging from 0.1% to 3.7%. Corn (Zea mays) and soybean (Glycine max) were the predominant crops grown, but 23 site-years had popcorn (Zea mays everta), wheat (Triticum aestivum), forage, oats (Avena sativa), or sugar beets (Beta vulgaris).
CD was practised at the greatest number (19) of sites (Fig. 2) across seven states in the Midwest and North Carolina. The research sites extended from 35.8° to 46.4° N and 76.7° to 96.9° W. The majority of sites (30) were on private farm (cooperator) fields through a lease or collaborative arrangement, with the remaining 9sites on university-owned and managed research farms. The USDA soil drainage class for the dominant soil type at each site ranged from somewhat poorly drained to very poorly drained11. The subsurface drainage of all sites consisted of 102 mm-diameter perforated corrugated tubing except MN_Clay sites (76 mm diameter tubing) and included both CD and free drainage (FD) treatments. Tile depth ranged from 0.61 m to 1.22 m, and tile spacing varied from 6 m to 36 m with median 13.7 m. All sites had similar drain spacings across treatments except IA_di4 and IA_Washington. These two sites varied tile spacing and/or tile depth. IA_di4 tile spacing differed with 27 m and 36 m for FD and CD plots, respectively. While at IA_Washington, tile spacing was 12 m in the shallower drainage treatment compared to 18 m spacing in the conventional drainage treatment. Seven sites had replicated drainage treatments with an average drainage area of 1.1 ha. Sites that did not include replications were larger farm fields with an average drainage area of 10.5 ha, except one university research field with a drainage area of 1.8 ha.
DWR research was conducted at seven sites across the Midwest. Individual research site locations ranged from 39° to 46° N and 83° to 96° W. The treatments at the sites included DWR utilizing controlled drainage with sub-irrigation or controlled drainage with on-surface drip irrigation. In addition, there was a comparison treatment of FD with no irrigation. The three Ohio sites included wetland monitoring in addition to drainage water recycling as part of the Wetland Reservoir Subirrigation System (WRSIS) project12.
Eight SB sites were monitored as part of this project, seven in Iowa and one in Minnesota. One of the Iowa sites included the first SB installed in the US4. Five sites categorized as ‘Other’ included monitored drainage practices slightly different from the previously described categories. The IN_Tippecanoe site was a wetland with future drainage water recycling planned but not implemented during this period. MN_Clay1 was a conventionally drained farm, MN_Clay3 was an undrained farm with only surface drainage, MN_Redwood2 was an undrained prairie area and ND_Richland had controlled drainage and a sub-irrigated area utilizing a sump pump lift station for water management.
Data collected at each site
The data describes crop and field management, soil physical characteristics, water quality and quantity time series, drainage system design and specific practice variables for the 39 research sites. Weather data, primarily precipitation and air temperature, were also available for each site. However, other data collected varied since the measurement protocols were not coordinated before research was initiated at many sites. Cumulatively, more than 90 in-field variables were measured across all sites to characterize the performance of these alternative agricultural water management strategies. Water quality and quantity time series (drain flow, water table depth, nitrate-N concentration, and precipitation) were considered essential data for temporal robustness and accuracy regarding the hydrological response.
Precipitation (39 sites) and drain flow or discharge (36 sites) were the most reported variables, followed by nitrate-N concentration (32 sites) and load (30 sites) (Fig. 2). Other common water quality variables are summarized in Fig. 3. In addition, soil moisture time series collected at varying depths were reported for 16 sites.
In addition to the water quantity and quality variables that provide a direct measure of treatment impact to water sustainability, other variables including crop yield, crop and field management and soil characteristic data are important for evaluating inter-site variability. For example, differences in nutrient application with fertilizer and nutrient removal through crop uptake will influence the water quality impact of different treatments. Soil texture (reported for 21 sites), crop yield (29 sites), tillage (27 sites) and fertilizer application (31 sites) were considered most essential of these site characteristic variables13. Along with crop yield, sites reported additional variables that assisted in quantifying plant water, nutrient and carbon uptake, including grain moisture content (13 sites), final plant population (end of season plant density; 9 sites), grain total N (8 sites) and grain biomass (6 sites). Whole plant, vegetative and cob biomass, and whole plant, vegetative, cob and grain N and C contents, forage biomass and leaf area index were reported for five or fewer sites.
Sixteen sites reported soil organic carbon and total N, in addition to basic soil texture information. In addition, 31 other soil parameters were reported for a subset of sites; the most common are summarized in Table 1. Soil organic matter, infiltration, lime index, sodium concentration or amount, sodium absorption ratio, neutralizable acid and salinity were reported for five or fewer sites.
Summary of measurement methods
Most experiments were not coordinated when the data collection project was initiated; hence research data collected, length of experimentation, years of available data, and protocols varied. Methods for each research site are provided in the data to document differences in measurement schedule, sample size, sample collection frequency, and equipment precision. Here, we summarize methods for determining drain flow, nitrate-N concentration and load, water table, soil properties and weather data due to the variability across sites within these key metrics. Crop yield is not summarized here despite its importance as a metric due to more consistent methods typically used across sites. Inter-site sampling methods for water measurements varied more than methodology for measuring other parameters. This variability is due to differing infrastructure at each site that required different measurement methods and the financial resources available for monitoring.
Drain flow measurement and reporting
Drain flow or discharge data were reported for 36 sites, including 19 CD, eight SB, six DWR and three with other practices (e.g., wetland). For all CD, three DWR and two wetland sites, drain flow was reported in mm/day (drainage discharge normalized by the drainage area). For all other sites, volumetric drainage discharge was reported in m3/day. Two of the sites (MN_Clay3 and MN_Redwood2) were undrained control sites that did not report drain flow or discharge. A third site (MO_Shelby) focused on the agronomic impact of subsurface drainage practices and did not monitor drain flow.
Drain flow was measured hourly or sub-hourly at more than 80% of the sites, followed by aggregation to daily flow measurements. Subsurface drainage flow rates were determined as a function of the water head measured using pressure transducers installed inside drainage control structures or at the drain outlet for approximately two-thirds of the sites. The water head was measured upstream of V-notch or rectangular weirs and empirical equations that depend on the weir dimensions were used to determine drain flow, which was measured and recorded hourly or sub-hourly. For IN_Tippecanoe drain flow was estimated as a function of water head using an empirical rating curve. At three sites, drain flow was measured using inline flow meters and recorded by data loggers. The advantage of this method is that flow could be recorded in either direction, valuable for sites experiencing backflow in the drainage system due to high downstream water levels14. At ND_Richland, drainage was collected at a sump where a current sensor was used to measure pumping frequency to calculate drainage flow15. For an additional three sites, drainage discharge was measured using a depth-velocity meter installed at the outlet of the drainage pipe or a drainage ditch. The drainage discharge was calculated as the product of the flow velocity and the area of flowing water. Only one site (MN_Redwood3) had manual measurements of drain flow that were collected two to three times per week.
Measured drain flow data exhibited variable frequency and duration gaps due to instrumentation malfunctioning, particularly with the automated monitoring systems that provide near-continuous data. Missing data and their non-uniform distribution created problems in statistical analyses when comparing aggregated drain flow and loads from different locations. A systematic approach was used to infill missing drain flow data utilizing variables available at all sites (precipitation, temperature, drain flow) and replicate plots where available. The method consisted of the following three phases and completed in progression, when applicable.
Phase 1, fill in zero flow.
During most winters in the northern states, the soil is frozen to the depth of the tile, and no subsurface drain flow is expected. Such periods were identified based on expert judgment by researchers at each site, relying on soil and air temperature information and local knowledge of the drainage system’s response to these conditions. If no drainage measurements were available due to frozen soil, the corresponding gaps in the data record were infilled with zero.
Phase 2, predict using replicate plots.
Regression-based estimation was used to infill missing data at three sites which had replicated plots or adjacent fields with available data. Due to the seasonal nature of subsurface drainage from croplands, individual linear regression models were developed for each season: winter (Jan, Feb, Mar), spring (Apr, May, Jun), summer (Jul, Aug, Sep) and autumn (Oct, Nov, Dec). Regression r2 values ranged from 0.66 to 0.94 based on the site and season, although mean across-site values were similar: winter (0.80), spring (0.82), summer (0.80), and autumn (0.83).
Phase 3, populate based on precipitation and drain flow from the preceding day.
The remaining missing daily drain flow data at 11 sites were filled as described below, based on the assumption that drain flow occurs on a given day only if (a) precipitation occurred on that day or (b) the drain continued to flow from the day before.
-
a.
For days with precipitation, a two-day moving average was calculated to account for the time lag between rainfall and resulting drain flow. A linear regression model was fitted to non-zero drain flow and two-day moving average precipitation for each season, with the model’s intercept fixed to zero. We used these models to predict the missing drain flow data for days with non-zero precipitation. The predicted drain flow values were limited to the drainage system’s capacity by replacing predictions greater than the site’s drainage coefficient (depth of water the drainage system could remove within 24 hours) with the coefficient’s value.
-
b.
For days with zero precipitation, missing drain flow was calculated from the previous day’s observed flow using the following first-order recession equation
where Q is daily drain flow, k is the average recession coefficient of falling limbs calculated as a linear slope of ln(Q), and i indicates day. The recession coefficient was calculated as a linear slope between the peak and inflection point of log-transformed daily drain flow data. The coefficient was calculated for all falling limbs of drain flow data, and the average seasonal values were calculated as their arithmetic mean.
The regression model between on-site precipitation and peak flow and recession equation were only applied to the original (pre-gap-filled) drain flow data. Predictions were not made when the number of missing drainage days exceeded 152 (5 months) within a calendar year; therefore, approx. 18% of the drain flow data remain missing. Both the original and filled data are included in the published data.
Nitrate-N concentration and load measurement and reporting
Nitrate-N (NO3) concentrations were reported for 32 sites, including 15 CD, eight SB, six DWR, and three sites with other practices (e.g., wetland). The three sites not reporting drain flow (MN_Clay3 and MN_Redwood2, MO_Shelby) did not report NO3 concentrations. Two sites (MO_Knox1 and MO_Knox3) provided NO3 load along with discharge in place of reporting the concentration of individual water samples. Two sites (OH_Hardin2 and OH_Henry) did not report NO3 concentrations or load due to limited water sample collection at these sites.
Six sites collected flow-proportional samples, in which a sample is collected every time a given volume of water passes through the drainage system. The flow-proportional sampling methods at the sites varied. At NC_Washington, a portion of flow was diverted continuously into a composite sample which was collected fortnightly (or more frequently under high flows). At IA_di4, a proportional sample was collected each time the drainage system was pumped. At MN_Redwood1, flow proportional samples were collected during storm and baseflow conditions. These samples were not composited but rather kept discrete. Seven sites used automated samplers to collect time-proportional samples. Five of these sites composited samples daily, while one site (IN_Randolph) collected samples hourly, then combined samples into approximately weekly composites. One site (IN_Tippecanoe) collected weekly grab samples prior to 2016 but then switched to automated, time-proportional sampling composited weekly in March 2016. Sites that used automated samplers typically switched to manual sampling (every two days to weekly frequency) in winter to protect automated samplers from freezing. Twelve sites collected weekly grab samples, another collected samples 2–3 times per week. One site collected biweekly grab samples, and four sites collected grab samples approximately monthly. Regardless of the collection method, all samples were either frozen or refrigerated (4–5 °C) upon return to the laboratory until analysis.
The sampling strategy primarily affects the frequency and compositing strategy of the water samples. Automated samplers permit more complex sampling strategies, such as flow-proportional or sub-daily sampling. However, the disadvantages of this method are the high initial expense of sampling equipment and the propensity for equipment malfunction at below-freezing air temperatures. The potential for equipment failure prompted sites using automated samplers to switch to a manual sampling in winter while drains remained flowing. Manual sampling frequency varied among sites due to differences in site accessibility or personnel availability. Both automated and manual water samples were often composited following collection, and sample compositing frequency ranged from daily to biweekly. Although sample collection frequency and compositing strategy affect the uncertainty of loading measurements, a collection frequency between 3 to 17 days is generally sufficient to reach ± 10% accuracy for annual nitrate load estimation for tile-drained landscapes in the Midwest16.
For nitrate-N analysis, 12 sites reported a cadmium reduction followed by a sulfanilamide reaction (equivalent to EPA 353.2). However, there was a slight methodological variation depending on the equipment, either Lachat QuikChem 8000 Flow-Injection Analyzer or SEAL AQ2 Discrete Analyzer. The resulting nitrate-N concentrations calculated via cadmium reduction were directly comparable regardless of the instrument used. At one site, SD_Clay, ion chromatography (EPA 300.1) was used to measure nitrate-N in 2015 but was subsequently switched to a cadmium reduction method. Samples at the seven IA sites were analysed by second-derivative spectrophotometry17.
Daily nitrate loads were calculated by multiplying nitrate concentration by drain flow and were therefore available for 32 sites for which both values were reported. Load calculation methods differed slightly in terms of determining the volume of water associated with each concentration. Typically, linear interpolation was used to determine the daily nitrate concentration at sites which collected “grab” water samples following precipitation events or on a schedule spanning two days or more. One variation used assumed the measured concentration was representative of adjacent days (prior and post), hence no interpolation was done. One site (OH_Delaware) used a midpoint approach to determine the time interval in which measured concentrations were associated with, while another site (IA_di4) assumed measured concentrations represented all water drained before the sample was collected.
Water table measurement and reporting
The water table was measured at 16 sites including nine CD sites, three SB sites, three DWR sites and one wetland site. Documenting water table fluctuation is key to experimental and modelling research investigating crop production systems on artificially drained soils. In a tile-drained field, the water table is used as an input parameter in estimates of drain flow, evapotranspiration, and soil hydraulic conductivity14,18,19. In controlled drainage, the water table is used to determine CD effectiveness and guide water management in the field for different crop stages. For DWR practice, the water table, particularly the midpoint water table, is used to evaluate sub-irrigation performance, such as uniformity and efficiency20. In a saturated buffer field, the water table is the most important factor used to indicate a field’s saturation status21.
The field water table was typically measured at the midpoint between the subsurface drains. Some field studies also measured the water table at two locations, one near the drain tile and the other at the midpoint between two drain tiles. The water table was commonly measured and recorded hourly or sub-hourly using pressure transducers installed inside 1.5–2.5 m deep wells of perforated PVC pipes. The water table depth was calculated using the measured water pressure above the transducer, and the in-situ water temperature and barometric pressure measured in a nearby field and periodically adjusted with manually measured water tables. If there were any discrepancies, all previous water table depth data were moved up or down correspondingly.
Differences across sites spanned the type of pressure transducers used, depth of measurement (1.5 to 2.4 m), data collection frequency (0.17 hr (10 min) to 6 hr), location of the measurement, and the length of the screened section of the pipe. The selection of the transducer type was due to individual choice and cost, while differences in the water table depth measurements were affected by the soil types and drain depth. The frequency of data collection was based on data logger capacity, water table variations, and the purpose of the measurements. The length of the perforated (“screened”) section of the pipe, in which the transducer was installed, also varied. For a typical tile-drained field, the pipe was screened beginning 0.3 m below the soil surface while for a saturated buffer, the pipe was screened beginning at the soil surface16. The data collection frequency for the saturated buffer area was every 6 hr since the water table variations were minimal across time. Within the field experiments, data were collected every hour at 10 sites and every 0.17 hr (10 min) at two sites.
Soil physicochemical variable measurement and reporting
Potentially important soil physical and chemical properties that might affect or be affected by soil drainage were collected from 19 experiments across six states. Data included 17 total variables, continuous and categorical. Soil physical variables included bulk density, hydraulic conductivity (saturated), moisture (water) content, temperature, texture, and soil water retention data (used to form a water retention curve). The remaining 11 variables were chemical properties. The five most common chemical variables characterized were nitrate, total nitrogen, soil organic carbon, pH and cation exchange capacity with several sites using similar methods22.
There was large variability of soil sampling depth among the studies and within specific variables at a site, and in a few cases (<1%) sampling depth intervals were not specified. Of the soil physical properties, soil texture was consistently measured using the same quantitative method (hydrometer). Soil textural class, a categorical variable, was reported once during the study period. Although not explicitly stated, it was assumed that researchers used intact soil cores and standard methods23 for quantifying bulk density. Soil water content was measured using capacitance/frequency domain technology or dielectric impedance-based sensor. At sites where soil temperature was measured, a thermistor in thermal contact with a water content probe measured temperature. Soil water content and temperature data were saved at time intervals ranging from five to 60 minutes.
Of the soil chemical properties, total nitrogen was measured once during a study and consistently using the dry combustion method. The soil nitrate concentration measurement frequency was not specified, and nitrate was extracted using potassium chloride (2 M KCl). The nitrate analysis method, when specified, was either by cadmium reduction or nitrate specific electrode. Soil pH was analyzed using a 1:1 soil:water mixture or salt solution. Where soil organic carbon was measured, the dry combustion method was used after carbonate removal by sulfuric acid.
Weather data collection
Weather information was available from on-site weather stations and nearby weather stations operated by the Federal Aviation Administration, National Weather Service, or state agricultural weather networks. Precipitation was recorded using automated tipping bucket gauges for rainfall at 31 sites, while at one site, a manual read gauge was used. Multiple rain gauges were installed for redundancy and measurement intercomparisons at four sites. The air temperature was recorded electronically except at one site, where a manual thermometer was used. Weather information was recorded at daily intervals.
Precipitation measurements were a particular focus due to its important effects on drainage response and its higher spatial variability compared to solar radiation, air temperature, humidity, and wind speed. To help improve the quality and integrity of precipitation data in situations where the nearest weather station was located more than 1000 m from the research site or to provide more localized information, on-site precipitation was collected at 20 sites. These onsite precipitation measurements used tipping bucket gauges with 15.2 or 20.3 cm orifices to optimize measurement accuracy. Rain gauges may underestimate rainfall due to wind effects, wetting of and evaporation from the funnel, mechanical limitations, splashing out of the funnel or other factors24 by up to 20% at high rainfall intensities25. Reference evapotranspiration was estimated for short or tall reference (or both) using the standardized Penman-Monteith26,27 equation for most sites, except for three of the sites where potential evapotranspiration was estimated using the temperature-based Thornthwaite equation28.
Weather data were reviewed by the Data Management Team using visualization and statistical methods to ensure uniformity and high quality of the data across all sites. Plausible ranges were defined for each parameter based on known climatological limits for the site with guidance by research personnel. For certain parameters, absolute limits were used to identify impossible values (e.g., such as wind directions not between 0 and 360 degree or relative humidity greater than 100%). In addition, an internal consistency check was performed between daily minimum and maximum air temperature observations. All questionable records were flagged, reconciled or removed except for 6 records with internal inconsistency at IN_Randolph but retained and the decision left to data users. The percent of missing data varied among variables (1.2% relative humidity to 36.7% evapotranspiration) and research sites (0 to 38.5% at OH_Crawford). Only, 4.2% of the records are missing in total and it is left to the discretion of users to reconstruct missing data.
Data ingestion and processing
The team leveraged a previously existing Google Cloud entry and management platform developed for the USDA–NIFA funded Sustainable Corn CAP team29,30,31. The system was expanded and customized to the needs of the Transforming Drainage team. Team members were led through a workflow (Supplementary Fig. 1) utilizing Google Sheets (web browser-based spreadsheets stored in the cloud), customized Google App Scripts (web browser-based data entry forms) and uploading free-form content to the shared Google Drive. All information was stored in Google’s Cloud with associated versioning and metadata tracking with team member identification. Automation was used to provide daily alerts to Data Management Team personnel regarding content modified within the previous 24 hours for further review and quality control. Additional automation via Python scripting synced data content to a local PostgreSQL database server housed at Iowa State University for further analysis, quality control and viewing within custom interfaces on an internal website.
The team implemented a targeted data entry approach to ensure timely revision and sharing of key variables among project members. For this purpose, all variables were categorized as primary or secondary in importance based on research priorities. Primary variables were crop yield, drain flow, tile nitrate-N concentration, water table depth, on-site precipitation, and field management data. These data were critical to address research areas of interest and expected to be the most time-consuming to retrieve, given the sampling frequency and quality control needed. All other variables were secondary and incorporated into the database following the entry of primary variables.
The general workflow of the team included characterizing treatments and data collected for each site, building the data dictionary, creating site-specific spreadsheets for different subsets of variables, entering priority data, entering methods used to collect measurements, entering secondary data, and adding additional research sites from cooperating partners. Subsequently, supporting photographs and drainage system design drawings and site maps were uploaded. Quality control was performed on all data throughout the process. Data were interrogated multiple times during the life of the project, and data reviews with site personnel occurred annually to address issues or questions. Issues needing attention were listed internally for site personnel to further scrutinize data prior to publishing.
Standards were instituted for each variable to ensure uniformity and enable data synthesis across dispersed datasets. These standards specified the minimum frequency for continuous or semi-continuous variables (e.g., drain flow was required to be reported at least daily) and precision according to assigned units of measurement. There was only one site (MN_Redwood3) that did not meet the requirements in terms of periodicity for soil moisture, as measurements were collected weekly. At several locations, variables were reported at higher or irregular frequencies. At the end of the project, all high-frequency measurements were aggregated into values corresponding to the minimum frequency of the respective variable.
Nutrient concentration measurements were limited to the detection limit corresponding to the analytical methods used at each site when the information was available. Any concentration below the limit was replaced with the corresponding limit, such as < 0.3. Concentrations were not adjusted at several sites where the detection limit was not available. Occasionally, zero concentrations were reported, indicating that the analytical method could not detect the nutrient. This limitation should be considered for inter-site water quality analysis.
Data Records
Data have been published9 to the USDA National Ag Library Ag Data Commons (USDA NAL ADC) repository with an assigned Digital Object Identifier (doi) and made available at drainagedata.org. Both sites provide identical copies of the data. Data at USDA NAL ADC are stored as individual .csv files to meet their recommended standards and ensure machine-readability32. Each .csv file contains research data or metadata necessary to represent and understand these drainage systems with purposes specific to each file (Table 2).
Technical Validation
Data were scrutinized by individual site investigators and centrally by data experts with domain knowledge. Following upload by site investigators, all data underwent centralized validation programmatically and manually. Multiple steps and processes were used in this validation process, including establishing relevant and appropriate bounds based on local conditions, applying centralized standards and uniformity, and harmonization.
Data validation was first performed at the individual site level by personnel with expert knowledge of local conditions. This approach recognizes that the feasible ranges of the measured crop, soil and hydrologic parameters are limited by chemical, biological and physical processes that differ based on local soil, climate, and agronomic management conditions. For example, local precipitation intensity and depth, soil conductivity and drainage system design control the maximum expected daily drain flow rates. Based on local knowledge and conditions, data experts reset observed drain flow to a no data value if recorded values fell outside expectations based on observed precipitation or soil moisture. For example, a drain flow spike was deleted when no precipitation was observed. In addition, pronounced differences in seasonality in drain flow, soil moisture, soil temperature and related variables were used to screen for appropriate values. For example, given the low likelihood of winter drain flow in the northern and western portions of the domain, site experts set missing drain flow data to zero values, following examination of soil temperature and moisture data.
Given the large variation across sites and years in climate, soil, hydrology, and farm management practices, these measured data differed substantially and appropriately. Data distribution was assessed for all numeric measured variables (84 total) across research sites to ensure meaningful value ranges existed (Table 3). When outliers were detected, personnel reviewed past field and laboratory notes to discard or retain. In isolated cases, samples were resubmitted to laboratories for analysis. Individual site measurements were visualized relative to other experimental sites as a visual confirmation of magnitude and range. Management data, including farm operations, were also standardized with validation performed by data personnel knowledgeable about farmer practices and appropriate fieldwork methods as they pertain to crop varieties, fertilizer applications, tillage operations and seeding rates.
Usage Notes
The Transforming Drainage dataset is a unique resource for understanding drainage and evaluating the productivity and environmental tradeoffs of drainage storage practices. Although data coverage is entirely in the US, it has far reaching implications in areas with similar soil and hydrological constraints and opportunities. Its use to address questions beyond the scope of the Transforming Drainage team is encouraged such as:
-
Model developers can use data to validate algorithms and test the predictive ability of models to improve understanding of hydrologic and agroecosystem processes in drained landscapes. The simulation of conventional drainage could be assessed using the untreated plot or before treatment measurement collection from each site, and the ability of models to predict the effect of conservation practices could be tested. Models have been published with these data to-date18,33.
-
Tool developers can utilize data for necessary inputs to improve existing or build new applications. A current example is the Evaluating Drainage Water Recycling Decisions (EWDRD) tool34 which requires daily drainage data.
-
Drainage engineers or managers can use the data as baseline or reference values of typical drain flow and other drainage variables across the region.
-
Researchers can use the data in establishing data standards for variable measurement, variable reporting, collection frequency, and unit reporting.
-
Researchers can leverage these data with their own or other databases such as STEWARDS (water quality at watershed scale35), MANAGE (50-year compilation from literature at the field or replicate scale36), and Sustainable Corn CAP Research Data31 (five years overlap at 5 sites (IA_Washington, IN_Randolph; MN_Redwood1, MN_Redwood2, OH_Auglaize2).
-
Educators can use the drainagedata.org visualization interfaces for undergraduate education, outreach presentations to farmers, and other purposes.
Differences in methods and variations across the sites may present challenges to users and we encourage delving into site details through published metadata and web resources. Extensive metadata are available in .csv format from the USDA National Ag Library Ag Data Commons repository9 and the interactive website (drainagedata.org) at Iowa State University10. Technical, 2–4-page site summaries and imagery are also available on the project website (transformingdrainage.org)37 and drainagedata.org. Finally, published studies for many of the sites are available in the literature (Table 4).
Code availability
Code written by the Data Management Team in support of this project can be found on github.com under an open and permissive usage license. Gap filling of the drain flow data and visualization of the results were performed within RStudio38,39. The source code and corresponding input and output data are publicly available on GitHub40. The code is split into three pieces corresponding to individual phases of the gap-filling method. In addition, there are separate scripts for the visualization of the data. At drainagedata.org, users can visualize the data with customized tools, query based on specific sites and measurements of interest, and access site photographs, maps, summaries, and publications. The code used for the backend data services and frontend web interfaces can be found on GitHub41.
References
Castellano, M. J., Archontoulis, S. V., Helmers, M. J., Poffenbarger, H. J. & Six, J. Sustainable intensification of agricultural drainage. Nat. Sustain. 2, 914–921, https://doi.org/10.1038/s41893-019-0393-0 (2019).
Ross, J. A. et al. A synthesis and comparative evaluation of factors influencing the effectiveness of drainage water management. Agric. Water Manag. 178, 366–376, https://doi.org/10.1016/j.agwat.2016.10.011 (2016).
Helmers, M. J. et al. Impact of controlled drainage on subsurface drain flow and nitrate load: A synthesis of studies across the U.S. Midwest and Southeast. Agric. Water Manag. 259, 107265, https://doi.org/10.1016/j.agwat.2021.107265 (2022).
Jaynes, D. B. & Isenhart, T. M. Reconnecting tile drainage to riparian buffer hydrology for enhanced nitrate removal. J. Environ. Qual. 43, 631–638, https://doi.org/10.2134/jeq2013.08.0331 (2014).
Groh, T. A., Davis, M. P., Isenhart, T. M., Jaynes, D. B. & Parkin, T. B. In situ denitrification in saturated riparian buffers. J. Environ. Qual. 48, 376–384, https://doi.org/10.2134/jeq2018.03.0125 (2019).
Willison, R. S. et al. Corn yield response to subsurface drainage water recycling in the Midwestern United States. Agron. J. 113(2), 1865–1881, https://doi.org/10.1002/agj2.20579 (2020).
Hay, C. H. et al. Drainage Water Recycling in the Humid Regions of the US: Challenges and Opportunities. Trans. ASABE 64, 1095–1102, https://doi.org/10.13031/trans.14207 (2021).
Reinhart, B. D., Frankenberger, J. R., Hay, C. H. & Helmers, M. J. Simulated water quality and irrigation benefits from drainage water recycling at two tile-drained sites in the US Midwest. Agric. Water Manag. 223, 105699, https://doi.org/10.1016/j.agwat.2019.105699 (2019).
Chighladze, G. et al. Transforming Drainage Research Data (USDA-NIFA Award No. 2015-68007-23193). Ag Data Commons. https://doi.org/10.15482/USDA.ADC/1521092 (2021).
https://drainagedata.org hosted by Iowa State University.
Soil Survey Staff, Natural Resources Conservation Service, United States Department of Agriculture. Web Soil Survey. http://websoilsurvey.sc.egov.usda.gov/ (accessed Jan 2021).
Allred, B. J. et al. Field test results for nitrogen removal by the constructed wetland component of an agricultural water recycling system. Appl. Eng. Agric. 30, 163–177, https://doi.org/10.13031/aea.30.10061 (2014).
Eagle, A. J. et al. Meta-analysis constrained by data: Recommendations to improve relevance of nutrient management research. Agron. J. 109, 2441–2449, https://doi.org/10.2134/agronj2017.04.0215 (2017).
Saadat, S., Bowling, L., Frankenberger, J. & Kladivko, E. Estimating drain flow from measured water table depth in layered soils under free and controlled drainage. J. Hydrol. 556, 339–348, https://doi.org/10.1016/j.jhydrol.2017.11.001 (2018).
Scherer, T. F. & Jia, X. A simple method to measure the flow rate and volume from tile drainage pump stations. App Eng. Agric. 26, 79–83, https://doi.org/10.13031/2013.29478 (2010).
Williams, M. R. et al. Uncertainty in nutrient loads from tile-drained landscapes: Effect of sampling frequency, calculation algorithm, and compositing strategy. J. Hydrol. 530, 306–316, https://doi.org/10.1016/j.jhydrol.2015.09.060 (2015).
Crumpton, W. G., Isenhart, T. M. & Mitchell, P. D. Nitrate and organic N analyses with second‐derivative spectroscopy. Limnol. Oceanogr. 37, 907–913, https://doi.org/10.4319/lo.1992.37.4.0907 (1992).
Kolars, K., Jia, X., Steele, D. D. & Scherer, T. F. A soil water balance model for subsurface water management. Appl. Eng. Agric. 35, 633–646, https://doi.org/10.13031/aea.13038 (2019).
Niaghi, A. R. & Jia. X. New approach to improve the soil water balance method for evapotranspiration estimation. Water 11, 2478. https://doi.org/10.3390/w11122478 (2019).
Jia, X., Scherer, T. F., Steele, D. D. & DeSutter, T. M. Subirrigation system performance and evaluation in the Red River Valley of the North. Appl. Eng. Agric. 33, 811–818, https://doi.org/10.13031/aea.12286 (2017).
Jaynes, D. B. & Isenhart, T. M. Performance of Saturated Riparian Buffers in Iowa, USA. J. Environ. Qual 48, 289–296, https://doi.org/10.2134/jeq2018.03.0115 (2019).
Kladivko, E. J. et al. Standardized research protocols enable transdisciplinary research of climate variation impacts in corn production systems. J. Soil Water Conserv 69, 532–542, https://doi.org/10.2489/jswc.69.6.532 (2014).
Blake, G. R. & Hartge, K. H. Bulk density. In Methods of Soil Analysis: Part 1 Physical and mineralogical methods. 5th edn. SSSA Madison, WI (1986).
ASABE. Measurement and reporting practices for automatic agricultural weather stations. EP505.1 Standard, ASAE SW-244 Irrigation Management Subcommittee. 20 p. https://texaset.tamu.edu/Resources/Documents/ASABE%20Standard.pdf (2015).
Shedekar, V. S. et al. Measurement errors in tipping bucket rain gauges under different rainfall intensities and their implication to hydrologic models. ASABE Annual International Meeting, Reno, NV. https://doi.org/10.13031/2013.27308 (2009).
Allen, R., Pereira, L. S., Raes, D. & Smith, M. Crop evapotranspiration: Guidelines for computing crop water requirements. FAO-56, United Nations Food and Agriculture Organization, Rome (1998).
ASCE-EWRI. Evaporation, evapotranspiration and irrigation water requirements. 2nd edn. (eds. Jensen, M. E. & Allen, R.) American Society of Civil Engineers, Reston VA. https://doi.org/10.1061/9780784414057 (2016).
Thornthwaite, C. W. An approach toward a rational classification of climate. Geogr. Rev. 38, 55–94, https://doi.org/10.2307/210739 (1948).
Herzmann, D. E., Abendroth, L. J. & Bunderson, L. D. Data management approach to multidisciplinary agricultural research and syntheses. J. Soil Water Conserv. 69, 180A–185A, https://doi.org/10.2489/jswc.69.6.180a (2014).
Abendroth, L., et al. Project and Research Management: Integrating Systems, Data, and People in Multidisciplinary Work. Technical Report Series: Observations and Recommendations of the USDA-NIFA funded Climate and Corn-based Cropping Systems Coordinated Agricultural Project. Iowa State University, Ames, IA. Vol 5 of 5. Pub. No. CSCAP-0198-2017. http://lib.dr.iastate.edu/sustainablecorn_extensionoutputs/2/ (2017).
Abendroth, L. J. et al. Sustainable Corn CAP Research Data (USDA-NIFA Award No. 2011-68002-30190). https://doi.org/10.15482/USDA.ADC/1411953 National Agricultural Library - ARS - USDA (2017).
USDA NAL ADC. Ag Data Commons Data Submission Manual User Edition v1.3. USDA Agricultural Research Service. Washington, DC. https://data.nal.usda.gov/system/files/AgDataCommonsdatasubmissionmanual_v1.3_1.pdf (2016).
Saadat, S., Frankenberger, J., Bowling, L. & Ale, S. Evaluation of surface ponding and runoff generation in a seasonally frozen drained agricultural field. J. Hydrol. 588, 124985, https://doi.org/10.1016/j.jhydrol.2020.124985 (2020).
Reinhart, B. D., Frankenberger, J. R., Hay, C. H., Bowling, L. B. & Hancock, B. G. Development and sensitivity analysis of an online tool for evaluating drainage water recycling decisions. Trans. ASABE 63, 1991–2002, https://doi.org/10.13031/trans.13900 (2020).
Steiner, J. L. et al. Data management to enhance long-term watershed research capacity: Context and STEWARDS case study. Ecohydrol. 2, 391–398, https://doi.org/10.1002/eco.89 (2009).
Christianson, L. E. & Harmel, R. D. The MANAGE Drain Load database: Review and compilation of more than fifty years of North American drainage nutrient studies. Agric. Water Manag. 159, 277–289, https://doi.org/10.1016/j.agwat.2015.06.021 (2015).
Reinhart, B. D. & Frankenberger, J. F. Experimental sites. https://transformingdrainage.org/research-database/sites hosted by Purdue University. (2016).
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing. Retrieved from https://www.R-project.org/ (2018).
RStudio Team. RStudio: Integrated Development for R. R Studio Inc. Retrieved from https://www.rstudio.com/ (2016).
Chighladze, G. Gap filling method. https://github.com/TransformingDrainageProject/Drain-Flow-Gap-Filling (2021).
Herzmann, D. E., Chighladze, G. & Abendroth, L. J. ISU Data team repository. https://github.com/isudatateam/datateam (2015).
Jaynes, D. B. Changes in yield and nitrate losses from using drainage water management in central Iowa, United States. J. Soil Water Conserv. 67, 485–494, https://doi.org/10.2489/jswc.67.6.485 (2012).
Helmers, M. J., Christianson, R., Brenneman, G., Lockett, D. & Pederson, C. Water table, drainage, and yield response to drainage water management in southeast Iowa. J. Soil Water Conserv. 67, 495–501, https://doi.org/10.2489/jswc.67.6.495 (2012).
Saadat, S., Bowling, L., Frankenberger, J. & Kladivko, E. Nitrate and phosphorus transport through subsurface drains under free and controlled drainage. Water Res. 142, 196–207, https://doi.org/10.1016/j.watres.2018.05.040 (2018).
Smith, S. D. Evaluating management options: Simulating wetland processes and performance of nutrient reduction by use of a water quality algorithm. Open Access Theses. Purdue University, West Lafayette, IN. 1160. https://docs.lib.purdue.edu/open_access_theses/1160 (2015).
Sylvester, L. M. Characterization and analysis of a natural wetland receiving agricultural runoff (Order No. 1469776). Available from Dissertations & Theses @ CIC Institutions; ProQuest Dissertations & Theses Global. (304530254). Purdue University, West Lafayette, IN. Retrieved from https://www.proquest.com/dissertations-theses/characterization-analysis-natural-wetland/docview/304530254/se-2?accountid=13360 (2008).
Niaghi, A. R. Advanced evapotranspiration measurement for crop water management in the Red River Valley. North Dakota State University, Fargo, ND. https://library.ndsu.edu/ir/handle/10365/31644 (2019).
Almen, K. Impact of controlled drainage and subirrigation in the Red River Valley. North Dakota State University, Fargo, ND. https://search.proquest.com/docview/2477876026?pq-origsite=primo (2020).
Kjaersgaard, J., Janz, A. & Wagner, M. Designing In-Field Water Storage and Sub-Irrigation for an Agricultural Drainage System. World Environmental and Water Resources Congress 2016: Watershed Management, Irrigation and Drainage, and Water Resources Planning and Management Proceedings, 37–46, https://doi.org/10.1061/9780784479858.005 (2016).
Nash, P., Nelson, K. & Motavalli, P. Reducing nitrogen loss with managed drainage and polymer‐coated urea. J. Env. Qual. 44, 256–264, https://doi.org/10.2134/jeq2014.05.0238 (2015).
Nash, P. R., Nelson, K. A., Motavalli, P. P., Nathan, M. & Dudenhoeffer, C. Reducing phosphorus loss in tile water with managed drainage in a claypan soil. J. Env. Qual. 44, 585–593, https://doi.org/10.2134/jeq2014.04.0146 (2015).
Kaur, H., Nelson, K. A. & Singh, G. Subsurface drainage and subirrigation for increased corn production in riverbottom soils. Agron. J. 113, 4865–4874, https://doi.org/10.1002/agj2.20887 (2021).
Nash, P. R., Nelson, K. A. & Motavalli, P. P. Corn response to drainage and fertilizer on a poorly drained, river bottom soil. Agron. J. 107, 1801–1808, https://doi.org/10.2134/agronj14.0380 (2015).
Nash, P. R., Nelson, K. A. & Motavalli, P. P. Reducing nitrogen loss in subsurface tile drainage water with managed drainage and polymer-coated urea in a river bottom soil. Water Resour. Prot. 6, 988–997, https://doi.org/10.4236/jwarp.2014.611093 (2014).
Nash, P. R., Nelson, K. A., Motavalli, P. P. & Udawatta, R. P. Improved water management of a poorly drained floodplain soil and the impact on forage production and nitrate concentration. J. Soil Water Conserv. 72, 619–628, https://doi.org/10.2489/jswc.72.6.619 (2017).
Nash, P. R., Singh, G. & Nelson, K. A. Nutrient loss from floodplain soil with controlled subsurface drainage under forage production. J. Env. Qual. 49, 1000–1010, https://doi.org/10.1002/jeq2.20072 (2020).
Nelson, K. A. Soybean yield variability of drainage and subirrigation systems in a claypan soil. Appl. Eng. Agric. 33, 801, https://doi.org/10.13031/aea.12276 (2017).
Singh, G. & Nelson, K. A. Long-term drainage, subirrigation, and tile spacing effects on maize production. Field Crops Res. 262, 108032, https://doi.org/10.1016/j.fcr.2020.108032 (2001).
Poole, C. A., Skaggs, R. W., Chescheir, G. M., Youssef, M. A. & Crozier, C. R. Effects of drainage water management on crop yields in North Carolina. J. Soil Water Cons. 68(6), 429–437, https://doi.org/10.2489/jswc.68.6.429 (2013).
Poole, C. A., Skaggs, R. W., Youssef, M. A., Chescheir, G. M. & Crozier, C. R. Effect of drainage water management on nitrate nitrogen loss to tile drains in North Carolina. Trans. ASABE 61(1), 233–244, https://doi.org/10.13031/trans.12296 (2018).
Almen, K., Jia, X., DeSutter, T., Scherer, T. & Lin, M. Impact of controlled drainage and subirrigation on water quality in the Red River Valley. Water 13(1), 308–324, https://doi.org/10.3390/w13030308 (2021).
Gunn, K. M. et al. Subsurface drainage volume reduction with drainage water management: Case studies in Ohio, USA. Agric. Water Manag. 149, 131–142, https://doi.org/10.1016/j.agwat.2014.10.014 (2015).
Pease, L. A., Fausey, N. R., Martin, J. F. & Brown, L. C. Weather, landscape, and management effects on nitrate and soluble phosphorus concentrations in subsurface drainage in the Western Lake Erie Basin. Trans. ASABE 61, 223–232, https://doi.org/10.13031/trans.12287 (2018).
Allred, B. J. et al. Water table management to enhance crop yields in a wetland reservoir subirrigation system. Appl. Eng. Agric. 19, 407–421, https://doi.org/10.13031/2013.14923 (2003).
King, K. W., Williams, M. R. & Fausey, N. R. Effect of crop type and season on nutrient leaching to tile drainage under a corn–soybean rotation. J. Soil Water Conserv. 71, 56–68, https://doi.org/10.2489/jswc.71.1.56 (2016).
Shedekar, V. S. et al. Exploring the effectiveness of drainage water management on water budgets and nitrate loss using three evaluation approaches. Agric. Water Manag. 243, 106501, https://doi.org/10.1016/j.agwat.2020.106501 (2021).
Sahani, A. A Demonstration Study of Drainage Water Management in Eastern South Dakota. MS Thesis, South Dakota State University, Brookings, SD. https://openprairie.sdstate.edu/etd/2148/ (2017).
Acknowledgements
The Transforming Drainage team, officially referred to as the “Managing Water for Increased Resiliency of Drained Agricultural Landscapes” project, thanks the USDA-NIFA for funding (Award No. 2015-68007-23193). We thank Stephanie Bowden for data review support. We also thank the following members of the Transforming Drainage team who contributed research data to the published dataset9. The authors of this paper along with team members should be acknowledged when data are cited: Nelson, K. (five research sites in Missouri), Allred, B., King, K., Fausey, N., Brown, L. (eleven research sites in Ohio), Ahiablame, L. (research site in South Dakota), Jaynes, D. (eight research sites in Iowa) and Kladivko, E. (research site in Indiana).
Author information
Authors and Affiliations
Contributions
All authors participated in manuscript writing. L.J.A. led the writing of this manuscript and the data management team. G.C. managed and analysed data and is first author of the published data at USDA NAL ADC. D.E.H. built drainagedata.org and managed the relational database. All others contributed research data from experimental sites.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Abendroth, L.J., Chighladze, G., Frankenberger, J.R. et al. Paired field and water measurements from drainage management practices in row-crop agriculture. Sci Data 9, 257 (2022). https://doi.org/10.1038/s41597-022-01358-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-022-01358-7