Addressing rainfall data selection uncertainty using connections between rainfall and streamflow

Levy, Morgan C.; Cohn, Avery; Lopes, Alan Vaz; Thompson, Sally E.

doi:10.1038/s41598-017-00128-5

Download PDF

Article
Open access
Published: 16 March 2017

Addressing rainfall data selection uncertainty using connections between rainfall and streamflow

Scientific Reports volume 7, Article number: 219 (2017) Cite this article

4098 Accesses
15 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Studies of the hydroclimate at regional scales rely on spatial rainfall data products, derived from remotely-sensed (RS) and in-situ (IS, rain gauge) observations. Because regional rainfall cannot be directly measured, spatial data products are biased. These biases pose a source of uncertainty in environmental analyses, attributable to the choices made by data-users in selecting a representation of rainfall. We use the rainforest-savanna transition region in Brazil to show differences in the statistics describing rainfall across nine RS and interpolated-IS daily rainfall datasets covering the period of 1998–2013. These differences propagate into estimates of temporal trends in monthly rainfall and descriptive hydroclimate indices. Rainfall trends from different datasets are inconsistent at river basin scales, and the magnitude of index differences is comparable to the estimated bias in global climate model projections. To address this uncertainty, we evaluate the correspondence of different rainfall datasets with streamflow from 89 river basins. We demonstrate that direct empirical comparisons between rainfall and streamflow provide a method for evaluating rainfall dataset performance across multiple areal (basin) units. These results highlight the need for users of rainfall datasets to quantify this “data selection uncertainty” problem, and either justify data use choices, or report the uncertainty in derived results.

Enhancing the Australian Gridded Climate Dataset rainfall analysis using satellite data

Article Open access 30 November 2022

Zhi-Weng Chua, Alex Evans, … Chayn Sun

Spatial–temporal characterization of rainfall in Pakistan during the past half-century (1961–2020)

Article Open access 25 March 2021

Ghaffar Ali, Muhammad Sajjad, … Hafiza Nayab Gul

Developing a high-resolution gridded rainfall product for Bangladesh during 1901–2018

Article Open access 03 August 2022

Ashraf Dewan, Shamsuddin Shahid, … Md Asaduzzaman

Introduction

Quantifying precipitation patterns at regional scales is essential for water security^{1, 2}, but is compromised by discrepancies in rainfall datasets^3,4,5. Spatial rainfall data products have proliferated, drawing on differing information sources, using different techniques to impute that information through space, and varying in their spatial extent and spatio-temporal resolution⁶. The proliferation of such rainfall datasets facilitates applied research at regional spatial scales, but raises the risk that naïve use of an individual rainfall product may introduce bias into subsequent analyses, relative to the full range of representations of the rainfall field available⁷. Addressing this risk requires quantifying the differences between available rainfall data products, and, if possible, identifying and working with only those datasets that are most suitable for the intended analysis. Here we firstly show that the differences across daily rainfall datasets, for a test case in Northern Brazil, are large enough to require such uncertainty characterization. Next we demonstrate that comparison of datasets with a mechanistically related, but independently observed environmental variable, in this case streamflow, can provide a basis for selecting among available rainfall products. Although our proximate goal is to identify and reduce the uncertainties associated with naïve selection of a rainfall data product for hydrologic purposes, the approach is generalizable to other climatic products and applications.

Regional rainfall data are collected through remote sensing (RS) and in-situ (IS) rain gauge observations. At regional scales, and in remote, rural or developing regions, the rainfall data products generally available and most applicable for hydroclimatological analyses⁴ are based on RS data, IS data, or both. IS data provide precision and accuracy at a point, but are often distributed sparsely and heterogeneously in space, and discontinuously in time^{8, 9}, and may pose quality control challenges^{10, 11}. RS data have consistent coverage and represent spatial heterogeneity, but are often biased, with uncertainties that are dependent on topography, climate, and the level of spatial and temporal aggregation^{3, 5, 12}. Differences between rainfall datasets emerge, especially at daily or sub-daily temporal resolutions⁷, mostly due to artifacts introduced during data processing. For RS data, such artifacts can include a combination of satellite data retrieval technologies and associated processing algorithms, as well as IS calibration sources and methods⁴. For IS data, artifacts may derive from gauge measurement quality, availability, and the imputation and/or interpolation methods used^13,14,15. While RS data may be a preferred alternative to IS data in settings with sparse rain gauge networks¹⁶, at regional scales, both data types, and their spatial imputations, are expected to differ from ‘true’ (and unknown) rainfall fields.

Consequently, the challenge of data selection given the uncertainty associated with datasets is not to determine the ‘most accurate’ dataset, for which there is no universal assessment^{4, 17}, but instead to quantify the uncertainty in any given analysis that derives from the different representations of reality by the available ensemble of data products. If possible, data selection should also identify the most ‘fit-for-purpose’ dataset, based on its fidelity to the features of rainfall (e.g. mean, extremes, trends, or correspondence with an independently measured and mechanistically related environmental variable) most pertinent to a given study topic.

Our case study region, the rainforest-savanna (Amazon-Cerrado) transition zone in Brazil (Fig. 1(a)) has experienced dramatic changes in land-cover, with anticipated feedbacks to regional climate^{18, 19}, and thus to the wide variety of rainfall-dependent ecosystem services provided in the region, including agricultural industries, the hydropower sector^{20, 21}, and extensive regional forest. Variability and change in the Amazon and surrounding region’s precipitation therefore affect Brazilian economic, food, and energy security, and potentially also the health of the Amazon rainforest and the global climate system^22,23,24. Rainfall in center-west and northern Brazil is monitored through a relatively sparse rain gauge data network (15 or fewer rain gauges per 10⁴ km²), comparable to inland regions of South America; sub-Saharan Africa; and central, east, and southeast-Asia²⁵. These low densities are likely to result in non-trivial differences between regional rainfall data products²⁶ (in Switzerland, rain gauge densities of >24 rain gauges per 1,000 km² were required to avoid density-dependent biases⁹).

Rainfall data in center-west and northern Brazil are therefore likely to be inaccurate at regional scales, yet remain highly relevant to a wide range of policy and planning efforts. For the purposes of this paper, we focus on the quantification of regional daily rainfall statistics needed for hydrologic analyses. Daily rainfall data, or statistical representations thereof, are needed as input to a broad range of hydrological models and empirical analyses that assess spatial or regional trends or drivers of flow variability^{27, 28}. We analyze a suite of statistical descriptors of daily rainfall, including the daily mean rainfall depth, wet-day mean rainfall depth, and percent occurrence of wet days. These all influence streamflow response²⁹, and are referred to as “rainfall characteristics” in the remainder of this paper (results for a more expansive range of rainfall statistics are also presented as Supplementary Information).

The rainfall datasets used in this analysis (Table 1) include four global and quasi-global gridded (RS and IS) products, and five custom interpolations of the Amazon-Cerrado rain gauge network, containing 942 gauges (Fig. 1(b)) and managed by the Brazilian government water management agency (Agência Nacional de Águas - ANA). The curated IS rainfall and streamflow data used in this analysis are provided in a data package: “Curated rain and flow data for the Brazilian rainforest-savanna transition zone”³⁰. We interpolated each day’s set of reporting rain gauges over a 16 year period, from January 1, 1998 to December 31, 2013, using five interpolation methods ranging from a naïve nearest-neighbor to more sophisticated geostatistical approaches (see Methods).

Table 1 Daily rainfall datasets.

Full size table

Intercomparison of these products is not straightforward. Point (IS) estimates of rainfall are not directly comparable with gridded (RS) estimates^{31, 32}. Because streamflow responses arise at river-basin scales, we focus here on an intercomparison at spatially averaged river-basin domains, calculating the rainfall characteristics over 89 river basins in the study region (Fig. 1(c)), as well as on a 0.25° resolution grid. Given this focus, and the characteristics of the region and its rainfall, we might expect gridded RS products to be preferred. RS products are often preferred over IS products in regions where low gauge density prohibits high quality interpolation⁹, and the flat, low-altitude, and moderately wet conditions in central and northern Brazil are considered optimal for RS rainfall retrieval^{3, 5, 33, 34}.

Our approach to data selection and quantification of uncertainty involves an initial intercomparison of the rainfall characteristics, at grid and basin scales, across the nine rainfall datasets. In the absence of an independent set of empirical measurements against which to compare the datasets, the resulting range in the rainfall characteristics across datasets provides an ensemble measure of the uncertainty associated with these characteristics, which we measure using the maximum absolute deviation (MAD) and standard deviation across datasets for each statistical measure in each basin. To illustrate how such dataset differences may propagate into subsequent analyses, we compute several hydroclimatic indices or analytical results - the runoff ratio (ratio of annual runoff to rainfall), the evaporation ratio (ratio of annual evapotranspiration to rainfall), the Horton index³⁵ (ratio of evapotranspiration to available soil water), and long-term (inter-annual) trends in daily rainfall, evaluated on monthly timescales for each basin, and again compute MAD and the standard deviation for each basin. The range in these computed indices and trends provides an ensemble description of hydroclimatic uncertainty due to the propagation of data selection uncertainty into these simple analytical outputs.

Having demonstrated that the differences in rainfall characteristics and their propagation into simple analyses are large enough to cause concern, we next attempt to select a rainfall dataset for use in hydrologic studies, by the approach of comparing rainfall datasets to an independently measured, but mechanistically related, environmental variable. In this case, we use streamflow records across 89 river basins to provide such an independent metric. Given the mechanistic connection between streamflow and rainfall, whereby preceding rainfall events drive subsequent streamflow increases, we use measures of time series correspondence or similarity between daily rainfall (at river basin scales) and streamflow for this intercomparison. Specifically, we treat datasets that maximize the correlation between rainfall and streamflow timeseries, and the correspondence of rainfall with streamflow peaks (see Methods), as being the most informative for hydrologic studies.

Results

Rainfall characteristics

Figure 2(a) shows mean daily rainfall over the study period (1998–2013) at individual grid cells for all nine rainfall datasets, demonstrating relative consistency in large-scale spatial patterns and magnitudes of rainfall, although the mean at individual locations can differ substantially. Figure 2(b), however, demonstrates dramatic differences in the representation of wet-day (≥1 mm/day) rainfall. This illustrates that rainfall detection, to which RS data errors are principally attributed³, and representation of extremes, which vary with the level of spatial aggregation and due to interpolation method^{13, 14}, differentiate datasets. Differences across datasets - in spatial patterns and magnitudes - persist across a suite of other statistics (see Supplementary Figures 2–5, which depict grid-cell-level median and wet-day median rainfall depths, maximum and standard deviation of rainfall depths, mean annual total rainfall, and wet-day occurrence of rainfall). Figure 2 shows point-estimates of rainfall properties at individual grid cells. However, we are primarily concerned with observations of area-integrated rainfall, and the remaining results pertain to areal spatial units (either sample areas or river basins, as noted).

Figure 3 shows variations in the same statistics as presented in Fig. 2 - the mean and wet-day mean, as well as the occurrence of wet days, over river basin units of analysis (Supplementary Figure 6 shows basin-level percentiles, mean annual total, standard deviation, and maximum). Again, there is overlap in the mean daily rainfall estimates, but significant variation exists in wet-day mean values across rainfall datasets. These results suggest that the rainfall datasets can be divided into two groups: the first (I) includes the gridded datasets GPCP (RS), CPC (IS), and TRMM (RS), and the nearest-neighbor interpolation VP (IS); and the second group (II) includes the remaining interpolations UKP (IS, RS), UK (IS), OK (IS), and IDW (IS), and the gridded PERSIANN (RS) dataset. Figure 3 shows that group II datasets report a greater number of wet days, but lower mean rainfall on those wet days, relative to group I. The lower mean wet-day rainfall of group II stems from the fact that group I data report more wet-day extremes (see Supplementary Figures 3 and 6), which upwardly bias the mean wet-day rainfall of group I, despite those data showing fewer wet days. While group I wet-day medians are also greater than group II in accordance with wet-day means, group I all-day medians are less than those of group II (see Supplementary Figures 2 and 6). This is due to the combination of greater wet-day occurrence and medium-intensity rainfall (1–10 mm/day) in group II (see Supplementary Figure 7). These differences persist across the range of rain gauge densities in the study region (see Supplementary Figure 7).

Greater wet-day occurrence in group II custom interpolations (UKP, UK, OK, and IDW) likely results from greater rates of local detection of medium intensity rainfall by rain gauges relative to satellite sources, combined with spatial smoothing of those rainfall events. In the case of PERSIANN, elevated wet-day occurrence can be attributed to a combination of the rainfall estimation algorithm and/or incorporation of multiple RS and IS rainfall products that are unique to this RS dataset compared to earlier RS products (GPCP, CPC, TRMM)³⁶. In summary, divergent features of the group I datasets (lower wet-day occurrence and medium intensity rainfall depths, greater extremes), and group II datasets (greater wet-day occurrence and medium intensity rainfall depths, lower extremes) may result in similar mean daily average values across large regions as shown in Fig. 2(a). Thus, there may be consistency across rainfall datasets in analyses relying upon regional mean daily rainfall values (only). However, different datasets will propagate significant uncertainty into analyses relying on estimation of wet-day rainfall occurrence or depths, quantiles, and extremes. These findings are consistent with previous assessments of interpolated and gridded environmental data^{13, 14, 37}.

Calculation of the maximum absolute deviation (MAD) between any two datasets’ area-average rainfall (averages over areas of the 0.25° grid) provides a simple quantification of dataset divergence and thus the range of the data ensemble. We calculated 1998–2013 MAD at daily, monthly, and annual time scales at 100 regularly-sampled locations, for areas ranging from large to small (circles with radii of 200 km and 10 km, centered at the same 100 locations). This sample design accounts for the fact that different regions, and differently-sized sample units, have different rain gauge densities. At a daily resolution, the mean (median) MAD between any two datasets’ area-average rainfall is 7–12 mm (5–8 mm); at a monthly resolution, it is 56–97 mm (46–82 mm); at an annual resolution, it is 372–576 mm (310–497 mm). The ranges are from statistics calculated for the large to small sample units, respectively. These differences are comparable to global climate model (GCM) biases: projections from the Coupled Model Intercomparison Projects Phase 5 (CMIP5)³⁸ have annual biases relative to a single rainfall data product of −25% (approximately −250 to −550 mm/year) in northern Brazil³⁹, indicating that selection of a different rainfall dataset for reference has the capacity (at an extreme) to either eliminate or double the estimated model bias.

Trends and Hydroclimate Indices

Evaluation of hydroclimatic indices and temporal rainfall trends demonstrates the propagation of rainfall data selection uncertainty into a standard analysis. Although temporal trend analysis is not especially meaningful over a 16-year time period, it demonstrates the potential for trend detection and attribution to be amplified or eliminated by data uncertainty. We calculated monotonic trend slopes (corrected for monthly correlation) and associated p-values for total rainfall by month for all 89 river basin in the study region between 1998–2013 (see Methods). Variation in the estimated trend slopes for basins where at least one rainfall dataset had a statistically significant trend are shown in Fig. 4. Trend slopes, particularly for basins in the north of the study region where rain gauges are especially sparse, do not agree across rainfall datasets. Rainfall datasets agree on the sign of the slope in only eight of the 24 basins (four basins with all positive slopes, and four basins with all negative slopes).

The propagation of rainfall data selection uncertainty is further illustrated by hydroclimatic index measurements made using the different rainfall datasets. Hydroclimatic indices provide information on the relationships between climate, land use, and hydrology, which are critical to the examination of land use and climate change^{35, 40}. They are estimated using both rainfall and streamflow at river basin scales (see Methods). The runoff ratio is the fraction of rainfall discharged from a river basin as streamflow (as opposed to evaporated or transpired at the land surface, or percolated to deep groundwater); the evaporation ratio complements the runoff ratio - it is the fraction of rainfall evapotranspired (as opposed to discharged or percolated); the Horton index compares evapotranspiration to soil water stores (as opposed to total rainfall). The mean (median) maximum absolute deviation between basin-level index values generated using any of the nine rainfall datasets (see Fig. 5(a)) is 0.05 (0.04) for the evaporation ratio and Horton index, and 0.06 (0.04) for the runoff ratio; the difference exceeds 0.25 - a quarter of the entire index range - for some basins. Similarly, the mean (median) standard deviation of basin-level index values (see Fig. 5(b)) is 0.02 (0.01) for all three indices; and can exceed 0.05 in some basins. Streamflow data is the same for all calculations within each basin, so these results demonstrate the sensitivity of basin-scale analyses to rainfall input data alone.

In the absence of information on a ‘best’ rainfall data source, and knowing that data selection uncertainty will propagate into analyses as demonstrated in Figs 4 and 5, distributions of index values obtained from multiple rainfall datasets can be used to quantify data selection uncertainty. For example, the mean of the standard deviations across all basins for a given index (e.g. mean of values shown in each panel of Fig. 5(b)) may be treated as an index- and region-specific standard deviation (s) attributable to rainfall data selection uncertainty. According to our analysis, in rainforest-savanna transitional Brazil, s is approximately 0.02 for all three indices. A straightforward confidence interval for the mean of index values obtained using the nine rainfall datasets over an individual basin (\(\bar{x}\)) in our study region is: \(CI=\bar{x}\pm {z}^{\ast }S{E}_{\bar{x}}\), where \(S{E}_{\bar{x}}=s/\sqrt{89}=0.002\) (see Supplementary Discussion for further details).

Rainfall and Streamflow Correspondence

Figures 2, 3, 4 and 5 demonstrate the need for a procedure to guide rainfall data choice prior to conducting analyses. We build on the precedent for evaluating rainfall data quality using the correspondence between rainfall and river flow^{16, 33, 41} by measuring the empirical correspondence between rainfall and streamflow records using two performance statistics: non-parametric Spearman’s rank correlation, and peak correspondence - the rate at which distinct rainfall peaks correspond to distinct flow peaks within a basin-specific response time window (see Methods). Streamflow rises and peaks in unregulated, rain-fed rivers are caused by preceding rainfall events in the rivers’ catchment, so the correspondence between appropriately-lagged and basin-integrated rainfall, and basin streamflow, measures a rainfall dataset’s ability to capture area-integrated rainfall patterns.

In validation tests, rainfall data from seven Australian river basins were randomly perturbed using additive noise, and true and perturbed rainfall datasets were evaluated relative to streamflow using the performance statistics. Both performance statistics identify the correct rainfall dataset 100% of the time when the random noise is equivalent to or greater than basin rainfall standard deviation. In cases where random noise is less than or equal to half the basin rainfall standard deviation (when differences between datasets are small), correlation still identifies the correct rainfall dataset 100% of the time, however peak correspondence identifies the correct dataset on average (across the seven test basins) 79% of the time or less (see Supplementary Discussion for details). Specifically, peak correspondence performs perfectly (100% correct identification) in some basins, but not others, when the signal to noise ratio is low. This is likely due to peak correspondence’s reliance on quick (storm) runoff response signatures (see Methods), which may vary in quality across different basins. In the study region, the 1998–2013 average maximum absolute deviation (MAD) between any two rainfall datasets on a daily time scale is 7–12 mm; the range of grid cell-level (temporal) standard deviations averaged across the study region for each individual rainfall datasets is between 7–13 mm. Thus, in the study region, individual rainfall dataset variation is on the order of variation between datasets, indicating that peak correspondence will perform as well or nearly as well as correlation in identifying datasets with greatest correspondence to flow. This is confirmed by the similarity in results from both statistics for the Brazilian data.

Figure 6 presents distributions of the performance statistics in 89 river basins (panel a), and illustrates the sensitivity of the performance statistics to rain gauge density within the river basin (panel b). The better the performance of the dataset, the farther to the right are the masses of the distributions in (a), and the higher the curves are in (b). We found that custom interpolations of IS data using IDW and kriging (UKP, UK, and OK) out-performed the gridded datasets for both performance statistics, with IDW performing best overall. In agreement with these results, equivalent or superior performance of the IDW method relative to other interpolations including kriging and VP, specifically for hydroclimatological applications, has been observed in other regions as well^{9, 42}. The best performing gridded dataset is PERSIANN, whose statistics in the study region more closely resemble those of custom interpolated datasets than other gridded products. The differences between performance statistic distributions are statistically significant (as evaluated by non-parametric two-sample Kolmogorov-Smirnov tests, see Supplementary Table 1), consistent across gauge densities (as illustrated in Fig. 6(b)), as well as consistent across location (as indicated by latitude) and basin size (see Supplementary Figure 8), and season (see Supplementary Figure 9). The rain gauge densities in (b) are 1998–2013 averages of basin-area daily densities according to the IS data; they do not directly pertain to gridded datasets, but they are indicative of gridded dataset input gauge densities because gridded product source data (used directly, or for calibration) also comes from Brazilian government agency sources.

Discussion

The ‘data selection uncertainty’ problem identified here is similar to the ‘gigo’ (garbage in, garbage out) problem in modeling, but applied to regional data analysis. Although the need to base analyses and interpretation on high quality data appears self-evident, the inability to directly observe the true spatial process of interest at regional scales, and thus to a priori discriminate between a wide array of available or self-generated regional data products, means that regional data selection is not trivial. Instead, it should motivate environmental scientists to consider the state of practice in the field, with respect to the use of, and confidence placed in, the use of regional climatic data products. For example, in Northern Brazil, where we have identified significant and meaningful differences between rainfall datasets, a wide range of studies draw inference about historical climate patterns and trends⁴³, drought²³, the effects of land use change on hydrology^{44, 45}, and relationships between hydroclimate and agriculture^{46, 47}, without confronting data selection uncertainty. Our analyses suggest that the conclusions of these studies must be treated with caution, as the magnitudes of difference or trends within data products may be comparable to the magnitudes of difference between data products. Several studies in the region do explicitly address data selection uncertainty: by correlating rainfall and streamflow datasets and selecting the rainfall product with the greatest correspondence⁴⁸, and by demonstrating that multiple rainfall products would generate similar results⁴⁹. Overall, however, data selection uncertainty remains inconsistently acknowledged and unaccounted for by practitioners.

The empirical time series and signal-processing methodology used here (i.e. performance statistics) offers an approach to evaluate rainfall data quality for hydrological purposes across multiple river basins and at large spatial scales and is arguably an improvement on the state of practice for regional hydrology. Traditional rainfall data error estimation frameworks infer rainfall data quality at points using cross-validation methods, or over river basin areas based on runoff predictions made via a model^{33, 41}. Point-scale evaluations do not address areal-scale data quality, and at regional scales - i.e. the 89 basin region in this study - a model-based approach would require 89 separate runoff model calibration/validation procedures, and would not generate results that are comparable between basins because the calibration error would be unique for each basin^{10, 16}. Furthermore, the attribution of prediction error to calibration would be confounded with rainfall data input uncertainty. Lastly, the quality and reliability of rainfall-runoff model prediction relies on input stationary^{50, 51}, which is not guaranteed in the study region due to climate and land use change. Thus, model-free approaches are desirable. Our empirical approach capitalizes on the relationships between variables (rainfall, streamflow) rather than on their exact values to evaluate rainfall dataset quality at basin scales. This method complements standard model-based evaluation, but is scalable and generalizable over large regions that challenge the use of models.

While it was possible to identify a best performing rainfall dataset based on streamflow correlation in this region, the results are likely to be site specific and specific to applications in which comparing rainfall signals to streamflow signals offers an appropriate test of quality. Evaluations should be made separately for new study areas, and potentially by comparison to reference datasets other than streamflow for different study purposes. For example, streamflow intercomparisons would not necessarily inform the suitability of a rainfall dataset for surface soil moisture estimation purposes, as would microwave remote sensing data. Similarly, interpolation methods such as UK (which can control for elevation) would likely improve upon IDW in mountainous areas. The differences between the datasets’ performance statistics were reduced when data were aggregated or smoothed over time, consistent with previous studies that have shown RS data to correspond well to IS data with greater temporal aggregation^{52, 53}. Thus, at coarser temporal resolutions (monthly, annual), convenient gridded products remain attractive.

Critical climate change adaptation decisions are likely to derive from the understanding of emerging trends and variability in regional rainfall estimates. These results highlight the often-unacknowledged problem of ‘data-selection uncertainty’ in the detection and attribution of environmental change^{37, 54}, and demonstrate a need for increased effort in quantifying this uncertainty and justifying data choice because analysts may reach divergent understandings due to data selection alone⁵⁵. Identifying the often weak signals of change in noisy datasets is challenging, but analysts can reduce the uncertainty derived from data choice by (i) justifying dataset choices using selection methods such as the performance statistics demonstrated here, and/or (ii) including estimates of data-selection uncertainty (e.g. confidence intervals) in their findings. Evaluation of rainfall data prior to hydroclimatological analysis is both feasible (if streamflow records are available) and necessary. In contrast to the use of climate model outputs in analyses - where characterization of an ensemble of equally uncertain projections is best practice - if an individual dataset corresponds more closely with a reference of choice (e.g. streamflow) than other datasets, that dataset should be used for analysis.

Methods

Data

Gridded datasets include: Global Precipitation Climatology Project (GPCP) Version 1.2⁵⁶; Climate Prediction Center (CPC) Unified Gauge-Based Analysis of Global Daily Precipitation Version 1 and RT data^{26, 57}; Tropical Rainfall Measuring Mission (TRMM) 3B42 Version 7⁵⁸; and Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks - Climate Data Record (PERSIANN-CDR) Version 1.1^{36, 59}. GPCP, TRMM, and PERSIANN were acquired from public repositories via the IRI/LDEO Climate Data Library⁶⁰ and CPC from the raincpc R package⁶¹. IS rainfall, streamflow, and geographic information systems (GIS) data were acquired from the Agência Nacional de Águas (ANA), and reservoir locations (used to select only unregulated river basins for analysis) from the Agência Nacional de Energia Elétrica (ANEEL). Of a total of 1,171 usable rain gauges in the study region, 942 were active (for varying durations) during the study period, and were used for analysis. Daily streamflow data was obtained for basins fully contained in the study region, gauged for at least a year, and with <10% of their area impacted by reservoirs. Analysis was based on 89 basins that met data quality criteria and overlapped with the interpolated rainfall region. All IS rainfall, streamflow, GIS data, and comprehensive documentation on data acquisition and quality assurance/quality control are provided in the “Curated rain and flow data for the Brazilian rainforest-savanna transition zone” data package³⁰. Interpolated rainfall data are available upon request from the corresponding author. (See the Supplementary Discussion for more discussion of rainfall data).

We did not manipulate the spatial resolution of the daily rainfall datasets (all are obtained or generated at 0.25°, except for two sources at 0.5° and 1° resolution - see Table 1), nor did we compare the representation of rainfall based on spatial resolution. We did however briefly explore areal rainfall differences across rain gauge densities with respect to the known gauge densities of interpolated IS data - see Supplementary Figure 7). Typically, to compare rainfall datasets, one would aggregate (or disaggregate) rainfall datasets to a common grid using a method that conserves the total amount of rainfall in an area. Effectively, this study aggregates total daily rainfall to river basin units, without modifying original input data; this is done using a grid cell area-weighted mean of all cells located within a basin area, providing an unmodified representation of each datasets’ area-integrated rainfall over multiple basin scales, that is both conservative and representative of the practical needs of hydrologists and hydroclimatologists.

Interpolation

We used four common and well-documented interpolation techniques^{8, 15, 41, 42}: Voronoi (or Thiessen) Polygons (VP)⁶²; Inverse-Distance Weighting (IDW)⁶³, and Ordinary and Universal Kriging (OK, UK)^{64, 65}. All interpolations were done on a 0.25° resolution grid. IDW and OK are local interpolations, for which we set the maximum interpolation distance (radius) to 300 km, an upper bound on estimated mean rainfall correlation distances in this region, which ranged between 100–300 km for IS and RS data, respectively. Rainfall correlation distances were estimated by fitting a semivariogram model⁶⁵ to data on a random sample of 1,500 individual days (approximately 1/4 of the days in the full date range), and extracting semivariogram range estimates for each day. UK and UKP are ‘universal’ interpolations for the study region; their predictions rely on relationships established between predictor variables across the entire study region. Kriging methods can produce negative values, which were set to zero. To avoid edge effects in interpolations, the grid at which rainfall was interpolated is inset from the study region boundary by 100 km (the minimum mean correlation distance). For additional details on interpolation methods, see Supplementary Discussion.

Trends and Hydroclimate Indices

For trend analyses, we used the non-parametric Seasonal Kendall test for monotonic trends in monthly total rainfall with correction for correlation between monthly blocks, and estimated the slope of the trend using the SK slope estimator^{66, 67}; these are seasonally-adjusted modifications of the widely-used Mann-Kendall test⁶⁸ and Theil-Sen’s slope estimator^{69, 70} that are targeted to hydrological time series.

Index values were calculated for each water year (October-September) in each basin, using river basin area average daily rainfall depths (mm/day) from all nine rainfall datasets, and streamflow depths (mm/day, which are basin-area normalized volumetric flow rates) at river basin scales. The index values recorded for an individual basin and rainfall dataset combination is the average of annual index values for that basin-dataset combination (there are 15 complete water years between 1998–2013). The runoff ratio (RR) is the simple ratio of total annual (water year) streamflow (Q) to total annual rainfall (P): RR = Q/P. Similarly, the evaporation ratio (ER) is the simple ratio of total annual (water year) evapotranspiration (ET = P − Q) to total annual rainfall (P): ER = ET/P. (Note that in these computations we assumed no deep percolation). Lastly, the Horton index (HI) is the ratio of evapotranspiration (ET) to available soil water (W): HI = ET/W, where soil water W = P − Q _q, and Q _q is the direct runoff component of total flow (Q); W is equivalent to the sum of baseflow and ET - the total amount of water accessible to vegetation. Total flow was separated into baseflow and quickflow using a Lynne-Hollick recursive digital baseflow filter (three-pass, default parameter of 0.975)⁷¹. The Horton index is intended to be calculated over a growing season³⁵, however, growing seasons vary across the river basins in this analysis, and many are year-round, thus the use of annual data.

Performance statistics

Volumetric streamflow records were area-normalized and separated into baseflow and quickflow (direct runoff) using a Lynne-Hollick recursive digital baseflow filter (three-pass, default parameter of 0.975)⁷¹; the quickflow component can be more directly compared to rainfall. Both rainfall and quickflow time series were normalized to between 0 and 1. We identified the lag timescale (τ) that maximized the cross-correlation of rainfall and quickflow (the basin response timescale in units of days) for each basin, and lagged rainfall by τ for analysis of correlation and peak correspondence.

With respect to peak correspondence: we classified peaks in the normalized and lag-aligned rainfall and quickflow data by determining the position of peak extrema (observations that are preceded and followed by lower observations), as well as probabilities associated with peaks⁷². The probability associated with a peak quantifies the distinctness of the peak: more significant peaks are those surrounded by several lower observations. Peaks with lower probabilities are those that contain more information according to Kendall’s information theory^{72, 73}. We call peaks with probabilities <0.05 ‘distinct’ (due to autocorrelation in the rainfall and flow time series, this is not a measure of statistical significance, but may nevertheless be used to distinguish more and less distinct peaks). ‘Peak correspondence’ is the rate at which distinct peaks in lagged rainfall match those in streamflow over a basin-specific response time window equivalent to 1/4 × τ (minimum = 1 day) (see Supplementary Figure 10). Correlation between the lagged rainfall and quickflow was assessed using non-parametric Spearman’s rank correlation^{74, 75}. For more details, see the Supplementary Discussion.

Code availability and computational tools

Code is available upon request from the corresponding author. We carried out all analyses and generated all figures within the Comprehensive R Archive Network (CRAN)⁷⁶ programming environment (Version 3) on both Apple and Windows operating systems. See the Supplementary Discussion for a list of utilized software packages.

References

Milly, P. C. D. et al. Stationarity Is Dead: Whither Water Management? Science 319, 573–574 (2008).
Article CAS PubMed Google Scholar
Vörösmarty, C. J. et al. Global threats to human water security and river biodiversity. Nature 467, 555–561 (2010).
Article PubMed Google Scholar
Ebert, E. E., Janowiak, J. E. & Kidd, C. Comparison of near-real-time precipitation estimates from satellite observations and numerical models. Bulletin of the American Meteorological Society 88, 47–64 (2007).
Article ADS Google Scholar
Gehne, M., Hamill, T. M., Kiladis, G. N. & Trenberth, K. E. Comparison of global precipitation estimates across a range of temporal and spatial scales. Journal of Climate 29, 7773–7795 (2016).
Article ADS Google Scholar
Maggioni, V., Meyers, P. C. & Robinson, M. D. A Review of Merged High-Resolution Satellite Precipitation Product Accuracy during the Tropical Rainfall Measuring Mission (TRMM) Era. Journal of Hydrometeorology 17, 1101–1117 (2016).
Article ADS Google Scholar
Kalognomou, E.-A. et al. A Diagnostic Evaluation of Precipitation in CORDEX Models over Southern Africa. Journal of Climate 26, 9477–9506 (2013).
Article ADS Google Scholar
Hughes, D. & Slaughter, A. Daily disaggregation of simulated monthly flows using different rainfall datasets in southern Africa. Journal of Hydrology: Regional Studies 4(Part B), 153–171 (2015).
Google Scholar
Degre, A., Ly, S. & Charles, C. Different methods for spatial interpolation of rainfall data for operational hydrology and hydrological modeling at watershed scale: a review. Biotechnology, Agronomy, Society and Environment 17, 392–406 (2013).
Google Scholar
Girons Lopez, M., Wennerström, H., Nordén, L.-Å. & Seibert, J. Location and Density of Rain Gauges for the Estimation of Spatial Varying Precipitation. Geografiska Annaler: Series A, Physical Geography 97, 167–179 (2015).
Article Google Scholar
Kampf, S. K. & Burges, S. J. Quantifying the water balance in a planar hillslope plot: Effects of measurement errors on flow prediction. Journal of Hydrology 380, 191–202 (2010).
Article ADS Google Scholar
Sieck, L. C., Burges, S. J. & Steiner, M. Challenges in obtaining reliable measurements of point rainfall. Water Resources Research 43, doi:10.1029/2005WR004519 (2007).
Karimi, P. & Bastiaanssen, W. G. M. Spatial evapotranspiration, rainfall and land use data in water accounting – part 1: Review of the accuracy of the remote sensing data. Hydrology and Earth System Sciences 19, 507–532 (2015).
Article ADS Google Scholar
Hofstra, N., New, M. & McSweeney, C. The influence of interpolation and station network density on the distributions and trends of climate variables in gridded daily data. Climate Dynamics 35, 841–858 (2015).
Article ADS Google Scholar
Lundquist, J. D. et al. High-Elevation Precipitation Patterns: Using Snow Measurements to Assess Daily Gridded Datasets across the Sierra Nevada, California. Journal of Hydrometeorology 16, 1773–1792 (2015).
Article ADS Google Scholar
Mair, A. & Fares, A. Comparison of rainfall interpolation methods in a mountainous region of a tropical island. Journal of Hydrologic Engineering 16, 371–383 (2011).
Article Google Scholar
Su, F., Hong, Y. & Lettenmaier, D. P. Evaluation of TRMM Multisatellite Precipitation Analysis (TMPA) and Its Utility in Hydrologic Prediction in the La Plata Basin. Journal of Hydrometeorology 9, 622–640 (2008).
Article ADS Google Scholar
Massonnet, F., Bellprat, O., Guemas, V. & Doblas-Reyes, F. J. Using climate models to estimate the quality of global observational data sets. Science 354, 452–455 (2016).
Article ADS CAS PubMed Google Scholar
Lima, L. S. et al. Feedbacks between deforestation, climate, and hydrology in the southwestern amazon: implications for the provision of ecosystem services. Landscape Ecology 29, 261–274 (2014).
Article Google Scholar
Llopart, M., Coppola, E., Giorgi, F., Rocha, R. Pd & Cuadra, S. V. Climate change impact on precipitation for the Amazon and La Plata basins. Climatic Change 125, 111–125 (2014).
Article Google Scholar
Oliveira, L. J. C., Costa, M. H., Soares-Filho, B. S. & Coe, M. T. Large-scale expansion of agriculture in amazonia may be a no-win scenario. Environmental Research Letters 8, 024021 (2013).
Article ADS Google Scholar
Stickler, C. M. et al. Dependence of hydropower energy generation on forests in the amazon basin at local and regional scales. Proceedings of the National Academy of Sciences 110, 9601–9606 (2013).
Article ADS CAS Google Scholar
Lathuillière, M. J., Coe, M. T. & Johnson, M. S. A review of green- and blue-water resources and their trade-offs for future agricultural production in the amazon basin: what could irrigated agriculture mean for amazonia? Hydrology and Earth System Sciences 20, 2179–2194 (2016).
Article ADS Google Scholar
Phillips, O. L. et al. Drought Sensitivity of the Amazon Rainforest. Science 323, 1344–1347 (2009).
Article ADS CAS PubMed Google Scholar
Zhang, K. et al. The fate of Amazonian ecosystems over the coming century arising from changes in climate, atmospheric CO₂, and land use. Global Change Biology 21, 2569–2587 (2015).
Article Google Scholar
Schneider, U. et al. GPCC’s new land surface precipitation climatology based on quality-controlled in situ data and its role in quantifying the global water cycle. Theoretical and Applied Climatology 115, 15–40 (2013).
Article ADS Google Scholar
Chen, M. et al. Assessing objective techniques for gauge-based analyses of global daily precipitation. Journal of Geophysical Research: Atmospheres 113, D04110 (2008).
ADS Google Scholar
Botter, G., Porporato, A., Rodriguez-Iturbe, I. & Rinaldo, A. Basin-scale soil moisture dynamics and the probabilistic characterization of carrier hydrologic flows: Slow, leaching-prone components of the hydrologic response. Water resources research 43 (2007).
Thirel, G. et al. Hydrology under change: an evaluation protocol to investigate how hydrological models deal with changing catchments. Hydrological Sciences Journal 60, 1184–1199 (2015).
Article Google Scholar
Blöschl, G. Runoff prediction in ungauged basins: synthesis across processes, places and scales (Cambridge University Press, 2013).
Levy, M. C. Curated rain and flow data for the Brazilian rainforest-savanna transition zone, URL http://www.hydroshare.org/resource/e82e66572b444fc5b6bf16f88f911f77 (Consortium of Universities for the Advancement of Hydrologic Science, Hydroshare, 2016).
Muller, M. F. & Thompson, S. E. Bias adjustment of satellite rainfall data through stochastic modeling: Methods development and application to Nepal. Advances in Water Resources 60, 121–134 (2013).
Article ADS Google Scholar
Sivapalan, M. & Blöschl, G. Transformation of point rainfall to areal rainfall: Intensity-duration-frequency curves. Journal of Hydrology 204, 150–167 (1998).
Article ADS Google Scholar
Collischonn, B., Collischonn, W. & Tucci, C. E. M. Daily hydrological modeling in the Amazon basin using TRMM rainfall estimates. Journal of Hydrology 360, 207–216 (2008).
Article ADS Google Scholar
Gebregiorgis, A. & Hossain, F. Understanding the dependence of satellite rainfall uncertainty on topography and climate for hydrologic model simulation. IEEE Transactions on Geoscience and Remote Sensing 51, 704–718 (2013).
Article ADS Google Scholar
Troch, P. A. et al. Climate and vegetation water use efficiency at catchment scales. Hydrological Processes 23, 2409–2414 (2009).
Article ADS Google Scholar
Ashouri, H. et al. PERSIANN-CDR: Daily Precipitation Climate Data Record from Multisatellite Observations for Hydrological and Climate Studies. Bulletin of the American Meteorological Society 96, 69–83 (2014).
Article ADS Google Scholar
Auffhammer, M., Hsiang, S. M., Schlenker, W. & Sobel, A. Using Weather Data and Climate Model Output in Economic Analyses of Climate Change. Review of Environmental Economics and Policy 7, 181–198 (2013).
Article Google Scholar
Taylor, K. E., Stouffer, R. J. & Meehl, G. A. An overview of CMIP5 and the experiment design. Bulletin of the American Meteorological Society 93, 485–498 (2011).
Article ADS Google Scholar
Gulizia, C. & Camilloni, I. Comparative analysis of the ability of a set of CMIP3 and CMIP5 global climate models to represent precipitation in South America. International Journal of Climatology 35, 583–595 (2015).
Article ADS Google Scholar
Arora, V. K. The use of the aridity index to assess climate change effect on annual runoff. Journal of Hydrology 265, 164–177 (2002).
Article ADS Google Scholar
Shope, C. L. & Maharjan, G. R. Modeling Spatiotemporal Precipitation: Effects of Density, Interpolation, and Land Use Distribution. Advances in Meteorology 2015, 174196 (2015).
Article Google Scholar
Otieno, H., Yang, J., Liu, W. & Han, D. Influence of Rain Gauge Density on Interpolation Method Selection. Journal of Hydrologic Engineering 19, 04014024 (2014).
Article Google Scholar
Rao, V. B., Franchito, S. H., Santo, C. M. E. & Gan, M. A. An update on the rainfall characteristics of brazil: seasonal variations and trends in 1979–2011. International Journal of Climatology 36, 291–302 (2016).
Article ADS Google Scholar
Oliveira, P. T. S. et al. Trends in water balance components across the brazilian cerrado. Water Resources Research 50, 7100–7114 (2014).
Article ADS Google Scholar
Panday, P. K., Coe, M. T., Macedo, M. N., Lefebvre, P. & Castanho, A. Dd. A. Deforestation offsets water balance changes due to climate variability in the Xingu River in eastern Amazonia. Journal of Hydrology 523, 822–829 (2015).
Article ADS Google Scholar
Cohn, A. S., VanWey, L. K., Spera, S. A. & Mustard, J. F. Cropping frequency and area response to climate variability can exceed yield response. Nature Climate Change 6, 601–604 (2016).
Article ADS Google Scholar
Llano, M. P. & Vargas, W. Climate characteristics and their relationship with soybean and maize yields in argentina, brazil and the united states. International Journal of Climatology 36, 1471–1483 (2016).
Article ADS Google Scholar
Coe, M. T., Latrubesse, E. M., Ferreira, M. E. & Amsler, M. L. The effects of deforestation and climate variability on the streamflow of the Araguaia River, Brazil. Biogeochemistry 105, 119–131 (2011).
Article Google Scholar
Awange, J. L., Mpelasoka, F. & Goncalves, R. M. When every drop counts: Analysis of Droughts in Brazil for the 1901–2013 period. Science of The Total Environment 566–567, 1472–1488 (2016).
Article PubMed Google Scholar
Beven, K. & Westerberg, I. On red herrings and real herrings: disinformation and information in hydrological inference. Hydrological Processes 25, 1676–1680 (2011).
Article ADS Google Scholar
Thirel, G., Andréassian, V. & Perrin, C. On the need to test hydrological models under changing conditions. Hydrological Sciences Journal 60, 1165–1173 (2015).
Article Google Scholar
Cohen Liechti, T., Matos, J. P., Boillat, J.-L. & Schleiss, A. J. Comparison and evaluation of satellite derived precipitation products for hydrological modeling of the Zambezi River Basin. Hydrology and Earth System Sciences 16, 489–500 (2012).
Article ADS Google Scholar
Scheel, M. L. M. et al. Evaluation of TRMM Multi-satellite Precipitation Analysis (TMPA) performance in the Central Andes region and its dependency on spatial and temporal resolution. Hydrology and Earth System Sciences 15, 2649–2663 (2011).
Article ADS Google Scholar
Pascale, S., Lucarini, V., Feng, X., Porporato, A. & Hasson, S. Analysis of rainfall seasonality from observations and climate models. Climate Dynamics 44, 3281–3301 (2014).
Article Google Scholar
Lundquist, J. D. et al. Diagnosis of insidious data disasters. Water Resources Research 51, 3815–3827 (2015).
Article ADS Google Scholar
Huffman, G. J. et al. Global Precipitation at One-Degree Daily Resolution from Multisatellite Observations. Journal of Hydrometeorology 2, 36–50 (2001).
Article ADS Google Scholar
Xie, P. et al. A Gauge-Based Analysis of Daily Precipitation over East Asia. Journal of Hydrometeorology 8, 607–626 (2007).
Article ADS Google Scholar
Huffman, G. J. et al. The TRMM multisatellite precipitation analysis (TMPA): Quasi-global, multiyear, combined-sensor precipitation estimates at fine scales. Journal of Hydrometeorology 8, 38–55 (2007).
Article ADS Google Scholar
Sorooshian, S. et al. Evaluation of PERSIANN System Satellite–Based Estimates of Tropical Rainfall. Bulletin of the American Meteorological Society 81, 2035–2046 (2000).
Article ADS Google Scholar
Columbia University, Earth Institute. IRI/LDEO Climate Data Library, URL http://iridl.ldeo.columbia.edu/ (2015).
Goteti, G. raincpc: Obtain and Analyze Rainfall Data from the Climate Prediction Center. URL http://CRAN.R-project.org/package=raincpc. R package version 0.4 (2014).
Thiessen, A. H. Precipitation averages for large areas. Monthly Weather Review 39, 1082–1089 (1911).
Article ADS Google Scholar
Shepard, D. A Two-dimensional Interpolation Function for Irregularly-spaced Data. In Proceedings of the 1968 23rd ACM National Conference, ACM ’68, 517–524 (ACM, New York, NY, USA, 1968).
Matheron, G. Le krigeage universel (École nationale supérieure des mines de Paris, [Paris, France], 1969).
Cressie, N. A. C. Statistics for spatial data. (Wiley: New York, 1993).
MATH Google Scholar
Hirsch, R. M., Slack, J. R. & Smith, R. A. Techniques of trend analysis for monthly water quality data. Water Resources Research 18, 107–121 (1982).
Article ADS Google Scholar
Hirsch, R. M. & Slack, J. R. A Nonparametric Trend Test for Seasonal Data With Serial Dependence. Water Resources Research 20, 727–732 (1984).
Article ADS Google Scholar
Mann, H. B. Nonparametric Tests Against Trend. Econometrica 13, 245–259 (1945).
Article MathSciNet MATH Google Scholar
Sen, P. K. Estimates of the Regression Coefficient Based on Kendall’s Tau. Journal of the American Statistical Association 63, 1379–1389 (1968).
Article MathSciNet MATH Google Scholar
Theil, H. A rank-invariant method of linear and polynomial regression analysis. In Henri Theil’s Contributions to Economics and Econometrics, 345–381 (Springer, 1992).
Lyne, V. & Hollick, M. Stochastic time variable rainfall-runoff modelling. In Proceedings of the Hydrology and Water Resources Symposium (Institution of Engineers National Conference Publication, No. 79/10, pp. 89-92, Perth, Australia, 1979).
Ibanez, F. Sur une nouvelle application de la théorie de 1’information à la description des séries chronologiques planctoniques. Journal of Plankton Research 4, 619–632 (1982).
Article Google Scholar
Kendall, M. G. Time-series. (Hafner Press: New York, 1976).
MATH Google Scholar
Kendall, M. G. Rank correlation methods. (Griffin: London, 1970).
MATH Google Scholar
Spearman, C. The Proof and Measurement of Association between Two Things. The American Journal of Psychology 15, 72–101 (1904).
Article Google Scholar
R., Core Team. R: A Language and Environment for Statistical Computing, URL https://www.R-project.org (R Foundation for Statistical Computing, 2015).
Olson, D. M. et al. Terrestrial Ecoregions of the World: A New Map of Life on Earth. BioScience 51, 933–938 (2001).
Article Google Scholar

Download references

Acknowledgements

M.C.L. acknowledges funding support from the NSF GRFP, the UC Berkeley Philomathia Center, and NSF CNIC IIA-1427761, and assistance from Herman Wu and Ryan Avery from the UC Berkeley Cal Energy Corps program. A.C. acknowledges funding from the Dutch Ministry of Economic Affairs: Division of Agricultural and Natural Resources. A.V.L. acknowledges a doctorate fellowship from the Brazilian Coordination for Improvement of Higher Education Personnel (CAPES) and Fulbright. Publication made possible in part by support from the Berkeley Research Impact Initiative (BRII) sponsored by the UC Berkeley Library.

Author information

Authors and Affiliations

Energy and Resources Group, University of California, Berkeley, USA
Morgan C. Levy
Fletcher School, Tufts University, Medford, USA
Avery Cohn
National Water Agency (ANA), Brasilia, Brazil
Alan Vaz Lopes
Department of Civil and Environmental Engineering, University of California, Berkeley, USA
Sally E. Thompson

Authors

Morgan C. Levy
View author publications
You can also search for this author in PubMed Google Scholar
Avery Cohn
View author publications
You can also search for this author in PubMed Google Scholar
Alan Vaz Lopes
View author publications
You can also search for this author in PubMed Google Scholar
Sally E. Thompson
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.C.L. designed the research, acquired and analyzed data, and wrote the manuscript. A.V.L. advised data acquisition and processing, and preparation of associated data package documentation. S.E.T. and A.C. advised data analysis and interpretation of the results, and assisted in writing the manuscript. All authors edited the manuscript.

Corresponding author

Correspondence to Morgan C. Levy.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Levy, M.C., Cohn, A., Lopes, A.V. et al. Addressing rainfall data selection uncertainty using connections between rainfall and streamflow. Sci Rep 7, 219 (2017). https://doi.org/10.1038/s41598-017-00128-5

Download citation

Received: 16 May 2016
Accepted: 09 February 2017
Published: 16 March 2017
DOI: https://doi.org/10.1038/s41598-017-00128-5

This article is cited by

Modeling with Artificial Neural Networks to estimate daily precipitation in the Brazilian Legal Amazon
- Evanice Pinheiro Gomes
- Mayke Feitosa Progênio
- Patrícia da Silva Holanda
Climate Dynamics (2024)
Data assimilation for constructing long-term gridded daily rainfall time series over Southeast Asia
- Vishal Singh
- Qin Xiaosheng
Climate Dynamics (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.