The green and blue crop water requirement WATNEEDS model and its global gridded outputs

Accurately assessing green and blue water requirements from croplands is fundamental to promote sustainable water management. In the last decade, global hydrological models have provided important insights into global patterns of water requirements for crop production. As important as these models are, they do not provide monthly crop-specific and year-specific data of green and blue water requirements. Gridded crop-specific products are therefore needed to better understand the spatial and temporal evolution of water demand. Here, we present a global gridded database of monthly crop-specific green (rain-fed) and blue (irrigated) water requirements for 23 main crops and 3 crop groups obtained using our WATNEEDS model. For the time periods in which our dataset matched, these estimates are validated against existing global products and satellite based datasets of evapotranspiration. The data are publicly available and can be used by practitioners in the water-energy-food nexus to assess the water sustainability of our food and energy systems at multiple spatial (local to global) and temporal (seasonal to multi-year) scales.


Background & Summary
Water plays a central role in supporting agriculture, with food production responsible for ~90% of humanity's consumptive water footprint 1,2 . Accurately assessing demands for freshwater resources -both in terms of timing and magnitude -is critical for understanding the water sustainability of agriculture. A better understanding of agricultural water needs could be used to identify those places where water demand and its variability could potentially compromise the reliability of food production, and for formulating solutions to promote sustainable water management. Some studies have helped to expand our knowledge of crop water use by adopting the water footprint approach 3,4 that partitions crop water requirements (i.e., the volume of water needed to support crop's evapotranspiration during its growing period without experiencing water stress) between blue water and green water (i.e., water from water bodies or aquifers and soil moisture, respectively). Some studies have provided estimates of annual trends in water demand from croplands 5,6 while others have progressed to finer temporal (i.e., monthly) resolutions that consider multiple individual crops 4,7 . The latter crop-specific studies have been typically centred on the year 2000, offering an important snapshot of the global distribution of water demand and insight into the importance of certain crops in driving water demands in different regions. More broadly, all of these products have helped to advance the science of food security and water sustainability and to identify those places where chronic or seasonal water stress is occurring because of agricultural water demand.
Unlike the spatial distribution of crop water demand, the time component of freshwater demand is less well understood. To date, most temporal studies of water use in agriculture have concentrated on national, regional, or basin scales (e.g., refs. [8][9][10] or examined the agricultural sector as whole (e.g., refs. 9,11 ) and have rarely captured the intra-national and intra-annual dynamics of water demand and availability. In addition, previous studies examining long-term temporal trends in crop water use have relied primarily on national, temporally averaged crop water requirement values (mm H 2 O yr −1 ; see e.g., ref. 12 ). These values are then typically multiplied by a crop's annual harvested area in a particular country to determine the total water demand for that crop in that year. www.nature.com/scientificdata www.nature.com/scientificdata/ While such analyses provide important information related to broad trends of water use for crop production, such an approach does not permit temporal examinations that are spatially explicit and intra-annually disaggregated or that account for different crop growing periods. As such, there is a persisting need for datasets that not only incorporate the advances of previous work (i.e., spatially explicit, crop-specific) but that also expand upon these strengths by adding the dimension of time.
Here we address this important gap by presenting our WATNEEDS model. The model assesses the vertical component of the soil water balance and introduces a spatially distributed crop specific monthly analysis of green and blue water requirement and use for available climatic data. Specifically, evapotranspiration is computed using the FAO Penmann Monteith method 13 , overland flow is modelled as soil water saturation excess mechanism, and percolation is a linear function of the maximum infiltration rate and soil moisture, as in Hoogeven et al. 14 .
As input data for WATNEEDS model, we use MIRCA crop distribution data 15 and crop parameters and growing stage data taken from Allen et al. 13 , following Siebert and Döll 7 . We provide WATNEEDS model outputs for the 1998-2002 average (i.e., circa year 2000) and for the year 2016. Moreover, we provide monthly crop-specific gridded maps for 5 main crops.
Specifically, we begin by describing the biophysical input datasets required for the calculation of crop water requirements including spatially and temporally distributed information on climate, soil, and crop characteristics. We then describe how the model (WATNEEDS) solves a daily vertical soil water balance to generate estimates of green and blue crop water requirements. Using the model WATNEEDS, we present a global gridded (5 arcminute) dataset of monthly green and blue crop water requirements for 5 major crops and annual green and blue crop water requirements for 23 crops and 3 crop groups that encompass the vast majority of global water use for food production for the average year 2000 (i.e. averaged among 1998-2002 yearly simulations) in agreement with previous datasets and the most recent available year 2016. Moreover, where temporal and spatial overlap permits, the model outputs generated by this study are also compared to other existing global model outputs and satellite-derived products Ultimately, this model will be used to develop a continuous global time series for better understanding the spatial and temporal heterogeneity and variability of freshwater demand for food production as well as how water demand has evolved through time relative to water availability. Such historical examinations can provide insights for more accurate predictions of future water demand and availability.

Methods
This section provides a detailed description of the input data sources, the model components used for calculating crop water requirements, and the resultant time series of global gridded monthly crop water requirement maps.
The crop water requirement (mm yr -1 ) is the volume of water required to compensate for a crop's evapotranspiration losses and to prevent crop water stress. This crop water requirement can be divided into two components: the green crop water requirement (met by available precipitation) and the blue crop water requirement (met by irrigation). The crop water requirement is fully satisfied only when there is enough water for the plant to take up during its growth (i.e. enough precipitation or irrigation) without undergoing water stress. In regions of the world where crop water demand cannot be met by rainwater, only part of the crop water requirement is satisfied by green water (i.e. actual evapotranspiration). Irrigation can be used to supplement the crop's water needs, thereby allowing crops to evapotranspire at the potential rate. For the years around 2000 (i.e., looking at average results for [1998][1999][2000][2001][2002] and the year 2016, we calculated yearly blue and green crop water requirements for 23 major crops -barley, cassava, citrus, cocoa, coffee, cotton, date palm, grapes/vine, groundnuts/peanuts, maize, millet, oil palm, potatoes, pulses, rape seed/canola, rice, rye, sorghum, soybeans, sugar beet, sugar cane, sunflower, and wheat -that currently account for 76% of global crop production and 95% of global harvested area 16 and 3 crop groups (fodder grasses, others annual crops, and others perennial crops). Specifically, we estimated actual green water use in rainfed areas and green and blue water use in irrigated areas. We also assessed monthly green and blue water requirements for five major crops -wheat, maize, rice, sugarcane and soybean -that currently account for almost half of global crop production. Land use, soil characteristics, crop calendars and crop growing stages are kept constant in all years using values available for the year 2000.
Data sources. Monthly data on potential reference evapotranspiration (ET o ) came from the University of East Anglia's Climate Research Unit Time Series version 4.01 dataset (CRU TS v. 4.01; 0.5° × 0.5° resolution) 17 and was calculated using the Penmann-Monteith equation, following Allen et al. 13 . Daily precipitation data between the latitudes 50° N and 50° S came from the Climate Hazards Group InfraRed Precipitation with Station version 2.0 dataset (CHIRPS; 0.05° × 0.05° resolution) 18,19 while precipitation data for the remaining latitudes was taken from the National Oceanic and Atmospheric Administration's Climate Prediction Center Global Unified Gauge-Based Analysis of Daily Precipitation dataset (CPC; 0.5° × 0.5° resolution) 20 . Soil information -maximum soil moisture storage capacity and maximum infiltration rate -were from Bajties et al. 21 (0.08333° × 0.08333° resolution). Crop coefficients (k c ) and growing stages came from Allen et al. 13 Growing stages -originally reported as a percentage of the growing period of a crop -were then scaled to the planting and harvesting dates reported for the 402 regions and sub-regions included in the MIRCA2000 dataset 15 . Crop-specific rooting depths for irrigated and rainfed crops and critical depletion factors came from Allen et al. 13 . All gridded datasets were resampled to a 5 arcminute (0.08333°) spatial resolution.

Multiple growing seasons.
For a number of regions included within the MIRCA2000 dataset 15 , more than one growing period is reported for certain crops. This is true for irrigated rice and wheat. For cases where more than two growing periods were reported, we averaged the growing periods with the harvested area reported by Portmann et al. 15 . In a limited number of cases, the harvested areas were reported as equal across all growing periods for a particular crop and region. In these instances, selection of the two dominant growing periods was complemented using the growing periods of Mekonnen and Hoekstra 4 , based on USDA 22 and FAO 23 information.
www.nature.com/scientificdata www.nature.com/scientificdata/ atmospheric demand on crops. Evapotranspiration represents the rate of water flow to the atmosphere as water vapor. Potential evapotranspiration corresponds to the crop water requirement of plants (CWR) in the absence of water-stress; it can be reached when plants can take up from the soil the amount of water they need. This water comes from precipitation (green water -GW) and, in the case of deficiency, it is supplemented by irrigation (blue water -BW).
Potential evapotranspiration ( ) can be assessed as where k c,i,t (−) is the crop coefficient of crop i, corresponding to the growing stage in which day t occurs; crop coefficients are taken from Allen et al. 13 . ETo is the reference evapotranspiration 17 .
The daily actual evapotranspiration (ET a,i,t ) ( ) mm day of crop i on day t is then calculated as: where k s,i,t (−) is the water stress coefficient calculated as a function of the soil water content in the root zone (S i,t ) and the maximum and actual water content in the root zone, as in Allen et al. 13 . For crop i on day t under water stressed conditions (i.e., when only precipitation is provided), k s,i,t was evaluated as: where S i,t (mm) is the depth-average soil moisture and RAW i (mm) is the readily available water. RAW is calculated as: where TAWi (mm) is the total available water (i.e., the amount of water that a crop can uptake from the rooting zone), p i (−) is the critical depletion factor (i.e., the fraction of TAW i that a crop can uptake from the rooting zone without experiencing crop water stress), θ θ − fc wp ( ) mm m is the maximum soil moisture storage capacity dependent on soil texture (i.e., the difference between the water content at field capacity and the water content at the wilting point) 14 , and z r (m) is the crop rooting depth 7 . For conditions of no water stress (where supplementary irrigation is available), k s,i,t was assumed to be equal to 1 (see ref. 13 ).
Vertical soil water balance. For a given crop and grid cell, soil moisture (S i,t ) was calculated by solving a daily soil water balance: ( ) is the soil moisture of the previous time step, Δt is equal to one day, P eff ( ) mm day is the effective precipitation -where we assume that 5% of precipitation is partitioned to surface runoff following Hoogeveen et al. 14  is deep percolation below the root zone (which occurs when soil moisture exceeds field capacity (i.e., the volume of water able to be retained in the soil)) and was calculated as: is the maximum infiltration rate depending on soil type 24 . In time steps where the sum of balance (i.e., S i,t-1 + P eff -Et a,i,t -D i,t ) is negative, the ET a,i,t and D i,t were scaled proportionally in order to close the balance. In time steps where the sum of the balance (i.e., S i,t-1 + P eff -Et a,i,t -D i,t ) is positive and exceeds TAW i , R i,t -the sub-surface runoff -is calculated as the difference between the sum of the balance and TAW i .
For each day, each crop, and each grid cell within a MIRCA2000 region for which data on growing period was available, we calculated a stress ET a,i,t,s -equal to the 'green' crop water requirement -and unstressed ET a,i,t,uequal to the actual evapotranspiration under no water stress ET i,t,s . 'Blue' crop water requirement was calculated as the difference between ET a,i,t,s and ET a,i,t,u and was only considered for irrigated areas. We then took a summation of the daily 'green' and 'blue' crop water requirements across each month of a crop's growing season to determine monthly 'green' (for rainfed and irrigated crops) and 'blue' (for irrigated crops only) consumptive crop water requirements (Table S1). These definitions of 'green' and 'blue' crop water requirements are consistent with standard methodologies of water footprint calculation 1,4 .
Model initial and non-growing season conditions. The model was initialized assuming an initial soil moisture condition of 50% of TAW. Following Hoogeveen et al. 14 , the model was then run for three years prior to the study start date using three randomly selected years of climate data. Because we ran multiple simulations (one www.nature.com/scientificdata www.nature.com/scientificdata/ for each crop), these three randomly selected years were held constant across simulations. For the months that fell outside of the growing season, we assumed a k c value of 0.5. We also examined the sensitivity of our results to this off-season k c value and found only limited variation.

Data Records
Each monthly time-step of the dataset has global coverage (180°E-180°W; 90°S-90°N) with a 5 arcminute resolution (~10 km at the equator) and is provided in a standard WGS84 coordinate system. The data are provided in NetCDF-4 format, where the third dimension represents the crop type and the month for yearly and monthly results respectively. For each of the 5 major crops, a separate NetCDF-4 file was created. Green crop water requirements represent the amount of ET (i.e., actual crop-specific evapotranspiration demand (mm)) met by precipitation. Blue water requirements represent the amount of irrigation required to make up the difference between ET and the green crop water requirement after accounting for vertical soil water balances. Tables 1-3 report the names of each file and the main characteristics. These data files are also accompanied by a readme text file.
Yearly and monthly crop specific blue and green water requirement data are available in the freely and publicly available repository in figshare (https://doi.org/10.6084/m9.figshare.c.4893084) 25 .

technical Validation
Our model shows that 5414 km 3 of green water and 1068 km 3 of blue water were consumed for crop production in the year 2000. Of these volumes, 922 km 3 of green water and 1068 km 3 of blue water were consumed on irrigated land covering 25% of total global harvested area, and the remaining 4491 km 3 of green water were used by rainfed area. These cumulative results were validated against those of Siebert and Döll 7 , showing good agreement (a discrepancy lower than 3%) in both rainfed and irrigated conditions. Crop-specific results are reported in Table 4. Except for rye and rapeseed, whose water requirements are low compared to other crops, we observed differences lower than 15% for both rainfed and irrigated conditions. Globally, the crops accounting for the largest volumes of water consumption are maize (12% and 26% of green and blue water respectively), wheat (13% and 18% of green and blue water respectively) and rice (12% and 7% of green and blue water respectively), which together cover more than 40% of the total global harvested area. Though cultivated less extensively, the crops with the highest blue water requirements were date palm (1174 mm), sugarcane (305 mm), and citrus (300 mm).
For maps centered on the year 2000, we also performed a crop-by-crop pixel-by-pixel comparison to the dataset of Siebert and Döll 7 for 23 main crops and 3 crop groups. The difference between Siebert and Doll and our model results in term of CWR is lower than 20% for about the 90% of the harvested area. All comparisons for rainfed and irrigated crops for the year 2000 are provided in the Supplementary Materials. For irrigated rice and  www.nature.com/scientificdata www.nature.com/scientificdata/ wheat, which can have more than one growing period in the same location, the comparison was done considering only the main harvested period (i.e., the period associated to the larger harvested area). For 23 out of 26 crops, values of rainfed crops estimated by our model differ less than 20% compared with rainfed results by Siebert and Döll 7 , while 13 and 17 out of 26 for green and blue water (respectively) in irrigated areas showed an overall good agreement for crop water requirement.
For 5 major crops -which account for almost half of total crop production -we generated results at a monthly time scale and compared with MODIS evapotranspiration estimates 26 . We observed better agreement for rainfed crops with a discrepancy lower than 20% in 69%, 51%, 46%, 41% and 27% respectively for rainfed rice, sugarcane, soybeans, maize and wheat and an average of 20% for irrigated crops. Generally, the results from our model show higher values, likely due to the fact that in practice the full amount of required water is not provided to the field in many cases (i.e., deficit irrigation).
In the year 2016, crop production consumed 5740 km 3 of green water and 1005 km 3 of blue water, with an increase of 6% in green water consumption and a reduction of 6% of blue water relative to the year 2000 (Table 5 and Fig. 1).

Sensitivity analysis.
We also performed sensitivity analyses on the initial condition and kc values during the non-growing season of the plant. Negligible changes (i.e., lower than 1%) are registered when changing the initial soil moisture condition from 0% to 100% as a result of the 3 year model spin-up to equilibrate the initial condition.
Varying kc values from 0.1 to 1 in the non-growing season produced a final variation in our estimates of less than 15% for green water and 20% for blue water (with the exceptions of rapeseed and rye) ( Table 6). An average value of 0.5 was ultimately used, in order to include an average condition between bare soils, grassland, or second crops that could potentially cover the harvested areas in the non-growing season.

Usage Notes
The primary use of this dataset is to examine spatial and temporal trends of crop water use in tandem. Combined with estimates of cropland extent through time, this information now allows for historical estimates of water demand for global crop production and the incorporation of spatio-temporal variability and uncertainty into such assessments. With that said, there are several important issues about which users should be aware when utilizing this dataset:   Table 5. Green and blue water for the year 2016. www.nature.com/scientificdata www.nature.com/scientificdata/ • This dataset utilizes the primary planting and harvesting dates for each crop in each region. However, some regions have multiple cropping seasons 27 . It is therefore important that any region-specific analyses using this data incorporate a detailed understanding of that area's cropping systems. • Changes in atmospheric CO 2 concentrations can produce competing effects on the water use efficiency of a crop. On one hand, elevated CO 2 concentrations lead to increased air temperatures and greater plant transpiration. On the other hand, elevated CO 2 concentrations permit smaller stomatal opening for carbon fixation. While these effects have been shown to largely cancel out for the time period 1981-2013 for rice, soybeans, wheat (C3 crops), and maize (a C4 crop), the water use efficiency of maize was found to be lower than estimated by a Penman-Monteith approach -as employed here -when considering longer time periods 28 . Thus while we expect that changing CO 2 concentrations do not significantly affect our estimates of crop water requirements for C4 crops (e.g., maize, millet, sorghum, sugarcane), such considerations are essential for studies seeking to apply our methodology over longer time periods. • Blue crop water requirements are not necessarily the same as the irrigation water that crops are able to receive.
In some places with access to irrigation, supporting infrastructure may be insufficient to provide the water needed to avoid crop water stress 29 . To the extent possible, studies using this dataset (or any other global gridded dataset of estimating crop water requirements) should incorporate available information on actual irrigation water withdrawals and consumption to avoid overestimation.
This dataset provides an important temporal extension of existing global gridded estimates of crop water requirements and can also be useful in hydro-economic modelling and assessments of historical climate variability and trends in water scarcity.

Code availability
Code for calculating yearly and monthly crop specific blue and green water requirements is available in the freely and publicly available repository in figshare (https://doi.org/10.6084/m9.figshare.c.4893084) 25 .   Table 6. Global crop water requirement with 3 different kc values during the non-growing season of the crop.