Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

# Irrigated areas drive irrigation water withdrawals

## Abstract

A sustainable management of global freshwater resources requires reliable estimates of the water demanded by irrigated agriculture. This has been attempted by the Food and Agriculture Organization (FAO) through country surveys and censuses, or through Global Models, which compute irrigation water withdrawals with sub-models on crop types and calendars, evapotranspiration, irrigation efficiencies, weather data and irrigated areas, among others. Here we demonstrate that these strategies err on the side of excess complexity, as the values reported by FAO and outputted by Global Models are largely conditioned by irrigated areas and their uncertainty. Modelling irrigation water withdrawals as a function of irrigated areas yields almost the same results in a much parsimonious way, while permitting the exploration of all model uncertainties. Our work offers a robust and more transparent approach to estimate one of the most important indicators guiding our policies on water security worldwide.

## Introduction

Irrigation agriculture is at the forefront of global food security. With the potential to provide crop yields more than two times as large as dryland agriculture1,2, irrigation agriculture currently produces ~40% of all food consumed worldwide in just 20% of the total cultivated land3. Its capacity to maximise yields per unit of terrain is conditional upon the investment of high labour inputs per surface unit and the provision of a steady freshwater supply, which relaxes the dependency of crops on rainwater seasonality4,5. This allows for year-round harvests while reducing adverse impacts on crops from dry spells. Such features make irrigation agriculture a key resource to buffer population growth in our context of climate change.

The demand of water by irrigation has been constantly rising over the last decades6. In general, it is expected to increase even further in the coming years due to changes in precipitation patterns, higher temperatures and the expansion of irrigated areas to meet the projected boost in food demands7,8,9,10,11,12. Acquiring reliable estimates of irrigation water withdrawals is thus regarded as a first step towards a more informed management of global freshwater resources13,14, ultimately endowing us with better tools to ensure food security without damaging the water system. At present, there are two main approaches to calculate global irrigation water withdrawals:

1. 1.

FAO’s/Aquastat’s approach. This is based on country surveys, questionnaires and censuses, literature reviews and coordination among relevant national and international agencies15. Its drawbacks include unreliability of data due to bureaucratic and political constraints16, national interests17,18, difficulties in homogenising water withdrawal data reported through different methods, and missing data points (not all countries provide information or the reported data does not pass Aquastat’s quality check)19.

2. 2.

Through Global Hydrological Models, Land Surface Models or Land Earth Systems Models20. Here we collectively refer to all these models as Global Models (GMs). They are spatially-distributed algorithms that simulate, among others, past, present and future hydrological processes on a global scale. Irrigation water withdrawals are generally computed at a specific time step and spatial resolution with sub-models on evapotranspiration processes, crop types, agrarian calendars, irrigation efficiencies, fertilisation, meteorological forcings and irrigated areas7,21,22,23,24. Some limitations of GMs are their high computational demands, poor calibration and a complex design that precludes a thorough assessment of output uncertainties25,26.

These drawbacks are amplified by uncertainties in crop types, growing seasons, agrarian practices, irrigated areas and local soil and climatic conditions27. Global irrigation water withdrawal estimates are therefore highly sensitive to the selection of the FAO’s or the GMs’ approach, and even the choice of a specific GM is a source of bias7. The reliance on multi-model ensembles of GMs allows for the obtainment of probabilistic estimates, yet it exacerbates the computational, opacity and uncertainty-related problems mentioned above28. Such flaws limit the utility of global irrigation water withdrawal estimates in the policy realm, where stakeholders and non-experts alike should be able to swiftly replicate the results or, at least, understand the main assumptions upon which the analysis is based29,30,31.

Here we show that global irrigation water withdrawals can simply be obtained as a function of irrigated areas. We submit eight GMs and two FAO-based datasets to uncertainty and sensitivity analysis methods and demonstrate that the variability of irrigation water withdrawals is mostly described by the extension of irrigation19,23,24,32,33,34,35,36,37. This paves the way to an easier, cheaper and more transparent estimation of global irrigation water demands and permits a systematic examination of all crucial uncertainties for water security. It also suggests that GMs can improve by better acknowledging the relevance of irrigated areas in their simulations. Our results align with recent works arguing that simple models may be more robust and of greater use than more elaborate approaches, especially when the estimation of interest is fraught with irreducible uncertainties31,38.

## Results

### Irrigated areas and irrigation water withdrawals are strongly related

Irrigated areas in GMs are parametrised with the Global Map of Irrigated Areas (FAO-GMIA)39, a gridded product that documents the extension of irrigation at a 5 arcmin resolution. A linear trend between the areas reported by the FAO-GMIA and the irrigation water withdrawals simulated by GMs is apparent at the country level from 1900 up to 2005–2010, the last period for which there is systematic data available for both variables. This pattern holds regardless of the GM used (Figs. S1–S8). Here we focus on data from 2005 as it adequately summarises this historical relationship and facilitates comparison with two different FAO-based datasets, which reflect country-based irrigation water withdrawals in 2010–201219,24.

For most combinations of continent and irrigation water withdrawal dataset (except some particular cases for Europe, see Figs. S9–S11), the trend between irrigated areas and irrigation water withdrawals is well modelled by a linear regression in which irrigated areas are the predictor x and water withdrawal is the response y (Fig. 1). Such an approach fits well with previous works connecting these variables both empirically and theoretically14,40. Other parameters or intermediate outputs of GMs used to compute irrigation water withdrawals, such as irrigation efficiencies, total evapotranspiration or potential evaporation, do not appear to have any significant influence (Figs. S12–S14).

The strength of the relationship between irrigated areas and irrigation water withdrawals can be assessed with the coefficient of determination r2, which measures how much variance in y can be predicted from x. We check how r2 changes when the main uncertainties conditioning its computation vary within reasonable bounds: for c = 1, 2,..., m countries, we vary in a Monte-Carlo setting (see Methods):

• X1: The selection of the GM or FAO-based dataset to characterise yc.

• X2: The multivariate method used to model a distribution for yc in case it is a missing value.

• X3: The final sampled value from that distribution to impute yc.

• X4: The use of a robust or non-robust regression to estimate r2, as some yc values are outliers (Fig. S11).

The coefficient of determination r2 leans towards high values for Africa (0.75 ≤ r2 ≤ 0.9, P2.5, P97.5), Asia (0.68 ≤ r2 ≤ 0.92) and the Americas (0.68 ≤ r2 ≤ 0.95) (Fig. 2a). The distribution of r2 for these continents is clearly left skewed, with the smaller mode at approximately r ≤ 0.8 produced by the simulations conducted with just one or two GMs (CLM45 for Africa, CLM45 and MPI-HM for the Americas, and CLM45 and VIC for Asia) (Fig. 2b). The goodness of fit for Europe shows the largest spread (0.5 ≤ r2 ≤ 0.89) and a three-modal distribution, with the highest r2 values produced by MPI-HM and VIC and the lowest by PCR-GLOBWB.

For all continents, the most influential factor conditioning r2 is the selection of the GM or FAO-based dataset to parametrise yc (X1) (Fig. 2c). X1 explains from 72% (Africa) to 95% (Asia) of the variance in r2 values. In the case of the Americas, the use of a robust or non-robust approach to compute r2 (X4) conveys an extra 12% of the uncertainty in the goodness of fit, with the robust option yielding slightly higher r2 values on average (Fig. S15). The rest of the variance is due to second and third-order effects. For instance, the third-order effect between (X1, X2, X3) conveys ~5%, ~10% and ~10% of the variance in r2 for the Americas, Europe and Africa, respectively. An important fraction of the ambiguity in the goodness of fit is hence largely irreducible for it emerges as the joint effect of three different structural uncertainties (Fig. S16).

### Estimating irrigation water withdrawals from irrigated areas

Such linear relation and high r2 values suggest that irrigated areas might fairly predict irrigation water withdrawals, especially for countries in Africa, Asia and the Americas. We thus combine Eq. (1) with an uncertainty analysis to estimate irrigation water withdrawals as a function of irrigated areas and compare the predictions with the ten point estimates yielded by all GMs and FAO-based datasets considered (see Methods).

The results show that GMs and FAO-based estimates fall nicely within the ranges defined by our predictions for a very large majority of countries (Figs. 3 and 4). Ninety-nine countries out of 139 (71%) present seven or more estimations bounded by the error bars of our regressions, while 26 countries (18%) have all ten point estimates fully framed. Examples of the latter are Egypt, South Africa, the United States, Mexico, Brazil, Afghanistan, India, Pakistan, Italy, Spain or France, all of them top-ranking countries in irrigation water consumption. For China, also a major consumer of irrigation water, our approach frames nine out of ten point estimates, with VIC falling outside our interval. Figures 3 and 4 thus offer a realistic impression of the uncertainty associated to these predictions.

The number of point estimates framed by our predictions increases with the number of countries in all four continents (Fig. S17). Malta is the only country for which our ranges do not embed any previous estimate. Other countries for which our predictions fit the preexisting estimates poorly are Seychelles (only one estimation framed), Cuba and Kuwait (2), or Ethiopia, Puerto Rico, Indonesia and the Philippines (3). All these countries share a large uncertainty with regard to the irrigation water withdrawal estimates produced by FAO and GMs. The datasets that show the most point estimates beyond or above our error bars are VIC in the case of Africa (~51%), Asia (~58%) and Europe (~43%), and CLM45 (~33%) in the case of the Americas (Fig. S18).

Our approach also replicates the irrigation water withdrawal estimates produced by GMs for 2050 in a context of climate change, regardless of the social context or the Representative Concentration Pathway (RCP) selected (Figs. S19 and S20, RCP2.6, RCP6, RCP8.5; see Methods). For most countries, our ranges encompass the estimates that GMs simulate for the future as nicely as those they yield for the present. This is shown in Fig. 5, with a large majority of countries clustering on the upper right side of the plot. This area includes 80% of the countries and contains the largest agricultural water consumers (e.g. China, India, Spain, Italy, Egypt or the United States, among others). In the case of Kuwait, Moldova or Angola, our approach mimics future estimates better than current estimates, while the opposite is true for Chile, Croatia or Finland.

### The relation may scale down

The trend between irrigated areas and irrigation water withdrawals detected at the country level also emerges at smaller geographical scales. To illustrate this phenomenon, we show independent data at three levels: (1) at the irrigation system level, from the Australian National Committee on Irrigation and Drainage (ANCID)41,42; (2) for every county of Colorado (a state whose farm water use exceeds USA’s national average43), from the Colorado Water Conservation Board44; and (3) for every state of the USA, from the US Department of Interior45. In all three cases, the proportion of the variability in irrigation water withdrawals that is described by irrigated areas (circa 0.7–0.9, 95% CI) is very similar to the results obtained at the national level with the FAO-based and GMs datasets (Fig. 6).

Irrigated areas drive water withdrawals even at the grid cell level, the minimum geographical unit in which GMs simulate irrigation water withdrawals. This is especially the case with CLM45 and MPI-HM, which operate like a linear model despite their computational complexity46, pp. 346–365 (Figs. S23–S54). The same can be said regardless of the GM for the cells of Egypt, Morocco, Sudan, South Africa and Zimbabwe in Africa; of most countries in Asia (including China and India); of Mexico, the USA, Colombia, Argentina or Peru in the Americas; and of Spain, France, Italy and Russia in Europe. Given that irrigated water withdrawals at the grid cell level are often aggregated to produce estimates at the river basin or at the agro-ecological level23,47, irrigated areas may also drive irrigation water withdrawals at these scales in the countries and GMs just mentioned.

### How do irrigation water withdrawals scale with irrigated areas?

The tight relation between irrigated areas and irrigation water withdrawals enables the assessment of how the latter responds to changes in the former. This is expressed by β, the slope of the linear regression of $${{{\mathrm{log}}}}\,(y)$$ against $${{{\mathrm{log}}}}\,(x)$$ (see Methods, Eq. (1)). If β < 1 (β > 1), every increase in the extension of irrigation leads to marginal (accelerated) increases in irrigation water consumption. This framework is known as scaling48,49, and allows to explore (1) whether larger irrigated areas are, on average, less water efficient than smaller ones (β > 1), and (2) whether the complexity behind irrigation water withdrawals can be further simplified to a single β value.

At the continental level, β > 1, β ≈ 1 or β < 1 depending on the GM or FAO-based dataset selected to characterise yc. This is exemplified by Africa, which shows β > 1 under DBHM, H08, LPJmL and PCR-GLBWB, VIC and WaterGap, β < 1 under MPI-HM, and β ≈ 1 under Liu et al.19 and CLM45 (Fig. 7). Such volatility originates from the uncertainty in yc and currently prevents from inferring the existence of a consistent scaling relationship between both variables. For Australian irrigation systems and Colorado counties, β is indistinguishable from 1, indicating that irrigation water withdrawals tend to become twice as large if irrigated areas at the system or the county level are doubled in size. This contrasts with the USA, whose β > 1 suggests that states with a larger extension of irrigation have a disproportionate consumption of irrigation water given the size of their irrigated areas (Fig. S55).

## Discussion

The present paper shows that irrigated areas describe a large degree of variability in irrigation water withdrawals, and that the latter can be approximated as a function of the former. These results are grounded on annual irrigation water withdrawal estimates outputted at the national level by eight Global Models (GMs) and reported by two FAO-based datasets. They are also based on independent data retrieved at the state, county and irrigation system level. Hence there is a great potential for the simplification of methods to calculate irrigation water withdrawals, especially with regard to those employed at larger geographical scales.

That a complex variable such as the volume of water withdrawn for irrigation is nicely described by just a single factor may appear surprising given the large space of relevant factors influencing its behaviour (e.g., crop type and calendars, growing seasons, irrigation efficiency, climate, crop evapotranspiration, soil texture). Yet several degrees of freedom are often determined or summarised by a small set of constraints or even by a single parameter. Size is one of such parameters: for animals, it defines their strength, metabolic rate, life span or population density50,51; for cities, its pace of innovation, number of patents or total electrical consumption48,52. The size of irrigated areas (their extension) appears to be a similar driving force for irrigation water withdrawals.

The irrigation module of GMs contains parameters or secondary outputs whose influence in the calculation of water withdrawals is very minor or inconsequential [e.g., total evapotranspiration, potential evaporation or irrigation efficiency once controlled for irrigated area, Figs. S12–S14)]. The effect of parameters such as crop coefficients or crop calendars, which vary as a function of time, or of complex sub-models such as fertilisation schemes or the climate driver33,34,35,53, is much harder to check. However, if a linear regression can nicely fit the input-output mapping of complex irrigation algorithms, other similar fast-running statistical emulators may be able to work in the computationally expensive sub-models nested within GMs. Substituting these sub-models with time-effective emulators can be an effective way to save computational resources and bridge model realism with computational efficiency. This may allow modellers to focus on answering relevant water policy questions without the extra burden of managing unneeded complexity.

The strong influence of irrigated areas makes GMs very sensitive to the FAO-GMIA39, the gridded map used by GMs to parametrise global irrigated areas. Yet the FAO-GMIA is just one of the five datasets currently available on the extension of irrigation18,54,55,56,57. Depending on the dataset selected, the irrigated area of a given country can differ by up to four orders of magnitude8, a range that reflects our limited knowledge on the current size of irrigated agriculture. The FAO-GMIA was the only map available when most GMs were initially designed, but research conducted over the last ten years has broadened the range of products available and increased the uncertainty range18,56,57. By relying exclusively on the FAO-GMIA, GMs discount this source of ambiguity and yield estimates that are critically conditioned by a structural model design.

Let us illustrate this issue with the case of China and India, for instance. According to the FAO-GMIA, their irrigated areas extend over 61 Mha. This point estimate turns into ranges that respectively span 43–74 Mha and 15–88 Mha if the other irrigated area datasets are taken into account18,56,57. These divergences are explained by the different methodological approaches mobilised to map irrigated areas, including the definition of what is considered an “irrigated area” and the degree of reliance on official statistics. Given the strong weight of the extension of irrigation, the already large variance in irrigation water withdrawals displayed in Figs. 3 and 4 will become much larger if GMs factor this uncertainty in. The same will apply to the estimates of future irrigation water withdrawals, because the uncertainty of global irrigated areas in 2050 spans half an order of magnitude (300–800 Mha, with the most extreme values reaching 1800 Mha)8.

In light of these results, the addition of conceptual depth aiming at making GMs more accurate (by modelling the human influence, increasing the spatial resolution or running multi-model ensembles) appears questionable20,58. The fastest way to acquire more precise irrigation water withdrawal estimates seems to be through a better appraisal of irrigated areas. This may also provide sharper insights into the growth rate between irrigated areas and water demands (i.e. whether β < 1, β ≈ 1, β > 1, see Fig. 7). Scaling relationships might hold crucial information for discerning the role of size in the sustainability of irrigated agriculture40,59,60,61, as well as for our design of irrigation schemes. Trivially, given 1000 ha potentially irrigable, should we promote one system extending over the whole 1000 ha or 10 systems of 100 ha each? If size largely drives some properties of irrigated agriculture, it may be that excess complexity delays—rather than accelerates—our understanding of irrigation systems, thus posing a hindrance to the design of robust water policy responses. This observation resonates with a broad literature showing that too much detail in model efforts undercuts reliable management62,63,64, as well as with recent warnings against excess complexity in mathematical modelling65.

That GMs are too complex given the quality of the data available is also suggested by the ambiguity surrounding other aspects of their irrigation module. Model inputs such as the crop coefficient or the evapotranspiration equation, among others, are also uncertain66,67,68,69,70,71. Their effect in the model output is likely to be minor given the strong weight of irrigated areas, yet we can not rule out their being influential through interactions. The computation of irrigation water withdrawals in GMs relies on multiplications, divisions and exponentials (e.g., Eq. (1) in Wada et al.32; Eqs. (1) and (2) in Döll and Siebert14, Eqs. (7)–(11), A1–A3 in Jägermeyr et al.34). Such operations promote non-additivities, whose effect on irrigation water withdrawals can only be appraised with a global sensitivity analysis (GSA), i.e. by moving all uncertainties at once. However, the literature on GMs has relegated GSA in favour of ensemble, one-at-a-time or piecewise sensitivity analysis7,72,73. These techniques are computationally affordable but severely underpowered to scrutinise the input space, and are unable to detect interactions74,75.

The much parsimonious approach adopted here has a predictive power almost identical to that of FAO’s and eight GMs combined. It has empirical support at several geographical scales, allows to save personal, financial and computational resources, and facilitates an appraisal of uncertainties and sensitivities, including those related with irrigated areas. An extension of our work is to explore why the trend between irrigated areas and water withdrawals is the weakest in European countries. Another direction is to assess whether irrigated areas drive irrigation water withdrawals in other irrigation systems and/or in different sub-regional contexts. This will help assess the extent to which this trend scales down in a robust way.

Finally, we should stress that we are not proposing to substitute approaches relying on crop, soil and climate parameters for linear regressions of irrigated areas. At small granularities, for instance at the plot or the scheme level, these methods nicely appraise physical process and facilitate monitoring of water withdrawals through time. What we argue is that so much detail may not be warranted at larger scales or under strong uncertainties. The debate between proponents of computationally-intensive methods and advocates of more simple approaches is vibrant in the climate modelling community64,76. Both parties also need to be heard in the field of global hydrology.

## Methods

### Data collection

We use irrigation water withdrawal values outputted by eight GMs [six Global Hydrological Models (PCR-GLOBWB32,53, H0833, LPJmL34, WaterGap35, MPI-HM22, DBHM23), one Land Surface Model (VIC77) and one Land Earth System Model (CLM4546)]. For PCR-GLOBWB, H08, LPJmL and WaterGap we rely on the products generated by Huang et al.21, who downscaled the data yielded by these four GMs between 1971-2010 with Aquastat water withdrawal estimates. For DBHM, MPI-HM, VIC and CLM45 we use the data produced by the Inter-Sectoral Impact Model Inter-comparison Project (ISI-MIP)78. Our analysis focuses on the values reported for 2005 in all the cases. All GMs datasets used are forced by WFDEI climate data except MPI-HM and CLM45, which are forced by MIROC5 and GFDL respectively.

We also retrieve irrigation water withdrawal data from two FAO-based datasets. We use the Aquastat country-level data produced by Frenken and Gillet24 for 2012 and the dataset elaborated by Liu et al.19, who filled out missing values in the Aquastat dataset using inverse distance weighting, nearest neighbour or linear interpolation based on associated variables.

For irrigated areas, we collect the data generated by the FAO-GMIA at the national level from Meier et al.57, which documents the extension of irrigation at c. 2005. In order to assess the influence of the FAO-GMIA at the cell level, we retrieve the data produced by the Historical, gridded land use (HYDE 3.2) product from ISI-MIP78. The HYDE 3.2 relies on the FAO-GMIA and the MIRCA 2000 to parametrise irrigated areas79, with the MIRCA 2000 being also a gridded product grounded on the FAO-GMIA80, p. 5).

To investigate whether our approach replicates the irrigation water withdrawal estimates produced by GMs for 2050, we retrieve from ISI-MIP the data produced by PCR-GLOBWBW, LPJmL, H08 and MPI-HM under five different social, climatic, and CO2 scenarios81:

• rcp26/rcp26: Water abstraction and land use (including irrigated areas) change according to the Shared Socioeconomic Pathway 2 (SSP2, “Middle of the road”). Future climate and CO2 concentration evolve as outlined by the Representative Concentration Pathway 2.6 (RCP2.6, mean temperature increase of 1°C up to 2065, CO2 emissions declining by 2020).

• rcp60/rcp60: Water abstraction and land use (including irrigated areas) change according to SSP2. Future climate and CO2 concentration evolve as outlined by the Representative Concentration Pathway 6.0 (RCP6.0, increase of 1.4 °C, CO2 declining by 2080).

• 2005soc/rcp26: Land use (including irrigated areas), nitrogen deposition and fertilizer input are fixed at 2005 values. Future climate and CO2 concentration as in RCP2.6.

• 2005soc/rcp60: Land use (including irrigated areas), nitrogen deposition and fertilizer input are fixed at 2005 values. Future climate and CO2 concentration as in RCP6.0.

• 2005soc/rcp85: Land use (including irrigated areas), nitrogen deposition and fertilizer input are fixed at 2005 values. Future climate and CO2 concentration as in RCP8.5.

### Data treatment

The GMs just mentioned have a spatial resolution of 0.5° × 0.5° and compute irrigation water withdrawals in each cell at a monthly time step. For each GM, we retrieve the data from 2005 and allocate each cell to a specific country given its geospatial information (longitude and latitude). We produce annual irrigation water withdrawal values at the national level by adding the values of all cells within the same country. We then bind all GMs datasets with the Aquastat and the Liu et al.19 datasets and pair each country with the national irrigated areas reported by the FAO-GMIA. This procedure yields missing values in water withdrawal for 28 unique countries, a total of 69 missing data points (Table S1, Fig. S56).

To calculate the relation between irrigated areas and water withdrawal at the cell level, we merge the HYDE 3.2 with all the GMs and pair only the cells that show the same coordinates in both products.

### The model

Following the linear trend between irrigated area and irrigation water withdrawals at the country level (Fig. 1), we model their relation as

$${{{\mathrm{log}}}}\,({y}_{c})=\alpha +\beta {{{\mathrm{log}}}}\,({x}_{c})\ ,$$
(1)

where yc and xc are respectively the irrigation water withdrawal and the irrigated area of country c, for c = 1, 2, ..., m countries. α is a constant and β the scaling exponent describing the growth rate between xc and yc.

### Uncertainty analysis

There are four main sources of uncertainty that condition the goodness of fit of Eq. (1), estimated with the r2 value. We treat these uncertainties as triggers (X1, X2, X3, X4), i.e. random parameters that explore the uncertainty in the model design space. They are the following:

• X1: The selection of the GM or FAO-based dataset to characterise irrigation water withdrawals at the country level (yc in Eq. (1)). There are ten different alternatives (eight GMs and two FAO-based datasets, see Fig. 1).

• X2: The multiple imputation methods used to impute missing values. After pairing the eight GMs and two FAO-based datasets with the irrigated areas reported by the FAO-GMIA, some countries showed missing irrigation water withdrawal values. To ensure that x and y have the same individual data points across all data sets, we replace missing values with substituted values using multiple imputation methods. Unlike single imputation, which treats the imputed value as the “true” value, multiple imputation accounts for the uncertainty about the prediction of the missing value by randomly drawing d values from a distribution specifically modelled for each missing entry82. This creates d different completed datasets or imputations.

Given the linear trend observed in Fig. 1, we assess how three different regression-based, multiple imputation methods affect the estimation of yc: Bayesian regression, linear regression ignoring the model error, and linear regression with bootstrap. The Bayesian regression method imputes yc by the normal model defined by Rubin83, while the linear regression with bootstrap method draws a bootstrap sample from x and y, calculates regression weights and imputes with normal residuals84.

• X3: The selection of the completed dataset to compute r2. The number of imputations d to obtain an appropriate estimation of the true missing value has long been a topic of discussion. Graham et al. 85 recommend 20 imputations for 20–30% missing data and 40 imputations for 50% missing data. The number of missing data points in our study is smaller than 10% for almost all continents and datasets, except for Aquastat in the Americas (c. 40%) (Fig. S56). In order to ensure enough statistical power, we set the number of imputations at d = 40 and create 40 different completed datasets in each iteration.

• X4: The eventual use of corrective measures to calculate the line of best fit in case yc is an outlier. Outliers can bias the estimation of r2. We document their presence for some continents depending on the irrigation water withdrawal dataset used (Fig. S11). The classic estimator of r2 when there is an intercept term in the linear model is

$${r}^{2}={\left(\frac{\mathop{\sum }\nolimits_{c = 1}^{m}({y}_{c}-\bar{y})({\hat{y}}_{c}-\bar{\hat{y}})}{\sqrt{\mathop{\sum }\nolimits_{c = 1}^{m}{\left({y}_{c}-\bar{y}\right)}^{2}\mathop{\sum }\nolimits_{c = 1}^{m}{\left({\hat{y}}_{c}-\bar{\hat{y}}\right)}^{2}}}\right)}^{2}\ ,$$
(2)

where yc is the observed irrigation water withdrawal value for the country c, $$\bar{y}$$ the mean, $$\hat{y}$$ the fitted value and $$\bar{\hat{y}}$$ the mean predicted responses. In order to account for the effect of applying corrective measures to outliers, we consider the consistency-corrected formula by Renaud and Victoria-Feser86,

$${r}^{2}=\frac{\mathop{\sum }\nolimits_{c = 1}^{m}{w}_{c}{\left({\hat{y}}_{c}-{\bar{\hat{y}}}_{w}\right)}^{2}}{\mathop{\sum }\nolimits_{c = 1}^{m}{w}_{c}{\left({\hat{y}}_{c}-{\bar{\hat{y}}}_{w}\right)}^{2}+a\mathop{\sum }\nolimits_{c = 1}^{m}{w}_{c}{\left({y}_{c}-{\hat{y}}_{c}\right)}^{2}}\ ,$$
(3)

where $${\bar{\hat{y}}}_{w}=(1/\sum {w}_{c})\sum {w}_{c}{\hat{y}}_{c}$$, a is a correction factor set at 1.2 and the weights wc and the predicted values yc are produced by the fast S-algorithm of Salibian-Barrera and Yohai87.

To assess how X1…, X4 condition the final r2 value, we conduct a Monte–Carlo-based uncertainty analysis. We design a (N, 2k)Q sample matrix using Sobol’ Quasi-Random Numbers88,89. The Sobol’ sequence is a base-2 sequence that explores the uncertainty space more effectively than random numbers, for it leaves smaller unexplored volumes. After a few experiments we decided to set the number of rows at N = 213 to handle a sample size large enough to ensure the convergence of the Sobol’ indices (see section “Sensitivity analysis” below).

We allocate the leftmost k columns of Q to an A matrix and the rightmost k columns to a B matrix. In these matrices each row is a sample point and each column a trigger described with a probability distribution according to its uncertainty (Fig. S57). Any point in either A or B can be referred to as xvi, where v and i respectively index the row (from 1 to N) and the column (from 1 to k). We also create k$${{{{\boldsymbol{A}}}}}_{B}^{(i)}$$ matrices, where all the columns come from the A matrix except the i-th, which comes from the B matrix (Fig. S58). The $${{{{\boldsymbol{A}}}}}_{B}^{(i)}$$ matrices are required to compute the Sobol’ indices of the triggers (see section “Sensitivity analysis” below)90. Overall, this design has a computational cost C of C = N(k + 2) = 213(4 + 2) = 49, 152 model runs per continent.

Our algorithm runs rowwise, as follows: for v = 1, 2, ... , C rows, it selects the irrigation water withdrawal dataset according to $${X}_{{1}_{v}}$$, fills the missing values in yc given the conditions set by $${X}_{{2}_{v}}$$ and $${X}_{{3}_{v}}$$, and finally computes Eq. (1) + Eq. (2) or Eq. (1) + Eq. (3) depending on the criteria defined by $${X}_{{4}_{v}}$$. The model output in the v-th row is therefore a specific $${r}_{v}^{2}$$ value calculated according to the conditions established by $${X}_{{1}_{v}},\ldots ,{X}_{{4}_{v}}$$. To obtain the range of predictions shown in Figs. 3 and 4, we retrieve xc from the FAO-GMIA and compute Eq. (1) with the 2,400 paired αv and βv coefficients obtained from the simulations.

### Sensitivity analysis

We conduct a global sensitivity analysis using Sobol’ indices91,92, which decompose the variance of the model output V(y) into fractions that are attributed to the model inputs, as

$$V(y)=\mathop{\sum }\limits_{i=1}^{k}{V}_{i}+\mathop{\sum}\limits_{i}\mathop{\sum}\limits_{i < j}{V}_{ij}+...+{V}_{1,2,...,k}\ ,$$
(4)

where

$${V}_{i}={V}_{{x}_{i}}\left[{E}_{{{{{\boldsymbol{x}}}}}_{ \sim i}}(y| {x}_{i})\right]\quad {V}_{ij}= \, {V}_{{x}_{i},{x}_{j}}\left[{E}_{{{{{\boldsymbol{x}}}}}_{ \sim i,j}}(y| {x}_{i},{x}_{j})\right]\\ \, -{V}_{{x}_{i}}\left[{E}_{{{{{\boldsymbol{x}}}}}_{ \sim i}}(y| {x}_{i})\right]\\ \, -{V}_{{x}_{j}}\left[{E}_{{{{{\boldsymbol{x}}}}}_{ \sim j}}(y| {x}_{j})\right]$$
(5)

and so on up to the kth order. Vi is the conditional variance of xi on V(y), Vij the conditional variance of xi and xj on V(y), etc. The notation $${E}_{{{{{\boldsymbol{x}}}}}_{ \sim i}}(y| {x}_{i})$$ means that the mean y value, represented by the E(.) operator, is taken over all inputs except xi. Sobol’ indices are then calculated as

$${S}_{i}=\frac{{V}_{i}}{V(y)}\quad{S}_{ij}=\frac{{V}_{ij}}{V(y)}\ .$$
(6)

Si represents the first-order effect of xi; Sij is the second-order effect of (xi, xj) (formed by the first-order effect of xi and xj and their interaction), etc. Si, Sij, ... can be interpreted as the reduction in variance that will be obtained in the model output if xi, (xi, xj),... are fixed to their “true value”, i.e., if they are no longer uncertain. These reductions are of course averaged over all the possible values of the unknown “true” value.

We also calculate the total-order index Ti, which assesses the first-order effect of a model input jointly with its interactions92. When Ti > Si, xi is involved in interactions. Ti is calculated as

$${T}_{i}=1-\frac{{V}_{{{{{\boldsymbol{x}}}}}_{ \sim i}}\left[{E}_{{x}_{i}}(y| {{{{\boldsymbol{x}}}}}_{ \sim i})\right]}{V(y)}=\frac{{E}_{{{{{\boldsymbol{x}}}}}_{ \sim i}}\left[{V}_{{x}_{i}}(y| {{{{\boldsymbol{x}}}}}_{ \sim i})\right]}{V(y)}\ .$$
(7)

There are several estimators available to compute Eqs. (6) and (7). Here we use the Jansen93 estimators, considered best practice in sensitivity analysis74,94. The Jansen estimators make use of the model output y produced after running the model f in the vth row of the A, B and $${{{{\boldsymbol{A}}}}}_{B}^{(i)}$$ matrices. This is indicated as f(A)v, f(B)v and $$f{({{{{\boldsymbol{A}}}}}_{B}^{(i)})}_{v}$$:

$${S}_{i}=\frac{V(y)-\frac{1}{2N}\mathop{\sum }\nolimits_{v = 1}^{N}{\left[f{({{{\boldsymbol{B}}}})}_{v}-f{({{{{\boldsymbol{A}}}}}_{B}^{(i)})}_{v}\right]}^{2}}{V(y)}\ ,$$
(8)
$${T}_{i}=\frac{\frac{1}{2N}\mathop{\sum }\nolimits_{v = 1}^{N}{\left[f{({{{\boldsymbol{A}}}})}_{v}-f{({{{{\boldsymbol{A}}}}}_{B}^{(i)})}_{v}\right]}^{2}}{V(y)}\ .$$
(9)

## Data availability

The irrigation water withdrawal data generated in this study, as well as the datasets needed to reproduce our results, are available in Puy 95 and in https://github.com/arnaldpuy/achilles_heel. The irrigation water withdrawal estimates produced by GM can be retrieved in https://www.isimip.org.

## Code availability

The R code to replicate our results is available in Puy95 and in https://github.com/arnaldpuy/achilles_heel.

## References

1. 1.

FAO. Crops and drops. making the best use of water for agriculture http://www.fao.org/3/y3918e/y3918e00.htm (2002).

2. 2.

Kukal, M. S. & Irmak, S. Irrigation-limited yield gaps: trends and variability in the United States post-1950. Environ. Res. Commun. 1, 061005 (2019).

3. 3.

United Nations. Facts and figures. Managing water under Uncertainty and risk. https://unesdoc.unesco.org/ark:/48223/pf0000215492 (2012).

4. 4.

Boserup, E. The Conditions of Agricultural Growth (George Allen 6 Unwin Ltd, 1965).

5. 5.

Netting, R. M. Smallholders, Householders. Farm Families and the Ecology of Intensive, Sustainable Agriculture (Stanford University Press, 1993).

6. 6.

Wada, Y., Van Beek, L. P. & Bierkens, M. F. Modelling global water stress of the recent past: On the relative importance of trends in water demand and climate variability. Hydrol. Earth Syst. Sci. 15, 3785–3808 (2011).

7. 7.

Wada, Y. et al. Multimodel projections and uncertainties of irrigation water demand under climate change. Geophys. Res. Lett. 40, 4626–4632 (2013).

8. 8.

Puy, A., Lo Piano, S. & Saltelli, A. Current models underestimate future irrigated areas. Geophys. Res. Lett. 47, e2020GL087360 (2020).

9. 9.

Hejazi, M. et al. Long-term global water projections using six socioeconomic scenarios in an integrated assessment modeling framework. Technol. Forecast. Soc. Change 81, 205–226 (2014).

10. 10.

Shen, Y., Oki, T., Utsumi, N., Kanae, S. & Hanasaki, N. Projection of future world water resources under SRES scenarios: water withdrawal. Hydrological Sci. 53, 11–33 (2008).

11. 11.

Fischer, G., Tubiello, F. N., van Velthuizen, H. & Wiberg, D. A. Climate change impacts on irrigation water requirements: Effects of mitigation, 1990-2080. Technol. Forecast. Soc. Change 74, 1083–1107 (2007).

12. 12.

Haddeland, I. et al. Global water resources affected by human interventions and climate change. Proc. Natl Acad. Sci. USA 111, 3251–3256 (2014).

13. 13.

Hejazi, M. I., Edmonds, J. A. & Chaturvedi, V. Global irrigation demand—a holistic approach. Irrig. Drain. Syst. Eng. 1, 2–5 (2012).

14. 14.

Döll, P. & Siebert, S. Global modeling of irrigation water requirements. Water Resour. Res. 38, 8–1–8–10 (2002).

15. 15.

FAO. AQUASTAT. FAO’s Global Information System on water and agriculture. http://www.fao.org/aquastat/en/ (2020).

16. 16.

Ajaz, A., Karimi, P., Cai, X., De Fraiture, C. & Akhter, M. S. Statistical data collection methodologies of irrigated areas and their limitations: a review. Irrig. Drain. 68, 702–713 (2019).

17. 17.

Young, A. Is there really spare land? a critique of estimates of available cultivable land in developing countries. Environ., Dev. Sustain. 1, 3–18 (1999).

18. 18.

Thenkabail, P. S. et al. Global irrigated area map (GIAM), derived from remote sensing, for the end of the last millennium. Int. J. Remote Sens. 30, 3679–3733 (2009).

19. 19.

Liu, Y. et al. Global and regional evaluation of energy for water. Environ. Sci. Technol. 50, 9736–9745 (2016).

20. 20.

Bierkens, M. F. P. Global hydrology 2015: state, trends, and directions. Water Resour. Res. 51, 4923–4947 (2015).

21. 21.

Huang, Z. et al. Reconstruction of global gridded monthly sectoral water withdrawals for 1971-2010 and analysis of their spatiotemporal patterns. Hydrol. Earth Syst. Sci. 22, 2117–2133 (2018).

22. 22.

Stacke, T. & Hagemann, S. Development and evaluation of a global dynamical wetlands extent scheme. Hydrol. Earth Syst. Sci. 16, 2915–2933 (2012).

23. 23.

Tang, Q., Oki, T., Kanae, S. & Hu, H. The influence of precipitation variability and partial irrigation within grid cells on a hydrological simulation. J. Hydrometeorol. 8, 499–512 (2007).

24. 24.

Frenken, K. & Gillet, V. Irrigation water requirement and water withdrawal by country. Aquastat report, Food and Agriculture Organization of the United Nations, Rome. http://www.fao.org/3/a-bc824e.pdf (2012).

25. 25.

Puy, A., Borgonovo, E., Lo Piano, S. & Saltelli, A. Are the results of the groundwater model robust? http://arxiv.org/abs/1912.10814 (2019).

26. 26.

Sperna Weiland, F. C., Vrugt, J. A., van Beek, R. L., Weerts, A. H. & Bierkens, M. F. Significant uncertainty in global scale hydrological modeling from precipitation data errors. J. Hydrol. 529, 1095–1115 (2015).

27. 27.

United Nations. The United Nations World Water Development Report 2018: Nature-Based Solutions for Water, UNESCO, Paris. https://unesdoc.unesco.org/ark:/48223/pf0000261424 (2018).

28. 28.

Parker, W. S. Ensemble modeling, uncertainty and robust predictions. Wiley Interdiscip. Rev.: Clim. Change 4, 213–223 (2013).

29. 29.

Saltelli, A. & Funtowicz, S. When all models are wrong. Issues Sci. Technol. 4, no.2 (2014).

30. 30.

Saltelli, A., Guimaraes Pereira, A., van der Sluijs, J. P. & Funtowicz, S. O. What do I make of your Latinorum? Sensitivity auditing of mathematical modelling. Int. J. Innov. Policy 9, 213–234 (2013).

31. 31.

Saltelli, A. et al. The technique is never neutral. How methodological choices condition the generation of narratives for sustainability. Environ. Sci. Policy 106, 87–98 (2020).

32. 32.

Wada, Y. et al. Modeling global water use for the 21st century: The Water Futures and Solutions (WFaS) initiative and its approaches. Geoscientific Model Dev. 9, 175–222 (2016).

33. 33.

Hanasaki, N., Yoshikawa, S., Pokhrel, Y. & Kanae, S. A global hydrological simulation to specify the sources of water used by humans. Hydrol. Earth Syst. Sci. 22, 789–817 (2018).

34. 34.

Jägermeyr, J. et al. Water savings potentials of irrigation systems: Global simulation of processes and linkages. Hydrol. Earth Syst. Sci. 19, 3073–3091 (2015).

35. 35.

Muller Schmied, H. et al. Variations of global and continental water balance components as impacted by climate forcing uncertainty and human water use. Hydrol. Earth Syst. Sci. 20, 2877–2898 (2016).

36. 36.

Hanasaki, N., Yoshikawa, S., Pokhrel, Y. & Kanae, S. A global hydrological simulation to specify the sources of water used by humans. Hydrol. Earth Syst. Sci. 22, 789–817 (2018).

37. 37.

Siebert, S., Henrich, V., Frenken, K. & Burke, J. Update of the digital global map of irrigation areas (GMIA) to version 5. http://www.fao.org/3/I9261EN/i9261en.pdf (2013).

38. 38.

Thompson, E. L. & Smith, L. A. Escape from model-land. Economics 13, 1–15 (2019).

39. 39.

Siebert, S. et al. Development and validation of the global map of irrigation areas. Hydrol. Earth Syst. Sci. 9, 535–547 (2005).

40. 40.

Puy, A., Muneepeerakul, R. & Balbo, A. L. Size and stochasticity in irrigated social-ecological systems. Sci. Rep. 7, 43943 (2017).

41. 41.

Malano, H. & Burton, M. Guidelines for benchmarking performance in the irrigation and drainage sector. Food and Agriculture Organization of the United Nations, Rome. https://www.icid.org/BMGuidelines.pdf (2001).

42. 42.

ANCID. Australian Irrigation Water Provider. Benchmarking data report for 2003/2004. Key irrigation industry and performance indicators (2005).

43. 43.

USDA. 2018 Irrigation and water management survey. Volume 3, Special Studies, Part 1. United States Department of Agriculture. https://www.nass.usda.gov/Publications/AgCensus/2017/Online_Resources/Farm_and_Ranch_Irrigation_Survey/fris.pdf (2019).

44. 44.

Ivahnenko, T. & Flynn, J. L. Estimated withdrawals and use of water in Colorado, 2005. US Geological Survey Scientific Investigations Report 2010–5002. http://pubs.usgs.gov/sir/2010/5002/pdf/SIR10-5002.pdf (2010).

45. 45.

Solley, W. B., Pierce, R. R. & Perlman, H. Estimated use of water in the United States in 1995. US Department of the Interior. http://www.usgs.gov/default.asp. (1998).

46. 46.

Oleson, K. W. et al. Technical description of version 4.5 of the Community Land Model (CLM) (No. NCAR/TN-503+STR). http://www.cesm.ucar.edu/models/cesm1.2/clm/CLM45_Tech_Note.pdf (2013).

47. 47.

Tatsumi, K. & Yamashiki, Y. Effect of irrigation water withdrawals on water and energy balance in the Mekong River Basin using an improved VIC land surface model with fewer calibration parameters. Agric. Water Manag. 159, 92–106 (2015).

48. 48.

West, G. Scale. The Universal Laws of Growth, Innovation, Sustainability, and the Pace of Life, in Organisms, Cities, Economies and Companies (Penguin Press, 2017).

49. 49.

Schmidt-Nielsen, K. Scaling. Why is Animal Size so Important? (Cambridge University Press, 1984).

50. 50.

Bonner, J. Why Size Matters (Princeton University Press, 2006).

51. 51.

White, E. P., Ernest, S. M., Kerkhoff, A. J. & Enquist, B. J. Relationships between body size and abundance in ecology. Trends Ecol. Evolution 22(June), 323–330 (2007).

52. 52.

Bettencourt, L. M. A., Lobo, J., Helbing, D., Kühnert, C. & West, G. Growth, innovation, scaling, and the pace of life in cities. Proc. Natl Acad. Sci. 104, 7301–7306 (2007).

53. 53.

Wada, Y., Wisser, D. & Bierkens, M. F. Global modeling of withdrawal, allocation and consumptive use of surface water and groundwater resources. Earth Syst. Dyn. 5, 15–40 (2014).

54. 54.

FAO. AQUASTAT website. http://www.fao.org/nr/water/aquastat/didyouknow/index3.stm (2016).

55. 55.

FAO. FAOSTAT database. http://www.fao.org/faostat/en/ (2017).

56. 56.

Salmon, J., Friedl, M. A., Frolking, S., Wisser, D. & Douglas, E. M. Global rain-fed, irrigated, and paddy croplands: A new high resolution map derived from remote sensing, crop inventories and climate data. Int. J. Appl. Earth Obser. Geoinf. 38, 321–334 (2015).

57. 57.

Meier, J., Zabel, F. & Mauser, W. A global approach to estimate irrigated areas. A comparison between different data and statistics. Hydrol. Earth Syst. Sci. 22, 1119–1133 (2018).

58. 58.

Pokhrel, Y. N., Hanasaki, N., Wada, Y. & Kim, H. Recent progresses in incorporating human land-water management into global land surface models toward their integration into Earth system models. Wiley Interdiscip. Rev.: Water 3, 548–574 (2016).

59. 59.

Adams, W. How beautiful is small? Scale, control and success in Kenyan irrigation. World Dev. 18, 1309–1323 (1990).

60. 60.

Attwood, D. W. Big is ugly? How large-scale institutions prevent famines in Western India. World Dev. 33, 2067–2083 (2005).

61. 61.

Lankford, B., Makin, I., Matthews, N., Mccornick, P. & Noble, A. A compact to revitalise large-scale irrigation systems using a leadership-partnership-ownership ’theory of change’. Water Alternatives 9, 1–32 (2016).

62. 62.

Ludwig, D. & Walters, C. Are age-structured models appropriate for catch-effort data? Can. J. Fish. Aquat. Sci. 42, 1066–72 (1985).

63. 63.

Levin, S. A. The problem of relevant detail. In Busenberg, S. & Martelli, M. (eds.) Differential Equations - Models in Biology, Epidemiology and Ecology, vol. 92, 9–15 (Springer-Verlag, 1991).

64. 64.

Stainforth, D. A. & Calel, R. New priorities for climate science and climate economics in the 2020s. Nat. Commun. 11, 10–12 (2020).

65. 65.

Saltelli, A. et al. Five ways to ensure that models serve society: a manifesto. Nature 582, 482–484 (2020).

66. 66.

Allen, R. G., Pereira, L. S., Raes, D. & Smith, M. Crop evapotranspiration (guidelines for computing crop water requirements). Irrig. Drain. 300, 300 (1998).

67. 67.

Jagtap, S. S. & Jones, J. W. Stability of crop coefficients under different climate and irrigation management practices. Irrig. Sci. 10, 231–244 (1989).

68. 68.

Satti, S. R., Jacobs, J. M. & Irmak, S. Agricultural water management in a humid region: sensitivity to climate, soil and crop parameters. Agric. Water Manag. 70, 51–65 (2004).

69. 69.

Lu, J., Sun, G., McNulty, S. G. & Amatya, D. M. A comparison of six potential evapotranspiration methods for regional use in the southeastern United States. J. Am. Water Resour. Assoc. 41, 621–633 (2005).

70. 70.

Kingston, D. G., Todd, M. C., Taylor, R. G., Thompson, J. R. & Arnell, N. W. Uncertainty in the estimation of potential evapotranspiration under climate change. Geophys. Res. Lett. 36, 3–8 (2009).

71. 71.

Weiß, M. & Menzel, L. A global comparison of four potential evapotranspiration equations and heir relevance to stream flow modelling in semi-arid environments. Adv. Geosci. 18, 15–23 (2008).

72. 72.

De Graaf, I. E. M., Gleeson, T., L. P. H.van Beek, Sutanudjaja, E. H. & Bierkens, M. F. P. Environmental flow limits to global groundwater pumping. Nature 574, 90–94 (2019).

73. 73.

Wisser, D. et al. Global irrigation water demand: Variability and uncertainties arising from agricultural and climate data sets. Geophys. Res. Lett. 35, 1–5 (2008).

74. 74.

Saltelli, A. et al. Variance based sensitivity analysis of model output. Design and estimator for the total sensitivity index. Computer Phys. Commun. 181(Feb.), 259–270 (2010).

75. 75.

Saltelli, A. et al. Why so many published sensitivity analyses are false: A systematic review of sensitivity analysis practices. Environ. Model. Softw. 114, 29–39 (2019).

76. 76.

Adam, D. Simulating the pandemic: What COVID forecasters can learn from climate models. Nature 587, 533–534 (2020).

77. 77.

Liang, X., Lettenmaier, D. P., Wood, E. F. & Burges, S. J. A simple hydrologically based model of land surface water and energy fluxes for general circulation models. J. Geophys. Res. 99, 14415–14428 (1994).

78. 78.

Warszawski, L. et al. The inter-sectoral impact model intercomparison project (ISI-MIP): project framework. Proc. Natl Acad. Sci. 111(Mar.), 3228–3232 (2014).

79. 79.

Goldewijk, K. K., Beusen, A., Doelman, J. & Stehfest, E. Anthropogenic land use estimates for the Holocene - HYDE 3.2. Earth Syst. Sci. Data 9, 927–953 (2017).

80. 80.

Portmann, F. T., Siebert, S. & Döll, P. MIRCA2000-Global monthly irrigated and rainfed crop areas around the year 2000: A new high-resolution data set for agricultural and hydrological modeling. Glob. Biogeochemical Cycles 24, 1–24 (2010).

81. 81.

van Vuuren, D. P. et al. The representative concentration pathways: An overview. Climatic Change 109, 5–31 (2011).

82. 82.

van Buuren, S. Flexible Imputation of Missing Data (Chapman and Hall/CRC, 2018).

83. 83.

Rubin, D. B. Multiple Imputation for Nonresponse in Surveys (John Wiley & Sons, 1987).

84. 84.

van Buuren, S. & Groothuis-Oudshoorn, K. Mice: Multivariate imputation by chained equations in R. J. Stat. Softw. 45, 1–67 (2011).

85. 85.

Graham, J. W., Olchowski, A. E. & Gilreath, T. D. How many imputations are really needed? Some practical clarifications of multiple imputation theory. Prev. Sci. 8, 206–213 (2007).

86. 86.

Renaud, O. & Victoria-Feser, M. P. A robust coefficient of determination for regression. J. Stat. Plan. Inference 140, 1852–1862 (2010).

87. 87.

Salibian-Barrera, M. & Yohai, V. J. A fast algorithm for S-regression estimates. J. Comput. Graph. Stat. 15, 414–427 (2006).

88. 88.

Sobol’, I. M. On the distribution of points in a cube and the approximate evaluation of integrals. USSR Comput. Math. Math. Phys. 7, 86–112 (1967).

89. 89.

Sobol’, I. M. Uniformly distributed sequences with an additional uniform property. USSR Comput. Math. Math. Phys. 16, 236–242 (1976).

90. 90.

Puy, A., Piano, S. L., Saltelli, A. & Levin, S. A. Sensobol: an R package to compute variance-based sensitivity indices. http://arxiv.org/abs/2101.10103 (2021).

91. 91.

Sobol’, I. M. Sensitivity analysis for nonlinear mathematical models. Math. Model. Comput. Exp. 1, 407–414 (1993).

92. 92.

Homma, T. & Saltelli, A. Importance measures in global sensitivity analysis of nonlinear models. Reliab. Eng. Syst. Saf. 52, 1–17 (1996).

93. 93.

Jansen, M. Analysis of variance designs for model output. Computer Phys. Commun. 117, 35–43 (1999).

94. 94.

Puy, A., Becker, W., Piano, S. L. & Saltelli, A. The battle of total-order sensitivity estimators. http://arxiv.org/abs/2009.01147 (2020).

95. 95.

Puy, A. R code of the paper “Irrigated areas drive irrigation water withdrawals”. (Version 2.0.1). Zenodo.https://doi.org/10.5281/zenodo.4721393 (2020).

96. 96.

Siebert, S. et al. A global data set of the extent of irrigated land from 1900 to 2005. Hydrol. Earth Syst. Sci. 19, 1521–1545 (2015).

97. 97.

Carpenter, J. & Bithell, J. Bootstrap confidence intervals: when, which, what? A practical guide for medical statisticians. Stat. Med. 19, 1141–1164 (2000).

## Acknowledgements

Thanks to Francesc C. Conesa (Institut Català d’Arqueologia Clàssica) and to Jonas Meier (German Aerospace Center) for helping with the conversion of .asc files to .csv files. This work has been funded by the European Commission (Marie Skłodowska-Curie Global Fellowship, grant number 792178 to AP) and by the National Science Foundation grant DMS 1951358.

## Author information

Authors

### Contributions

A.P. designed the research, retrieved the data and conducted the simulations. A.P., E.B., S.L.P., S.A.L. and A.S. interpreted and discussed the results. A.P. lead the writing of the manuscript, with contributions from E.B., S.L.P., S.A.L. and A.S. All authors edited, revised and approved the final version.

### Corresponding author

Correspondence to Arnald Puy.

## Ethics declarations

### Competing interests

The authors declare no competing interests.

Peer review informationNature Communications thanks Alvaro Calzadilla and other, anonymous, reviewers for their contributions to the peer review of this work. Peer review reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Rights and permissions

Reprints and Permissions

Puy, A., Borgonovo, E., Lo Piano, S. et al. Irrigated areas drive irrigation water withdrawals. Nat Commun 12, 4525 (2021). https://doi.org/10.1038/s41467-021-24508-8

• Accepted:

• Published: