Mapping 20 years of irrigated croplands in China using MODIS and statistics and existing irrigation products

Zhang, Chao; Dong, Jinwei; Ge, Quansheng

doi:10.1038/s41597-022-01522-z

Download PDF

Data Descriptor
Open access
Published: 15 July 2022

Mapping 20 years of irrigated croplands in China using MODIS and statistics and existing irrigation products

Scientific Data volume 9, Article number: 407 (2022) Cite this article

5909 Accesses
20 Citations
Metrics details

Subjects

Abstract

As a routine agricultural practice, irrigation is fundamental to protect crops from water scarcity and ensure food security in China. However, consistent and reliable maps about the spatial distribution and extent of irrigated croplands are still unavailable, impeding water resource management and agricultural planning. Here, we produced annual 500-m irrigated cropland maps across China for 2000–2019, using a two-step strategy that integrated statistics, remote sensing, and existing irrigation products into a hybrid irrigation dataset. First, we generated intermediate irrigation maps (MIrAD-GI) by fusing the MODIS-derived greenness index and statistical data. Second, we collected all existing available irrigation maps over China and integrated them with MIrAD-GI into an improved series of annual irrigation maps, using constrained statistics and a synergy mapping method. The resultant maps had moderate overall accuracies (0.732~0.819) based on nationwide reference ground samples and outperformed existing irrigation products by inter-comparison. As the first of this kind in China, the annual maps delineated the spatiotemporal pattern of irrigated croplands and could contribute to sustainable water use and agricultural development.

Measurement(s)	irrigation area and distribution
Technology Type(s)	statistics and satellite remote sensing
Factor Type(s)	agricultural irrigation
Sample Characteristic - Organism	agricultural irrigation
Sample Characteristic - Environment	agricultural field
Sample Characteristic - Location	China

Maps of cropping patterns in China during 2015–2021

Article Open access 05 August 2022

Bingwen Qiu, Xiang Hu, … Zeyu Jian

High-resolution crop yield and water productivity dataset generated using random forest and remote sensing

Article Open access 21 October 2022

Minghan Cheng, Xiyun Jiao, … Xiuliang Jin

The 10-m crop type maps in Northeast China during 2017–2019

Article Open access 02 February 2021

Nanshan You, Jinwei Dong, … Xiangming Xiao

Background & Summary

With the fast-growing population in the world, more grain must be produced to meet the increasing food demand, calling for expanded or intensified agriculture^1,2. As an important practice of agricultural intensification, irrigated croplands contribute to about 40% of food production using only 18% of croplands globally³, but consume 70% of the total available fresh water on the planet^4,5, exacerbating global and regional water scarcity^6,7. Thus, explicit information of the spatial distribution and temporal dynamics of irrigated areas is of significance for water management and agricultural planning, as well as the understanding of regional water cycle⁸ and climate change^9,10.

Irrigation has a long history in China and expanded fast with more and more irrigation infrastructures (i.e., reservoirs and canals) constructed in the last two decades¹¹. In China, irrigated croplands account for half of the total cropland area and produce approximately 75% of food and more than 90% of industrial crops¹¹. But the sustainable development of irrigation is challenged by episodic weather anomalies, uneven precipitation, and frequent droughts¹². Long-term intensive irrigation has resulted in water scarcity and environmental problems in some regions, such as groundwater crisis in North China Plain¹³, soil salinization in Northwest China¹⁴, and local climatic effects (e.g., temperature changes) in Northeast China^15,16. In the foreseeable future, irrigated agriculture in China is likely to experience pressure in water demand due to unstable weather and extreme climate events, especially in arid and semi-arid areas. Therefore, routinely monitoring irrigated areas can not only facilitate water management and allocation but also evaluate its environmental and climatic effects.

So far, the annual irrigation area is only documented in the statistical yearbook released by the National Bureau of Statistics of China but the explicit information about its location and distribution is absent. The current spatiotemporal extent and dynamics of irrigated croplands in China are still uncertain and existing available maps are usually outdated or with a coarse spatial resolution. During the last two decades, several global irrigation maps have been produced such as the Global Map of Irrigation Areas (GMIA)^17,18 generated with a 10-km resolution, using approximate irrigation area at national and sub-national scales and multi-source ancillary datasets (i.e., raster maps of land cover and vectorial irrigation maps). Two other global maps based on remote sensing classification methods also emerged subsequently, they were the Global Irrigated Area Map (GIAM)¹⁹ with a 10-km resolution for the year 2000, and the Global Rainfed, Irrigated, and Paddy Croplands map (GRIPC)³ with a 500-m resolution for the year 2005. Further, a couple of time-series irrigation maps at regional or continental scales were also developed by different institutes. For instance, the MIrAD-US^20,21,22 (MODIS Irrigated Agriculture Dataset for the US) which utilized a geospatial modelling framework that assimilates irrigation statistics with remote sensing vegetation index to identify the irrigated lands at a 250-m resolution every five years since 2002; the 500-m irrigated dryland map for the US in 2001 produced with remotely-sensed temporal and spectral signatures and a decision tree method²³; the yearly irrigated area maps in India for 2000–2015 using 250-m MODIS vegetation index, land use/cover data, and a decision tree irrigation model²⁴. In addition, several time series 30-m irrigation datasets have been generated using Landsat imagery, environmental variables, and random forest model on the Google Earth Engine platform^{25,26,27,28,29,30}. As for the China region, two national irrigation maps have ever been produced, including the map for the year 2000 generated by downscaling the statistical data to individual pixels based on three irrigation indices³¹, and the map for the year 2016 developed by comparing the water index of cropland to that of adjacent forest³². However, these two maps had low accuracies (less than 70%) and involved only single-year information^31,32, impeding our understanding of the spatial and temporal extent of irrigation area and urgently requesting for such a spatiotemporal dataset.

Here, we presented such a dataset, which resulted from a study attempting to track the spatiotemporal dynamics of irrigated croplands in China from 2000 to 2019³³. The dataset consists of yearly spatial information describing the extent and distribution of irrigated croplands at a resolution of 500 m, enabling us to find the increase or decrease of irrigated areas across China. Specifically, this study aimed to give a detailed delineation of the producing process of the dataset, including 1) generating provisional irrigation maps (MODIS irrigation dataset based on Greenness Index, MIrAD-GI) by downscaling statistics to potential cropland pixels based on MODIS-derived greenness index; 2) integrating MIrAD-GI with historical irrigation maps into an improved irrigation dataset, using constrained statistics and a synergy mapping method. Further, compared to the previous study which just validated the resultant maps in four typical irrigation districts for three years, more comprehensive validations were implemented based on nationwide ground samples for five years in this study. The data generating process described here was expected to be more explicit, legible, and repeatable with the assistance of accessible Python codes. The final irrigation dataset can be utilized to predict water consumption, assist agricultural planning, and assess regional climatic effects induced by irrigation.

Methods

China has a typical monsoon climate and is divided into arid, semi-arid, and wet regions. The total precipitation ranges from 3.2–4854.0 mm and the mean annual temperature varies between 4.41–26.33 °C³⁴. The unstable climate is prone to result in frequent floods and droughts, which is a threat to agriculture. The total cropland area in China is more than 120 million hectares, accounting for 15% of the global land. Irrigated croplands in this study refer to the areas equipped with irrigation facilities and receive water supply at least once a year, including paddy fields and irrigated drylands. Irrigated croplands mainly spread over flat regions like the North China Plain, Northeast China, the Hetao Plain, the oasis in Northwest China, etc. Irrigation area increased by approximately one quarter in the last two decades³³ which happened most in Northeast and Northwest China due to the reclamation of croplands.

Greenness Index (GI) data from MODIS

Vegetation indices (VIs), including normalized difference vegetation index (NDVI), enhanced vegetation index (EVI), and greenness index (GI), are widely used as indicators for irrigation detection^{3,29,35,36,37,38} as they represent the amount of green biomass with varied index values responding to the perturbances in vegetation condition²⁰. A maximum VI derived from the annual time series can be seen as a proxy for the peak level of photosynthetic activity, the highest biomass, and the possibly densest vegetation canopy^39,40. The highest annual peak VI for any crops can be attributed to the consistent adequate soil moisture delivered by irrigation during the growing season; thus, it’s common that irrigated crops have higher peak VIs than non-irrigated crops^20,21,23 (Fig. 4).

Among various vegetation indices, GI is found more sensitive to the irrigation-induced status of crops and can better capture the absolute magnitude of greenness which is an indication of irrigation presence²³. Gitelson et al.⁴¹ argued that the absorption of light in the green spectrum was high enough to provide GI with a high sensitivity to chlorophyll content but much lower than in the blue and red to avoid saturation. As a result, we selected peak GI as the indicator of irrigation activity. Due to the high temporal resolution and least amount of cloud contamination, the Terra/MODIS surface spectral reflectance data product (MOD09A1) with a spatial resolution of 500 m and an 8-day temporal composite period was used to calculate GI⁴² (Eq. 1).

$${\rm{GI}}={{\rm{\rho }}}_{{\rm{nir}}}/{{\rm{\rho }}}_{{\rm{green}}}$$

(1)

where ρ_nir and ρ_green is the near-infrared band and green band of MODIS, respectively. The GI time series were filtered to reduce outliers using the Savitzky-Golay filter⁴³.

Statistical data of irrigation and pre-processing

We collected all available irrigation statistics for 2000–2019 at the provincial, municipal, and county levels released by the National and Provincial Bureau of Statistics (Fig. S1). The statistical data is the only reliable data covering the most irrigated areas in China and has been widely used to assist the mapping of irrigated croplands in existing studies^22,23. Due to the varied integrities of statistics and adjustments of administrative division in different provinces in the last two decades, we adopted the following measures to produce a consistent statistical dataset. (1) We implemented a linear interpolation to fill the temporal gap using prior and latter years’ data when statistics from one year were absent. (2) We used county-level statistics in priority but adopted municipal or provincial statistics when the finer-level data was unavailable. (3) We unified the statistics of some prefectures and counties whose boundaries or names have ever changed for the whole period. (4) We integrated the multi-source statistics for some provinces like Xinjiang, Heilongjiang, and Hainan, based on geographic locations and administrative boundaries. At last, we got the consistent time series irrigation data and transformed them into shapefile format with ArcGIS.

Other datasets

Two other datasets were used to facilitate the selection of reference ground samples for validating the resultant maps, they were MOD16A2⁴⁴ and TerraClimate⁴⁵. The 8-day evapotranspiration (ET) data from MOD16A2 with a 500 m resolution was aggregated into monthly mean values to serve as an indicator of irrigation since irrigated croplands usually have a stronger ET than nearby non-irrigated croplands^28,29,46. In addition, we extracted precipitation data from the TerraClimate which is a dataset of monthly climate and climatic water balance for global terrestrial surfaces⁴⁵, to reflect the climatic background when selecting validation samples.

Existing irrigation maps and pre-processing

Existing irrigation maps include thematic irrigation products and land use land cover (LULC) products with irrigation information (Table 1). Five thematic irrigation products at global and continental scales were adopted in this study, they were GMIA-m, the extended version of Global Mapping of Irrigation Areas produced by Meier et al.⁴⁷ with a spatial resolution of 1 km and a strong correlation with statistical data (r = 0.84); GRIPC, generated using a supervised classification method with remote sensing, climate, and agricultural inventory data at 500 m resolution and had an overall accuracy of 69%³; GFSAD (Global Food Security-support Analysis Data) which was a NASA-funded project to provide high-resolution global cropland data and their water use to sustain global food security⁴⁸; IAAA (Irrigated Area Map for Asia and Africa) developed to indicate irrigated and rainfed croplands in Asia and Africa at 250 m resolution for 2000 and 2010⁴⁹; Xiang16 produced for mainland China in 2016 with an overall accuracy of 62%³².

Table 1 Input spatial datasets for synergizing into the improved irrigation maps.

Full size table

Three LULC products were included in this study. The NLCD (National Land Cover Dataset of China) dataset for the years 2000, 2005, 2010, 2015, and 2018 was collected from the Data Center for Resource and Environmental Sciences of the Chinese Academy of Sciences with a 100-m resolution. They were generated through a human-computer interaction method with Landsat imagery and had high overall accuracies of over 90%^50,51. In the NLCD, paddy fields (class code: 11) which indicated rice and other paddy crops like lotus root were considered as irrigated croplands in this study. Since the NLCD had a five-year interval, we extended the application time to five years including two front years and two subsequent years. E.g., NLCD-2005 was applied for 2003–2007 (Table S1).

The second LULC product was CCI-LC (Climate Change Initiative Land Cover dataset), produced by the European Space Agency (ESA) (yearly since 1992) and consisted of 22 level-1 classes⁵². The explicit definition of irrigated croplands (class code: 20) in CCI-LC indicated irrigated tree crops, shrub crops, and herbaceous crops, as well as post-flooding cultivation of herbaceous crops. The mean accuracy of irrigated croplands in CCI-LC was 88%⁵². The last LULC product was GLC_FCS (Global Land-Cover product with Fine Classification System), developed for the years 2015 and 2020 using a random forest model with Landsat time series and a global training dataset⁵³; GLC_FCS had the same classification scheme as CCI-LC and had a mean overall accuracy of 82.5%⁵³.

All existing irrigation maps were resampled to 500 m resolution using the nearest sampling method and harmonized to the same coordinate system. We adopted the Albers equal-area projected coordinate system with WGS84 datum as the base projection. In addition, a unified cropland mask derived from the NLCD was used to exclude non-cropland areas in the input datasets for each year (Table S1).

Generating initial irrigation maps (MIrAD-GI) using a thresholding method

We first generated an intermediate irrigation dataset (MIrAD-GI) by fusing remote sensing and statistics. A statistical downscaling or thresholding method^20,21 was used here to allocate the statistical area to individual cropland pixels. As illustrated in Fig. 1, before thresholding, we overlayed MODIS GI images and the NLCD cropland layer to exclude non-cropland pixels. Then, we calculated the peak annual GI value at the pixel scale based on the smoothed GI time-series images (46 values a year in response to 8-day temporal resolution). Next, within a county extent, all cropland pixels were ranked in a descending order based on GI values. The cropland pixels with the first-highest GI value were counted and their total area was summed up (Area₁) to match the nominal statistical irrigation area. If Area₁ was less than the nominal area (Area_c), then cropland pixels with GI values equal to the 2^nd-highest GI were counted and their area (Area₂) was combined and compared with Area_c. If (Area₁ + Area₂) was less than Area_c, then the iteration process turned to the 3^rd-highest GI and continued until their total area was closest to Area_c. The GI value in response to the smallest area gap was the optimal threshold and all cropland pixels with a GI value not lower than this threshold were labeled as irrigated pixels. The above iterative thresholding process was conducted in each county of China to get county-level irrigation maps. Finally, all county-level irrigation maps were mosaiced to constitute the provisional irrigation map (MIrAD-GI). Next, MIrAD-GI was integrated with other irrigation maps into a hybrid dataset for each year (Table S1) using a synergy mapping method, which will be described in the following section.

Synergy mapping by fusing MIrAD-GI, multiple existing maps, and statistics

Synergistic approaches include two types: agreement-scoring method⁵⁴ and regression method⁵⁵. The former one overlays multiple input products and assigns different scores to each pixel based on the agreement level of the input datasets⁵⁴. As for the latter one, it uses geographically weighted regression and crowdsourced validation data to generate a hybrid map based on existing products^55,56. We adopted the former one due to the lack of enough validation samples to run a regression model. The key components of the synergy mapping involve (1) quantifying the weight order of input products; (2) overlaying the input products and calculating pixel-wise scores based on the weight order; (3) downscaling the statistical area into individual cropland pixels according to pixel-wise scores.

Determining a suitable weight order of the input products is the first step of the synergistic approach, which relies on explicit accuracy information or empirical judgment from experts. Fritz et al.⁵⁷ ranked the input products at a national or regional scale based on the accuracy derived from crowdsourced samples and then integrated them into a hybrid cropland map in 2005. Lu et al.⁵⁸ evaluated the accuracies of several existing cropland datasets with a large number of ground truth points when using the synergy mapping method for a hybrid map. When lack of ground samples to assess the accuracy of input products, expert judgment can also play an important role. For instance, when generating a cropland product for sub-Saharan Africa with five land cover datasets, Fritz et al.⁵⁴ assigned a higher priority to the regional product derived from higher resolution images or the more recent product. In this study, we used accuracy information and expert judgment fused method to get the weight order of input datasets. We preferred accuracy information as an indicator of priority when it was available but would consider timeliness and spatial resolution and data performance based on empirical judgment. The empirical judgment could play a key role when a regional product derived from higher resolution images or reliable interpretation methods was available but without explicit accuracy information. Timeliness could be considered because the more recent product usually was perceived as more accurate⁵⁴. For instance, the weight order of input products in 2010 was NLCD, CCI-LC, MIrAD-GI, IAAA, and GFSAD. NLCD ranked first due to its highest accuracy (over 90%) and reliability (human-computer interactive interpretation method)^50,51, followed by CCI-LC whose overall accuracy was 0.88 for irrigated croplands⁵². As an intermediate irrigation product generated by fusing MODIS GI and statistics, MIrAD-GI was closely related to statistical data and covered the approximate irrigated areas²⁰ and therefore ranked third. Both IAAA and GFSAD lacked explicit accuracy assessment, so IAAA with a finer spatial resolution (250 m) ranked higher than GFSAD (1 km), based on our empirical judgment rules. Further, through visual inspection, we found that IAAA captured most irrigation hotspots in China and performed better than GFSAD. As a result, IAAA and GFSAD ranked fourth and fifth, respectively.

The second step of the synergistic approach is assigning each pixel a score based on the agreement among input products⁵⁹. In general, one pixel with a higher consensus of irrigation among input products will be more likely identified as irrigation. Various permutations can be obtained based on the agreement of input irrigation datasets and their weight order. For instance, five input datasets adopted in this study were labeled as A, B, C, D, and E according to a descending weight order. In other words, product A ranked first and E last due to their accuracy information or reliability. As illustrated in Table 2, we assigned six agreement levels varied from 0 to 5. Level 5 indicates all five input datasets identify a pixel as irrigated and Level 0 indicates no datasets label the pixel as irrigated. Thirty-two permutations can be obtained based on five input products and a pixel is more likely labeled as irrigated when it has a higher score. The highest score was 31, in response to the highest agreement Level 5, and a score of 0 corresponded to the lowest agreement Level 0. There are multiple scores for other agreement levels due to various arrangements. For example, Level 4 has five combinations with a score ranging from 26 to 30. Based on the weight order, if products A, B, C, and D have the value 1 (irrigation) synchronously, then the score is set 30; when B, C, D, and E have the value 1 simultaneously, the score is 26. Similarly, different scores are assigned according to different combinations (Table 2).

Table 2 The agreement ranking score table for five input products.

Full size table

The last step is to downscale the statistical irrigation area to individual pixels based on their score orders. It is an iterative process to allocate the statistical area to the pixels with higher ranking scores automatically until the cumulative irrigation area is closest to the statistical area, like the thresholding process of generating MIrAD-GI (Fig. 1). As Fig. 2 illustrated, within a region (e.g., county and prefecture), the pixels with a score of 31 are counted and their sum area is compared with the statistical area (Area_c). If the area is less than Area_c, the pixels with a score of 30 (29, 28, …) are counted and their area is added to match Area_c until the cumulative area is closest to Area_c. After the thresholding process has been done in each county, all resulting county-level maps are integrated into the final map.

Data Records

The annual irrigation dataset with a 500-m resolution is provided for China during 2000–2019. The dataset is available at the figshare repository in a Geotiff format⁶⁰. The spatial reference system of this dataset is EPSG: 4326 (WGS-1984). All the maps in the dataset are binary maps with 1 indicating irrigated and 0 indicating non-irrigated. The maps can be visualized and analyzed in ArcGIS or QGIS.

Technical Validation

We adopted two methods to assess the performance of the resultant maps, including pixel-wise validation with nationwide reference data and inter-comparison with existing irrigation products at national and local scales.

Accuracy assessment using ground truth samples

To implement pixel-wise validation, we randomly collected 5,648 ground truth points in five years 2000, 2005, 2010, 2015, and 2019 based on the following three rules. First, croplands close to lakes, rivers, and reservoirs are more likely to be irrigated than those far from water sources. Based on Google Earth images, green homogenous crop fields in the vicinity of water were initially labeled as irrigated (Fig. 3b). Instead, croplands far from water and lacking clear evidence of irrigation infrastructures (i.e., ditches and canals) were identified as non-irrigated (Fig. 3c). Second, we examined the water use of crops within each pixel by plotting monthly time series of ET and precipitation (Fig. 4a). Generally, irrigated crops have higher ET than non-irrigated ones in the growing season from the same region³⁶, serving as evidence to distinguish irrigated from non-irrigated croplands. For instance, most winter wheat in North China Plain would be irrigated in April when it turned green because the precipitation is not sufficient. Irrigated crops had a higher monthly ET value than rain-fed crops. Specifically, the ET of March to May exceeds precipitation due to the irrigation practice (Fig. 4b). Third, since irrigated crops are usually greener and have a higher peak greenness index than non-irrigated crops²⁰ within the same extent, we further compared the peak greenness index between them. In total, 5,648 validation samples were collected for five years (Table 3, Fig. 3a).

Table 3 Accuracy assessment of the resultant irrigation maps using validation samples.

Full size table

We adopted five evaluation metrics, including the producer’s accuracy (PA, corresponding to omission error), user’s accuracy (UA, corresponding to commission error), overall accuracy (OA), Kappa coefficient, and F1-score as indicators of the performance of the resultant maps. As illustrated in Table 3, the mean overall accuracy was 0.765, with a kappa coefficient of 0.680 and an F1-score of 0.773. All of them were higher than the reported accuracies by Zhu et al.³¹ and Xiang et al.³². The highest and lowest accuracy was found in 2000 (0.810) and 2010 (0.732), respectively. It’s worth noting that the maps had a lower PA than UA, corresponding to a higher omission error than commission error. It may be due to the potential underestimation of the irrigated croplands in statistical data since our synergy mapping method relied heavily on statistics. Several studies have argued that irrigation statistics in developing countries are prone to underestimation due to varied reasons including biased sampling method and political factors^47,61.

Inter-comparison with existing irrigation products

We compared the resultant map with the five input irrigation products for qualitative assessment. Taking the maps in 2010 as an example, we found both similarities and disparities in irrigated areas in these products (Fig. 5). The resultant map matched well with MIrAD-GI and IAAA in most regions and all of them captured the irrigation hotspots, such as Sanjiang Plain and Liaohe Plain in Northeast China, Hetao Plain and Guanzhong Plain in the middle reaches of the Yellow River, North China Plain. Specifically, the resultant map and MIrAD-GI had a highly similar pattern of the distribution of irrigated croplands since both of them were related to GI values and MIrAD-GI played a key role in the synergy process producing an improved dataset. But IAAA suffered from great overestimation, especially in the west of Heilongjiang Province where rainfed corn and soybean were widely planted⁶² (Fig. 5c). GFSAD suffered from great omission in South China (Fig. 5d) and CCI-LC failed to capture the irrigated croplands in Sanjiang Plain and North China Plain (Fig. 5e). NLCD didn’t identify the irrigated croplands in Northern China (Fig. 5f) because its definition of irrigated areas in the NLCD only referred to paddy fields.

Specifically, we selected two irrigation districts for further comparison. In an important rice base in Northeast China (Fig. 6a), both IAAA and CCI-LC failed to detect the irrigated areas. The irrigated area in GFSAD was small, which was far from reality. MIrAD-GI was similar to the resultant map but omitted the important paddy fields in the bottom right corner. The south of Hetao Plain (Fig. 6b), a well-known agricultural base along the Yellow River, was not identified by IAAA; the GFSAD covered most but suffered from underestimation, followed by CCI-LC. MIrAD-GI also omitted some irrigated areas and NLCD had a similar performance to the resultant map.

Uncertainty analysis

Several uncertain factors may have led to some limitations in the resultant maps. First, the statistical data may be inadequate in some regions, especially where the economy is poorly developed. The earlier surveying and sampling method adopted by local statistical bureaus may introduce some uncertainties into the inventory, and the varied political bias of local water managers may also have an impact on the reported area⁶¹. Furthermore, compared to the irrigated area detected by remote sensing images, statistical data usually suffer from underestimation, because the reported area focuses on irrigated districts equipped with infrastructures but neglects subsistence-level farmland managed by small stakeholders⁴⁷. Nevertheless, statistical data seems to be the only reliable data source on behalf of the most irrigated areas in China.

Second, the hybrid map in this study was highly dependent on source products and their weight order. The uncertainty and bias in source products may be delivered to the resultant map. These errors can be controlled to a large extent through the overlaying and scoring method based on the weight order, but some residuals still existed. Third, our weight sorting method was implemented on the national level, but the accuracy in source products may vary at local scales, which would result in uncertainties. Thus, local adaptive weight ranking orders could be considered in the future if more knowledge about the spatial variations in their accuracies is available. Fourth, although we validated the resultant maps using more than five thousand reference samples extracted through high-resolution Google Earth images with well-defined rules, some bias may still exist due to other factors like georeferenced error and artificial interpretation error. Reference samples from field surveys or irrigation maps from planning authorities may better sustain the validation process and increase the reliability of the final product. Last, there may be some other uncertainties such as the error introduced by irrigated fields where crops do not reach peak GI because of pests, diseases, soil issues (salinity), and other reasons, leading to omission and commission errors.

Usage Notes

As an important agricultural practice, irrigation contributes a lot to grain production and food security. Our irrigation dataset with a 500-m resolution for the last two decades is the basic data for various scientific investigations and engineering applications. Our dataset can be used to track the spatiotemporal extent of irrigated croplands and inter-annual change at regional scales (Fig. 7) in China³³. The water department can use this spatiotemporal dataset to evaluate the performance of irrigation infrastructures since many facilities have been outdated and need to be updated. Policymakers can make better decisions on financial support for water construction according to our dataset. The time-series maps will enable water managers to investigate water consumption⁶³ and predict future water requirements, and to better allocate water between different sectors like agriculture and industry. Managers from agricultural sectors can use our dataset to assess planting structures and rotations, predict grain production, and explore the potential yield gap needing to be filled. Under the warming background, our dataset may also be used to evaluate the climatic effects of irrigation, including temperature^15,16, evapotranspiration^64,65 wind, and precipitation⁶⁶. Further, the dataset is of significance to help us understand regional hydrological cycles and climate changes.

Code availability

Python code used to generate the irrigation maps is available from the figshare repository⁶⁰.

The software used in this work include:

∙ ArcGIS 10.2

∙ Python 2.7, numpy 1.16.6, pandas 0.19.0, scipy 1.2.3, scikit-learn 0.20.3, matplotlib 1.1.1

References

Godfray, H. C. J. et al. Food Security: The Challenge of Feeding 9 Billion People. Science 327, 812–818 (2010).
Article CAS PubMed ADS Google Scholar
Tilman, D., Balzer, C., Hill, J. & Befort, B. L. Global food demand and the sustainable intensification of agriculture. Proc Natl Acad Sci USA 108, 20260–20264 (2011).
Article CAS PubMed PubMed Central ADS Google Scholar
Salmon, J. M., Friedl, M. A., Frolking, S., Wisser, D. & Douglas, E. M. Global rain-fed, irrigated, and paddy croplands: A new high resolution map derived from remote sensing, crop inventories and climate data. Int J Appl Earth Obs Geoinf 38, 321–334 (2015).
Google Scholar
Siebert, S. & Döll, P. Quantifying blue and green virtual water contents in global crop production as well as potential production losses without irrigation. J Hydrol 384, 198–217 (2010).
Article Google Scholar
Thenkabail, P., Hanjra, M., Dheeravath, V. & Gumma, M. A Holistic View of Global Croplands and Their Water Use for Ensuring Global Food Security in the 21st Century through Advanced Remote Sensing and Non-remote Sensing Approaches. Remote Sens 2, 211–261 (2010).
Article ADS Google Scholar
Deines, J. M., Kendall, A. D., Butler, J. J., Basso, B., Hyndman, D. W. Combining Remote Sensing and Crop Models to Assess the Sustainability of Stakeholder‐Driven Groundwater Management in the US High Plains Aquifer. Water Resour Res (2021).
Zhang, K., Li, X., Zheng, D., Zhang, L. & Zhu, G. Estimation of Global Irrigation Water Use by the Integration of Multiple Satellite Observations. Water Resour Res 58, e2021WR030031 (2022).
ADS Google Scholar
Ozdogan, M., Salvucci, G. D. Irrigation-induced changes in potential evapotranspiration in southeastern Turkey: Test and application of Bouchet’s complementary hypothesis. Water Resour Res 40 (2004).
Sacks, W. J., Cook, B. I., Buenning, N., Levis, S. & Helkowski, J. H. Effects of global irrigation on the near-surface climate. Clim Dyn 33, 159–175 (2009).
Article Google Scholar
Zhu, G. et al. Daytime and nighttime warming has no opposite effects on vegetation phenology and productivity in the northern hemisphere. Sci Total Environ 822, 153386 (2022).
Article CAS PubMed ADS Google Scholar
Pei, Y., Li, X. & Yang, M. Changes in Irrigated Areas and the Types of Cropland in China Since 2000 (in Chinese). J Irrig Drain 37, 1–8 (2018).
Google Scholar
Piao, S. et al. The impacts of climate change on water resources and agriculture in China. Nature 467, 43–51 (2010).
Article CAS PubMed ADS Google Scholar
Sun, H., Zhang, X., Wang, E., Chen, S. & Shao, L. Quantifying the impact of irrigation on groundwater reserve and crop production – A case study in the North China Plain. Eur J Agron 70, 48–56 (2015).
Article Google Scholar
Wu, J. H., Li, P. Y., Qian, H. & Fang, Y. Assessment of soil salinization based on a low-cost method and its influencing factors in a semi-arid agricultural area, northwest China. Environ Earth Sci 71, 3465–3475 (2014).
Article CAS Google Scholar
Liu, W. et al. Biophysical effects of paddy rice expansion on land surface temperature in Northeastern Asia. Agric For Meteorol (2022).
Pan, T. et al. Large-scale rain-fed to paddy farmland conversion modified land-surface thermal properties in Cold China. Sci Total Environ 722, 137917 (2020).
Article CAS PubMed ADS Google Scholar
Siebert, S. et al. Development and validation of the global map of irrigation areas. Hydrol Earth Syst Sci 9, 535–547 (2005).
Article ADS Google Scholar
Siebert, S., Henrich, V., Frenken, K. K. B. Update of the Digital Global Map of Irrigation Areas to Version 5; Rheinische Friedrich-Wilhelms-University: Bonn, Germany; Food and Agriculture Organization of the United Nations: Rome, Italy. http://www.fao.org/nr/water/aquastat/irrigationmap/index10.stm (Accessed 23 December 2021). (2013).
Thenkabail, P. S. et al. Global irrigated area map (GIAM), derived from remote sensing, for the end of the last millennium. Int J Remote Sens 30, 3679–3733 (2009).
Article Google Scholar
Pervez, M. S. & Brown, J. F. Mapping Irrigated Lands at 250-m Scale by Merging MODIS Data and National Agricultural Statistics. Remote Sens 2, 2388–2412 (2010).
Article ADS Google Scholar
Brown, J. F. & Pervez, M. S. Merging remote sensing data and national agricultural statistics to model change in irrigated agriculture. Agric Syst 127, 28–40 (2014).
Article Google Scholar
Shrestha, D., Brown, J. F., Benedict, T. D. & Howard, D. M. Exploring the Regional Dynamics of U.S. Irrigated Agriculture from 2002 to 2017. Land 10, 394–409 (2021).
Article Google Scholar
Ozdogan, M. & Gutman, G. A new methodology to map irrigated areas using multi-temporal MODIS and ancillary data: An application example in the continental US. Remote Sens Environ 112, 3520–3537 (2008).
Article ADS Google Scholar
Ambika, A. K., Wardlow, B. & Mishra, V. Remotely sensed high resolution irrigated area mapping in India for 2000 to 2015. Sci Data 3, 118–131 (2016).
Article Google Scholar
Xie, Y., Lark, T. J., Brown, J. F. & Gibbs, H. K. Mapping irrigated cropland extent across the conterminous United States at 30 m resolution using a semi-automatic training approach on Google Earth Engine. ISPRS J Photogramm Remote Sens 155, 136–149 (2019).
Article ADS Google Scholar
Xie, Y., Gibbs, H. K. & Lark, T. J. Landsat-based Irrigation Dataset (LANID): 30-m resolution maps of irrigation distribution, frequency, and change for the US, 1997–2017. Earth Syst Sci Data 13, 5689–5710 (2021).
Article ADS Google Scholar
Xie, Y. & Lark, T. J. Mapping annual irrigation from Landsat imagery and environmental variables across the conterminous United States. Remote Sens Environ 260, 112445–112461 (2021).
Article ADS Google Scholar
Deines, J. M. et al. Mapping three decades of annual irrigation across the US High Plains Aquifer using Landsat and Google Earth Engine. Remote Sens Environ 233, 111400–111417 (2019).
Article ADS Google Scholar
Deines, J. M., Kendall, A. D. & Hyndman, D. W. Annual Irrigation Dynamics in the U.S. Northern High Plains Derived from Landsat Satellite Data. Geophys Res Lett 44, 9350–9360 (2017).
Article ADS Google Scholar
Chance, E., Cobourn, K., Thomas, V. Trend Detection for the Extent of Irrigated Agriculture in Idaho’s Snake River Plain, 1984–2016. Remote Sens 10 (2018).
Zhu, X., Zhu, W., Zhang, J. & Pan, Y. Mapping Irrigated Areas in China From Remote Sensing and Statistical Data. IEEE J Sel Top Appl Earth Obs Remote Sens 7, 4490–4504 (2014).
Article ADS Google Scholar
Xiang, K., Yuan, W., Wang, L. & Deng, Y. An LSWI-Based Method for Mapping Irrigated Areas in China Using Moderate-Resolution Satellite Data. Remote Sens 12, 4181–4195 (2020).
Article ADS Google Scholar
Zhang, C., Dong, J., Zuo, L. & Ge, Q. Tracking spatiotemporal dynamics of irrigated croplands in China from 2000 to 2019 through the synergy of remote sensing, statistics, and historical irrigation datasets. Agric Water Manage 263, 107458–107470 (2022).
Article Google Scholar
Peng, S., Ding, Y., Liu, W. & Li, Z. 1 km monthly temperature and precipitation dataset for China from 1901 to 2017. Earth Syst Sci Data 11, 1931–1946 (2019).
Article ADS Google Scholar
Biggs, T. W. et al. Irrigated area mapping in heterogeneous landscapes with MODIS time series, ground truth and census data, Krishna Basin, India. Int J Remote Sens 27, 4245–4266 (2007).
Article Google Scholar
Peña-Arancibia, J. L. et al. Dynamic identification of summer cropping irrigated areas in a large basin experiencing extreme climatic variability. Remote Sens Environ 154, 139–152 (2014).
Article ADS Google Scholar
Xu, T., Deines, J., Kendall, A., Basso, B., Hyndman, D. Addressing Challenges for Mapping Irrigated Fields in Subhumid Temperate Regions by Integrating Remote Sensing and Hydroclimatic Data. Remote Sens 11 (2019).
Chen, Y. et al. Detecting irrigation extent, frequency, and timing in a heterogeneous arid agricultural region using MODIS time series, Landsat imagery, and ancillary data. Remote Sens Environ 204, 197–211 (2018).
Article ADS Google Scholar
Yan, H. et al. Modeling gross primary productivity for winter wheat–maize double cropping system using MODIS time series and CO2 eddy flux tower data. Agric Ecosyst Environ 129, 391–400 (2009).
Article Google Scholar
Wilson, T. B. & Meyers, T. P. Determining vegetation indices from solar and photosynthetically active radiation fluxes. Agric For Meteorol 144, 160–179 (2007).
Article ADS Google Scholar
Gitelson, A. A., Gritz, Y. & Merzlyak, M. N. Relationships between leaf chlorophyll content and spectral reflectance and algorithms for non-destructive chlorophyll assessment in higher plant leaves. J Plant Physiol 160, 271–282 (2003).
Article CAS PubMed Google Scholar
Gitelson, A. A., Vina, A., Ciganda, V., Rundquist, D. C. & Arkebauer, T. J. Remote estimation of canopy chlorophyll content in crops. Geophys Res Lett 32, L08403 (2005).
Article ADS CAS Google Scholar
Savitzky, A. & Golay, M. J. E. Smoothing and differentiation of data by simplified least squares procedures. Anal Chem 36, 1627–1639 (1964).
Article CAS ADS Google Scholar
Mu, Q., Zhao, M. & Running, S. W. Improvements to a MODIS global terrestrial evapotranspiration algorithm. Remote Sens Environ 115, 1781–1800 (2011).
Article ADS Google Scholar
Abatzoglou, J. T., Dobrowski, S. Z., Parks, S. A. & Hegewisch, K. C. TerraClimate, a high-resolution global dataset of monthly climate and climatic water balance from 1958–2015. Sci Data 5, 170191–170202 (2018).
Article PubMed PubMed Central Google Scholar
Deines J. M., Kendall A. D., Butler J. J., Hyndman D. W. Quantifying irrigation adaptation strategies in response to stakeholder-driven groundwater management in the US High Plains Aquifer. Environ Res Lett 14, (2019).
Meier, J., Zabel, F. & Mauser, W. A global approach to estimate irrigated areas – a comparison between different data and statistics. Hydrol Earth Syst Sci 22, 1119–1133 (2018).
Article ADS Google Scholar
Teluguntla, P. et al. Global Cropland Area Database (GCAD) derived from Remote Sensing in Support of Food Security in the Twenty-first Century: Current Achievements and Future Possibilities. Chapter 7, Vol II Land Resources: Monitoring, Modelling, and Mapping, Remote Sensing Handbook edited by Prasad S Thenkabail In Press, (2014).
Siddiqui, S., Cai, X., Chandrasekharan, K. Irrigated Area Map Asia and Africa. International Water Management Institute http://waterdata.iwmi.org/applications/irri_area/ (Accessed 24 December 2021). (2016).
Liu, J. et al. Spatiotemporal characteristics, patterns, and causes of land-use changes in China since the late 1980s. J Geog Sci 24, 195–210 (2014).
Article ADS Google Scholar
Liu, J. et al. Spatial and temporal patterns of China’s cropland during 1990–2000: An analysis based on Landsat TM data. Remote Sens Environ 98, 442–456 (2005).
Article ADS Google Scholar
Defourny, P. et al. Land Cover CCI: Product User Guide Version 2. http://maps.elie.ucl.ac.be/CCI/viewer/download.php (Accessed 5 May 2021). (2016).
Zhang, X. et al. GLC_FCS30: global land-cover product with fine classification system at 30 m using time-series Landsat imagery. Earth Syst Sci Data 13, 2753–2776 (2021).
Article ADS Google Scholar
Fritz, S. et al. Cropland for sub-Saharan Africa: A synergistic approach using five land cover data sets. Geophys Res Lett 38, L04404 (2011).
Article ADS Google Scholar
See, L. et al. Building a hybrid land cover map with crowdsourcing and geographically weighted regression. ISPRS J Photogramm Remote Sens 103, 48–56 (2015).
Article ADS Google Scholar
Schepaschenko, D. et al. Development of a global hybrid forest mask through the synergy of remote sensing, crowdsourcing and FAO statistics. Remote Sens Environ 162, 208–220 (2015).
Article ADS Google Scholar
Fritz, S. et al. Mapping global cropland and field size. Glob Chang Biol 21, 1980–1992 (2015).
Article PubMed ADS Google Scholar
Lu, M. et al. A Synergy Cropland of China by Fusing Multiple Existing Maps and Statistics. Sensors (Basel) 17, 1613–1628 (2017).
Article ADS Google Scholar
Jung, M., Henkel, K., Herold, M. & Churkina, G. Exploiting synergies of global land cover products for carbon cycle modeling. Remote Sens Environ 101, 534–553 (2006).
Article ADS Google Scholar
Zhang, C., Dong, J. & Ge, Q. The 500-m irrigated cropland maps in China during 2000–2019 based on a synergy mapping method, figshare, https://doi.org/10.6084/m9.figshare.19352501.v1 (2022).
Ozdogan, M., Yang, Y., Allez, G. & Cervantes, C. Remote Sensing of Irrigated Agriculture: Opportunities and Challenges. Remote Sens 2, 2274–2304 (2010).
Article ADS Google Scholar
You, N. et al. The 10-m crop type maps in Northeast China during 2017–2019. Sci Data 8 (2021).
Yin, L. et al. Irrigation water consumption of irrigated cropland and its dominant factor in China from 1982 to 2015. Adv Water Resour 143, 103661 (2020).
Article Google Scholar
Zhang, X., Xiong, Z. & Tang, Q. Modeled effects of irrigation on surface climate in the Heihe River Basin, Northwest China. J Geophys Res: Atmos 122, 7881–7895 (2017).
Article ADS Google Scholar
Zhang, X., Ding, N., Han, S., Tang, Q. Irrigation‐Induced Potential Evapotranspiration Decrease in the Heihe River Basin, Northwest China, as Simulated by the WRF Model. J Geophys Res: Atmos 125 (2020).
Liu, J., Jin, J., Niu, G. Y. Effects of Irrigation on Seasonal and Annual Temperature and Precipitation over China Simulated by the WRF Model. J Geophys Res: Atmos 126 (2021).
Liu, L., Zhang, X., Chen, X., Gao, Y. & Mi, J. GLC_FCS30: Global land-cover product with fine classification system at 30m using time-series Landsat imagery. Zenodo https://doi.org/10.5281/zenodo.3986872 (2020).
Liu, L., Zhang, X., Chen, X., Gao, Y. & Mi, J. GLC_FCS30-2020:Global Land Cover with Fine Classification System at 30m in 2020, Zenodo https://doi.org/10.5281/zenodo.4280923 (2020).

Download references

Acknowledgements

This study was supported by the Strategic Priority Research Program of the Chinese Academy of Sciences (Grant No. XDA23100400, XDA19040301) and the National Natural Science Foundation of China (Grant No. 41871349).

Author information

Authors and Affiliations

Key Laboratory of Land Surface Pattern and Simulation, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, 100101, China
Chao Zhang, Jinwei Dong & Quansheng Ge
University of Chinese Academy of Sciences, Beijing, 100049, China
Chao Zhang

Authors

Chao Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jinwei Dong
View author publications
You can also search for this author in PubMed Google Scholar
Quansheng Ge
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Chao Zhang: Conceptualization, methodology, software, dataset production, validation, visualization, writing - original draft preparation. Jinwei Dong: Conceptualization, methodology, writing - reviewing and editing, supervision, funding acquisition. Quansheng Ge: Conceptualization, supervision, funding acquisition.

Corresponding author

Correspondence to Quansheng Ge.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, C., Dong, J. & Ge, Q. Mapping 20 years of irrigated croplands in China using MODIS and statistics and existing irrigation products. Sci Data 9, 407 (2022). https://doi.org/10.1038/s41597-022-01522-z

Download citation

Received: 17 March 2022
Accepted: 22 June 2022
Published: 15 July 2022
DOI: https://doi.org/10.1038/s41597-022-01522-z

This article is cited by

A Chinese soil conservation dataset preventing soil water erosion from 1992 to 2019
- Jialei Li
- Hongbin He
- Ranhao Sun
Scientific Data (2023)