A data set of distributed global population and water withdrawal from 1960 to 2020

Yan, Denghua; Zhang, Xin; Qin, Tianling; Li, Chenhao; Zhang, Jianyun; Wang, Hao; Weng, Baisha; Wang, Kun; Liu, Shanshan; Li, Xiangnan; Yang, Yuheng; Li, Weizhi; Lv, Zhenyu; Wang, Jianwei; Li, Meng; He, Shan; Liu, Fang; Bi, Wuxia; Xu, Ting; Shi, Xiaoqing; Man, Zihao; Sun, Congwu; Liu, Meiyu; Wang, Mengke; Huang, Yinghou; Long, Haoyu; Niu, Yongzhen; Dorjsuren, Batsuren; Gedefaw, Mohammed; Li, Yizhe; Tian, Zihao; Mu, Shizhou; Wang, Wenyu; Zhou, Xiaoxiang

doi:10.1038/s41597-022-01760-1

Download PDF

Data Descriptor
Open access
Published: 21 October 2022

A data set of distributed global population and water withdrawal from 1960 to 2020

Denghua Yan¹,
Xin Zhang¹,
Tianling Qin ORCID: orcid.org/0000-0002-6073-6744¹,
Chenhao Li ORCID: orcid.org/0000-0002-9131-9230¹,
Jianyun Zhang²,
Hao Wang¹,
Baisha Weng¹,
Kun Wang¹,
Shanshan Liu¹,
Xiangnan Li ORCID: orcid.org/0000-0001-5406-8068¹,
Yuheng Yang¹,
Weizhi Li¹,
Zhenyu Lv¹,
Jianwei Wang¹,
Meng Li ORCID: orcid.org/0000-0003-1884-1521¹,
Shan He¹,
Fang Liu¹,
Wuxia Bi ORCID: orcid.org/0000-0003-0058-6286¹,
Ting Xu¹,
Xiaoqing Shi¹,
Zihao Man¹,
Congwu Sun¹,
Meiyu Liu¹,
Mengke Wang¹,
Yinghou Huang¹,
Haoyu Long¹,
Yongzhen Niu¹,
Batsuren Dorjsuren ORCID: orcid.org/0000-0002-7864-8291¹,
Mohammed Gedefaw¹,
Yizhe Li¹,
Zihao Tian¹,
Shizhou Mu¹,
Wenyu Wang¹ &
…
Xiaoxiang Zhou¹

Scientific Data volume 9, Article number: 640 (2022) Cite this article

3623 Accesses
2 Citations
30 Altmetric
Metrics details

Subjects

Abstract

Population and water withdrawal data sets are currently faced with difficulties in collecting, processing and verifying multi-source time series, and the spatial distribution characteristics of long series are also relatively lacking. Time series is the basic guarantee for the accuracy of data sets, and the production of long series spatial distribution is a realistic requirement to expand the application scope of data sets. Through the time-consuming and laborious basic processing work, this research focuses on the population and water intake time series, and interpolates and extends them to specific land uses to ensure the accuracy of the time series and the demand of spatially distributed data sets. This research provides a set of population density and water intensity products from 1960 to 2020 distributed to the administrative units or the corresponding regions. The data set fills the gaps in the multi-year data set for the accuracy of population density and the intensity of water withdrawal.

Measurement(s)	distributed global population and water withdrawal
Technology Type(s)	mathematical statistics and analysis

Global monthly sectoral water use for 2010–2100 at 0.5° resolution across alternative futures

Article Open access 11 April 2023

Sensitivity of subregional distribution of socioeconomic conditions to the global assessment of water scarcity

Article Open access 25 June 2022

Urban water and electricity demand data for understanding climate change impacts on the water-energy nexus

Article Open access 23 January 2024

Background & Summary

The rapid increase in global water use can be attributed to factors such as population growth and economic development¹. the world’s population still continues to grow, albeit at a slowing rate, but there are significant differences in regional characteristics². The increase in population and water withdrawal demand makes it difficult to make decisions about water allocation, and the demand competition between population and water withdrawal further exacerbates the risk of regional conflicts³. Identifying, measuring and expressing the value of water is the focus of the United Nations World Water Development at this stage², but it is also inseparable from the research on the changes in world population development and water withdrawal patterns, especially the demand for relevant long basic data sets. Obtaining sector and high-precision reference data through deep mining has taken a key step in the refinement of water withdrawal^4,5,6, but long series of basic data usually come from FAO and other international organizations. Their data is often time-periodized and discrete, and there is almost no continuous long series of population and water withdrawal data sets globally⁷.

It is also an indisputable fact that the data published by international organizations do have errors (There is data duplication in different time periods⁸). The population or water withdrawal data released by the World Bank, FAO and other international organizations may deviate from the real population or water withdrawal data of a country or region due to statistical caliber, and its time series accuracy needs to be further corrected^9,10,11. We believe that the statistical data of relevant departments in various countries are more authoritative, but the collection and other work are very difficult. This data set collects, interpolates and extrapolates the data published by government agencies to achieve horizontal expansion of the basic data. It is the foundation of the follow-up research work and the work that cannot be ignored to fundamentally improve the accuracy of basic data.

Basic work of collection, induction and interpolation are carried out for individual countries, and it is difficult to establish spatial connections. Considering the need for spatialization of basic data and comparability of regional development, two variables of population density and water intensity¹¹ were introduced. Among them, population density refers to the population per unit area (person /km²), and water intensity refers to the water withdrawal per unit area (m³/km²). Refined spatial classification is still a difficult problem at this stage. This data set only realizes spatial distribution on administrative units and land use. If there is a more refined classification, spatial connections can be established independently based on the population and water withdrawal data sets in this data set.

This data set includes a set of population density products distributed to the administrative units from 1960 to 2020, a set of water intensity products distributed to the administrative units, a set of population density products distributed to an artificial surface, a set of water intensity products distributed to the artificial surface and cultivated land, an EXCEL file for the revised population of different countries, and an EXCEL file for the total amount of water withdrawal of different countries (Tables 1 and 2). Considering that the naming of low-level administrative units in different countries is different, the concept of sub-national is adopted in this set data, and the spatial boundaries of population density and water intensity are shown in Fig. 1.

Table 1 Input data sets used to produce the global population and water withdrawal products.

Full size table

Table 2 The global population and water withdrawal products.

Full size table

Data cover almost all regions of the world. Population data include 214 national units, 1805 national or sub-national units. Water data include 214 national units and 616 national or sub-national units. Because of the difficulties in obtaining regional data in some countries, sub-national data are replaced by national data. There may be some errors in the statistics and collection of the population and water withdrawal data in various countries, which may lead to deviations between the data set and the real data. Therefore, further considering the data trend of international organizations and referring to the officially released data, the accuracy of the data set of this study is sufficient to be effectively guaranteed. It can be used as the basic information for the study of global climate change, environmental resources, regional economy and political decision-making. With the improvement of the collection of relevant credible data or the accuracy of the original data acquisition in the future, the data set can be amended and supplemented.

Methods

In this chapter, we describe in detail the method of data set generation, including data collection, data modification and interpolation extension, and grid data generation (Fig. 2).

First, the collection of population and water withdrawal data. Collect as much as possible of the national and sub-national permanent population and water withdrawal data released by governments and institutions on a global scale. Here we provide the source of our data collection.

Second, establish a national and sub-national default data interpolation model. Based on the shape of the sample data scatter plot, determine the most appropriate curve model. The simulation modeling is implemented by EXCEL and provided one by one according to the national level.

Third, create spatial distribution grids. Spread the population density to the administrative unit and artificial surface, and spread the water intensity to the administrative unit, and artificial surface and cultivated land (Spatial distribution section for details).

Fourth, data verification. For population data, we compare the global population of the revised results with data of the World Bank and FAO, and calculate the correlation and deviation between the revised results and the other two sets of data. For the water withdrawal data, we divide the measured data into calibration and verification periods, re-interpolate the data using the data of the calibration period, and then verify the simulation accuracy by using the data of the verification period and the simulation.

Data collection and pretreatment

The data sources include government population data for xx nation and xx sub-nation, government water withdrawal data for xx nations and xx sub-nations, national population and water withdrawal data from the World Bank¹², and national population and water withdrawal data from FAO⁸, water withdrawal data from the United Nation¹³, national population and water withdrawal data from Eurostat¹⁴, and Globeland30¹⁵ data for 2000 and 2010. Among them, xx refers to one of many countries in the data set, and only serves as an indicator.

Globally, it is believed that the accuracy rate of census results obtained by counting the population of various administrative units in the country is the highest at present when a large amount of manpower and material resources are spent by the country itself¹⁶. In addition to the census conducted every certain year, the statistical department gets a high accuracy rate by calculating the overall figures according to the sample survey of population changes and the random sample survey of fertility rate in some areas and some units. To sum up, we believe that the data released by our country on the statistical official website is the most reliable.

When national population data are missing, it is generally believed that the data and trends of the World Bank and FAO are authoritative. When the data of the World Bank and FAO are complete, the World Bank data prevails as reference population data. When the length of World Bank data is shorter that of than FAO, the FAO data is used as reference population data¹⁷.

For water withdrawal data, FAO and UN data are generally considered authoritative when government water withdrawal data is missing. When the FAO and UN data are both complete, the FAO data is used as a reference for water withdrawal data.

Interpolation and extrapolation of national and sub-national population data

When the lack of data is obvious, the results obtained by the simplest method often have more reference value. The following four basic methods are used for the processing of population data^{9,10,11,18,19,20}.

Interpolation method assuming increasing in arithmetic series

If discontinuities exist in government data, and the number of data increases in arithmetic series according to the judgement, then the linear interpolation method can be used based on a linear model of arithmetic series growth. This method is suitable for interval data interpolation with a short interruption time and relatively uniform data growth scale. The interpolation model is as follows:

$${P}_{N,k}=\left[\frac{I\left(j\right)-I\left(i\right)}{j-i}\cdot \left(k-i\right)+I\left(i\right)\right]\cdot {P}_{W,k}$$

(1)

Where, P_N,k is the government data for the k year, i ≤ k ≤ j; P_W,k is the reference data for the k year; I(j) and I(i) are the ratios of government data to reference data for the j year and i year, respectively.

Trend extrapolation method based on general trend curve model

If there are continuous points in the government data, it is better to obtain interpolation results by assisting based on the trend of the ratio of government data to reference data. General trend line functions such as linear, conic, cubic and exponential curves can be used, and the fitting result needs to be comprehensively judged by the linear change of the reference data, and finally a more suitable interpolation result can be obtained. This method is more suitable for interval data interpolation with shorter time and faster data growth.

$${P}_{N,k}=F(k)\cdot {P}_{W,k}$$

(2)

where, P_N,k is the government data in the k year, i ≤ k ≤ j; P_W,k is the reference data for the k year; F(k) is the trend for the ratio of government data to reference data in the k year.

Scale up to the same ratio

If there is only one year of government data, then the reference data will be scaled up to the same ratio according to the ratio of government data to the reference data of the corresponding year.

$$I=\frac{{P}_{N}}{{P}_{W}},{P}_{N,o}=I\cdot {P}_{W,o}$$

(3)

Where, P_N is the government data; P_W is the reference data; I is the ratio of government data to reference data; P_N,o is the default government data; P_W,o is the reference data corresponding to the default; o is the default year.

Based entirely on government data or reference data

If there is complete government data, the government data is used as the final population result. If there is no government data, the reference data is used as the final result of the population.

Interpolation and extrapolation of national and sub-national water withdrawal data

The total amount of water withdrawal in various countries varies greatly, but the per capita water withdrawal of the country generally remains within a certain range. Therefore, we first calculate the reference data, and then interpolate and extrapolate the missing per capita water withdrawal data. The methods can also be summarized into the following five categories.

Interpolation method assuming increasing in arithmetic series

The calculation principle is the same as the interpolation method of national population data. This method is more suitable for interval data interpolation with shorter and discrete data, such as the data form before 1990 in Fig. 6(c).

Trend extrapolation method based on revised per capita water withdrawal growth rate

If there are continuous points in the data, we assume that the per capita water withdrawal versus time curve is consistent with the S curve, that is, the per capita water withdrawal shows only a slow change in the first years and the last years. We first calculate the growth rate of per capita water withdrawal in the last two years or the first two years, adjust the final growth rate proportionally to reflect the subsequent changes, and adjust the first growth rate proportionally to reflect the previous changes. Equation (4) represents a method of extrapolating the previous missing value data, and Eq. (5) represents a method of extrapolating the subsequent missing value data. This method is more suitable for the situation where continuous government data exists and the change trend of per capita water consumption is clear, such as the form of continuous data after 1990 in Fig. 6(c).

$$\left\{\begin{array}{rll}{s}_{i} & = & \frac{{w}_{i}-{w}_{i+1}}{{w}_{i+1}}\\ {s}_{i-1} & = & {s}_{i}\cdot (1-\theta )\\ {w}_{i-1} & = & {w}_{i}\cdot (1+{s}_{i-1})\end{array}\right.$$

(4)

$$\left\{\begin{array}{rll}{s}_{j} & = & \frac{{w}_{j}-{w}_{j-1}}{{w}_{j-1}}\\ {s}_{j+1} & = & {s}_{j}\cdot \left(1-\theta \right)\\ {w}_{j+1} & = & {w}_{j}\cdot \left(1+{s}_{j+1}\right)\end{array}\right.$$

(5)

Where w_i-1 is the missing per capita water withdrawal value for time step i-1; s_i-1 is the missing reverse order growth rate value for time step i-1; w_i and w_i+1 are the first two known per capita water withdrawal values for time step i and i + 1, and s_i-1 is the known reverse order growth rate value for time step i-1. For Eq. (5), w_j+1 is the missing per capita water withdrawal value for time step j + 1; s_j+1 is the missing growth rate value for time step j + 1; w_j-1 and w_j are the last two known per capita water withdrawal values for time step j and j-1, and s_j is the known growth rate value for time step j. To ensure that the per capita water withdrawal in the front of the series or in the latter part of the series does not change too fast, the equation introduces θ to represent the correction coefficient for the growth rate, which is generally in the range of 0.1 to 0.2.

Scale up to the same ratio or smoothing spline fitting

If there is only one data released, the per capita water withdrawal of that year will be used for all years. For water withdrawal data with long time spans and more data but many intervals, we use smoothing spline to provide smooth interpolation over time, taking into account the equilibrium of per capita water withdrawal fluctuations.

Proximity of adjacent region

If no national water withdrawal data is released, based on the country’s level of development and geographic location, the per capita water withdrawal of adjacent countries with similar development levels is selected as an approximate value for the country’s per capita water withdrawal value.

The treatment of sub-national water withdrawal data is similar to sub-national population data. First, the ratio of the sub-national data to the national data of the known year is calculated, and then the interpolation and extrapolation methods are used to calculate the ratio of the missing values, and finally sub-national data is obtained by the national data and the ratio.

Spatial distribution

This research further considers the indicative role of specific land use types. Spatial distribution, which means that the data is distributed to a meaningful area. It is assumed that the population and water are only used on an artificial surface and cultivated land. We mainly used the globeland30 data¹⁵ of 2000 and 2010 to process the data before and after 2000, respectively (Figs. 3 and 4).

Based on ArcGIS Desktop 10.2, convert the global land use grid into a vector format, and then extract the global artificial surface and cultivated land. The population density and water intensity on the grid are expressed as follows²¹:

$$S{D}_{ad,P}=\frac{{P}_{ad}}{{A}_{ad}},S{D}_{lu,P}=\frac{{P}_{ad}}{{A}_{lu,a}}$$

(6)

$$S{D}_{ad,W}=\frac{{W}_{ad}}{{A}_{ad}},{SD}_{lu{\rm{,}}W}=\frac{{W}_{ad}}{{A}_{lu,ac}}$$

(7)

Where, SD_{ad, P} and SD_{ad, W} are the population density and water intensity of an administrative unit, respectively; SD_{lu, P} is the population density on the artificial surface of an administrative unit; SD_{lu, W} is the water intensity on the artificial surface and cultivated land of an administrative unit; P_ad and W_ad are the population and water withdrawal of an administrative unit, respectively; A_ad, A_{lu, a} and A_{lu, ac} are the area of an administrative unit, the area of the artificial surface of an administrative unit, and the area of artificial surface and cultivated land of an administrative unit.

Data Records

The output data sets described in this article are publicly and freely available through the website²². The data set includes 4 sets of raster data and 2 sets of EXCEL spreadsheet data, which are published separately on each continent. Abbreviations for each continent are as follows: NA-North America, SA-South America, EU-Europe, AS-Asia, AF-Africa, OC-Oceania. The data includes the following:

*_dpyear_sr: Spatial distribution of population density grid data sets in national or sub-national administrative units, with a unit of person/km².

*_dpyear_1 km: Spatial distribution of population density grid data sets on 1 km resolution artificial surface grids, with a unit of person/km².

*_winyear_sr: Spatial distribution of water intensity grid data sets in national or sub-national administrative units, with a unit of m³/km².

*_widyear_1km: Spatial distribution of water intensity grid data sets on 1km resolution artificial surface and cultivated land grids, with a unit of m³/km².

Country_P_Data.xlsx: processing and result documents of national population data, including year, World Bank data, FAO data, the government published data, and revised population data, with a unit of 10,000 people.

Country_W_Data.xlsx: processing and result documents of national water withdrawal data, including year, World Bank data, FAO data, UN data, the government published data, and revised water withdrawal data, with a unit of 10,000 m3.

* indicates AF, AS, EU, NA, OC and SA.

The product set is designed to fill the blanks in the long series of population and water withdrawal, enhance the accuracy of data, and can reflect the spatial distribution changes of population and water withdrawal. The data products reveal the development of the world population and the changes in the pattern of water withdrawal. They can help to reveal the regional characteristics of population development all over the world. In particular, it is of great significance to master the scale of population water withdrawal in regions where data are difficult to access.

The product offers the population density products of the minimum administrative units and artificial surface from 1960 to 2020, the products of the minimum administrative units and the artificial surface-cultivated land products, a set of EXCEL files of the revised total population in different countries, and a set of EXCEL files of the revised total water consumption in different countries.

The trend interpolation and extrapolation in the product are conducted under the assumption that the population maintains a certain natural growth and they cannot timely reflect the sudden changes in population and water withdrawal caused by major disasters (i.e. extreme floods and earthquakes), wars and large-scale migration. Given the mobility and flexibility of human activities, there may be some errors in the above data in many countries/regions. The data can be edited to meet the needs of various users. Therefore, users are encouraged to make up for the error by using recently updated data or data from specific sources.

At present, we have not collected related data of a few countries/regions, such as Mauritania, Madeira Island, St. Helena, Christmas Island, British Indian Ocean territory, the Vatican, Svalbard Island and Jan Mayen Island, Guadeloupe, St Pierre et Miquelon, Na Varsa Island, Anguilla, Montserrat, Martinique, Clipperton Island, Midway Islands, Virgin Islands, Netherlands Antilles, United States Miscellaneous Islands, Pitcairn Islands, Norfolk Island, Heard-und McDonald- Island, Bouvet Island, South Georgia and South Sandwich Island, Cocos (Keeling) Islands, Prince Edward Island, Wake Island, French Territory in The South, Falkland Islands, etc. Most of these areas are uninhabited or sparsely populated, so there are few records of water withdrawal and population data. In the data set, they are treated as no-value areas. We intend to add more data sets to the product in the future to further improve its spatial and temporal coverage.

Technical Validation

To make the data more transparent, we have compiled detailed data sources for population and water withdrawal for each country, and the data is available as a separate EXCEL spreadsheet. These data sources, including the World Bank, FAO, the United Nations, and officially released data, are relatively accurate. The officially released data of each country is more rigorous in its own region, and we believe that the government officially publishes the highest level of accuracy. Since some of the data set was obtained by interpolation and extrapolation, we performed verification for the reliability of the data set, and the data validation graphs are provided synchronously with the data collected by each country.

In the process of population data correction, the following three situations often occur (Fig. 5). When the officially released data is relatively long, we take the officially released data as the standard and revise some contents in combination with the data of international organizations; When the data series are relatively concentrated, we can only use the data and trend changes of relevant international organizations for reference to reasonably correct the missing years and make it smoothly connect the official data; When the officially released data is discontinuous, we take the official data as the correction node and learn from the trends of relevant international organizations to correct it into smooth and continuous population data.

In the process of water withdrawal data correction, we also take the officially released data as the standard. Considering the basic assumption that the water withdrawal per capita in the country or region is constant or follows a certain trend, the water withdrawal data is interpolated and extrapolated (Fig. 6). China and India have a large proportion of water withdrawal in the world and more official data, some measured data are selected for interpolation and extrapolation, and compared with the actual data (Fig. 7). The results show that the deviation of the data is mostly within ± 10%, and the reason for the large deviation of some points is due to the large annual fluctuation of official data. Therefore, it can be considered that the data set derived from the existing water withdrawal data is accurate.

Code availability

The data set processing process and usage method can be obtained from Figshare²². We believe that in the case of serious data missing or large data differences, the effect of using the most basic mathematical method is more effective and reliable. Our basic methods can be realized by using the conventional functions in Excel without new code. Please refer to the methods section. Spatial processing content ran in ArcGIS Desktop (V10.2 or later). The interpolation and extrapolation is processed in Microsoft Excel. All software needs to be installed in the Windows 10.

References

United Nations. The United Nations World Water Development Report 2021: Valuing Water. Educational, Scientific and Cultural Organization. https://www.unwater.org/publications/un-world-water-development-report-2021/.
The twenty-sixth round of official United Nations population estimates and projections. The 2019 Revision of World Population Prospects. https://population.un.org/wpp/Publications/Files/WPP2019_Highlights.pdf.
Huggins, X. et al. Hotspots for social and ecological impacts from freshwater stress and storage loss. Nat Commun. 13(1), 1–11 (2022).
Article Google Scholar
Hanasaki, N. et al. A global water scarcity assessment under Shared Socio-economic Pathways–Part 1: Water use. Hydrol Earth Syst Sc. 17(7), 2375–2391 (2013).
Article ADS Google Scholar
Huang, Z. et al. Reconstruction of global gridded monthly sectoral water withdrawals for 1971–2010 and analysis of their spatiotemporal patterns. Hydrol Earth Syst Sc. 22(4), 2117–2133 (2018).
Article ADS Google Scholar
Wada, Y., Wisser, D. & Bierkens, M. F. P. Global modeling of withdrawal, allocation and consumptive use of surface water and groundwater resources. Earth Syst Dynam. 5(1), 15–40 (2014).
Article ADS Google Scholar
Dobson, J. E. et al. LandScan: a global population database for estimating populations at risk. Photogramm Eng Rem S. 66(7), 849–857 (2000).
Google Scholar
FAO. AQUASTAT Database. https://www.fao.org/aquastat/statistics/query/index.html.
Balk, D. L. et al. Determining global population distribution: methods, applications and data. Adv Parasit. 62, 119–156 (2006).
Article CAS Google Scholar
Doxsey-Whitfield, E. et al. Taking advantage of the improved availability of census data: A first look at the Gridded Population of the World, Version 4. Papers in Applied Geography. 1(3), 226–234 (2015).
Article Google Scholar
Nouri N. Water withdrawal and consumption reduction analysis for electrical energy generation system. Dissertations & Theses - Gradworks (2015).
World Bank, “Population, total and Annual freshwater withdrawals, total (billion cubic meters)”. The World Bank Group, https://data.worldbank.org/indicator/SP.POP.TOTL and https://data.worldbank.org/indicator/ER.H2O.FWTL.K3 accessed December 16, 2021.
UNdata, “Fresh surface water abstracted”. The Statistics Division of the Department of Economic and Social Affairs (UN DESA) of the UN Secretariat. https://data.un.org/.
Eurostat. the European Statistical System (ESS). https://ec.europa.eu/eurostat/data/database.
Chen, J. et al. Analysis and applications of GlobeLand30: a review. ISPRS Int J Geo-Inf. 6(8), 230 (2017).
Article Google Scholar
Honjo, H. et al. Statistical properties of approval ratings for governments. Physica A. 428, 266–272 (2015).
Article ADS Google Scholar
Kumar, P. S., Carolin, C. F. Water withdrawal and conservation-Global stcenario. Consumption, Footprint, and Life Cycle Assessment. https://www.sciencedirect.com/science/article/pii/B978008102633500004X.
Deichmann, U., Balk, D. & Yetman. G. Transforming Population Data for Interdisciplinary Usages: From Census to Grid. Palisades, NY: NASA Socioeconomic Data and Applications Center (SEDAC), CIESIN, Columbia University. http://sedac.ciesin.columbia.edu/downloads/docs/gpw-v3/gpwdocumentation.pdf.
United Nations, Department of Economic and Social Affairs, Population Division. World Population Prospects: The 2015 Revision, DVD Edition, http://esa.un.org/unpd/wpp/DVD/ (2015).
Tobler, W. et al. World population in a grid of spherical quadrilaterals. International Journal of Population Geography. 3, 203–225 (1997).
Article CAS Google Scholar
Salvatore, M. et al. Mapping global urban and rural population distributions. Environment and Natural Resources Working Paper 24. Food and Agri. Org. UN Corporate Document Repository, http://www.fao.org/docrep/009/a0310e/a0310e00.htm (2005).
Yan, D. H. A data set of distributed global population and water withdrawal from 1960 to 2020, figshare, https://doi.org/10.6084/m9.figshare.19387406.v2 (2022).

Download references

Acknowledgements

The researchers would like to extend their thanks to the National Science Fund Project (Grant No. 52130907), the National Key Research and Development Project of China (No. 2016YFA0601503), the National Science Fund Project for Distinguished Young Scholars (Grant No. 51725905).

Author information

Authors and Affiliations

State Key Laboratory of Simulation and Regulation of Water Cycle in River Basin, China Institute of Water Resources and Hydropower Research, No. 1 Fuxing Road, Haidian District, Beijing, 100038, China
Denghua Yan, Xin Zhang, Tianling Qin, Chenhao Li, Hao Wang, Baisha Weng, Kun Wang, Shanshan Liu, Xiangnan Li, Yuheng Yang, Weizhi Li, Zhenyu Lv, Jianwei Wang, Meng Li, Shan He, Fang Liu, Wuxia Bi, Ting Xu, Xiaoqing Shi, Zihao Man, Congwu Sun, Meiyu Liu, Mengke Wang, Yinghou Huang, Haoyu Long, Yongzhen Niu, Batsuren Dorjsuren, Mohammed Gedefaw, Yizhe Li, Zihao Tian, Shizhou Mu, Wenyu Wang & Xiaoxiang Zhou
State Key Laboratory of Hydrology-Water Resources and Hydraulic Engineering, Nanjing Hydraulic Research Institute, Nanjing, 210029, China
Jianyun Zhang

Authors

Denghua Yan
View author publications
You can also search for this author in PubMed Google Scholar
Xin Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Tianling Qin
View author publications
You can also search for this author in PubMed Google Scholar
Chenhao Li
View author publications
You can also search for this author in PubMed Google Scholar
Jianyun Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Baisha Weng
View author publications
You can also search for this author in PubMed Google Scholar
Kun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Shanshan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xiangnan Li
View author publications
You can also search for this author in PubMed Google Scholar
Yuheng Yang
View author publications
You can also search for this author in PubMed Google Scholar
Weizhi Li
View author publications
You can also search for this author in PubMed Google Scholar
Zhenyu Lv
View author publications
You can also search for this author in PubMed Google Scholar
Jianwei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Meng Li
View author publications
You can also search for this author in PubMed Google Scholar
Shan He
View author publications
You can also search for this author in PubMed Google Scholar
Fang Liu
View author publications
You can also search for this author in PubMed Google Scholar
Wuxia Bi
View author publications
You can also search for this author in PubMed Google Scholar
Ting Xu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoqing Shi
View author publications
You can also search for this author in PubMed Google Scholar
Zihao Man
View author publications
You can also search for this author in PubMed Google Scholar
Congwu Sun
View author publications
You can also search for this author in PubMed Google Scholar
Meiyu Liu
View author publications
You can also search for this author in PubMed Google Scholar
Mengke Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yinghou Huang
View author publications
You can also search for this author in PubMed Google Scholar
Haoyu Long
View author publications
You can also search for this author in PubMed Google Scholar
Yongzhen Niu
View author publications
You can also search for this author in PubMed Google Scholar
Batsuren Dorjsuren
View author publications
You can also search for this author in PubMed Google Scholar
Mohammed Gedefaw
View author publications
You can also search for this author in PubMed Google Scholar
Yizhe Li
View author publications
You can also search for this author in PubMed Google Scholar
Zihao Tian
View author publications
You can also search for this author in PubMed Google Scholar
Shizhou Mu
View author publications
You can also search for this author in PubMed Google Scholar
Wenyu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoxiang Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.Y. and X.Z. contributed equally to this work as co-first authors. D.Y., T.Q., J.Z. and H.W. designed the study and provided guidance. X.Z., C.L. and X.L. drafted the manuscript. X.Z., C.L., X.L., Y.Y., K.W., Z.L., J.W. and M.L. undertook data processing and assembly. All members participated in data collection.

Corresponding authors

Correspondence to Tianling Qin or Chenhao Li.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yan, D., Zhang, X., Qin, T. et al. A data set of distributed global population and water withdrawal from 1960 to 2020. Sci Data 9, 640 (2022). https://doi.org/10.1038/s41597-022-01760-1

Download citation

Received: 18 May 2022
Accepted: 10 October 2022
Published: 21 October 2022
DOI: https://doi.org/10.1038/s41597-022-01760-1

This article is cited by

Assessment on the sustainability of water resources utilization in Central Asia based on water resources carrying capacity
- Wenhua Liu
- Yizhuo Wang
- Wenbin Zhu
Journal of Geographical Sciences (2023)
A Complementary Streamflow Attribution Framework Coupled Climate, Vegetation and Water Withdrawal
- Shanhu Jiang
- Yongwei Zhu
- Chong-Yu Xu
Water Resources Management (2023)

Subjects

Abstract

Similar content being viewed by others

Global monthly sectoral water use for 2010–2100 at 0.5° resolution across alternative futures

Sensitivity of subregional distribution of socioeconomic conditions to the global assessment of water scarcity

Urban water and electricity demand data for understanding climate change impacts on the water-energy nexus

Background & Summary

Methods

Data collection and pretreatment

Interpolation and extrapolation of national and sub-national population data

Interpolation method assuming increasing in arithmetic series

Trend extrapolation method based on general trend curve model

Scale up to the same ratio

Based entirely on government data or reference data

Interpolation and extrapolation of national and sub-national water withdrawal data

Interpolation method assuming increasing in arithmetic series

Trend extrapolation method based on revised per capita water withdrawal growth rate

Scale up to the same ratio or smoothing spline fitting

Proximity of adjacent region

Spatial distribution

Data Records

Technical Validation

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Assessment on the sustainability of water resources utilization in Central Asia based on water resources carrying capacity

A Complementary Streamflow Attribution Framework Coupled Climate, Vegetation and Water Withdrawal

Search

Quick links