Development and application of high resolution SPEI drought dataset for Central Asia

Pyarali, Karim; Peng, Jian; Disse, Markus; Tuo, Ye

doi:10.1038/s41597-022-01279-5

Download PDF

Data Descriptor
Open access
Published: 14 April 2022

Development and application of high resolution SPEI drought dataset for Central Asia

Scientific Data volume 9, Article number: 172 (2022) Cite this article

5450 Accesses
17 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Central Asia is a data scarce region, which makes it difficult to monitor and minimize the impacts of a drought. To address this challenge, in this study, a high-resolution (5 km) Standardized Precipitation Evaporation Index (SPEI-HR) drought dataset was developed for Central Asia with different time scales from 1981–2018, using Climate Hazards group InfraRed Precipitation with Station’s (CHIRPS) precipitation and Global Land Evaporation Amsterdam Model’s (GLEAM) potential evaporation (E_p) datasets. As indicated by the results, in general, over time and space, the SPEI-HR correlated well with SPEI values estimated from coarse-resolution Climate Research Unit (CRU) gridded time series dataset. The 6-month timescale SPEI-HR dataset displayed a good correlation of 0.66 with GLEAM root zone soil moisture (RSM) and a positive correlation of 0.26 with normalized difference vegetation index (NDVI) from Global Inventory Monitoring and Modelling System (GIMMS). After observing a clear agreement between SPEI-HR and drought indicators for the 2001 and 2008 drought events, an emerging hotspot analysis was conducted to identify drought prone districts and sub-basins.

Measurement(s)	Drought index
Technology Type(s)	Remote sensing products
Factor Type(s)	Standardized Precipitation Evapotranspiration Index
Sample Characteristic - Environment	drought
Sample Characteristic - Location	Central Asia

Future groundwater potential mapping using machine learning algorithms and climate change scenarios in Bangladesh

Article Open access 06 May 2024

High-resolution impact-based early warning system for riverine flooding

Article Open access 02 May 2024

Global prediction of extreme floods in ungauged watersheds

Article Open access 20 March 2024

Background & Summary

The study area, as shown in Fig. 1, comprises of six different countries which are Tajikistan, Kazakhstan, Turkmenistan, Uzbekistan, parts of China, and Kyrgyzstan. The precipitation data used in this study are only available for regions below 50°N therefore the study area had to be clipped accordingly. Central Asia covers approximately 5.65 million km² of land¹ and the topography of the region is very diverse from mountainous terrain to low lying basins and from deserts to grasslands². The study area is very far away from an ocean and the mountain ranges in the south-east Asia further blocks the amount of moisture reaching Central Asia. Therefore, mostly arid conditions prevail in the regions with a typical temperate continental climate³. The main source of water for the region are the glaciers of Tianshan Mountains⁴. The population density of the region is low compared to neighbouring Southeast Asia. Figure 1 shows very few locations which have a large population of more than 50,000 people per square km⁵. According to Zhi Li et al.⁶, the temperature of Central Asia has increased sharply since 1997 and as expected the decade of 2007 to 2017 was the warmest period for region’s recorded history. The poor management of water resources and effects of increasing temperature and varying precipitation patterns due to climate change have led to an increase in severity of water resource deficit. The effects of increase in temperature, exacerbated by human activities, were very clear when the Aral Sea shrank over a very short time period².

Background/Introduction

A drought is an environmental disaster characterised by a prolonged dry period and is caused by a lack of precipitation, which can take place anywhere on land^7,8. The definition of a drought varies in the academic literature. However, there is a consensus that an anomaly in temperature or precipitation that persists for a long period of time across a region reduces the volume of soil moisture, groundwater, and surface runoff⁹. The impacts of droughts are non-structural and difficult to quantify due to its slow creeping nature, but a lack of water could lead to crop failure, multiple consecutive crop failures could start a famine and result in human migrations⁷. In last half century multiple droughts have occurred around the world, for example, the drought of 1988 in United States costed its economy $40 billion worth in damages¹⁰, the drought of 2005 across Spain and Portugal decreased the cereal yields of European Union (EU) by ten percent¹¹, a multi-year drought in Central and Southwest Asia affected 60 million people during 1999 – 2000¹², in Australia the drought of 2006 caused an estimated $3.5 Billion worth of damages to the local economy¹³ and in Africa severe droughts in 1980s, 1970s and 1960s were followed by famines in the region⁸. Therefore, understanding the socio-economic and ecological aspects of drought is as important as the hydrologic or meteorological aspects of the event. It is key to note that a drought itself is not a disaster, but the lack of resilience of a community to cope with its impacts makes it one⁷.

Generally, droughts are categorized into four types: meteorological (indicator lack of precipitation), agricultural (indicator low soil moisture), hydrological (indicator low runoff or ground water table) and socio-economic drought (social indicators income and access to water)^8,14,15. Furthermore, each drought can be characterized according to its severity, duration, and intensity. The quantification of the different types of droughts depends on the time scale at which the water deficit is accumulated. The different time scales respond to different source of accessible water. Therefore, to estimate the different types of droughts multiple drought indices have been developed. One of the most widely accepted drought indices is Standardized Precipitation Index (SPI) which has a simple application and can characterize different types of droughts by varying the time scales. SPI is recommended by World Meteorological Organization (WMO) and it only needs precipitation data for estimation^8,14,16. Vicente et al.¹⁷, argued that SPI ignores the influence of temperature on water deficit, which leads to misrepresentation of actual drought conditions specifically in arid regions. Additionally, rising average global temperature due to climate change will amplify the role of temperature on drought propagation. Thus, Standardized Precipitation Evaporation index (SPEI) was developed, which is similar to SPI, except it uses water deficit values instead of precipitation for better representation of drought conditions¹⁷.

There are two widely used global SPEI datasets, which are SPEIbase¹⁸ and Global Precipitation Climatology Centre Drought Index (GPCC-DI)¹⁹. Their spatial resolutions are 0.5° (≈ 50 km) and 1° (≈ 110 km), respectively. SPEI-base was developed using CRU Time Series and GPCC-DI was developed using GPCC precipitation data and Climate Prediction Center’s temperature data. A lack of finer resolution data meant that most studies could only be conducted at regional levels^2,3,20 because the resolution of the input dataset is too coarse to apply drought indices at district or sub basin level²¹. However, recently developed high-resolution precipitation data²² and evapotranspiration data²³ made it possible to estimate SPEI at a spatial resolution of 5 km. As benefits, some researchers prepared high-resolution SPEI data for the whole continent of Africa¹⁵, which presented reliable performances in characterizing drought events. Similar studies are missing but could be valuable in the data scarce continental region of Central Asia. What performance these high-resolution data will have for drought research of another domain with different climate and topographic characteristics is an open question.

In this study, a high-resolution (5 km) SPEI drought index was prepared for the entirety of Central Asia. The temperature of the region is currently the warmest it has ever been in the recorded history²⁴, it is prone to droughts²⁵ and studies have found a significant drying pattern between 2003–2015². Furthermore, the available observed meteorological data of the region is not continuous¹ and very scarce. As a result, more research is required to understand how sensitive the region is to climate change. The high-resolution SPEI dataset produced in this study will help water managers, policy makers and local stake holders to improve their risk analysis and plan their response accordingly. Lastly, to enhance the available knowledge for this region an emerging hotspot analysis is conducted for SPEI values of 6-month and 48-month time scale, where a time scale indicates the period of water deficit accumulation.

Summary

The region of Central Asia is data scarce therefore, a high-resolution SPEI drought dataset was developed in this study for the region. The input data used were CHIRPS’s precipitation dataset and GLEAM’s potential evaporation dataset, both remote sensing products. SPEI dataset for forty-eight different time scales were produced and the time period ranged from 1981–2018, a total of 38 years. The results were validated using CRU’s SPEI data, GLEAM’s root zone soil moisture and GIMMS’s NDVI data. Overall, the high-resolution SPEI dataset displayed high spatial and temporal correlation with CRU’s SPEI vales, high to moderate correlation with GLEAM’s root zone soil moisture, and satisfactory to low correlation with GIMMS’s NDVI. The evaluation of the dataset indicated that SPEI-03 (three month time scale), SPEI-06 and SPEI-09 performed best in capturing RSM values, while SPEI-06, SPEI-09 and SPEI-12 gave the highest NDVI values. The overall performance of the dataset is considered good; therefore, an emerging hotspot analysis was conducted on the dataset to observe drought conditions on a district or basin scale. The emerging hotspot analysis identified regions with oscillating hot and cold spot patterns for the time scales of 1, 3, 6, 9, 12, 24, 36, and 48-month, but it failed to provide any other conclusive pattern due to the periodic nature of the drought indices. Lastly, the high-resolution (5 km) of the SPEI-HR dataset produced in this study, to our knowledge, is the best available resolution for a drought index in Central Asia.

Methods

High-Resolution SPEI calculation

In this study, following the work of Peng et al.¹⁵, a high-resolution drought index dataset containing SPEI values for Central Asia was prepared for 48 different time scales using the method proposed by Vicente-Serrano et al.¹⁷. The input data used was CHIRPS precipitation dataset, which has a monthly temporal resolution and a 5 km spatial resolution, and the GLEAM evaporation data, which was downscaled from 25 km resolution to 5 km, using bilinear interpolation, and has a monthly temporal resolution.

To compute SPEI we require water deficit values (D), which are calculated by subtracting E_p from precipitation (P) values using the following equation:

$${D}_{i}={P}_{i}-E{p}_{i}$$

(1)

Please note that different E_p methods could result in different SPEI estimations^26,27,28. To improve the regional drought assessment, such impacts should be addressed by further studies when regional data will be available for verification. It requires a standalone research and beyond the scope this Data Descriptor paper.

The water deficit values are aggregated depending on the time scale before being standardized using the log-logistic distribution with the following probability density function:

$$f\left(x\right)=\frac{\beta }{\alpha }\left(\frac{x-\gamma }{\alpha }\right){\left[1+\left(\frac{x-\gamma }{\alpha }\right)\right]}^{-2}$$

(2)

where the parameters are β, γ and α representing shape, origin and scale. The probability distribution function for log logistic distribution, is given by,

$$F\left(x\right)={\left[1+{\left(\frac{\alpha }{x-\gamma }\right)}^{\beta }\right]}^{-1}$$

(3)

Then the SPEI values can be estimated by standardizing the $F\left(x\right)$ values using the following equation

$$SPEI=W-\frac{{C}_{0}+{C}_{1}W+{C}_{2}{W}^{2}}{1+{d}_{1}W+{d}_{2}{W}^{2}+{D}_{3}{W}^{3}},$$

(4)

where C₀ = 2.515517, C₁ = 0.802853, C₃ = 0.010328, d₁ = 1.432788, d₂ = 0.189269, d₃ = 0.001308. The value of W depends on the probability of exceedance P as shown below

$$W=\sqrt{-2ln\left(P\right)},$$

(5)

where $P=1-F(x)$ when $P\le 0.5$, but in case $P > 0.5$, then the P is replaced by $1-P$ and the sign of SPEI is reversed¹⁷.

The drought index values within the datasets vary from extremely wet to extremely dry and two SPEI ranges, or thresholds were found in the literature, as shown in Table 1.

Table 1 Classification of SPEI values based on two different thresholds found in the literature.

Full size table

Due to low hydroclimatic variability the SPEI results are not reliable for sparsely vegetated and barren areas, therefore during evaluation the SPEI values were masked for these two specific land covers^18,29 using Moderate Resolution Imaging Spectroradiometer (MODIS) land cover type product (MCD12Q1)³⁰.

Evaluation Criteria

The results were evaluated by comparing high-resolution SPEI results with CRU SPEI results for some of the time scales (i.e. 1, 3, 6, 9, 12, 24, 36, & 48 months). The high-resolution results were upscaled to 50 km to ensure we have consistent data for comparison. The correlation between high-resolution SPEI and CRU SPEI were evaluated both temporally and spatially. Furthermore, to observe the performance of high-resolution SPEI dataset, the SPEI-06 was compared with NDVI and root zone soil moisture (RSM). Then the spatial mean or area mean of both high and coarse resolution SPEI-06 were compared with the area mean of NDVI and RSM over the eight different time scales aforementioned. The reason behind using SPEI-06 was the findings of Törnros and Menzel, (2014)³¹, who observed that the 6-month SPEI (or SPEI-06) has the highest correlation with soil moisture and best captures the variations of NDVI.

The NDVI is a measure of health of the vegetation and the RSM is an indicator for agricultural droughts. The NDVI values were obtained from GIMMS dataset (1981–2015) and RSM data was collected from GLEAM (1981–2018). The high-resolution of SPEI results had to be resampled according to the product (NDVI or RSM) it was compared to. The NDVI and RSM products were standardized before being compared to the resampled SPEI. For standardization, the time series of each pixel were ordered according to months and then the time series of each month were standardized using its mean and standard deviation as shown in the following equation, suggested by Meng Zhao et al.²⁹, where i is month and j is the year.

$$standardized\;{X}_{\left(i,j\right)}=\frac{{X}_{\left(i,j\right)}-mean\left(X\right)}{standard\;deviation\left(X\right)}$$

(6)

In the following sections, high-resolution SPEI will be referred as SPEI-HR and the coarse resolution SPEI results from Climate Research Unit data are termed as SPEI-CRU. The Pearson’s correlation was used to analyse the correlation between the different variables and only statistically significant results were accepted. The rest were converted into “Not Available”. The following Table 2 shows all the correlations carried out.

Table 2 SPEI-HR correlation analysed with different products to evaluate its performance.

Full size table

High-Resolution Emerging Hot Spot Analysis

As a potential application, an emerging hot spot analysis was conducted on the high-resolution SPEI drought dataset prepared in this study. The analysis was performed using the Getis-Ord-Gi* statistic that is available in “Spatial Statistics Toolbox” of ArcGIS Pro software. The analysis identifies a range of statistically significant patterns depending on the value of its Gi* statistic³². A Gi* statistic proportionally compares the sum of a local feature and its adjacent feature to the sum of all the features in that study, using:

$${G}_{\left(i\right)}^{* }=z-score=\frac{{\sum }_{j=1}^{n}{w}_{\left(i,j\right)}{x}_{j}-mean\left(X\right){\sum }_{j=1}^{n}{w}_{\left(i,j\right)}}{S\sqrt{\frac{n{\sum }_{j=1}^{n}{w}_{\left(i,j\right)}^{2}-{\left({\sum }_{j=1}^{n}{w}_{\left(i,j\right)}\right)}^{2}}{n-1}}},$$

(7)

where ${x}_{j}$ is the attribute value of the feature j, ${w}_{\left(i,j\right)}$ is the spatial weight between feature i and j, and n is the total number of features.

$$mean\left(X\right)=\frac{{\sum }_{j=1}^{n}\;{x}_{j}}{n},$$

(8)

$$S=\sqrt{\frac{{\sum }_{j=1}^{n}\;{x}_{j}^{2}}{n}-{\left(mean\left(X\right)\right)}^{2}}.$$

(9)

The Gi* statistic is evaluated for each feature in the study and a z-score and p-value is obtained. The z-score is only statistically significant when the difference between estimated local sum and expected local sum is too large and cannot be attributed to randomness. A hotspot has a high z-score and small p-value which indicates a significant cluster of high values around the local feature. While a cold spot has a low negative z-score and small p-value which represents a significant cluster of low values. In other words, hotspots occur where the drought index is high in the pixel and in the neighbourhood of that pixel. In this study a hotspot indicates accumulation of high SPEI values (or wet conditions) in time and space around a pixel, while a cold spot represents low SPEI values (or dry conditions) around a pixel in time and space^33,34.

Data Records

The high resolution (5 km) SPEI drought dataset produced in this study for Central Asia is archived at the Centre for Environmental Data Analysis (CEDA)³⁵. The dataset is publicly available and can be accessed as follows: Pyarali, K.; Peng, J.; Disse, M.; Tuo, Y. High resolution Standardized Precipitation Evapotranspiration Index (SPEI) dataset for Central Asia. NERC EDS Centre for Environmental Data Analysis (2022).

CHIRPS

CHIRPS is a state-of-the-science high-resolution precipitation dataset. It has a quasi-global coverage, where the available data spans over the entire longitude, but the latitude ranges between 50°S to 50°N. The development of CHIRPS dataset can be divided into three main components 1) Climate Hazards group Precipitation climatology (CHP_clim), 2) Satellite only Climate Hazards group Infrared Precipitation (CHIRP), and 3) blending station data to produce CHIRPS²².

CHIRPS is the final gridded precipitation dataset that was used in this study. The blending between CHIRP and observed station data is carried out by applying a modified inverse distance weighting interpolation, where the interpolation for any pixel depends on the weighted average of the ratios between the CHIRP value of the pixel and the five closest stations. The value of the pixel is further adjusted depending on the correlations between it and the nearest station and between the CHIRP value and the true precipitation value. The final CHIRPS value for the pixel is a combination of unadjusted and bias adjusted CHIRP data²²

CHIRPS is specifically developed to observe conditions which indicate the emergence of agricultural drought and global environmental change on land. The dataset provides context for the recent extreme climate events relative to historical observations, with an unprecedented high spatial and temporal resolution in the domain of global terrestrial products. The validation and application studies of CHIRPS suggest that it performs well across Turkey with a monthly and decadal correlation of 0.81 and 0.78, respectively³⁶, over southern river basins in China with a correlation between 0.44 to 0.46³⁷, throughout the Indian Subcontinent with a correlation of 0.80³⁸ and across Africa¹⁵, while poor performances with correlation between 0.21 to 0.34 were recorded over north-western and northern river basins in China³⁷.

In this study CHIRPS precipitation data were used to prepare a drought index dataset with a 5 km resolution and a time period of thirty-eight years from 1981 to 2018. More details regarding the product can be found in the paper authored by Chris Funk et al.²².

GLEAM

GLEAM is a model that estimates Global E_p and RSM using remote sensing products. The dataset this model provides has a spatial resolution of 0.25° (≈ 25 km), a monthly temporal resolution and spans from 1980 to 2018. The aim of developing GLEAM was to provide a consistent and long term observed dataset for hydrological variables, which are sparsely available for most regions of the world²³. The model is made up of four modules: 1) Potential evaporation, 2) Rainfall interception, 3) Soil module and 4) Stress module.

GLEAM estimates E_p (mm day⁻¹) using the Priestley and Taylor (1972)³⁹ equation, as presented below. This method provides an E_p value, which is based on land cover, net radiation (R_n), and air temperature of the region.

$$\lambda {E}_{p}=\alpha \frac{\Delta }{\Delta +\psi }\left({R}_{n}-G\right),$$

(10)

where λ is the latent heat of vaporization (MJ kg⁻¹), Δ is the slope of saturated water vapor temperature (kPa K⁻¹), ψ is the psychometric constant (kPa K⁻¹), α is the unitless Priestly and Taylor coefficient ( = 1.26) and G is the ground heat flux (Wm⁻²)²³.

Furthermore, the root zone soil moisture, RSM, is estimated by GLEAM using a multi-layer water balance approach, where the inputs are net precipitation (precipitation minus intercept loss) and snow melt, while the outputs are evaporation and drainage. The depth of the root zone is a function of the type of land cover, if the land cover is tall vegetation the models divides the depth into three layers (0 – 10, 10 – 100, and 100 – 250 cm), for low vegetation the depth is divided into two layers (0 – 10, 10 – 100 cm), bare soil only one layer used (0 – 10 cm) and if the land cover is forest than the interception model developed by Gash (1979)⁴⁰ and improved by Valente et al.⁴¹ is used²³.

Both E_p and RSM are highly validated products used in multiple studies¹⁵. RSM is validated using in-situ soil moisture measurements from international Soil Moisture Network (ISMN)⁴², the validation results give an average correlation ranging between 0.49 to 0.64 for multiple datasets. Unlike RSM, E_p is not validated directly, rather the actual terrestrial evaporation E_a, which is based on E_p, is validated. The in-situ measurements for validating terrestrial evaporation are collected from fluxnet.org and the results show that the GLEAM estimated evapotranspiration and the in-situ measurements are highly correlated and the correlation ranges between 0.78 to 0.81 for all the datasets²³.

In this study, GLEAMS potential evaporation (or “terrestrial evaporation” or “evapotranspiration”) was used to estimate water deficit from 1981–2018, while root-zone soil moisture was used to evaluate the performance of the SPEI estimation.

CRU

CRU provides gridded observations of the whole Earth (except Antarctica) on a monthly temporal resolution and a 0.5° (≈ 50 km) spatial resolution. The observations range from 1901 to 2018. The data consist of primary variables (i.e. mean temperature at 2 m, diurnal temperature range at 2 m and monthly precipitation), secondary variables (i.e. vapor pressure, wet days and cloud cover percentage) and derived variables (i.e. frost days, minimum temperature at 2 m, maximum temperature at 2 m and potential evapotranspiration). CRU data use angular distance weighting (ADW) to interpolate observed data over the gridded land surface. The application of CRU data is very diverse from the sphere of climate research to the global financial and insurance sector⁴³.

The precipitation data from stations around the globe are collected and converted into anomalies using the mean of every individual station. These anomalies are then interpolated over the 0.5° × 0.5° grid using ADW method and then converted back into actual precipitation using climatologies. Potential evaporation is one of the derived variables in CRU data, it is estimated using the Penman-Monteith equation⁴⁴, PM, which is a method approved by Food and Agriculture Organization (FAO). The method uses the gridded vapor pressure, mean temperature, cloud cover and static average wind field observations. The application of PM in context of CRU data is explained in paper authored by M. Ekström et al.⁴⁵.

The precipitation values are validated using Deutscher Wetterdienst (DWD) Global Precipitation Climatology Center’s (GPCC) precipitation data. The correlation, R, between CRU and GPCC precipitation for a Global scale is 0.92, the correlation decreases in the Southern Hemisphere and increases in Northern Hemisphere. This could be due to the distribution of available observation data⁴³. In this study, we worked with the precipitation (mm/mon) data and the potential evaporation (mm day⁻¹) data to evaluate the performance of 0.05° SPEI estimated over Central Asia.

GIMMS NDVI

GIMMS was used to prepare a 3^rd generation NDVI dataset, which covers the whole Earth, except Antarctica, and is available for a period of 1981 to 2015. The data have a spatial resolution of 8 km and a monthly temporal resolution. The GIMMS model estimate NDVI using Advanced Very High-Resolution Radiometer (AVHRR) sensors. Bayesian method was used to derive calibration parameters from the AVHRR NDVI values. The uncertainty evaluation gave an error of ± 0.005 for the entire NDVI dataset⁴⁶.

The dataset is widely accepted and has been used for multiple purposes for example, to evaluate the degradation of land in Sahel-Sudanian zone of Africa⁴⁷, monitor biomass production in an ecosystem⁴⁸, assess vulnerability of agriculture in India to variation in rainfall as a consequence of climate change⁴⁹ and validation of drought index in Africa¹⁵.

Since, NDVI values can be used as a proxy to observe the growth of vegetation, we used them in this study to validate the performance of our SPEI dataset by investigating the effects of drought on vegetation.

Technical Validation

Inter-comparison between SPEI-HR and SPEI-CRU

A direct comparison between high-resolution SPEI and coarse resolution SPEI for a couple of time scales, 3-month and 12-month, is presented in Fig. 2. The spatial patterns in Fig. 2 belong to June 1990. It can be seen that the emerging SPEI patterns between high and coarse resolution data, for both time scales, performed similar to each other. However, the level of detail in the high-resolution dataset provided a better understanding of the drought and its corresponding climate features at a finer scale. In SPEI 12 CRU a large portion of the pattern falled within the range of −0.99 to 0.99, which according to Table 2 can be defined as near normal conditions. However, the same portion, when viewed in SPEI 12 HR, showed that drought conditions over some pixels deviate from near normal range, indicating possibly moderate, severe, or extreme conditions. This highlighted the advantages of developing a high-resolution drought dataset. Furthermore, the different drought patterns between 3-month and 12-month SPEI were due to varying aggregation of water deficit depending on the time scale. The SPEI results from different time scales helped in differentiating between meteorological, agricultural, hydrological, groundwater, or any other type of drought. In Fig. 2 the northern and north-western region around Aral Sea (60°E,45°N) shows a near normal condition for the 3-month time scale, but moderate, severe, and extreme drought conditions for the 12-month time scale. The local stake holders in this region should prepare for a long-lasting drought and should not misjudge the conditions based on short term near normal patterns.

To assess the difference between SPEI-HR and SPEI-CRU spatial and temporal correlation were evaluated using Pearson’s coefficient for time scales 1, 3, 6, 9, 12, 24, 36, and 48. Figure 3 presents the results from temporal analysis, where the time series of each pixel for a certain time scale from SPEI-HR and SPEI-CRU were correlated and the results were plotted on a map. The results from temporal analysis indicate that SPEI-HR and SPEI-CRU for all time scales were highly correlated with a median correlation either equal to 0.6 or greater than 0.6 for all the time scales, as shown in the box and whiskers plot for each time scale in Fig. 3. Even though the correlation values were generally high a diminishing pattern can be observed as the time scale increases. For example, the region around 65°E and 50°N had a high correlation for SPEI 01, but the correlation decreased for greater time scales and eventually we observed negative correlation values for time scales greater than 12. Additionally, the results from spatial correlation analysis, presented in Fig. 4, were estimated by comparing SPEI-HR and SPEI-CRU values for a particular month and then the monthly correlation values over the entire period were used to prepare a box and whiskers plot. The results from spatial analysis indicate that overall, the SPEI-HR and SPEI-CRU agreed well with each other due to positive correlations (R > 0.3), but the median correlation decreased as the time scale increased. Furthermore, except for SPEI 01 and SPEI 03, we observed similar correlation values between different months for each time scale. The relatively low correlations for the month of August and September for SPEI 01 and SPEI 03 could be attributed to the short accumulation periods of the water deficit for these time scales.

It needs to be highlighted that only statistically significant correlation values, which have p-values less than 0.05, were used in evaluating the new SPEI-HR dataset.

SPEI-HR and SPEI-CRU comparison against root zone soil moisture (RSM)

To assess the performance of the new dataset it was compared with standardized root zone soil moisture from GLEAM. As shown in Fig. 5, the SPEI with a time scale of 6-month was evaluated with RSM and the results indicate that even though both SPEI-HR and SPEI-CRU gave positive correlation values with RSM, the performance of SPEI-CRU was slightly better, specifically in centre of the map (70°E, 40°N to 45°N). Furthermore, the time series of area mean, which was calculated by estimating the spatial average of each month, showed that SPEI-HR and SPEI-CRU gave similar results, whereas RSM followed the general trend, but was slightly different than both sets of SPEI. The area mean correlation between SPEI-HR and RSM was 0.66 and between SPEI-CRU and RSM was 0.71, a difference of 7.04 percent. The results given in Fig. 5 were only for a 6-month time scale, therefore, area mean correlation between both sets of SPEI and RSM was estimated for seven other time steps to explore their correlations, as presented in Table 3. The results from Table 3 show that SPEI-HR agrees best with RSM at 6-month time scale. Since deficiency in RSM is an indicator for agricultural drought¹⁴, therefore, to monitor an agricultural drought SPEI-HR with 6-month time scale could be a more statistically reasonable solution in Central Asia.

Table 3 Statistically significant (p < 0.05) correlation within area mean SPEI, at different time scales, and area mean of RSM.

Full size table

SPEI-HR and SPEI-CRU comparison against NDVI

To further assess the performance of the SPEI-HR it was compared with the GIMMS’s standardized NDVI product. NDVI values indicate the health of a vegetation and have been previously used for monitoring droughts¹⁴. Therefore, they were correlated with SPEI values. Figure 6 presents the result of correlation between a 6-month time scale SPEI-HR and SPEI-CRU against NDVI. The emerging pattern shows that although largely similar, the performance of SPEI-CRU was slightly better than SPEI-HR. Interestingly, there were some negative correlations in both datasets, but they were more dominant in SPEI-HR. In general, there were very few high correlations and most correlation values lie below 0.5. The low correlations values could be attributed to complex seasonal processes that vegetation goes through annually. Furthermore, the growth of vegetation depends on multiple variables and not just water availability, which is the only variable required for SPEI estimation. Slightly better values were observed for the African continent by Jian Peng et al.¹⁵. The timeseries in Fig. 6 also presents low correlation values for the area mean values for both SPEI products against NDVI, supporting the results of the spatial correlation plots. The correlation between 6-month SPEI-HR and SPEI-CRU against NDVI were 0.26 and 0.30, respectively. According to Table 4 the time scales with highest correlation were 6, 9, and 12-month.

Table 4 Statistically significant (p < 0.05) correlation within area mean SPEI, at different time scales, and area mean of NDVI.

Full size table

Pattern characteristics of SPEI, RSM, and NDVI for certain drought events

Guo et al.² found that Central Asia has suffered three periods of severe droughts in the last fifty years, which are 1973–1979, 1983–1988, and 1997–2003. Furthermore, in the same research the authors identified that there were noticeable drought events in some clustered regions during 2001 and 2008. Furthermore, FAO and World Food Program (WFP) reported that during the severe drought event of 2001 the regional agricultural industry incurred damages worth of US$800 million, the precipitation and river discharge levels were below average by 60–40% and 40–35%, respectively. While during the drought event of 2008, six percent of the population of Kyrgyzstan fell below poverty line and the wheat harvest in Tajikistan was down by 20–35%^50,51.

To evaluate the performance of the new high-resolution SPEI dataset a direct comparison was conducted between 6-month SPEI-HR, SPEI-CRU, RSM, and NDVI for the year 2001 (May-September) and 2008 (March-July). As shown in Figs. 7 and 8, it is evident that the SPEI-HR dataset was able to observe similar drought patterns over time and space when compared to low resolution SPEI-CRU. The slight differences, specifically for the 2001 event, could be attributed to different precipitation data and E_p estimation methods used by the two datasets. Further comparison revealed that the evolution of 6-month SPEI-HR for both years was very well reflected in RSM. Concerning the relationship between 6-month SPEI-HR and NDVI it can be seen that the connection or reflection was very strong for the drought event of 2008, while it was slightly weak for the 2001 drought event. Overall, the four variables successfully demonstrated the progressive drying-out of Central Asia for both events. During the 2001 event the central and southern regions experienced the most severe events, the intensity of drought started to reduce in September 2001. Whereas, for the event of 2008 almost the entire Central Asia was experiencing either severe or extreme drought, except for the small part of north-western region, this event seemed to be less severe than 2001, but more spatially spread and did not seem to ease off for the whole period of observation. Another set of plots were produced using the SPEI threshold provided by Danandeh Mehr et al.⁵², these plots are available in appendix as Figure A3 and Figure A4. The overall drought patterns were similar under both thresholds therefore we opted for the more commonly used SPEI threshold. In future a site-specific stand-alone study might be required to choose the most suitable SPEI threshold for Central Asia.

The results from Figs. 7 and 8 indicate that the SPEI-HR dataset is able to capture drought events. Therefore, the dataset could be used to assess different impacts of droughts and to study atmospheric processes on a finer scale.

Usage Notes

Emerging hot spot analysis

Figure 9 and Figure A1-A2 (in the Appendix) presents the results of emerging hot spot analysis from multiple time scales of SPEI-HR dataset. The analysis was conducted for the entire Central Asia and barren and non-vegetated lands were not masked. The reason of conducting this analysis was to observe and identify regions with varying patterns. As shown in Fig. 9, the 6-month time scale was used to observe short term hot and cold spots, which could give us information regarding agricultural droughts, their trends, and intensities, while the long term 48-month time scale was used to understand how the water deficit of the region is shaping up over a four-year period at a basin and district scale. A positive or a high SPEI value indicates a wet region therefore a hot spot in this context indicates a region with sufficient water, while cold spot indicates a dry region.

In this study, for eight different time scales, only three patterns emerged: 1) Oscillating hot spot, which is shaded in blue, represents wet condition in the last time step and indicates that the region is a hot spot for less than 90% of the time with a tendency and a history of being a cold spot. 2) Oscillating cold spot, which is shaded in red, represents dry condition in the last time step and indicates that the region is a cold spot for less than 90% of the time with a tendency and a history of being a hot spot. 3) No detectable pattern, which is shaded in grey. Other patterns like persistent, new, intensify, diminishing, consecutive and sporadic hot or cold spots did not emerge possibly because of the seasonal nature of the SPEI values and the very fine resolution of our dataset which may lead to a very high local variation.

As presented in Fig. 9, in the 6-month time scale, the north-western region and parts of central and south-eastern regions indicated an oscillating cold spot therefore were going through dry conditions, while the rest of the Central Asia had wet conditions. The results for the 48-month time step showed that, unlike SPEI-6, the droughts in central and western regions were affecting a smaller area and most of Central Asia seemed to have wet conditions.

Code availability

Climate Data Operators from Max Planck Institute of Meteorology was used in the pre-processing of the data. Then functions of the SPEI package in R programming Language were used to prepare the final code. The code files are provided to the journal as Supplementary Information.

References

Zhang, M., Chen, Y., Shen, Y. & Li, B. Tracking climate change in Central Asia through temperature and precipitation extremes. Journal of Geographical Sciences 29, 3–28 (2019).
Article ADS Google Scholar
Guo, H. et al. Spatial and temporal characteristics of droughts in Central Asia during 1966–2015. Science of the Total Environment 624, 1523–1538 (2018).
Article ADS CAS Google Scholar
Hu, Z. et al. “Dry gets drier, wet gets wetter”: A case study over the arid regions of central Asia. International Journal of Climatology 39, 1072–1091 (2019).
Article ADS Google Scholar
Pritchard, H. D. Asia’s glaciers are a regionally important buffer against drought. Nature 545, 169–174 (2017).
Article ADS CAS PubMed Google Scholar
Center for International Earth Science Information Network - CIESIN - Columbia University. Gridded Population of the World, Version 4 (GPWv4): Population Density, Revision 11. (2018).
Li, Z., Chen, Y., Fang, G. & Li, Y. Multivariate assessment and attribution of droughts in Central Asia. Scientific Reports 7, 1–12 (2017).
ADS Google Scholar
WMO. Drought Monitoring and Early Warning: Concepts, Progress, and Future Challenges | World Meteorological Organization. https://public.wmo.int/en/resources/library/drought-monitoring-and-early-warning-concepts-progress-and-future-challenges (2006).
Mishra, A. K. & Singh, V. P. A review of drought concepts. Journal of Hydrology 391, 202–216 (2010).
Article ADS Google Scholar
Diaz, V., Corzo Perez, G. A., van Lanen, H. A. J., Solomatine, D. & Varouchakis, E. A. An approach to characterise spatio-temporal drought dynamics. Advances in Water Resources 137, 103512 (2020).
Article Google Scholar
Riebsame, W., Changnon, S. & Karl, T. Drought and Natural Resources Management in the United States: Impacts and Implications of the 1987-89 Drought. (Kluwer Academic Publishers., 1991).
United Nations Environment Program & Harrison, P. GEO Year Book 2006: An Overview of Our Changing Environment. https://digital.library.unt.edu/ark:/67531/metadc28575/ (2006).
Agrawala, S., Barlow, M., Cullen, H. & Lyon, B. The Drought and Humanitarian Crisis in Central and Southwest Asia: A Climate Perspective. https://doi.org/10.7916/D8NZ8FHQ (2001).
Wong, G., Lambert, M. F., Leonard, M. & Metcalfe, A. V. Drought Analysis Using Trivariate Copulas Conditional on Climatic States. Journal of Hydrologic Engineering 15, 129–141 (2010).
Article Google Scholar
Zargar, A., Sadiq, R., Naser, B. & Khan, F. I. A review of drought indices. Environmental Reviews 19, 333–349 (2011).
Article Google Scholar
Peng, J. et al. A pan-African high-resolution drought index dataset. Earth System Science Data 12, 753–769 (2020).
Article ADS Google Scholar
McKee, T. B., Nolan, J. & Kleist, J. The relationship of drought frequency and duration to time scales. Preprints, Eighth Conf. on Applied Climatology, Amer. Meteor, Soc. (1993).
Vicente-Serrano, S. M., Beguería, S. & López-Moreno, J. I. A multiscalar drought index sensitive to global warming: The standardized precipitation evapotranspiration index. Journal of Climate 23, 1696–1718 (2010).
Article ADS Google Scholar
Beguería, S., Vicente-Serrano, S. M. & Angulo-Martínez, M. A multiscalar global drought dataset: The SPEI base: A new gridded product for the analysis of drought variability and impacts. Bulletin of the American Meteorological Society 91, 1351–1356 (2010).
Article ADS Google Scholar
Ziese, M. et al. GPCC Drought Index Product (GPCC_DI) at 1.0°. Global Precipitation Climatology Centre at Deutscher Wetterdienst (DWD) (2013).
Meque, A. & Abiodun, B. J. Simulating the link between ENSO and summer drought in Southern Africa using regional climate models. Climate Dynamics 44, 1881–1900 (2015).
Article ADS Google Scholar
Vicente-Serrano, S. M. et al. A high resolution dataset of drought indices for Spain, Data 2, 22, https://doi.org/10.3390/data2030022 2017.
Funk, C. et al. The climate hazards infrared precipitation with stations - A new environmental record for monitoring extremes. Scientific Data 2, 1–21 (2015).
Article Google Scholar
Martens, B. et al. GLEAM v3: Satellite-based land evaporation and root-zone soil moisture. Geoscientific Model Development 10, 1903–1925 (2017).
Article ADS Google Scholar
Feng, R., Yu, R., Zheng, H. & Gan, M. Spatial and temporal variations in extreme temperature in Central Asia. International Journal of Climatology 38, e388–e400 (2018).
Article ADS Google Scholar
World Bank. Drought: Management and Mitigation Assessment for Central Asia and the Caucasus. https://openknowledge.worldbank.org/handle/10986/8724 (2005).
Stagge, J. H., Tallaksen, L. M., Gudmundsson, L., van Loon, A. F. & Stahl, K. Candidate Distributions for Climatological Drought Indices (SPI and SPEI). International Journal of Climatology 35, 4027–4040 (2015).
Article ADS Google Scholar
Beguería, S., Vicente-Serrano, S. M., Reig, F. & Latorre, B. Standardized precipitation evapotranspiration index (SPEI) revisited: parameter fitting, evapotranspiration models, tools, datasets and drought monitoring. International Journal of Climatology 34, 3001–3023 (2014).
Article ADS Google Scholar
Stagge, J. H., Tallaksen, L. M., Xu, C. Y. & van Lanen, H. A. J. Standardized precipitation-evapotranspiration index (SPEI): Sensitivity to potential evapotranspiration model and parameters. Clinical Epigenetics 367–373, 10.2/JQUERY.MIN.JS (2014).
Zhao, M., Geruo, A., Velicogna, I. & Kimball, J. S. A global gridded dataset of GRACE drought severity index for 2002-14: Comparison with PDSI and SPEI and a case study of the Australia millennium drought. Journal of Hydrometeorology 18, 2117–2129 (2017).
Article ADS Google Scholar
Friedl, M. A. et al. MODIS Collection 5 global land cover: Algorithm refinements and characterization of new datasets. Remote Sensing of Environment 114, 168–182 (2010).
Article ADS Google Scholar
Törnros, T. & Menzel, L. Addressing drought conditions under current and future climates in the Jordan River region. Hydrology and Earth System Sciences 18, 305–318 (2014).
Article ADS Google Scholar
Getis, A. & Ord, J. K. The Analysis of Spatial Association by Use of Distance Statistics. Geographical Analysis 24, 189–206 (1992).
Article Google Scholar
Khajehei, S., Ahmadalipour, A., Shao, W. & Moradkhani, H. A Place-based Assessment of Flash Flood Hazard and Vulnerability in the Contiguous United States. Scientific Reports 10, (2020).
Kaiser, M., Günnemann, S. & Disse, M. Spatiotemporal analysis of heavy rain-induced flood occurrences in Germany using a novel event database approach. Journal of Hydrology 595, 125985 (2021).
Article Google Scholar
Pyarali, K., Peng, J., Disse, M. & Tuo, Y. High resolution Standardized Precipitation Evapotranspiration Index (SPEI) dataset for Central Asia, NERC EDS Centre for Environmental Data Analysis. https://doi.org/10.5285/feb1e0b5426d4f5c80f791909a3a2d37 (2022).
Aksu, H. & Akgül, M. A. Performance evaluation of CHIRPS satellite precipitation estimates over Turkey. Theoretical and Applied Climatology 142, 71–84 (2020).
Article ADS Google Scholar
Bai, L., Shi, C., Li, L., Yang, Y. & Wu, J. Accuracy of CHIRPS satellite-rainfall products over mainland China. Remote Sensing 10, 362 (2018).
Article ADS Google Scholar
Prakash, S. Performance assessment of CHIRPS, MSWEP, SM2RAIN-CCI, and TMPA precipitation products across India. Journal of Hydrology 571, 50–59 (2019).
Article ADS Google Scholar
Priestley, C. H. B. & Taylor, R. J. On the Assessment of Surface Heat Flux and Evaporation Using Large-Scale Parameters. Monthly Weather Review 100 (1972).
Gash, J. H. C. An analytical model of rainfall interception by forests. Quarterly Journal of the Royal Meteorological Society 105, 43–55 (1979).
Article ADS Google Scholar
Valante, F., David, J. S. & Gash, J. H. C. Modelling interception loss for two sparse eucalypt and pine forests in central Portugal using reformulated Rutter and Gash analytical models. Journal of Hydrology 190, 141–162 (1997).
Article ADS Google Scholar
Dorigo, W. A. et al. The International Soil Moisture Network: A data hosting facility for global in situ soil moisture measurements. Hydrology and Earth System Sciences 15, 1675–1698 (2011).
Article ADS Google Scholar
Harris, I., Osborn, T. J., Jones, P. & Lister, D. Version 4 of the CRU TS monthly high-resolution gridded multivariate climate dataset. Scientific Data 7, 1–18 (2020).
Article Google Scholar
Allen, R., Pereira, L., Raes, D. & Smith, M. Crop evapotranspiration - Guidelines for computing crop water requirements. (FAO - Food and Agriculture Organization of the United Nations, 1998).
Ekström, M. et al. Regional climate model data used within the SWURVE project projected changes in seasonal patterns and estimation of PET. Hydrology and Earth System Sciences 11, 1069–1083 (2007).
Article ADS Google Scholar
Pinzon, J. E. & Tucker, C. J. A non-stationary 1981-2012 AVHRR NDVI3g time series. Remote Sensing 6, 6929–6960 (2014).
Article ADS Google Scholar
Fensholt, R. & Rasmussen, K. Analysis of trends in the Sahelian “rain-use efficiency” using GIMMS NDVI, RFE and GPCP rainfall data. Remote Sensing of Environment 115, 438–451 (2011).
Article ADS Google Scholar
Wu, D. et al. Evaluation of spatiotemporal variations of global fractional vegetation cover based on GIMMS NDVI data from 1982 to 2011. Remote Sensing 6, 4217–4239 (2014).
Article ADS Google Scholar
Ramachandran, K. et al. Assessment of vulnerability of Indian agriculture to rainfall variability – Use of NOAA-AVHRR (8 km) and MODIS (250 m) time-series NDVI data products. Climate Change and Environmental Sustainability 1, 37 (2013).
Article Google Scholar
WFP. Climate Risk and Food Security in the Kyrgyz Republic: An Overview on Climate Trends and the Impact on Food Security. https://www.wfp.org/publications/climate-risk-and-food-security-kyrgyz-republic-overview-climate-trends-and-impact-food-security-and-livelihoods (2014).
FAO. Drought characteristics and management in Central Asia and Turkey (2017).
Danandeh Mehr, A., Sorman, A. U. & Kahya, E. & Hesami Afshar, M. Climate change impacts on meteorological drought using SPI and SPEI: case study of Ankara, Turkey. Hydrological Sciences Journal 65, 254–268 (2020).
Article Google Scholar
Liu, C., Yang, C., Yang, Q. & Wang, J. Spatiotemporal drought analysis by the standardized precipitation index (SPI) and standardized precipitation evapotranspiration index (SPEI) in Sichuan Province, China. Scientific Reports 11, (2021).
Wang, Q. et al. A multi-scale daily SPEI dataset for drought characterization at observation stations over mainland China from 1961 to 2018. Earth System Science Data 13, 331–341 (2021).
Article ADS CAS Google Scholar

Download references

Acknowledgements

We would like to acknowledge the researchers behind CRU, CHIRPS, GLEAM, GIMMS and CIESIN for providing meteorological, environmental and socio-economic data required to successfully conclude this study.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Chair of Hydrology and River Basin Management, Technical University of Munich, Arcisstrasse 21, 80333, Munich, Germany
Karim Pyarali, Markus Disse & Ye Tuo
Department of Remote Sensing, Helmholtz Centre for Environmental Research−UFZ, Permoserstrasse 15, 04318, Leipzig, Germany
Jian Peng
Remote Sensing Centre for Earth System Research, Leipzig University, 04103, Leipzig, Germany
Jian Peng

Authors

Karim Pyarali
View author publications
You can also search for this author in PubMed Google Scholar
Jian Peng
View author publications
You can also search for this author in PubMed Google Scholar
Markus Disse
View author publications
You can also search for this author in PubMed Google Scholar
Ye Tuo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.P., J.P. and Y.T. designed the approach. K.P. collected and processed the data, prepared the code, prepared figures and tables, conducted literature review and wrote the manuscript. Y.T. worked closely with the first author, provided guidance throughout the process, reviewed and edited the manuscript and supervised the entire study. J.P. and M.D. reviewed and edited the manuscript.

Corresponding author

Correspondence to Ye Tuo.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Pyarali, K., Peng, J., Disse, M. et al. Development and application of high resolution SPEI drought dataset for Central Asia. Sci Data 9, 172 (2022). https://doi.org/10.1038/s41597-022-01279-5

Download citation

Received: 05 August 2021
Accepted: 07 March 2022
Published: 14 April 2022
DOI: https://doi.org/10.1038/s41597-022-01279-5

Subjects

Abstract

Similar content being viewed by others

Future groundwater potential mapping using machine learning algorithms and climate change scenarios in Bangladesh

High-resolution impact-based early warning system for riverine flooding

Global prediction of extreme floods in ungauged watersheds

Background & Summary

Background/Introduction

Summary

Methods

High-Resolution SPEI calculation

Evaluation Criteria

High-Resolution Emerging Hot Spot Analysis

Data Records

CHIRPS

GLEAM

CRU

GIMMS NDVI

Technical Validation

Inter-comparison between SPEI-HR and SPEI-CRU

SPEI-HR and SPEI-CRU comparison against root zone soil moisture (RSM)

SPEI-HR and SPEI-CRU comparison against NDVI

Pattern characteristics of SPEI, RSM, and NDVI for certain drought events

Usage Notes

Emerging hot spot analysis

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links