Climatologies at high resolution for the earth’s land surface areas

Karger, Dirk Nikolaus; Conrad, Olaf; Böhner, Jürgen; Kawohl, Tobias; Kreft, Holger; Soria-Auza, Rodrigo Wilber; Zimmermann, Niklaus E.; Linder, H. Peter; Kessler, Michael

doi:10.1038/sdata.2017.122

Download PDF

Data Descriptor
Open access
Published: 05 September 2017

Climatologies at high resolution for the earth’s land surface areas

Dirk Nikolaus Karger^1,2,
Olaf Conrad³,
Jürgen Böhner³,
Tobias Kawohl³,
Holger Kreft ORCID: orcid.org/0000-0003-4471-8236⁴,
Rodrigo Wilber Soria-Auza^4,5,
Niklaus E. Zimmermann²,
H. Peter Linder¹ &
…
Michael Kessler¹

Scientific Data volume 4, Article number: 170122 (2017) Cite this article

65k Accesses
2322 Citations
92 Altmetric
Metrics details

Subjects

Abstract

High-resolution information on climatic conditions is essential to many applications in environmental and ecological sciences. Here we present the CHELSA (Climatologies at high resolution for the earth’s land surface areas) data of downscaled model output temperature and precipitation estimates of the ERA-Interim climatic reanalysis to a high resolution of 30 arc sec. The temperature algorithm is based on statistical downscaling of atmospheric temperatures. The precipitation algorithm incorporates orographic predictors including wind fields, valley exposition, and boundary layer height, with a subsequent bias correction. The resulting data consist of a monthly temperature and precipitation climatology for the years 1979–2013. We compare the data derived from the CHELSA algorithm with other standard gridded products and station data from the Global Historical Climate Network. We compare the performance of the new climatologies in species distribution modelling and show that we can increase the accuracy of species range predictions. We further show that CHELSA climatological data has a similar accuracy as other products for temperature, but that its predictions of precipitation patterns are better.

Design Type(s)	data integration objective • modeling and simulation objective
Measurement Type(s)	temperature of air • hydrological precipitation process
Technology Type(s)	data acquisition system
Factor Type(s)
Sample Characteristic(s)	Earth • planetary atmosphere

Machine-accessible metadata file describing the reported data (ISA-Tab format)

Global daily 1 km land surface precipitation based on cloud cover-informed downscaling

Article Open access 26 November 2021

High-resolution terrestrial climate, bioclimate and vegetation for the last 120,000 years

Article Open access 14 July 2020

High-resolution monthly precipitation and temperature time series from 2006 to 2100

Article Open access 23 July 2020

Background & Summary

High-resolution climate data are essential to many applications in environmental and ecological sciences. Whereas many studies in these fields are conducted at a resolution of ~1 km², state-of-the-art global climate reanalyses often only represent climatic variation at spatial resolutions of 0.25°–1° (ca. 25–100 km at the equator). The gap between these spatial scales may be bridged using satellite data (CHIRPS¹, TRMM^2,3) and statistical downscaling^4–7 for a specific region of interest and/or interpolation methods applied to meteorological station data (WorldClim⁸, CRU⁹, GPCC¹⁰, PRISM¹¹). Climatologies based on satellites or statistical downscaling, considered superior to interpolated data¹² for ecological applications, are currently either not available on a global scale or are still too coarse to reflect the small scale patterns needed in ecological studies. While interpolated datasets⁸ often perform well in matching precipitation or temperature of the stations from which they are produced, they often fail to accurately predict patterns between stations. This is particularly problematic in highly variable terrain with low station density¹³. Whereas some interpolated datasets use elevation as a predictor (e.g., WorldClim⁸) and observations such as the Global Historical Climate Network (GHCN)^14,15 to achieve a high-resolution prediction, it is also possible to use predictors from global circulation models (e.g., from the National Center for Atmospheric Research (NCEP)¹⁶, or the European Centre for Medium-Range Weather Forecast (ECMWF) climatic reanalysis interim (ERA-Interim)¹⁷).

Although interpolation and statistical downscaling approaches may also integrate land-surface predictors such as elevation, slope or aspect, satisfactory results still require a more or less regular distribution of meteorological stations and a proper representation of topo-climatic settings¹³. However, the global distribution of meteorological stations is highly biased by funding and accessibility, leading to a poor representation of climatic variability in mountainous regions or areas with intact lowland rainforest, such as the Amazon or Congo basin^8,13. On a global scale, it is also difficult to find generally valid transfer functions between predictors and the climatic variable of interests, especially for highly non-linear phenomena such as precipitation. Statistical downscaling is problematic¹⁸, especially on a global scale, due to temporal variation in the spatial distribution of weather stations. Although measurements for a given predictor might be available in a given month, they might be absent in another, leading to a generally high heterogeneity of the underlying climate records when time series of precipitation need to be calculated. While this does not affect static predictors such as elevation, slope, or aspect, statistical downscaling becomes especially problematic when highly dynamic predictors such as wind fields need to be integrated. The heterogeneity in the temporal and spatial distribution of such dynamic factors can also lead to spurious correlations in specific months or in specific regions, which can severely influence regression model parameters. When specific predictors, such as windward or leeward mountain sides^19,20 change over the course of the year, the location of the climatic records does not change accordingly. Therefore, regression-based downscaling might, for example, detect a significant negative relationship between a station on the windward site of a mountain for one month, and a positive relationship for another, although atmospheric physics would always predict a positive relationship. Due to this problem, statistical downscaling and interpolation methods have often been applied to single regions¹¹, while a global model is lacking.

For ecological applications, the representation of the temporal and spatial variability of temperature and precipitation is, however, extremely important to infer ecological niches, growing seasons, species migrations, or small scale species distribution. Errors in the underlying climatic dataset at this small spatial scale can easily inflate in such studies¹³, which calls for an improvement of climatic information available for such analyses.

To overcome the problem of heterogeneous spatial and temporal distribution of meteorological station data, we use a Model Output Statistics algorithm for data provided from the ERA-Interim reanalysis¹⁷ which we correct using gauge-derived products from the GPCC¹⁰ and the GHCN^14,15 datasets. The results are improved climatologies for precipitation and temperature at high spatial resolution for environmental and ecological studies, which might prove valuable in varied scientific applications that rely on a good representation of small scale precipitation and temperature patterns.

Methods

Calculation of monthly temperature and precipitation values

ERA-Interim (developed at the European Centre for Medium-Range Weather Forecast, ECMWF), simulates six-hourly large-scale atmospheric fields for 60 pressure levels between 1,000 and 1 hPa globally with a horizontal resolution of 0.75° lat/long (T255)^17,21,22. Since the ERA-Interim reanalysis combines modelling results with ground and radiosonde observations as well as remote sensing data using a data assimilation system, the free-atmospheric and surface fields can be considered as the best approximation of the current large-scale atmospheric situation for every time step. Several studies show that ERA-Interim adequately captures the variability of relevant free-air meteorological parameters, even over complex terrain^23–25.

Temperature

Spatial variation in temperature is to a large degree determined by the vertical state of the troposphere and thus, if not affected by inversion layers, temperature decreases with increasing altitude^26,27. The long term mean hypsometric temperature gradient covered in the ERA-Interim data accurately reflects the vertical distribution of moist- or dry-adiabatic lapse rates²⁰. Typical temperature lapse rates are in the order of −0.4 to −0.8 K/100 m with a characteristic seasonality. The corresponding temperature distribution pattern in the free atmosphere²⁸ can be assumed to be directly related to surface elevation¹⁹.

For our downscaling approach of mean monthly temperatures, we used the monthly means of daily mean temperature derived from six-hourly synoptic data from ERA-Interim. Temperature lapse rates were calculated from the ERA-Interim for pressure levels from 1,000 to 300 hPa, using linear regression for each ERA-Interim grid cell. We then interpolated temperature to sea level using the derived lapse rates. Sea level temperatures were then interpolated between grid cells using B-spline interpolation, and then projected back on the elevational surface of the digital elevation model using the equation:

\begin{matrix} (1) & t = Γ_{d} * e l e v + t_{0} \end{matrix}

where t equals the temperature at a given elevation, Γ_d equals the lapse rate, elev equals elevation at 30 arc sec. from the Global Multi-resolution Terrain Elevation Data 2010 (GMTED2010)²⁹ of the United States Geological Survey (USGS) and the National Geospatial-Intelligence Agency (NGA), and t₀ equals the interpolated temperature at sea level.

Maximum and minimum temperatures

Maximum (t_max) and minimum (t_min) temperatures were calculated using climatological aided interpolation³⁰. For that we used the mean monthly temperature values (t) and added, or subtracted the maximum or minimum daily temperature derived from the three-hourly data of minimum or maximum temperature since previous post processing data fields in ERA-Interim:

\begin{matrix} (2) & t_{m a x} = t + Δ t_{e r a \_m a x} \end{matrix}

\begin{matrix} (3) & t_{m i n} = t - Δ t_{e r a \_m i n} \end{matrix}

where Δt_{era_max} and Δ t_{era_min} are the respective differences between maximum and minimum temperatures interpolated to 30 arc sec resolution using B-spline interpolation from the mean monthly temperatures (t).

Precipitation

Elevation is one of the main topo-climatic drivers of vertical precipitation gradients, but the relation between elevation and precipitation can be idiosyncratic^19,31–36. In the convective regimes of the tropics, precipitation amounts commonly increase up to the condensation level at about 1,000–1,500 m above the ground surface, and the exponentially decreasing air moisture content in the mid- to upper troposphere results in a corresponding drying above the condensation level of tropical convection cluster systems (non-linear precipitation lapse rates)³⁷. Likewise, negative lapse rates typically occur in the extremely dry polar climates. At mid-latitudes and in the subtropics, the frequent or even prevalent advection of moisture bearing air to high altitudes leads to increasing precipitation with increasing elevation. Consequently, the summits of high mountain ranges such as the Alps³⁸ may have high rainfall, and this leads to linear precipitation lapse rates³⁹. The reduced precipitation at lower elevations is due, firstly, to the evaporation of rain drops when falling through non-saturated, lower-air levels. Secondly, the vertical precipitation gradient in high mountain ranges is often increased due to the diurnal formation of autochthonous upslope breezes. This upward flow of air intensifies cloud and precipitation formation in upper slope positions whilst the subsiding branch of these autochthonous local circulation systems along the valley axis leads to cloud dissolution and a corresponding reduction of precipitation rates in the valley bottoms. We approximated such orographic precipitation effects and used them as a parameter for the CHELSA precipitation downscaling algorithm (Fig. 1) as explained below.

Wind effect correction

Orographic precipitation patterns⁴⁰ caused by the uplift of moist air currents at the windward side of a mountain range and the intimately related rain shadow effect on leeward sides induced by the blockage of moisture-bearing air are most common effects influencing small-scale precipitation patterns^38,40–43. Based on the assumption that the windward impact on the precipitation intensity depends on the prevailing wind direction at any given elevation of an orographic barrier, we used a wind index^19,20 to account for the expected higher precipitation at the windward sites of an orographic barrier.

We used u-wind and v-wind components at the 10-m level of ERA-Interim as underlying wind components. These two wind components were interpolated to the CHELSA grid resolution using a B-spline interpolation. As the calculation of a windward leeward index (hereafter: wind effect) requires a projected coordinate system, both wind components were projected to a world Mercator projection and then combined to a directional grid. The wind effect H with windward component H_W and the leeward component H_L were then calculated using:

\begin{matrix} (4) & H_{W} = \frac{\sum_{i = 1}^{n} \frac{1}{d_{WH i}} t a n^{- 1} (\frac{d_{WZ i}}{d_{WH i}})}{\sum_{i = 1}^{n} \frac{1}{d_{LH i}}} + \frac{\sum_{i = 1}^{n} \frac{1}{d_{LH i}} t a n^{- 1} (\frac{d_{LZ i}}{d_{LH i}})}{\sum_{i = 1}^{n} \frac{1}{d_{LH i}}} \end{matrix}

\begin{matrix} (5) & H_{L} = \frac{\sum_{i = 1}^{n} \frac{1}{d_{WH i}} t a n^{- 1} (\frac{d_{LZ i}}{d_{WH i}})}{\sum_{i = 1}^{n} \frac{1}{d_{LH i}}} \end{matrix}

where d_WHi and d_LHi refer to the horizontal distances in windward and leeward direction and d_WZi and d_LZi are the corresponding vertical distances compared with the considered raster cell. The second summand in equation (4) accounts for the leeward impact of previously traversed mountain chains. The horizontal distances in equation (5) lead to a longer-distance impact of leeward rain shadow. The final wind-effect parameter, which is assumed to be related to the interaction of the large-scale wind field and the local-scale precipitation characteristics, is calculated as H=H_L×H_W and generally takes values between 0.7 for leeward and 1.3 for windward positions¹⁹. Equation (3) and equation (4) were applied to each grid cell at the CHELSA resolution in a world mercator projection.

Valley exposition correction

Although the wind effect algorithm can distinguish between the windward and leeward sites of an orographic barrier, it cannot distinguish extremely isolated valleys in high mountain areas. Such dry valleys are situated in areas where the wet air masses flow over an orographic barrier and are prevented from flowing into deep valleys. To account for these effects, we used a variant of equation (4) and equation (5) with a linear search distance of 300 km in steps of 5° from 0° to 355° circular for each grid cell. The calculated leeward index was then scaled towards higher elevations using:

\begin{matrix} (6) & E = H_{L}^{\frac{e l e v}{c}} \end{matrix}

which rescales the strength of the exposition index relative to elevation (elev) from GMTED2010, and gives valleys at high elevations larger wind isolations (E) than valleys located at low elevations. The correction constant c was set to 9,000 m to include all possible elevations of the DEM. The constant has been set to 9,000 m as values of elev >c could lead to a reverse relationship between elev and H_L. Additionally, a prior sensitivity analysis indicated that downscaled precipitation with c=9,000 m has a better fit with precipitation measured at the stations (GHCN stations) than values of c >9,000 m. We therefore choose to set c conservatively to 9,000 m.

Boundary layer correction

Orographic precipitation effects are less pronounced just above the surface, as well as in the free atmosphere above the planetary boundary layer^11,44,45. The highest impact of orography is considered just at the boundary layer height where the airflow interacts with the terrain. While former studies used single ERA pressure levels, known to represent the main wind field patterns in a specific area²⁰, the pressure level representing the prevailing wind directions at the boundary layer is usually not known a priory on a global basis. We therefore used the boundary layer height B from ERA-Interim as indicator of the pressure level that has the highest contribution to the wind effect. The boundary layer height has been interpolated to the CHELSA resolution using a B-spline interpolation. The wind effect grid H containing the windward (H_W) and leeward (H_L) index values was then proportionally distributed to all grid cells falling within a respective 0.75° grid cell using:

\begin{matrix} (7) & H_{W B} = \frac{H_{W}}{1 - (\frac{| d | - d_{m a x}}{c})} \end{matrix}

\begin{matrix} (8) & H_{L B} = \frac{H_{L}}{1 - (\frac{| d | - d_{m a x}}{c})} \end{matrix}

with:

\begin{matrix} (9) & d = e l e v - B \end{matrix}

With d being the distance between a grid cell and the boundary layer height B, d_max being the maximum distance between the boundary layer height B and all grid cells at the CHELSA resolution falling within a respective 0.75° grid cell, c being a constant of 9,000 m, and elev being the respective elevation from GMTED2010.

with:

\begin{matrix} (10) & B = B_{E R A} + e l e v_{E R A} + f \end{matrix}

Bbeing the height of the monthly means of daily mean boundary layer from ERA-Interim, elev_ERA being the elevation of the ERA-Interim grid cell, and f being a constant of 500 m which takes into account that the level of highest precipitation is not necessarily at the lower bound of the boundary layer, but slightly higher^44,45. Similar to the c value in equations (equation (6),equation (7),8) we used a prior sensitivity analysis that varied f in steps of 50 m to determine the impact of f on the modelled precipitation values. Values of 500 >f<500 showed to a lower fit between modelled precipitation and precipitation measured at stations.

Precipitation data from ERA-Interim

For accumulated parameters (total monthly precipitation), we used the monthly means of daily forecast accumulations of total precipitation initialized at the synoptic hours 0:00 and 12:00. To calculate monthly precipitation sums, we added the synoptic monthly means at time 0:00, step 12 and time 12:00, step 12 and multiplied it by the number of days in the respective month.

Bias correction of ERA-Interim data using GPCC and GHCN data

Model-generated estimates of the surface precipitation are extracted from short range forecasts, which vary with forecast length. This drift in the short-range forecasts can be a problem for monthly and climatic means⁴⁶. One very common approach is to calculate the difference between baseline precipitation from the GCM and the observed precipitation and apply this ‘factor of change’ to historically observed time series to generate a synthetic time series^47–49. We therefore performed three steps of bias correction.

Monthly bias correction

We applied the monthly bias correction before the downscaling of the precipitation data on the ERA-interim precipitation values p_ERA directly⁴⁹. To this end, we used the monthly values p_GPCC of the gridded GPCC dataset¹⁰ to calculate the monthly bias R_m caused by the ERA-Interim parametrization, and the excessive or insufficient precipitation of the forecast algorithm⁴⁶ for each month from Jan. 1979–Dec. 2013 using:

\begin{matrix} (11) & R_{m} = \frac{p_{G P C C}}{p_{E R A}} \end{matrix}

We only used grid cells with meterological stations present for R_m. The forecast algorithm used to produce the precipitation amounts for ERA-Interim exhibits a considerable spin up—spin down effect (too much or too less precipitation), that has a coherent spatial structure, with a larger bias over high elevation terrain, or specific land forms such as tropical rainforests⁴⁶. Based on this observation, we assumed that grid cells without stations share a similar bias as their neighbouring stations. To achieve a gap-free grid surface, we interpolated the gaps in the R_m grid using a multilevel B-spline interpolation with 14 error levels to a 0.75° resolution. The gap-free bias correction surface R_m was then multiplied with the ERA-Interim precipitation p_ERA to get the bias corrected monthly precipitation sums p_m at 0.75° resolution:

\begin{matrix} (12) & p_{m} = p_{E R A} * R_{m} \end{matrix}

Monthly precipitation including orographic effects

To achieve the distribution of monthly precipitation sums p including orographic effects, we used a linear relationship between the monthly bias corrected precipitation grids at the ERA resolution p_m and the boundary layer corrected wind effect surface H:

\begin{matrix} (13) & p = \frac{H}{\bar{H}} * p_{m} \end{matrix}

where $\bar{H}$ is the mean wind effect at ERA resolution. By using a linear relationship we archive that the data are to scale, e.g., the precipitation at 0.75° resolution exactly matches the mean precipitation at all 30°sec cells within the range of a 0.75° cell.

Station bias correction

We used precipitation from a set of meteorological stations from GHCN, MeteoSwiss, and DWD to correct the remaining error between p and p_Station. We calculated the bias ratio between p and p_Station and interpolated the bias ratio using a multilevel B-spline interpolation with 14 error levels to a 0.1° grid which matches the spatial accuracy of many GHCN stations. The resulting bias surface was then multiplied with p to achieve the final monthly precipitation estimates.

Climatologies

We calculated the climatologies as the mean monthly sum of precipitation in the years 1979–2013 for each month. As slight errors in the precipitation sums can, however, accumulate over time, we applied an additional bias correction step using the GPCC Climatology Version 2015⁵⁰. We used the cells at which stations are present, calculated the bias between the annual accumulations and the GPCC climatology, and used a multilevel B-spline interpolation of the biases to a 0.25 grid to create the bias surface. This bias surface was then multiplied with the mean annual precipitation sums to create the final climatologies.

Bioclimatic parameters

From the monthly temperature and precipitation values, we additionally calculated a set of derived parameters often used in ecological applications. These bioclimatic variables are derived variables from the monthly mean, min, max, mean temperature, and mean precipitation values. These variables are specifically developed for species distribution modelling and related ecological applications. They represent annual averages (e.g., mean annual temperature, annual precipitation), seasonality (e.g., annual range in temperature and precipitation), and extreme or limiting environmental factors (e.g., temperature of the coldest and warmest month, and precipitation of the wet and dry quarters). A quarter is defined as the period of three months (1/4 of the year). The procedure strictly followed that of WorldClim⁸ and ANUCLIM⁵¹. The equations used to calculate the bioclimatic variables (where applicable) are:

t=monthly temperature [°C]

p=monthly precipitation [mm]

\begin{matrix} (14) & bio1 : (\sum_{i = 1}^{12} t_{i}) / 12 \end{matrix}

\begin{matrix} (15) & bio2 : (\sum_{i = 1}^{12} (t_{m a x} - t_{m i n})) / 12 \end{matrix}

\begin{matrix} (16) & bio3 : (t * (t_{m a x} - t_{m i n}) / t_{m a x} - t_{m i n}) * 100 \end{matrix}

\begin{matrix} (17) & bio4 : (\sqrt{{\frac{1}{11} \sum_{i = 1}^{12} (t_{i} - (\sum_{i = 1}^{12} t_{i} / 12))}^{2}}) * 100 \end{matrix}

\begin{matrix} (18) & bio7 : t_{m a x} - t_{m i n} \end{matrix}

\begin{matrix} (19) & bio12 : \sum_{i = 1}^{12} t_{i} \end{matrix}

\begin{matrix} (20) & bio15 : (\sqrt{\frac{1}{11} \sum_{i = 1}^{12} (p_{i} -} (\sum_{i = 1}^{12} p_{i} / 12))^{2}) / (\sum_{i = 1}^{12} p_{i}) / 12) \end{matrix}

Not listed here are the variables which are based on quarters (3 consecutive months) or specific (wettest, driest, warmest, coldest) months.

Code availability

The codes used to calculate CHELSA climatologies are written in C++ and are included in SAGA Version 2.2.7, freely available at www.saga-gis.org under the GNU public license including the necessary source codes. Calculations were done in SAGA Version 2.2.7 on the ‘Science Cloud’ cloud computing facility of the University of Zurich www.s3it.uzh.ch/infrastructure/sciencecloud/.

Data Records

The CHELSA data contains records for monthly mean temperature in °C and precipitation values in mm/month, and derived bioclimatic variables for the reference period 1979–2013 in form of GeoTIFF files. The climatologies available for download have been derived from monthly values of precipitation and temperature. The files are freely available at www.chelsa-climate.org as well as Dryad (Data Citation 1).

The file format is GeoTIFF.

Naming convention:

CHELSA_<variable><z-scale>_<month>_<Version>_land.tif

variable: prec=precipitation [mm/month]

temp=monthly mean of daily mean temperature [°C*10]

tmax=monthly mean of daily maximum temperature [°C*10]

tmin=monthly mean of daily minimum temperature [°C*10]

CHELSA_bio<z-scale>_<bioclim-variable>_<Version>_land.tif

bioclim-variable: 1=Annual Mean Temperature [°C*10]

2=Mean Diurnal Range [°C]

3=Isothermality

4=Temperature Seasonality [standard deviation]

5=Max Temperature of Warmest Month [°C*10]

6=Min Temperature of Coldest Month [°C*10]

7=Temperature Annual Range [°C*10]

8=Mean Temperature of Wettest Quarter [°C*10]

9=Mean Temperature of Driest Quarter [°C*10]

10=Mean Temperature of Warmest Quarter [°C*10]

11=Mean Temperature of Coldest Quarter [°C*10]

12=Annual Precipitation [mm/year]

13=Precipitation of Wettest Month [mm/month]

14=Precipitation of Driest Month [mm/month]

15=Precipitation Seasonality [coefficient of variation]

16=Precipitation of Wettest Quarter [mm/quarter]

17=Precipitation of Driest Quarter [mm/quarter]

18=Precipitation of Warmest Quarter [mm/quarter]

19=Precipitation of Coldest Quarter [mm/quarter]

Technical Validation

To validate the results of the CHELSA algorithm and the different bias correction steps applied, we use a statistical cross-validation, compared the results with several comparable products that are available at comparable spatial and temporal resolution, and independent meteorological station data. The climatologies are validated in two steps. First, as the effects of orographic winds are a non-stationary phenomenon, we show a validation of the time series (hereafter: reanalysis) from which the climatologies are created. Second, we show a validation of the final climatological products which are available for download.

Cross-validation of the bias correction method using monthly stations

To validate the results of the bias correction method using meteorological station data, we employed a cross-validation approach based on repeated split-resampling. We randomly omitted 20% of the stations for validation and used the remaining 80% for the bias correction by multilevel B-spline interpolation. We repeated the randomization 20 times per month for each respective year from 1979–2013. The mean R² values after cross-validation ranged between 0.53 and 0.90 with a mean of R²=0.77 and a root mean squared error (RMSE) ranging from 30.06–189.12 mm (mean=54.69 mm) globally throughout the years. A small increase in variance throughout the years can be observed, which might be due to the decrease in the number of stations throughout the years from 13,680 (Jan. 1979) to 1951 (Dec. 2013).

Validation of the orographic precipitation patterns

The CHELSA algorithm distributes the mean precipitation measured in a grid cell onto the expected precipitation pattern which in turn is calculated based on the wind effect and valley exposition indices. To test whether the inclusion of wind effect and valley exposition patterns produce a higher accuracy, we compared the fit between station data and precipitation in cells of 0.75° spatial resolution (correction step 1) and compared it with the fit between station data and corrected precipitation at 30 arc sec. We performed this comparison in five topographically complex regions (Table 1).

Table 1 Difference in R² values before and after the downscaling from 0.75° resolution to 30 arc sec resolution by means of orographic wind effect correction globally and for five topographic complex regions.

Full size table

The inclusion of the small-scale orographic effects generally leads to a better fit to the station data in all complex areas in CHELSA compared to WorldClim⁸. The improvement can range from 19.67 % variance explained during the Himalayan wet season (Table 1) to even a decrease in variance of −3.67 % variance explained during August in the Alps. In the majority of cases, however, the variance explained between measured station data and orographically corrected precipitation generally increases during this step of the algorithm. The remaining variance between stations and orographic predictors is finally removed using the subsequent bias correction steps.

Small-scale fit between stations and final climatology

To compare the fit of GHCN stations¹⁴ with the final climatologies, we calculated a linear regression model for each station separately. For each station, the surrounding stations in 2° distance where included (with a minimum of 16 stations). We then regressed the mean annual precipitation measured at the station with the mean annual precipitation derived from six different models that either use GHCN data in their algorithms (CHELSA, CRU⁹, WorldClim⁸, GPCC⁵⁰) or do not use GHCN directly (ERA-Interim¹⁷).

From this linear regression approach two inferences can be drawn. First, the comparison between ERA-Interim, GPCC, and CHELSA shows the increase in fit with the specific corrections which are applied in the CHELSA algorithm, as ERA-Interim and GPCC contribute data to CHELSA. Second, the results show how well the stations correspond to the respective models at small spatial scale.

Among the models which use GHCN in their algorithm, CHELSA shows the highest fit between stations and predicted precipitation, with WorldClim, GPCC and CRU showing smaller, but still high fits with the station data (Fig. 2).

**Figure 2: Small scale comparison of model fit with station data from GHCN^14,15 for annual precipitation sums derived from six different models.**

Validation using independent precipitation station data

A statistical comparison with different datasets is complicated by the fact that most gridded temperature and precipitation datasets are parameterized using similar observational data, leading to generally high correlations between climatic reanalyses. To validate the results of the CHELSA algorithm, we identified several independent datasets of various size and temporal extents. None of these have been used in the final bias correction within the algorithm, and we have additionally screened them for duplicates in the GHCN¹⁵, MeteoSwiss, and DWD datasets. As the station data is of different spatial extent, it allows us to validate the accuracy of CHELSA on the global scale, as well as on the very small target scale of 30 arc°. We split the validation in two parts. One part examines the temporal performance of the reanalysis dataset (Table 2) and the other the climatological performance of the dataset (Table 3). For comparison with other reanalysis products, we also calculated a similar validation for the CRU⁹ and ERA-Interim¹⁷ datasets. For comparison with other climatologies, we included the CHPclim⁵², CRU, ERA-Interim, and WorldClim⁸ datasets.

Table 2 Reanalysis validation using independent station data.

Full size table

Table 3 Climatological validation using independent climate station data.

Full size table

Precipitation validation data:

1
FAO—data: 2,316 stations
2
Mexico—data: 2,950 stations
3
Austria—Ehyd data: 877 stations
4
South Africa—SAEON data: 14 stations
5
Scandinavia—Nordklim data: 11 stations
6
China—CMA data: 241 stations

FAO data validation results

The FAO data obtained from the Agromet Group of the Food and Agriculture Organization of the United Nations (FAO) is a collection of 2,316 stations with a good representation in many typically data-sparse regions, but many stations only have a short measuring period. The data is global and duplicates in the GHCN data report have been removed.

CHELSA shows high correspondence of R²=0.83 throughout the years with the FAO dataset, while CRU and ERA-interim show considerably lower values (R²=0.73 & R²=0.51, respectively) (Table 2). The root mean square error (RMSE) and mean absolute error between CHELSA and the FAO data is not significantly different from those of the other validation datasets (with the exception of the SAEON dataset) (Table 2).

For the climatological validation, CHELSA performs similar to CHPclim, and WorldClim (Table 3). All three climatologies, however, already include FAO data in some way, which explains the close fit among the data. CHPclim and WorldClim use them in their station interpolation, and in the case of CHELSA, FAO data are only included in the GPCC data that have been used for the bias correction at the large scale, but not in the GHCN data that have been used in the monthly bias interpolation step. Both CRU and ERA-Interim perform considerably mediocre when compared to FAO data (Table 3). However, this comparison is only partly valid and only shows the increase of fit when station data is included into a precipitation downscaling or interpolation algorithm.

Mexico data validation results

The Mexico data consist of 2,950 stations with a dense spatial distribution, but with only a short measuring period for many stations.

None of the reanalysis datasets are able to capture the temporal variation in the station dataset well (Table 2). The average R² of CHELSA only reaches 0.39, which is still slightly better than the performance of CRU and ERA-Interim. The RMSE and MAE values are also lower in their mean, as well as in their standard deviation. The poor performance of all products might be due to the fact that many meteorological stations in this dataset have missing values.

The climatological performance of all models with the Mexico dataset is slightly better than that of the reanalysis dataset (Table 3). WorldClim shows the highest fit with stations, which is not surprising, as WorldClim already includes most of the stations for its original calibration and the Mexico data set is therefore not independent from the WorldClim climatologies. CHELSA shows the second highest fit with the Mexico data and is slightly better than CHPclim in all three metrics (R², RMSE, MAE). Era-Interim and CRU do not capture the climatological precipitation in this area well, in comparison to the other three datasets.

Austria Ehyd data validation results

The Ehyd data from the Federal Ministry of Agriculture, Forestry, Environment and Water Management of Austria comprises of a dense net of 877 precipitation stations in Austria.

The overall performance of all precipitation products is low when compared to the Ehyd stations (Table 2). From all models however, CHELSA performs best, with the highest R², and lowest errors. For the climatologies CHELSA performs second best after CHPclim, but with all climatologies having a comparably low fit with the station data (Table 3). WorldClim shows the lowest fit with station data. In general the overall performance of the climatologies is comparable to that of the reanalysis.

Skandinavia—Nordklim data

The Nordklim data 1.0 includes observations of twelve climate variables from more than 119 stations in the Nordic region including precipitation and air temperature, in a time span of over 100 years. The data are provided by NORDKLIM/NORDMET on behalf of the National meteorological services in Denmark (DMI), Finland (FMI), Iceland (VI), Norway (DNMI) and Sweden (SMHI). We screened these stations for duplicates in the GHCN dataset and remained with a set of 11 independent stations which we used for the validation.

All reanalysis products track the temporal signal in the data reasonably well, with CRU slightly outperforming CHELSA, and ERA-Interim performing the worst (Table 2). All models, however, show relatively small errors in this region, and are only slightly different in their temporal signal.

CHELSA, WorldClim, and CRU climatologies fit the Nordklim data well, with CHELSA performing better than the other models, despite the fact that Nordklim data are included in WorldClim but not in CHELSA (Table 3). ERA-Interim and CHPclim do not perform well in this region, which is probably due to the larger errors of remote sensing data in arctic regions, on which both models depend.

China—CMA data

The precipitation data from China comes from the Chinese Meteorological Administration and consists of 241 stations with daily records that are not included in the GHCN dataset. CHELSA and CRU are able to track the temporal signal in precipitation rather well when compared to the CMA data, with ERA-Interim performing less well (Table 2).

For the climatological means, CRU slightly outperforms CHELSA, WorldClim shows a slightly worse performance compared to the former two, and CHPclim and ERA-Interim show the lowest performance in this region, with R² values slightly below 0.5 (Table 3).

South Africa—SAEON data validation results

One of the main purposes of CHELSA is the better representation of precipitation gradients at small spatial scales. The SAEON precipitation stations are located in the Jonkershoek valley in South Africa with a strong elevational gradient, from the entrance to the valley to the watershed at the top of the valley. They additionally have a very high interannual variability and have not been included in any global precipitation product. Although the timespan of the dataset is too low to validate the climatological performance, the dataset can be used to track the performance of models in a very complex terrain and a strong seasonality.

For all stations of the SAEON network, CHELSA predicts the temporal variation and the actual precipitation values best compared to the ERA-Interim, CRU and CHIRPS¹ reanalysis products (Fig. 3). All reanalysis products predict the temporal variation in precipitation well, but they differ in the respective errors. All of them underestimate the extremes of precipitation in the region covered by the SAEON data.

**Figure 3: Temporal precision of the CHELSA reanalysis which forms the basis for the climatologies in a small region of South Africa.**

Large-scale spatial comparison of precipitation patterns

To compare our precipitation data with those of other products, we first compared the spatial patterns of precipitation with those of the Tropical Rainfall Measuring Mission (TRMM)^2,3 combined multisatellite product TRMM/TMPA (3B43)⁵³, CRU⁹, WorldClim⁸, CHPclim⁵², GPCC⁵⁰, and ERA-Interim¹⁷. Figure 4 shows the bias of all mentioned products with the CRU dataset. We used the CRU dataset as a comparison, as it is not included in the other datasets, and therefore the most independent. TRMM/TMPA (3B43), WorldClim, GPCC are all using similar stations, and ERA-Interim is known to exhibit large biases in precipitation. All products, with the exception of ERA-Interim show similar amounts and patterns of biases when compared to CRU data. The bias of CHELSA is lower than that of GPCC and ERA-Interim, the two datasets which have been used in the correction algorithm. The large scale comparison however, only serves as a guideline for the deviation in the above mentioned products in general and cannot be seen as independent validation. For regions in which all models exhibit large biases, we would urge caution in the use of a single precipitation product and would suggest the use of multiple models from various sources.

**Figure 4: Bias ratio comparison of annual precipitation sums for six different climatologies with the CRU⁹ climatology at the global scale.**

Small-scale comparison of precipitation patterns

To highlight small scale performance of CHELSA, we compared precipitation patterns of three different models in the topographically and climatically highly complex terrain of Bhutan (Fig. 5). A comparison of the annual precipitation totals between TRMM/TMPA (3B43)⁵³, WorldClim⁸, CHELSA, and the statistical downscaling approach of Böhner³¹ shows similar patterns between all models at the mesoscale. The differences at the microscale are, however, severe between CHELSA and Böhner³¹ compared to WorldClim⁸. There are only few climate stations in the region of Bhutan, which creates spurious correlations between elevation and precipitation in the ANUSPLIN algorithm of WorldClim. CHELSA and Böhner show a more consistent relation between the terrain features and the resulting precipitation patterns. A comparison with the patterns of cloud formations in this region⁵⁴ shows similarities in the patterns where clouds form and where higher precipitation amounts are predicted by CHELSA and Böhner (Fig. 5). Although the formation of clouds does not necessarily coincide with rainfall, there is generally a high correlation between the formation of clouds and the patterns of rainfall especially in topographically complex terrain⁵⁵. We therefore assume that our model is able to capture the topographic heterogeneity of precipitation at the small spatial scale well.

Figure 5: Comparison of precipitation patterns in the complex terrain of Bhutan (country boundaries in black) between TRMM/TMPA (3B43)⁵³, WorldClim⁸, CHELSA, the statistical downscaling approach of Böhner³¹, the topography from GMTED2010²⁹, and the cloud cover climatology from Wilson & Jetz⁵⁴.

Validation of temperature using independent meteorological stations

We compared CHELSA temperature data to that of MODIS (MOD11C3)⁵⁶ and several independent station datasets. Other high resolution products for temperature such as WorldClim do not have the same validation period as CHELSA. A comparison is therefore problematic due to the increase of global temperatures in the last decades⁵⁷. PRISM¹¹ is geographically restricted to the United States and therefore also not available for global comparisons. As climate station data not directly used by the CHELSA algorithm for temperature, a comparison with station data is possible. We used a set of four different station networks with different temporal and spatial extent for the validation.

Temperature validation data:

1
FAO—data: 400 stations
2
Mexico—data: 2,915 stations
3
GHCN—data: 6,093 stations
4
Scandinavia—Nordklim data: 32 stations

The downscaled CHELSA data tracks the temperature data well in the GHCN, and FAO datasets, but larger deviations in the Mexico and Nordklim datasets with regard to the R² values (Table 4). However, the RMSE and MAE of the Nordklim dataset are comparable to those of the two global datasets GHCN and FAO. Only the comparison with the Mexico dataset shows a high RMSE of 1.95, and MAE of 1.43 (Table 4).

Table 4 Reanalysis validation using independent station data for temperature.

Full size table

The climatologies show a lower RMSE and MAE than the time series when compared to all station data (Table 5). This indicates that although an error exists in the monthly CHELSA reanalysis temperatures, the error does not inflate when these values are averaged. R² values are also higher for the climatologies, than for the reanalysis values.

Table 5 Climatological validation using independent station data.

Full size table

CHELSA—MODIS comparison

Coefficients of determination between MODIS (MOD11C3)⁵⁶ and CHELSA temperatures range from 0.95 to 0.99 globally, between GHCN Version 3 and CHELSA temperatures range 0.96 to 0.99 globally, and between MODIS (MOD11C3) and GHCN Version 2 range from 0.83–0.97 (Fig. 6). Both CHELSA and MODIS (MOD11C3) show systematically lower correlations during the northern summer months for the GHCN dataset which might indicate erroneous temperature values in the GHCN dataset. The deviations might, however, also come from the overestimation of temperatures in the arctic by remote sensing data observed as well in the MODIS (MOD11C3) and the ERA-Interim data^17,58. As MODIS (MOD11C3) and ERA-Interim data are showing a similar bias, we can assume that the deviations either stem from the GHCN dataset or the remote sensing input to ERA-Interim and not the downscaling algorithm we use.

**Figure 6: Temporal comparison of the CHELSA algorithm with GHCN Version 3 (temperature)¹⁴, and MODIS (MOD11C3)⁵⁶.**

The high spatial correlation between CHELSA and MODIS (MOD11C3)⁵⁶ shows that CHELSA is able to predict spatial patterns of temperature distributions well, and additionally accurately predicts the observed values of temperature on a small scale.

Application example: Performance for species distribution modelling

As we are generally interested in the use of CHELSA climatologies for ecological studies we compare the performance of CHELSA in a species distribution modelling (SDM) approach^59,60 to the most commonly used climate dataset for this purpose: WorldClim⁸. We calculated SDMs for 67 species from Switzerland. We used species from Switzerland as this allows a comparison of performances in areas where climate station density is high in CHELSA and WorldClim, and tests whether the performance improvement of CHELSA is also found in areas with detailed climate data. We modelled species using a generalized linear model with mean annual precipitation and mean annual temperature as predictors. We randomly sampled six times as many pseudo-absence points as presence points and used an inverse weighting approach on the resulting presences and absences. We evaluated the models in a 10-fold cross-validation using the area under the receiver operating characteristic curve (AUC), Kappa statistics⁶¹ and true skill statistic⁶² (TSS). AUC and Kappa are traditional test measures between predicted and observed data and usually ranges from 0.5 (AUC) or 0 (Kappa), indicating random fit, to 1, indicating perfect fit. TSS assesses model specificity and sensitivity and ranges from zero (both the specificity and the sensitivity of the model are zero) to 1 (both specificity and sensitivity are 1). Additionally, we calculated the adjusted D² value which represents the percentage deviance explained by (goodness-of-fit of) the model. Model performance for all 67 species was then compared using a paired t-test of all species distribution models.

The result shows an improvement of the models when using CHELSA over WorldClim data (Fig. 7). All measures show a higher performance of the CHELSA data, although the difference in mean is not significant for the TSS. This shows that even in areas with comparably high station density and good climatic information the CHELSA algorithm improves the spatial prediction of climatic variables and subsequently the modelled distribution of species (Fig. 8).

Figure 7: Comparison of the performance of 67 species distribution models using generalized linear models with WorldClim (blue) and CHELSA (red) precipitation and temperatures in Switzerland as predictors.

**Figure 8: Comparison of species distribution models based and climate data from WorldClim⁸ and CHELSA of *Astragalus monspessulanus* for Switzerland.**

Validation results—Conclusions

The validation results in general show that including orographic effects can improve existing climatologies and reanalysis to a degree that the derived analysis (here the SDMs) show increasing accuracies. While CHELSA is an improvement over existing very high-resolution climatologies, it still exhibits errors which we quantified in several ways. The validation of main correction step in the algorithm that includes the orographic wind effects and boundary layer shows that the precipitation at the stations is better captured after the correction than before the downscaling to 30 arc sec resolution. The improvement varies by region and month with the majority of months showing an improvement. Most importantly, the better prediction with regard to SDMs in which precipitation and temperature data are used already indicates that CHELSA might be a substantial improvement over existing products which are currently being employed for such purposes.

Usage Notes

All CHELSA products are in a geographic coordinate system referenced to the WGS 84 horizontal datum, with the horizontal coordinates expressed in decimal degrees. The CHELSA layer extents (minimum and maximum latitude and longitude) are a result of the coordinate system inherited from the 1-arc-second GMTED2010 data which itself inherited the grid extent from the 1-arc-second SRTM data.

Note that because of the pixel center referencing of the input GMTED2010 data the full extent of each CHELSA grid as defined by the outside edges of the pixels differs from an integer value of latitude or longitude by 0.000138888888 degree (or 1/2 arc-second). Users of products based on the legacy GTOPO30 product should note that the coordinate referencing of CHELSA (and GMTED2010) and GTOPO30 are not the same. In GTOPO30, the integer lines of latitude and longitude fall directly on the edges of a 30-arc-second pixel. Thus, when overlaying CHELSA with products based on GTOPO30 a slight shift of 1/2 arc-second will be observed between the edges of corresponding 30-arc-second pixels.

The dataset is in GEOtiff format. GEOtiff can be viewed using standard GIS software such as:

SAGA GIS—(free) http://www.saga-gis.org/

ArcGIS—https://www.arcgis.com/

QGIS—(free) http://www.qgis.org

DIVA—GIS—(free) http://www.diva-gis.org/

GRASS—GIS—(free) https://grass.osgeo.org/

Grid extent:

Resolution (decimal degrees): 0.0083333333

West extent (minimum X-coordinate, longitude): −180.0001388888

South extent (minimum Y-coordinate, latitude): −90.0001388888

East extent (maximum X-coordinate, longitude): 179.9998611111

North extent (maximum Y-coordinate, latitude): 83.9998611111

Rows: 20,800

Columns: 43,200

The data are feely available under the Creative Commons Licence: CC 0.

Additional Information

How to cite this article: Karger, D. N. et al. Climatologies at high resolution for the earth’s land surface areas. Sci. Data 4:170122 doi: 10.1038/sdata.2017.122 (2017).

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Funk, C. et al. The climate hazards infrared precipitation with stations—a new environmental record for monitoring extremes. Sci. Data 2, 150066 (2015).
Article Google Scholar
Biasutti, M., Yuter, S. E., Burleyson, C. D. & Sobel, A. H. Very high resolution rainfall patterns measured by TRMM precipitation radar: seasonal and diurnal cycles. Clim. Dyn 39, 239–258 (2011).
Article Google Scholar
Huffman, G. J. et al. The TRMM Multisatellite Precipitation Analysis (TMPA): Quasi-Global, Multiyear, Combined-Sensor Precipitation Estimates at Fine Scales. J. Hydrometeorol. 8, 38–55 (2007).
Article ADS Google Scholar
Maraun, D. et al. Precipitation downscaling under climate change: Recent developments to bridge the gap between dynamical models and the end user. Rev. Geophys. 48, RG3003 (2010).
Article ADS Google Scholar
Wood, A. W., Leung, L. R., Sridhar, V. & Lettenmaier, D. P. Hydrologic Implications of Dynamical and Statistical Approaches to Downscaling Climate Model Outputs. Clim. Change 62, 189–216 (2004).
Article Google Scholar
Wilby, R. L. et al. Statistical downscaling of general circulation model output: A comparison of methods. Water Resour. Res. 34, 2995–3008 (1998).
Article ADS Google Scholar
Schmidli, J., Frei, C. & Vidale, P. L. Downscaling from GCM precipitation: a benchmark for dynamical and statistical downscaling methods. Int. J. Climatol. 26, 679–689 (2006).
Article Google Scholar
Hijmans, R. J., Cameron, S. E., Parra, J. L., Jones, P. G. & Jarvis, A. Very high resolution interpolated climate surfaces for global land areas. Int. J. Climatol. 25, 1965–1978 (2005).
Article Google Scholar
Harris, I., Jones, P. d., Osborn, T. J. & Lister, D. H. Updated high-resolution grids of monthly climatic observations—the CRU TS3.10 Dataset. Int. J. Climatol. 34, 623–642 (2014).
Article Google Scholar
Schneider, U. et al. GPCC’s new land surface precipitation climatology based on quality-controlled in situ data and its role in quantifying the global water cycle. Theor. Appl. Climatol. 115, 15–40 (2013).
Article ADS Google Scholar
Daly, C., Taylor, G. H. & Gibson, W. P. The PRISM approach to mapping precipitation and temperature. in Proc., 10th AMS Conf. on Applied Climatology 20–23 (1997).
Deblauwe, V. et al. Remotely sensed temperature and precipitation data improve species distribution modelling in the tropics. Glob. Ecol. Biogeogr 25, 443–454 (2016).
Article Google Scholar
Soria-Auza, R. W. et al. Impact of the quality of climate models for modelling species occurrences in countries with poor climatic documentation: a case study from Bolivia. Ecol. Model. 221, 1221–1229 (2010).
Article Google Scholar
Lawrimore, J. H. et al. An overview of the Global Historical Climatology Network monthly mean temperature data set, version 3. J. Geophys. Res. Atmospheres 116, 1–18 (2011).
Article Google Scholar
Peterson, T. C. & Vose, R. S. An overview of the Global Historical Climatology Network temperature database. Bull. Am. Meteorol. Soc 78, 2837–2849 (1997).
Article Google Scholar
Kalnay, E. et al. The NCEP/NCAR 40-Year Reanalysis Project. Bull. Am. Meteorol. Soc 77, 437–471 (1996).
Article Google Scholar
Dee, D. P. et al. The ERA-Interim reanalysis: configuration and performance of the data assimilation system. Q. J. R. Meteorol. Soc 137, 553–597 (2011).
Article ADS Google Scholar
Wilby, R. L. & Wigley, T. M. L. Downscaling general circulation model output: a review of methods and limitations. Prog. Phys. Geogr. 21, 530–548 (1997).
Article Google Scholar
Böhner, J., Antonic, O., Böhner, J. & Antonic, O. in Geomorphometry: Concepts, Software, Applications (eds Hengl T. & Reuter H. I. ) 195–226 (Elsevier Science, 2009).
Book Google Scholar
Gerlitz, L., Conrad, O. & Böhner, J. Large-scale atmospheric forcing and topographic modification of precipitation rates over High Asia—a neural-network-based approach. Earth Syst Dynam 6, 61–81 (2015).
Article ADS Google Scholar
Berrisford, P. et al. The ERA-interim archive. ERA Rep. Ser 1–16 (2009).
Berrisford, P. et al. Atmospheric conservation properties in ERA-Interim. Q. J. R. Meteorol. Soc 137, 1381–1399 (2011).
Article ADS Google Scholar
Gao, L. et al. Statistical Downscaling of ERA-Interim Forecast Precipitation Data in Complex Terrain Using LASSO Algorithm, Statistical Downscaling of ERA-Interim Forecast Precipitation Data in Complex Terrain Using LASSO Algorithm. Adv. Meteorol. Adv. Meteorol. e472741 (2014).
Bao, X. & Zhang, F. Evaluation of NCEP-CFSR, NCEP-NCAR, ERA-Interim, and ERA-40 Reanalysis Datasets against Independent Sounding Observations over the Tibetan Plateau. J. Clim 26, 206–214 (2012).
Article ADS Google Scholar
Betts, A. K., Köhler, M. & Zhang, Y. Comparison of river basin hydrometeorology in ERA-Interim and ERA-40 reanalyses with observations. J. Geophys. Res. Atmospheres 114, D02101 (2009).
Article ADS Google Scholar
Hansen, J., Sato, M. & Ruedy, R. Radiative forcing and climate response. J. Geophys. Res. Atmospheres 102, 6831–6864 (1997).
Article CAS ADS Google Scholar
Rolland, C. Spatial and Seasonal Variations of Air Temperature Lapse Rates in Alpine Regions. J. Clim 16, 1032–1046 (2003).
Article ADS Google Scholar
Minder, J. R., Mote, P. W. & Lundquist, J. D. Surface temperature lapse rates over complex terrain: Lessons from the Cascade Mountains. J. Geophys. Res. Atmospheres 115, D14122 (2010).
Article ADS Google Scholar
Danielson, J. J. & Gesch, D. B. Global multi-resolution terrain elevation data 2010 (GMTED2010). (US Geological Survey, 2011).
Book Google Scholar
Hunter, R. D. & Meentemeyer, R. K. Climatologically Aided Mapping of Daily Precipitation and Temperature. J. Appl. Meteorol. 44, 1501–1510 (2005).
Article ADS Google Scholar
Böhner, J. General climatic controls and topoclimatic variations in Central and High Asia. Boreas 35, 279–295 (2006).
Article Google Scholar
Spreen, W. C. A determination of the effect of topography upon precipitation. Eos Trans. Am. Geophys. Union 28, 285–290 (1947).
Article Google Scholar
Gao, X., Xu, Y., Zhao, Z., Pal, J. S. & Giorgi, F. On the role of resolution and topography in the simulation of East Asia precipitation. Theor. Appl. Climatol. 86, 173–185 (2006).
Article ADS Google Scholar
Basist, A., Bell, G. D. & Meentemeyer, V. Statistical Relationships between Topography and Precipitation Patterns. J. Clim 7, 1305–1315 (1994).
Article ADS Google Scholar
Daly, C., Neilson, R. P. & Phillips, D. L. A Statistical-Topographic Model for Mapping Climatological Precipitation over Mountainous Terrain. J. Appl. Meteorol. 33, 140–158 (1994).
Article ADS Google Scholar
Sevruk, B. Regional Dependency of Precipitation-Altitude Relationship in the Swiss Alpsin Climatic Change at High Elevation Sites (eds Diaz, H. F., Beniston, M. & Bradley, R. S. ) 123–137 (Springer Netherlands, 1997).
Chapter Google Scholar
Körner, C. The use of ‘altitude’ in ecological research. Trends Ecol. Evol. 22, 569–574 (2007).
Article Google Scholar
Rotunno, R. & Houze, R. A. Lessons on orographic precipitation from the Mesoscale Alpine Programme. Q. J. R. Meteorol. Soc 133, 811–830 (2007).
Article ADS Google Scholar
Weischet, W. & Endlicher, W. Einführung in die allgemeine Klimatologie (2008).
Roe, G. H. Orographic Precipitation. Annu. Rev. Earth Planet. Sci. 33, 645–671 (2005).
Article CAS ADS Google Scholar
Colle, B. A. Sensitivity of Orographic Precipitation to Changing Ambient Conditions and Terrain Geometries: An Idealized Modeling Perspective. J. Atmospheric Sci 61, 588–606 (2004).
Article ADS Google Scholar
Sinclair, M. R. A Diagnostic Model for Estimating Orographic Precipitation. J. Appl. Meteorol. 33, 1163–1175 (1994).
Article ADS Google Scholar
Smith, R. B. & Barstad, I. A Linear Theory of Orographic Precipitation. J. Atmospheric Sci 61, 1377–1391 (2004).
Article ADS Google Scholar
Oke, T. R. . Boundary layer climates. Routledge, (2002).
Book Google Scholar
Stull, R. B. An introduction to boundary layer meteorology 13 (Springer Science & Business Media, 2012).
MATH Google Scholar
Kållberg, P. Forecast drift in ERA-Interim (European Centre for Medium Range Weather Forecasts, 2011).
Google Scholar
Lafon, T., Dadson, S., Buys, G. & Prudhomme, C. Bias correction of daily precipitation simulated by a regional climate model: a comparison of methods. Int. J. Climatol. 33, 1367–1381 (2013).
Article Google Scholar
Arnell, N. W., Hudson, D. A. & Jones, R. G. Climate change scenarios from a regional climate model: Estimating change in runoff in southern Africa. J. Geophys. Res. Atmospheres 108, 4519 (2003).
Article ADS Google Scholar
Molteni, F. A. ‘historical’ approach to the rescaling of ERA-Interim precipitation, internal technical note (European Centre for Medium Range Weather Forecasts, 2013).
Google Scholar
Meyer-Christoffer, A. et al. GPCC Climatology Version 2015 at 0.25°: Monthly Land-Surface Precipitation Climatology for Every Month and the Total Year from Rain-Gauges built on GTS-based and Historic Data. Global Precipitation Climatology Centre at Deutscher Wetterdienst doi: 10.5676/DWD_GPCC/CLIM_M_V2015_025 (2015).
Xu, T. & Hutchinson, M. F. New Developments and Applications in the ANUCLIM Spatial Climatic and Bioclimatic Modelling Package. Env. Model Softw 40, 267–279 (2013).
Article Google Scholar
Funk, C. et al. A global satellite-assisted precipitation climatology. Earth Syst Sci Data 7, 275–287 (2015).
Article ADS Google Scholar
Goddard Space Flight Center Distributed Active Archive Center (GSFC DAAC). TRMM/TMPA 3B43 TRMM and Other Sources Monthly Rainfall Product V7 (2011).
Wilson, A. M. & Jetz, W. Remotely Sensed High-Resolution Global Cloud Dynamics for Predicting Ecosystem and Biodiversity Distributions. PLOS Biol 14, e1002415 (2016).
Article Google Scholar
Pruppacher, H. R., Klett, J. D. & Wang, P. K. Microphysics of Clouds and Precipitation. Aerosol Science and Technology 28, 381–382 (1998).
Article ADS Google Scholar
NASA LP DAAC. MODIS/Terra Land Surface Temperature and Emissivity Monthly L3 Global 0.05Deg CMG. NASA EOSDIS Land Processes DAAC, USGS Earth Resources Observation and Science (EROS) Center, (2015).
Stocker, T. F. et al. IPCC, 2013: climate change 2013: the physical science basis. Contribution of working group I to the fifth assessment report of the intergovernmental panel on climate change (2013).
Wan, Z., Zhang, Y., Zhang, Q. & Li, Z.-L. Quality assessment and validation of the MODIS global land surface temperature. Int. J. Remote Sens. 25, 261–274 (2004).
Article ADS Google Scholar
Guisan, A. & Zimmermann, N. E. Predictive habitat distribution models in ecology. Ecol. Model. 135, 147–186 (2000).
Article Google Scholar
Guisan, A. & Thuiller, W. Predicting species distribution: offering more than simple habitat models. Ecol. Lett. 8, 993–1009 (2005).
Article Google Scholar
Warren, D. L., Glor, R. E. & Turelli, M. Environmental Niche Equivalency Versus Conservatism: Quantitative Approaches to Niche Evolution. Evolution 62, 2868–2883 (2008).
Article Google Scholar
Allouche, O., Tsoar, A. & Kadmon, R. Assessing the accuracy of species distribution models: prevalence, kappa and the true skill statistic (TSS). J. Appl. Ecol. 43, 1223–1232 (2006).
Article Google Scholar

Data Citations

Karger, D. N. Dryad Digital Repository https://doi.org/10.5061/dryad.kd1d4 (2017)

Download references

Acknowledgements

M.K. & D.N.K. would like to acknowledge funding from the Swiss National Funds (SNF 147630, SNF 146906). We thank Sergio Maffioletti for implementing the CHELSA algorithm on the science cloud grid computing facility of the University of Zurich. We further thank Stefan Eggenberg and InfoFlora for access to 67 plant species for test modelling.

Author information

Authors and Affiliations

Department of Systematic and Evolutionary Botany, University of Zurich, Zollikerstrasse 107, Zurich, 8008, Switzerland
Dirk Nikolaus Karger, H. Peter Linder & Michael Kessler
Swiss Federal Research Institute WSL, Zürcherstr 111, Birmensdorf, 8903, Switzerland
Dirk Nikolaus Karger & Niklaus E. Zimmermann
Institute of Geography, University of Hamburg, Bundesstrasse 55, Hamburg, 20146, Germany
Olaf Conrad, Jürgen Böhner & Tobias Kawohl
Biodiversity, Macroecology & Conservation Biogeography Group, University of Göttingen, Göttingen, 37077, Germany
Holger Kreft & Rodrigo Wilber Soria-Auza
Asociación Armonía, Av. Lomas de Arena # 400, Zona Palmasola, 10260, Santa Cruz de la Sierra, Bolivia
Rodrigo Wilber Soria-Auza

Authors

Dirk Nikolaus Karger
View author publications
You can also search for this author in PubMed Google Scholar
Olaf Conrad
View author publications
You can also search for this author in PubMed Google Scholar
Jürgen Böhner
View author publications
You can also search for this author in PubMed Google Scholar
Tobias Kawohl
View author publications
You can also search for this author in PubMed Google Scholar
Holger Kreft
View author publications
You can also search for this author in PubMed Google Scholar
Rodrigo Wilber Soria-Auza
View author publications
You can also search for this author in PubMed Google Scholar
Niklaus E. Zimmermann
View author publications
You can also search for this author in PubMed Google Scholar
H. Peter Linder
View author publications
You can also search for this author in PubMed Google Scholar
Michael Kessler
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.K. initiated the project. D.N.K., O.K., and T.K. developed the algorithms in close communication with J.B. R.W.S. compiled the GHCN data and removed the errors. M.K., H.K., P.L., and N.Z. provided the funding for the project. D.N.K. wrote the first draft of the manuscript and all authors contributed significantly to the revisions.

Corresponding author

Correspondence to Dirk Nikolaus Karger.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

ISA-Tab metadata

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files made available in this article.

Reprints and permissions

About this article

Cite this article

Karger, D., Conrad, O., Böhner, J. et al. Climatologies at high resolution for the earth’s land surface areas. Sci Data 4, 170122 (2017). https://doi.org/10.1038/sdata.2017.122

Download citation

Received: 11 October 2016
Accepted: 21 July 2017
Published: 05 September 2017
DOI: https://doi.org/10.1038/sdata.2017.122

This article is cited by

The potential range of west Asian apple species Malus orientalis Uglitzk. under climate change
- Łukasz Walas
- Shirin Alipour
- Saud Alamri
BMC Plant Biology (2024)
Ecology and geography of Cache Valley virus assessed using ecological niche modeling
- John A. Muller
- Krisangel López
- Albert J. Auguste
Parasites & Vectors (2024)
Exploring the potential effects of forest urbanization on the interplay between small mammal communities and their gut microbiota
- Marie Bouilloud
- Maxime Galan
- Nathalie Charbonnel
Animal Microbiome (2024)
Risk factors for tick attachment in companion animals in Great Britain: a spatiotemporal analysis covering 2014–2021
- Elena Arsevska
- Tomislav Hengl
- Alan D. Radford
Parasites & Vectors (2024)
Elevation affects both the occurrence of ungulate browsing and its effect on tree seedling growth for four major tree species in European mountain forests
- Marianne Bernard
- Julien Barrere
- Georges Kunstler
Annals of Forest Science (2024)

Subjects

Abstract

Similar content being viewed by others

Background & Summary

Methods

Calculation of monthly temperature and precipitation values

Temperature

Maximum and minimum temperatures

Precipitation

Wind effect correction

Valley exposition correction

Boundary layer correction

Precipitation data from ERA-Interim

Bias correction of ERA-Interim data using GPCC and GHCN data

Monthly bias correction

Monthly precipitation including orographic effects

Station bias correction

Climatologies

Bioclimatic parameters

Code availability

Data Records

Technical Validation

Cross-validation of the bias correction method using monthly stations

Validation of the orographic precipitation patterns

Small-scale fit between stations and final climatology

Validation using independent precipitation station data

FAO data validation results

Mexico data validation results

Austria Ehyd data validation results

Skandinavia—Nordklim data

China—CMA data

South Africa—SAEON data validation results

Large-scale spatial comparison of precipitation patterns

Small-scale comparison of precipitation patterns

Validation of temperature using independent meteorological stations

Temperature validation data:

CHELSA—MODIS comparison

Application example: Performance for species distribution modelling

Validation results—Conclusions

Usage Notes

Additional Information

References

References

Data Citations

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

ISA-Tab metadata

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links