Background & Summary

Human activities have transformed most of the world’s terrestrial landscapes1, resulting in accelerated resource exploitation, environmental deterioration, biodiversity loss, and climate change2,3,4. Growing human populations5 and increasing wealth in many regions6 will inevitably propel further development to meet the rising demands for food, water, energy, and other land-based resources. Predicting and managing forthcoming development is essential to minimize the impacts of large-scale expansion on natural habitats and their services and to promote ecological and socioeconomic sustainability7. Anticipating where future development may occur requires mapping of land that is suitable to human activities8 (e.g., growing crops, expanding housing development, establishing a mine), often taking into consideration the area’s biophysical criteria (e.g., prevailing climate, soil, topography), land use or administrative constraints (e.g., compatible land types, protected areas), and/or socio-economic factors (e.g., accessibility to markets or infrastructure) that are associated with the target development9,10.

Global land suitability mapping has aided our understanding of the expansion of many development sectors, including cropland or biofuel expansion, renewable energy sources (i.e., solar, wind, and hydropower), fossil fuels, and mining. However, previous global assessments for food and biofuel suitability are largely binary maps for “croplands” (e.g.11,12,13) or focus on marginal and abandoned land potential for biofuel production (e.g.14,15). A few cropland assessments account for social or policy constraints (e.g.13,16), but none globally map feasibility of land conversion based on factors of yield potential and access to infrastructure to distinguish relative conversion pressure8. Global mapping of renewable energy potential maps have incorporated only simple land constraints17,18,19 or select few spatial development feasibility factors (e.g., market accessibility that considers distance to urban areas, load centers, and transmission lines20,21,22,23,24, or site construction and operational costs21,24,25), at times doing so only post-hoc to categorize potential energy production26,27 or to compare implementation costs23,24. Global fossil fuels and mining sectors maps have been limited to one fuel or mineral type28,29,30,31, do not include spatial siting factors20,32, or rely on proprietary industry data that limits public distribution33. The inconsistency in mapping across different sectors and the lack of publicly available maps at resolutions finer than large-scale aggregate summaries (e.g., countries) severely limits the ability to plan for cumulative development pressures across geographic regions.

Here, we generated spatially-explicit, global land suitability maps at a fine resolution (1-km) for renewable energy (concentrated solar power – CSP, photovoltaic solar power – PV, wind power – Wind, and hydropower – Hydro); fossil fuels (coal mining – Coal, conventional oil – CO, conventional gas – CG, unconventional oil – UO, and unconventional gas – CG); mining (metallic minerals – MM and non-metallic minerals – NMM); and agriculture (crop expansion – Crop and biofuels expansion – Bio) development using publicly available datasets that account for both resource potential and development feasibility34. For each of the 13 sectors, we produced a land suitability index, referred to as a “development potential index” (DPI), that relatively ranks each 1-km area of land for its likelihood to be modified in the future by that sector and then classified each sector consistently on a high-low scale based on its DPI values. The sector-specific DPI datasets are made freely and publicly available to the scientific community and to policy decision-makers to facilitate broad-scale spatial assessments of potential individual sector and cumulative development patterns, and can be used to identify high-risk areas where near-future expansion may conflict with biodiversity, climate, or environmental assets.

We created DPIs using three mains steps of spatial multi-criteria decision analysis (MCDA) techniques in geographic information systems (GIS) (Fig. 1)9. First, we mapped sector-specific land constraints expected to restrict development (e.g., suitable land cover, slope). Second, we produced spatially-explicit, independent criteria that were continuously-scaled factors that enhanced the suitability of sector development35, and captured both resource availability (sector-specific yields) and development feasibility (e.g., distance to major roads, railroads, ports, power plants, electrical grid, and demand centers). We removed areas with constraints from the mapped resource yield and development feasibility criteria and standardized values from 0–1. Third, we weighted the importance of spatial criteria using Analytic Hierarchy Process (AHP)36,37,38, and combined them into DPI maps using Weighted Linear Combination (WLC) in GIS9. We analyzed both output uncertainty and input sensitivity following refs39,40 and validated our DPIs using over 6,000 points and 200,000 km2 of mapped locations identifying recent or planned development sectors. While GIS within MCDA procedures (GIS-MCDA), specifically AHP in combination with WLC, have been widely used to map land suitability at local and regional scales10,41 (e.g., siting of renewable energy facilities42,43,44,45,46, fossil fuel development47,48,49, mineral extraction50, and agriculture development51,52,53), to our knowledge, this is the first study to apply these procedures consistently across multiple energy, extractive, and agricultural sectors on a global scale.

Fig. 1
figure 1

Procedures used to produce all development potential index (DPI) maps. Analysis steps were applied for the 13 sectors related to renewable energy, fossil fuels, mining, and agriculture.

Methods

To inform parameters and criteria selection for each step of the analysis, we conducted a literature review of studies on the mapping of land suitability, yield potential, and/or economic feasibility associated with all development sectors (Online-only Table 1). Because technologies in all development sectors are rapidly changing, we limited our literature search to papers published within the last 10 years focusing mainly on global17,18,19,20,21,22,23,24,25,27,32,54 and regional16,55,56,57,58,59,60 analyses but also using state/local analyses44,45,46,47,48,49,50,51,53,61,62,63,64,65,66 to fully capture the variety and weights of criteria used in all analyses. We relied on the most commonly cited development constraints and criteria that could be mapped from publicly available, open access global data to produce our DPI maps and thus facilitate public distribution of derived datasets.

Analyses were performed at a 1-km resolution for terrestrial lands defined as cells containing one or more 300-m pixels of terrestrial land cover types based on ESA CCI dataset67. We projected input data to the equal-area Mollweide projection and applied bilinear resampling method for continuous raster data and nearest-neighbor method for discrete data (vector data were first projected and then converted to raster datasets). Unless otherwise specified, analyses were conducted using ArcGIS 10.5 (www.esri.com) with the Spatial Analyst 10.5 extension.

Mapping development constraints (step 1)

Constraints were tied to resource thresholds (e.g., solar irradiance, wind speed), land use characteristics (e.g., urban areas), and biophysical characteristics (e.g., slope, elevation) that limit the ability of the sector to economically produce its associated commodity or to be constructed. Addition of these spatial constraints improved each sectors’ DPI by limiting the extent of the analysis to only viable locations16. Given the wide-range of constraints and their values often reported in the literature, we selected the least-restrictive constraint values reported by global or regional studies (see Online-only Table 1) to avoid the exclusion of areas that may become accessible to future development with improved technology. We did not include any administrative constraints (e.g., excluding protected areas) because these assignments can be modified or removed based on policy changes and land use pressure68. Online-only Table 2 provides sector-specific breakdown of the constraints applied with appropriate citations with their justifications, which we summarize briefly below.

For solar and wind renewable energy sectors (CSP, PV, Wind), we excluded areas below sector-specific resource (solar irradiance or wind speed) and above slope or elevation thresholds, and areas categorized as snow, ice, or urban. For CSP, we also excluded areas with already operating CSP power plants, and for wind, we excluded areas with ≥3 wind turbines per km2 (we did not exclude existing PV power plants given lack of global data). For Hydro, we excluded urban areas, and locations with existing hydroelectric dams or estimated to produce <1 MW of power.

We excluded lands classified as urban for all fossil fuel and mining sectors. For coal mining, we also excluded existing coal mines based on our mapping from global sources, and for mineral and non-mineral mining, we excluded lands with former or current active mines. We did not exclude current oil and gas wells due to the lack of publicly available, globally comprehensive data on well locations. Lastly, for agriculture sectors, we removed lands classified as urban or currently cropped, arid lands without irrigation, and areas too steep for cultivation.

Mapping development criteria (step 2)

Mapping resource yield criteria

For each sector, we spatially mapped a resource yield criterion based on available resources converted to yield estimates using common production values (e.g., annual megawatt hours, barrels of oil, tons of coal) for each 1-km2 cell. See Online-only Table 2 for detailed methods and data applied to derive these resource yield maps with brief descriptions below. We limited the resultant global maps to suitable locations based our constraint maps (step 1), and applied the following steps on the yield map values to produce approximately normally distributed values ranging from 0–1 across all sectors: (1) reassigned values of cells that were within the top one percentile outliers to the 99th percentile value of the distribution, (2) applied transformation based on the skewness (s) of the yield value distribution as follows: no transformation if s < 0.5, square-root transformation for 0.5 ≤ s ≤ 1.0, and log-transformation if s > 169,70, and (3) scaled data into a 0–1 range using min-max normalization. This approach addressed the right skewed distribution of most yield cell values and maintained all cells but treated the top 1% of outlying yield values as a constant value given their expected lack of differentiation in development potential.

Renewable energy: We estimated yield for four renewable energy sectors: CSP, PV, Wind, and Hydro (Online-only Table 2). For solar (CSP and PV) and wind, we used the general equation: PD ∙ CFi ∙ 8760 to estimate annual yield (MWh/km2), where PD is the sector-specific power density in MW/km2, CFi is the sector-specific and spatially explicit (for the ith cell) capacity factor derived from the corresponding resource estimate (e.g., wind speed for wind) and defined as the ratio of expected to potential power output, and 8760 is the number of hours in a year26,57,59. For wind, we multiplied this equation by a spatially-explicit air density factor (ADi), because differences in elevation have known effects on wind power production17,27. For hydropower, we used a publicly available, 1-km resolution hydropower potential dataset which derived potential from a global digital elevation model and river runoff data using a fixed CF value of 0.518.

Fossil fuels: We estimated yield for five fossil fuel energy sectors: Coal, CO, CG, UO, and UG (Online-only Table 2). Yields were derived from global and national level assessments of technically recoverable resources per basin (coal), assessment units (CO, CG, UO, and UG), or prospective areas (UO and UG), which are collectively referred to as assessment units (AUs). For each AU, we divided the total recoverable resource by the AU area, thereby producing an average yield/km2 across the AU. Where AUs overlapped, we summed resource-specific (e.g., conventional oil) yield values before producing the final yield map.

Mining: Due to the variety of different minerals mined and the lack of global, basin-level estimates of technically recovered minerals for each, we relied on proxy yield values based on deposit locations for two collective categories of mining: metallic and non-metallic (Online-only Table 2). Mineral deposit data were publicly available for 167 minerals along with categorical size estimates of deposit amounts that were determined from attribute size descriptions (e.g., very large, large, medium, etc.). Given the availability of only categorical estimates and because deposit densities are widely used to estimate undiscovered deposits and potential recoverable amounts71,72, we relied on mining density as a surrogate for yield. We implemented kernel density (KD) methods73 using Kernel Density tool in ArcGIS, and incorporated categorical deposit amounts as weights following ref.74. Kernels were centered on each deposit location and generated based on deposit size weights and on radii distances specific to metallic or non-metallic minerals. We then selected only cells with KD > 0.001 deposits/km2, a minimum value used by ref.75 and https://www.openstreetmap.org/. Because 82% of mineral deposits were located within the U.S., we created and standardized KD maps separately for the U.S. and non-U.S. regions before combining these two regions to produce each sector’s final resource yield map.

Agriculture: We estimated agriculture yield for ten food crops, which captured 83% of total calorie production on croplands76, and for a subset of five first-generation biofuel crops, which comprised the majority of commercial biofuel production77 and have the most growth potential based on market maturity and technological capacity78 (Online-only Table 2). Using 2012 yield data summarized at national or sub-national jurisdictional units, and following methods in ref.79, we modeled the crop-specific relationships between area-weighted yield (ton/km2) and biophysical covariates (e.g., growing degree-day, precipitation, fraction irrigated, slope, etc.) using a 95th percentile quantile regression to predict attainable yields (quantreg package version 5.33 in R version 3.4.0; model coefficient results in Online-only Table 3). We then combined spatially-explicit covariate maps with the resulting model coefficients to produce global, crop-specific, predicted yield maps. For crop expansion, we min-max normalized these yield values for each crop. For biofuel crops, we converted predicted yield in ton/km2 to gallons of gasoline equivalents (GGE) per km2 based on conversion rates from ref.20. We generated final cropland and biofuel resource potential maps by calculating the mean standardized yield value (i.e., 0–1 for crops and GGE/km2 for biofuels) across the ten subsistence crops and five biofuel crops respectively, and then ensured normal distribution of these two final yield maps following the methods discussed previously.

Mapping feasibility criteria

We produced 13 development feasibility criteria at 1-km resolution, which were factors that increase site development potential or decrease operational costs (details in Online-only Table 4). These criteria related to: (1) ability to transport resources and/or construction materials (distance to major roads, railways, and ports); (2) access to resource demand centers (market accessibility, distance to the electrical grid, urban areas, coal-fired power plants, and aggregate demand centers); (3) locations of existing development (distance to producing oil and gas fields and active coal mining density); and (4) other economic costs associated with resource siting (inverse population density), development (landcover feasibility and land supply elasticity), and/or production (access to electricity). Criteria values ranged from 0 to 1, with 1 indicating the most preferred location for a particular sector development associated with the criteria, and 0 implying the criteria no longer provided any advantage for this development. For each sectors’ MCDA, we selected criteria used in previous studies on land suitability or recognized as an economic factor influencing siting (Online-only Table 1), and that could be mapped globally from existing, publicly available data.

We generated distance criteria values (c) based on a Gaussian distance decay function, c = exp(−d2/2∙(h/2)2), where d is Euclidian distance between a focal cell and the closest feature of influence (e.g., transmission line, roads, railway, etc.), and h/2 is the inflection point, or the point beyond which the criteria score starts to rapidly decline towards zero. Resulting standard values of c ranged from 0–1, where 1 indicated closest proximity to the feature of importance. For renewable energy sectors, we set h to 100 km similar to ref.56, and for fossil fuels and mining sectors, we set h to 50 km, or the average distance at which infrastructure costs associated with mineral extraction approximately doubles80.

Combining yield and feasibility criteria to create DPIs (step 3)

We used Analytic Hierarchy Process (AHP)36,81 and Weighted Linear Combination (WLC)9 methods to combine resource yield and feasibility criteria maps into a final DPI map for each sector. AHP calculates criteria weights from a pairwise matrix of importance values (judgement matrix) formulated based on scaled rankings36,37. Pairwise comparison values are assigned based on Saaty’s nine-point importance scale36,37, where a score of 9 indicates criterion A is nine times more important than criterion B, and where scores are reciprocal (therefore criterion B is 1/9 times as important as criterion A). We selected criteria, none of which were highly correlated (Pearson’s correlation > 0.6) and assigned importance values based on our literature review of sector suitability (Online-only Table 1), current literature on resource transportation and development costs80,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102,103,104,105,106,107,108,109,110,111,112,113, and authors’ expertise. Criteria weights were calculated as the normalized eigenvector associated with the judgement matrix’s largest eigenvalue (λmax) (see refs36,37 for details). We evaluated the consistency of the judgement matrices using a consistency ratio (CR) calculated as CR = CI/RI, where CI is the consistency index calculated based on number of criteria (n) as CI = (λmaxn)/(n − 1), and RI is an established random inconsistency index based on n (see Table 1.2 in ref.81).

We found all judgment matrices to be within acceptable consistency (i.e., CRs < 0.10). For all judgement matrices, the resource yield criterion was identified as the most important, which produced weight ranging from 0.336 (coal) to 0.552 (metallic mining). Judgement matrices of fossil fuel sectors included feasibility criteria related to current development, which were prioritized as next highest, or equal to, resource yield criteria. For all other sectors, the typical second highest prioritized criteria were related to transporting the resource to its respective demand centers, except for hydropower, for which population avoidance (i.e., inverse population density criterion) ranked as second highest. Supplementary Table S1 provides judgement matrices for each sector including justifications for importance ranking, and derived weights used to calculate DPIs.

Once we derived the resource yield and feasibility criteria maps with their associated weights, we implemented WLC in ArcGIS using the Weighted Overlay tool to create sector-specific DPI maps using: ∑wncn, which produces a composite DPI map based on AHP derived weights (wn) for n criteria (cn)9. Values of all input criteria maps and resulting DPI maps ranged from >0 (low) to 1 (high). To reduce local variations as a result of global data inaccuracies or resolution artifacts, we spatially averaged DPI values using a 3 × 3-km moving window analysis10. If any cell previously excluded based on constraints or water was assigned a value by the smoothing technique, we reassigned its value to null. A final continuous DPI was created by max normalizing the remaining spatially-averaged values34 (e.g., Fig. 1-Step 3iii).

DPI classification

To facilitate the comparison of spatial patterns across the different sectors, we grouped the continuous DPI values into six relativized development potential classes: very high, high, medium high, medium low, low and very low. Given that each sector’s DPI values were approximately normally distributed but varied in their mean values, we calculated the standard global z-score per pixel and then binned each DPI based on five z-score breakpoints that corresponded to the percentiles of the distribution (Table 1). This method classified each DPI equally based on its mean and standard deviation of values using consistent estimated percentage breakpoints of 10% (Very Low, Very High), 15% (Low, High), and 25% (Medium-high, Medium-low). These breakpoints are offered as one way to classify the continuous DPI values and are included in each sector-level DPI data bundle34 (e.g., Supplementary Table S1). We note that these six classes have been applied in previous global threat analyses (e.g., by ref.114 when classifying cumulative threats to global marine environments).

Table 1 Development potential index (DPI) classes.

Uncertainty and sensitivity analyses

For each DPI sector, we quantified the variability associated with each output based on model input (i.e., uncertainty analysis), and then identified which DPI criteria were responsible for the most variability (i.e., sensitivity analysis).

Uncertainty analysis

The two main sources of uncertainty for any MCDA arise from the criteria maps and their weights9,115. For the criteria maps, uncertainty can stem from three aspects in the analysis; (1) choice of criteria in the decision model, (2) errors of measurement in the original source spatial data, and (3) the value scaling (or standardization) of the criterion maps115. To reduce these sources of uncertainty, we respectively (1) relied on supported literature to guide criteria selection and value scaling of categorical data (Online-only Table 1), (2) avoided arbitrary classifications of continuous input data, and (3) applied fuzzy measures when appropriate. We focused our uncertainty analysis on the weighting values, a common approach when addressing uncertainty regarding GIS-MCDA39,116,117,118. To do so, we relied on a Monte Carlo (MC) approach, which assesses both qualitative and quantitative uncertainty within an MCDA based on repeated random sampling from a range of criteria weights to produce several iterations of the model results118. These iterations are then used to calculate standard deviation (SD) and/or coefficient of variation (CV) to map and analyze uncertainty39,116,117,119.

To define our weight ranges for each DPI criterion, we modified the original AHP by first increasing the importance value of a selected criterion by two points across all other criteria and then used this new matrix to calculate the maximum weight for this criterion weight range. We then decreased the importance value for this same selected criterion by two points across all criteria to derive the minimum weight for the range. This process emulated selecting the next highest or lowest odd number comparison value (i.e., 1, 3, 5, 7, 9 values most commonly used when assigning values using the Saaty’s nine-point importance scale36), while also maintaining the necessary consistency ratio (i.e., CR < 0.1) to use these derived weights (see the Supplementary Information for an example). Once the weight ranges for all criteria were defined (Online-only Table 5), we followed standard MC methodology and applied a four step process similar to refs39,120 for each DPI. For 300 iterations, we: (1) randomly selected a criterion, (2) randomly assigned the criterion weight from its bracketed range of generated weights (Online-only Table 5), (3) proportionally modified the remaining criteria weights such that all weights sum to the value of one, and (4) applied these weights to reproduce the modified DPI. Our chosen number of 300 iterations falls within the recommended range of 100–10000121,122 and predominantly produced a <1% change of the mean and SD associated with each iteration as the iterations neared 300 which suggested this number was sufficient117.

Based on mean CV across all DPI maps, the biofuel and crop DPIs exhibited the highest relative uncertainty, largely attributed to these DPIs having the lowest number of criteria (Table 2). In contrast, the wind DPI had the lowest uncertainty, likely driven by this sector having the greatest number of criteria. This inverse relationship between uncertainty and the number of criteria was upheld by all sectors except coal and unconventional oil (Table 2). Coal had higher than expected uncertainty presumably because of the larger undeveloped basins located in remote regions of the globe. Unconventional oil basins on the other hand are found in more accessible locations and consistently much smaller in size in comparison to other fossil fuel sectors; thus, limiting the variability caused by development feasibility criteria and thus also reducing uncertainty.

Table 2 Mean coefficient value (CV) for 13 development sectors with number of criteria used for each development potential index (DPI).

Similar to refs39,120, we classified all 13 CV datasets values into “Very Low” to “Very High” classes using the same DPI classification method based on z-score breakpoints34 (6 classes; Table 1). For each sector, we determined the DPI class(es) that exhibited the greatest uncertainty by calculating the percentages of DPI classes within each uncertainty class (for 36 different combinations; e.g., Fig. 2). We also averaged percentages of the 36 combinations across all 13 DPIs for a comprehensive uncertainty measure (Fig. 3). None of the DPI cells classified as “Very High” or “High” were found to fall within “Very High” or “High” uncertainty classes (Fig. 3). Rather, >99% “Very High” DPI cells and >75% of “High” DPI cells fell within the “Low” or “Very Low” uncertainty classes, respectively. These results indicate less uncertainty exists in high DPI areas. In contrast, the areas with highest uncertainty (i.e., “Very High” and “High” classes) fell within the “Low” and “Very Low” DPI classes for most sectors. These high uncertainty areas tended to occur in remote regions that lacked supporting infrastructure or market access but had higher than average resource potential (i.e., yield criteria in all DPIs), especially for the renewable and fossil fuel sectors. However, this same spatial pattern was not as prevalent for mining and agriculture sectors, where the highest uncertainty values largely occurred in regions that had low yield but were farther from markets or infrastructure. Readers can further explore the spatial distributions of uncertainty across each DPI by downloading data from figshare34.

Fig. 2
figure 2

Example spatial uncertainty analysis for wind development potential index (DPI). Spatial datasets used for wind DPI uncertainty analyses: (a) classified wind uncertainty map, (b) classified wind DPI map, and (c) resulting map produced by intersection of two maps. In the legend for map (c), arrows indicate the direction of classes going from “Very Low” (VL) to “Very High” (VH). For example, purple areas classified as VL for DPI and VH for uncertainty, whereas dark brown areas classified as VH for DPI and VL for uncertainty. Non-classified areas are identified in grey and were excluded based on a lack of available future resources or by constraints applied during the DPI analysis.

Fig. 3
figure 3

Cross tabular average percentages of development potential index (DPI) classes in each corresponding uncertainty class. Data for each DPI and uncertainty class were averaged across all 13 sectors and total percentages are summarized at the bottom (total percentage in DPI class) and right (total percentage in uncertainty class) of the table. Six colors classify percentages from lowest to highest (i.e., light-blue [0%], light-green [0–1%], yellow [1–3%], light-red [3–5%], red [5–7%], and dark-red [>10%]).

Sensitivity analysis

For each sector DPI map, we evaluated the sensitivity of the multi-criteria weights using a one-at-a-time (OAT) method: where we incrementally modified weights within a range of values and then compared the modified and original (DPIOrig) outputs9,41. Following ref.115, we varied weights −20% to +20% of original weight value in increments of ±2% (n = 21 simulation runs), and determined the change in cell counts within five DPI value bins (i.e., >0.0–0.2, >0.2–0.4, >0.4–0.6, >0.6–0.8, >0.8–1.0). These five, equal interval bins were used to evaluate the sensitivity across the spectrum of continuous DPI values to distinguish changes in consistent value ranges across sensitivity runs while also reducing computing resources needed for a per pixel measurement. For each criterion and its bins, we calculated the average percent change in cell counts relative to the counts from DPIOrig (Online-only Table 6). We additionally evaluated the spatial differences in outputs by calculating the cell-based correlations (Pearson’s r) between the modified DPI outputs and DPIOrig. Detailed sensitivity reports that included the percentage change per bin per simulation run for all sector criteria and the corresponding correlations are packaged with individual DPI data bundles34.

Unsurprisingly, the most sensitive criteria were the highest weighted ones and the least sensitive criteria were the lowest weighted ones across all DPIs. The most sensitive criterion was resource yield for all sectors, except for coal, for which mining density (equally weighted as coal resource yield) was the most sensitive. The least sensitive criteria were distance to railways or ports (CSP, PV, CG, UO, UG, NMM), distance to major roads (CO), distance to urban areas (Wind, Hydro), or distance to coal power plants (Coal). Overall, biofuels DPI exhibited the greatest sensitivity due to high shifts in the lowest bin (>0–0.2) (Online-only Table 6; see additional details below).

For most criteria and sectors, the lowest bin (>0.0–0.2) exhibited the greatest sensitivity, likely because it had the smallest overall frequency of cells within the DPIOrig, and thus the greatest percent changes. To focus on high development potential areas that are most influential to predicting land expansion areas, we examined the sensitivity exhibited within the top bins. For the second highest bins (>0.6–0.8), the most sensitive criterion within any sector-MCDA was coal mining density followed by hydropower resource yield. For the top bin (>0.8–1), the resource yield criterion for wind exhibited the greatest shifts. To further examine the sensitivity of these three criteria, we calculated the maximum and average absolute cell value change on a cell by cell basis associated with the sensitivity run having the greatest degree of change for all cell values and for cells with DPIOrig values > 0.5. Even for the sensitivity runs with the greatest weight change (i.e., +20% or −20%), the maximum absolute cell change was less than 0.1 and the average absolute change was less than 0.05 across all cells and for DPIOrig cells that were greater than 0.50 (Table 3, see Supplementary Fig. 1 for mapped example with Coal DPI).

Table 3 Sensitivity analysis results summarized as change in DPI values.

Overall across all sectors, the binned value of most cells remained the same, and there were no cells that either increased or decreased more than one bin level from that of the original run. Furthermore, spatial correlations were r ≥ 0.971 for all sensitivity runs across all sectors, indicating low spatial variance in the DPI outputs due to modified weights. An overall low sensitivity was further reinforced by an only slight change (i.e., ~0.05) detected in cell values for the three most sensitive sector criteria when applying the maximum weight change.

Data Records

For each development sector, three spatial datasets (i.e., the continuous DPI, the classified DPIs, and the classified uncertainty map) are accessible via figshare as GeoTIFF raster datasets at 1-km resolution using the Mollweide projection34. Due to large file sizes and for ease of access, all sector DPI data are bundled together within a correspondingly named zip file (e.g., Wind.zip). Each zip file contains: the continuous DPI raster dataset, the classified DPI raster dataset, the classified uncertainty raster dataset, all parameter descriptions and values (i.e., constraints, criteria correlations and weights, and AHP comparison values and matrix consistency ratio) used to produce the DPI, along with the full DPI sensitivity report. Additionally, four zip files (i.e., DPI_Inputs_and_Scripts_Part01-04.zip) are provided and contain all input data and Python scripts necessary to reproduce any DPI34. All three raster datasets per sector can also be viewed and examined interactively at http://s3.amazonaws.com/DevByDesign-Web/Maps/DPI_viewer/index.html.

Technical Validation

To validate the DPIs, we used spatial point locations of planned or recently developed renewable energy power plants123,124, recent lease and claim boundaries identifying where fossil fuels and mining development is permitted125,126, and recent areas of crop expansion127. We compared mapped DPI classes to recent or planned development locations and determined the percentage of overlap and non-overlap (“none” class; Table 4).

Table 4 Spatial validation of DPI maps.

For solar power plants and wind farms, we used the most comprehensive database on recently developed or planned development locations123; data were only available for North America. We included records with available x/y coordinates (i.e., not city or county) for facilities constructed no earlier than 2016 or that were currently planned (see Table 4 for sample sizes). We assigned a DPI class value for each point based the closest DPI classified cell that fell within a maximum distance representing the square-root of the mean facility-area reported for the sector. For example, CSP plants average 6.64 km2 in size128, so we assigned the DPI class of the nearest cell within 2.58 km (i.e., square-root of 6.64), otherwise the location was classified as not having a class (i.e., None). We used a distance threshold of 1.77 km for PV based on ref.128 and 7.35 km for wind based on ref.129. We found that all but one of the CSP plants fell within the very high DPI class (Table 4). The vast majority of utility-scale PV power plants (92%) and all but 13 of the large PV plants (i.e., >=20 MW, cutoff identified by ref.128) fell within very high, high, or medium-high DPI classes (Table 4). Similarly, 85% of all wind farms fell within our mapped high and very high DPI areas and only twenty-five sites (6%) fell outside of any DPI class. For hydropower, we relied on the a dataset that identified proposed and currently constructed hydropower dams globally124. Because there were no feature-level location error assessments (i.e., how accurately each dam location was mapped), we only used future dam sites which were within 1 km of any DPI category cells. We found that 72% of all dam locations and 76% of large hydropower dams (i.e., >30 MW, cutoff identified by ref.130) fell within medium-high to very high DPI classes.

For coal, we used a U.S. coal mining permit database126 (n = 4,650 permits), that maps boundaries where companies have the right to disturb land for the mining and will be required by law to reclaim the site. Based on intersecting these lease boundaries with mapped DPI classes, we found that 87% of permit areas not mined fell within the highest DPI (only 4% were outside of any DPI cell). For oil and gas and mineral extraction sectors, we used a U.S. lease databases for oil and gas removal and mining claims125. Given the lack of publicly available data for these sectors and the high costs associated with more expansive proprietary data, we were limited to data within 10 western U.S. states: California, Oregon, Nevada, Idaho, Utah, New Mexico, Colorado, Wyoming, Montana, North Dakota, and South Dakota. Because oil and gas lease data did not distinguish the resource (i.e., oil or gas) or the method used (i.e., conventional or unconventional), we combined the DPI classes for CO, CG, UO, and UG and maintained the highest class per cell. We similarly combined metallic and non-metallic mining claims because these were not distinguished in the dataset. We overlapped oil and gas leases and mining claims with their associated combined DPI maps and found that >67% of oil and gas leases were located within very high or high DPI scores (only 5% were outside of any DPI cell), and 71% of mining claims not already mined overlapped with the two highest DPI classes (only 1% fell outside of DPI cells).

For cropland, we relied on spatial maps of annual cropland percentages per 1-km2 within the conterminous U.S. over the past 150 years127 and calculated the percentage of expansion for the most recent year of 2016 (i.e., we subtracted 2015 from 2016 percentages and selected only those cells with positive values). This identified over two million pixels (totaling 83,685.87 km2) with cropland expansion, which we overlapped with Crop DPI maps and calculated the total expansion per DPI class. We found that 73% of cropland expansion occurred in medium-high to very high DPI classes and over 46,000 km2 (56%) occurred in the top two classes. We were unable to find an analogous biofuels expansion dataset but assert that these results offer indirect support given that the (1) the cropland expansion dataset includes all biofuel crops, and (2) our biofuels DPI was created from the yield potential of a subset of crops and the same feasibility criteria (i.e., market accessibility and land supply elasticity) as the cropland DPI.

Usage Notes

The DPI maps generated here provide some of the first globally consistent land suitability maps at a fine resolution (1-km) that depict the potential expansion for 13 major development sectors related to renewable energy, fossil fuels, mining, and agriculture. Our approach offers an advancement to other products by factoring in resource yield potential alongside multiple spatial factors that influence development siting using a spatial MCDA approach. It also advances the global mapping of multiple energy and extractive sectors that increasingly play a role in land use change131, but have been overlooked relative to agriculture or urban expansion132,133. Further, we examined the uncertainty and the sensitivity of each DPI and validated results with the best available known locations of recent or planned development: efforts rarely performed even for site-based or regional land suitability analyses41.

We acknowledge that our DPIs, like all global data, are inherently prone to inaccuracies, omissions, and inconsistencies in both their spatial features and attributes. While we used the best publicly available and current data for our analyses, input datasets were not always comprehensive in regional coverage (an issue that plagues all global analyses); however, with the provided Python code, each DPI can be easily updated as new data becomes accessible. For example, because only proprietary, global pipeline spatial data were available, our oil and gas DPIs (i.e., CO, CG, UO, UG) lacked this important criterion in the analysis and instead we relied on distance to existing oil and gas fields as a proxy that identifies where pipelines exist. Additionally, our DPIs do not consider governmental actions (e.g., environmental regulations, incentives, tax breaks) that often influence development siting, and may not capture land expansion under varying market changes and technological advancements. Given the frequency of policy and market changes, variations across administrative units, and the effort required to maintain such a database, incorporating the above was beyond the scope of this study, but future work, especially if focused on a smaller focal area, should seek to capture these criteria and/or conditions to update DPIs. We also do not account for climate change, which has been shown to redefine future crop yields13, and has the potential to inundate areas of current suitable land and/or supporting infrastructure, relocate population/demand centers of resources, modify precipitation or cloud cover patterns that can alter hydropower and solar resources134,135.

Finally, we recognize there are multiple uncertainties throughout any MCDA process (e.g., setting constraints, calculating spatial criteria values, and selecting criteria weights), and we only evaluated the uncertainty and sensitivity of one primary source (criteria weights). Nevertheless, our DPI maps offer more detailed and consistent global products on the relative (rather than the precise) suitability of lands for future development expansion. Although we produced DPI maps at a 1-km resolution, we do not recommend the use of these data at this resolution for local land-use planning or siting of development. We provide the DPI maps at this resolution (1) given its consistency with the input spatial feasibility metrics; (2) because it allows for the aggregation of data into comparable zones of analysis (e.g., countries, states/provinces, ecoregions, watersheds, etc.) that circumvent the modifiable areal unit problem often introduced with coarse resolutions136; and (3) because it maximizes the potential discernment of spatial heterogeneity in global development patterns137. While our validation results produced favorable support of our products, we emphasize that more localized or detailed spatial MCDA analysis should be performed using much finer resolution and higher accuracy data to fully resolve land suitability at 1-km pixel level. In addition, feedback should be solicited from local decision makers, industry representatives, and sector specialists to select regionally-tailored criteria factors and assigning their influence (weights) on development in the analysis.

Despite these limitations, the timeliness and substantial need for these types of data are demonstrated by an increasing number of online portals hosting spatial data on human development pressure along with environmental features: e.g., the World Resource Institute’s (WRI) Resource Watch (http://resourcewatch.org), the World Wildlife Foundation’s (WWF) Sight (http://wwf-sight.org/explore), the European Commission Joint Research Center’s Digital Observatory for Protected Areas (DOPA) (http://dopa-explorer.jrc.ec.europa.eu/), the Global Forest Watch (http://www.globalforestwatch.org), the United Nation’s MapX (http://www.mapx.org/), the UN Biodiversity Lab (https://www.unbiodiversitylab.org/about.html), and the World Bank’s Spatial Agent (https://olc.worldbank.org/content/spatial-agent-tutorial), among others. We note that the development pressure datasets hosted on these online portals predominately focus on current development patterns, thus, they are limited to retrospective or current planning efforts. Of the select datasets that capture potential future expansion areas, they tend to map only areas of unexploited resources (e.g., resource yield proxies) without integrating spatial feasibility factors. Figure 4 displays some of the representative resource datasets from the above sources in comparison with the most analogous DPIs from this study. Previous existing maps identify locations of resources without resource value attribution (Fig. 4e) or captures resource yield potential but without spatial details on siting constraints (i.e., Fig. 4a vs Fig. 4b) and/or siting feasibility (i.e., Fig. 4c vs Fig. 4d). In addition, these maps often only capture one segment of a sector thereby neglecting the overall sector development pressures (i.e., Fig. 4g vs. Fig. 4h). While current development maps delineate regions susceptible to single-sector expansion over the long-term and can be used in basic binary overlay assessments32, they cannot be used in a gradient spatial analysis that differentiates among areas likely to undergo varying levels of development growth by multiple sectors in the near term.

Fig. 4
figure 4

Comparison of DPIs with publicly available resource data. Data on resources were obtained from WRI Resource Watch and partners (left side panel) with the most analogous DPI maps produced by this study (right side panel). Color ramp for all maps are the same, with highest values in dark orange and lowest values in blue and null value in grey. Legend in first DPI map (b) can be applied to all other DPI maps (d,f,h). Map of only potential resource locations are displayed in a uniform orange color, i.e., large mineral deposit locations (e). Legend abbreviations for resource maps (a,c,g) are as follows: watts per square meter (W/m2), billion barrels of oil equivalent (BBOE), and tons per hectare (t/ha).

In addition to elucidating potential individual sector expansion patterns, these DPIs can be combined to produce a cumulative development pressure metric at regional or global scales. While each DPI is sector specific, the relative index value is a measurement of development suitability based on multiple criteria on resource yield potential and development feasibility. Combing multiple DPIs provides a method for illuminating those lands that are suitable for multiple development sectors and thus have more pressures for being used. Although all DPIs have been scaled from 0–1, the distribution of these values may vary across sectors. Thus, to ensure equitability across all DPI values in a cumulative map, we recommend standardizing or classifying each DPI relative to an area of interest (e.g., global, regional, country) prior to this process.

Similar to a cumulative map of marine threats produced by ref.138, an additive approach is a simple and an effective way to create a cumulative DPI map. To provide an additive cumulative DPI map in line with the standardization guidelines we recommend above, one can apply globally, standardized z-score values for all continuous DPIs, which are equally-weighted and summed together (Fig. 5a). Such a map allows for countries, states, biomes, and/or ecoregions across the globe to be compared based on cumulative scores (i.e., prior to classification), thereby, identifying areas of varying levels of future development pressure. If the focus is at a more regional or country scale, each continuous DPI can be standardized to that spatial extent before summation. This approach helps to discern high development pressures that are less apparent when DPI values are globally standardized (Fig. 5b,c).

Fig. 5
figure 5

Global and regional-level cumulative development maps produced from standardized DPIs. Maps that display (a) global cumulative development potential map based on summing standardized global DPIs, and two regional-level cumulative development potential based on standardizing DPIs at the scale of the (b) United States (US) and (c) Democratic Republic of Congo (DRC). All maps use previously described z-score binning with legend in map (a) also applicable to maps (b,c).

The DPI datasets provide relative measures of development suitability across the most comprehensive set of sectors, thus, serve as important tools that can help anticipate future development patterns at broad spatial scales from multiple sectors. The DPI maps can be combined with existing maps on current land use and land cover (e.g., global Human Modification map139, ESA CCI dataset67) to help assess opportunity costs and the potential for additional land conversion in a given region. In addition, estimates of production and consumption demand, which influence the likelihood of sector expansion, can also be considered for when quantifying how much potential land may be converted by multiple sectors within a given region (as done by ref.140). Together these data with the DPI maps can be used to proactively prioritize regions at a global scale to better plan for near-term tradeoffs among economic development, population growth, and the environment.