Built structures influence patterns of energy demand and CO2 emissions across countries

Haberl, Helmut; Löw, Markus; Perez-Laborda, Alejandro; Matej, Sarah; Plank, Barbara; Wiedenhofer, Dominik; Creutzig, Felix; Erb, Karl-Heinz; Duro, Juan Antonio

doi:10.1038/s41467-023-39728-3

Download PDF

Article
Open access
Published: 03 July 2023

Built structures influence patterns of energy demand and CO₂ emissions across countries

Nature Communications volume 14, Article number: 3898 (2023) Cite this article

6665 Accesses
6 Citations
92 Altmetric
Metrics details

Subjects

Abstract

Built structures, i.e. the patterns of settlements and transport infrastructures, are known to influence per-capita energy demand and CO₂ emissions at the urban level. At the national level, the role of built structures is seldom considered due to poor data availability. Instead, other potential determinants of energy demand and CO₂ emissions, primarily GDP, are more frequently assessed. We present a set of national-level indicators to characterize patterns of built structures. We quantify these indicators for 113 countries and statistically analyze the results along with final energy use and territorial CO₂ emissions, as well as factors commonly included in national-level analyses of determinants of energy use and emissions. We find that these indicators are about equally important for predicting energy demand and CO₂ emissions as GDP and other conventional factors. The area of built-up land per capita is the most important predictor, second only to the effect of GDP.

Decoupling density from tallness in analysing the life cycle greenhouse gas emissions of cities

Article Open access 05 July 2021

Unveiling patterns in human dominated landscapes through mapping the mass of US built structures

Article Open access 04 December 2023

Impacts of climate change, population growth, and power sector decarbonization on urban building energy use

Article Open access 18 October 2023

Introduction

Increasing global temperature driven by growing greenhouse gas (GHG) emissions is a major global concern¹. GHGs mainly result from the energy-related combustion of fossil fuels². The question emerges, which factors drive energy demand and emissions, and to what extent they are malleable. This question is relevant at many levels, from products to individuals, households, and cities up to continental and even global totals. We here focus on countries, because many decision-making processes occur at the national level. A widespread approach for cross-country analyses classifies the factors influencing resource use and emissions into population, affluence, and technology^3,4,5. These factors are captured in the STIRPAT (Stochastic Impacts by Regression on Population, Affluence and Technology) framework³, derived from the classical IPAT (impact = population × affluence × technology) approach^5,6.

Economic activity (called affluence in IPAT-style analyses, usually measured as Gross Domestic Product or GDP) is acknowledged as a major determinant of energy use and greenhouse gas (GHG) emissions^7,8,9. The discussion mainly focuses on whether GDP can be decoupled from emissions, i.e., whether energy use and emissions can be reduced while GDP is growing. This may be possible e.g., through more efficient technologies, but the debate is so far inconclusive^10,11. While recent studies revealed examples of growing national economies where policies implemented in the last decade achieved reductions in energy demand and CO₂ emissions^12,13,14, neither those studies nor a meta-analysis¹⁵ yielded evidence for reductions in energy use and GHG emissions consistent with ambitious climate targets.

Other potential determinants of energy demand and emissions receive less attention than GDP, even though additional entry points to accelerate decarbonization are urgently required. Population density has been studied with varied outcomes. An econometric study of OECD countries 1980–2011 suggested an inverse relation between population density and CO₂ emissions¹⁶; a regression analysis of materials used in >100 countries in 2000 found a similar effect¹⁷. A global regression analysis¹⁸ found no effect of population density on energy demand, whereas a panel analysis of 11 Asian countries from 1960–2004¹⁹ to conditions that influence energy demand for heating or cooling of buildings²⁰, fuel prices affecting demand for transport energy²¹, and the urbanization rate (urban population as percent of the total population). A suitable proxy for the climate dependency of heating energy demand are heating-degree days (abbreviated as HDD)²⁰. The price of gasoline (abbreviated as PGAS) is an energy price indicator that is strongly related to settlement patterns²¹. Some studies used urban population as a percent of the total population (abbreviated as UPOP) as a development indicator²².

At the urban scale, the influence of population density and the spatial layout of urban areas on cities’ resource demand has been widely studied^{20,23,24,25,26,27,28}. There are several reasons why the extent and spatial layout (density and form) of built structures—henceforth denoted as material stock patterns²⁹, i.e., the spatial patterns of societies’ material stocks in infrastructures and buildings—could affect energy demand and CO₂ emissions. The accumulation of material stocks requires massive amounts of resources such as steel-reinforced concrete, mortar, bricks, timber, plastics, glass, gravel, or sand^30,31,32,33, which are associated with high GHG emissions³⁴. Heating, cooling, and lighting of buildings and production processes in industrial plants require much energy^20,23,26,35, as does the mobility of goods and people on roads and railways^33,36.

Despite these insights from urban studies, material stock patterns are seldom considered in debates on national-level analyses of factors determining levels of energy demand and emissions, as well as their possible decoupling from GDP^11,15. A systematic review of the empirical literature on these questions^15,37 revealed only one study³⁸ considering material stock patterns in analyzing transport-related emissions. Hence little is known about the effects of material stock patterns on energy demand and CO₂ emissions beyond the city level. This results from a scale mismatch: maps of material stock patterns provide fine-grained spatial detail³⁰, often focused on specific regions, that cannot be included in national-level analyses of factors driving energy demand and emissions. For cross-country analyses, consistent indicators need to be developed from spatial data and then aggregated at the national level in a manner which preserves key information on patterns and supports comparative analyses across countries and world regions.

In this work, we develop national-level indicators of characteristics of built structures that, based on urban studies, can be potential determinants of resource use and emissions. We quantify them for 113 countries comprising 91.2% of the world population and 97.3% of global GDP. We analyze two independent variables: (1) yearly total per-capita final energy consumption (abbreviated as TFC) and (2) yearly per-capita CO₂ emissions (abbreviated as CO₂). We test material stock pattern indicators against other variables that have been widely used in national-level analyses of determinants of energy use and emissions, here denoted as conventional factors. As conventional factors, we use GDP/cap/yr (abbreviated as GDP), population density (DENS), UPOP as a development indicator, HDD as a proxy of climate dependency of energy demand, and PGAS as an energy price indicator. Extensive variables are expressed as per-capita values to facilitate country comparisons and remove countries’ population numbers from the analysis.

Results

National-level indicators of material stock patterns

We test three hypotheses based on aggregated indicators of material stock patterns (Fig. 1). Material stock patterns are represented by three types of indicators: (1) The area of built-up land is represented by two indicators, one as a fraction of a nation’s inhabited land, the other per inhabitant. Other indicators describe patterns of built-up areas, including their spatial clustering, form, and distribution, which reflect geomorphological factors as well as historical contingencies. (2) Road indicators that describe the density (length per unit area) of roads in urban and rural regions and the relations between urban and rural road lengths and densities. (3) Railway indicators are defined in the same manner as those for roads (Table 1).

Table 1 Indicators of the extent and patterns of built-up land and transport infrastructures

Full size table

Bivariate and semi-partial correlations

In terms of their Pearson coefficients in bivariate correlations, several indicators of material stock patterns are as strongly correlated with TFC and CO₂ as the conventional factors (Fig. 2a). GDP is positively correlated with both TFC and CO₂. HDD and the fraction of the urban population also show the expected pattern, while PGAS and DENS are largely uncorrelated. Almost all material stock pattern indicators are correlated with both TFC and CO₂. The extent of built-up land (BL_cap and BL_fract) is positively correlated with both TFC and CO₂, as are total and rural road density and the dispersion of built-up land and most railway-related indicators. The correlation coefficients of BL_cap with TFC and CO₂ are both ~0.7; BL_cap is the second-best predictor of both TFC and CO₂ after GDP. As expected from the urban literature, urban population density (UP_dens) is inversely correlated with CO₂ and energy, whereas the share of urban population (UPOP) is strongly positively correlated with CO₂ and TFC. Inverse relations prevail for BL_mono, BL_comp, and the urban-to-rural relations of infrastructure density.

**Fig. 2: Correlation analyses of total final energy demand per capita (TFC) and per-capita CO₂ emissions (CO₂) with conventional factors and material stock pattern indicators.**

Semi-partial correlations of the material stock pattern indicators controlled for GDP and population density (DENS) are shown in Fig. 2b. The part of each material stock pattern indicator correlated with GDP and DENS is removed, revealing the strength of the linear correlation between TFC or CO₂ and the remaining part of the respective variable. The distance from the vertical axis either to the right (positive correlation) or to the left (inverse correlation) depicts the additional explanatory power of the respective indicator over a model considering only GDP and DENS. Several indicators provide additional explanations over GDP and DENS alone. The area of built-up land (both BL_cap and BL_fract) is positively correlated with TFC and CO₂, as are the density of rail and road infrastructures, especially in rural areas. UPOP is inversely correlated with TFC and CO₂, as is urban road density (not significant) and BL_mono. PGAS, which had not been significant in the bivariate correlations, emerges as an important factor, which is also observed for RD_urban and other indicators. UP_dens lose importance, most likely due to its high correlation with GDP.

Multivariate lasso analysis

The capability of the material stock pattern indicators to add insights beyond conventional factors is further analyzed in Table 2. We use the least absolute shrinkage and selection operator (lasso) approach to select variables for multivariate statistical models capable of predicting cross-country patterns of TFC and CO₂. Lasso is a widely-used procedure for automatically performing variable selection in linear regression models^39,40. It overcomes the drawbacks of overfitting and multicollinearity associated with ordinary least square (OLS) methods⁴¹. The lasso method penalizes complexity to derive the best parsimonious model for any predefined value of λ (the factor determining how strongly model complexity is penalized; for detail, see Methods section), thereby allowing to identify the factors that most strongly affect TFC and CO₂. The standard procedure to select λ is cross-validation, where randomly chosen samples of countries are used to develop a model used for the prediction of patterns in out-of-sample countries. The procedure gradually reduces λ, which generally results in more indicators being selected⁴¹. Alternative Lasso schemes and other variable selection techniques, such as stepwise regression, yield very similar results (SI).

Table 2 Multivariate analysis using cross-validation lasso

Full size table

The leftmost three columns of Table 2 show the knots in the lasso paths, i.e., the λ values at which indicators are added (or removed due to collinearity). Knots are arranged in decreasing order of λ, with indicators being ranked in order of selection. In the first column, the λ value marked with an asterisk (*) indicates the optimal model identified by the cross-validation. GDP is always selected as the first indicator, but various material stock pattern indicators appear very early on (i.e., at high λ values), and remain active in the optimal model. BL_cap is the second-chosen indicator for both CO₂ and TFC, and several other material stock pattern indicators are selected much earlier than widely-used conventional factors such as population density (DENS). Heating-degree days (HDD) and the price of gas (PGAS) are important for both CO₂ and TFC, whereas UPOP is important for predicting CO₂ but not selected by lasso for TFC.

The rightmost three columns of Table 2 report the estimated coefficients of models selected by cross-validation as well as measures of in-sample and out-of-sample fit (r²). Models A comprises only variables selected by lasso (path shown in the left part of the table) among all indicators. For the benchmark Models B, only conventional factors are selected. Including material stock pattern indicators in Model A yields better prediction than Models B for both CO₂ and TFC in terms of in-sample and out-of-sample goodness of fit, and improves the Bayesian Information Criterion (BIC) of model selection. Other criteria to develop optimal models, and alternative variable selection procedures yielded similar results (section 5 in the SI). Even models that include only material stock pattern indicators and exclude all conventional factors (including GDP) achieve good predictions (out-of-sample r² of 0.65 for TFC and 0.62 for CO₂; see results for Model C shown in section 5.2 of the SI).

Discussion

The analysis shows that extent and spatial patterns of built structures, here denoted as material stock patterns, play an important role in co-determining and predicting the level of resource use, here TFC and CO₂, in a cross-country analysis. This implies that insights from urban studies^{23,24,25,26,42} generally hold at the national scale. Despite the unavoidable loss of information resulting from the aggregation of maps to the national scale, the indicators in Table 1 maintain key information representing important characteristics of material stock patterns that strongly affect cross-country patterns of TFC and CO₂. The indicators have substantial additional explanatory and predictive power over conventional factors. They can help develop much stronger models of national-level TFC and CO₂ than usual IPAT-type approaches, and will enable researchers to broaden their analysis and scenario modeling capabilities by including material stock patterns as crucial factors for analyzing the possible decoupling of energy use and emissions from GDP.

Population density plays a smaller role than widely assumed, while many material stock patterns strongly influence the cross-country differences in energy demand and CO₂ emissions. The material stock pattern indicator with the most consistent predictive power across all analyses is the area of built-up land per capita (BL_cap), which emerges as the second-most important variable (after GDP) in most of our statistical analyses, even in analyses considering the GDP effect. This is plausible because infrastructures and buildings require energy for being built and used, which results in CO₂ emissions in current fossil fuel-dominated energy systems^34,43. Higher BL_cap also means larger floor size and longer distances between destinations, which all raises energy demand in buildings and transport. These findings corroborate and expand on previous analyses (that used entirely different models, did not capture spatial patterns, and mostly referred to temporal trends), suggesting that challenges for climate-change mitigation strongly depend on the past and future accumulation of material stocks in buildings and infrastructures^43,44,45,46. This is worrying because material stocks are growing globally largely in unison with GDP^32,46.

Our indicators and results create options for analyzing which of the characteristics of material stock patterns are most important in determining and predicting energy demand and emissions in cross-country analyses. Analyses presented in Fig. 2, Table 2 as well as the SI clearly show that many specific aspects of material stock patterns play a role in determining cross-country differences in energy demand and emissions. The multivariate analysis also shows that these patterns interact in many ways that are difficult to disentangle due to the collinearities of the material stock pattern indicators. Future research could employ refined study designs addressing how changes over time and space affect these relationships and elucidate the different causal pathways involved.

Our results have implications for countries pursuing ambitious climate targets^2,43. They demonstrate that insights from urban-level studies translate to the national scale, which could so far not be investigated at that level due to lacking data. This suggests that the indicators presented above offer opportunities to investigate the importance of built structures also for other aspects of national economies’ resource use than those analyzed here. Empirical urban studies have consistently shown that urban form and infrastructure affect travel demand and, therefore, GHG emissions⁴⁷. The relationship between built structures and GHG emissions has also been predicted by theoretical analyses in urban economics^21,48. This suggests that our national-level observations—e.g., that large, sprawling material stocks in built structures represent an important determinant of a nation’s per-capita level of energy demand and emissions—are very likely underpinned by similar causal relationships. Of course, more research, in particular on how these relationships change over time, would be desirable. Because high-resolution maps of decadal trends in built structures are gradually becoming available^49,50, such analyses may soon be feasible. We conclude that the area and patterns of built-up land emerge as an important entry point for efforts at reducing energy demand and CO₂ emissions at national levels⁵¹, suggesting that limiting built-up area per capita could be a worthwhile policy goal, not only from a land-use perspective, but also to limit future GHG emissions.

Methods

The material stock pattern indicators were derived from crowd-sourced data on infrastructures (roads and railways)⁵² as well as high-resolution land-cover data⁵³. The indicators refer to the years 2015 (built-up land) and 2020 (roads and railroads). To reduce random fluctuations for the energy and emission data as well as the conventional factors, we calculated averages for as many years in the period 2015-2020 as were available in the statistical sources. Extensive variables were represented by per-capita values to facilitate comparisons between different-sized countries, following the convention to regard population as a scaling factor with an elasticity of one in STIRPAT analyses⁵.

Data used to derive spatial indicators

The spatial indicators presented in this study rely on three spatially explicit datasets; (1) built-up features and urban agglomerations, (2) main infrastructure features (road and railway), and (3) an inhabited land area layer used for reference. All variables that express quantities as fractions of a country’s area, relate to inhabited land. We used the LC100 grid of the Copernicus Global Land Cover Service⁵³ to derive relevant land cover information. The choice of input data and data quality checks are discussed in the SI; see also Supplementary Fig. 1 (SI). In contrast to other built-up land datasets^54,55,56, the LC100 not only provides globally consistent information on built-up areas, it also includes all other complementary land cover types at a global scale. For data preparation, we vectorized the LC100 grid to allow further spatial intersection procedures (e.g., to clip to national borders). The NOAA-DEM grid was used as a global digital elevation model. We utilized the Geofabrik-download hub⁵⁷ to obtain the entire global OSM data. The GISCO archive from EUROSTAT⁵⁸ provides the applied country borders. Details on the data sources are given in Supplementary Table 1 (SI); datasets⁵⁹ and software code⁶⁰ are available online.

Sources of energy and emission data as well as conventional factors

Gross domestic product (GDP) in constant 2015 US$ was sourced from UN Statistics Division National Accounts⁶¹. Population data used to calculate population density (DENS) and urban population rates (UPOP) were taken from official census data 2017 of the World Bank^62,63. The World Bank database was also used to source data on pump prices for gasoline (PGAS)⁶⁴. Heating-degree days (HDD) were calculated as a population-weighted average of °C days above 18 °C and sourced from the International Energy Agency (IEA) Weather for Energy Tracker⁶⁵. Territorial CO₂ emissions from fossil fuel combustion and cement production (CO₂) were sourced from the Global Carbon Project database^66,67 and total final energy consumption (TFC) from the IEA energy balances, which could only be accessed for the latest year 2017⁶⁸. The record date for the download of these indicators was 25 March 2021. We always used the latest available year for our cross-sectional analysis, but also conducted robustness checks using arithmetic averages over the latest five years for all indicators to exclude potential bias from outstanding annual values. Details on the data are available in Supplementary Table 2 in the SI.

Deriving the built-up land layer

The built-up land vector (BL) for every country is one primary data product derived by the vectorized LC100 grid. This standard BL layer comprised all national built-up features and was used to derive the BL-related indicators shown in part (1) of Table 1. To map each country’s urban agglomerations, and thereby distinguish urban from rural land respectively infrastructures (see Supplementary Figs. 2–4, SI), we clustered features of the BL vector layer using an empirical growing-neighborhood approach: We started with the country’s largest BL feature and created convex hulls, which were buffered with the fifth of the area-equal radius. We then identified intersecting BL features within this buffered hull and created a new hull and buffer. These two steps were repeated as long as the BL area of all intersecting features reached 33.3% of the area of the buffered hull (used to check the intersections), and the BL area increased at least 0.5%, compared to the BL area of the previous iteration. If these criteria were not given, the growing procedure terminated and the collected BL features dissolved to one BL agglomeration. The algorithm subsequently went on with the next-biggest BL feature, which had not been assigned to an already-created agglomeration, again starting the growing-neighborhood procedure. The whole process terminated when the next BL feature (to restart the growing-neighborhood procedure again) represented less than 0.1% of the total BL area of the respective country. The maps in Supplementary Fig. 5 (SI) show the built-up (BL) land indicators. Fractions of land areas refer to inhabited land.

The infrastructure layers

The preprocessed OSM database comprises globally consistent road (R) and railway vector data (RW), but the availability of OSM data varies strongly between countries. While the OSM data in countries of the global North (industrial or even postindustrial countries) also include minor road and train track categories (e.g., cycleways, steps, or private gauges), OSM data in countries of the global South only comprise the main road and railway network. To reduce this inconsistency, we excluded minor OSM classes in order to derive a more comparable global database (Supplementary Table 3). The maps showing the road network indicators are in Supplementary Fig. 6 (SI). Fractions of the area refer to an inhabited land. The railway indicators were derived from OSM in the same manner as those for roads and are shown in Supplementary Fig. 7 (SI). The railway types considered are defined in Supplementary Table 4. Fractions refer to an inhabited land. The planar extent of road and railway networks was calculated using width data reported in Supplementary Table 5. The distinction between urban and rural infrastructures was based on a spatial intersection of OSM road and railway data with the BL features, which resulted in layers of urban and rural networks (example shown in Supplementary Fig. 4, SI).

Reference layer for inhabited land

The definition of some material stock pattern indicators requires a reference area (A_REF). The area of the total national territory (A_NT) may not be suitable, given that in some countries (almost), the entire area is inhabitable, whereas other countries contain large tracts of land unsuitable for human habitation and hence largely uninhabited. We, therefore, developed the proxy layer inhabited land (IH) as a more suitable area reference (A_IH). In contrast to existing similar datasets⁶⁹, this IH mainly uses land cover information of the LC100 grid. This guarantees thematic consistency in spatial intersections with BL data that were derived using the same dataset. The IH includes not only area that is covered with settlements or infrastructures, but also cropland areas and areas with ambiguous land cover that fall within a zone of influence around existing built-up areas, which is approximated by a buffer that depends on the area of a built-up land feature (Eq. 1). The IH is based on the current settlement and cropland extent, as well as the spatial distribution, density, and elevation occurrence of built-up areas according to the LC100 grid.

To calculate IH, we first cut out high-altitude regions from the country’s total territory. We calculated the elevational distribution of BL features and excluded all areas above the area-weighted 99th percentile of BL-elevations. In a second step, we excluded the LC100 land cover types bare/sparse vegetation (deserts and rocks), moss & lichen, snow & ice, and permanent water bodies. Thirdly, we spatially intersected the map resulting from the previous two steps with a synthetic layer that represents the gapless inhabited land area. To derive this layer, we applied an area-dependent buffer for all BL features. See Eq. (1) for the dynamic BL buffer width (w_BL); A_{BL feature} denotes the area of an exemplary built-up land feature.

$${{w}_{{BL}}={A}_{{BL}\;{feature}}}^{\frac{2}{3}}$$

(1)

$$\max \{{w}_{{BL}}:100{km}\}$$

Finally, we added cropland areas (from LC100) and the original BL areas to re-include those BL areas excluded by the elevation threshold. Please note that for spatial indicators that depend on ${A}_{{REF}}$, the specific spatial pattern (shape) of IH is not relevant: we just use the national total area of IH instead of the area of the national territory as reference value. The potential usefulness of this IH layer for other research questions, in particular where its spatial accuracy is of high importance, needs to be tested and is not in focus in this study.

Definitions of spatial pattern indicators

The spatial pattern indicators were derived from the built-up, road, and railway layers. Details on the definitions of the indicators (Table 1) are given in the SI in section 3, Supplementary Tables 6–8.

Pearson correlations

The Pearson correlation is a measure of linear association between two variables. The coefficient can be obtained from bivariate data $\left\{\left({X}_{1},{Y}_{1}\right), . . . ,\left({X}_{n},{Y}_{n}\right)\right\}$ as ${r}_{{XY}}={S}_{{XY}}/{S}_{X}{S}_{Y}$, where ${S}_{{XY}}$ and ${S}_{i}$ denote the sample covariance and standard deviation. The correlation coefficient is between −1 and 1. Correlations equal to 1 (or −1) indicate a perfect linear association, with data points lying exactly on a positive (negative) line. A value equal to zero indicates the absence of any linear association. The squared correlation coefficient ${r}_{{xy}}^{2}$ is the coefficient of determination (R-squared) of the linear regression of variable x on variable y; it measures the fraction of the variance explained by the regression line.

Semi-partial correlations

Suppose that $Y$ is determined by ${{{\bf{X}}}}=\left\{{X}_{1},....,{X}_{k}\right\}$. Then, the semi-partial correlation between $Y$ and ${X}_{i}$, controlled for the other predictors ${{{{\bf{X}}}}}_{-i}$, attempts to measure the correlation between $Y$ and ${X}_{i}$ that would be observed if the effect of ${{{{\bf{X}}}}}_{-i}$ would be removed from ${X}_{i}$ but not from$Y$. This means that the semi-partial regression measures the correlation between $Y$ and the part of ${X}_{i}$ that is orthogonal to the other variables ${{{{\bf{X}}}}}_{-i}$. It is calculated by constructing a new variable ${X}_{i}^{{\prime} }$ that is orthogonal (i.e., entirely uncorrelated) to all previously considered variables (i.e., those controlled for).

The squared semi-partial correlation coefficient measures the fraction of the variance of the dependent variable $Y$ that is uniquely explained by ${X}_{i}$. Thus, it can also be interpreted as the increase (decrease) in the model R-squared value that results from including (removing) ${X}_{i}$ from the set of predictors ${{{\bf{X}}}}$.

The semi-partial correlation is calculated by first fitting a linear regression of $Y$ on ${{{\bf{X}}}}$ and computing the coefficient as:

$${r}_{Y\left({X}_{i} | {{{{\bf{X}}}}}_{-i}\right)}={{{\rm{sign}}}}\left(t\right)\sqrt{\frac{{t}^{2}\left(1-{R}^{2}\right)}{n-k}}$$

(2)

In (2) $t$ is the t-statistic of variable ${X}_{i}$ in the previous regression, ${R}^{2}$ is the R-squared, $k$ is the number of independent variables plus the constant, and $n$ is the sample size. Finally, the significance level is given by $2/{{{\rm{Pr }}}}\left({t}_{n-k} \, > \, \left|t\right|\right)$, where ${{{\rm{Pr }}}}$ is the probability, $t$ is as described above and ${t}_{n-k}$ follows a Student’s $t$ distribution with $n-k$ degrees of freedom. Further details on the correlation techniques used are available e.g. here⁷⁰.

Lasso analysis

The least absolute shrinkage and selection operator (lasso) approach is used to select variables for multivariate statistical models for predicting cross-country patterns of TFC and CO₂^41,71. Lasso is a standard regularization technique for model selection and prediction that overcomes the disadvantages of other regularization techniques such as Ridge regression⁷². It can select a parsimonious set of variables from many potential covariates, even if covariates are collinear. Lasso minimizes the sum of squared residuals, but unlike standard least square fit approaches, it penalizes complexity in the objective function to derive the best parsimonious model for any predefined value of $\lambda $, i.e., the factor penalizing model complexity. If $\lambda $ is set to zero, lasso delivers standard least squares estimates, which corresponds to a model with the maximum complexity. In general, the larger the $\lambda $, the smaller the number of non-zero coefficients.

Consider a linear specification $Y={B}_{0}+{B}_{1}{X}_{1}+\ldots+{B}_{P}{X}_{P}+\epsilon $, where variables have been previously standardized to account for differences in scales. Lasso finds estimates for model coefficient B keeping the model sparse by minimizing the following term:

$$\frac{1}{2N}{\left(Y-{{{\bf{XB}}}}\right)}^{{\prime} }\left(Y-{{{\bf{XB}}}}\right)+\lambda {\sum }_{j=1}^{P}\left|{B}_{j}\right|$$

(3)

The first part of the term (3) is the in-sample squared error minimized by a classical least-squares approach. Lasso also includes the absolute sum of coefficients in the objective function, which penalizes complexity driving some of the estimated coefficients to zero.

The value of λ is typically chosen so that the estimated model satisfies a predetermined condition. Several criteria can be employed, the most common of which is cross-validation. Cross-validation selects the $\lambda $ to minimize the out-of-sample prediction error. First, sample observations are split into K random folds (validation sets). For each validation set, the model is fitted using data from the other folds, and the out-of-sample deviance for the observations in the validation set is computed (i.e., using data not employed for estimation). Finally, the overall out-of-sample performance of the model in all the validation sets is assessed by the mean-square prediction error (MSPE), a statistical parameter in squared (log) units required for model selection. Cross-validation selects the $\lambda $ over a grid of possible values such that the corresponding model has the minimum MSPE⁷¹. In Table 2, MSPE is transformed into r², i.e., the goodness of fit within the sample of countries, and oSr², i.e., the cross-validated out-of-sample goodness of fit for the optimal models. If no variable is added or removed at λ*, this is reported in the left columns of Table 2 as Unchanged. Note that beyond ${\lambda }^{*}$, more variables could be added but would not further improve the out-of-sample prediction.

Data availability

Datasets on spatial data on patterns of global infrastructure and settlements, the inhabited land layer, as well as the indicator values of dependent and independent variables used in the statistical analyses is freely available here: https://doi.org/10.5281/zenodo.5876941. An interim result that was too large to be uploaded as zenodo archive is available from the authors for non-commercial research purposes upon reasonable request (for detail, see ref. ⁵⁹).

Code availability

The code used for calculations of maps is freely available here: https://doi.org/10.5281/zenodo.5883652.

References

IPCC. Climate Change 2021, The Physical Science Basis. (Working Group I contribution to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, 2021).
IPCC. Global Warming of 1.5 °C. An IPCC special report on the impacts of global warming of 1.5 °C above pre-industrial levels and related global greenhouse gas emission pathways, in the context of strengthening the global response to the threat of climate change, sustainable development, and efforts to eradicate poverty. (WMO, UNEP, 2018).
Dietz, T., Shwom, R. L. & Whitley, C. T. Climate change and society. Annu. Rev. Sociol. 46, 135–158 (2020).
Article Google Scholar
Jorgenson, A. K. et al. Social science perspectives on drivers of and responses to global climate change. WIREs Clim. Change 10, e554 (2019).
Article Google Scholar
Liddle, B. Impact of population, age structure, and urbanization on carbon emissions/energy consumption: evidence from macro-level, cross-country analyses. Popul. Environ. 35, 286–304 (2014).
Article Google Scholar
Ehrlich, P. R. & Holdren, J. P. Impact of population growth. Science 171, 1212–1217 (1971).
Article ADS CAS PubMed Google Scholar
Ayres, R. U. & Warr, B. The Economic Growth Engine: How Energy And Work Drive Material Prosperity (Edward Elgar, 2009).
Hall, C. A. S. & Klitgaard, K. A. Energy and the Wealth of Nations. An Introduction to Biophysical Economics (Springer, 2017).
Stern, D. I. The role of energy in economic growth. Ann. N. Y. Acad. Sci. 1219, 26–51 (2011).
Article ADS PubMed Google Scholar
Jackson, T. & Victor, P. A. Unraveling the claims for (and against) green growth. Science 366, 950–951 (2019).
Article ADS CAS PubMed Google Scholar
Vadén, T. et al. Decoupling for ecological sustainability: A categorisation and review of research literature. Environ. Sci. Policy 112, 236–244 (2020).
Article PubMed PubMed Central Google Scholar
Quéré, C. L. et al. Drivers of declining CO₂ emissions in 18 developed economies. Nat. Clim. Chang. 9, 213–217 (2019).
Article ADS Google Scholar
Hubacek, K., Chen, X., Feng, K., Wiedmann, T. & Shan, Y. Evidence of decoupling consumption-based CO2 emissions from economic growth. Adv. Appl. Energy 4, 100074 (2021).
Article CAS Google Scholar
Lamb, W. F., Grubb, M., Diluiso, F. & Minx, J. C. Countries with sustained greenhouse gas emissions reductions: an analysis of trends and progress by sector. Clim. Policy 22, 1–17 (2022).
Article Google Scholar
Haberl, H. et al. A systematic review of the evidence on decoupling of GDP, resource use and GHG emissions, part II: synthesizing the insights. Environ. Res. Lett. 15, 065003 (2020).
Article ADS Google Scholar
Shafiei, S. & Salim, R. A. Non-renewable and renewable energy consumption and CO2 emissions in OECD countries: a comparative analysis. Energy Policy 66, 547–556 (2014).
Article CAS Google Scholar
Steinberger, J. K., Krausmann, F. & Eisenmenger, N. Global patterns of materials use: a socioeconomic and geophysical analysis. Ecol. Econ. 69, 1148–1158 (2010).
Article Google Scholar
Parikh, J. & Shukla, V. Urbanization, energy use and greenhouse effects in economic development: Results from a cross-national study of developing countries. Glob. Environ. Change 5, 87–103 (1995).
Article Google Scholar
Rahman, M. M. Do population density, economic growth, energy use and exports adversely affect environmental quality in Asian populous countries? Renew. Sustain. Energy Rev. 77, 506–514 (2017).
Article Google Scholar
Creutzig, F., Baiocchi, G., Bierkandt, R., Pichler, P.-P. & Seto, K. C. Global typology of urban energy use and potentials for an urbanization mitigation wedge. Proc. Natl Acad. Sci. USA 112, 6283–6288 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Creutzig, F. How fuel prices determine public transport infrastructure, modal shares and urban form. Urban Clim. 10, 63–76 (2014).
Article Google Scholar
Krausmann, F., Weisz, H. & Eisenmenger, N. in Social Ecology: Society-Nature Relations Across Time And Space (eds Haberl, H., Fischer-Kowalski, M., Krausmann, F. & Winiwarter, V.) Ch. 3 (Springer, 2016).
Güneralp, B. et al. Global scenarios of urban density and its impacts on building energy use through 2050. Proc. Natl Acad. Sci. USA 114, 8945–8950 (2017).
Article ADS PubMed PubMed Central Google Scholar
Kenworthy, J. R. Passenger transport energy use in ten Swedish cities: understanding the differences through a comparative review. Energies 13, 3719 (2020).
Article CAS Google Scholar
Newman, P. W. G. Sustainability and cities: extending the metabolism model. Landsc. Urban Plan. 44, 219–226 (1999).
Article Google Scholar
Seto, K. C. et al. Human settlements, infrastructure and spatial planning. In Climate Change 2014: Mitigation of Climate Change. Working Group III contribution to the IPCC Fifth Assessment Report (AR5) of the Intergovernmental Panel for Climate Change (eds Edenhofer, O. et al.) 923–1000 (Cambridge Univ. Press, 2014).
Newman, P. W. G. & Kenworthy, J. R. Transport and urban form in thirty‐two of the world’s principal cities. Transp. Rev. 11, 249–272 (1991).
Article Google Scholar
Pomponi, F., Saint, R., Arehart, J. H., Gharavi, N. & D’Amico, B. Decoupling density from tallness in analysing the life cycle greenhouse gas emissions of cities. npj Urban Sustain 1, 33 (2021).
Article Google Scholar
Haberl, H. et al. Contributions of sociometabolic research to sustainability science. Nat. Sustain. 2, 173–184 (2019).
Article Google Scholar
Haberl, H. et al. High-resolution maps of material stocks in buildings and infrastructures in Austria and Germany. Environ. Sci. Technol. 55, 3368–3379 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Kennedy, C. The energy embodied in the first and second industrial revolutions. J. Ind. Ecol. 24, 887–898 (2020).
Article Google Scholar
Krausmann, F. et al. Global socioeconomic material stocks rise 23-fold over the 20th century and require half of annual resource use. Proc. Natl Acad. Sci. USA 114, 1880–1885 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Lamb, W. F. et al. A review of trends and drivers of greenhouse gas emissions by sector from 1990 to 2018. Environ. Res. Lett. 16, 073005 (2021).
Article ADS CAS Google Scholar
Hertwich, E. G. Increased carbon footprint of materials production driven by rise in investments. Nat. Geosci. 14, 151–155 (2021).
Article ADS CAS Google Scholar
Grubler, A. et al. A low energy demand scenario for meeting the 1.5 °C target and sustainable development goals without negative emission technologies. Nat. Energy 3, 515 (2018).
Article ADS Google Scholar
Sims, R. et al. Transport. In Climate Change 2014, Mitigation of Climate Change. Contribution of Working Group III to the Fifth Assessment Report of the Intergovernmental Panel on Climate Change (eds Edenhofer, O., Pichs-Madruga, R. & Sokona, Y.) 599–670 (Cambridge Univ. Press, 2014).
Wiedenhofer, D. et al. A systematic review of the evidence on decoupling of GDP, resource use and GHG emissions, part I: bibliometric and conceptual mapping. Environ. Res. Lett. 15, 063002 (2020).
Article ADS Google Scholar
Lim, J., Kang, M. & Jung, C. Effect of national-level spatial distribution of cities on national transport CO2 emissions. Environ. Impact Assess. Rev. 77, 162–173 (2019).
Article Google Scholar
Voorhies, W. I., Miller, J. A., Yao, J. K., Bunge, S. A. & Weiner, K. S. Cognitive insights from tertiary sulci in prefrontal cortex. Nat. Commun. 12, 5122 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Shi, X., Wang, K., Cheong, T. S. & Zhang, H. Prioritizing driving factors of household carbon emissions: An application of the LASSO model with survey data. Energy Econ. 92, 104942 (2020).
Article Google Scholar
Tibshirani, R. J. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B 58, 267–288 (1996).
MathSciNet MATH Google Scholar
Creutzig, F. et al. in Climate Change 2022: Mitigation of Climate Change (Intergovernmental Panel on Climate Change, 2022).
IPCC. Climate Change 2022. Mitigation of Climate Change (Intergovernmental Panel on Climate Change, 2022).
Cao, Z. et al. The sponge effect and carbon emission mitigation potentials of the global cement cycle. Nat. Commun. 11, 3777 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Krausmann, F., Wiedenhofer, D. & Haberl, H. Growing stocks of buildings, infrastructures and machinery as key challenge for compliance with climate targets. Glob. Environ. Change 61, 102034 (2020).
Article Google Scholar
Watari, T. et al. Global metal use targets in line with climate goals. Environ. Sci. Technol. 54, 12476–12483 (2020).
Article ADS CAS PubMed Google Scholar
Ewing, R. & Cervero, R. Travel and the built environment. J. Am. Plan. Assoc. 76, 265–294 (2010).
Article Google Scholar
Borck, R. & Brueckner, J. K. Optimal energy taxation in cities. J. Assoc. Environ. Resour. Economists 5, 481–516 (2018).
Google Scholar
Schug, F. et al. High-resolution mapping of 33 years of material stock and population growth in Germany using Earth Observation data. J. Ind. Ecol. 27, 110–124 (2023).
Article Google Scholar
Schug, F. et al. High-resolution data and maps of material stock, population, and employment in Austria from 1985 to 2018. Data Brief. 47, 108997 (2023).
Article CAS PubMed PubMed Central Google Scholar
Vogel, J., Steinberger, J. K., O’Neill, D. W., Lamb, W. F. & Krishnakumar, J. Socio-economic conditions for satisfying human needs at low energy use: An international analysis of social provisioning. Glob. Environ. Change 69, 102287 (2021).
Article Google Scholar
Geofabrik. Open Street Map Data Extracts. (Geofabrik Download Server, last download date 13.5.2020, 2020).
Buchhorn, M. et al. Copernicus global land service: land cover 100m: collection 2: epoch 2015: globe. Zenodo https://doi.org/10.5281/zenodo.3243509 (2019).
Florczyk, A. J. et al. GHSL Data Package 2019 (Publication Office of the European Union, 2019).
Esch, T. et al. Where we live—A summary of the achievements and planned evolution of the global urban footprint. Remote Sens. 10, 895 (2018).
Article ADS Google Scholar
Gong, P. et al. Annual maps of global artificial impervious area (GAIA) between 1985 and 2018. Remote Sens. Environ. 236, 111510 (2020).
Article ADS Google Scholar
Geofabrik GmbH. OpenStreetMap data extracts - Geofabrik Download Server. https://download.geofabrik.de/.
Eurostat. Countries - GISCO archive. https://ec.europa.eu/eurostat/web/gisco/geodata/reference-data/administrative-units-statistical-units/countries.
Löw, M. et al. Datasets on global patterns of settlements and infrastructures. https://doi.org/10.5281/zenodo.5876941 (2023).
Löw, M. & Matej, S. Software code to calculate datasets on global patterns of settlements and infrastructures. https://doi.org/10.5281/zenodo.5883652 (2023).
UN. National Accounts - Analysis of Main Aggregates (AMA). United Nations Statistics Division https://unstats.un.org/unsd/snaama/ (2021).
World Bank Group. Population. World Bank Data https://data.worldbank.org/indicator/SP.POP.TOTL (2021).
World Bank Group. Urban Population Rate. World Bank Data https://data.worldbank.org/indicator/SP.URB.TOTL.IN.ZS (2021).
World Bank Group. Pump price for gasoline. World Bank Data https://data.worldbank.org/indicator/EP.PMP.SGAS.CD (2021).
IEA. Weather for Energy Tracker. International Energy Agency https://www.iea.org/articles/weather-for-energy-tracker (2021).
Global Carbon Project. Supplementary data of Global Carbon Budget 2020 (Version 1.0) [Data set]. Global Carbon Project https://doi.org/10.18160/gcp-2020 (2020).
Friedlingstein, P. et al. Global Carbon Budget 2020. Earth Syst. Sci. Data 12, 3269–3340 (2020).
Article ADS Google Scholar
IEA. World Energy Balances 2018. https://www.iea.org/statistics/balances/ (2018).
Riggio, J. et al. Global human influence maps reveal clear opportunities in conserving Earth’s remaining intact terrestrial ecosystems. Glob. Change Biol. 26, 4344–4356 (2020).
Article ADS Google Scholar
Cohen, J., Cohen, P., West, G. & Aiken, L. S. Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences (Erlbaum, 2003).
Hastie, T. J., Tibshirani, R. J. & Wainwright, M. Statistical Learning with Sparsity: The Lasso and Generalizations (CRC Press, 2015).
Hoerl, A. & Kennard, R. Ridge regression: biased estimation for nonorthogonal problems. Technometrics 12, 55–67 (1970).
Article MATH Google Scholar

Download references

Acknowledgements

This research has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program, MAT_STOCKS, grant agreement No 741950, awarded to H.H., and from Grant PID2020-119152GB-I00, funded by MCIN/AEI/ 10.13039/501100011033, awarded to J.A.D.

Author information

These authors contributed equally: Helmut Haberl, Markus Löw, Alejandro Perez-Laborda.

Authors and Affiliations

Institute of Social Ecology, University of Natural Resources and Life Sciences, Vienna, Austria
Helmut Haberl, Markus Löw, Sarah Matej, Barbara Plank, Dominik Wiedenhofer & Karl-Heinz Erb
Economics Department and Eco-SOS, Universitat Rovira i Virgili, Tarragona, Spain
Alejandro Perez-Laborda & Juan Antonio Duro
Mercator Research Institute on Global Commons and Climate Change, EUREF 19, 10829, Berlin, Germany
Felix Creutzig
Technical University Berlin, Straße des 17 Junis 135, 10623, Berlin, Germany
Felix Creutzig

Authors

Helmut Haberl
View author publications
You can also search for this author in PubMed Google Scholar
Markus Löw
View author publications
You can also search for this author in PubMed Google Scholar
Alejandro Perez-Laborda
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Matej
View author publications
You can also search for this author in PubMed Google Scholar
Barbara Plank
View author publications
You can also search for this author in PubMed Google Scholar
Dominik Wiedenhofer
View author publications
You can also search for this author in PubMed Google Scholar
Felix Creutzig
View author publications
You can also search for this author in PubMed Google Scholar
Karl-Heinz Erb
View author publications
You can also search for this author in PubMed Google Scholar
Juan Antonio Duro
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.H. conception of research design, data analysis, interpretation of results, article drafting, project management, and funding acquisition. M.L. development of spatial indicators, spatial analyses, data analysis, quantification of indicators, designing figures, and contribution to writing. A.P.-L. conceptualization and implementation of statistical analyses, data analysis, designing figures, and contribution to article writing, S.M. spatial and statistical analyses, data analysis and interpretation, and contribution to writing, B.P. collection of STIRPAT indicators, data analysis and interpretation, and contribution to writing, D.W., F.C., and K.-H.E. contribution to research design, data analysis and interpretation, and contribution to writing, J.A.D. conception and supervision of statistical analyses, research design, data analysis and interpretation, and contribution to writing.

Corresponding author

Correspondence to Helmut Haberl.

Ethics declarations

Competing interests

The authors declare no competing interests

Peer review

Peer review information

Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Haberl, H., Löw, M., Perez-Laborda, A. et al. Built structures influence patterns of energy demand and CO₂ emissions across countries. Nat Commun 14, 3898 (2023). https://doi.org/10.1038/s41467-023-39728-3

Download citation

Received: 12 January 2023
Accepted: 23 June 2023
Published: 03 July 2023
DOI: https://doi.org/10.1038/s41467-023-39728-3

This article is cited by

Analysis of influencing factors of energy consumption in Beijing: based on the IPAT model
- Zheng Zhang
- Xianzhong Mu
- Guangwen Hu
Environment, Development and Sustainability (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.