Main

The production of waste as a fundamental aspect of living systems has characterized the history of the biosphere (including the oxygenation of the atmosphere as a photosynthetic by-product1), inspired evolutionary transitions2,3, and constrained the ecological dynamics of all temporal and spatial scales. The balance and cycling of several key greenhouse gases such as carbon dioxide are largely defined as the waste products of biological and industrial metabolism4,5,6. For humans, the management of waste is a central consideration for health, well-being, quality of life, impact on the environment, efficient economies, and climate change7,8,9,10,11. Furthermore, these considerations have motivated everything from sewage systems to environmental regulations and the handling of medical and nuclear waste.

Human society is currently characterized by rapid population growth and urbanization, and thus the ability to quantify and forecast the mechanisms behind urban waste production and reduce waste has tremendous benefits for policy and planning, strategic technological developments, and ecological modeling. However, the systematic connections between shifts in waste production and urbanization have not yet been described. The challenge is that frameworks for waste need to account for the nonlinear effects associated with city size to understand the fundamental mechanisms of waste production.

Scaling theory is an effective way to illuminate systematic behavior across diverse systems and to reveal novel mechanisms, specifically considering how features of a system change with its size12. Scaling theory has been successful in a variety of biological applications ranging from organism physiology to the structure of forests and mammalian ecosystems12,13,14,15,16. In many cases, the scaling exponents between various features and size can be derived from fundamental physical, physiological, and structural limitations, revealing the fundamental mechanisms underlying the systematic behavior.

More recently, various studies have applied scaling theory to quantify mechanistic processes operating in urban environments12,17,18,19,20,21, describing everything from the production of new knowledge to shifts in inequality with city size19,22,23. It is essential to note that similar to the biological scaling theories, these urban scaling theories can be derived from fundamental mechanisms of geometry, transport, and the requirements of individual components12,17,18,19,20,21.

Cities typically exhibit sublinear, linear, or superlinear scaling as characterized by the exponent β (Methods). Sublinear scaling (scaling exponents <1) often results from the efficiencies of scaling up (economy of scale) and is typically associated with physical infrastructure such as roads12,17,18,19,20,21. Linear scaling is often driven by individual needs that are density-independent such as total housing and household electricity consumption19. Superlinear scaling (scaling exponents >1) is often the consequence of densifying social interactions, and recent findings typically associate it with intellectual or virtualized features such as patent production or wealth creation12,17,18,19.

For waste production, distinct types of waste may have fundamentally different relationships with urbanization. For example, coal-fired power plants achieve efficiencies of scale in terms of CO2 per kilowatt hour (ref. 24), and thus if larger cities employ larger power plants, we might expect sublinear scaling with city size. In contrast, increased social connections and intellectual activity could increase the output of certain types of waste, but of which ones (and why) remains unclear. For example, previous work has shown that food waste, CO2 emissions, and water consumption all have complicated relationships that are dependent on population, infrastructure, gross domestic product (GDP), development level, local climate, and regional geographic characteristics20,25,26,27,28,29,30,31,32,33,34,35,36,37.

In this study, we synthesized three major forms of human waste in more than 1,000 cities across 171 countries. The cities span populations of 50,000 to 24 million people (Fig. 1a and Extended Data Fig. 1). We examined Chinese cities as a detailed case study of rapid urbanization where detailed data are available for diverse cities within a single nation.

Fig. 1: Scaling law of waste production across cities worldwide.
figure 1

a, Geolocation of the cities included in this study from three distinct data sources (see Data compiling in Methods). MoHURD, Ministry of Housing and Urban Rural Development (China). The map was generated using R with the ‘ggplot2’ package. b, Wastewater production scales superlinearly with the size of cities (β = 1.15 ± 0.04, n = 675). We highlight two example cities (black circles) that stand out with a large deviation from the scaling law. Dongguan, an industrial city of southern China that features high personal wealth and high annual precipitation, generates disproportionately more wastewater than expected given its size. In contrast, the northwestern city of Tianshui, which features much lower personal wealth and rainfall, generates much less wastewater than expected given its size. c, MSW production scales linearly with city size (β = 1.04 ± 0.05, n = 292). We highlight Seattle (United States) and Lilongwe (Malawi) as two cities that deviate from the general scaling relationship. The much richer Seattle produces eight times more municipal waste than Lilongwe, even though it has a smaller population. d, The emission of GHGs displays sublinear scaling across cities worldwide (β = 0.85 ± 0.1, n = 296). We highlight Rotterdam (the Netherlands) and Bandung (Indonesia) as two cities that deviate from the general scaling relationship, with Rotterdam producing disproportionately more GHGs. The purpose of highlighting certain high- and low-residual cities is to give concrete examples so that readers can relate to the abstract data points presented here (no subjective judgements are made here). The dark gray error bands in bd represent the CIs of each scaling relationship.

Scaling laws of waste production differ across waste types

We found universal scaling laws of waste production globally across diverse urban systems spanning all three major forms of waste that we considered: wastewater (Fig. 1b), municipal solid waste (MSW; Fig. 1c), and greenhouse gas (GHG) emissions (Fig. 1d). According to these scaling laws, the overall magnitude of waste production in a city can be reliably predicted based on city size as measured by urban population.

What is particularly intriguing, however, is that these scaling laws are functionally distinct across different waste types, probably due to inherently different mechanisms of waste generation. For example, the production of MSW (Fig. 1c) seems to scale linearly with city size (β ≈ 1, 95% confidence interval (CI) = (0.99, 1.09), r2 = 0.83, P < 0.001), that is, doubling a city’s population will double its MSW generation. This linear relationship suggests that MSW generation is driven by individual needs, independent of the density or size of the city where people reside.

In contrast, both wastewater generation (Fig. 1b) and GHG emissions (Fig. 1d) scale nonlinearly with city size (β ≠ 1), that is, doubling a city’s size will not double its generation of wastewater or GHGs. Perhaps most intriguingly, the production of wastewater seems to scale superlinearly with city size (β = 1.15, 95% CI = (1.11, 1.20), r2 = 0.79, P < 0.001) such that doubling the urban population will more than double waste production. The theory of urban scaling suggests that superlinear scaling in cities is driven by the increased rates of interpersonal interactions as cities densify with increasing population size38. This type of scaling leads to greater rates of economic activity such as the creation of wealth18,39,40,41,42. This co-occurrence of superlinear wealth creation and wastewater production points to the possibility that wealth creation and water consumption might be tightly coupled; the full set of interconnected mechanisms needs to be uncovered in the future.

In contrast to wastewater production, GHG emissions scale sublinearly with city size (β = 0.85, 95% CI = (0.75, 0.95), r2 = 0.47, P < 0.001), pointing to a relative economy of scale for the processes that generate GHGs43,44. For example, the energy efficiency of urban transportation systems increases with city size and population density due to the increasing use of public transportation45. In addition, our findings remain robust regardless of whether we used only emissions generated within the city boundary (scope-1 emissions) or emissions generated outside the city boundary such as imported grid power (scope-2 emissions; Extended Data Fig. 2a). However, we need to acknowledge that scope-3 emission data (emissions not accounted for by scope-1 and scope-2, that occur outside of the city boundary as a result of activities taking place within the city boundary) were not included in our analysis, even though scope-3 data can account for a sizable portion of the total emissions (median 9% based on the C40 dataset, see Methods). With increasing city participation and increasing coverage of scope-3 activities, future efforts could shed more light on the impacts of scope-3 emissions.

Given the challenges of acquiring GHG data, these data are historically limited to bigger cities from more developed regions46. Putting our findings into context, what makes our analysis unique is that it includes ~300 global cities that vary in size by more than two orders of magnitude, which makes our findings less sensitive to particular choices of city groups or regions. We would also like to note that the application of scaling theory is most useful in revealing macroscale patterns that are agnostic of the particular underlying processes, in this case, the coarse-grained relationship between GHG emissions and city size. The purpose of scaling analysis is to provide us with a point of departure for further in-depth and city-specific analyses (discussed in the next section). For this particular topic, fruitful explorations trying to unpack this relationship have already been reported29,47.

Overall, the observation that various types of waste are characterized by significant scaling relationships is important and implies, on average, that waste generation is determined by a common set of organizing principles related to city size globally. While the central relationship accounts for most of the variation, it is also useful to consider the cities that deviate significantly from the power law. These deviations might be the result of an ensemble of factors: climatic conditions, economic development, convention, governance policies, and reporting practices, to name a few. In our quantitative framework, deviations are most productively considered in terms of the normalized distance from the scaling relationship, which takes into account any underlying nonlinear effects (for example, see ref. 22). The scale-adjusted deviations in turn become a powerful tool to spot anomalies, unpack hidden variance, and inform city-level policymaking. For example, in China there is a gradient in both wealth and aridity as one moves inland from the coast, and one should expect such gradients to alter water use and wastewater production. Indeed, Fig. 1b shows that Dongguan (high precipitation and high per capita GDP) uses more water than expected and Tianshui (low precipitation and low per capita GDP) uses less water than expected. Similarly, Seattle and Malawi are very similar in population but differ by a factor of roughly 40 in per capita GDP, leading to roughly an order of magnitude difference in MSW production (Fig. 1c). In the next section we evaluate such deviations and unify the variation under a common framework that combines wealth and the natural environment.

Scale-adjusted deviations reveal city-specific efficiency

A central idea in evolution and ecology is that local species are adapted to local environments. Traits found in an environment will typically be relatively well matched to that environment, such that environments with scarce water will have species that use water more efficiently. This idea extends to the lifestyles and strategies of hunter–gatherer groups and early human settlements48,49,50. For cities, then, to what extent are patterns of waste production influenced by local environmental constraints, or do increasingly global supply chains decouple local constraints from local activities?

We tested this idea by examining the residuals of scaling relationships as a way to define relative over- and underperformance after accounting for the inherent economies or diseconomies of scale22,51. Consistent with intuition, the amount of rainfall that cities receive helps to explain the deviations of wastewater production from the scaling law: cities with higher rainfall produce more wastewater than expected for their size, and cities with lower rainfall use water more efficiently. In addition to this local constraint (that is, natural precipitation), the production of wastewater is impacted by socio-economic factors such as per capita GDP. Richer cities generate more wastewater than expected for their size and poorer cities generate less than expected. The combined result is a planar function (Fig. 2a) that can explain a considerable fraction of the city-specific deviations from the wastewater scaling law observed in Fig. 1b (adjusted r2 = 0.24, n = 282).

Fig. 2: Deviation from scaling laws explained by environmental and economic factors.
figure 2

a, The residuals of the wastewater scaling relationship can be explained by the combination of annual precipitation (Precip) and per capita GDP (perGDP) using a planar function (Residual = 0.35log10(perGDP) + 0.0001Precip − 1.6; r2 = 0.24, n = 282). b, The residual of the MSW scaling law can be explained by per capita GDP in a nonlinear relationship (Residual = 0.28log10(perGDP) − 1.14; r2 = 0.26, n = 165). c, The residual of the GHG scaling law can be explained by per capita GDP in a nonlinear relationship (Residual = 0.6log10(perGDP) − 2.56; r2 = 0.36, n = 148). Note that we removed three outliers from this regression. Keeping the three outliers would not alter the slope of the regression (slope = 0.6) but would shift downwards the intercept (intercept = −2.65). See Extended Data Fig. 3 for a version of the regression in which we retained the outliers. The error bands (gray shading) in b and c represent the CIs of each regression fit around the mean. Note that in b and c, the x axes are logarithmic scales, while the residual values on the y axes are in log10 units.

For the production of MSW and GHGs, the city-specific deviations from the scaling laws are not explained by natural constraints such as rainfall or temperature. Instead, we found that these deviations are best explained by per capita wealth (Fig. 2b, adjusted r2 = 0.26; Fig. 2c, adjusted r2 = 0.36; Extended Data Fig. 3). In both cases, increasing wealth will lead to a positive deviation from the scaling law and thus create cities that are more wasteful. This finding implies that certain types of waste are emergent phenomena related to the internal characteristics of cities rather than being constrained by the natural attributes of the immediate surroundings.

We next examined cities with relatively high residuals (>75% quantile, hereafter wasteful cities) and low residuals (<25% quantile, more efficient cities). A detailed knowledge of these cities might inform policymaking and future planning. Consistent with our residual analysis, we found that for wastewater generation, more wasteful cities on average receive 34% more annual rainfall than relatively efficient cities (1,030 mm versus 767 mm, P < 0.001; Extended Data Fig. 4a). We also found that, for wastewater generation, these wasteful cities have more than double the per capita GDP of these efficient cities (US$27,164 versus US$13,312, P < 0.001; Extended Data Fig. 4b). For the production of MSW and GHGs, we found no significant differences in any of the environmental variables that we analyzed, but strong (three- to fourfold) differences in per capita GDP between wasteful and efficient cities (Extended Data Fig. 4b,c).

We note that there are many other factors (not included in our analysis) that can cause deviations from the scaling law, including topography, culture, city-level policies, and path dependence on historical events, to name a few. However, our results suggest that, at a coarse-grained bigger-picture level, the waste that cities produce can be understood in terms of both the city size (Fig. 1) and the influence of financial and natural resources (Fig. 2 and Extended Data Fig. 4). These results are important for forecasting waste production into the future, as we discuss next.

Waste scaling evolves with time

Another important consideration for cities is how waste generation is changing in time (Fig. 3). This is especially relevant as nations continue to increase total wealth, urbanize, and implement new technologies that may either lower or increase consumption and waste generation. From a scaling perspective, this can be represented as a change in scaling exponents through time52 (Fig. 3a–c). For example, an exponent that rises in time indicates that larger cities are increasing waste production faster than smaller cities.

Fig. 3: Temporal evolution of the scaling relationship.
figure 3

a,d, The scaling exponent of wastewater production has hovered at around 1.15 over the past two decades for over 600 Chinese cities (the wastewater data were taken from the MoHURD database (denoted MoHURD_wastewater)) (a). The black circle represents the scaling slope from the cross-sectional scaling relationship shown in Fig. 1b. The color gradient of diamonds from blue to red indicates the scaling slope of the wastewater production versus city size data for each year from d. The error bars in a are based on the following sample sizes n from 2006 to 2020: 637, 642, 644, 646, 648, 646, 645, 645, 642, 638, 637, 638, 648, 654, and 658. b,e, The scaling exponent of MSW started as sublinear but over time gradually approached linear scaling (MoHURD_MSW dataset), tending towards the global average (black circle) shown in Fig. 1c (b). The color gradient of diamonds from blue to red indicates the scaling slope of the MSW production versus city size data for each year from e. Due to varying data availability, the error bars in b are based on the following sample sizes n from 2006 to 2020: 639, 643, 645, 645, 648, 646, 645, 645, 642, 638, 637, 638, 648, 654, and 658. c,f, A temporal analysis of the C40 dataset displays higher variation, but the estimated exponents consistently hover around 0.85 (c). Due to the small size of the temporal dataset, the uncertainty in the scaling exponents is large, yet the central tendency corroborates the sublinear scaling of the global analyses shown in Fig. 1d (black circle). The color gradient of diamonds from blue to red indicates the scaling slope of the GHG emissions versus city size data for each year from f. The scaling analyses in c and f were restricted to years when data were available from more than 15 cities (that is, 2012–2019, see Methods and Extended Data Fig. 5). Due to varying data availability, the error bars in c are based on the following sample sizes n from 2012 to 2019: 17, 22, 30, 36, 42, 27, 39, and 22. The error bars in ac and the error bands in df represent 95% CIs around the mean. Shaded regions in ac separate the static scaling exponents of Fig. 1b–d from the exponent time series in this figure, with horizontal dashed lines as reference lines.

We found that wastewater generation (Fig. 3a,d) maintains a consistent superlinear scaling. This diseconomy of scale seems to have stabilized, implying that continuous urbanization is expected to radically increase the amount of urban wastewater produced. GHG emission (Fig. 3c,f) maintains a consistent sublinear scaling in time. Despite the uncertainty associated with a smaller number of cities in the temporal dataset (Fig. 3c, Extended Data Fig. 5, and Supplementary Table 1), cities maintain a relative economy of scale over time for GHG emissions, consistent with our global analysis in Fig. 1d. In contrast, MSW shifts from strongly sublinear scaling to linear scaling (Fig. 3b). In China, where we have temporal data, the economy of scale in solid waste production is lost by cities over time. On the upside, cities of various sizes seem to be equilibrating to near linear scaling in time, which implies that urbanization matters less as people in cities of all sizes produce comparable amounts of solid waste per capita. On the downside, the economies of scale that have been historically achieved for urban solid waste production no longer exist. This trend suggests that cities are drifting away from a future that would realize considerable solid waste reduction with increasing city size.

Breaking the vicious circle of waste production

Our work provides a framework for understanding current and future human waste production. If cities simply change in population size, they should follow the current scaling relationship defining the system that they are a part of. If the exponents are sublinear overall, waste per capita should decrease, although total waste will increase given the increasing total population. If the exponents are superlinear, waste will dramatically increase due to both a larger population and larger per capita production. This perspective highlights the startling challenges associated with superlinear waste production, such as wastewater, which will increase dramatically as cities increase in size. Our results provide a very moderate amount of hope for waste products with economies of scale, such as GHGs, where per capita generation is decreasing with city size. However, for net-zero emission goals to be achieved, all cities need to adjust their overall magnitude of emissions in combination with an economy of scale: decreasing both the exponent and overall magnitude of urban scaling relationships is required.

Ongoing urbanization in sub-Saharan Africa, India, and other areas of the developing world should lead to an increase in both city sizes and personal wealth. In particular, the world’s population is expected to increase by 2 billion people in the next 30 years53, a 26% increase in population. In contrast, the urban population is projected to increase by ~60% (ref. 54). The population increase alone will mean more waste production even for products that grow sublinearly, but for superlinear waste production, population growth combined with increasing urbanization and increasing wealth implies a rapid growth in per capita waste. More concretely, if a current city of 1 million people doubles in size, then wastewater production will rise by 122%, equivalent to an increase from 100 to 221 Mt yr−1.

Figure 4a illustrates the types of growth trajectory that cities may take and how these will change waste production. Here, increases in GDP or population will increase waste production, and a combined increase will change waste production most rapidly, which is the scenario most likely to occur globally. Indeed, such trends are observed as cities are mostly increasing in size in time and MSW production is rising in various size categories of Chinese cities (Fig. 4b,c).

Fig. 4: Understanding the forces that drive waste generation across cities and through time.
figure 4

a, A city that is currently situated at the lower left of the worldwide waste scaling line can change in two orthogonal directions: (1) a change in the urban population (that is, city size), shrinking or growing, parallel to the scaling line, and (2) a change in efficiency orthogonal to the scaling line. In the example we give, the underdeveloped city (denoted by the gray circle) increases in population and increases their per capita waste generation (that is, decreasing efficiency driven by economic growth) and consequently shifts to the upper right of the scaling line (denoted by the black circle). b, When summing up the total MSW generation of Chinese cities according to their size classification, our analysis reveals that the biggest cities have the highest growth rate of waste production, while the smallest cities have the lowest growth rate. The size classification of cities is based on the urban population in 2020: urban populations of 50,000–180,000, 180,000–300,000, 300,000–700,000, and >700,000. c, The rapid growth of waste generation in large cities is driven by both China’s rapid urbanization in the past 15 years and a steady increase in the per capita waste generation rate in the largest cities (indicated by the size of circle). Note that the per capita waste has decreased in small cities, but the total population in small cities is a magnitude smaller than that of bigger cities.

Increases in GDP often improve the quality of life for individuals in cities, so we need ways to decouple the link between increasing economic prosperity and per capita waste production55,56. Perhaps we can look to San Francisco or Japanese cities as examples of high GDP and low waste cities. For roughly the same per capita GDP, Japan generates one-third of the MSW per capita compared with the United States, and San Francisco generates less MSW per capita than any other major city in the United States57. The structural features, cultural dynamics, and policies that allow these cities to reduce waste need to be more systematically understood in connection with scaling and deployed in most global cities.

The hidden side of human productive economies

Most economic and social theories of human civilization, including urban scaling theory, focus on the concepts of production, growth, innovation, and the forces that ultimately limit these productive processes. In contrast, on the other half of the equation, the by-products of these productive processes (that is, the consequences of waste) have largely been overlooked. This work fills this knowledge gap, illuminating the scaling of waste production in urban systems.

This perspective relates to classic ecology and the history of life, where the accumulation of waste is seen as a fundamental limit for all species58,59. Complete theories of economies must implement similar considerations for human society. Moving forward, a complete science of waste would couple by-products to products, and an ultimate science of the economy would consider the feedbacks between production and waste, including eventual hindrances produced by waste. Notably, this is already considered in many studies that seek to price the eventual economic costs of global warming due to anthropogenic CO2 emissions60. These considerations should be expanded to all types of waste through the lens of urban systems and how waste scales with city size as we have presented here.

Quantifying the numerous dimensions of waste will be important for a complete theory of production and waste generation. For example, our quantitative framework should be extended to all types of waste to understand the complete dynamics of waste generation. According to our country-level analysis, for each gigatonne of MSW generated, a country on average generates 2 Gt of construction waste, 600 Mt of agricultural waste, 300 Mt of industrial waste, 100 Mt of hazardous waste, 10 Mt of electronic waste, and 8 Mt of medical waste (Fig. 5).

Fig. 5: Global accounting of country-level waste generation.
figure 5

Country-level waste generation ranked according to the median value across nine major types of waste: wastewater, GHGs, construction waste, MSW, agricultural waste, industrial waste, hazardous waste, electronic waste, and medical waste. Each data point represents a total value for a specific country. The numbers shown at the bottom of the plot represent the ratio of the medians of the various waste types to that of MSW. The median value of country-level MSW production is ~1.8 Mt yr−1 (red dashed line). The letters a–e and bc indicate significant pairwise differences between waste types (α = 0.05). Wastewater, GHG emissions, and MSW (shaded gray) were the focus of our city-level analyses and are highlighted with colors consistent with Figs. 13. Open circles indicate waste types for which we did not have city-level data. In each box plot, the lower and upper bounds of the whiskers denote minima and maxima, the center line denotes the median, and the lower and upper bounds of the boxes represent the 25% and 75% quantiles, respectively.

These wastes impose a heavy burden on our society and natural ecosystems by their sheer volume, but more importantly, threaten public health and safety by polluting air and drinking water61, clogging sewage systems and creating flooding62, and polluting soil and consequently the entire food web through cascading interactions63. Our findings highlight the urgent need for a holistic approach to account for all of these waste types, and illustrate the lack of city-level waste data that are much needed for future analysis. In addition, future analyses of economic performance must incorporate the costs of waste, for example, via natural capital accounting64, in connection with normalizations that consider city size and local natural resources.

Taken together, our work shows the need for a general theory that incorporates the integrated science of economic production, waste generation, and, eventually, the full cycle of material flow. Realizing this goal should enhance humanity’s pursuit of a more sustainable future.

Methods

Scaling theory

Urban scaling theory relates city size (N) to a given feature of interest (Y) through power law relationships of the form \(Y={Y}_{0}{N}^{\beta }\), where Y0 is the normalization constant and β is the scaling exponent12,19. The power of this perspective is that the exponent β is often not equal to one, demonstrating nonlinearities that contradict simple per capita perspectives together with an economy or diseconomy of scale; typically, the exponent reveals something about the underlying mechanisms. For example, superlinear exponents are the consequence of densifying social interactions with urban environments and are typically related to intellectual or virtualized features such as knowledge creation or wealth production18,19,39,41,42. At the most basic level, there are more people per unit area in larger cities, such that interactions are easier with the total interaction rate between individuals increasing with city size38. Sublinear exponents are typically related to infrastructural economies of scale such as the total area of roads or the total number of gas stations18,65,66.

A powerful aspect of this theory is that it allows us to transcend the individual and consider the non-trivial effects of larger collections of humans across cities, regions, and countries12. However, we need to caution that scaling analysis across national borders inevitably introduces noise arising from country-specific variations in economic development, cultural habits, policy, and geographic and climatic factors. Indeed, early application of scaling theory has largely focused on city groups within the same country19,67. However, the benefit of scaling across cities of various design and conditions from different countries lies in its ability to reveal underlying mechanisms that shape all urban systems regardless of their home country. An analogy can be found in the application of scaling theory in biology: scaling analysis across species within the same genus elucidates taxon-level adaptations68, while scaling analysis across all species in the same class or even kingdom14,69 reveals biophysical boundaries and shared designs.

Precisely because of its strength in uncovering fundamental shared principles, scaling theory has now been used to investigate city groups beyond national boundaries, from cities in the European Union19,39 to more diverse metropolitans worldwide47. Ultimately, the scaling relationships provide a non-trivial baseline—country-specific, region-specific, or global—against which we can measure deviations and variance from expectation22,51. These deviations reveal the character of individual cities and elucidate higher-order mechanisms beyond the scale of a city, and in many cases they can contribute to the design of better scale-adjusted metrics and policies.

Data compiling

We compiled five city-level waste datasets from four different sources (Supplementary Table 1) that encompass three major waste types: wastewater, MSW (colloquially known as ‘city trash’), and GHG emissions (CO2 equivalent). Each dataset contains urban population data consistent with the definition of city within each dataset. We used a minimum threshold of 50,000 urban residents to define a city70. Note that we did not combine all these datasets into a single homogenized dataset, primarily because of the distinct nature of gaseous, liquid, and solid waste. But another practical reason is that the definition of a city or urban system is only consistent within each source dataset (our scaling analysis is not sensitive to the potential inconsistency of city definitions as it was only performed within each dataset). These datasets, detailed below, together enabled us to perform cross-sectional analyses (Figs. 1 and 2) and temporal analyses (Figs. 3 and 4) for these different waste types.

MoHURD_wastewater and MoHURD_MSW datasets

We acquired a centralized database curated by MoHURD, China (Fig. 1b and Supplementary Table 1). This database contains urban population census and city-level waste production data from 2002 to 2020. From this centralized database we extracted and homogenized wastewater and MSW datasets that cover city-specific waste production time series from 2006–2020.

Worldbank_city dataset

We accessed city-level MSW data (Fig. 1c and Supplementary Table 1) from the World Bank (https://datacatalog.worldbank.org/search/dataset/0039597).

Nangini2019 dataset

We acquired the Nangini2019 global dataset on city-level GHG emissions46 (Extended Data Fig. 1c). This dataset contains only cross-sectional data, with no temporal series. We used a territory-based metric of GHG emissions that includes transport, industrial, and local power plant emissions within the city boundary (commonly referred to as scope-1 emissions71). A consumption-based emission metric will also include emissions embedded in traded goods as well as grid-supplied energy consumption produced by power plants outside the city boundary (commonly referred to as scope-2 emissions). At the city scale, data on scope-2 GHG emissions are scarce and less reliable as they are much harder to derive and often involve a range of assumptions46. We examined our finding of sublinear scaling of GHG emissions using the total emissions data (scope-1 + scope-2) included in this dataset. Despite a much smaller sample size (n = 130) and only covering cities from developed countries (United States, Canada, Australia, New Zealand, and Europe), the scaling relationship of total emissions is consistent with our main findings (Extended Data Fig. 2).

C40 dataset

We acquired a temporal dataset of GHG emissions from the cities of the C40 Cities Climate Leadership Group programme (C40; Extended Data Fig. 1d). Consistent with the cross-sectional analysis presented in Fig. 1d, we used scope-1 emissions (that is, territorial) for the temporal analysis (Fig. 3). The number of cities reporting emissions data varied over time: less than 3 cities during 1990–2004, 4–7 cities during 2005–2011, 17–42 cities during 2012–2019, and 5 cities for 2020. The scaling analysis was thus constrained to 2012–2019 due to the paucity of data in other years (Fig. 3 and Extended Data Fig. 5).

For the residual analysis of scaling laws (Fig. 2), we compiled city-level GDP and climatic variables for each city in our dataset compiled above. We acquired data on city-level GDP from two sources: (1) the Oxford economies dataset (proprietary), which spans 902 cities worldwide from 2000 to 2020, and (2) a GDP dataset from Zhou’s lab that features 292 Chinese cities spanning 1988–2018. All GDP data were converted by purchasing power parity before being used in our analyses. Combined, we were able to integrate these city-level GDP data with our existing waste data. For environmental factors, we acquired the following abiotic conditions for each city based on their geographic coordinates: (1) temperature, (2) annual temperature range, (3) annual precipitation, (4) dry season precipitation, (5) precipitation seasonality, and (6) aridity. We derived elevation from the Google Maps Elevation Application Programming Interface. All climatic variables were derived from the 30-year average (1970–2000) at 1 km resolution from WorldClim (version 2)72.

We also compiled three country-level waste datasets from three different sources (Supplementary Table 1) to generate country-level accounting of waste generation (Fig. 5): (1) Jones2021, a recent global analysis of country-level wastewater generation73, (2) Ritchie2020, a global analysis of country-level GHG emissions74, and (3) Worldbank_country, a country-level dataset curated by the World Bank that covers seven waste types: MSW, construction, agricultural, industrial, hazardous, electronic, and medical waste. The GHG dataset (Ritchie2020) contains time series data, whereas Jones2021 and Worldbank_country are cross-sectional. To facilitate comparison between datasets (Fig. 5), we randomly sampled the Ritchie2020 dataset between 2009 and 2018 such that the resulting country-level data have an ensemble of years that are similar to the time distribution of the other datasets. The combined dataset is summarized in Fig. 5 and Supplementary Table 2, and is available on figshare (https://doi.org/10.6084/m9.figshare.19361675).

Statistical analyses

We used linear regression to analyze the scaling relationships in Figs. 1 and 3. Both waste production rate (tonnes per year) and population size were log10-transformed, consistent with the logarithmic scale representation in these figures. There are many more small cities represented in Fig. 1b than large cities. As a result, both city size distribution (measured by urban population) and wastewater production are moderately positively skewed even after log10 transformation. We tested whether our results are sensitive to this inhomogeneity of city sizes (overpresentation of small cities in log space) by performing regression on binned city sizes. We first binned cities according to their size, and then calculated the mean wastewater generation and mean city size across all cities that fall within a certain bin, thus removing the effect of overpresentation of small cities. Our main conclusion that wastewater scales superlinearly with city size is not sensitive to the particular bin size. For instance, when we used a bin size of 100.05, the scaling exponent was 1.14 with a 95% CI of (1.10, 1.18), consistent with the analysis that we presented in the main results. Typically, scaling analyses are robust to oversampling at certain scales if variance consistently occurs around a well-defined mean that follows a power law. We then fed the residuals (in log units) of the scaling relationships derived in Fig. 1 into Fig. 2 as the dependent variables. Residuals (for each city) that had no matching environmental or socio-economic data were dropped from the analysis. For the planar function of wastewater production in Fig. 2a, we tested a no interaction effect between per capita GDP and other environmental variables. We analyzed the magnitude difference across nine waste types (Fig. 5) using linear regression (lm, R package ‘stats’ v.4.2.0) followed by pairwise contrast analysis (emmeans, R package ‘emmeans’ v.1.7.5). We log10-transformed the country-level waste generation rate, consistent with the logarithmic scale of the y axis in Fig. 5. To test feature differences between wasteful versus efficient cities (Extended Data Fig. 4), we classified cities by their scale-adjusted residuals for each waste type. For example, a city that is below the 25% quantile of the residual for wastewater scaling would be considered as efficient in its use of water (well below the scaling curve), while a city with a residual that is above the 75% quantile of wastewater scaling would be classified as wasteful. We also consistently log10-transformed per capita GDP to conform with normality. We tested equal variance using the F-test before applying a t-test: we applied Welch’s t-test in the case of unequal variance (Extended Data Fig. 4a,b) and Student’s t-test in the case of equal variance (Extended Data Fig. 4c,d). All analyses were conducted in R (v.4.2.0).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.