Abstract
The recent availability of data for cities has allowed scientists to exhibit scalings which present themselves in the form of a powerlaw dependence on population of various socioeconomical and structural indicators. We propose here a stochastic theory of urban growth which accounts for some of the observed scalings and we confirm these predictions on US and OECD empirical data. In particular, we show that the dependence on population size of the total number of miles driven daily, the total length of the road network, the total traffic delay, the total consumption of gasoline, the quantity of CO_{2} emitted and the relation between area and population of cities, are all governed by a single parameter which characterizes the sensitivity to congestion. Our results suggest that diseconomies associated with congestion scale superlinearly with population size, implying that –despite polycentrism– cities whose transportation infrastructure rely heavily on traffic sensitive modes are unsustainable.
Introduction
The recent availability of an unprecedented amount of data has made possible quantitative studies of urban systems^{1,2,3}, opening the way to a new Science of Cities. In particular, the discovery of allometric scaling relationships in cities has driven the quantitative research on urban systems in the past years. Indeed, there is a great amount of evidence that different socioeconomic indicators in cities, such as the GDP, the crime rate, the number of patents as well as different structural indicators such as the total length of the road network, the urbanized land area, etc., exhibit robust scaling relationships with respect to population^{4,5,6,7,8,9,10}. The existence of these simple scaling relationship hints at the existence of universal processes shared by urban systems and thus at the possibility of modeling cities.
A common trait shared by all complex systems –including cities– is the existence of a large variety of processes occuring over a wide range of time and spatial scales. The main obstacle to the understanding of these systems therefore resides in uncovering the hierarchy of processes and in singling out the few ones which govern their dynamics. Albeit difficult, the hierarchisation of processes is of prime importance. A failure to do so leads to models which are either too complex to give any real insight into the phenomenon, or too simple and abstract to have any resemblance with reality. As a matter of fact, despite numerous attempts^{5,11,12,13,14,15}, a theoretical understanding of many observed empirical regularities in cities is still missing.
In the present study, we show that the spatial structure of the mobility pattern controls the behaviour of many quantities in urban systems. Indeed, cities are not only defined by the spatial organisation of places fulfilling different functions –shops, places of residence, workplaces, etc.– but also by the way individuals move among them. Understanding where people live, where and how they travel within the city thus appears as a necessary step towards a scientific theory of cities.
Although an increasing amount of data about mobility is now available^{16}, we still lack a simple model explaining the dominant mechanisms governing the formation and evolution of mobility patterns. Many factors such as geographical constraints, facilities location and available transportation –to name a few– can impact the mobility and it thus appears as an intricate issue. Here, we tackle the problem of mobility by making simplifying –yet not simple– assumptions, trying to grasp the most important parameters which define the problem. We thus build upon a simple outofequilibrium model previously developed^{17}. This model, among other things, accounts for the polycentric transition of cities and gives a prediction for the number of centers as a function of population. We show that this framework allows us to predict the behaviour of many quantities related to mobility and the structure of cities: the scaling with population of the total time wasted in congestion, transport related CO_{2} emissions, total travelled distance, total lane miles and surface area.
Our results allow us give a quantitative insight into two important debates around urban systems. First, we are able to discuss the benefits of polycentricity and quantify some of its aspects. Then, maybe more importantly, we are able to put into perspective the sustainability of urban systems.
Results
Naive scalings
We start by presenting some naive arguments to estimate the scaling exponents for the area A, the total daily distance driven L_{tot} and the total lane miles L_{N}. Although these predictions turn out to be wrong, naive scalings are useful as a first approach to the problem as they allow us understand how the different quantities relate to each other.
Surface area
First, we estimate the dependence of the area A of a city on its population P –a long standing problem in the field^{5}. A first crude approach would be to assume that cities evolve in such a way that their population density ρ = P/A remains constant. This assumption straighforwardly implies that the area should scale linearly with population
where λ^{2} is the average surface occupied by each individual (the assumption of a constant density is then equivalent to the one of a constant average surface per capita).
Total length of roads
We now estimate the total length L_{N} of all the roads within a city. If we consider that the network formed by streets is such that all the nodes (intersections) are connected to their closest neighbour, the typical length of a road segment is given by
where N is the number of intersections. Previous studies of road networks in different regions and over extended time periods^{18,19}, have shown that the number of intersections is proportional to the population size. Therefore, the typical length of a road segment (between two intersections) varies with the population size P as
and the total length of the network L_{N} ~ Pℓ_{R} should then scale as
Using the naive scaling for the dependence of A on population size given previously in Eq. 1 we finally get
Total daily commuting distance
Individual constraint. We also estimate the total commuting distance L_{tot}. The first constraint on this distance comes from individuals's limitations and behaviour. We make here the simple assumption that individuals choose their residence and work place such that their total commuting distance is fixed (or at least smaller than a certain value) and equal on average to ℓ_{C}. In that case, we simply have
(by constant, we mean independent from the population size of the city).
The city structure constraint. An additional contraint on L_{tot} is given by the structure of the city^{8,25}. Indeed, the individual commuting distance is also related to the total suface area of the city and the location of activity centers.
If we first assume that the city is monocentric, individuals are all commuting to the same center and the typical commuting distance is controlled by the typical size of the city of order
On the other hand, if we assume that the city is completely decentralized, the typical commuting distance is of order the nearest neighbour distance and we obtain
Comparison of naive scalings with empirical results
The comparison of the naive exponents with the exponents measured on US data is shown in Table 1 (see the Methods section for details about the data). There are important discrepancies, which we discuss in the following.
First, we note that the naive scaling for the surface area A predicts a value of the exponent that is quantitatively –and worse, qualitatively– different from that observed. Indeed, we find that for real cities
with a = 0.85. While the naive argument implies a linear dependence of the surface area A with population, we find a sublinear scaling in the data, which is a qualitatively different behavior (Table 1). This disagreement on this basic quantity will naturally impact the scaling of the other quantities.
The data also show that L_{tot}/P can be considered reasonably independent from P (with a value of approximately 23 miles for the US, see Fig. 1), in agreement with the individual constraint assumption (Eq. 6). This finding is also in agreement with the results drawn from census data in Germany by^{20}. Although this assumption of a constant distance is simple and verified on the US data, we think that it deserves to be systematically tested on other datasets for other countries and cities.
Finally, the scaling of given in the extreme cases of a monocentric city structure and a totally decentralized city structure disagree with the value measured on data (see Table 1). This suggests that most cities have a structure that is neither completely centralized, nor totally decentralized. In particular, this result cast some doubts about the study^{15} which assumes implicitely that cities are always monocentric. Any situation between the two previous extreme cases would give a scaling of the form where b ∈ [1/2, 1]. One can easily see that this expression is consistent with that of A/λ^{2} and L_{tot}/P if
which is indeed what we observe empirically (up to error bars). This preliminary analysis thus leads us to the conclusion that, in order to compute the various exponents, we need to better describe the structure of commuting patterns. In other words, we need to find a description of cities that goes beyond the naive monocentric or totally decentralized views and which accounts for the observed sublinear scaling of the surface area A.
Beyond naive scaling: modeling mobility patterns
We begin with the assumption that mobility patterns are mostly driven by the daily commuting and we would like to understand how an individual, given his household location, will choose his job location. We assume that this choice will be determined by two dominant factors: the expected wage at a given job and the commuting time to this job's location. Indeed, places with high average salaries are attractive, but having to spend a sensible amount of time commuting every day is less desirable. We assume there are N_{c} potential activity centers in the city, each characterized by an average wage w(j) at location j. This wage is endogenously determined and depends a priori on many factors such as agglomeration effects^{21}, the type of industry, etc. Although it is in principle possible to write down equations to determine the wage (as attempted in^{11} for instance), not only is it impossible to solve them, but also not necessarily useful. A similar situation arises in physics when one studies the behaviour of atoms made of a large number of electrons. Physicists found out^{22} that, in fact, a statistical description of these systems relying on random matrices could lead to predictions which agree with experimental results. We would like to import this idea of replacing a complex quantity such as wages –which depends on so many factors and interactions– by a random one in spatial economics. So, we treat the wage as if it was exogenous and random^{17}, that is we write w(j) = s η_{j} where s represents the typical income in this city and η is a random number chosen uniformly in [0, 1]. Furthermore, we assume that the commuting time does not only depend on the distance between the two places, but also on the traffic T_{ij} between those two locations. An individual living at i will thus commute to the center j which corresponds to the best tradeoff between income and commuting time, thus to the center j such that the quantity
is maximum^{17}. The quantity d_{ij} is the euclidean distance between i and j (both supposed to be scattered randomly across the city), T(j) the total incoming traffic at j, c the capacity of the underlying transportation network and μ is an exponent describing the sensitivity of the network to congestion. The quantity ℓ is the maximum distance that people can financially travel daily, defined as the ratio between the typical individual income and the transportation costs per unit of distance.
This simple model displays a surprisingly rich behaviour^{17}. In particular, it accounts for the monocentric to polycentric transition observed in most cities. It has been a wellknown fact for quite some time that as cities grow, they evolve from a monocentric organisation where all the activities are concentrated in the same geographical area – usually the central business district– to a more distributed, polycentric organisation^{11,23}. Several theories in spatial economics exist^{1}, but are not satisfactory for many reasons. Among other things, they do not take congestion into account and have no predictive, testable content^{24}. Within this framework, congestion is actually responsible for the transition and the number of activity centers in a city of population size P is on average given by
with
Using data of employment per Zip Code Area in the US^{17}, we showed that
where we measure α = 0.64 ± 0.12 (95% confidence interval (CI)). In other words, the number of centers scales sublinearly with population size.
Computing the exponents
Area
At this stage, the number of centers is a function of population and the area
and we need an additional equation in order to get a closed system. Here we focus on the area and its evolution with the population size, which reflects the growth process of the city. In the following, we will investigate two different approaches. It is worth noting that both approaches give results in qualitative agreement, showing that some stylized facts –such as super or sublinearity– are very robust.
Fitting procedure. In the absence of knowledge of the processes responsible for urban sprawl, we can assume that the area behaves as
where a is the exponent to be determined, through fits on data. The empirical value for the exponent for the US data is . Once this exponent is given we can then compute the various exponent for the quantities of interest (see the following and table 2). We get for the number of centers k
which is sublinear as long as a < 2, in agreement with the empirical results for US cities. As we will see, this approach yields the same qualitative behaviours as those predicted with the method of the next section. In other words, even if the main mechanism behind urban sprawl is not congestion, the conclusions of this paper are not affected as long as the area scales sublinearly with population.
Coherent growth. Let us now assume that the scaling of A with population is determined by the number of activity centers and the constant commuting length of individuals. This means that the growth of the area is controlled by the appearance of new activity centers. if we assume that a city is organized around k activity centers and that the attraction basin of each of these centers are spatially separated^{17}, we then have A ~ kA_{1} where A_{1} is the area of each subcenter's attraction basin. This area A_{1} is related to the average individual commuting distance by and we obtain
This leads to expression for the number of centers
which is always smaller than 1, also in agreement with the empirical results for US cities. We can now also compute the scaling of the surface area
We further assume that L_{tot}/P is a fraction of the longest possible journey ℓ individuals can afford, that is to say
It is important to note that if ℓ_{c} is independent from ℓ, the quantitative predictions of our model would still hold. The final expression for the area is then here given by
where . The exponent δ is smaller than 1/2 whatever μ ≥ 0, which implies that the density of cities increases sublinearly with population. In other words, the density of cities increases with population. We verify this prediction in Table 2, with data about land area of urbanized areas in the US (Figure 2). We find 2δ_{emp} = 0.85 ± 0.01 (95% CI) which is not too far from the theoretical value 2δ_{th} = 0.64 ± 0.12 (95% CI), equal to α in this case.
Because the area of an urban system results from centuries of evolution, we do not a priori expect our model–where individual vehicles are assumed to be the only vector of mobility– to give a prediction valid for all countries and all times. Nevertheless, these results give us reasons to believe that the spatial structure of the journeytowork commuting should still be the dominant factor in the dependence of land area on population.
Total commuting distance
Using Eq. 6 and Eq. 22 we are now able to compute
We plot for urbanized areas in the US on Figure 2 and one can check in Table 2 that the exponent predicted from the previously measured value of α agrees well with the exponent measured on the data. In the fitting case, the exponent would simply be given by 1 − a/2.
Total length of roads
If we use the previously derived expression for the area A, we find
The quantity δ is less than 1/2, which implies that L_{N} scales sublinearly with the city's population size. In other words, larger cities need less roads per capita than smaller ones: we recover the fact that agglomeration of people in urban centers involves economies of scale for infrastructures. Within the fitting assumption (Eq.16), we would obtain (1 + a)/2.
Total delay due to congestion
Unfortunately, agglomeration in cities does not only generate economies. Congestion, for instance, is a major diseconomy associated with the concentration of people in a given area. A simple way to quantify the impairement caused by traffic congestion is through the total delay it generates. If we make the first order approximation that the average freeflow speed v is the same for everyone, the total delay due to congestion is given –according to our model–by
If we assume that all the centers share the same number of commuters –a reasonable assumption within our model^{17}–we obtain
which, using the expressions for L_{tot} and A given in Eq. 23 and Eq. 22 respectively, gives
The total commuting time corresponding to the same distance but without congestion scales as τ_{0} ~ L_{tot} and thus less rapidly than the total delay which scales superlinearly with population (even when polycentricity is taken into account). This means that, for the largest cities, delays due to congestion actually dominate the time spent in traffic and that economical losses per capita due to the time lost in congestion –and the corresponding strain on people's life– increase with the size of the city.
In the fitting assumption Eq. 16 and using the same arguments for the calculation of δτ, we easily obtain for the exponent the value .
Transport related CO_{2} emission. Gasoline consumption
Another diseconomy associated with congestion is the quantity of CO_{2} emitted by cars and the gasoline consumed by motor vehicles. This amount not only depends on the distance that has been driven, but also on the traffic during the journey. It indeed turns out that for the same length driven, a car burns more oil when the traffic is heavy than when the road is clear. Within our model, the presence of traffic is seen in the time spent to cover a given distance and we write that the quantity of CO_{2} emitted by a vehicle is proportional to the total time spent in traffic, leading to
where q is the average quantity of CO_{2} produced per unit time. In the polycentric case with k = k(P) subcenters, the typical trip length is given by and we obtain
The first term in brackets is a constant and the quantity of CO_{2} is thus dominated by congestion effects at large populations
and the total daily transportrelated CO_{2} emission per capita thus scales as
The quantity of CO_{2} emitted per capita in cities thus increases with the size of the city, a consequence of congestion. This prediction agrees with the exponent we measure (Figure 3) on data gathered for US and OECD cities (Data about the area and population of urbanised areas can be found on the Census Bureau website^{38}, data about congestions in urban areas can be found in the Urban Mobility Report^{39} and data about the total lane miles and the daily total miles driven in urbanized areas can be found on on the Federal Highway administration website^{40}). We are aware that the scaling of CO_{2} with population size is controversial, with results varying from one study to another. Although a systematic metaanalysis of these results is beyond the scope of this paper, we note that the authors of^{27} are concerned with the total emissions of CO_{2}, while this paper is only concerned with emissions due to transportations. Moreover, our prediction agrees well with the exponent of 1.33 measured by the authors of^{26} on the same dataset, but with a different definition of cities. Finally, our prediction also agrees with measurements made in^{28} for developing countries.
Another important related quantity is the the consumption of gasoline which in principle is proportional to the emission of CO_{2} and the time spent driving. The total daily gasoline consumption is then given by
where q is the average quantity of gasoline needed per unit time. From this expression, we see that the total daily gasoline consumption per capita scales as
and is therefore not a simple function of the city density, in contrast with what was suggested by the seminal paper of Newman and Kenworthy^{4}. At this stage however, more data about gasoline consumption is needed to test this prediction and draw definitive conclusions.
Discussion
Monocentric versus polycentric
Although polycentricity emerges naturally from our model as a result of congestion, many circumstances can prevent or foster the appearance of new activity centers in a city. There are many debates as to whether policies should favour polycentric or monocentric developpement of cities. Most of them are based on ideologies and opinions about how cities should be, very few are based on a quantitative understanding of the city as a complex system. Although this only represents a small part of the debate, our model allows to quantify the effect of polycentricity on the total delay due to congestion.
We can indeed compute the total delay due to congestion in the case of a monocentric configuration. In this situation, all the population commutes to a single destination 1 and we have
It follows, using the expression given above for L_{tot}
From the fact that , we indeed find that the total delay due to congestion is worse for monocentric cities than it is for polycentric cities with the same population, which agrees with the usual intuition. More precisely the ratio of delays is given by
where the exponent is of order β ≈ 0.57. Therefore, even though diseconomies associated with polycentric cities scale superlinearly with population, it would be even worse if we did not let cities evolve from the monocentric case. The same reasoning applies to the consumption of gasoline and the CO_{2} emissions. This suggests that, everything else being equal, polycentricity should be favoured for quality of life and environmental reasons.
Megapolis versus urban villages
Also, given the superlinear behaviour of the diseconomies associated with living in cities, it is clear that we would be better off living in several smaller cities rather than a single huge city. However, due to the economies of scale realised in large cities, we can wonder whether this is also economically reasonable. If we assume that the total cost of a city of population P is the sum of its infrastructure cost and the economical losses due to congestion we have
where is the average cost of a kilometer of roads, the average hourly wage and Δt the planning horizon in years (this expression is not exhaustive, as the costs dues to CO_{2} emissions and gasoline consumptions are not included). The infrastructure needs maintenance and its cost depends on the planning horizon as well and can be written where is the construction cost in $/km and the maintenance cost in $/km/year.
We assume that the population P is distributed among n cities of the same size P/n (see Figure 4). The total lane miles for the n cities reads where is the total lane for one city. The total congestion delay for n cities is δτ_{n} = nδτ(P/n) and we thus obtain the total cost C_{T}(P, n) for n cities
The number of cities n_{min} which minimises the total cost is obtained when , leading to (for )
(the actual number of cities is of course an integer and can be taken as the nearest integer from n_{min} for instance). It is then economically advantageous to divide the population in several cities if n_{min} > 2. To illustrate this point, we compute the number of cities which would minimise the cost for a world population P ≈ 10^{9}. The World Bank estimates the maintenance cost of roads to be of the order of and the average hourly wage to be of the order of , the value of δ is taken from the measures on US data, δ ≈ 0.27 and τ/ℓ ≈ 10 km/h. We then obtain
which gives an average city size of P/n ≈ 5, 500, 000. This result is to put in perspective with the fact that the world hosts 40 or so cities with over 5, 500, 000 inhabitants and that this number is still increasing.
The most economical population distribution
The previous results assume that we split a large city into many cities of the same size. The cities are however organized in various sizes distributed according to something that can be approximated by a Pareto distribution, as known since Zipf's work^{29}. It is still unclear why we observe such a convergence^{30,31}. We propose here a new perspective to this debate by asking: Assuming cities are distributed according to a Pareto distribution, what value of the exponent minimises the overall cost? Indeed from above the total cost for a population size x is given at large times by
We assume that the population is distributed according to
with γ > 1 and a cutoff population (which is at most equal to the world's population). The average cost is then given by
leading to
The only consistent solution is obtained for γ < δ + 2. The dominant term for is given by
The optimal power law distribution minimizes the average cost and is such that . We obtain the following equation
and in the limit we obtain the optimal value for γ
Numerically, δ ≈ 0.32 and Λ ≈ 10^{9}, leading to γ* ≈ 2.27. It is interesting to note that this value would lead to a rankplot exponent (≈0.78) not far from those measured on different countries around the world^{32}. Although we do not pretend that the above reasoning provides a definitive answer to the Zipf puzzle, it nevertheless suggests that the broad diversity of population might derive from economical considerations and that there may be a connection between the Zipf law exponent and optimality considerations.
Outlook
The superlinear increase of congestion delay with population and thereby of gasoline consumption and of CO_{2} emissions, has terrible consequences on the economy, the environment, health and wellbeing. The outlook is nothing short of grim in our everurbanising world. As the proportion of human beings living in cities dramatically increases –the UN expects the world population to be 67% urban in 2050^{33}– wages are likely to increase^{7} but not enough to compensate for the negative effects of congestion. As a result, if the individual car stays the dominant transportation mode, cities will put more strain on people's life, while acting as catalysts for the production of CO_{2} greenhouse gas, responsible for an overall increase of the planet's temperature^{34}. It is currently believed that advantages associated with living in a large city outweigh the costs. Our results reveal however the existence of very rapidly growing problems such as congestion and CO_{2} emissions, which inevitably begs the question of the sustainability of large cities. It might be time to cut down considerably the use of individual vehicles, or to consider the possibility of living in smaller or medium sized cities: the infrastructure costs (L_{N}) may be larger, but the impact on the environment (CO_{2} emissions) and on the wellbeing of people (delays in congestion) would be beneficial (see Figure 3).
The most striking fact about the above results is that despite the apparence of complexity that is conveyed by cities, most of their structure can be explained by the very simple and universal desire for the best achievable balance between income and commuting costs. Our model unifies mobility patterns, spatial structure of cities and allometric scalings in a framework that can be built upon. More work is needed in order to integrate information about firm locations, the influence of public transportation on mobility patterns^{35}, the effect of the integration of cities into urban systems^{36}, to understand the fluctuations around the average trends and to test the validity of the model on different sets of data. We believe however that the results presented here represent a crucial step towards a scientific understanding of cities.
Methods
Data
As recently stressed in^{37}, when trying to identify patterns accross cities, one must be careful and consistent in the definition of city boundaries. These authors indeed found out that the scaling exponents measures for several quantities are usually sensitive to the definition chosen for the city. In order to make our results reproducible, we detail in the following the data source and the corresponding city definition.
Total distance driven and lane miles
The daily commuting vehiclemiles as well as the total lane miles data were obtained for the year 2011 from the Federal Highway Administration for 441 Urban Areas (as defined by the Census Bureau) in the US.
Area
The surface area data were obtained for the year 2010 from the Census Bureau for 3540 Urban Areas (as defined by the Census Bureau) in the US. It is interesting to note that the dependence of the surface area of Metropolitan Statistical Areas with population is a lot less clearcut, implying that, with respect to surface area, the definition of UAs delineates more coherent systems than the definition of MSAs.
Values of ℓ
In order to compute a value for , we use for s the average wage at the county level, provided by the Bureau of Labor Statistics. For t, the transportation cost per unit distance, we use the average gas price per state as given by the U.S. Energy Information Administration and assume that all vehicles burn the same quantity of gas per unit distance on average. Interestingly, while we have assumed a constant ℓ throughout this paper, we have noticed that its effect on the different scalings was not negligible (Compare the results for L_{N} and A between Table 1 and Table 2 for instance), implying that ℓ has a small, yet nonzero dependence on the population. This probably comes from the dependence of the average wage on population^{7}. We leave the investigation of this dependence for further studies.
Total delay and CO_{2} emissions
The excess CO_{2} and the total delay due to traffic congestion were obtained for the year 2012 from the Urban Mobility Report for 97 Urban Areas in the US. Also, the quantity of CO_{2} emissions due to transportation was obtained from the OECD for 268 metropolitan areas accross 28 countries for the year 2008. It is worth noting here that the US definition of Urban Area and the OECD definition of Metropolitan Area are qualitatively different, added to the fact that OECD data cover many different countries. Yet, the measured values of the exponent are compatible with each other.
As far as the United States are concerned, we present results for Urban Areas only. Indeed, when data were available for both MSA and Urban Area, we found out that the MSA data did not exhibit as clearcut regularities as the Urban Area data did. We believe that this effect is due to the lack of a unique, quantitative definition of a city which makes In this work, we assumed that Urban Areas designate areas which are coherent with respect to the quantities we are measuring and leave the crucial issue of city definition for further studies.
References
Fujita, M., Krugman, P. R. & Venables, A. J. The spatial economy: cities, regions and international trade (The MIT press, Cambridge, USA, 1999).
Batty, M. Cities and complexity: understanding cities with cellular automata, agentbased models and fractals (The MIT press, Cambridge, USA 2007).
Marshall, S. Streets and patterns (Routledge, Abingdon, 2004).
Newman, P. W. & Kenworthy, J. R. Gasoline consumption and cities: a comparison of US cities with a global survey. Journal of the American Planning Association 55, 2437 (1989).
Makse, H. A., Havlin, S. & Stanley, H. E. Modelling urban growth. Nature 377, 608–612 (1995).
Pumain, D., Paulus, F., VacchianiMarcuzzo, C. & Lobo, J. An evolutionary theory for interpreting urban scaling laws. Cybergeo: European Journal of Geography (2006).
Bettencourt, L. M. A., Lobo, J., Helbing, D., Kühnert, C. & West, G. B. Growth, innovation, scaling and the pace of life in cities. Proc. Natl. Acad. Sci. U.S.A. 104, 7301–7306 (2007).
Samaniego, H. & Moses, M. E. Cities as organisms: Allometric scaling of urban road networks. Journal of Transport and Land Use 1, 21–39 (2008).
Rozenfeld, H. D. et al. Laws of population growth. Proc. Natl Acad. Sci. USA 105, 1870218707 (2008).
Pan, W., Ghoshal, G., Krumme, C., Cebrian, M. & Pentland, A. Urban characteristics attributable to densitydriven tie formation. Nature communications 4, 1961 (2013).
Fujita, M. & Ogawa, H. Multiple equilibria and structural transition of nonmonocentric urban configurations. Regional science and urban economics 12, 161–196 (1982).
Batty, M. The Size, Scale and Shape of Cities. Science 319, 769771 (2008).
Frasco, G. F., Sun, J., Rozenfeld, H. D. & benAvraham, D. Spatially distributed social complex networks. ArXiv:1306.0257 (2013).
Bettencourt, L. & West, G. A unified theory of urban living. Nature 467, 912913 (2010).
Bettencourt, L. M. A. The Origins of Scaling in Cities. Science 340, 14381441 (2013).
Gonzalez, M., Hidalgo, C. A. & Barabasi, A.L. Understanding individual human mobility patterns. Nature 453, 779–782 (2008).
Louf, R. & Barthelemy, M. Modeling the polycentric transition of cities. Physical Review Letters 111, 198702 (2013).
Strano, E., Nicosia, V., Latora, V., Porta, S. & Barthelemy, M. Elementary processes governing the evolution of road networks. Sci. Rep. 2, 296 (2012).
Barthelemy, M., Bordin, P., Berestycki, H. & Gribaudi, M. Selforganization versus topdown planning in the evolution of a city. Sci. Rep. 3, 2153 (2013).
Wilkerson, G., Khalili, R. & Schmid, S. Urban Mobility Scaling: Lessons from Little Data. arXiv:1401.0207 (2013).
Glaeser, E. L. & David, C. M. Cities and skills. Journal of Labor Economics 19, 316342 (2001).
Dyson, F. J. Journal of Mathematical Physics 3, 140–156 (1962).
McMillen, D. P. & Smith, S. C. The number of subcenters in large urban areas. Journal of Urban Economics 53, 321338 (2003).
Bouchaud, J.P. Economics needs a scientific revolution. Nature 455, 1181–1181 (2008).
Barthelemy, M. Spatial networks. Physics Reports 499, 1101 (2011).
Oliveira, E. A., Andrade, J. S., Jr. & Makse, H. A. Large cities are less green. arXiv:1401.7720 (2014).
Fragkias, M., Lobo, J., Strumsky, D. & Seto, K. C. Does Size Matter? Scaling of CO2 Emissions and U.S. Urban Areas. PLoS ONE 8, e64727 (2013).
Rybski, D., Sterzel, T., Reusser, D. E., Winz, A.L., Fichtner, C. & Kropp, J. P. Cities as nuclei of sustainability. arXiv:1304.4406 (2013).
Zipf, G. K. Human behavior and the principle of least effort (AddisonWesley Press, Cambridge, 1949).
Batty, M. Rank clocks. Nature 444, 592596 (2006).
Cristelli, M., Batty, M. & Pietronero, L. There is More than a Power Law in Zipf. Sci. Rep. 2, 812 (2012).
Soo, K. T. Zipfs Law for cities: a crosscountry investigation. Regional science and urban Economics 35, 239263 (2005).
United Nations. The world urban propects: The 2011 revision. (2011).
Oreskes, N. The scientific consensus on climate change. Science 306, 1686 (2004).
Roth, C., Kang, S. M., Batty, M. & Barthelemy, M. Structure of Urban Movements: Polycentric Activity and Entangled Hierarchical Flows. PLoS ONE 6, e15923 (2011).
Rozenblat, C. & Pumain, D. “Firm linkages, innovation and the evolution of urban systems”. in Cities in globalization: practices, policies and theories, [130–156] (Routledge, London, 2007).
Arcaute, E. et al. City boundaries and the universality of scaling laws. arXiv:1301.1674 (2013).
Census bureau: https://www.census.gov/ (date of access:01/04/2014).
Urban mobility report: http://mobility.tamu.edu/ums/ (date of access: 01/04/2014).
Federal Highway administration: https://www.fhwa.dot.gov/ (date of access: 01/04/2014).
http://measuringurban.oecd.org (Date of access: 01/04/2014).
Acknowledgements
We thank Giulia Carra, Riccardo Gallotti and Thomas Louail for stimulating discussions. MB acknowledges funding from the EU commission through project EUNOIA (FP7DG.Connect318367).
Author information
Authors and Affiliations
Contributions
R.L. and M.B. designed, performed research and wrote the paper.
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Rights and permissions
This work is licensed under a Creative Commons AttributionNonCommercialNoDerivs 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder in order to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/byncnd/4.0/
About this article
Cite this article
Louf, R., Barthelemy, M. How congestion shapes cities: from mobility patterns to scaling. Sci Rep 4, 5561 (2014). https://doi.org/10.1038/srep05561
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/srep05561
Further reading

Understanding the mesoscopic scaling patterns within cities
Scientific Reports (2020)

Deep multitask learning for individuals origin–destination matrices estimation from census data
Data Mining and Knowledge Discovery (2020)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.