From the betweenness centrality in street networks to structural invariants in random planar graphs

Kirkley, Alec; Barbosa, Hugo; Barthelemy, Marc; Ghoshal, Gourab

doi:10.1038/s41467-018-04978-z

Download PDF

Article
Open access
Published: 27 June 2018

From the betweenness centrality in street networks to structural invariants in random planar graphs

Nature Communications volume 9, Article number: 2501 (2018) Cite this article

11k Accesses
111 Citations
45 Altmetric
Metrics details

Subjects

Abstract

The betweenness centrality, a path-based global measure of flow, is a static predictor of congestion and load on networks. Here we demonstrate that its statistical distribution is invariant for planar networks, that are used to model many infrastructural and biological systems. Empirical analysis of street networks from 97 cities worldwide, along with simulations of random planar graph models, indicates the observed invariance to be a consequence of a bimodal regime consisting of an underlying tree structure for high betweenness nodes, and a low betweenness regime corresponding to loops providing local path alternatives. Furthermore, the high betweenness nodes display a non-trivial spatial clustering with increasing spatial correlation as a function of the edge-density. Our results suggest that the spatial distribution of betweenness is a more accurate discriminator than its statistics for comparing static congestion patterns and its evolution across cities as demonstrated by analyzing 200 years of street data for Paris.

Linking the network centrality measures closeness and degree

Article Open access 02 July 2022

Three-dimensional shape and connectivity of physical networks

Article Open access 23 July 2024

Network geometry

Article 29 January 2021

Introduction

Recent years have witnessed unprecedented progress in our understanding of spatial networks that are pervasive in biological, technological and infrastructural systems^1,2. These networks are quite relevant in the context of urban systems^3,4,5,6,7, where analysis of their structural properties has uncovered unique characteristics of individual cities, as well as surprising statistical commonalities across different urban contexts^8,9,10. Patterns of streets and roads are particularly important, allowing residents to navigate the different functional components of a city. Different street structures result in varying levels of efficiency, accessibility, and usage of transportation infrastructure;^{11,12,13,14,15,16,17} consequently structural characteristics of roads have been of great interest in the literature^{18,19,20,21,22,23,24}.

Street networks fall into the category of planar graphs²⁵ and their edges constitute a physical connection, as opposed to relational connections found in many complex networks²⁶. The geographical embedding leads to strong effects on network topology with limitations on the number of long-range connections and the number of edges incident on a single node (its degree k)^27,28. Degree-based network measures, while well-studied on such systems, lead to rather uninteresting results; the degree distribution is strongly peaked, and related metrics such as clustering and assortativity are high². Instead, more information can be gleaned from non-local higher-level metrics such as those based on network centralities, which while strongly correlated with degree in non-spatial networks²⁹, display non-trivial behavior in planar networks³⁰. Among the more studied and illuminating of such metrics is the betweenness centrality (BC), a path-based measure of the importance of a node in terms of the amount of flow passing through it³¹. More precisely, the BC for node i is defined as

$$g_B(i) = \frac{1}{{\cal N}}\mathop {\sum}\limits_{s \ne t \in V} \frac{{\sigma _{st}(i)}}{{\sigma _{st}}},$$

(1)

where σ_st is the number of shortest paths going from nodes s to t and σ_st(i) is the number of these paths that go through i³¹. Here ${\cal N}$ is a normalization constant, typically of order N² where N is the number of nodes, although for reasons that will be apparent later in the manuscript, we will use here the unnormalized version ${\cal N} = 1$.

In principle one can define a variety of different shortest paths: the number of hops in the purely topological case, the shortest distance between two points if the edges are weighted according to Euclidean distances, taking into account route preferences if edges are weighted according to a cost function such as capacity or speed-limits, or indeed some combination of the above. Incorporating this structural information into the edge-weights, the BC can be used as a proxy for predicted traffic flow^32,33,34. In such a setting the paths can be considered as the optimal routes between locations, and thus nodes with high BC should expect to receive more traffic.

A number of studies have been conducted on the BC in planar graphs^35,36,37 finding among other things, a complicated spatial behavior of the high BC nodes^19,38, and in the case of street networks, connections to the organization and evolution of cities^39,40,41. For non-planar graphs the average BC scales with the degree k in a power law fashion thus $g_B(k) = \mathop {\sum}\nolimits_{i|k_i = k} \frac{{g_B(i)}}{{N(k)}} \propto k^\eta$, where N(k) is the number of nodes of degree k, and η is an exponent depending on the graph⁴². In planar graphs, however, the BC behaves in a more complex manner, as now both topological and spatial effects are at play.

Given their practical relevance as well as the relative abundance of data, street networks have proven to be an excellent platform on which to study the properties of planar graphs including the BC. Existing analyses, however, suffer from limitations of scale (unlike other structural properties, see ref. ⁴³ for a recent global description), and most comparative studies of the BC across cities are typically restricted to a few square-kilometers, while studies on more extensive street-maps have been examined for at most tens of cities limited to those in Europe or North America^{12,38,39,40,41}. Furthermore, there have been limited studies of the BC distribution in its entirety, with the majority of analyses instead focusing on the average BC (proportional to the average shortest path⁴⁴) or on its maximum value^45,46.

To fill this gap in our understanding of this important class of networks, we conduct here a large-scale empirical study of the BC across 97 of the world’s largest cities as measured by population (details on dataset in Methods). The cities are sampled from all six inhabited continents and the analysis is conducted at scales on the order of three thousand square-kilometers. We demonstrate that the BC distribution is an invariant quantity for most planar graphs and that it is robust to major alterations in the network, including significant changes to its topology and edge weight structure, with the relevant factors shaping the distribution being the number of nodes and edges as well as the constraint of planarity. Through simulations of random planar graph models and analytical calculations on Cayley trees, we demonstrate this to be a consequence of a bimodal regime consisting of an underlying tree structure for high BC nodes, and a low BC regime corresponding to loops providing local path alternatives. The high BC nodes display increasing spatial correlation as a function of the number of edges, leading them to cluster around the barycenter at large edge densities. The observed invariance and spatial dependence has practical implications for infrastructural and biological networks. For the case of street networks, as long as planarity is conserved, bottlenecks continue to persist, and the effect of planned interventions to alleviate structural congestion will be limited primarily to load redistribution, a feature confirmed by analyzing 200 years of data for central Paris.

Results

Betweenness at different scales and rescaling

We group cities into three categories according to the number of nodes, from small (N ∼ 10³), medium (N ∼ 10⁴) to large road networks (N ∼ 10⁵) as shown in Fig. 1 (further details in Supplementary Note 1 and Supplementary Table 1). In Supplementary Fig. 1a, we show the betweenness probability distribution for a selection of the three categories of cities at the resolution of two and a half square-kilometers. One sees significant variability between cities, within and across categories, with mostly exponential tails (Supplementary Fig. 2) as also seen for similar samples in^39,40. This is somewhat expected given the small sample size, and that topology of cities are different due to geographic and spatial constraints^47,48. Indeed, variations may show up within the same city where multiple samples of a similar resolution within a city display fluctuations (Supplementary Fig. 1b). In all cases, we observe a range of behavior in the tails of the BC ranging from peaked to broad distributions, reflecting local variation in the street network structure and fluctuations in the data. One sees a dramatic difference at the scale of three thousand square-kilometers (Supplementary Figs. 1c, d) where we observe that the BC distribution for cities within each category is virtually identical, and bimodal, with two regimes separated by a bump roughly at g_B∼N. For larger values of the BC we observe a slow decay signaling a broad distribution.

These trends are apparent across all 97 cities with the two regimes being separated by bumps spread across an interval of 10³ ≤ g_B ≤ 10⁵ corresponding to the range of N in our data (Fig. 2a). Indeed rescaling the betweenness of each node by the number of vertices in the network $g_B \to \tilde g_B = g_B/N$, we see the distributions collapse on a single curve with a unique bump separating two clear regimes (Fig. 2b). Fitting the distribution of $\tilde g_B$ with the function

$$p(\tilde g_B) \sim \tilde g_B^{ - \alpha }{e}^{ - \tilde g_B/\beta },$$

(2)

results in a tightly bound range for α≈1 and a broad size-dependent distribution for β (Supplementary Fig. 4). Rescaling the tail with respect to β results in a collapse of the curves for all cities (Fig. 2c). (See Supplementary Note 2, Supplementary Fig. 3 and Supplementary Table 2 for details of the rescaling and fitting procedures).

Determinants of the betweenness centrality distribution

Given that cities are ostensibly quite different in terms of geography or space, as well as their levels of infrastructure and socioeconomic development, the observed invariance is quite striking. To investigate the factors behind this behavior, we next systematically probe the effect of the main features that may be influencing the betweenness distribution. Examining Eq. (1), apart from its dependence on the number of nodes N and the number of edges e, the other primary factors are the local connectivity patterns of a street intersection as governed by its degree distribution; distribution of edge weights that can correspond either to euclidean distances or some scalar quantity such as speed-limits; and planarity, the effect of space. We select the BC distribution of a number of cities as baseline and generate multiple variants of random graphs to compare with the original. In Fig. 2d we show Phoenix (blue circles) as a representative example of a city on which we perform this analysis.

To investigate the effect of varying the local neighborhood of a given street intersection, we fix the spatial position of nodes on the 2D plane and generate a Delaunay Triangulation (DT)⁴⁹ of the street network. The DT corresponds to the maximum number of edges that can be laid down between a fixed number of nodes distributed within a fixed space, without any edge-crossings. Edges are then randomly eliminated until their number corresponds exactly to our baseline example of Phoenix. A hundred realizations of this procedure was conducted, having the effect of rewiring the local neighborhood of intersections—by changing a node’s degree and its neighbors—while still maintaining planarity. In Fig. 2d we plot the average of these realizations (orange triangles), showing differences with the original street network in the lower range of the distribution, yet showing minimal change in both the location of the peak as well as the tail of the distribution. Similar random graphs were generated using a number of other cities showing the same behavior (Supplementary Fig. 5).

Next we investigate the effect of Euclidean distances on the BC distribution. We fix the number of nodes N and instead of fixing their positions according to the empirical pattern, we now distribute them uniformly in the 2D plane with a scale determined by the spatial extent of the considered city. Then we generate the DT of the street network and randomly remove edges until we match the number of roads in the data. A hundred different realizations of this procedure has the effect of either dispersing high density areas or compressing very long road segments, and generating a distribution of distances that are markedly different from the original (Supplementary Fig. 8). Figure 2d (red triangles) suggests that while this has a marginally stronger effect than edge rewiring, the tails of the original and perturbed distributions are quite similar within the bounds of the error-bars. Furthermore, the positions of the peaks remain unchanged. Varying the area (and therefore density of nodes) and conducting the same procedure over multiple cities yielded identical results (Supplementary Fig. 9), suggesting that the distribution of (spatial) edge-weights has negligible effect on the BC distribution.

While the procedure outlined above does not preserve the local topology it is possible to change the edge-weights while preserving the degree sequence of nodes. This can be done by taking the original street network and randomly sampling from its associated distribution of distances, assigning each edge a number from this distribution—the edge-weights now do not correspond to physical distances but can be interpreted instead as a cost function such as speed-limits, travel demand, or road capacity. In Fig. 2d we show the average of this process over a hundred realizations (green triangles) where each realization corresponds to a reshuffling of the edge weights over the network. While there are some changes in the distribution with a minor shift in the position of the peaks and a moderately heavier tail, no drastic modifications are apparent. Strikingly, sampling from a whole statistical family of distributions for the edge weights produced identical results (Supplementary Fig. 10), indicating little-to-no dependence on the specific nature of the weights.

Finally, we probe the effects of relaxing the constraint of planarity. Fixing N, the degree-sequence, and assigning weights sampled from the distance distribution of Phoenix, we use the configuration model⁵⁰ to generate one hundred non-spatial versions of the street network resulting in the markedly different curve in Fig. 2d (purple triangles). The shape of the curve is in line with the known dependence of g_B on the degree for non-spatial networks, with a distribution of degrees peaked around k = 3 (Supplementary Figs. 11 and 12). The markedly different shape of the curve as compared to the actual street network shows that planarity appears to be the dominant factor specifying the BC distribution, with topological effects and edge-weights playing only a negligible role. While this provides an explanation for the observed similarity across cities, it does not by itself provide an explanation for the form of the distribution, its scaling with N, nor its bimodality, and we will provide in the following some theoretical arguments.

Modeling the betweenness centrality distribution

A clue for the bimodal behavior comes from the peak at N, a feature reminiscent of nodes adjacent to the leaves of a minimum spanning tree (MST). The MST consists of the subset of edges connecting all nodes with the minimum sum of edge-weights⁵¹ and whose betweenness value is of O(N). An examination of the BC distribution of trees therefore, may provide an explanation for the observed scaling behavior. While an exact analytical expression for the BC distribution of generalized MST’s is elusive, progress can be made by approximating it as a k-ary tree (where each node has a branching ratio bounded by k). Given that the degree distribution of streets is tightly peaked (Supplementary Fig. 11), we assume a fixed branching ratio, in which case the k-ary tree reduces to a Cayley tree where all non-leaf nodes have degree k. Assuming all leaf nodes are at the same depth L and adopting the convention l = L for the leaf level and l = 0 for the root, a simple calculation reveals that for a node v at level l, the betweenness scales as $g_B(v|k,l) \sim O(Nk^{L - l})$. After a sequence of manipulations (Methods), it can be shown that

$$P(g_B) \propto g_B^{ - 1},$$

(3)

indicating that the node betweenness of a Cayley tree scales with exponent α= −1, consistent with previous calculations of the link betweenness⁵². This provides a possible explanation for the scaling with N as well as the form of the tail found in the empirical measurements (Eq. (2)), indicating an underlying tree structure on which the high BC nodes of all cities lie, with the majority of flow concentrated around a spanning tree⁵³. While a similar feature is seen for the BC of weighted (non-planar) random graphs, this is only true for specific families of weight distributions⁵⁴, a factor that has little-to-no effect in planar graphs.

Of course, street networks are not pure trees and contain loops given by the cyclomatic number Γ = e−N + 1 (for a connected component) where N is the number of nodes and e is the number of edges. In the absence of loops, N = e + 1, and for fixed N, the addition of further edges will necessarily produce loops leading to alternate local paths for navigation. With increasing number of edges, a large fraction of the (previously) high betweenness nodes lying on the MST are bypassed, decreasing their contribution to the number of shortest paths. This induces the emergence of a low betweenness regime as well as increasingly sharp cutoffs in the tail, in line with empirical observations (Fig. 2).

To investigate the effect of increasing edges on the betweenness, we study a simple model of random planar graphs. Given that e ∼ O(N) and that N varies over three orders of magnitude in our dataset, we define a control parameter which we call the edge density,

$$\rho _e = \frac{e}{{e_{\rm DT}}},$$

(4)

defined as the fraction of extant edges e compared to the maximal number of possible edges e_DT (determined by the Delaunay triangulation). The parameter varies between ρ_e≈1/3 for the MST to ρ_e≈1 for the DT, and given that e_DT ≈ 3N, this is equivalent to the ratio of edges to nodes, or in the context of street networks, the average degree 〈k〉 of street intersections⁴⁹.

Next, we distribute N nodes uniformly in the 2D plane and first study the MST. To vary the density, we generate the DT on the set of nodes and remove edges until we reach the desired value for ρ_e. Figure 3a–d shows the betweenness distribution resulting from a hundred realizations of this procedure for N = 10⁴ and for increasing values of ρ_e from the MST to the DT. The distribution for the MST seen in Fig. 3a is peaked at N and is bounded by N² which gives here a range of order [10⁴, 10⁸]. In this interval the distribution follows a form close to our calculation for the Cayley tree (Eq. (3)). As one increases ρ_e and creates loops in the graph, we see the emergence of a bimodal form, with a low betweenness regime resulting from the bypassing of some of the high betweenness nodes due to the presence of alternate paths (Fig. 3b). As ρ_e is further increased, the distribution gets progressively homogeneous, yet remains peaked around N even as we approach the limiting case of the DT (Fig. 3d). As a guide to the eye, we shade the “tree-like” region from the “loop-like” region separated by the peak at N.

The simulations indicate the observed bimodality to be a combination of a high betweenness backbone belonging to the MST, and a low betweenness region generated by loops. The transition between the two regimes is determined by the minimum non-zero betweenness value for the MST, which is O(N) and the tail may have different peaks, determined by the distribution of branches emanating from the tree. Progressively decorating the tree with loops leads to arbitrarily low betweenness values due to the creation of multiple alternate paths, thus smoothing out the distribution, as the betweenness transitions from an interval [N,N²] for the MST to a continuous distribution over [1,N²] for the DT.

Spatial distribution of high betweenness centrality nodes

Figure 3e–h shows a single instance of the actual network generated by our procedure for each corresponding edge-density. Highlighted in red are nodes lying in the 90th percentile of betweenness. There is a distinct change in spatial pattern with increasing ρ_e; for the MST, they span the network and are tree-like with no apparent spatial correlation; as the network gets more dense, the nodes cluster together and move closer to the barycenter, suggesting a transition between a “topological regime” and a “spatial regime”.

To quantify these observed changes, we investigate the behavior of the high BC nodes at and above percentile θ through a set of metrics: the clustering C_θ which measures the spread of high betweenness nodes around their center of mass, the anisotropy factor A_θ which characterizes the spatial anisotropy of this set of nodes, and finally, the detour factor D which measures the average extent to which paths between two locations deviate from their geodesic distance. (Details on metrics shown in Methods)

In Fig. 4a we plot 〈C_θ〉 for θ = 90, 95, and 97 finding a clear asymptotic decrease with increasing ρ_e. In Fig. 4b the plot of 〈A_θ〉 in function of ρ_e, for the same set of thresholds as before, indicates a growing isotropic layout with a transition from a quasi one-dimensional to a two-dimensional spatial regime. This is confirmed by the corresponding decrease in the detour factor shown in Fig. 4c, where there is a rapid drop around ρ_e≈0.4 (or equivalently 〈k〉≈2) corresponding to the transition from the tree-like to the loop-like region.

Plotting the rescaled average betweenness of nodes as a function of the distance r from the barycenter (Methods), demonstrates a monotonic decrease with distance in the high density regime (Fig. 4d). For low values of ρ_e there appears no distance dependence of the nodes, whereas for ρ_e > 0.4, a clear dependence emerges with the curves converging to the form seen for maximally dense random geometric graphs as calculated in⁵⁵. (Note that while both planar and geometric graphs are embedded in space, the latter allows for edge-crossings and therefore broader degree distributions and larger number of edges for the same N. In light of this difference, the similarity between the two ostensibly different classes of graphs is notable.) In combination, the structural metrics suggest that while the spatial position of a node is decoupled from its BC value in sparse networks, a strong correlation emerges for increasingly dense networks.

We next investigate the spatial behavior of the high betweenness nodes in the empirical data. The distribution of ρ_e in Fig. 5a lies in a tight range (0.4 ≤ ρ_e≤ 0.6) with the majority of cities peaked at ρ_e ≈ 0.5. The observed range is notable, as for one it corresponds to a range of edge densities where a clear bimodal regime exists as seen in Fig. 3, while the peaked nature of ρ_e provide a further explanation for the observed similarity in BC distributions, given that it is the key controlling parameter. On the other hand, this provides a limited window for checking the spatial trends; indeed the curves for 〈C_θ〉, 〈A_θ〉 and D shown in Fig. 5b–d are noisy. Yet, within the extent of fluctuations, the trend is reasonably consistent with that seen in Fig. 4 for the same range of ρ_e. A clearer picture emerges when looking at individual cities; in Fig. 5e–h we show the geospatial layout of the BC distribution for the full street network in four representative cities arranged in increasing order of ρ_e. Santiago, being a city with relatively sparse number of streets, shows a tree-like anisotropic pattern for the high BC nodes that are spread mostly along a single axis of the city. Paris and Tokyo, being in the intermediate range, show a complicated lattice-like structure with loops spanning the spatial extent of the cities. Finally, Shenyang, being a city from the upper range of densities, shows a clear (relatively symmetric) clustering of the high BC nodes around the city center.

Temporal evolution of betweenness centrality in cities

The changes in the structure of the random graph, shown in Fig. 3, serves as a proxy for the evolution of a city as it experiences refinements in infrastructure with increased connectivity. While historical data of complete street networks in cities is limited, progress can be made by examining smaller subsets. To this effect, we make use of five historical snapshots of a portion of central Paris spanning 200 years (1790–1999), previously gathered to study the effects of central planning by city authorities⁴¹. The selected portion of Paris is around thirty square kilometers with about 10³ intersections and road-segments, and represents the essential part of the city around 1790. This particular period was chosen to examine the effects of the so-called “Hausmann transformation”, a major historical example of central planning in a city that happened in the middle of the 19th century in an effort to transform Paris and to improve traffic flow, navigability and hygiene (see refs. ⁴¹ and ⁵⁶ for historical details).

In Fig. 6a we show five instances of the street network (1790, 1836, 1849, 1888, 1999), corresponding to the region clipped to 1790. Highlighted in red are nodes at and above the 90^th percentile of betweenness. The spatial pattern of the nodes remains virtually identical (with a radial, spoke-like appearance) until 1849, and experiences an abrupt change to a ring-like pattern in 1888 which persists to modern times. This change corresponds to the period after the Haussmann transformation, involving the creation of new roads, broader avenues, city squares among other things. Yet, relative to the spatial extent of the region the high betweenness nodes are located near the city center. Also of note is the relative stability of the edge-density (ρ_e≈0.5) across the temporal period, reflecting the fact that both nodes and edges are growing at the same rate (Supplementary Fig. 13).

The rescaled BC distribution, $\tilde g_B$, is identical for all 5 snapshots as seen in Fig. 6b despite the significant structural changes. Figure 6c, d shows the clustering 〈C₉₀〉and anisotropy metrics 〈A₉₀〉 for the different eras, capturing the transition from the radial to the ring pattern, but are nevertheless relatively flat in correspondence with the trend in the planar random graph for fixed ρ_e. For purposes of comparison, we plot the averaged metrics for hundred random realizations (using the same procedure as in Fig. 3) for each of the five networks showing a remarkable similarity between the original and randomized cities. To track the evolution of the BC at the local level, we identify those intersections that are present throughout the temporal interval (within a resolution of fifty meters) and compute their betweenness in each instance of the network normalizing by N² to provide a consistent comparison, given the historical increase in intersections and roads. In Fig. 6e we plot the temporal evolution of g_B/N² for these intersections, coloring the points according to their corresponding relative rank. While one observes significant fluctuations in the BC at the local level (as expected), the high BC nodes are relatively stable from 1790 to 1849.

After the Haussmann intervention, one observes a dramatic drop in rank of the high BC nodes-corresponding to the “decongesting” spatial transition from a radial to a circular pattern-after which once again the high BC nodes are relatively stable till 1999. It is important to note that the load is simply redistributed to a different part of the network, as can be seen by the transition of the middle-ranked nodes to the top positions in the same periods. Furthermore, as indicated by the spatial layout of these “new” high BC nodes, they continue to be relatively close to the center (few or none are near the periphery), a pattern that is consistent with what one would expect to find for the corresponding random graphs.

Discussion

Taken together our results shed new light on the understanding of structural flow in spatial networks. The observed invariance in the BC distribution appears to be a function of the strong constraint imposed by planarity, leaving only the number of nodes N and the number of edges e as tunable parameters—a markedly different phenomenon than seen for non-planar networks, where betweenness is strongly correlated with degree. Empirical studies on street networks, analytical calculations on Cayley trees, coupled with simulations of random planar graph models, suggest this to be a consequence of a bimodal regime consisting of a tree-like structure with a tightly peaked branching ratio comprising the high betweenness “backbone” of the network, and a low betweenness regime dominated by the presence of loops. The transition of nodes between regimes is driven by increasing the density of edges in the network, which has the additional effect of introducing a spatial correlation in the high BC nodes—from being dominated by topology in the low-density regime to being strongly dependent on spatial location in the high-density regime. Given that the number of roads and intersection in our sampled cities vary over three orders of magnitude, the similarity in the BC distribution can be explained as a function of the observed narrow range of ρ_e. Indeed, it appears that the characteristics of flow across cities are better characterized by the spatial distribution of the high BC set, as well as the specific location of nodes that lie on this set, rather than global-level statistics.

On the other hand, the relative lack of sensitivity of the BC distribution to changes in the spatial layout, including distances and local topological variations, has interesting implications for urban planning. While the random graph models are closer in spirit to so-called self-organized cities that grow organically, the observed evolution of Paris suggests that central planning may also have its limitations. The invariance of the BC distribution suggests that congestion (in the structural sense) cannot be alleviated, but only redirected to different parts of the city. Indeed, the Haussmann transformation succeeded in doing precisely that by improving the navigability of Paris and decongesting the center. However, the high BC backbone continued to be closer to the center than the city periphery, a consequence of the spatial distribution being a function of ρ_e. For cities with a higher ratio of roads to intersections, the “decongestion-space” as it were, is expected to be even more limited.

It must be noted that the BC does have limitations in terms of predicting real-time traffic behavior. In particular, weighting edges based only on Euclidean distance artificially places more demand on shorter streets, although in reality, these streets may have lower speed limits and thus receive less travel demand⁵⁷. There is also the issue of spatially irregular travel demand which is overlooked in the betweenness formulation, as all pairs of nodes are given equal weight in the calculation of the global metric⁵⁸. Various solutions to this route-sampling issue⁴⁷ have been proposed; in particular, there have been studies using alternative versions of betweenness that weight each node pair proportional to its perceived travel demand, obtained via both real dynamic data and/or heuristics depending on the study^59,60. The planarity constraint is also alleviated in many cases with multilevel underpasses, public transportation, etc, although the majority of the network still remains planar. We argue that despite these concerns, the results of this study are flexible enough to suggest that load redistribution will be the primary result of planned traffic intervention given static network structure. In particular, we can absorb travel preference, distance, speed limits, and other spatially heterogeneous factors into our edge weights, and the invariance of the BC distribution to edge weight adjustment can be used as evidence for these factors not affecting the global load distribution (Cf. Supplementary Fig. 10). In addition, the construction of detours and alternative paths can be absorbed into factors affecting local topology, which also leaves the global BC distribution invariant (see Supplementary Note 3, Supplementary Fig. 14, Supplementary Table 3 for an analysis of the temporally fastest routes in a city).

Generally speaking, the study of high BC nodes is an important endeavor as they correspond to bottlenecks in networked systems. In some sense, they represent a generalization of studying the maximum BC node, that governs the behavior of the system in saturation cases where the traffic exceeds the node-capacity. Our analysis suggests, however, that for planar graphs, one needs to take into account the entire high BC set, since the maximum BC node can easily change due to local variations, yet is guaranteed to lie somewhere along the spanning tree that constitutes the backbone of the network. In this respect, further study of the mechanisms governing the spatial distribution of BC is important. Planar graphs are an important class of networks that include infrastructural systems such as power grids and communication networks, as well as transport networks found in biology and ecology¹. In particular, leaf venation networks, arterial networks, and neural cortical networks rely on tree-like structures for optimal function⁶¹. The lessons from this analysis may well be gainfully employed in these other sectors.

Methods

Construction of street networks

The street networks used in our analysis were constructed from the OpenStreetMaps (OSM) database⁶². For each city we extracted the geospatial data of streets connecting origin-destination pairs within a 30 km radius from the city center (referenced from https://www.latlong.net), corresponding to a rectangular area of ~60 × 60 km² with some variability due to road densities, latitude and topographical variations. The 30 km radius was chosen to encapsulate both high density urban regions and more suburban regions with fewer, longer streets. Furthermore, the choice of scale negates any (minimal) boundary effects on the calculated distribution of the BC^38,63. The locations of the street-intersections were found using an Rtree data structure for expedited spatial search⁶⁴. Lattitude and longitude coordinates were projected onto global distances using the Mercator projection, and adjacent intersections lying along the same roads were adjoined by edges with weights equal to the Euclidean distance between the intersections. The resulting street networks are weighted, undirected planar graphs with intersections as nodes, and edges between these nodes approximating the contour of the street network. Aggregate statistics are shown in Table 1.

Table 1 Aggregate statistics for the 97 street networks

Full size table

BC of Cayley trees

Let us consider a perfect Cayley tree of size N with fixed branching ratio k and all leaf nodes at the same depth. Adopting the convention l = L for the leaf level and l = 0 for the root, a node on the l-th level has k−1 branches directly below it at the (l+1)-th level, each with M_l+1 children such that the set of branches {n_i} stemming from this node will have sizes $\{ n_i\} = \{ M_{l + 1},...,M_{l + 1},N - M_l\}$. For fixed k there are k−1 copies of the term M_l+1 which is of the form

$$M_\lambda = \mathop {\sum}\limits_{l^\prime = 0}^{L - \lambda } k^{l^\prime } = \frac{{1 - k^{L - \lambda + 1}}}{{1 - k}}.$$

(5)

The betweenness value of a vertex v in any tree is given by $g_B(v) = \mathop {\sum}\nolimits_{i < j} n_in_j$ where i, j are indices running over the branches coming off of v (excluding v), and n_i, n_j are the number of nodes in each branch⁶⁵. Combining this with Eq. (5) gives us the betweenness of v at level l thus

$$g_B(v|k,l) = \left(\begin{array}{*{20}{c}} {} K-1\\ 2 \end{array}\right)M_{l + 1}^2 + (k - 1)M_{l + 1}\left( {N - M_l} \right),$$

(6)

from which it is easy to see that for any level l, the betweenness scales as $g_B(v|k,l) \sim O(Nk^{L - l})$. Thus, absorbing k^L into the leading constant A, and letting $g_B(v|k,l) \approx ANk^{ - l}$, we have that since g_B is completely determined by the level l in which it lies in the tree,

$$P(g_B) = \mathop {\sum}\limits_l P(g_B|l)P(l).$$

(7)

Now, using the fact that $P(l) = \frac{{k^l}}{N}$ and $P(g_B|l) = \delta _{g_B,ANk^{ - l}}$, we have that

$$P(g_B) = Ag_B^{ - 1}.$$

(8)

Spatial metrics for high BC nodes

To measure the clustering, we specify a threshold θ, i.e., we isolate nodes with a BC above the θ-th percentile-and then compute their spread about their center of mass, normalizing for comparison across networks of different sizes, thus,

$$C_\theta = \frac{1}{{N_\theta \langle X\rangle }}\mathop {\sum}\limits_{i = 1}^{N_\theta } ||x_i - x_{cm}||.$$

(9)

Here $x_{cm} = \frac{1}{N_{\theta}}\mathop {\sum}\nolimits_{i = 1}^{N_\theta } x_i$, N_θ is the number of high betweenness nodes isolated, {x_i} specify their coordinates, and 〈X〉 is the average distance of all nodes in the network to the center of mass of the high BC cluster,

$$\langle X\rangle = \frac{1}{N}\mathop {\sum}\limits_{i = 1}^N ||x_i - x_{cm}||.$$

(10)

Equation 9 quantifies the extent of clustering of the high BC nodes relative to the rest of the nodes in the network, with increased clustering resulting in low values of C_θ.

In order to more precisely quantify the transition between the topological and spatial regimes, a clue is provided by the increasingly isotropic layout of the high BC nodes with increasing edge-density. To measure the extent of this observed (an)isotropy, we define the ratio,

$$A_\theta = \frac{{\lambda _1}}{{\lambda _2}},$$

(11)

where λ₁ ≤ λ₂ are the (positive) eigenvalues of the covariance matrix of the spatial positions of the nodes with BC above threshold θ. The metric is unitless and measures the widths of the spread of points about their principal axes, analogous to the principal moments of inertia. Low values of A_θ correspond to a quasi one-dimensional structure with large anisotropy, whereas the system becomes increasingly isotropic for larger values until it is roughly two-dimensional as A_θ → 1.

The detour factor measures the average extent to which paths between two locations deviate from their geodesic distance and is given by

$$D = \frac{1}{{N(N - 1)}}\mathop {\sum}\limits_{i \ne j} \frac{{d_G(i,j)}}{{d_E(i,j)}}.$$

(12)

Here d_E(i,j) is the euclidean distance between nodes i,j, and d_G(i, j) is their distance-weighted shortest path in the network G.

Distance dependence of BC

In our simulations, nodes were located on a 100×100 grid with coordinates in ${\Bbb R}^2 \in [ - 50,50]$. The center of the grid was chosen as the origin (0, 0) and the average betweenness $\langle g_B(r)\rangle$ is computed over all nodes that are located at a distance r from the origin, advancing in units of r = 1, until we reach the grid boundary r = 50. In order to restrict $\langle g_B(r)\rangle$ to the interval [0, 1] we measure the rescaled quantity

$$\langle g_{\it{b}}^ \ast (r)\rangle = \frac{{\langle g_{\it{B}}(r)\rangle - {\rm min}\langle g_{\it{B}}(r)\rangle }}{{{\rm max}\langle g_{\it{B}}(r)\rangle - {\rm min}\langle g_{\it{B}}(r)\rangle }},$$

(13)

for different values of ρ_e. This was done to compare our results to the corresponding expression in random geometric graphs, which was analytically calculated for (the somewhat artificial) limit of an infinitely dense disk of radius R⁵⁵.

Data availability

All data needed to evaluate the conclusions are present in the paper and/or the Supplementary Information. The street networks were constructed from open access data. Any additional data related to this paper are available from the authors on reasonable request.

References

Mileyko, Y., Edelsbrunner, H., Price, C. A. & Weitz, J. S. Hierarchical ordering of reticular networks. PLoS ONE 7, e36715 (2012).
Article PubMed PubMed Central ADS CAS Google Scholar
Barthelemy, M. Spatial networks. Phys. Rep. 499, 1–101 (2011).
Article MathSciNet ADS CAS Google Scholar
Bretagnolle, A., Daudé, E. & Pumain, D. From theory to modelling: urban systems as complex systems. CyberGeo: Eur. J. Geogr. 335, 1–17 (2006).
Google Scholar
Bettencourt, L. & West, G. A unified theory of urban living. Nature 467, 912–913 (2010).
Article PubMed ADS CAS Google Scholar
Pan, W., Ghoshal, G., Krumme, C., Cebrian, M. & Pentland, A. Urban characteristics attributable to density-driven tie formation. Nat. Commun. 4, 1961 (2013).
Article PubMed ADS CAS Google Scholar
Batty, M. Building a science of cities. Cities 29, S9–S16 (2012).
Article Google Scholar
Barthelemy, M. The Structure and Dynamics of Cities (Cambridge University Press, Cambridge, UK, 2016).
Goh, S., Choi, M. Y., Lee, K. & Kim, K.-M. How complexity emerges in urban systems: theory of urban morphology. Phys. Rev. E 93, 052309 (2016).
Article PubMed ADS CAS Google Scholar
Bettencourt, L. The origins of scaling in cities. Science 340, 1438–1441 (2013).
Article MathSciNet PubMed MATH ADS CAS Google Scholar
Kalapala, V., Sanwalani, V., Clauset, A. & Moore, C. Scale invariance in road networks. Phys. Rev. E 73, 026130 (2006).
Article ADS CAS Google Scholar
Youn, H., Gastner, M. T. & Jeong, H. Price of anarchy in transportation networks: efficiency and optimality control. Phys. Rev. Lett. 101, 128701 (2008).
Article PubMed ADS CAS Google Scholar
Cardillo, A., Scellato, S., Latora, V. & Porta, S. Structural properties of planar graphs of urban street patterns. Phys. Rev. E 73, 066107 (2006).
Article ADS CAS Google Scholar
Justen, A., Martnez, F. J. & Cortés, C. E. The use of space-time constraints for the selection of discretionary activity locations. J. Transp. Geogr. 33, 146–152 (2013).
Article Google Scholar
Witlox, F. Evaluating the reliability of reported distance data in urban travel behaviour analysis. J. Transp. Geogr. 15, 172–183 (2007).
Article Google Scholar
da F. Costa, L., Travençolo, B. A. N., Viana, M. P. & Strano, E. On the efficiency of transportation systems in large cities. Europhys. Lett. 91, 18003 (2010).
Article ADS CAS Google Scholar
Wang, P., Hunter, T., Bayen, A. M., Schechtner, K. & González, M. C. Understanding road usage patterns in urban areas. Sci. Rep. 2, 1001 (2012).
Article PubMed PubMed Central ADS CAS Google Scholar
Kang, C., Ma, X., Tong, D. & Liu, Y. Intra-urban human mobility patterns: an urban morphology perspective. Phys. A 391, 1702–1717 (2012).
Article Google Scholar
Haggett, P. & Chorley, R. J. Network Analysis in Geography (St. Martins Press, New York, 1969).
Google Scholar
Lammer, S., Gehlsen, B. & Helbing, D. Scaling laws in the spatial structure of urban road networks. Physica. A 369, 853866 (2006).
Google Scholar
Wang, F., Antipova, A. & Porta, S. Street centrality and land use intensity in Baton Rouge, Louisiana. J. Transp. Geogr. 19, 285–293 (2011).
Article Google Scholar
Rui, Y., Ban, Y., Wang, J. & Haas, J. Exploring the patterns and evolution of self-organized urban street networks through modeling. Eur. Phys. J. B 86, 74 (2013).
Article ADS Google Scholar
Louf, R. & Barthlemy, M. A typology of street patterns. J. R. Soc. Interface 11, 20140924 (2014).
Article PubMed PubMed Central Google Scholar
Strano, E. et al. Urban Street Networks: a comparative analysis of ten European Cities. Environ. Plann. B. Plann. Des. 40, 1071–1086 (2013).
Article Google Scholar
Masucci, A. P., Smith, D., Crooks, A. & Batty, M. Random planar graphs and the London street network. Eur. Phys. J. B 71, 259–271 (2009).
Article MathSciNet MATH ADS CAS Google Scholar
Clark, J. & Holton, D. A. A First Look at Graph Theory (World Scientific, Teaneck, NJ, 1991).
Book MATH Google Scholar
Newman, M. E. J. Networks: An Introduction (Oxford University Press, Oxford, 2010).
Book MATH Google Scholar
Aldous, D. & Ganesan, K. True scale-invariant random spatial networks. Proc. Natl Acad. Sci. USA 110, 8782–8785 (2013).
Article MathSciNet PubMed PubMed Central MATH ADS Google Scholar
Aldous, D. Routed planar networks. Electron. J. Graph Theory Appl. 4, 42–59 (2016).
Article MathSciNet Google Scholar
Ghoshal, G. & Barabási, A.-L. Ranking stability and super stable nodes in complex networks. Nat. Commun. 2, 394 (2011).
Article PubMed CAS Google Scholar
Barthelemy, M. Crossover from scale-free to spatial networks. Europhys. Lett. 63, 915 (2003).
Article ADS CAS Google Scholar
Freeman, L. C. A set of measures of centrality based on betweenness. Sociometry 40, 35–41 (1977).
Article Google Scholar
Holme, P. Congestion and centrality in traffic flow on complex networks. Adv. Complex Syst. 6, 163–176 (2003).
Article MATH Google Scholar
Ashton, D. J., Jarrett, T. C. & Johnson, N. F. Effect of congestion costs on shortest paths through complex networks. Phys. Rev. Lett. 94, 058701 (2005).
Article PubMed ADS CAS Google Scholar
Jarrett, T. C., Ashton, D. J., Fricker, M. & Johnson, N. F. Interplay between function and structure in complex networks. Phys. Rev. E 74, 026116–026118 (2006).
Article MathSciNet ADS CAS Google Scholar
Roswall, M., Trusina, A., Minnhagen, P. & Sneppen, K. Networks and cities: an information perspective. Phys. Rev. Lett. 94, 028701 (2005).
Article ADS CAS Google Scholar
Jiang, B. A topological pattern of urban street networks: universality and peculiarity. Phys. A 384, 647–655 (2007).
Article MathSciNet Google Scholar
Chan, S. H. Y., Donner, R. V. & Lämmer, S. Urban road networks—spatial networks with universal geometric features? Eur. Phys. J. B 84, 563–577 (2011).
Article ADS CAS Google Scholar
Lion, B. & Barthelemy, M. Central loops in random planar graphs. Phys. Rev. E 95, 042310 (2017).
Article PubMed ADS MathSciNet Google Scholar
Crucitti, P., Latora, V. & Porta, S. Centrality measures in spatial networks of urban streets. Phys. Rev. E 73, 036125 (2006).
Article MATH ADS CAS Google Scholar
Porta, S., Crucitti, P. & Latora, V. The network analysis of urban streets: a primal approach. Environ. Plann. B. Plann. Des. 33, 705–725 (2006).
Article MATH Google Scholar
Barthelemy, M., Bordin, P., Berestycki, H. & Gribaudi, M. Self-organization versus top-down planning in the evolution of a city. Sci. Rep. 3, 2153 (2013).
Article PubMed PubMed Central ADS Google Scholar
Barthelemy, M. Betweenness centrality in large complex networks. Eur. Phys. J. B 38, 163–168 (2004).
Article ADS CAS Google Scholar
Strano, E. et al. The scaling structure of the global road network. J. R. Soc. Interface 4, 170590 (2017).
Google Scholar
Gago, S., Hurajová, J. & Madaras, T. Notes on the betweenness centrality of a graph. Math. Slov. 62, 1–12 (2012).
Article MathSciNet MATH Google Scholar
Narayan, O. & Saniee, I. Large-scale curvature of networks. Phys. Rev. E 84, 066108 (2011).
Article ADS CAS Google Scholar
Jonckheere, E., Lou, M., Bonahon, F. & Baryshnikov, Y. Euclidean versus hyperbolic congestion in idealized versus experimental networks. Internet Math. 7, 1–27 (2011).
Article MathSciNet MATH Google Scholar
Lee, M., Barbosa, H., Youn, H., Holme, P. & Ghoshal, G. Morphology of travel routes and the organization of cities. Nat. Commun. 8, 2229 (2017).
Article PubMed PubMed Central ADS CAS Google Scholar
Clark, C. Urban population densities. J. R. Stat. Soc. Ser. A. 114, 490–496 (1951).
Article Google Scholar
Lee, D.-T. & Schachter, B. J. Two algorithms for constructing a delaunay triangulation. Int. J. Comput. & Inf. Sci. 9, 219–242 (1980).
Article MathSciNet MATH Google Scholar
Newman, M. E. J., Watts, D. J. & Strogatz, S. Random graphs with arbitrary degree distributions and their applications. Phys. Rev. E 64, 026118 (2001).
Article ADS CAS Google Scholar
Graham, R. L. & Hell, P. On the history of the minimum spanning tree problem. Ann. Hist. Comput. 7, 43–57 (1985).
Article MathSciNet MATH Google Scholar
Szabó, G., Alava, M. & Kertész, J. Shortest paths and load scaling in scale-free trees. Phys. Rev. E 66, 026101 (2002).
Article ADS CAS Google Scholar
Wu, Z., Braunstein, L. A., Havlin, S. & Stanley, H. E. Transport in weighted networks: partition into superhighways and roads. Phys. Rev. Lett. 96, 148702 (2006).
Article PubMed ADS CAS Google Scholar
Wang, H., Hernandez, J. M. & Van Mieghem, P. Betweenness centrality in a weighted network. Phys. Rev. E 77, 046105 (2008).
Article ADS CAS Google Scholar
Giles, A. P., Georgiou, O. & Dettmann, C. P. Betweenness centrality in dense random geometric networks. In 2015 IEEE International Conference on Communications (ICC) 6450–6455 (IEEE, 2015).
Jordan, D. Transforming Paris: The Life and Labors of Baron Haussmann (University of Chicago Press, Chicago, USA, 1995).
Google Scholar
Leung, I. X., Chan, S.-Y., Hui, P. & Lio, P. Intra-city urban network and traffic flow analysis from gps mobility trace. Preprint at https://arxiv.org/abs/1105.5839 (2011).
Kazerani, A. & Winter, S. Can betweenness centrality explain traffic flow? In 12th AGILE International Conference on Geographic Information Science 1–9 (European Commission, 2009).
Gao, S., Wang, Y., Gao, Y. & Liu, Y. Understanding urban traffic-flow characteristics: a rethinking of betweenness centrality. Environ. Plan B Urban Anal. City Sci. 40, 135–153 (2013).
Google Scholar
Chen, S., Huang, W., Cattani, C. & Altieri, G. Traffic dynamics on complex networks: a survey. Math. Probl. Eng. 2012, 732698 (2012).
MathSciNet MATH Google Scholar
Tekin, E., Hunt, D., Newberry, M. G. & Savage, V. M. Do vascular networks branch optimally or randomly across spatial scales? PLoS Comput. Biol. 12, e1005223 (2016).
Article PubMed PubMed Central ADS CAS Google Scholar
OpenStreetMap Working Data Group. OpenStreetMap. Planet OSM, http://planet.openstreetmap.org (2015).
Gil, J. Street network analysis “edge effects”: examining the sensitivity of centrality measures to boundary conditions. Environ. Plan B Urban Anal. City Sci. 44, 819–836 (2016).
Article Google Scholar
Guttman, A. R-trees: a dynamic index structure for spatial searching. In Proc. of the 1984 ACM SIGMOD International Conference on Management of Data Vol. 14, 47–57 (ACM, New York, 1984).
Unnithan, S. K. R., Balakrishnan, K. & Jathavedan, M. Betweenness centrality in some classes of graphs. Int. J. Combinatorics 2014, 241723 (2014).
MathSciNet MATH Google Scholar

Download references

Acknowledgements

This work was partially supported by the US Army Research Office under Agreement Number W911NF-17-1-0127. M.B. thanks the city of Paris (Paris 2030) for funding and the geohistoricaldata group for discussions and data. Map data copyrighted by OpenStreetMap contributors and available from https://www.openstreetmap.org.

Author information

Authors and Affiliations

Department of Physics & Astronomy, University of Rochester, Rochester, NY, 14627, USA
Alec Kirkley, Hugo Barbosa & Gourab Ghoshal
Institut de Physique Théorique, CEA, CNRS-URA 2306, Gif-sur-Yvette, F-91191, France
Marc Barthelemy
Centre d’Analyse et de Mathématique Sociales (CNRS/EHESS), 54 Boulevard Raspail, Paris, 75006, France
Marc Barthelemy
Goergen Institute for Data Science, University of Rochester, Rochester, NY, 14627, USA
Gourab Ghoshal

Authors

Alec Kirkley
View author publications
You can also search for this author in PubMed Google Scholar
Hugo Barbosa
View author publications
You can also search for this author in PubMed Google Scholar
Marc Barthelemy
View author publications
You can also search for this author in PubMed Google Scholar
Gourab Ghoshal
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.K., H.B., M.B., and G.G. designed the study. A.K. and H.B. implemented the method. A.K., H.B., M.B., and G.G. analyzed the results and wrote the manuscript.

Corresponding author

Correspondence to Gourab Ghoshal.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kirkley, A., Barbosa, H., Barthelemy, M. et al. From the betweenness centrality in street networks to structural invariants in random planar graphs. Nat Commun 9, 2501 (2018). https://doi.org/10.1038/s41467-018-04978-z

Download citation

Received: 02 December 2017
Accepted: 05 June 2018
Published: 27 June 2018
DOI: https://doi.org/10.1038/s41467-018-04978-z

This article is cited by

Behavior-encoded models reveal differentiated access to public cooling environment by race and income
- Chao Li
- Xing Su
- Haoying Han
npj Urban Sustainability (2024)
A Global Feature-Rich Network Dataset of Cities and Dashboard for Comprehensive Urban Analyses
- Winston Yap
- Filip Biljecki
Scientific Data (2023)
Future urban transport management
- Ziyou Gao
- Hai-jun Huang
- Jianjun Wu
Frontiers of Engineering Management (2023)
15-min Cities: the Potential of a Medium-Sized Polycentric Latin American City
- Jorge Urrutia-Mosquera
- Luz Flórez-Calderón
- Dusan Paredes
Journal of Urban Health (2023)
Influence maximization based on SATS scheme in social networks
- Xinxin Zhang
- Min Gao
- Zhaobin Zhou
Computing (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.