Field theory for recurrent mobility

Mazzoli, Mattia; Molas, Alex; Bassolas, Aleix; Lenormand, Maxime; Colet, Pere; Ramasco, José J.

doi:10.1038/s41467-019-11841-2

Download PDF

Article
Open access
Published: 29 August 2019

Field theory for recurrent mobility

Nature Communications volume 10, Article number: 3895 (2019) Cite this article

11k Accesses
49 Citations
72 Altmetric
Metrics details

Subjects

Abstract

Understanding human mobility is crucial for applications such as forecasting epidemic spreading, planning transport infrastructure and urbanism in general. While, traditionally, mobility information has been collected via surveys, the pervasive adoption of mobile technologies has brought a wealth of (real time) data. The easy access to this information opens the door to study theoretical questions so far unexplored. In this work, we show for a series of worldwide cities that commuting daily flows can be mapped into a well behaved vector field, fulfilling the divergence theorem and which is, besides, irrotational. This property allows us to define a potential for the field that can become a major instrument to determine separate mobility basins and discern contiguous urban areas. We also show that empirical fluxes and potentials can be well reproduced and analytically characterized using the so-called gravity model, while other models based on intervening opportunities have serious difficulties.

A generalized vector-field framework for mobility

Article Open access 13 June 2024

The universal visitation law of human mobility

Article 26 May 2021

The scales of human mobility

Article 18 November 2020

Introduction

Human mobility has been studied for decades due to the relevant role it plays in a wide spectrum of applications including economic questions and living conditions^1,2,3, city structure^4,5, forecasting epidemic spreading^6,7,8,9, traffic demand and design of new infrastructure¹⁰, or urban pollution and air quality¹¹. Data on people migrations dates back at least to 1871 when the United Kingdom registered the difference in inhabitants during a decade¹². More recently, in the last decades, census surveys in countries around the world have included a question on the tract of residence and that of work (see for instance the Supporting Information of⁷ to find a list). Aggregating the home-work trips of the single individuals, one can define the so-called Origin-Destination (OD) matrices that for every pair (i, j) collect the flow of people traveling from census tract i to j, T_ij. These matrices are absolutely essential for transport planning since they encode trip demand. Census and specially dedicated surveys have dominated the area in terms of mobility data collection until a few years ago^13,14. With the advent of the big data era, the availability of large-scale quick-updated data has notably increased. Passive sources such as mobile phone records or GPS-located messages in online social networks (Twitter, Foursquare, etc) have been employed to study mobility^{15,16,17,18,19,20} and, in particular, to extract OD matrices (see also the recent reviews^14,21). It is worth noticing that the quality of the OD matrices obtained from these new information and communication technologies (ICT) data sources have been confronted against the information provided by surveys with satisfactory results in urban areas at geographical scales larger than one square kilometer¹⁸. The wealth of new data opens the door to tackle and revisit relevant theoretical aspects concerning mobility flows that could not been boarded before.

From a theoretical perspective, two competing frameworks have been used for almost 80 years to characterize mobility flows: the gravity^22,23 and the intervening opportunity^24,25 models. Their main difference lies in the way in which the geographical distance affects the flows. While in the gravity model the flows decay with a certain deterrence function (usually, with an exponential or power law-like forms^26,27,28,29), the intervening opportunity models depend on the “opportunities” or jobs enclosed within a given area. Since the opportunity distribution can be highly heterogeneous in space, the distance plays an indirect role on the final assignment of the trip destinations and, in turn, on the decay of the total flows^14,30. A few years ago, it has been introduced the so-called radiation model as an evolution of the intervening opportunity concept in which the opportunity selected is supposed to be the best possible choice simplifying the statistic treatment, and the density of opportunities is related to the population^30,31. This allows to write a closed formula for the probability of a trip to finish at a given geographical unit. Regarding the gravity model, its functional shape was proposed ad hoc, essentially inspired by Newton’s law in which the populations act as masses^32,33, although it can be also recovered from maximal entropy arguments³⁴. Moreover, the model can be developed further by taking into account the distinguishability of the trips^35,36,37. Early after the gravity model introduction, the possibility of defining a potential was discussed³⁸ but the lack of reliable data prevented ulterior research in this direction.

Several works have focused on the comparison between the two families of models and their performance when compared with empirical data^{39,40,41,42,43,44,45,46,47}. It is worth mentioning that a fair comparison requires to be carried out over the same type of mobility data (daily or sporadic trips behave differently) and with the same constraints. The constraints here refer to the amount of information provided to the model. The basic unconstrained models only include the population in the geographical units, while in the constrained versions the total in- or/and out-flows are also supplied⁴⁶.

In this work, we propose a method to define a mesoscopic vector field out of daily commuting data. This field turns out to be well-behaved, fulfilling Gauss’s divergence theorem and being irrotational. Given that we are analyzing empirical information, these results are far from trivial and they reveal intrinsic features of aggregated daily human mobility. The existence of a well-behaved mesoscopic field is confirmed with both data from Twitter and census for large urban areas. By taking into account the irrotational nature of the field, we also define a potential for the mobility flows. This potential is a tool that will crucially contribute to controversial issues such as the functional definition of city limits⁴⁸ and the presence of polycenters⁵. After these first empirical results, we focus on which properties of the mesoscopic field can be reproduced by the models. In the case of the gravity, the fluxes over surfaces, rotational and potential empirical observations are well reproduced with an exponentially decaying deterrence function and they can be analytically obtained or approximated. The radiation model has, however, stronger difficulties to reproduce the empirical values of the fluxes.

Results

Definition of the vector field

We obtain OD matrices between cells of 1 × 1 km² from Twitter and, where available, also from census data in several worldwide cities (see Supplementary Table 1 for a list of cities and Methods, below, for a description of the data cleaning procedure). We call T_ij to the daily flow of commuters from cell i, home, to j, work. There can be flows between any pair of cells in the city. As defined, the OD matrix T_ij contains only information on trips origin and final destination, not about trajectories or middle points visited. We then define a vector centered in i, $T_{ij}{\kern 1pt} \overrightarrow {\mathbf{u}} _{ij}$, where $\overrightarrow {\mathbf{u}} _{ij}$ is the unit vector from i to j. The vectors pointing to all destinations j are then vectorially summed to obtain a resultant vector $\overrightarrow {\mathbf{T}} _i = \mathop {\sum}\nolimits_j {T_{ij}{\kern 1pt} \overrightarrow {\mathbf{u}} _{ij}}$ in every cell i (see Fig. 1a). These vectors define a field in the space and they identify the mean outgoing mobility direction in every point. If the mobility is balanced in opposite directions, the vector $\overrightarrow {\mathbf{T}} _i$ can vanish. These equilibrium (Lagrange) points play an important role in the field theoretical framework. As an illustration, empirical fields for London and Paris are displayed in Fig. 1b, c, respectively. Further examples for other cities are shown in the Supplementary Figs. 27–42.

Drawing a parallel with classical field theories, $\overrightarrow {\mathbf{T}} _i$ can be divided by the “mass” of the origin cell i (home-place) to define the vector field

$$\overrightarrow {\mathbf{W}} _i = \frac{{\overrightarrow {\mathbf{T}} _i}}{{m_i}} = \mathop {\sum}\limits_{j \ne i} {\frac{{T_{ij}}}{{m_i}}{\kern 1pt} \overrightarrow {\mathbf{u}} _{ij}} ,$$

(1)

where the mass m_i corresponds to the cell population considered in the analysis. The vector $\overrightarrow {\mathbf{W}} _i$, defined at the mesoscopic cell-size scale, is the main object of study in this work and it represents an average mobility per capita. Our data refers to commuters, either those calculated from Twitter or collected by the census. For practical reasons, we define the local mass m_i as the total number of commuters residing in cell i. This means that $m_i = \mathop {\sum}\nolimits_j {T_{ij}}$, with the sum including the term j = i. This definition allows us to apply a coherent treatment to all our databases and it is an approximation for the total workforce living in every cell. As shown in⁴⁶, the mass defined in this way yields better flow estimates than the actual cell population for both gravity and radiation models.

If instead of home to work, we consider the returning trip from work to home the picture does not change significantly. If the vectors $\overrightarrow {\mathbf{T}} _i$ are still defined at the residence cell, their sense reverses but the modulus remains unchanged. The spatial organization of the field is, therefore, invariant and it does not affect the results shown below (except for one sign). On the other hand, if instead of calculating the resultant vector at the residence place we define it at the working cell: $\overrightarrow {\mathbf{T}} _j^\prime = \mathop {\sum}\nolimits_i {T_{ij}} {\kern 1pt} \overrightarrow {\mathbf{u}} _{ji}$ and $\overrightarrow {\mathbf{W}} _j^\prime = \overrightarrow {\mathbf{T}} _j^\prime /m_j$, the values of the vectors themselves modify at every location but the mesoscopic field behavior and the main properties studied below are robust (see Supplementary Note 13 and Supplementary Fig. 51).

Empirical results

Once the field is defined, we can calculate directly from empirical data the flux across any closed perimeter from the surface integral ${\it{\Phi }}_W^S = {\oint} \, {d\ell } {\kern 1pt} \overrightarrow {\mathbf{n}} {\kern 1pt} \overrightarrow {\mathbf{W}}$, where $\overrightarrow {\mathbf{n}}$ is the unit vector normal to the perimeter in each point and $d\ell$ the infinitesimal of length, and compare it with the volume integral of the divergence of $\overrightarrow {\mathbf{W}}$, ${\it{\Phi }}_W^V = {\int} {dS} {\kern 1pt} \nabla \overrightarrow {\mathbf{W}}$, in the area enclosed inside the perimeter (where dS is the infinitesimal of area). This allows us to assess whether the empirical vector field $\overrightarrow {\mathbf{W}}$ fulfils Gauss’s Theorem of the Divergence or not. Gauss’s theorem states that

$${\it{\Phi }}_W^S = {\oint}\, {d\ell } {\kern 1pt} \overrightarrow {\mathbf{n}} {\kern 1pt} \overrightarrow {\mathbf{W}} = {\int} \, {dS} {\kern 1pt} \nabla \overrightarrow {\mathbf{W}} = {\it{\Phi }}_W^V,$$

(2)

and it implies that the field is generated by a source and that the fluxes through surfaces must respect conservation laws. The numerical estimations of the flux Φ_W as a function of the scale using both integrals are shown in Fig. 2 for London and Paris with two perimeter shapes: a circle and a square. As it can be seen, the agreement between both approaches is rather good with $R_{\mathrm{P}}^2 = 0.96$ (circle) and $R_{\mathrm{P}}^2 = 0.89$ (square) for London and $R_{\mathrm{P}}^2 = 0.97$ (circle) and $R_{\mathrm{P}}^2 = 0.80$ (square) for Paris. $R_{\mathrm{P}}^2$ is obtained as the square of the Pearson correlation coefficient of both curves. We have run the same test in several cities with Twitter data. Supplementary Table 1 shows the list of coordinates of the central points of the perimeters in each city and Supplementary Table 2 the results of the comparisons. In most of the cases the values of $R_{\mathrm{P}}^2$ are in the range 0.8–0.97 with only two exceptions that are, in any case, over 0.66. For completeness, the same operation has been performed with census data in London ($R_{\mathrm{P}}^2 = 0.98$ both for the circle and the square) and in Paris ($R_{\mathrm{P}}^2 = 1$ for the circle and $R_{\mathrm{P}}^2 = 0.98$ for the square) as can be seen in Supplementary Fig. 1. This implies that the field does indeed fulfil Gauss’s theorem.

Similarly, we can compute the curl of the vector field directly out of the data (see Methods). The field $\overrightarrow {\mathbf{W}}$ is embedded in a x–y plane and, therefore, ∇ × $\overrightarrow {\mathbf{W}}$ has only a component on the z-direction. The outcome of ||∇ × $\overrightarrow {\mathbf{W}}$|| using a colormap is depicted in Fig. 3a. The values of the curl modulus is of the order of 10⁻¹ in km⁻¹. To evaluate whether this is small or large, we have defined a null model by randomly redirecting the angles of the vectors of each cell. The curl of the random model is of the same scale as the empirical field (Fig. 3b). For instance, calculating the dimensionless numbers ${\int} {dS{\kern 1pt} } \, ||\nabla \times \overrightarrow {\mathbf{W}} ||^2$ we obtain 21 for the empirical field and 45 for null model. Furthermore, the distribution of the original ∇ × $\overrightarrow {\mathbf{W}}$ is similar to the random one, with a mixed between a delta distribution at zero and a symmetric exponential decay in the tails (Supplementary Fig. 43). This means that the values that we observe in the empirical curl are compatible with random fluctuations and the possibility of having a developed rotational structure in the field is rejected. The comparison with the modulus of the original field shows as well that the curl is 4 orders of magnitude smaller (Fig. 1b). All these evidences support the irrotational character of $\overrightarrow {\mathbf{W}}$ and allow us to define a potential for it. These results are further supported by the vectors $\overrightarrow {\mathbf{W}}$ angle analysis performed in Supplementary Note 11 (Supplementary Figs. 45–50).

Circular infrastructures are not so uncommon in cities, besides circular metro lines many highways are organized as concentric rings when there is no major geographical impediment as in Paris or London. One may, thus, wonder why typically we do not observe rotational components in the cities vector field. To have such components, it would be necessary to have an unbalanced flow of people living in an area and working in another over the ring following one of the rotation senses. At the scale that we are using, this is not seen anywhere in the cities under study. The main factor that could favor the emergence of rotational components is thus the segregation of land use. However, land use mixing is strong enough in large cities⁴⁹ to prevent this sort of loops in the mobility flows at mesoscopic scales, leading to hierarchical configurations of the mobility with a few clear attraction centers.

Models

There are two main modeling frameworks in the literature to characterize mobility flows: those based on intervening opportunities and those based on gravity-like approaches. Here we have considered different variations of these models. In the case of the gravity model, the deterrence function can show either an exponential or a power-law decay with the distance. For the intervening opportunities, we have focused on the radiation model³¹ and its nonlinear version⁴⁵. Models can be classified as unconstrained if only require the masses in every cell m_i as inputs and production-constrained if additionally need the empirical outflow from each cell in order to estimate flows to other cells. The results discussed in this main paper refer to the unconstrained gravity with an exponential deterrence function and to the radiation model that is production-constrained. For the gravity model, the unconstrained version is considered because of its simplicity and amenability to analytical treatment (see Supplementary Note 4, Supplementary Figs. 14–19). The model parameters (for the gravity k and d₀) have been adjusted to best reproduce the curve of the flux as a function of distance from the city center in terms of $R_{\mathrm{P}}^2$. For the results of other models and details on the parameter calibration see Supplementary Note 3, Supplementary Tables 3 and 4, Supplementary Fig. 13, and Supplementary Note 6 along with Supplementary Figs. 23–26.

We consider a set of circles centered at the center of London with radius R from 0 to 40 km (Supplementary Table 1). The flux of $\overrightarrow {\mathbf{W}}$ across the circles with different R is computed for both models and compared with the empirical value (Fig. 4). While the gravity model with an exponential deterrence function works well at reproducing the entering fluxes of the vector field $\overrightarrow {\mathbf{T}}$ and $\overrightarrow {\mathbf{W}}$ in the Greater London Area, the radiation model does not capture the level of fluxes observed empirically, despite receiving more detailed input information given that it is a production-constrained model. This is due to the fact that the local individual mobility predicted by the radiation model is more isotropic than the empirical one and the mobility predicted by the gravity. The results for other cities are consistent (Supplementary Note 7, Supplementary Figs. 27–42). The nonlinear radiation model improves a little the situation but it still underestimates the fluxes (Supplementary Note 6, Supplementary Figs. 23–26). The gravity with a power-law decaying deterrence function is neither able to reproduce well Φ_W(R) or Φ_T(R) (Supplementary Note 5, Supplementary Figs. 19–22). The unconstrained gravity framework provides the important advantage of allowing an analytical treatment for the fluxes, which is based on a scaling approach that is exact for the power-law deterrence function and approximated for the exponential (Supplementary Note 5 and Supplementary Fig. 22).

A recent brute-force comparison between models (gravity, radiation and intervening opportunities with different levels of contraint) and empirical commuting flows was carried out in⁴⁶. The performance indicators at single flow level were favoring the exponential gravity model but the metrics were not able to capture big differences across models. For completeness, a similar analysis based on trip distance distribution has been included in Supplementary Note 10 and Supplementary Fig. 44. As with the direct flows, the results are not conclusive regarding model performance. However, the behavior of the fluxes as a function of the radius clearly discern between models performance. One may wonder what the origin is of these differences. The answer reveals the real potential of the vectorial framework. Besides the modulus, the empirical vectors $\overrightarrow {\mathbf{W}} _i$ also have a direction that must be reproduced by the models. Measuring the angle of the vector over the horizontal positive axis Θ_emp and comparing it with the models predictions Θ_mod, we obtain the scatter plots of Fig. 5 for London and Paris (results for other cities are in Supplementary Fig. 47). The domain of Θ_mod has been adjusted to minimize the difference. As seen in Fig. 5, the gravity model reproduces much better the direction of the vectors. Since the calculation of the fluxes involves a scalar product between $\overrightarrow {\mathbf{W}}$ and the perimeter normal vector, the directionality (besides the modulus) is essential to obtain a good result. An analysis performed with direct trip flows would never be able to detect these differences.

City potential

Since we have empirically found that the field $\overrightarrow {\mathbf{W}}$ can be considered irrotational, we can define a scalar potential using the formula $\overrightarrow {\mathbf{W}}$ = −∇V. Numerically, this means to find V_i in every cell i given the vector field $\overrightarrow {\mathbf{W}} _i$. The procedure to do this is detailed in the Methods Section. Figure 6a shows the empirical potential for London obtained with Eqs. (12) and (13) compared with the one computed by the gravity model with exponential deterrence function using the same treatment as in Fig. 6b. The same results for Paris are displayed in Fig. 6d, e. Even though the empirical potential is noisier than the one obtained with the gravity model, they agree well. As shown in Fig. 6c, the level of correlation is $R_{\mathrm{P}}^2 = 0.98$ for London and $R_{\mathrm{P}}^2 = 0.93$ for Paris (Fig. 6f). The potential has a clear marked minimum in the center of the city, which is a clue of the commuting monocentricity at these scales. As depicted in Fig. 7, other cities or conurbations have a different configuration with as many local minima as mobility centers. Note that this is an appropriate method to define and visualize areas of attraction of each city and their geographic limits. The equipotential contour plots for other cities are shown in the Supplementary Note 8 and Supplementary Fig. 42.

Discussion

In summary, we have introduced a vectorial field framework to characterize human mobility flows. When considering recurrent home-work mobility in cities, we find that the mesoscopic field representing the flows is well-behaved in the sense of satisfying Gauss’s theorem and, besides, it is irrotational. As a consequence of this last point, it is possible to define a scalar potential, which reducing the dimensionality of the system encodes all the information on the commuting at a mesoscopic scale. The results are corroborated using two independent data sources for the commuting. Twitter data is used in the main text, and the results are reproduced for census data in the Supplementary Note 2 and Supplementary Figs. 2–10 for London, Manchester and Paris. Our focus here has been on commuting, which in most cities corresponds to over 60% of the total mobility. However, we cannot discard that other types of mobility at larger or shorter ranges may display similar behaviors. This remains as an open question for further exploration.

Our results have important consequences both from theoretical and applied perspectives. From a theoretical point of view, there are no a-priori reasons to assume that individual mobility at microscopic scale could induce a well-behaved mesoscopic field amenable to continuous treatment. Finding it from the empirical data implies that recurrent mobility in cities obeys deep symmetries that can be fully understood and described only within the framework of field theory. In particular, Gauss’s and the rotational are the most basic theorems in the theory. They are the blocks upon which more involved results (metrics, theorems, etc) are built and this is why it is so important to prove that the vectors obtained from empirical data satisfy both. Gauss’s theorem means that the field is generated by a source and that the fluxes through surfaces must respect conservation laws. These constraints affect the flows and also the directions as shown in Figs. 4 and 5. The irrotational nature of the field implies that one can derive the field from a potential and vice versa, the field is univocally determined by the potential. The symmetries of the potential are also present in the field and, among other things, the dimensionality of the problem can be reduced: from a vector in every location to a scalar. Differences in the potential between points decide the direction and intensity of the mobility flows. Out of the symmetries usually it is possible to define invariant (conservative) quantities that play a central role in the vector field. Our work opens thus the door to use the heavy mathematical machinery developed during centuries to cope with vector fields.

Concentrating in the data, this framework allows to better distinguish between models performance. Any model trying to reproduce daily mobility flows should generate a field with the properties observed here in the empirical data. Otherwise, the model does not adjust to reality. These models have been used for decades to calculate trip demand in the planning of transport infrastructure. This is, therefore, a very relevant applied question. Recent brute-force comparisons between models and empirical commuting flows throw no clear conclusion on which model reproduces best the data. The metrics used were based on the analysis of raw mobility flows, hence a different approach is needed to reach a final conclusion. This is the role that the field theoretical perspective covers. Beyond the raw flows, the vector field has also a direction in each point and we can compare directions between model predictions and empirical data. This analysis shows that the gravity model with an exponential decay best reproduces both flows and directions. This result is further confirmed with the study of the fluxes across surfaces where the directionality plays a central role. We observe a better fit to the empirical curves as a function of the distance from the city center by the gravity model. Furthermore, the unconstrained gravity model admits an analytical treatment capable of producing expressions for the flux and the potential. This example is a proof of the potential of the vector representation.

In the gravity model framework, the existence of a potential has been postulated decades ago but these hypotheses were not systematically validated against data. We perform such validation and confirm that the gravity model with an exponential deterrence function generates a potential compatible with the empirical one. The potential is a fundamental tool to tackle hard open problems such as the definition of centers in cities, polycentricity and borders in conurbation systems. The shape of the potential sheds new light on the spatial organization of mobility in cities as we can picture city centers as the strongest gravitational attractors of the metropolitan area and redefine city boundaries. For example, borders could be defined as the locations where the potential falls below a fixed percentage from the highest peak of the city, separating thus the basins of attraction of the different centers. This can have an important practical relevance when planning infrastructures and public services.

Methods

Twitter data

We use geolocated Twitter data in big cities and conurbations to extract information on commuters mobility. Even if the number of users is smaller than the local population, it has been shown that this data is valid to study aggregated urban mobility at scales larger than 1 km² with a global coverage^18,20. Details on the procedure to download geolocated Twitter data are included in Supplementary Note 12. Our database is composed of tweets with coordinates in the area of Manchester–Liverpool, London, Los Angeles, Paris, Rio de Janeiro and Tokyo from March 2015 to October 2017. The information is then mapped into a regular square grid of 1 km². Tweets on Saturdays and Sundays, people moving faster than 200 km/h, users tweeting more than once per second, people tweeting <10 times in the whole time window and for less than one month have been filtered out. We consider the interval from 8 AM to 8 PM in local time as working hours, tweets in this interval are supposedly posted from the work place. Similarly, the rest of tweets are assumed to be posted from home. We assign to every user a home and a work cell as the most common cells during the corresponding hours. With this information, we can assume a daily trip from home to work for every user and another one back. Aggregating trips we can generate an OD matrix for the whole city, where each element T_ij contains the number of people commuting from cell i to j. The OD matrices represent generic levels of daily mobility and are used to determine trip demand for urban planning. The trips are not assigned to a particular moment in the data time window. To avoid noise due to poor statistics, we filtered out cells with <5 people as residents or workers.

A minor issue can raise with the misclassification of night-shift workers. A possible solution tested in⁵⁰ is to assume that the place with largest activity corresponds to work. However, this procedure was designed for more exhaustive data such as mobile phone records and it may introduce new biases with Twitter data. Still, the fraction of night-workers is only 10% of the total workforce in London, and less than 11% in the whole UK (see [https://www.tuc.org.uk/news/260000-more-people-working-night-past-five-years-finds-tuc] for more details). The night workers mobility, even if misclassified, is part of the general daily mobility flow of the city. Finally, the census data is free from this issue since the questionnaire explicitly asks for residence and working places and the results are consistent for both data sources.

Census data

In addition to the Twitter data, the same study is repeated with census data from France and the United Kingdom. This data is publicly available on governmental web sites (FR, https://www.insee.fr and UK, https://www.ons.gov.uk/census/2011census). Census output areas have heterogeneous shapes different for every country and they do not compose a regular grid. A further treatment has to be carried out to adapt the population distribution and the home-work OD matrix to the grid. This introduces uncertainty that is not present in the Twitter data. Detailed information on how to divide and rearrange heterogeneous census areas into a square grid is provided in the Supplementary Note 5 and Supplementary Fig. 12. Thresholds on number of inhabitants and workers have been applied as well to avoid considering non statistically relevant zones. A method to assign a threshold to each city is provided in the Supplementary Note 2 and Supplementary Fig 11.

Numerical calculation of the curl

Given a vector field evaluated in the cells of a grid, it is possible to calculate the curl using the central finite differences⁵¹ discretization method. The curl of $\overrightarrow {\mathbf{W}}$ in the cell i, whose indices in the x- and y-directions are (α, β), is determined as:

$$\begin{array}{c}\nabla \times \overrightarrow {\mathbf{W}} _i = \frac{{Wy_{(\alpha + 1,\beta )} - Wy_{(\alpha - 1,\beta )}}}{{2{\kern 1pt} \Delta x}}\\ - \frac{{Wx_{(\alpha ,\beta + 1)} - Wx_{(\alpha ,\beta - 1)}}}{{2{\kern 1pt} \Delta y}},\end{array}$$

(3)

where Δx and Δy are side sizes of the cells in the x- and y-directions, and Wx and Wy are the x and y components of the vector $\overrightarrow {\mathbf{W}}$, respectively, evaluated in i and its nearest neighbors in the grid. The curl only has component in the z-direction since the vector $\overrightarrow {\mathbf{W}}$ lays on the x–y plane.

Numerical calculation of the flux

The definition of the flux as a perimeter (surface) integral is

$${\it{\Phi }}_W^S = \oint_S {\overrightarrow {\mathbf{W}} } {\kern 1pt} \overrightarrow {\mathbf{n}} {\kern 1pt} d\ell$$

(4)

for the vector $\overrightarrow {\mathbf{W}}$ and

$${\it{\Phi }}_T^S = \oint_S \overrightarrow {\mathbf{T}} {\kern 1pt} \overrightarrow {\mathbf{n}} {\kern 1pt} d\ell$$

(5)

for $\overrightarrow {\mathbf{T}} {\kern 1pt}$. In both cases, the integral is performed over the perimeter S, $d\ell$ is the infinitesimal element of length and $\overrightarrow {\mathbf{n}}$ is the unit vector normal to the perimeter in each point.

From a numerical perspective, the integrals are calculated as

$${\it{\Phi }}_W^S = \mathop {\sum}\limits_{i \in S} {\overrightarrow {\mathbf{W}} _i} {\kern 1pt} \overrightarrow {\mathbf{n}} _i{\kern 1pt} d\ell ,$$

(6)

$${\it{\Phi }}_T^S = \mathop {\sum}\limits_{i \in S} {\overrightarrow {\mathbf{T}} _i} {\kern 1pt} \overrightarrow {\mathbf{n}} _i{\kern 1pt} d\ell ,$$

(7)

where the index i runs over all the cells intersecting the perimeter S, $\overrightarrow {\mathbf{n}} _i$ is the unit vector normal to the surface in i and $d\ell$ is approximated by the total perimeter of S divided by the number of intersecting cells. The flux as a volume integral of the divergence is calculated as

$${\it{\Phi }}_W^V = \mathop {\sum}\limits_{i \in V} {\left( {\frac{{Wx_{(\alpha + 1,\beta )} - Wx_{(\alpha ,\beta )}}}{{\Delta x}} + \frac{{Wy_{(\alpha ,\beta + 1)} - Wy_{(\alpha ,\beta )}}}{{\Delta y}}} \right){\kern 1pt} dV}$$

(8)

with the location of cell i in (α, β), as above, the index i runs over the cells in the volume V and dV is the area of the unit cell. The cells without resident commuters, m = 0, do not exhibit outflows and, to avoid inconsistencies, the field is defined as null in them. This implies that they do not contribute to the calculation of the flux or other results. Note that this is different from the classical continuous approaches of field theory in physics (e.g., electric or gravitational fields) where the field is defined everywhere and always contributes to the net flux.

Gravity model

The equation for the flow of commuters between two areas i and j with an exponential deterrence function is

$$T_{ij} = k{\kern 1pt} m_i{\kern 1pt} m_j{\kern 1pt} e^{ - d_{ij}/d_0},$$

(9)

where k is a constant, m_i,j are the populations of origin and destination areas i (j), d_ij is the distance between them and d₀ is a characteristic distance. This is the linear version of the Gravity Model, where the output and input flows are proportional to the number of people in the area. The model has only two parameters to fit (k and d₀). The vector field is obtained by summing over the possible destinations and dividing by m_i. If $\overrightarrow {\mathbf{u}} _{ij}$ is the unit vector pointing from i to j, the vector field can be written as

$$\overrightarrow {\mathbf{W}} _i = \mathop {\sum}\limits_j {\frac{{T_{ij}}}{{m_i}}} {\kern 1pt} \overrightarrow {\mathbf{u}} _{ij} = k\;\mathop {\sum}\limits_j {m_j} {\kern 1pt} e^{ - d_{ij}/d_0}{\kern 1pt} \overrightarrow {\mathbf{u}} _{ij}.$$

(10)

Radiation model

The Radiation Model is inspired by radiation and absorption of particles³¹: for every worker residing in and leaving cell i, the destination (work) cell j is obtained using the probability expression

$$P(i,j) = \frac{{m_i{\kern 1pt} m_j}}{{(m_i + s_{ij}){\kern 1pt} (m_i + m_j + s_{ij})}},$$

(11)

where s_ij is the population residing in a circle centered in i, with radius d_ij and excluding the populations of i and j. The average flows can be calculated as 〈T_ij〉 = T_iP(i, j), where T_i is the empirical total outflow of cell i.

Numerical calculation of the potential

The potential is calculated by numerically solving the equations −∇V_i = $\overrightarrow {\mathbf{W}} _i$ taking into account that ∇ × $\overrightarrow {\mathbf{W}}$ = 0. For the computation of the empirical potential, we used conditions V = 0 in all the boundary regions of the grid and then use the forward centered discretization formula for the gradient operator⁵¹ starting from the city bounding box corner. In a cell i with indices (α, β), this operation becomes:

$$\frac{{dV_i}}{{dx}} = \frac{{V_{\alpha + 1,\beta } - V_{\alpha ,\beta }}}{{\Delta x}} = W_{(x),\alpha ,\beta },$$

(12)

$$\frac{{dV_i}}{{dy}} = \frac{{V_{\alpha ,\beta + 1} - V_{\alpha ,\beta }}}{{\Delta y}} = W_{(y),\alpha ,\beta },$$

(13)

The procedure is iterated until all cells have been assigned a potential. We average then the resulting potentials after starting from every corner of the bounding box to decrease the noise.

Data availability

In this work, we use two data sources: Geolocated Twitter and census in the UK and France. All the data are available online, although in all cases the access conditions require the user to obtain the data directly from the provider sites. For the census data, the 2011 UK commuting information can be found at output area level in the link [https://wicid.ukdataservice.ac.uk/cider/about/data_int.php?type=2] and 2011 French data at municipal level is available at [https://www.insee.fr/en/statistiques?categorie=1]. For Twitter, the data is downloaded using the streaming API [https://developer.twitter.com/en/docs/tweets/filter-realtime/overview]. An example of the script employed to obtain geolocated data in a geographical area is provided in the Supplementary Note 12. The aggregated information necessary to reproduce our results has been uploaded at the repository Figshare with doi: [https://doi.org/10.6084/m9.figshare.8158958]⁵².

Code availability

An example of the code used to collect Twitter data is provided in the Supplementary Note 12. The code for the analysis was programmed using Python and the equations employed are described in the Methods Section.

References

Bergstrand, J. H. The gravity equation in international trade: some microeconomic foundations and empirical evidence. Rev. Econ. Stat. 67, 474–481 (1985).
Article Google Scholar
Rouwendal, J. & Nijkamp, P. Living in two worlds: a review of home-to-work decisions. Growth Change 35, 287–303 (2004).
Article Google Scholar
Carra, G., Mulalic, I., Fosgerau, M. & Barthelemy, M. Modelling the relation between income and commuting distance. J. R. Soc. Interface 13, 20160306 (2016).
Article Google Scholar
Batty, M. The new science of cities. (MIT Press, Cambridge, 2013).
Barthelemy, M. The Structure and Dynamics of Cities: Urban Data Analysis and Theoretical Modeling. (Cambridge Univ. Press, Cambridge, 2017).
Viboud, C. et al. Synchrony, waves, and spatial hierarchies in the spread of influenza. Science 312, 447–451 (2006).
Article ADS CAS Google Scholar
Balcan, D. et al. Multiscale mobility networks and the spatial spreading of infectious diseases. Proc. Natl Acad. Sci. USA 106, 21484–21489 (2009).
Article ADS CAS Google Scholar
Balcan, D. & Vespignani, A. Phase transitions in contagion processes mediated by recurrent mobility patterns. Nat. Phys. 7, 581 (2011).
Article CAS Google Scholar
Tizzoni, M. et al. On the use of human mobility proxies for modeling epidemics. PLoS Comput. Biol. 10, e1003716 (2014).
Article Google Scholar
Ortúzar, J. & Willumsen, L. Modeling Transport. (John Wiley and Sons Ltd., New York, 2010).
Ewing, R. & Hamidi, S. Compactness versus sprawl: a review of recent evidence from the united states. J. Plan. Lit. 30, 413–432 (2015).
Article Google Scholar
Ravenstein, E. G. The laws of migration. J. Stat. Soc. Lond. 48, 167–235 (1885).
Article Google Scholar
Boyce, D. E. & Williams, H. C. Forecasting Urban Travel: Past, Present and Future. (Edward Elgar Publishing, Cheltenham, 2015).
Barbosa-Filho, H. et al. Human mobility: Models and applications. Phys. Rep. 734, 1–74 (2018).
Article ADS MathSciNet Google Scholar
González, M. C., Hidalgo, C. A. & Barabasi, A.-L. Understanding individual human mobility patterns. Nature 453, 779 (2008).
Article ADS Google Scholar
Bagrow, J. P. & Lin, Y.-R. Mesoscopic structure and social aspects of human mobility. PLoS ONE 7, e37676 (2012).
Article ADS CAS Google Scholar
Noulas, A., Scellato, S., Lambiotte, R., Pontil, M. & Mascolo, C. A tale of many cities: universal patterns in human urban mobility. PLoS ONE 7, e37027 (2012).
Article ADS CAS Google Scholar
Lenormand, M. et al. Cross-checking different sources of mobility information. PLoS ONE 9, e105184 (2014).
Article ADS Google Scholar
Hawelka, B. et al. Geo-located twitter as proxy for global mobility patterns. Cartogr. Geogr. Inf. Sci. 41, 260–271 (2014).
Article Google Scholar
Lenormand, M., Gonçalves, B., Tugores, A. & Ramasco, J. J. Human diffusion and city influence. J. R. Soc. Interface 12, 20150473 (2015).
Article Google Scholar
Blondel, D. V., Decuyper, A. & Krings, G. A survey of results on mobile phone datasets analysis. EPJ Data Sci. 4, 10 (2015).
Article Google Scholar
Carey, H. C. Principles of Social Science ume 3 (JB Lippincott & Company, Philadelphia, 1867) .
Zipf, G. K. The p1 p2/d hypothesis: on the intercity movement of persons. Am. Sociol. Rev. 11, 677–686 (1946).
Article Google Scholar
Stouffer, S. A. Intervening opportunities: a theory relating mobility and distance. Am. Sociol. Rev. 5, 845–867 (1940).
Article Google Scholar
Ruiter, E. R. Toward a better understanding of the intervening opportunities model. Transp. Res. 1, 47–56 (1967).
Article ADS Google Scholar
de Vries, J., Nijkamp, P. & Rietveld, P. Exponential or power distance-decay for commuting? an alternative specification. Environ. Plan. A 41, 461–480 (2009).
Article Google Scholar
Lenormand, M., Huet, S., Gargiulo, F. & Deffuant, G. A universal model of commuting networks. PLoS ONE 7, e45985 (2012).
Article ADS CAS Google Scholar
Liang, X., Zhao, J., Dong, L. & Xu, K. Unraveling the origin of exponential law in intra-urban human mobility. Sci. Rep. 3, 2983 (2013).
Article ADS Google Scholar
Chen, Y. The distance-decay function of geographical gravity model: Power law or exponential law? Chaos, Solitons Fractals 77, 174–189 (2015).
Article ADS MathSciNet Google Scholar
Ren, Y., Ercsey-Ravasz, M., Wang, P., González, M. C. & Toroczkai, Z. Predicting commuter flows in spatial networks using a radiation model based on temporal ranges. Nat. Commun. 5, 5347 (2014).
Article ADS CAS Google Scholar
Simini, F., González, M. C., Maritan, A. & Barabási, A.-L. A universal model for mobility and migration patterns. Nature 484, 96–100 (2012).
Article ADS CAS Google Scholar
Anderson, J. E. A theoretical foundation for the gravity equation. Am. Econ. Rev. 69, 106–116 (1979).
Google Scholar
Erlander, S. & Stewart, N. F. The gravity model in transportation analysis: theory and extensions. (VSP, Utrecht, 1990) .
Wilson, A. Entropy in Urban and Regional Modelling. (Pion, London, 1970) .
Sagarra, O., Pérez Vicente, C. J. & Díaz-Guilera, A. Statistical mechanics of multiedge networks. Phys. Rev. E 88, 062806 (2013).
Article ADS CAS Google Scholar
Sagarra, O., Pérez Vicente, C. J. & Díaz-Guilera, A. Role of adjacency-matrix degeneracy in maximum-entropy-weighted network models. Phys. Rev. E 92, 052816 (2015).
Article ADS CAS Google Scholar
Sagarra, O., Szell, M., Santi, P., Díaz-Guilera, A. & Ratti, C. Supersampling and network reconstruction of urban mobility. PLoS ONE 10, e0134508 (2015).
Article Google Scholar
Steward, J. Q. Empirical mathematical rules concerning the distribution and equilibrium of population. Am. Geogr. Soc. 37, 461–485 (1947).
Google Scholar
Heanus, K. & Pyers, C. A comparative evaluation of trip distribution procedures. Public Roads 34, 43–51 (1966).
Google Scholar
Pyers, C. Evaluation of intervening opportunities trip distribution models. Highw. Res. Rec. 114, 71–88 (1966).
Google Scholar
Lawson, M. & Dearinger, J. A comparison of four work trip distribution models. Proc. Am. Soc. Civ. Eng. 93, 1–25 (1967).
Google Scholar
Haynes, K. E., Poston, D. L. J. & Schnirring, P. Intermetropolitan migration in high and low opportunity areas: indirect tests of the distance and intervening opportunities hypotheses. Econ. Geogr. 49, 66–73 (1973).
Article Google Scholar
Okabe, A. A theoretical comparison of the opportunity and gravity models. Reg. Sci. Urban Econ. 6, 381–397 (1976).
Article Google Scholar
Masucci, A. P., Serras, J., Johansson, A. & Batty, M. Gravity versus radiation models: On the importance of scale and heterogeneity in commuting flows. Phys. Rev. E 88, 022812 (2013).
Article ADS Google Scholar
Yang, Y., Herrera, C., Eagle, N. & González, M. C. Limits of predictability in commuting flows in the absence of data for calibration. Sci. Rep. 4, 5662 (2014).
Article ADS CAS Google Scholar
Lenormand, M., Bassolas, A. & Ramasco, J. J. Systematic comparison of trip distribution laws and models. J. Transp. Geogr. 51, 158–169 (2016).
Article Google Scholar
Piovani, D., Arcaute, E., Uchoa, G., Wilson, A. & Batty, M. Measuring accessibility using gravity and radiation models. R. Soc. Open Sci. 5, 171668 (2018).
Article ADS MathSciNet Google Scholar
Arcaute, E. et al. Constructing cities, deconstructing scaling laws. J. R. Soc. Interface 12, 20140745 (2015).
Article Google Scholar
Lenormand, M. et al. Comparing and modelling land use organization in cities. R. Soc. Open Sci. 2, 150449 (2015).
Article ADS MathSciNet Google Scholar
Bassolas, A., Ramasco, J. J., Herranz, R. & Cantú-Ros, O. G. Mobile phone records to feed activity-based travel demand models: matsim for studying a cordon toll policy in barcelona. Transp. Res. Part A 121, 56–74 (2019).
Article Google Scholar
Hyman, J. M. & Shashkov, M. Natural discretizations for the divergence, gradient, and curl on logically rectangular grids. Comput. Math. Appl. 33, 81–104 (1997).
Article MathSciNet Google Scholar
Mazzoli, M. et al. Aggregated mobility data uploaded at Figshare repository. https://doi.org/10.6084/m9.figshare.8158958. (2019)

Download references

Acknowledgements

M.M. and A.B. are funded by the Conselleria d’Innovació, Recerca i Turisme of the Government of the Balearic Islands and the European Social Fund. M.M., A.B., P.C., and J.J.R. also acknowledge partial funding from the Spanish Ministry of Science, Innovation and Universities, the National Agency for Research Funding AEI and FEDER (EU) under the grants ESOTECOS (FIS2015-63628-C2-1-R and FIS2015-63628-C2-2-R) and PACSS (RTI2018-093732-B-C22) and the Maria de Maeztu program for Units of Excellence in R&D (MDM-2017-0711). M.L. received financial support from a grant of the French National Research Agency (project NetCost, ANR-17-CE03-0003).

Author information

Authors and Affiliations

Instituto de Física Interdisciplinar y Sistemas Complejos IFISC (CSIC-UIB), Campus UIB, 07122, Palma de Mallorca, Spain
Mattia Mazzoli, Alex Molas, Aleix Bassolas, Pere Colet & José J. Ramasco
Irstea, UMR TETIS, 500 rue JF Breton, 34093, Montpellier, France
Maxime Lenormand

Authors

Mattia Mazzoli
View author publications
You can also search for this author in PubMed Google Scholar
Alex Molas
View author publications
You can also search for this author in PubMed Google Scholar
Aleix Bassolas
View author publications
You can also search for this author in PubMed Google Scholar
Maxime Lenormand
View author publications
You can also search for this author in PubMed Google Scholar
Pere Colet
View author publications
You can also search for this author in PubMed Google Scholar
José J. Ramasco
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.M., A.M. and J.J.R. designed the study and contributed new conceptual tools. M.M., A.B. and M.L. cleaned and processed the data. M.M. and A.M. performed the numerical analyses. M.M. and J.J.R. developed the analytical treatment. P.C. and J.J.R. coordinated the study. All authors contributed to the discussion, to the writing and approved the paper.

Corresponding authors

Correspondence to Mattia Mazzoli or José J. Ramasco.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information: Nature Communications would like to thank Bernat Corominas-Murtra,Tim Evans and Pu Wang for their contributions to the peer review of this work. Peer review reports are available.

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mazzoli, M., Molas, A., Bassolas, A. et al. Field theory for recurrent mobility. Nat Commun 10, 3895 (2019). https://doi.org/10.1038/s41467-019-11841-2

Download citation

Received: 18 January 2019
Accepted: 30 July 2019
Published: 29 August 2019
DOI: https://doi.org/10.1038/s41467-019-11841-2

This article is cited by

Symmetry breaking in optimal transport networks
- Siddharth Patwardhan
- Marc Barthelemy
- Filippo Radicchi
Nature Communications (2024)
Enhancing global maritime traffic network forecasting with gravity-inspired deep learning models
- Ruixin Song
- Gabriel Spadon
- Amilcar Soares
Scientific Reports (2024)
Inferring language dispersal patterns with velocity field estimation
- Sizhe Yang
- Xiaoru Sun
- Menghan Zhang
Nature Communications (2024)
Unravelling the spatial directionality of urban mobility
- Pengjun Zhao
- Hao Wang
- Jingzhong Li
Nature Communications (2024)
Human mobility description by physical analogy of electric circuit network based on GPS data
- Zhihua Zhong
- Hideki Takayasu
- Misako Takayasu
Scientific Reports (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.