Introduction

Among non-pharmaceutical interventions (NPIs), limitations to mobility of various degree have proven to be a successful strategy towards mitigating the spread of COVID-19 in populations across the world1,2,3. In particular, starting with the first epidemic cluster in Wuhan, China, lockdown restrictions, i.e. full limitations of mobility on an entire territory, have been widely and effectively adopted4,5.

Lockdowns are particularly effective since they incorporate a wide range of more targeted interventions, among which two are worth mentioning here - the limitation on large gatherings, and limitation on long range travel3. After the initial widespread adoption of lockdown restrictions, several studies2,6,7,8 have focused on disentangling the specific effect of each NPI, fostering the debate on the appropriateness and effectiveness of lockdowns as policy interventions.

In fact, in addition to the direct effect of aiding in the containment of COVID-19, one major side effect of the limitations was a severe economic impact induced by the lockdown that has been felt globally. While online commerce bloomed9, traditional brick-and-mortar businesses took a severe hit10. As a consequence, governments have been criticized both for implementing lockdown too fast11,12 and too slow13, with restrictions too harsh and too lax.

Early studies on the economic effect of lockdown restrictions have highlighted their impact on consumption14,15,16, on the supply chain17, on inequality18,19 and on the overall economy20. The totality of the effect of lockdown restrictions on the economy, however, is still difficult to ascertain for several reasons. For one, the extent of the economic impact on various layers of the population is unknown at this point as the crisis is still ongoing. For another, different countries implement various economic stimuli to offset and ease the economic downturn, both on the personal and business levels. Lastly, with systems being as interconnected as they are, efficient trade-off evaluation of all possible effects (businesses closing, impact on education of all levels from primary school through higher education and research, strain on healthcare systems and the higher-order impact due to lack of doctors or equipment) is almost impossible in the short term.

Network Science21,22,23 had frequently been employed over the past two decades to investigate the behaviour of complex systems in multiple domains, ranging from ecosystems24,25,26 and biology27,28, through engineering29, urban traffic30 and computers31 to economics and finance32,33,34,35. Network tools have also proven useful in analyzing the spreading of the current pandemic in different contexts: from the direct estimation of the epidemiological evolution of the pandemic36,37, to the development of new modeling approaches38,39, to the analysis of mobility evolution1,40 and finally in relation to economic conditions and backlashes18,19,41.

Following this approach, in this work we develop a framework based on methodologies from network science and percolation theory22,42 to explore the effects of the COVID-19 pandemic on the network structure of human mobility in Italy before and during the epidemic of 2020, from its very beginning to the second wave. We employ rich mobility data of Italy provided through Facebook’s Data for Good program (available at https://dataforgood.fb.com/tools/disease-prevention-maps/) capturing movements of more than 3 millions of Italian individuals at a sub-municipal level and on a daily basis.

We use mobility data to construct the network of movements of individuals across territories, measuring how lockdown has affected the number and the intensity of connections among different areas of the country. In so doing, we aim to investigate the evolution of the Italian mobility network, measure the extent, beyond the immediately apparent one, of the mobility reduction after lockdown and draw conclusions regarding the existence and significance of an impact on the economy of lockdown policies.

Our hypothesis is that lockdown restrictions have increased the fragility of the Italian mobility network. What we want to further investigate is if this process has affected not only the network in general, which is plausible, but, more importantly, also the core of the network (i.e. the Giant Connected Component (GCC)). Furthermore we want to assess how long the disruptive effects of lockdown on the Italian mobility network have lasted even after restrictions were lifted. With respect to these objectives, the percolation methodology represents the ideal tool because it allows us to focus on the resilience profile of the network distinguishing between low impact and high impact effects.

Finally, we investigate the economic consequences of mobility restrictions. Using official statistics about the geographical distribution of income per capita, we are able to observe a significant correlation between mobility and levels of income, suggesting that one can be used as a proxy of the other. Moreover, the percolation analysis allows us to investigate if the most resilient part of the network, after lockdown restrictions, is also the richest one. By observing that the average income per capita in the GCC has increased after lockdown, we are able to uncover that restrictions have heightened the economic segregation of the country, leaving poorer territories disconnected from the mobility network.

While this work is not designed to make causal claims regarding the actual economic effect of lockdown restrictions, which, as we said, need to be disentangled from several concurrent phenomena, we believe that our analysis represents a useful tool to assess the potential economic impact of disruptive events such as the recent COVID-19 pandemic.

Network structure before and under lockdown

A network consists of elements, or nodes, connected to each other via links. In our case, the fundamental units (nodes) are rectangular tiles of areas in Italy (see “Materials and Methods”). We set an edge between two tiles if a sufficient number of people travel from one tile to another in a given period of time. The smallest window of time in our data is 8 hours, however we aggregate them at the daily, weekly and monthly levels when necessary. Because tiles typically do not represent any particular entities in the real world, we apply a grouping procedure to them, aggregating multiple tiles to match them with their corresponding administrative units. In the case of Italy three such groupings exist: Municipality-, Province- and Region-scale, where the former is a small geographic area (of the order of a township), the second is a larger area aggregating several Municipalities and the latter is a relatively large one. Thus, Italy consists of about 8000 Municipalities, over 100 Provinces and 20 Regions. Since the number of tiles in our data set is of the order of the number of Municipalities, we do not analyze that administrative division and the most granular analysis is conducted directly on the tiles. The left panels of Fig. 1 (map figures created using GeoPandas43) show the connectivity of Italy’s Regions (the coarsest level of aggregation) before (Fig. 1a) and during (Fig. 1b) the lockdown, while the right panel shows tile-level network characteristics. Throughout the analysis we identify four distinct periods, corresponding to different behavior of the metrics under analysis, as will be detailed below. Those are:

  1. 1.

    Pre-lockdown, baseline period, starting end of February, 2020 until March 8th.

  2. 2.

    Full lockdown, starting March 9th and lasting through May 18th

  3. 3.

    Partial restriction, whereby some limitations remained in place but others were lifted, May 19th till June 3rd

  4. 4.

    Removal of all limitations, June 4th until Mid July.

One fundamental characterization of a network is its degree distribution, P(k), where k, the degree of a node, is the number of other nodes it is connected with. The degree distribution shows the probability P to have a node of degree k. For random graphs, known also as Erdos–Renyi (ER) graphs, in which edges between nodes exist with some fixed probability p, this distribution follows the Poisson one, \(P(K)=\frac{(np)^ke^{-np}}{k!}\) where n is the number of nodes in the network which is assumed large44. In that case, the product np is the average degree of the nodes in the network. Importantly, the distribution of degree in ER graphs is relatively narrow around the mean and most nodes have a degree close to the mean, i.e., large deviations are exceedingly rare. Many real-world networks exhibit instead much broader degree distributions, often characterized by a power law, \(P(k)\propto k^{-\beta }\), with or without exponential truncation to accommodate cutoffs, finite size effects and other features which separate them from random graphs45. In such a distribution, the average degree \(\langle k \rangle\) is not representative of the entire network degree distribution, due to the presence of relatively few hubs that are connected to many more nodes than the typical node.

Edges connecting pairs of nodes need not be identical and may carry information with them. This information is typically regarded as an edge “weight” and may be useful in determining shortest paths, estimating travel time and assess the fragility of the network. In our case, edges carry two types of information. One is geographical, namely the geometric distance between two connected nodes, and the other is demographic, specifically, the number of people traveling along the edge. When edges weights are available, the simple degree may be generalized and augmented by a weighted degree, where the sum of edges weights incident on a node is taken, rather than the number of edges. Below, when discussing weighted degrees we refer to aggregating all people traveling to and from a certain node. As discussed in “Materials and Methods”, our edges are undirected.

We explore the global connectivity (i.e., the possibility to get from one node to another via existing edges) through the functional and the pure network perspectives (Fig. 1). From the functional perspective, we note the connectivity of the largest-level aggregation of our data, namely, Italian Regions. This view allows us to appreciate the significant drop in the connectivity between large parts of the country. The simple unweighted degree of the Regions (as seen in the colored maps in panels (a) and (b) of Fig. 1) can be viewed as a qualitative effect of the mobility restrictions. During lockdown connectivity dropped across the country such that interactions between different Regions (the coarsest aggregation level) almost disappeared. Advancing to more quantitative characteristics of the mobility network, we note the drop in the average daily weighted degree, shown by the color of the links, which measures the daily number of people moving between locations. We define levels of mobility as the total number of people moving in or out of an entity (a node, Province or Region), that is, the aggregate weighted degree incident on an entity.

Another important feature of a network is the extent to which nodes are connected globally to each other, which can be investigated by measuring the size of the network components. A component of a network is a set of nodes reachable from one another. Among components it is particularly informative to analyze the largest one, typically called the Giant Connected Component (GCC)23. This is because in complex networks above a certain degree of connectivity (an average degree of 1 in the case of ER networks, known as the percolation threshold) most nodes of the network are reachable from any other node, that is, most of the network belongs to the GCC. When the network becomes fragmented (in our case via the removal of edges due to mobility restrictions), less nodes tend to belong to the GCC and the number of smaller components increases, as may be observed in Fig. 1c and d. Interestingly, comparing the behavior of the mean weighted degree with that of the GCC of the network, we find a striking similarity, which can also be seen in the component sizes (SI) and will surface in other aspects of our analysis (resilience). There is no particular mechanism driving that - and yet the drop in average weighted degree traces very closely that of the component size (Fig. 1c). As the GCC decreases, the network becomes more fragmented, leading to an increase in number of components. We report values for the numbers of non-isolated components and components larger than five nodes (Fig. 1d), as well as their sizes (Fig. 1e). Finally, we measure the reachability within the changing GCC. Interestingly, as the GCC shrinks during lockdown, the different nodes effectively become more distant as the average path length and diameter of the component increase (Fig. 1f).

Figure 1
figure 1

Network characteristics of traffic before and during lockdown for the coarsest (left) and finest (right) resolutions. (a) Region-level mobility before lockdown. Region color shows the daily average simple degree for business days before lockdown measure were implemented. The edge colors show the log of the weighted link, i.e. the daily number of people traversing each edge. (b) Mobility during lockdown, color scales matching (a). Many of the existing edges have failed, and those that remain are much weaker. (c) Average weighted degree (teal) and size of the Giant Connected Component (GCC, red) averaged over a week, error bars showing the standard deviation for the period. (d) Number of non-isolated components (teal) and number of components with more than 5 nodes in them (red). (e) Average size of: non-isolated components (teal), and components larger than 5 nodes (red). (f) Average shortest path length (teal) and the diameter of the GCC (red).

Scaling

Figure 2
figure 2

Comparison between pre-lockdown and lockdown of (a) weighted degrees, (b) Unweighted degree (for the degree distribution we also show a stretched exponential fit in the SI), (c) edge weights and (d) distances. Teal and dark red lines show the Complimentary Cumulative Distribution function (CCDF) of the weighted degrees, weights and distances before and during the lockdown, while the dashed lines show the stretched exponential (top) and power law fits (bottom) to the data.

Next, we examine how some of our tile-level network’s characteristics behave before vs during lockdown. We analyze the PDFs of some network characteristics and find that they can be approximated by power-laws46 (see section S2 in the SI for additional possible fits). The probability distribution of the data, together with the fitted power-law functions is shown in Fig. 2. Starting with the degree distributions, we note the larger exponents during lockdown (Fig. 2a and b). Both the weighted and simple degree distribution contract which means less people (weighted degree) travel to fewer places. That is backed up by examining the distributions of the edge weights (Fig. 2c) and distances traveled (Fig. 2d).

Weight distribution tells probably the most important story of the positive and negative aspects of the lockdown. The reduction of people traveling along edges assists in reducing contagion1,47,48,49 but also induces a significant economic impact10,18,20,41. We note here the entire distribution shrinks, both the typical and extreme values. The relative ubiquity of the impact is elaborated in section "Local impact and recovery". The extent to which reduction in mobility affects the economy as a whole is discussed in section "Economic impact".

To conclude our exploration of scaling behavior we revisit an often-measured characteristic of human mobility, namely, laws governing distances traveled50,51,52,53. Two points are worth noting here. The exponents observed here are noticeably larger than the findings on human traveling in Ref.51. Several sources of the difference may be identified. One is the specific geometry of Italy, characterized mostly by relatively short distances. The typical width of the country is about 300km, while the distance between Naples and Milan, two economic centers relatively far from one another, is close to 900km. Due to that, we do not expect to find enough distances much greater than 300km. Indeed, we can see in Fig. 2d that before the lockdown the slope changes and become larger at about 200km. Another important factor comes from our data limitations relating to our sampling period being 8 hours and the lower bound on number of at least 10 people to establish an edge between two tiles (see “Materials and Methods” for details). Taking those limitations into account, however, the scale-free behavior of human mobility is approximately reproduced in our analysis. More importantly, we find that under the lockdown restrictions, the scaling laws for weights and distances still hold, while the exponents increase further, shrinking the covered distances. For the distances, as opposed to weights, where the entire distribution shrank, the range of the distribution does not change much, with the farthest points observed before the lockdown being reachable during lockdown as well. Moreover, the most prominent change takes place in the medium distances, between roughly twenty and two hundred kilometers, the distances that cover (with some excess) what would typically be the daily commute. Workers related to essential activities, however, are probably still present and travel to distances comparable to those before the lockdown41,54.

Local impact and recovery

Figure 3
figure 3

Clockwise from top-left: Baseline map of Italian Provinces; Reduction in mobility during Lockdown; Partial lift of limitations; All limitations lifted. The color bar indicates the reduction in weighted degree for each Province, with darker blue meaning stronger reduction. The lower panels show the gradual “return to normal”, with the majority of Provinces behaving similarly, but a few noticeable outliers as described in the main text and Fig. 4.

Previous section analyzes mobility between individual tiles, whereas here, in order to take a step toward assessing potential economic implications, we aggregate mobility across Provinces—a more granular perspective than the Regions in Fig. 1 but containing economic meaning as compared to the individual tiles. The lockdown (top-right) panel of Fig. 3 shows the full impact of the restrictions. For non-zero distances (i.e., excluding people who stay within the same tile for the measurement period, as explained in “Materials and Methods”), mobility decreased dramatically, with Provinces dropping between 50 and 90%. The lower two panels, perhaps, hold the more interesting and unexpected effects of the lockdown. While some Provinces (as will be detailed in Fig. 4) rebound relatively quickly, even when only some of the limitations are lifted, other Provinces remain far below their initial levels throughout the time frame shown.

Indeed, Fig. 4a highlights the relative similarity of the impact suffered by the most mobile Provinces, those with the highest weighted degree. Those Provinces show a consistent, close to 60%, reduction of mobility during lockdown and an equally consistent recovery when restrictions are gradually lifted. This is contrasted by the higher variation seen in the lower panels of Fig 3 and detailed by Fig. 4 for Provinces with lower initial mobility (weighted degree). A noticeable difference exists in the level of recovery, with some of the most impacted Provinces, such as the southern Province of Cosenza and the northern Aosta Valley, remaining well below initial levels after restrictions are lifted, while others, such as the coastal Provinces of Genoa and Olbia-Tempio recovering close to, and above, their initial levels.

Figure 4
figure 4

(Top) Most mobile-Provinces with the highest weighted degree, and (Bottom) most impacted Provinces and their recovery through partial restrictions to complete lift of limitations.

To generalize those statements we compare mobility levels at the three stages, lockdown, partial restrictions, and lifting of restrictions to the initial levels (Fig. 5). The level to which the relations are linear over the various phases of mobility limitations is very surprising. This evidence will be exploited in section "Economic impact" to assess the economic impact of mobility restrictions. Moreover, we note that overall mobility had not returned to pre-lockdown levels in the post-lockdown phase, but roughly to 87% of those levels, which may thus suggest the presence of long-lasting negative effects in the structure of the Italian mobility network persisting the removal of mobility restrictions.

Figure 5
figure 5

Uniformity in damage. (a) A linear relation between the lockdown levels of mobility and the initial ones shows a 60% drop, as seen via the regression coefficient, consistent with  Fig. 4a. (b), (c) show the relations remain very closely linear and uniform in the following stages of partial (b) and full (c) lift of limitations. We note that following the removal of all limitations, overall mobility is still about 15% below its initial levels. Zooming into the smaller Provinces we see the variation mentioned in Fig. 4b while still largely following the overall trend (d) traces the slope of impact vs. the averages pre-lockdown levels over time. We see again the shape encountered in Fig. 1c and f. Insert - the distributions of impact magnitude, i.e. weighted degree drop per time frame.

A resilience perspective

Figures 1c and 3 may lead us to assume things were, to some extent, almost back to normal when restrictions were lifted. We wish, however, to employ a methodology taken from traffic jam and service quality domain42 to show this is not, in fact, the case. For that we return briefly to the tile perspective taken in section "Scaling". For every edge between two tiles we calculate the distribution of its weight across time. We then determine a maximal value for that weight, \(W_{max}\), as each edge’s 95th percentile. Now we introduce a parameter q, the fraction of that maximal weight \(q=W_q/W_{max}\) and we state that for a given level of q, an edge whose current weight is above \(W_q\) is considered functional and below it - the edge does not exist. This may be motivated from several perspectives. The obvious approach is to say that if an edge exists but it is only a small fraction of what that edge had been previously - we cannot state that connectivity between the two nodes the edge joins had really been restored. An argument closer to our economic analysis in the previous and following sections may say that there exists a non-linear relation between the number of people traveling along an edge and the value the edge carries. That relation may come from a skewed distribution of people’s income, the non-uniformity of economic activity of the travelers or other similar notions. Because of that relation we want to see not only which edges remain, but which edges carry sufficient weight to carry economic value.

Armed with this approach we now modify the parameter q and analyze the changes in network connectivity. We start by asking whether or not the edges that fail under our q-perturbation differ during changing conditions. The result is shown in Fig. 6a. We find that initially (before the lockdown) the network is insensitive to the removal of low-value edges. That is, only at high levels of q do significant edges fail. During lockdown, however, the picture changes. Now we see that many edges are at 20%-60% of their maximal levels and the weight they carry is very substantial. Thus the network is much more brittle than it was initially and edges carry only a small fraction of their maximum historic capacity. We can ask, then, how is this under-the-surface fragmentation reflect on the GCC, which is often considered the main functioning component of the network. The standard presentation42 relates the level at which the GCC fragments, termed \(q_c\), to network strength and resilience, showing that high \(q_c\) corresponds to a robust network, while low levels of \(q_c\) characterize a weak, fragile network. Figure 6b shows at which value of \(q=q_c\) the GCC breaks down, i.e. split into noticeably smaller components. We note, again, that before the lockdown it happens only close to \(q=1\), that is, when the edges are removed only when they are marginally below their maximal value. What this means, simply, is that before the lockdown, the edge weights are relatively stable and close to their typical peaks. During lockdown the network becomes much more fragile and breaks down at much lower levels, at \(q_c\) close to 0.2. That means the edges that are present - are nearly five times weaker than in normal times. On one hand, that is to be expected. We claimed throughout the analysis mobility had been highly impacted. On the other hand, we saw in Fig. 5a that node weighs dropped by about 60% on average - but the surprising detail here is that the global mobility is actually more fragile, there are many critical edges that break under very light pressure. But perhaps most interesting is the situation post lockdown, when everything, presumably, goes back to normal. Looking at nominal edges, and even to some extent node degrees and distributions, it may be assumed that this is indeed the case. However, under the magnifying glass of q-analysis we see that picture is not so clear. Below we explore the extent to which the network remains damaged.

In order to further understand the lasting impact of the lockdown, we compare the nodes that form the GCC at a given q-level over time. To this end we calculate what is known as the Jaccard coefficient, which is a straight-forward metric to measure node similarity. It is defined as the size of intersection of two sets divided by its union:

$$\begin{aligned} J(q)=\frac{GCC_1(q)\cap GCC_2(q)}{GCC_1(q)\cup GCC_2(q)} \end{aligned}$$
(1)

where in general \(GCC_1, GCC_2\) are nodes belonging to the GCC in two different time points and in the case of Fig. 6c \(GCC_1\) is taken at a random business day before the lockdown and \(GCC_2\) is calculated for every business day in our data set. We perform this calculation for a range of q values and discover (Fig. 6c) that the GCC, as constructed at various levels of q, 40% and up, retains only about 50%-70% of the initial nodes. Moreover, during the lockdown itself, when the simple intersection shows a decrease of 60%-70% as seen by other metrics in Figs. 1, 2, 3, 4 and 5, even for very low levels of q, the GCC is almost entirely gone, bearing only about 10% similarity to the initial structure. We now want to use this understanding to tie together the levels of mobility and observed decline of network functionality to the actual levels of economic activity.

Figure 6
figure 6

Network under q-Resilience. (a) The distinct characteristics of failing weights during lockdown vs the other periods. We calculate and average the weights of newly failed edges at every level of q throughout the various periods. In particular, during the lockdown the network is particularly brittle, and moreover, the edges lost at relatively low q’s carry comparatively high weights, reflecting the stronger impact, (b) Fracturing of the GCC at various q-levels over time. As can be seen, during, but more importantly, after the lockdown the network remains susceptible to failure for only very slight perturbations—the links that are present are roughly 20% of the original weights. The GCC is in effect very weakly connected. (c) Jaccard similarity for different levels of Q. Here the fragility of the network is even more prominent - if we ask when the edges regain their original strength - the answer is they do not, and many of them remain at around 60% of their initial levels.

Economic impact

Reliable and accurate data on economic activity from official sources takes a long time to measure and evaluate. There are many reasons for that, some include multiple sources of information that need to be gathered and unified. Others just due to the fact that economic and financial information is reported long after the occurrence of the phenomenon. For instance, a company would file its fiscal year reports only towards the end of the following year’s first quarter, and similarly national accounts are typically released with some months of delay.

However, mobility patterns can be readily observed in near-real-time via various methods. One such method is employed in this manuscript, relying on cellphone data. Other providers of such data exist and often share or sell this data so it can be observed in very high spatio-temporal resolution. Other measures include public transportation usage, sensors that count numbers of vehicles on the roads and more. In short, mobility is relatively easy to observe.

Here we propose the use of mobility data as a proxy for the economic performance. As an example, Fig. 7a shows the correlation between Italian Provinces’ Gross Domestic Product (GDP) and their mobility-based weighted degree. The official economic data used in this figure is only available until 2017. Newer data is available only for the regional level, for 2018, while if we are interested in quarterly data of 2019 and onward, we can only look at the country level. This creates a difficult situation for policy makers and stakeholders when economic impact or distribution of financial aid need to be determined at a granular spatial level of intervention.

Thus, we come to the concluding section of our analysis, bringing together the various network insights gathered so far into an effective fine-grained decision-aiding tool (see “Materials and Methods” for further details). This tool is made available due to two interesting facts. One, as seen in Fig. 7a, there is a close relation between a Province’s mobility performance and its GDP. In fact, the levels of mobility correlate with the GDP more than 60%: \(corr(GDP, \langle K_w \rangle ) = 0.64,\) Pearson’s correlation coefficient, with 95% CI [0.53,0.74] (we present additional metrics tying together GDP per capita to average degree, as well as “normalized” average degree, and details on correlation methodology in the SI). This means that mobility is a reasonable proxy for the economy. While the average degree employed here is not a good proxy for the typical degree, as is expected for a fat tailed distribution, it does correlate well with the economic indicators. In fact, it is quite reasonable that the highest degrees are more representative of economic activity than are the lower, or typical ones. More on that in Sup. section S3, we next compare OECD data which calculates economic activity based on multiple attainable sources on the national level55 to produce an estimated GDP (EGDP) with our mobility metrics. While this is an approximate metric, it analyzes various Internet search patterns related to economic activity and shows a close prediction (to within its margin of error) to the officially published quarterly national GDP56. Figure 7b shows how closely the average mobility weighted degree (teal, left axis) follows the proposed economic metric (red, right axis). The correlation here is even higher, \(\rho (EGDP, \langle K_{w,p} \rangle ) = 0.89\), and the weekly updates \(\Delta EGDP = EGDP(i) - EGDP(i-1), \Delta \langle K_w\rangle =\langle K_w(i)\rangle - \langle K_w(i-i)\rangle\), are also very closely tied, \(\rho (\Delta EGDP, \Delta \langle K_w \rangle ) = 0.58\).

While the causal relations between affected mobility and economic impact may be far from trivial (although the intuition that people, when traveling, conduct various business activities, is straightforward), we now have a means to offer both a national and a regional or local estimate of impact to the economy. Specifically, the remarkably linear relations of Fig. 5 can help us give a rough estimate of a uniform impact to Provinces’ GDPs. However, the deviations from that linear relationship, as seen from the mobility patterns of each individual Province, as shown above (section "Local impact and recovery"), together with the close relations between GDP and mobility as shown here, can allow for a finer-grained, Province-specific analysis, and give in-depth expectations about heterogeneous local economic impact from the observed dynamics of mobility. Figure 7c–f show a sample of such forecasts. The process to obtain the estimates is detailed in “Materials and Methods”.

Finally, in order to uncover heterogeneous responses of Italian territories to mobility restrictions which may signal competing economic trajectories of local economic systems, we show in Fig. 8 how the network of Italian mobility relates to average income levels. This perspective complements the previous analysis by looking solely on the constituents of the GCC component, i.e. the economic units that stay connected even when restrictions are in place, and their relative economic strength. We consider the network of mobility constructed at the Municipality level and income per capita referred to 2019, hence before the outbreak of the pandemic. The red line in Fig. 8 shows the dynamics of GCC, while in green we report its number of nodes. The Italian mobility network is composed by a main connected group of Municipalities which are on average associated with higher levels of income per capita than those more peripheral in the network. Interestingly, after the deployment of lockdown restrictions we note a remarkable reduction in the size of GCC, while the economic well-being of the remaining population, as seen through the average income per capita, increases. Hence, a relevant effect of mobility restrictions put in place to contain the spread of the virus is the emergence of an enforced economic segregation, where Municipalities with high-income population tend to be connected while Municipalities with poorer economic conditions of their population are excluded from the main component of the Italian mobility network. This effect vanishes once mobility restrictions are lifted, with a rapid reversal to the pre-lockdown configuration. In addition, the percolation process is informative to identify those Municipalities which may exit GCC when the network starts deteriorating. As shown in Fig. 6, lockdown restrictions determine a sharp decrease of the parameter q. As a consequence, Fig. 8 highlights how during the lockdown on average the configuration reached before the dissipation of GCC is very similar to the one at \(q=0\). Nevertheless, during business as usual periods (pre-lockdown phase) or after the removal of lockdown restrictions, we notice that progressively nodes that are removed have lower average income per capita than those that remain in the GCC. Once again, the core of the GCC seems to point to the presence of highly connected nodes that have on average also better economic conditions of their population.

Figure 7
figure 7

Inferring the economy from mobility. (a) The correlation between the GDP levels of individual Provinces for 2017 and the weighted Province degree. Log of values is shown to highlight the relation holds regardless the strongest Province economies. Inset-shifted correlation to validate the significance of the relationship (b) The Average weighted degree, teal, left axis; and the EGDP estimates, red, right axis. Here, as in the right panel of Fig.  1 we see the similarity in dynamics. (c)–(f) The Province-based GDP forecast based on mobility data for Turin, Padua, Como and Pisa Provinces. The thick line is the average forecast, with the shaded area showing the 25–75 percentile range.

Figure 8
figure 8

Average income per capita and total number of nodes in the GCC for every week of the period of observation. The GCC is either calculated at the start of the percolation process, i. e. without removing any nodes (GCC at \(q=0\), red and green lines), or it is obtained by removing edges incrementally by weight until the optimal q is reached but before the GCC is dissolved (GCC at \(q_c\), blue and yellow lines). The lines represent the mean value of income per capita of the nodes or the total number of nodes in the GGC, averaged over one week of observations.

Conclusion

We analyze in detail the changes in mobility patterns in Italy induced by the lockdown measures. We show the fragmentation and disconnection they brought. While it is without a doubt the intention of those measures, we show also, using concepts borrowed from percolation theory, that the impact goes much deeper than is seen superficially. In fact, the network after the removal of limitations is still far more fragile than it was initially. We estimate the overall impact and the deviations from that uniformity for various Regions. Combining those deviations with close relations between mobility and economic conditions, we present a novel approach to estimate hard-to-observe economic activity from easily observable mobility patterns. These perspectives on the network resilience and local economic impact of lockdown measures may be used by policy makers to estimate more accurately and promptly the costs and benefits of potential additional restrictive measures.

Materials and methods

Data

The data used for the analysis is provided under academic license agreement with Facebook through the “Data for Good” program and is available at https://dataforgood.fb.com/tools/disease-prevention-maps/. Economic data is taken from the Istituto Nazionale di Statistica (ISTAT) http://dati.istat.it/ and the Ministero dell’Economia e delle Finanze (MEF) https://www.mef.gov.it/documenti-pubblicazioni/open-data/index.html.

Network structure

As described in section "Network structure before and under lockdown", we construct our network by observing people move between various locations. For ease of processing, continuous space is divided by Facebook into tiles, approximately 7x7km each, covering the entire country. When calculating overall statistics, such as degree distributions, component sizes and shortest paths we analyze the network using those elements. However, in order to make our research more meaningful from a geographic and economic perspective, we also map those tiles to Italy’s Regions and Provinces based on the the location of a tile’s center. For that we aggregate all relevant node information into a higher-level node (Province or Region) by summing all the edge weights incident on the appropriate nodes.

Edges represent the number of people traveling within an eight-hour period between two nodes. That is, if over a period of eight hours, \(N > 10\) people were located at nodes u and v then and edge will exist between the two nodes and its weight will be N. The reason for this limitation lies in privacy restrictions, where a smaller number of people may make it possible to identify specific individuals, so information is only exposed where such deanonymization is unlikely.

Those two limitations carry noteworthy meaning to our analysis. For one, the relatively short time period of eight hours is shorter than the typical time frames in human mobility research. That leads directly to shorter typical distances traveled. Secondly, the limitation on quantity can also directly lead to the rarer but longer edges being omitted from our data set.

Self-loops reflect a situation where, throughout the measurement period, a person is not recorded outside their initial tile. Those events are frequent in our analysis, more so at the large aggregation levels. Those loops effectively mean people stay in the initial location, whether a tile or a Region. Since we are interested in connectivity between various locations, those loops are removed from our analysis.

Information is only available about the presence of people in two locations, without time ordering, so our edges are undirected. This precludes us from performing Origin-Destination analysis, or to follow explicitly the direction of flow of people

Percolation and resilience analysis

Percolation takes its origins in the analysis of disordered systems, where matter is modeled as sites that may or may not be occupied with probability \(p_n\) (node percolation) or sites between which edges are present or absent with probability \(p_e\). The question is then asked whether or not a Giant Connected Component exists, that is, whether the largest component is extensive, i.e., proportional to the original system size. Originally developed around regular systems, such as two-dimensional or three-dimensional lattices, network theory expanded the notion to random networks, where each “site” has potentially a random degree taken from a given degree distribution. Specifically, important differences appear when the degrees are drawn from a Poisson distribution (Erdos–Renyi, (ER) networks, where every two nodes are connected by an edge with probability p) or a scale free (SF) degree distribution, where node degrees are distributed according to a power law, \(P(x) \propto x^{-\beta }\) with usually \(2< \beta < 3\). In the former case the network experiences a percolation transition when the average degreee, \(\langle k \rangle\) crosses the threshold of 1, where below that value the GCC increases slowly with network size, N, and its relative fraction of the entire network goes to zero as N goes to infinity21,23. SF networks, on the other hand, do not experience such a transition, and rather have a non-zero GCC for any nonzero average degree. It is often assumed for real-world networks that in order for a node to be functional, whether the node is a financial institution, a geographical location or an infrastructure component such as a power plant29, it needs to be part of the GCC. While we do not suggest Regions, Provinces and Municipalities disconnected from the GCC cease to function, we do suggest their participation in economic activity is impaired. Since there can only be one GCC in a network21,23, we assume only nodes connected to it operate at their fullest capacity. Thus, as described in section "Local impact and recovery", we define a critical level of q at which the reduction of the GCC is largest, thereby reflecting the most significant impact to the economic activity.

Province-based GDP forecast

To generate GDP forecasts per Province we combine the close relations between mobility and GDP, both statically (Fig. 7a) and dynamically (Fig. 7b). For that we rewrite the GDP over time for a Region as

$$\begin{aligned} GDP_t = GDP_0 + \sum _{i=1}^{t} \Delta GDP_i \end{aligned}$$
(2)

where \(GDP_0\) is the initial level (here taken as the value at 2017 as the latest published data point for Provinces, but may be replaced with a more recent estimate), and the weekly updates \(GDP_i\) are estimated using the dynamic relations. We may also rewrite the process as a multiplication of percentage changes, more in keeping with economic literature, but no additional insight is gained by it and the equations become more cumbersome. We now regress the weekly GDP changes against the mobility updates as

$$\begin{aligned} \Delta GDP_i = \alpha \cdot \Delta \langle K_i \rangle + \beta \end{aligned}$$
(3)

for the initial estimate. Then, based on the correlation between \(GDP_i\) and \(\Delta \langle K_i \rangle\) we generate correlated random variables from the latter and substitute 3 into 2. The last part is done so that statistics can be generated around the average levels, that may be derived directly by plugging the relevant Province’s \(\langle K_i \rangle\) into 3. We now have all the components to calculate both the expected \(GDP_t\) and the range it may take.