The use of mobile phone data to inform analysis of COVID-19 pandemic epidemiology

Abstract

The ongoing coronavirus disease 2019 (COVID-19) pandemic has heightened discussion of the use of mobile phone data in outbreak response. Mobile phone data have been proposed to monitor effectiveness of non-pharmaceutical interventions, to assess potential drivers of spatiotemporal spread, and to support contact tracing efforts. While these data may be an important part of COVID-19 response, their use must be considered alongside a careful understanding of the behaviors and populations they capture. Here, we review the different applications for mobile phone data in guiding and evaluating COVID-19 response, the relevance of these applications for infectious disease transmission and control, and potential sources and implications of selection bias in mobile phone data. We also discuss best practices and potential pitfalls for directly integrating the collection, analysis, and interpretation of these data into public health decision making.

Introduction

The coronavirus disease 2019 (COVID-19) pandemic caused by the novel coronavirus (SARS-CoV-2) has created an unprecedented challenge for governments, public health agencies, medical officials, and populations globally1,2. The public health response is seeking to effectively mitigate and contain the pandemic while balancing social and economic costs3,4,5. Control strategies thus far have primarily consisted of non-pharmaceutical interventions (NPIs), which have slowed down the epidemic in many settings. Most NPIs rely on reducing contact between infected and susceptible individuals through mass social distancing, including restrictions on social gatherings, closures of schools and businesses, shelter-in-place or stay-at-home orders or lockdowns, travel restrictions, active monitoring, and increased testing, contact tracing, and isolation measures6,7,8,9. These interventions are effective when they result in large-scale human behavioral changes that reduce the close contacts and mobility patterns that facilitate disease transmission, but are challenging to maintain10. Quantifying these patterns to assess NPI effectiveness, particularly on the spatial, temporal, and population scales necessary to fully inform public health response, is an important challenge for this pandemic response.

As a result of the rapid spread and grievous toll exacted by the COVID-19 pandemic, there has been increasing interest in developing innovative methods and tools to inform public health response through digital data, including mobile phone data both passively collected by mobile phone operators and actively collected via recently developed applications11. Mobile phone data remain one of the best sources of information on large-scale population behaviors12. These data can be collected in high- and low-income settings and can capture, in near real-time, changes in mobility and clustering patterns for large swaths of the population. We and others have previously used aggregated and anonymized geolocation information from passively collected mobile phone data to successfully inform and model the spatial and temporal dynamics of endemic and emerging infectious diseases, including malaria13,14,15,16, cholera17, measles18,19,20,21,22,23, dengue24,25, and Ebola26,27. Through these prior applications, an understanding of privacy-conscious ways to utilize these data and inform public health policy while forming productive collaborations with operators, public health officials, and academic partners has been developed.

Mobility analysis, quantifying clustering of social contacts, symptom tracking, surveying, and contact tracing applications have all been proposed and employed to some degree to inform the response to COVID-19 (see Fig. 1a). These applications, metrics developed to analyze these data, and proposed best practices have recently been reviewed by an interdisciplinary team of experts28. To build on this work, we examine the applicability of mobile phone data for public health response by reviewing the common applications of mobile phone data relevant to outbreak response; the kinds of behaviors captured within these data and proposed applications; the validity of these data for public health response and epidemiologic research, including sources and implications of selection bias; and potential concerns and best practices for direct integration of these data with public health response.

Fig. 1: The uses of mobile phone data to inform COVID-19 public health response and their possible biases.
figure1

a Over the course of the epidemic, mobile phone data and applications may be relevant to help answer a number of important epidemiological questions needed to guide the implementation and evaluation of various interventions. b However, these data should be considered in light of ownership and use biases that may or may not limit generalizability to the overall population. Mobile phone owners and users only represent a subset of the population and may have additional age (shown here for a synthetic population for illustrative purposes), socio-demographic, or geographic biases. Applications that require the use of a smartphone or application may further limit the generalizability of these data since they represent smaller subsets of the user population.

Utilizing mobile phone data to inform COVID-19 response

Mobile phone data can be used to inform different aspects of COVID-19 response (Table 1). At the population level, quantifying changes in human mobility or clustering can help evaluate the impact of an NPI and identify hotspots where additional or different interventions may need to be applied. At the individual level, mobile phone data may be used to understand patterns of individual contacts and enhance contact tracing.

Table 1 Summary of types, metrics, and proposed applications of mobile phone data.

Evaluating current interventions and monitoring their release

The most widely used application of mobile phone data in public health to date is the use of telecom geolocation data to track population movements11,12. Mobile phone operators routinely collect Call Detail Records (CDRs) that contain a timestamp and GPS location with a unique identifier for all subscribers. These data thus are typically readily available and offer high coverage to estimate mobility patterns of individuals using their mobile devices. We note that similar time-resolved GPS location data may be passively collected through certain applications, though typically for only a subset of subscribers that may introduce further bias.

CDRs can be used to generate a number of metrics for characterizing large, population-level mobility patterns. Origin-Destination (OD) matrices reflect the number of times a trip is made between two locations (of varying spatial resolution) in a certain period. These matrices can be analyzed over time to detect temporal trends (i.e., holidays, seasonality, weekday vs weekend) and regular hotspots of attraction. These spatial and temporal flows of individuals between locations, including the magnitude and frequency of these movements, can be used to understand the risk of importation from areas with ongoing outbreaks to areas without sustained transmission where there is a risk of reintroduction and resurgence. Aggregate flows can also be used to retrace the likely introduction and spread of an outbreak in new areas and to inform future projections of disease risk or burden across space and decision making around the design and implementation of travel restrictions or increased surveillance.

Aggregate mobility patterns may also be critical pieces of evidence when evaluating the effectiveness of various NPIs. Most NPIs are reliant on modifying physical behavior. Monitoring the volume, frequency, and average distance of flow during interventions can be used to directly quantify the adoption and effect of these interventions, and identify areas of high potential risk to target with different interventions. There are already identified associations between reductions in population-level mobility within and between different locations and COVID-19 incidence6,10,29, though further exploration of which population-level metrics are most closely related to changes in disease risk and whether these associations are sustained throughout an outbreak is needed30. These associations would ideally be interrogated to identify individual behaviors associated with mobility measures that are also associated with individual risk of COVID-19.

The effect on NPIs can also be monitored through subscriber density metrics that combine the recorded GPS location and timestamp of CDRs to capture the real-time population density and identify potential hotspots. When using finer-scale GPS location data, these density metrics may quantify the likelihood or frequency that users came into proximal contact. A third metric derived from CDR or GPS location data, the radius of gyration, quantifies the range over which a single person may travel in a specified time period. Importantly, the data required for these applications are non-identifiable; they cannot be used to identify any given individual’s interactions, but provide population-level insight into the average clustering and movement of individuals. These metrics, along with traditional OD matrix flows, were recently employed in Italy as a way to evaluate the impact of its national lockdown31. Traffic flow between provinces and probability of colocation were reduced initially in the northern provinces, where the COVID-19 outbreak was first observed, a clear signal of reactive social distancing. As the epidemic progressed, and especially once the national lockdown was enforced, the entire country saw a reduction in traffic between provinces; however, the probability of colocation remained highly dependent on province and was likely attributed to the number of cases reported in each province. Interestingly, the average distance traveled by individuals was significantly reduced across all provinces after the initial outbreak was confirmed.

The use of Bluetooth data (records of proximal interactions between Bluetooth-enabled devices) to quantify physical clustering or real-time density of subscribers at small spatial scales (e.g., zip codes) and fine temporal resolution has been explored for the purposes of contact tracing (see below). The use of these data has been considered less for population-level analyses, though it offers another source of information on behavioral changes under different NPIs. When activated, mobile phones will emit a Bluetooth beacon that is detected by other activated phones. When two Bluetooth-enabled devices are within range, the date, time, distance and duration of interaction can be recorded. The frequency or number of these interactions (analyzed anonymously to form, broadly, measures of clustering or proximal interaction rates over time) may be important given the role of sustained interaction or overcrowding of individuals32,33,34 and contact structure in SARS-CoV-2 transmission35. Furthermore, Bluetooth data in combination with GPS data or a network of Bluetooth sensors can be used to quantify the amount of time people spend at home or other identified locations when lockdown measures are in place to determine if policies are effective.

These data and measures of population-level mobility or clustering patterns would be exceedingly difficult to collect on a similar scale without mobile phone data. These data are often continuously collected, in near real-time, allowing for continued analysis as an outbreak unfolds. Importantly, though, a baseline understanding of contact or clustering patterns prior to any interventions is necessary to inform estimates of intervention impact.

Facilitating contact tracing

Opt-in applications (apps)36,37,38,39,40,41,42 that rely on digital approaches to enumerate and contact individuals who may have been in proximity with someone infected with COVID-19 have been proposed to increase efficiency and decrease the very large burden of manual contact tracing programs43,44,45. By enabling rapid tracing of perhaps higher proportions of affected individuals, these apps can reduce the amount of time that a potentially infected person would have to infect others, particularly in asymptomatic or pre-symptomatic phases of infection46. Most contact tracing apps collect Bluetooth and/or GPS location data to create trails of contacts over a moving time window (14-28 days). Unlike the data needed to understand population-level, aggregated behaviors described above, these data must be linked to single individuals and capture pairwise interactions with other identifiable individuals. Once a case has been identified, they are added to a list of infected users that is queried by the other phones in the network. If the infected user is detected in the trail of contacts, then the user and their contacts are alerted, either by the app or by a public health official, to initiate isolation and quarantine.

This contact tracing process occurs either in a centralized manner, where user information is sent to a remote computer where matching occurs, or in a decentralized manner, where the matching process occurs on the user’s phone. In order for these approaches to feed directly into public health decision making, a direct line between the developers, public health response teams, and users needs to be put in place. This will also be key to mitigating any privacy concerns, which should be dealt with in a transparent and direct manner. Although there has been little discussion to date, routinely collected, individually-identifiable Bluetooth or fine-scale GPS location data may also be used to infer and quantify high-resolution proximity network structures which may further inform contact tracing efforts, but will also raise additional privacy concerns47,48.

Frameworks to process and analyze mobile phone data

Luckily, computing resources and methods to analyze and extract these data will not likely be the limiting factor in these instances. Groups such as Flowminder and Telenor Research Group have worked for multiple years to develop more streamlined processes to analyze these data, particularly aggregate mobility data, that are able to directly interface with mobile phone operators. Flowminder has produced a suite of CDR aggregates, such as counts of active subscribers per region or counts of travelers, that can then be used to calculate indicators of mobility, such as crowdedness, population mixing, locations of interest, and intra-/inter-regional travel49. The code to extract these metrics is publicly available at50. Telenor Research Group works directly with mobile phone operators to provide researchers with spatially aggregated CDR/mobility data51. Facebook’s Data For Good program provides aggregated mobility data to researchers that come from their subscribers, and companies like Cuebiq provided mobility data for a number of COVID-19 studies that summarize the distance users travel or the proportion of users that stay at home52. These existing frameworks - not only the analyses, but also the privacy considerations and data sharing agreements - will provide standardized methods that facilitate integrating mobility data into intervention assessments.

Data privacy

Various forms of identifiable personal information are generated when using mobile phones, including names, identification numbers, fine spatial and temporal data on where the device was used, other users’ identification numbers who may have been detected by Bluetooth, and personal details that might be entered into an app. In light of the growing number of digital privacy concerns and regulations, one must carefully consider the exact form and use of mobile phone data being collected against the legal and ethical need to protect users’ data security and confidentiality. While maintaining user confidentiality is often seen as a hindrance to the use of mobile phone data, in that it limits the use of individual-level data and typically requires aggregation to coarse spatial and temporal resolutions, there are a number of existing frameworks that can help provide guidance for the effective, privacy-conscious use of mobile phone data53.

Exactly which model of data privacy will best suit the use of mobile phone data for COVID-19 response will depend on the exact form and proposed use of the data. As discussed above, there already exist many data processing and analysis frameworks to provide anonymized indicators of population mobility. These standard procedures, though, could result in aggregated data with insufficient spatial and temporal resolution to be effective for monitoring the spread of SARS-CoV-2. Privacy regulations, such as the European Union’s General Data Protection Regulation (GDPR)54, offer exceptions for the use of non-anonymous data that may be needed for other response efforts. For example, opt-in applications for contact tracing may seek consent of the data subject to collect and analyze identifiable data, though the ability to scale opt-in approaches to a wide enough population and to maintain user compliance and participation remains unclear. GDPR and other regulations also provide an exception for anonymization of data to be used in public service, but the regulatory hurdles to gain this exception can be substantial and would require clear use policies and applications for these data. The use of mobile phone data, particularly forms such as those proposed through contact tracing applications, must be weighed against the possible infringements of privacy and civil liberties versus the potential public health benefit.

Capturing epidemiologically-relevant behaviors with mobile phone data

Both the ability to capture behavioral patterns in a large proportion of the population and the potential scalability of these approaches are some of the most promising aspects of mobile phone data11,12. The potential for broad expansion in the collection and availability of mobile phone data requires an understanding of exactly which behaviors are captured and whether these behaviors are valid measurements of interactions relevant for infectious disease transmission (Table 2). The validity of these behavioral metrics needs to be evaluated in the specific context of their application, including the spatial and temporal scales of the data and the proposed public health actions or policies informed by these data.

Table 2 Epidemiologically-relevant behaviors captured in mobile phone data.

Many natural experiments are now occurring as various NPIs are implemented and lifted, which could be systematically and passively measured via mobile phone data to guide decision makers monitoring the effectiveness and implementation of various NPIs in real time. CDR data may offer one of the best assessments of changes to population-level mobility and clustering behaviors in response to NPIs at potentially fine spatial and temporal scales. These data, though, are only relevant to disease transmission if we can assume that these aggregate behaviors capture the movement of infected, and potentially infectious, individuals. Individual behavior may change in response to real or perceived illness in ways that are not easily captured in aggregate metrics. These aggregate measures are also unable to distinguish movement with high risk of transmission (e.g., shared public transit without appropriate protective equipment) from movement with low risk (e.g., travel by personal vehicle with appropriate social distancing), and therefore will not fully capture the spectrum of behavioral changes that may reduce disease risk.

All forms of mobile phone-based tracking are only able to capture proxies of movement, in that they track a device rather than an individual. Compared to other measures of human mobility (surveys, direct observation), mobile phone data tend to more completely capture the movement of individuals within the study population. However, differences in how individuals use their phone may introduce important biases, particularly when attempting to assess changes in behavior across time or across populations. For example, mobile phone data may be unable to capture an increasing proportion of individuals staying at home or at work following restrictions on non-essential travel, where they may be more likely to use Internet-based communication that does not generate CDRs. Similarly, there may be differences in how individuals of different ages or in different regions use their phones, which will affect the validity of mobile phone metrics in these populations. Mobile phone data cannot distinguish between multiple people using a single phone or SIM card (either of which may be used as a unique identifier), nor does it account for users with multiple phones or SIM cards, limiting the ability to make any inferences about the behavior of individuals from CDR data.

The spatial and temporal scales over which these aggregate data are collected also have important implications for their application. Mobility flows derived from CDRs are commonly used in metapopulation transmission models to parameterize the rates at which individuals move between various locations. This application requires that the origin and destination locations be spatial areas within which the exact social contact patterns of individuals can be estimated (e.g., through mass action assumptions or age-specific contact matrices) and which directly relate to relevant public health decisions (e.g., administrative units with a common public health authority). Data availability and privacy regulations often limit the spatial scales on which these locations can be defined, nor is it always clear which locations are most relevant to disease transmission processes. While mobility flows are useful for understanding potential transmission links between these locations, when they can be defined, the spatial aggregation naturally limits their utility in understanding or modeling transmission chains within these locations. Respiratory viruses like SARS-CoV-2 are diseases of close contact; the spatial scale (several meters) over which transmission occurs is many times smaller than what can be explored through aggregate mobility flows (typically aggregated to areas of at least 500 m2).

The temporal scale of aggregation is also important. For reasons of privacy and computational efficiency, several hour time steps or daily movements are often calculated, and multi-step journeys over several days are not measured. Similar privacy and efficiency concerns mean that individual trajectories (e.g., moving from A to B to C) are often impossible to measure. As such, aggregate data are used and, though the relative connections between places are typically robust (e.g., flows between A and B), the exact magnitude of travel occurring multiple times per time step or along specific routes (e.g., transit in and out of a capital city) are difficult to capture in aggregate data.

Contact tracing applications require the use of identifiable, fine-scale clustering and contact data to understand the proximal interactions individuals have that may result in disease transmission. The tolerance for missing data in these applications is low; any missed interaction might be a missed transmission event. Unlike aggregated metrics, contact tracing applications allow for measured behavior to be linked to an individual’s infection status, though it is still unclear how to translate the physical proximity of mobile devices to transmission-relevant interactions between individuals. Particularly in dense areas (e.g., apartment buildings), these applications may capture many proximal interactions which have very low risk of transmission, leading to high rates of unnecessary quarantine and associated social and economic costs. It further remains unclear whether SARS-CoV-2 transmission through non-proximal interactions (aerosols, fomites, fecal-oral) plays an important role, compounding the difficulty of defining transmission-relevant interactions that can be captured in mobile phone data.

Evaluating the ability of mobile phone data to represent populations at risk for COVID-19

A key advantage of mobile phone data is the possibility to quickly collect bespoke data in many areas; particularly in a pandemic, where tailored responses to specific epidemiologic and social contexts are required, use of mobile phone data might require fewer assumptions about the transportability of derived mobility and contact metrics across different populations. Critically, though, data derived from mobile phones only directly captures mobility and contact among those who own and regularly use a mobile device (see Fig. 1b). While there is evidence these measures do reflect movement of individuals without mobile phones on a population level, the exact representativeness of mobile phone data to populations without phones in varied settings for both population-level flows and individual-level clustering patterns remains unclear55. Any applications that require smartphone versus general phone use may further limit their generalizability and applicability. Prior work has shown clear sociodemographic and age biases of mobile phone ownership within populations in multiple settings56,57. Children and the elderly are frequently under-represented in mobile phone data, and inferences derived from mobile phone users may not be generalizable to these populations. Even in areas where mobile phone ownership is nearly-ubiquitous, data may not be fully representative if collected only from a single operator, which may target a specific population, or from an area with unequal service coverage, which is necessary for CDR and GPS location capture. Issues of service coverage may be particularly problematic in low- and middle-income countries, where there are generally fewer events recorded in CDR data, leading to greater uncertainty and potential for bias in estimates of mobility. Mobile phone owners may not be the primary or sole users of their device, too, further complicating the transportability of these data to particular populations.

Applications with opt-in or opt-out features will have an additional level of selection in the population for whom data is available; this selection could be particularly dangerous if there is clustering in the populations which do not participate in app-based responses or uses that require smartphone ownership. Modeling work has indicated that upwards of 60-80% of individuals must be enrolled in contact-tracing applications to achieve substantial reductions in transmission, but these estimates may be optimistic if chains of transmission are sustained and uninterrupted among clusters of non-participants46,58.

There are clear age-specific and social disparities in COVID-19 risk that may dovetail with the potential selection biases in mobile phone data, leading to a critical need to understand where and to what extent these biases may exist. Identifying and quantifying these biases is particularly challenging, though, when there is no clear gold standard against which to validate mobile phone data. Transparent usage and ownership information will allow for exploration and characterization of the populations under study and potential generalizability of results59. For example, areas with limited coverage may be excluded from certain analyses, or population weighting may be used to derive appropriate, adjusted estimates of population mobility. Targeted studies to understand how aggregate mobility or contact patterns differ by important demographic groups (e.g., the elderly, essential care workers) may be needed to capture the most epidemiologically-relevant and high-risk patterns of behavior relevant to COVID-19 response.

Determining the appropriate resolution for evaluating behavior in a pandemic

Most mobile phone data is used to understand aggregate, population-level behaviors. Though commonly used to describe possible mechanistic drivers of transmission, these aggregate patterns may not be the most relevant to public health responses, particularly if the focus of pandemic response shifts from containment and mitigation to sustained surveillance and perhaps even local elimination that require highly targeted responses. Moreover, it is increasingly clear that the COVID-19 burden is not equally borne throughout the population. Using aggregate mobility flows to estimate population-level reductions in travel will fail to capture increased risk among essential workers unable to stay home. However, there may be promise in aggregating individual-level data (such as contact tracing data) that may provide additional epidemiologically relevant information.

The COVID-19 pandemic has therefore led to an increased push to utilize mobile phone data to quantify fine-scale, individual-level contact and clustering patterns. These kinds of data have historically been key features of outbreak control, but have involved labor-intensive, manual contact tracing, surveillance, and outbreak investigations. The ability to standardize and streamline the collection, analysis, and application of individual-level data for outbreak response could reduce the current burden on public health officials and is being actively explored by multiple teams.

Developing this ability is not without its challenges, though. Collecting potentially sensitive identifiable data, perhaps passively or without opt-in consent, requires a deep, careful understanding of the legal, ethical, and privacy concerns surrounding the collection and use of these data relative to the potential public health applications and benefits. In all cases, transparent data policies must be used to ensure community engagement and appropriate use and dissemination of collected data. Opt-in features may be necessary to adequately maintain privacy and respect for persons, but will likely impact adoption of data collection and may result in severe selection biases.

In fact, the pitfalls of non-representative study populations and demographic biases may become more pronounced with individual data at finer spatial and temporal scales than aggregate population-level mobility patterns. For opt-in approaches, complementary research into who adopts, uses, and complies with platforms collecting and using individual-level data will be key to assessing their effectiveness. There must additionally be strategies to ensure engagement and representation of high-risk groups in the design and roll-out of these platforms to mitigate the most severe consequences of the pandemic.

Integrating mobile phone data into decision making

Although mobile phone data show great promise for characterizing population- and individual-level behaviors, there remains considerable uncertainty around how to appropriately account for the patterns and potential biases within these data. Policy decisions informed by mobile phone data must consider which populations and which behaviors are or would be excluded from data and which approaches in the design, collection, analysis, or interpretation of mobile phone data can help confront these biases. Care must also be taken to collect and analyze mobile phone data at the appropriate scale for the decision at hand; mobile phone data can potentially provide highly detailed data at various levels of spatiotemporal resolution and aggregation, and the exact data and inferences required will change with the specific decisions and challenges being confronted.

Many important, practical questions must also be addressed before widespread use of mobile phone data is possible, including: How will the data be used? Who will have access to these data? How long will the data be used and stored? Developing solutions that maintain the privacy of subscribers and follow appropriate regulations while protecting data sovereignty for operators and ensuring the most complete, informative data is made available to researchers and stakeholders will be key to the widespread use and adoption of mobile phone data. Collaborative groups such as the COVID-19 Mobility Network and European Commission are avenues that can standardize and expedite the dissemination and use of mobile phone data for public health response60. However, careful, context-specific application of standard methods is necessary to translate them into policy, especially when considering regulatory differences and changing public health priorities across countries. This points to a broader challenge in the use of mobile phone data to explore mobility and contact patterns across international borders. These data typically only include information on movement or clustering within a given country, but cross-border travel will likely be an increasingly important consideration in COVID-19 response as travel restrictions are eased.

Effective use of mobile phone data will require direct, iterative collaboration between mobile phone operators, researchers, and public health officials. This form of reciprocal collaboration will facilitate rapid processing and bespoke analysis of these data; this can also help ensure the continued participation of mobile phone operators, who typically participate in such public health initiatives as a form of social responsibility. The near real-time nature of mobile phone data is a key feature that must be appropriately leveraged to inform response in rapidly-changing situations. At the same time, it is necessary to contextualize the analysis and interpretation of these data as they are integrated into public health decision making. Without appropriate communication, mobile phone data at best will be of limited use and at worst could lead to misguided decisions informed by bad data.

It must be acknowledged, though, that while mobile phone data contain challenges, the perfect data are rarely available in emergency situations. An informed response must consider all available evidence, of which mobile phone data can be a key component at every stage. Mobile phone data are not a panacea, but they may serve as an important complementary approach that can facilitate rapid, early assessments of intervention impacts along with existing surveillance systems tracking disease burden. Mobile phone apps can serve as an important technological resource in contact tracing efforts, and continued monitoring of clustering and contact patterns through mobile phone data will complement active surveillance efforts as the current pandemic wanes.

References

  1. 1.

    Zhu, N. et al. A novel coronavirus from patients with pneumonia in China, 2019. N. Engl. J. Med 382, 727–733 (2020).

    CAS  Article  Google Scholar 

  2. 2.

    Zhou, P. et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature 579, 270–273 (2020).

    ADS  CAS  Article  Google Scholar 

  3. 3.

    Gates, B. Responding to Covid-19 — a once-in-a-century pandemic? N. Engl. J. Med. 382, 1677–1679 (2020).

    CAS  Article  Google Scholar 

  4. 4.

    Anderson, R. M., Heesterbeek, H., Klinkenberg, D. & Hollingsworth, T. D. How will country-based mitigation measures influence the course of the COVID-19 epidemic? Lancet 395, 931–934 (2020).

    CAS  Article  Google Scholar 

  5. 5.

    Fisher, D. & Wilder-Smith, A. The global community needs to swiftly ramp up the response to contain COVID-19. Lancet 395, 1109–1110 (2020).

    CAS  Article  Google Scholar 

  6. 6.

    Chinazzi, M. et al. The effect of travel restrictions on the spread of the 2019 novel coronavirus (COVID-19) outbreak. Science 368, 395–400 (2020).

    ADS  CAS  PubMed  PubMed Central  Google Scholar 

  7. 7.

    Lewnard, J. A. & Lo, N. C. Scientific and ethical basis for social-distancing interventions against COVID-19. Lancet Infect. Dis. 20, 631–633 (2020).

  8. 8.

    Ferguson, N. M. et al. Impact of non-pharmaceutical interventions (NPIs) to reduce COVID- 19 mortality and healthcare demand. https://dsprdpub.cc.ic.ac.uk:8443/handle/10044/1/77482 (2020).

  9. 9.

    Lai, S. et al. Effect of non-pharmaceutical interventions to contain COVID-19 in China. Nature. In press https://doi.org/10.1038/s41586-020-2293-x (2020).

  10. 10.

    Kraemer, M. U. G. et al. The effect of human mobility and control measures on the COVID-19 epidemic in China. Science 368, 493–497 (2020).

    ADS  CAS  Article  Google Scholar 

  11. 11.

    Buckee, C. O. et al. Aggregated mobility data could help fight COVID-19. Science 368, 145–146 (2020).

    ADS  PubMed  Google Scholar 

  12. 12.

    Wesolowski, A., Buckee, C. O., Engø-Monsen, K. & Metcalf, C. J. E. Connecting mobility to infectious diseases: the promise and limits of mobile phone data. J. Infect. Dis. 214, S414–S420 (2016).

    Article  Google Scholar 

  13. 13.

    Wesolowski, A. et al. Quantifying the impact of human mobility on malaria. Science 338, 267–270 (2012).

    ADS  CAS  Article  Google Scholar 

  14. 14.

    Ruktanonchai, N. W. et al. Identifying malaria transmission foci for elimination using human mobility data. PLoS Comput. Biol. 12, e1004846 (2016).

    Article  Google Scholar 

  15. 15.

    Pindolia, D. K. et al. Human movement data for malaria control and elimination strategic planning. Malar. J. 11, 205 (2012).

    Article  Google Scholar 

  16. 16.

    Tatem, A. J. et al. Integrating rapid risk mapping and mobile phone call record data for strategic malaria elimination planning. Malar. J. 13, 52 (2014).

    Article  Google Scholar 

  17. 17.

    Peak, C. M., Reilly, A. L., Azman, A. S. & Buckee, C. O. Prolonging herd immunity to cholera via vaccination: accounting for human mobility and waning vaccine effects. PLoS Negl. Trop. Dis. 12, e0006257 (2018).

    Article  Google Scholar 

  18. 18.

    Xia, Y., Bjørnstad, O. N. & Grenfell, B. T. Measles metapopulation dynamics: a gravity model for epidemiological coupling and dynamics. Am. Nat. 164, 267–281 (2004).

    Article  Google Scholar 

  19. 19.

    Grenfell, B. T., Bjørnstad, O. N. & Kappey, J. Travelling waves and spatial hierarchies in measles epidemics. Nature 414, 716–723 (2001).

    ADS  CAS  Article  Google Scholar 

  20. 20.

    Bharti, N. et al. Explaining seasonal fluctuations of measles in niger using nighttime lights imagery. Science 334, 1424–1427 (2011).

    ADS  CAS  Article  Google Scholar 

  21. 21.

    Bjornstad, O. N., Finkenstadt, B. F. & Grenfell, B. T. Dynamics of measles epidemics: estimating scaling of transmission rates using a time series SIR model. Ecol. Monogr. 72, 17 (2002).

    Article  Google Scholar 

  22. 22.

    Finkenstädt, B. F., Bjørnstad, O. N. & Grenfell, B. T. A stochastic model for extinction and recurrence of epidemics: estimation and inference for measles outbreaks. Biostatistics 3, 493–510 (2002).

    Article  Google Scholar 

  23. 23.

    Wesolowski, A. et al. Measles outbreak risk in Pakistan: exploring the potential of combining vaccination coverage and incidence data with novel data-streams to strengthen control. Epidemiol. Infect. 146, 1575–1583 (2018).

    Article  Google Scholar 

  24. 24.

    Cummings, D. A. T. et al. Travelling waves in the occurrence of dengue haemorrhagic fever in Thailand. Nature 427, 344–347 (2004).

    ADS  CAS  Article  Google Scholar 

  25. 25.

    Wesolowski, A. et al. Impact of human mobility on the emergence of dengue epidemics in Pakistan. Proc. Natl Acad. Sci. 112, 11887–11892 (2015).

    ADS  CAS  Article  Google Scholar 

  26. 26.

    Peak, C. M. et al. Population mobility reductions associated with travel restrictions during the Ebola epidemic in Sierra Leone: use of mobile phone data. Int. J. Epidemiol. 47, 1562–1570 (2018).

    Article  Google Scholar 

  27. 27.

    Wesolowski, A. et al. Commentary: containing the Ebola outbreak - the potential and challenge of mobile network data. PLoS Curr. 6, https://doi.org/10.1371/currents.outbreaks.0177e7fcf52217b8b634376e2f3efc5e (2014).

  28. 28.

    Oliver, N. et al. Mobile phone data for informing public health actions across the COVID-19 pandemic life cycle. Sci. Adv. 6, eabc0764 (2020).

  29. 29.

    Gao, S., Rao, J., Kang, Y., Liang, Y. & Kruse, J. Mapping county-level mobility pattern changes in the United States in response to COVID-19. https://papers.ssrn.com/abstract=3570145 (2020).

  30. 30.

    Ainslie, K. E. C. et al. Evidence of initial success for China exiting COVID-19 social distancing policy after achieving containment. Wellcome Open Res. 5, 81 (2020).

    Article  Google Scholar 

  31. 31.

    Pepe, E. et al. COVID-19 outbreak response, a dataset to assess mobility changes in Italy following national lockdown. Sci. Data 7, 1–7 (2020). Using mobile phone data, Pepe et al. used a number of mobility metrics at the individual and population level to evaluate the impact of non-pharmaceutical interventions across Italy at the beginning of the COVID-19 pandemic.

  32. 32.

    Ghinai, I. Community transmission of SARS-CoV-2 at two family gatherings — Chicago, Illinois, February–March 2020. MMWR Morb. Mortal. Wkly. Rep. 69, 446–450 (2020).

  33. 33.

    Hamner, L. High SARS-CoV-2 attack rate following exposure at a choir practice — Skagit County, Washington, March 2020. MMWR Morb. Mortal. Wkly. Rep. 69, 606–610 (2020).

  34. 34.

    Cai, J. et al. Indirect virus transmission in cluster of COVID-19 cases, Wenzhou, China, 2020. Emerg. Infect. Dis. 26, 1343–1345 (2020).

  35. 35.

    Rader, B. et al. Crowding and the epidemic intensity of COVID-19 transmission. Preprint at https://doi.org/10.1101/2020.04.15.20064980 (2020).

  36. 36.

    NHS COVID-19 App. https://www.nhsx.nhs.uk/covid-19-response/nhs-covid-19-app/ (2020).

  37. 37.

    Safe Paths — MIT Media Lab. https://www.media.mit.edu/projects/safepaths/overview/ (2020).

  38. 38.

    Australian Government Department of Health. COVIDSafe app. https://www.health.gov.au/resources/apps-and-tools/covidsafe-app (2020).

  39. 39.

    Help speed up contact tracing with TraceTogether. http://www.gov.sg/article/help-speed-up-contact-tracing-with-tracetogether (2020).

  40. 40.

    Care19. https://ndresponse.gov/covid-19-resources/care19 (2020).

  41. 41.

    Aarogya Setu Mobile App. https://mygov.in/aarogya-setu-app/ (2020).

  42. 42.

    Rakning C-19 App. https://www.covid.is/app/en (2020).

  43. 43.

    Park, S., Choi, G. J. & Ko, H. Information technology–based tracing strategy in response to COVID-19 in South Korea—privacy controversies. JAMA 323, 2129–2130 (2020).

  44. 44.

    Romm, T., Dwoskin, E. & Timberg, C. U. S. government, tech industry discussing ways to use smartphone location data to combat coronavirus. Washington Post (2020).

  45. 45.

    Pollina, E. & Busvine, D. European mobile operators share data for coronavirus fight. Reuters (2020).

  46. 46.

    Ferretti, L. et al. Quantifying SARS-CoV-2 transmission suggests epidemic control with digital contact tracing. Science 368, 1–7 (2020). Through mathematical models and case studies, Ferretti et al. detail how a mobile phone app could bring a quickly spreading epidemic under control by facilitating rapid and wide-spread contact tracing and quarantining.

  47. 47.

    Raskar, R. et al. Apps gone rogue: maintaining personal privacy in an epidemic. Preprint at https://arxiv.org/abs/2003.08567 (2020).

  48. 48.

    Berke, A., Bakker, M., Vepakomma, P., Larson, K. & Pentland, A.’sandy’. Assessing disease exposure risk with location data: a proposal for cryptographic preservation of privacy. Preprint at https://arxiv.org/abs/2003.14412 (2020).

  49. 49.

    FlowKit. https://flowkit.xyz/ (2020).

  50. 50.

    Gray, J. et al. Flowminder/FlowKit: 1.9.4. https://zenodo.org/record/3820249#.Xr2Cj8YpC7Y (2020).

  51. 51.

    Buckee, C. O. & Engø-Monsen, K. Mobile phone data for public health: towards data-sharing solutions that protect individual privacy and national security. Preprint at https://arxiv.org/abs/1606.00864 (2016). Buckee & Engø- Monsen describe a standardized framework for data sharing that avoids privacy and security concerns.

  52. 52.

    Cuebiq Marketing Team. Data for good. https://www.cuebiq.com/about/data-for-good/ (2017).

  53. 53.

    de Montjoye, Y.-A. et al. On the privacy-conscientious use of mobile phone data. Sci. Data 5, 180286 (2018).

    Article  Google Scholar 

  54. 54.

    General Data Protection Regulation (GDPR) – Official Legal Text. General Data Protection Regulation (GDPR) https://gdpr-info.eu/.

  55. 55.

    Bengtsson, L., Lu, X., Thorson, A., Garfield, R. & von Schreeb, J. Improved response to disasters and outbreaks by tracking population movements with mobile phone network data: a post-earthquake geospatial study in Haiti. PLoS Med. 8, e1001083 (2011).

    Article  Google Scholar 

  56. 56.

    Wesolowski, A., Eagle, N., Noor, A. M., Snow, R. W. & Buckee, C. O. The impact of biases in mobile phone ownership on estimates of human mobility. J. R. Soc. Interface 10, 20120986 (2013).

    Article  Google Scholar 

  57. 57.

    Wesolowski, A., Eagle, N., Noor, A. M., Snow, R. W. & Buckee, C. O. Heterogeneous mobile phone ownership and usage patterns in Kenya. PLoS One 7, e35319 (2012).

    ADS  CAS  Article  Google Scholar 

  58. 58.

    Kucharski, A. J. et al. Effectiveness of isolation, testing, contact tracing, and physical distancing on reducing transmission of SARS-CoV-2 in different settings: a mathematical modelling study. Lancet Infect. Dis. https://doi.org/10.1016/S1473-3099(20)30457-6 (2020).

  59. 59.

    Balsari, S., Buckee, C. & Khanna, T. Which Covid-19 data can you trust? Harvard Bus. Rev. (2020).

  60. 60.

    European Commission. Coronavirus: commission adopts recommendation to support exit strategies through mobile data and apps. https://ec.europa.eu/commission/presscorner/detail/en/IP_20_626 (2020).

Download references

Acknowledgements

A.W., K.H.G., H.R.M., and J.R.G. are funded by a Career Award at the Scientific Interface from the Burroughs Wellcome Fund and the National Library of Medicine of the National Institutes of Health under Award Number DP2LM013102. C.O.B. is funded by the National Institute of General Medical Sciences (R35GM124715-02). S.S. is funded by DP2DA040244. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Author information

Affiliations

Authors

Contributions

A.W., H.R.M., and K.H.G. conceived and took the lead writing the manuscript. D.A.T.C., C.J.E.M., B.T.G., J.R.G., S.M., S.S., A.L., N.K., C.O.B., A.W., H.R.M., K.H.G. all provided critical feedback and helped shape the manuscript.

Corresponding author

Correspondence to Amy Wesolowski.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Linus Bengtsson and Manolis Terrovitis for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Grantz, K.H., Meredith, H.R., Cummings, D.A.T. et al. The use of mobile phone data to inform analysis of COVID-19 pandemic epidemiology. Nat Commun 11, 4961 (2020). https://doi.org/10.1038/s41467-020-18190-5

Download citation

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Search

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing