State-level tracking of COVID-19 in the United States

Unwin, H. Juliette T.; Mishra, Swapnil; Bradley, Valerie C.; Gandy, Axel; Mellan, Thomas A.; Coupland, Helen; Ish-Horowicz, Jonathan; Vollmer, Michaela A. C.; Whittaker, Charles; Filippi, Sarah L.; Xi, Xiaoyue; Monod, Mélodie; Ratmann, Oliver; Hutchinson, Michael; Valka, Fabian; Zhu, Harrison; Hawryluk, Iwona; Milton, Philip; Ainslie, Kylie E. C.; Baguelin, Marc; Boonyasiri, Adhiratha; Brazeau, Nick F.; Cattarino, Lorenzo; Cucunuba, Zulma; Cuomo-Dannenburg, Gina; Dorigatti, Ilaria; Eales, Oliver D.; Eaton, Jeffrey W.; van Elsland, Sabine L.; FitzJohn, Richard G.; Gaythorpe, Katy A. M.; Green, William; Hinsley, Wes; Jeffrey, Benjamin; Knock, Edward; Laydon, Daniel J.; Lees, John; Nedjati-Gilani, Gemma; Nouvellet, Pierre; Okell, Lucy; Parag, Kris V.; Siveroni, Igor; Thompson, Hayley A.; Walker, Patrick; Walters, Caroline E.; Watson, Oliver J.; Whittles, Lilith K.; Ghani, Azra C.; Ferguson, Neil M.; Riley, Steven; Donnelly, Christl A.; Bhatt, Samir; Flaxman, Seth

doi:10.1038/s41467-020-19652-6

Download PDF

Article
Open access
Published: 03 December 2020

State-level tracking of COVID-19 in the United States

Nature Communications volume 11, Article number: 6189 (2020) Cite this article

8640 Accesses
78 Citations
35 Altmetric
Metrics details

Subjects

Abstract

As of 1st June 2020, the US Centres for Disease Control and Prevention reported 104,232 confirmed or probable COVID-19-related deaths in the US. This was more than twice the number of deaths reported in the next most severely impacted country. We jointly model the US epidemic at the state-level, using publicly available death data within a Bayesian hierarchical semi-mechanistic framework. For each state, we estimate the number of individuals that have been infected, the number of individuals that are currently infectious and the time-varying reproduction number (the average number of secondary infections caused by an infected person). We use changes in mobility to capture the impact that non-pharmaceutical interventions and other behaviour changes have on the rate of transmission of SARS-CoV-2. We estimate that R_t was only below one in 23 states on 1st June. We also estimate that 3.7% [3.4%–4.0%] of the total population of the US had been infected, with wide variation between states, and approximately 0.01% of the population was infectious. We demonstrate good 3 week model forecasts of deaths with low error and good coverage of our credible intervals.

Infectious disease in an era of global change

Article 13 October 2021

Rachel E. Baker, Ayesha S. Mahmud, … C. Jessica E. Metcalf

The evolutionary drivers and correlates of viral host jumps

Article Open access 25 March 2024

Cedric C. S. Tan, Lucy van Dorp & Francois Balloux

Risk of death following COVID-19 vaccination or positive SARS-CoV-2 test in young people in England

Article Open access 27 March 2023

Vahé Nafilyan, Charlotte R. Bermingham, … James C. Doidge

Introduction

The first death caused by COVID-19 in the United States is currently believed to have occurred in Santa Clara County, California on the 6th February¹. Throughout March 2020, US state governments implemented a variety of non-pharmaceutical interventions (NPIs), such as school closures and stay-at-home orders, to limit the spread of SARS-CoV-2 and ensure the number of severe COVID-19 cases did not exceed the capacity of the health system. In April 2020, the number of deaths attributed to COVID-19 in the United States (US) surpassed that of Italy². Courtemanche et al.³ used an event-study model to determine that such NPIs were successful in reducing the growth rate of COVID-19 cases across US counties. We similarly seek to estimate the impact of NPIs on COVID-19 transmission, but through a semi-mechanistic Bayesian model that reflects the underlying process of disease transmission and relies on mobility data released by companies such as Google⁴.

Mobility measures revealed stark changes in behaviour following the large-scale government interventions in the first stage of the epidemic, with individuals spending more time at home and correspondingly less time at work, at leisure centres, shopping, and on public transit^4,5. As states continued to ease the stringency of their NPIs in the end of June, policy decisions relied on the interaction between mobility and NPIs and their subsequent impact on transmission, alongside other measures to track and curtail SARS-CoV-2 transmission.

We introduced a new Bayesian statistical framework for estimating the rate of transmission and attack rates for COVID-19 in Flaxman et al.⁶. In that paper, we inferred the time-varying reproduction number, R_t, or the average number of people an infected person will infect over time. We calculated the number of new infections through combining previous infections with the generation interval (the distribution of times between infections) and chose the number of deaths to be a function of the number of infections and the infection fatality ratio (IFR). We estimated the posterior probability of our parameters given the observed data, while incorporating prior uncertainty. This made our approach empirically driven, whilst incorporating uncertainty. This approach has also been implemented for Italy⁷ and Brazil⁸.

In this paper, we extend the Flaxman et al.⁶ framework to model transmission in the US at the state-level and include reported cases in our model. We parameterise R_t as a function of several mobility types and include an autoregressive term to capture changes in transmission that are decoupled from mobility, for example hand-washing, social distancing and changes in transmission that are decoupled from mobility. We utilise partial pooling of parameters, where information is shared across all states to leverage as much signal as possible, but individual effects are also included for state and region-specific idiosyncrasies. In this paper, we infer plausible upper and lower bounds (Bayesian credible interval summaries of our posterior distribution) of the total population that had been infected by COVID-19 on 01 June 2020 (also called the cumulative attack rate or attack rate) and estimate the effective number of individuals currently infectious given our generation distribution. We also present effect sizes of the mobility covariates and make short-term forecasts, which we compare with reality throughout June. Details of the data sources and a technical description of our model are found in sections “Methods” and “Data”, respectively. General limitations of our approach are presented in the conclusions.

Results

Infections

The percentage of the total population across the US infected by COVID-19 was 3.7% [3.4%–4.0%] on 01 June 2020. However, this low national average masked a stark heterogeneity across the states (Table 1). New York and New Jersey had the highest estimated cumulative attack rates, of 15.9% [12.4%–19.9%] and 14.8% [11.2%–18.2%] respectively, and Connecticut and Massachusetts both had cumulative attack rates over 10%. Conversely, other states that have drawn attention for early outbreaks, such as California, Washington, and Florida, only had cumulative attack rates of around 2% and states that were only in the early stages of their epidemics, like Maine, had estimated cumulative attack rates of <1%.

Table 1 Posterior model estimates of percentage of total population ever infected, mean new infections per day over week ending 01 June 2020, and infection ascertainment ratio as of 01 June 2020. We present the mean and the 95% credible intervals in square brackets.

Full size table

Figure 1 shows the effective number of infectious individuals and the number of newly infected individuals on any given day up until 01 June 2020 for each of the 8 regions in our model, which are based on US census regions (see Supplementary Note 1 for further descriptions of our groupings). The effective number of infectious individuals is calculated using the generation time distribution, where individuals are weighted by how infectious they are over time, see section “Generated quantities” for more information. The fully infectious average includes asymptomatic and symptomatic individuals. On 01 June 2020, we estimate that there were 41,100 [34,500–46,800] infectious individuals across the US, which corresponds to 0.01% of the population. Table 1 shows estimates of the number of new infections across each states on 01 June 2020. By this date, the estimated number infections were beginning to increase in the Pacific (Alaska, California, Hawaii, Oregon and Washington) and Mountain (Arizona, Colorado, Idaho, Montana, Nevada, New Mexico, Utah and Wyoming) regions.

**Fig. 1: Daily estimates of the number of infectious (those still able to transmit) individuals and newly infected individuals.**

Our model includes a state-level parameter for the infection ascertainment ratio, IAR, which we define as the number of reported cases divided by the true number of infections (including asymptomatic infections). We only estimate this parameter from 11 May 2020 when more than 375,000 tests are done each day, see Supplementary Note 2 for further information. Column 3 of Table 1 shows the value of the infection ascertainment ratio in our model (see section “Methods”) and varies significantly between state. We would not expect the infection ascertainment to be 100% because our model includes asymptomatic individuals who may not know they have COVID-19. The mean value of this ratio varies between 43% (Missouri) to 74% (Kansas and Tennessee), which suggests that states are doing very different levels of testing.

Reproduction number

The mean estimate for R_t was below one in 23 states on 01 June 2020 and the 95% credible intervals did not exclude one in any state (see Supplementary Note 3 for R_ts by state). Figure 2 depicts the geographical variation in the posterior probability that R_t was <1 using a shape file from the US Census Bureau⁹. The closer a value is to 100%, the more certain we were that the reproduction number was below 1, indicating that new infections were not increasing. The probability was <40% that R_t < 1 in 20 states. There was substantial geographical clustering; most states in the Midwest and the South had reproduction numbers that suggested that the epidemic was not under control. We include figures of R_t, infections and deaths over time for each state in Supplementary Note 4.

Model effect sizes

We find that decreases in the overall average number of visits to different places had a significant effect on reducing transmission. If mobility stopped entirely (100% reduction in average mobility) then R_t would be reduced by 55.1% [26.5%–77.0%]. The country effect size estimates are given in Fig. 3, with regional and state-level effects given in Supplementary Note 5. However, in the US, the average mobility covariate never approached a 100% reduction, and only about half the states had reductions below 50% of the baseline. We define the baseline as the pre-epidemic mobility for each state⁴. As an example, consider the largest reduction observed, −62% of the baseline (Minnesota on 12 April 2020). The effect on R_t was a reduction of 37% [16%–56%] from the country level effect.

Fig. 3: Country level covariate effect sizes assuming mobility stopped entirely (100% reduction in average mobility) and residential mobility was increased fully (100% increase in residential mobility).

Increased time spent in residences also reduced transmission; if time spent in residences increased to 100% of the baseline, R_t would be reduced by 15.3% [−27.5% to 54.6%]. Time spent in residences increased by 20% or more from the baseline in 36 states. As an example, consider the largest reduction observed, a 33% increase from the baseline (New Jersey on 10 April 2020). The effect on R_t from this was a reduction of 5% [−10% to 20%] in New Jersey from the country level effect.

Average mobility and residential mobility are no doubt correlated—when people spend less time in public spaces, captured by our average mobility metric, they conversely spend more time at home. Owing to this collinearity, our model is unable to distinguish between the independent contributions of these covariates, with most of the effect assigned to the average mobility coefficient, due to its greater explanatory power. As a check that our overall findings were not biased by this collinearity, we verified that the posterior estimates of these coefficients were not correlated.

Short-term forecasts

We used our model to produce short-term death forecasts. Figure 4 compares our forecasts for the 3 weeks after 01 June 2020 (blue line with shaded uncertainty intervals) with the recorded daily number of deaths during this period (coral bars). As expected from our R_t values, deaths were noticeably declining in the Northeastern Corridor, where R_t > 1, with particularly low error between our forecasts and reality in New York and Connecticut. In the South, we forecast a flattening or slight increase of deaths, especially in Arkansas, Texas and Florida.

**Fig. 4: Three-week death forecasts for model fitted up until 01 June 2020.**

We investigated the numerical accuracy of our forecast using three metrics: mean absolute error, continuous ranked probability score (CRPS) and coverage of credible intervals. We fitted our model to three end points: 1 May, 15 May and 1 June and performed 3-week forecasts from each end point. We compared the metric scores with a log-linear “null” model fit to 31 days of data prior to the three specified end points (see Supplementary Note 6 for further information). We find our model performs similarly to the null model (1 June) or better (15 May), however, our model fit to 1 May is worse than the null model because we only include cases after 11 May in our models. This suggests that including cases improves the forecasting ability of our model and further justifies our inclusion of them. The coverage of our credible intervals is good for all models, in particular our model and the null model fit to 1 June.

Model selection and sensitivity

Mobility data provided a proxy for the behavioural changes that occur in response to non-pharmaceutical interventions. Supplementary Note 7 shows the mobility trends for the 50 states and the District of Columbia up until 01 June 2020 (see section “Data” for a description of the mobility dimensions). The median correlation between the observed average mobility and the timing of the introduction of major NPIs (represented as step functions) was ~86% (see Supplementary Note 8). We make no explicit causal link between NPIs and mobility because this relationship is plausibly causally linked by other factors. The mobility trends data suggests that substantial early outbreak in New York state may have led to substantial changes in mobility in nearby states, like Connecticut, prior to any mandated interventions in those states, which supports including regions in our model. Including both mobility trends and the timing of imposition and lifting of “stay-at-home” orders did not affect the estimated cumulative attack rates (see Supplementary Note 9).

Mobility alone cannot fully capture how transmission evolves over time. In particular, it cannot capture the impact of case-based interventions (such as testing and tracing) or behaviour changes (such as mask wearing or hand-washing). We use a second-order, weekly, autoregressive process to allow our changes in transmission to be decoupled from mobility. This autoregressive process is an additional term in our parametric equation for R_t and accounts for residual effects by capturing a correlation structure where current R_t is correlated with previous weeks R_t. This means that our forecasts were equally good whatever combination of mobility covariates were used because this term could capture the unexplained behaviour. The learnt random effects from this process are shown in Supplementary Note 10 for all states along with the contributions to R_t from the mobility and autoregressive terms for three example states. The autoregressive term increases R_t before lockdown in New York, which could be explained by behaviour such as panic buying. In contrast, the autoregressive term reduces R_t in Montana and could reflect behavioural changes such as hand-washing and self isolation, which can reduce transmission with maintained mobility levels. The autoregressive term remains mostly constant in Washington and suggests that mobility is sufficient to capture the behaviour there.

Discussion

We developed a Bayesian semi-mechanistic modelling approach to investigate the impact of NPIs on the spread of SARS-CoV-2 in the United States through changes in mobility. Our model relies on death data from the start of the epidemic and recently reported case data to inform our predictions. This enabled us to estimate a realistic infection ascertainment ratio for the 3 weeks before 01 June 2020 for each state, which could help inform policy as to where testing may be lacking. The mean value of this ratio varies between 43% (Missouri) to 74% (Kansas and Tennessee). Our epidemiological grounded mechanistic model links unobserved infections to reported cases and deaths, all within a principled Bayesian statistical framework. This is a significant advancement over curve-fitting models fit directly to reported cases.

Our model suggests that although initial reductions in the daily infections had plateaued in most states by 01 June 2020, the reservoir of infectious individuals still remained large with approximately 0.01% of the population being infectious on that date. Despite this, the cumulative attack rate across the US still remained low. We found our attack rate for New York was in line with those from recent serological studies¹⁰. There is now evidence that mild infection is able to lead to robust immunity (via T cells) but potentially not induce antibody production, which are detected in serosurveys¹¹. Therefore, serosurveys might underestimate exposure, particularly in mild cases, and our model may provide an alternative way to measure population exposure. Our cumulative attack rates are, however, sensitive to the assumed values of infection fatality rate (IFR). We account for each individual state’s age structure, and further adjust for contact mixing patterns¹², but age-specific modelling may be necessary to capture potential changes in the demographics of cases in states such as Texas, Florida and South Carolina where there is evidence that younger people than were infected at the start of the epidemic are being infected^13,14.

We estimated that 23 states had a posterior mean reproduction number R_t below one on 01 June 2020 and in no states were we more than 95% confident that R_t was below one. We compared our estimates with predictions made by rt.live¹⁵ who use a method that fits the most likely R_t curve to the daily new daily cases (see Supplementary Note 11). Overall, our estimates were weakly correlated (ρ = 0.42) with both of us estimating R_t > 1 in 23 states (red points), including Montana and Alaska. However, the rt.live estimates are slightly more pessimistic because they predict R_t > 1 in ten states where we predict R_t < 1 (blue points). In contrast we predict R_t > 1 in five states where they predict R_t < 1 (green points). Both sets of reproduction numbers strongly implied that the US epidemic was not under control in many states, and that in the presence of continued migration and the loosening of interventions seen in June, increased infections were to be expected with high probability. We found that state with high reproduction numbers on 01 June 2020 were geographically clustered in the west and south US, whilst the states that had suffered high COVID-19 mortality (such as the Northeast Corridor) in the early phase of the epidemic had lower reproduction numbers. After the period covered by this study, reported cases began to increase in the US, and seven states (Arizona, Arkansas, California, North Carolina, South Carolina, Tennessee and Texas) had recorded higher levels of hospitalisations in early July than before^16,17. This suggests our estimates that R_t was not less than one were accurate. More recent estimates of R_t, the number of infections, and the number of people currently infectious are presented on our website https://mrc-ide.github.io/covid19usa/.

Our 3-week forecasts of daily deaths were highly accurate, confirming the predictive validity of our modelling approach, despite our having kept mobility constant during our forecasts. These forecasts, alongside our R_t values, show that the epidemic was not under control at the end of May. The accuracy of our forecasts varied during the epidemic and could be due to our assumption that mobility is kept constant over these 3 weeks. Our forecast would perform worse in weeks where mobility was significantly different to the last week of our model fit. When we include cases in our model, we are able to get similar results to a simple “null” model whilst also being about to estimate effect sizes of different mobility trends. We also compared our cumulative death forecasts with those presented by Friedman et al.¹⁸. Friedman et al. compared the median absolute percentage error (MAPE) for SEIR and dynamic growth rate types of models for models fit to some point in June. Unlike those models, we find the MAPE of our cumulative death forecasts did not increase significantly over time and our 3-week median cumulative death MAPE across all states (9.9%) was similar to the US estimate from Friedman et al. (4.1–8.6), see Supplementary Note 12 for more information.

Our model uses mobility to predict SARS-COV2 transmission. We find that the timings that non-pharmaceutical interventions were implemented was strongly correlated to changes in mobility. This is similar to findings in Abouk and Heydari⁵ who find that statewide stay-at-home orders had the strongest causal impact on reducing social interaction and that these orders significantly increase the presence of individuals at home by about six fold (our “residential mobility trend”). This supports our choice of using mobility instead of the timings of NPIs in this study instead of the times of interventions as in Flaxman et al.⁶. We find that magnitude of the reductions in average mobility, and the resulting increases in residential mobility, are important in determining the size of reduction in R_t. This agrees with Wang et al.¹⁹ who use a stochastic age- and risk-structured susceptible-exposed-asymptomatic-symptomatic-hospitalised-recovered (SEAYHR) model to considered the effect of various levels of social distancing. They found that social distancing measures, which reduced non-household contacts by <50%, would not prevent a healthcare crisis and that only their 75% and 90% contact reduction scenarios were projected to enable metropolitan areas to remain within healthcare levels.

While mobility, or social distancing measures, will explain a large amount of the trend in R_t, there is likely to be substantial residual variation from other behavioural changes such as mask wearing and hand-washing. We accounted for this residual variation through a second-order, weekly, autoregressive process. This stochastic process captures changes in R_t reflected in the data, but is unable to attribute these changes to other determinants of transmission or interventions. We pool parameters in our model to leverage as much signal in our data as possible and to reflect the conjoined nature of some states, in particular in the Northeastern Corridor. While this sharing can potentially lead to over or under estimation of effect sizes, it also means that a consistent signal for all states can be estimated before that signal is presented in an individual state with little data, such as Alaska and Hawaii. Pooling also increases the robustness of our models to under reporting and time lags^6,7,8.

Methods

Flaxman et al.⁶ introduced a Bayesian model for estimating the transmission intensity and attack rate (percentage of the population that has been infected) from COVID-19 from the reported number of deaths. This framework used the time-varying reproduction number R_t to inform a latent function for infections, and then these infections, together with probabilistic lags, were calibrated against observed deaths. Observed deaths, while still susceptible to under reporting and delays, comprise a more consistent time series than the reported number of confirmed cases, which are susceptible to changes in the probability of ascertainment over the course of the epidemic as testing strategies changed. Our model code is available on GitHub. Analysis was done using RStan²⁰ version 2.19.3 within R version 3.6.3.

We adapted the original Bayesian semi-mechanistic model of the infection cycle to all the states in the US and the District of Columbia to infer the reproduction number over time (R_t), plausible upper and lower bounds (95% Bayesian credible intervals) of the total populations infected (attack rates) and the number of people currently infected on 01 June 2020. In this paper, we also include the reported number of cases after 11 May 2020, see Supplementary Note 13. This reflects the point in time when over 375,000 tests were being done each day across the US. We include this in our likelihood but do not use them to calculate transmission directly. We parametrise R_t as a function of Google mobility data and include an autoregressive term to capture non-mobility driven behaviour. We fit our model jointly to COVID-19 data from all states to assess the attack rates and number of people who were currently infected. Finally, we use our model to forecast for 3 weeks from 01 June 2020 and compare our estimates of deaths to those recorded in the US for each state. We assume mobility remains constant at the previous value of mobility on the same day the previous week in our forecasts and the autoregressive term remains constant.

Data

Our model uses daily real-time state-level aggregated data published by New York Times (NYT)²¹ for New York State and John Hopkins University (JHU)² for the remaining states. We include 105,006 deaths in our model up until 1 June and 479,422 cases from 11 May to 1 June. Age-specific population counts were drawn from the U.S. Census Bureau in 2018²² to estimate state-specific infection fatality ratio reflective of the population age structure. The timing of NPIs were collated by the University of Washington²³. We used Google’s COVID-19 Community Mobility Report⁴, which provides data on movement in the US by states and highlights the percent change in visits to:

Grocery & pharmacy: mobility trends for places like grocery markets, food warehouses, farmers markets, speciality food shops, drug stores, and pharmacies.
Parks: mobility trends for places like local parks, national parks, public beaches, marinas, dog parks, plazas, and public gardens.
Transit stations: mobility trends for places like public transport hubs such as subway, bus, and train stations.
Retail & recreation: mobility trends for places like restaurants, cafes, shopping centres, theme parks, museums, libraries, and movie theatres.
Residential: mobility trends for places of residence.
Workplaces: mobility trends for places of work.

The residential data includes length of stay at different places compared to a baseline, whereas the other mobility trends are based on number of visits to a certain place. These trends are, therefore, relative, i.e., mobility of −20% means that, compared to normal circumstances individuals are engaging in a given activity 20% less.

Model specifics

The true number of infected individuals, i, is modelled using a discrete renewal process. We specify a generation distribution g with density g(τ) as:

$$g \sim {\rm{Gamma}}(6.5,0.62).$$

(1)

Given the generation distribution, the number of infections i_t,m on a given day t, and state m, is given by the following discrete convolution function:

$$\begin{array}{rcl}{i}_{t,m}&=&{S}_{t,m}{R}_{t,m}\mathop{\sum }\nolimits_{\tau = 0}^{t-1}{i}_{\tau ,m}{g}_{t-\tau },\\ {S}_{t,m}&=&1-\frac{\mathop{\sum }\nolimits_{j = 0}^{t-1}{i}_{j,m}}{{N}_{m}},\hfill\end{array}$$

(2)

where the generation distribution is discretised by ${g}_{s}=\mathop{\int}\nolimits_{s-0.5}^{s+0.5}g(\tau )d\tau$ for s = 2, 3, . . . , and ${g}_{1}=\mathop{\int}\nolimits_{0}^{1.5}g(\tau )d\tau$. The population of state m is denoted by N_m. We include the adjustment factor S_t,m to account for the number of susceptible individuals left in the population.

Both deaths and cases are observed in our model. We define daily deaths, D_t,m, for days t ∈ {1, …, n} and states m ∈ {1, …, M}. These daily deaths are modelled using a positive real-valued function ${d}_{t,m}={\mathbb{E}}[{D}_{t,m}]$ that represents the expected number of deaths attributed to COVID-19. The daily deaths D_t,m are assumed to follow a negative binomial distribution with mean d_t,m and variance ${d}_{t,m}+\frac{{d}_{t,m}^{2}}{{\psi }_{1}}$, where ψ₁ follows a positive half normal distribution, i.e.,

$${D}_{t,m} \sim \,{\text{Negative}} \;{\text{binomial}}\,\left({d}_{t,m},{d}_{t,m}+\frac{{d}_{t,m}^{2}}{{\psi }_{1}}\right),\quad t=1,\ldots ,n$$

(3)

$${\psi }_{1} \sim {{\mathcal{N}}}^{+}(0,5).$$

(4)

Here, ${\mathcal{N}}(\mu ,\sigma )$ denotes a normal distribution with mean μ and standard deviation σ. We say that X follows a positive half normal distribution ${{\mathcal{N}}}^{+}(0,\sigma )$ if X ~ ∣Y∣, where $Y \sim {\mathcal{N}}(0,\sigma )$.

We link our observed deaths mechanistically to transmission as in Flaxman et al.⁶. We use a previously estimated COVID-19 infection fatality ratio (IFR, probability of death given infection) together with a distribution of times from infection to death π. Details of this calculation can be found in^24,25. From the above, every region has a specific mean infection fatality ratio ifr_m (see Supplementary Note 13). To incorporate the uncertainty inherent in this estimate we allow the ifr_m for every state to have additional noise around the mean. Specifically we assume

$$if{r}_{m}^{* } \sim if{r}_{m}\cdot N(1,0.1).$$

(5)

We believe a large-scale contact survey similar to polymod¹² has not been collated for the USA, so we assume the contact patterns are similar to those in the UK. We conducted a sensitivity analysis, shown in Supplementary Note 13, and found that the IFR calculated using the contact matrices of other European countries lay within the posterior of $if{r}_{m}^{* }$.

Using estimated epidemiological information from previous studies, we assume the distribution of times from infection to death π (infection-to-death) to be the convolution of an infection-to-onset distribution ($\pi ^{\prime}$)²⁵ and an onset-to-death distribution²⁴:

$$\pi \sim {\rm{Gamma}}(5.1,0.86)+{\rm{Gamma}}(17.8,0.45).$$

(6)

The expected number of deaths d_t,m, on a given day t, for state m is given by the following discrete sum:

$${d}_{t,m}=if{r}_{m}^{* }\mathop{\sum }\nolimits_{\tau = 0}^{t-1}{i}_{\tau ,m}{\pi }_{t-\tau },$$

(7)

where i_τ,m is the number of new infections on day τ in state m and where, similar to the generation distribution, π is discretized via ${\pi }_{s}=\mathop{\int}\nolimits_{s-0.5}^{s+0.5}\pi (\tau )d\tau$ for s = 2, 3, . . . , and ${\pi }_{1}=\mathop{\int}\nolimits_{0}^{1.5}\pi (\tau )d\tau$, where π(τ) is the density of π.

For every state m, we also observe daily cases C_t,m after t_c = 11 May 2020. Similar to daily deaths, daily cases are modelled using a positive real-valued function ${\bar{c}}_{t,m}={\mathbb{E}}[{C}_{t,m}]$ that represents the expected number of symptomatic and asymptomatic cases. Again, the daily cases C_t,m are assumed to follow a negative binomial distribution but with mean c_t,m and variance ${c}_{t,m}+\frac{{c}_{t,m}^{2}}{{\psi }_{2}}$, where ψ₂ follows a positive half normal distribution, i.e.,

$${C}_{t,m} \sim \,{\text{Negative}} \;{\text{binomial}}\,\left({c}_{t,m},{c}_{t,m}+\frac{{c}_{t,m}^{2}}{{\psi }_{2}}\right),\quad t={t}_{c},\ldots ,n,$$

(8)

$${\psi }_{2} \sim {{\mathcal{N}}}^{+}(0,5).$$

(9)

As before, we assume the distribution of times from infection to becoming a case $\pi ^{\prime}$ (infection-to-onset) to be

$${\pi }^{\prime} \sim {\rm{Gamma}}(5.1,0.86).$$

(10)

We add in a new link between our observed daily cases and our estimated daily infections. We use our model to estimate an infection ascertainment ratio (iar_m) for each state m, which is defined as the number of reported cases divided by the true number of infections (including both symtomatic and asymptomatic infections). This follows a Beta distribution, specifically u_m ~ Beta(12, 5).

The expected number of cases c_t,m, on a given day t, for state m is given by the following discrete sum:

$${c}_{t,m}=ia{r}_{m}\mathop{\sum }\nolimits_{\tau = 0}^{t-1}{i}_{\tau ,m}{\pi }_{t-\tau }^{\prime},$$

(11)

where, again, c_τ,m is the number of new infections on day τ in state m and where $\pi ^{\prime}$ is discretized via ${\pi }_{s}^{\prime}=\mathop{\int}\nolimits_{s-0.5}^{s+0.5}\pi ^{\prime} (\tau )d\tau$ for s = 2, 3, . . . , and ${\pi }_{1}^{\prime}=\mathop{\int}\nolimits_{0}^{1.5}\pi ^{\prime} (\tau )d\tau$, where $\pi ^{\prime} (\tau )$ is the density of $\pi ^{\prime}$.

We parametrise R_t,m as a linear function of the relative change in time spent and number of visits (from a baseline)

$${R}_{t,m}={R}_{0,m}\cdot f\left(-\left(\mathop{\sum }\nolimits_{k = 1}^{2}{X}_{t,m,k}{\alpha }_{k}\right)-\mathop{\sum }\nolimits_{l = 1}^{2}{Y}_{t,m,l}{\alpha }_{r(m),l}^{{\rm{region}}}-{Z}_{t,m}{\alpha }_{m}^{{\rm{state}}}-{\epsilon }_{m,{w}_{m}(t)}\right),$$

(12)

where $f(x)=2\exp (x)/(1+\exp (x))$ is twice the inverse logit function. X_t,m,k are covariates that have the same effect for all states, Y_t,m,l is a covariate that has region-specific effects, r(m) ∈ {1, …, R} is the region a state is in (see Supplementary Note 7), Z_t,m is a covariate that has a state-specific effect and ${\epsilon }_{m,{w}_{m}(t)}$ is a weekly AR(2) process, centred around 0, that captures variation between states that is not explained by the covariates.

The prior distribution for R_0,m²⁶ was chosen to be

$${R}_{0,m} \sim {\mathcal{N}}(3.28,\kappa )\,{\rm{with}}\,\kappa \sim {{\mathcal{N}}}^{+}(0,0.5),$$

(13)

where κ is the same among all states.

In the analysis of this paper we chose the following covariates: ${X}_{t,m,1}={M}_{t,m}^{{\rm{average}}}$, ${X}_{t,m,2}={M}_{t,m}^{{\rm{residential}}}$, Y_t,m,1 = 1 (an intercept), ${Y}_{t,m,2}={M}_{t,m}^{{\rm{average}}}$ and ${Z}_{t,m}={M}_{t,m}^{{\rm{average}}}$, where the mobility variables are from⁴ and defined as follows (all are encoded so that 0 is the baseline and 1 is a full reduction of the mobility in this dimension):

${M}_{t,m}^{{\rm{average}}}$ is an average of retail and recreation, groceries and pharmacies, and workplaces. An average is taken as these dimensions are strongly collinear.
${M}_{t,m}^{{\rm{residential}}}$ are the mobility trends for places of residences.

We include regional, as well as state-level parameters, in our model to encapsulate the connected nature of states. This was particularly important in the Northeasten corridor where residents in New Jersey and Connecticut regularly commuted into New York, the early epicentre of the US epidemic (see Supplementary Note 1 for a map of the regions). Regions are based on US Census Divisions, modified to account for coordination between groups of state governments²⁷.

We assume that seeding of new infections begins 30 days before the day after a state has cumulatively observed 10 deaths. From this date, we seed our model with 6 sequential days of an equal number of infections: ${i}_{1,m}=\ldots ={i}_{6,m} \sim {\rm{Exponential}}\left(\frac{1}{\tau }\right)$, where τ ~ Exponential(0.03). These seed infections are inferred in our Bayesian posterior distribution.

The weekly, state-specific effect is modelled as a weekly AR(2) process, centred around 0 with stationary standard deviation σ_w that, in every state, starts on the first day of its seeding of infections, i.e., 30 days before a total of 10 cumulative deaths have been observed in this state. The AR(2) process starts with ${\epsilon }_{1,m} \sim {\mathcal{N}}(0,{\sigma }_{w}^{* })$,

$${\epsilon }_{w,m} \sim {\mathcal{N}}({\rho }_{1}{\epsilon }_{w-1,m}+{\rho }_{2}{\epsilon }_{w-2,m},{\sigma }_{w}^{* })\,{\rm{for}}\,m=2,3,4,\ldots$$

(14)

with independent priors on ρ₁ and ρ₂ that are normal distributions conditioned to be in [0, 1]; the prior for ρ₁ is a ${\mathcal{N}}(0.8,0.05)$ distribution conditioned to be in [0, 1] and the prior for ρ₂ is a ${\mathcal{N}}(0.1,0.05)$ distribution, conditioned to be in [0, 1]. The prior for σ_w, the standard deviation of the stationary distribution of ϵ_w is chosen as ${\sigma }_{w} \sim {{\mathcal{N}}}^{+}(0,0.2)$. The standard deviation of the weekly updates to achieve this standard deviation of the stationary distribution is ${\sigma }_{w}^{* }={\sigma }_{w}\sqrt{1-{\rho }_{1}^{2}-{\rho }_{2}^{2}-2{\rho }_{1}^{2}{\rho }_{2}/(1-{\rho }_{2})}$. The conversion from days to weeks is encoded in w_m(t). Every 7 days, w_m is incremented, i.e., we set ${w}_{m}(t)=\lfloor (t-{t}_{m}^{{\rm{start}}})/7\rfloor +1$, where ${t}_{m}^{{\rm{start}}}$ is the first day of seeding. We keep the AR(2) process constant over the last 7 days of observations since this is less informed by data due to the lags and also over the forecast period.

The prior distribution for the shared coefficients were chosen to be

$${\alpha }_{k} \sim {\mathcal{N}}(0,0.5),k=1,\ldots ,3,$$

(15)

and the prior distribution for the pooled coefficients were chosen to be

$${\alpha }_{r,l}^{{\rm{region}}} \sim {\mathcal{N}}(0,{\gamma }_{r}),r=1,\ldots ,R,l=1,2,\,{\rm{with}}\,{\gamma }_{r} \sim {{\mathcal{N}}}^{+}(0,0.5),$$

(16)

$${\alpha }_{m}^{{\rm{state}}} \sim {\mathcal{N}}(0,{\gamma }_{s}),m=1,\ldots ,M\,{\rm{with}}\,{\gamma }_{s} \sim {{\mathcal{N}}}^{+}(0,0.5).$$

(17)

We estimated parameters jointly for all states in a single hierarchical model. Fitting was done in the probabilistic programming language Stan²⁰ using an adaptive Hamiltonian Monte Carlo (HMC) sampler.

Generated quantities

The effective number of infectious individuals, i^*, on a given day considers how infectious a previously infected individual is on a given day and includes both asymptotic and symptomatic individuals. It is calculated by first re-scaling the generation distribution by its maximum, i.e., ${g}_{\tau }^{* }=\frac{{g}_{\tau }}{\mathop{\max }\limits_{t}{g}_{t}}$. Based on (2), the number of infectious individuals is then calculated from the number of previously infected individuals, c, using the following:

$${i}_{t,m}^{* }=\mathop{\sum }\nolimits_{\tau = 0}^{t-1}{i}_{\tau ,m}{g}_{t-\tau }^{* },$$

(18)

where i_t,m is the number of new infections on day t in state m.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All data necessary for the replication of our results is collated in https://github.com/ImperialCollegeLondon/covid19model. The death data originated from John Hopkins University https://github.com/CSSEGISandData/COVID-19 and the New York Times https://github.com/nytimes/covid-19-data.

Code availability

All code necessary for the replication of our results is collated in https://github.com/ImperialCollegeLondon/covid19model release 10.

References

Santa Clara County Public Health. County of Santa Clara Identifies Three Additional Early Covid-19 Deaths (2020). https://www.sccgov.org/sites/covid19/Pages/press-release-04-21-20-early.aspx.
Dong, E., Du, H. & Gardner, L. An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect. Dis. 20, 1473–3099 (2020).
Google Scholar
Courtemanche, C., Garuccio, J., Le, A., Pinkston, J. & Yelowitz, A. Strong social distancing measures in the United States reduced the COVID-19 growth rate. Health Aff. 39, 7 (2020).
Aktay, A. et al. Google COVID-19 Community Mobility Reports: Anonymization Process Description (version 1.0) (Google, 2020).
Abouk, R. & Heydari, B. The Immediate Effect of Covid-19 Policies on Social Distancing Behavior in the United States. (SSRN, 2020).
Flaxman, S. et al. Estimating the effects of non-pharmaceutical interventions on COVID-19 in Europe. Nature 584, 257–261 (2020).
Vollmer, M. et al. Report 20: Using Mobility to Estimate the Transmission Intensity of COVID-19 in Italy: A Subnational Analysis with Future Scenarios (Imperial College COVID-19 Response Team, 2020).
Mellan, T. A. et al. Report 21-Estimating COVID-19 cases and reproduction number in Brazil (Imperial College COVID-19 Response Team, 2020).
U.S. Census Bureau. Cartographic Boundary Files-Shapefile. https://www.census.gov/geographies/mapping-files/time-series/geo/carto-boundary-file.html (U.S. Census Bureau, 2020).
Kissler, S. M. et al. Reductions in commuting mobility predict geographic differences in sars-cov-2 prevalence in new york city (2020). http://nrs.harvard.edu/urn-3:HUL.InstRepos:42665370.
Long, Q.-X. et al. Clinical and immunological assessment of asymptomatic sars-cov-2 infections. Nat. Med. 26, 1200–1204 (2020).
Mossong, J. et al. Social contacts and mixing patterns relevant to the spread of infectious diseases. PLOS Med. 5, 1–1 (2008).
Article Google Scholar
Champagne, S. R. & Oxner, R. Surge in Coronavirus Cases Linked to More Texans in Their 20s Getting Sick, Officials Say (2020). https://www.texastribune.org/2020/06/16/texas-coronavirus-spike-young-adults/. Accessed 9 July 2020.
NY Times. Florida and South Carolina Again Set Records as U.s. Coronavirus Cases Surge (2020). https://www.nytimes.com/2020/06/20/world/coronavirus-updates.html#link-2204ff25. Accessed on 9 July 2020.
Systrom, K., Vladek, T. & Krieger, M. Model powering rt.live (2020).
Buffa, P. Coronavirus in the U.S.: Where Cases Are Growing and Declining. https://www.nationalgeographic.com/science/2020/05/graphic-tracking-coronavirus-infections-us/. (2020) Accessed on 26 June 2020.
Knowles, H. et al. Seven States Report Highest Coronavirus Hospitalizations since Pandemic Began https://www.washingtonpost.com/nation/2020/06/23/coronavirus-live-updates-us/. (2020). Accessed on 26 June 2020.
Friedman, J., Liu, P., Gakidou, E. & IHME COVID-19 Model Comparison Team. Predictive performance of international COVID-19 mortality forecasting models. medRxiv https://doi.org/10.1101/2020.07.13.20151233 (2020).
Wang, X. et al. Impact of social distancing measures on coronavirus disease healthcare demand, central Texas, USA. Emer. Infect. Dis. 26, 10 (2020).
Carpenter, B. et al. Stan: a probabilistic programming language. J. Stat. Softw. 76, 1–32 (2017).
Article Google Scholar
Smith, M. et al. Coronavirus (covid-19) Data in the United States (2020). https://github.com/nytimes/covid-19-data.
Reporter, C. Census Reporter (Census reporter, 2020). https://censusreporter.org.
Fullman, N. et al. State-level Social Distancing Policies in Response to Covid-19 in the US (2020). http://www.covid19statepolicy.org.
Verity, R. et al. Estimates of the severity of COVID-19 disease. Lancet Infect. Dis. 20, 669–677 (2020).
Walker, P. et al. The impact of covid-19 and strategies for mitigation and suppression in low- and middle-income countries. Science 369, 413–422 (2020). https://www.imperial.ac.uk/mrc-global-infectious-disease-analysis/news-wuhan-coronavirus/.
Liu, Y., Gayle, A., Wilder-Smith, A. & Rocklöv, J. The reproductive number of COVID-19 is higher compared to SARS coronavirus. J. Travel Med. 27, taaa021 (2020).
Reston, M., Sgueglia, K. & Mossburg, C. Governors on East and West Coasts Form Pacts to Decide When to Reopen Economies (2020). https://edition.cnn.com/2020/04/13/politics/states-band-together-reopening-plans/index.html.

Download references

Acknowledgements

We would like to thank Amazon AWS and Microsoft AI for health for computational credits and we would like to thank the Stan development team for their ongoing assistance. We would also like to thank David Joerg and Jacob Steinhardt for their comments through Open Review. This research was partly funded by the Imperial College COVID-19 Research Fund and was supported by Centre funding from the UK Medical Research Council under a concordat with the UK Department for International Development, the NIHR Health Protection Research Unit in Modelling Methodology and Community Jameel. H.J.T.U. is funded by Imperial College London through an Imperial College Research Fellowship grant. S.B. acknowledges the NIHR BRC Imperial College NHS Trust Infection and COVID themes, the Academy of Medical Sciences Springboard award and the Bill and Melinda Gates Foundation.

Author information

These authors contributed equally: H. Juliette T. Unwin, Swapnil Mishra, Valerie C. Bradley, Axel Gandy.
Unaffiliated: Fabian Valka.

Authors and Affiliations

MRC Centre for Global Infectious Disease Analysis, Abdul Latif Jameel Institute for Disease and Emergency Analytics (J-IDEA), Imperial College, London, UK
H. Juliette T. Unwin, Swapnil Mishra, Thomas A. Mellan, Helen Coupland, Michaela A. C. Vollmer, Charles Whittaker, Iwona Hawryluk, Philip Milton, Kylie E. C. Ainslie, Marc Baguelin, Nick F. Brazeau, Lorenzo Cattarino, Zulma Cucunuba, Gina Cuomo-Dannenburg, Ilaria Dorigatti, Oliver D. Eales, Sabine L. van Elsland, Richard G. FitzJohn, Katy A. M. Gaythorpe, William Green, Wes Hinsley, Benjamin Jeffrey, Edward Knock, Daniel J. Laydon, John Lees, Gemma Nedjati-Gilani, Pierre Nouvellet, Lucy Okell, Kris V. Parag, Igor Siveroni, Hayley A. Thompson, Patrick Walker, Caroline E. Walters, Oliver J. Watson, Lilith K. Whittles, Azra C. Ghani, Neil M. Ferguson, Steven Riley, Christl A. Donnelly & Samir Bhatt
Department of Statistics, University of Oxford, Oxford, UK
Valerie C. Bradley, Michael Hutchinson & Christl A. Donnelly
Department of Mathematics, Imperial College, London, UK
Axel Gandy, Jonathan Ish-Horowicz, Sarah L. Filippi, Xiaoyue Xi, Mélodie Monod, Oliver Ratmann, Harrison Zhu & Seth Flaxman
Department of Infectious Disease Epidemiology, London School of Hygiene and Tropical Medicine, London, UK
Marc Baguelin
NIHR Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance, Imperial College London, London, UK
Adhiratha Boonyasiri
MRC Centre for Global Infectious Disease Analysis, Imperial College, London, UK
Jeffrey W. Eaton
School of Life Sciences, University of Sussex, Brighton, UK
Pierre Nouvellet
Department of Laboratory Medicine and Pathology, Brown University, Providence, RI, USA
Oliver J. Watson

Authors

H. Juliette T. Unwin
View author publications
You can also search for this author in PubMed Google Scholar
Swapnil Mishra
View author publications
You can also search for this author in PubMed Google Scholar
Valerie C. Bradley
View author publications
You can also search for this author in PubMed Google Scholar
Axel Gandy
View author publications
You can also search for this author in PubMed Google Scholar
Thomas A. Mellan
View author publications
You can also search for this author in PubMed Google Scholar
Helen Coupland
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Ish-Horowicz
View author publications
You can also search for this author in PubMed Google Scholar
Michaela A. C. Vollmer
View author publications
You can also search for this author in PubMed Google Scholar
Charles Whittaker
View author publications
You can also search for this author in PubMed Google Scholar
Sarah L. Filippi
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoyue Xi
View author publications
You can also search for this author in PubMed Google Scholar
Mélodie Monod
View author publications
You can also search for this author in PubMed Google Scholar
Oliver Ratmann
View author publications
You can also search for this author in PubMed Google Scholar
Michael Hutchinson
View author publications
You can also search for this author in PubMed Google Scholar
Fabian Valka
View author publications
You can also search for this author in PubMed Google Scholar
Harrison Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Iwona Hawryluk
View author publications
You can also search for this author in PubMed Google Scholar
Philip Milton
View author publications
You can also search for this author in PubMed Google Scholar
Kylie E. C. Ainslie
View author publications
You can also search for this author in PubMed Google Scholar
Marc Baguelin
View author publications
You can also search for this author in PubMed Google Scholar
Adhiratha Boonyasiri
View author publications
You can also search for this author in PubMed Google Scholar
Nick F. Brazeau
View author publications
You can also search for this author in PubMed Google Scholar
Lorenzo Cattarino
View author publications
You can also search for this author in PubMed Google Scholar
Zulma Cucunuba
View author publications
You can also search for this author in PubMed Google Scholar
Gina Cuomo-Dannenburg
View author publications
You can also search for this author in PubMed Google Scholar
Ilaria Dorigatti
View author publications
You can also search for this author in PubMed Google Scholar
Oliver D. Eales
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey W. Eaton
View author publications
You can also search for this author in PubMed Google Scholar
Sabine L. van Elsland
View author publications
You can also search for this author in PubMed Google Scholar
Richard G. FitzJohn
View author publications
You can also search for this author in PubMed Google Scholar
Katy A. M. Gaythorpe
View author publications
You can also search for this author in PubMed Google Scholar
William Green
View author publications
You can also search for this author in PubMed Google Scholar
Wes Hinsley
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin Jeffrey
View author publications
You can also search for this author in PubMed Google Scholar
Edward Knock
View author publications
You can also search for this author in PubMed Google Scholar
Daniel J. Laydon
View author publications
You can also search for this author in PubMed Google Scholar
John Lees
View author publications
You can also search for this author in PubMed Google Scholar
Gemma Nedjati-Gilani
View author publications
You can also search for this author in PubMed Google Scholar
Pierre Nouvellet
View author publications
You can also search for this author in PubMed Google Scholar
Lucy Okell
View author publications
You can also search for this author in PubMed Google Scholar
Kris V. Parag
View author publications
You can also search for this author in PubMed Google Scholar
Igor Siveroni
View author publications
You can also search for this author in PubMed Google Scholar
Hayley A. Thompson
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Walker
View author publications
You can also search for this author in PubMed Google Scholar
Caroline E. Walters
View author publications
You can also search for this author in PubMed Google Scholar
Oliver J. Watson
View author publications
You can also search for this author in PubMed Google Scholar
Lilith K. Whittles
View author publications
You can also search for this author in PubMed Google Scholar
Azra C. Ghani
View author publications
You can also search for this author in PubMed Google Scholar
Neil M. Ferguson
View author publications
You can also search for this author in PubMed Google Scholar
Steven Riley
View author publications
You can also search for this author in PubMed Google Scholar
Christl A. Donnelly
View author publications
You can also search for this author in PubMed Google Scholar
Samir Bhatt
View author publications
You can also search for this author in PubMed Google Scholar
Seth Flaxman
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.J.T.U., S.M., V.C.B., A.G., S.B. and S.F. conceived and designed the study. H.J.T.U., S.M., T.A.M., M.A.C.V., H.Z. and P.M. performed mobility analysis. J.I.-H., S.L.F. and X.X. contributed to statistical analysis and M.H. and I.H. did other analysis. H.J.T.U., S.M., A.G., T.A.M., H.C.,. M.A.C.V., V.W., M.M., O.R., S.B. and S.F. contributed to code development. H.J.T.U., M.A.C.V. and S.F. did the plotting. S.M. and F.V. created the website. H.J.T.U., V.C.B., S.B. and S.F. wrote the first draft of the paper. All authors (H.J.T.U, S.M., V.C.B., A.G., T.A.M., H.C., J.I.-H., M.A.C.V., C.W., S.L.F., X.X., M.M., O.R., M.H., F.V., H.Z., I.H., P.M., K.E.C.A., M.B., A.B., N.F.B., L.C., Z.C., G.C.-D., I.D., O.D.E., J.W.E., S.L.E., R.G.F., K.A.M.G., W.G., W.H., B.J., E.K., D.J.L., J.L., G.N.-G., P.N., L.O., K.V.P., I.S., H.A.T., P.W., C.E.W., O.J.W., L.K.W., A.C.G., N.M.F., S.R., C.A.D., S.B. and S.F.) discussed the results and contributed to the revision of the final manuscript.

Corresponding authors

Correspondence to H. Juliette T. Unwin, Samir Bhatt or Seth Flaxman.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thank Joel Hellewell and the other, anonymous reviewer(s) for their contribution to the peer review of this work. Peer review reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Unwin, H.J.T., Mishra, S., Bradley, V.C. et al. State-level tracking of COVID-19 in the United States. Nat Commun 11, 6189 (2020). https://doi.org/10.1038/s41467-020-19652-6

Download citation

Received: 15 July 2020
Accepted: 15 October 2020
Published: 03 December 2020
DOI: https://doi.org/10.1038/s41467-020-19652-6

This article is cited by

Quality assessment and community detection methods for anonymized mobility data in the Italian Covid context
- Jules Morand
- Shoichi Yip
- Luca Tubiana
Scientific Reports (2024)
Artificial intelligence-based framework to identify the abnormalities in the COVID-19 disease and other common respiratory diseases from digital stethoscope data using deep CNN
- Kranthi Kumar Lella
- M. S. Jagadeesh
- P. J. A. Alphonse
Health Information Science and Systems (2024)
Reproduction number projection for the COVID-19 pandemic
- Ryan Benjamin
Advances in Continuous and Discrete Models (2023)
Mass gatherings for political expression had no discernible association with the local course of the COVID-19 pandemic in the USA in 2020 and 2021
- Eric Feltham
- Laura Forastiere
- Nicholas A. Christakis
Nature Human Behaviour (2023)
Epidemic modelling of monitoring public behavior using surveys during pandemic-induced lockdowns
- Andreas Koher
- Frederik Jørgensen
- Sune Lehmann
Communications Medicine (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.