Burden and characteristics of COVID-19 in the United States during 2020

Pei, Sen; Yamana, Teresa K.; Kandula, Sasikiran; Galanti, Marta; Shaman, Jeffrey

doi:10.1038/s41586-021-03914-4

Download PDF

Article
Published: 26 August 2021

Burden and characteristics of COVID-19 in the United States during 2020

Nature volume 598, pages 338–341 (2021)Cite this article

44k Accesses
102 Citations
1033 Altmetric
Metrics details

Subjects

A Publisher Correction to this article was published on 17 December 2021

This article has been updated

Abstract

The COVID-19 pandemic disrupted health systems and economies throughout the world during 2020 and was particularly devastating for the United States, which experienced the highest numbers of reported cases and deaths during 2020^1,2,3. Many of the epidemiological features responsible for observed rates of morbidity and mortality have been reported^4,5,6,7,8; however, the overall burden and characteristics of COVID-19 in the United States have not been comprehensively quantified. Here we use a data-driven model-inference approach to simulate the pandemic at county-scale in the United States during 2020 and estimate critical, time-varying epidemiological properties underpinning the dynamics of the virus. The pandemic in the United States during 2020 was characterized by national ascertainment rates that increased from 11.3% (95% credible interval (CI): 8.3–15.9%) in March to 24.5% (18.6–32.3%) during December. Population susceptibility at the end of the year was 69.0% (63.6–75.4%), indicating that about one third of the US population had been infected. Community infectious rates, the percentage of people harbouring a contagious infection, increased above 0.8% (0.6–1.0%) before the end of the year, and were as high as 2.4% in some major metropolitan areas. By contrast, the infection fatality rate fell to 0.3% by year’s end.

Long COVID: major findings, mechanisms and recommendations

Article 13 January 2023

Infectious disease in an era of global change

Article 13 October 2021

Persistence in risk and effect of COVID-19 vaccination on long-term health consequences after SARS-CoV-2 infection

Article Open access 26 February 2024

Main

During 2020, the United States documented more COVID-19 cases and deaths than any other country in the world¹. The first US COVID-19 case was identified in Washington state on 20 January 2020². Over the course of the year, three pandemic waves took place: (1) a spring outbreak in select, mostly urban areas following the introduction of the virus to the United States; (2) a summer wave that predominantly affected the southern half of the country; and (3) an autumn–winter wave that remained pervasive until the spring of 2021. To understand the transmission of the virus and better control its progression in the future, it is vital that the epidemiological features that have supported these outbreaks are quantified and analysed in both space and time.

Here we use a county-resolved metapopulation model to simulate the transmission of SARS-CoV-2 within and between the 3,142 counties of the United States. The model depicts both documented and undocumented infections and is coupled with an iterative Bayesian inference algorithm—the ensemble adjustment Kalman filter—which assimilates observations of daily cases in each county, as well as population movement between counties^9,10 (Supplementary Information). The Bayesian inference supports a fitting of the model to case observations and estimation of unobserved state variables (for example, population susceptibility within a county) and system parameters (for example, the ascertainment rate in each county). Synthetic tests indicate that the inference approach can recover key time-varying parameters across a diversity of simulation scenarios (Extended Data Fig. 1). The model fitting to observed case data captures the three waves of the outbreak as manifest at national scales (Fig. 1a), as well as in major metropolitan areas and at county scales (Extended Data Fig. 2). These inference results are robust to parameter settings and model configurations (Extended Data Figs. 3, 4, Supplementary Information).

**Fig. 1: Model calibration and ascertainment rate.**

To further validate the fitting, we compared model estimates of cumulative infections to findings from US Centers for Disease Control and Prevention (CDC) seroprevalence surveys conducted at site and state levels³. The seroprevalence data, which provide an out-of-sample corroboration of the model fitting, were adjusted for the waning of antibody levels following adaptive immune response^11,12 (Extended Data Fig. 5, Supplementary Information). Model estimates of cumulative infected percentages are well aligned with adjusted seroprevalence estimates from the CDC 10-site survey across sites and through time (Pearson’s r = 0.97, mean absolute error (MAE) = 1.31%) (Fig. 1b) and are similarly well matched to adjusted estimates at the state level (Extended Data Fig. 6). In addition, the seroprevalence generated using the estimated daily infections adjusted for seroreversion also matches the observed seroprevalence, and the results are robust to assumed use of a lower-sensitivity seroassay (Extended Data Fig. 6).

A critical feature of SARS-CoV-2 is its ability to infect and transmit largely from individuals who have not been diagnosed with the virus⁴. The model structure and fitting enable estimation of the ascertainment rate, the percentage of infections confirmed diagnostically, at county scales. The national population-weighted ascertainment rate averaged for all of 2020 was 21.8% (95% CI: 15.9–30.3%), similar to an estimate derived from surveys on healthcare-seeking behaviours¹³. This national ascertainment rate increased from 11.3% (8.3–15.9%) during March 2020 to 24.5% (18.6–32.3%) during December 2020 (Fig. 1c). The increase through time is a likely by-product of increasing testing capacity, a relaxation of initial restrictions on test usage, and increasing recognition, concern and care-seeking among the public. We additionally focus on five metropolitan areas in the United States. Small differences in the ascertainment rate manifest across these areas—in particular, ascertainment rates for Phoenix and Miami were higher than the national average for much of the year, whereas those for New York City, Chicago and Los Angeles were consistently below the national average.

At the national level, three pandemic waves were evident during spring, summer and autumn–winter (Fig. 1a); however, the structure differs among the five focus metropolitan areas, with New York and Chicago experiencing strong spring and autumn–winter waves but little activity during summer, Los Angeles and Phoenix undergoing summer and autumn–winter waves, and Miami experiencing all three waves (Extended Data Fig. 2). Los Angeles County, the largest county in the United States, with a population of more than 10 million people, was particularly severely affected during autumn–winter. The differences in virus activity produced different cumulative infection numbers through time (Fig. 2a). Population susceptibility at the end of the year was 69.0% (63.6–75.4%) for the United States, and among the focal metropolitan areas it ranged from 47.6% (37.2–54.8%) in Los Angeles to 73.2% (68.3–77.8%) in Phoenix. Although there is variability among counties, a substantial portion of the US population (69.0%) had not been infected by the end of 2020; however, pockets of lower population susceptibility, which are evident in the southwest and southeast on 1 August 2020 (Fig. 2b), expanded considerably by 31 December 2020 (Fig. 2c). In particular, areas of the upper Midwest and Mississippi valley, including the Dakotas, Minnesota, Wisconsin and Iowa, are estimated to have population susceptibility below 40% as of 31 December 2020.

**Fig. 2: Estimates of population susceptibility.**

The structure of the outbreak is evident in both incidence and prevalence estimates (Fig. 3, Extended Data Fig. 7). Incidence indicates the daily number of newly infectious individuals—both confirmed cases of COVID-19 and those whose infections remain undocumented. The majority of infections each month are undocumented (Fig. 3a), as indicated by the low ascertainment rates (Fig. 1c). For all of 2020, an estimated 78.2% of infections in the United States were undocumented. Estimates of daily prevalence provide a measure of the community infectious rate (CIR), the fraction of the population currently harbouring a contagious infection. The national SARS-CoV-2 CIR was 0.77% (0.60–0.98%) on 31 December 2020, indicating that roughly 1 in 130 people was contagious (a similar percentage, 0.83% (0.52–1.26%), was estimated to be latently infected—that is, infected but not yet contagious) (Fig. 3b). Among the 5 focal metropolitan areas, the CIR varied considerably: in mid-November, Chicago reached a CIR of 1.51% (1.27–1.82%); whereas in Miami CIR increased to 1.25% (1.03–1.53%) during July. Los Angeles was even more burdened at the end of 2020, with a CIR of 2.42% (2.05–2.86%) as of 31 December 2020 (Extended Data Fig. 7).

**Fig. 3: Estimated transmission and characteristics of COVID-19 in the United States.**

The model fitting enables estimation of the case fatality rate (CFR) and the infection fatality rate (IFR). Using public line-list data from the CDC¹⁴, we estimated the distribution of time lag from case confirmation to death for each county and, using these estimates, deconvolved observed deaths to their date of case reporting¹⁵ (Extended Data Figs. 8, 9, Supplementary Information). CFR and IFR were then generated using these deconvolved death data. Both rates were highest nationally at the beginning of the spring wave: the CFR was 7.1% (4.8–9.8%) and the IFR was 0.77% (0.51–1.25%) during April (Fig. 3c). The national cumulative IFR up to 1 June was 0.69% (0.47–1.04%), in line with previous studies^5,6,7 (Extended Data Fig. 2, Supplementary Information). Over the course of the year, with earlier diagnosis and treatment, improved patient care^16,17,18 and—in the case of CFR—increased reporting of mild infections, the CFR and IFR dropped to 1.29% (0.98–1.68%) and 0.31% (0.22–0.44%) by December 2020, respectively. Both rates varied by location and over time; for instance, intermediate drops of CFR and IFR began for Chicago, Phoenix and Miami during the summer wave, in association with a decrease of the average age of hospitalized patients (Extended Data Fig. 8). During the winter of 2020, the CFR and IFR in most metropolitan areas increased slightly, possibly driven by greater hospitalization rates among older individuals (Extended Data Fig. 8) and strained healthcare resources¹⁹. Overall, these findings delineate the mortality risk associated with infection broadly. The national IFR during the latter half of 2020 hovers around 0.30%, well above estimates for both seasonal influenza²⁰ (<0.08%) and the 2009 influenza pandemic²¹ (0.0076%). As COVID-19 deaths are likely to be under-reported, our estimate of IFR could be biased low.

We further examined the change of the reproduction number R_t, in response to changing local, reported COVID-19 case numbers in five US regions (Northeast, Southeast, Midwest, Southwest and West) during the spring, summer and autumn–winter (Supplementary Information). Results indicate that communities with increasing cases showed greater reductions of R_t (Extended Data Fig. 10). However, the rate of reduction in R_t decreased over successive waves. These findings are potentially driven by a number of factors modulating the reproduction number, including changing compliance with non-pharmaceutical interventions²² and seasonal modulation of virus transmissibility²³. A more thorough analysis of this preliminary finding is needed.

The United States experienced the highest numbers of confirmed COVID-19 cases and deaths in the world during 2020¹. Our findings provide quantification of the time-evolving epidemiological characteristics associated with successive pandemic waves in the United States, as well as conditions at the end of the year and prospects for 2021. Critically, despite more than 19.6 million reported cases by the end of 2020, an estimated 69% of the population remained susceptible to viral infection. Several factors will considerably alter population susceptibility in the coming months. First, ongoing transmission will infect naive hosts and continue to deplete the susceptible pool. Second, as more vaccine is distributed and administered, more individuals will be protected against symptomatic infection and the IFR will decrease. Finally, our model does not represent reinfection, either through waning immunity or immune escape; however, reinfection has been documented^24,25, evidence of waning antibody levels exists^26,27, and new variants of concern have emerged^28,29 and will probably continue to do so. All these processes will affect population susceptibility over time and help to determine when society enters a post-pandemic phase, the pattern of endemicity the virus ultimately assumes and its long-term public health burden³⁰.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this paper.

Data availability

The human mobility and COVID-19 surveillance data that support the findings of this study are available at GitHub (https://github.com/SenPei-CU/COVID_US_2020). The county-level COVID-19 surveillance data for the United States are available at Johns Hopkins University coronavirus resource center (https://github.com/CSSEGISandData/COVID-19/tree/master/csse_covid_19_data/csse_covid_19_time_series). County-to-county commuting data were downloaded from the US Census Bureau (https://www.census.gov/data/tables/2015/demo/metro-micro/commuting-flows-2015.html). Human mobility data in 2020 were provided by SafeGraph (https://safegraph.com/), which aggregates anonymized location data from numerous applications to provide insights about physical places, via the SafeGraph Community. To enhance privacy, SafeGraph excludes census block group information if fewer than five devices visited an establishment in a month from a given census block group. We aggregated the mobility data to county level to estimate change of inter-county mobility in 2020. Aggregated and derived data are allowed to be shared publicly by SafeGraph. Seroprevalence data were published by the CDC (https://www.cdc.gov/coronavirus/2019-ncov/cases-updates/commercial-lab-surveys.html). The line-list datasets are available at the CDC website (https://data.cdc.gov/Case-Surveillance/COVID-19-Case-Surveillance-Public-Use-Data/vbim-akqf and https://data.cdc.gov/Case-Surveillance/COVID-19-Case-Surveillance-Public-Use-Data-with-Ge/n8mc-b4w4). Source data are provided with this paper.

Code availability

Custom code supporting this study is available at GitHub (https://github.com/SenPei-CU/COVID_US_2020).

Change history

17 December 2021
A Correction to this paper has been published: https://doi.org/10.1038/s41586-021-04172-0

References

WHO Coronavirus (COVID-19) Dashboard. World Health Organization https://covid19.who.int (2021).
Holshue, M. L. et al. First case of 2019 novel coronavirus in the United States. N. Engl. J. Med. 382, 929–936 (2020).
Article CAS Google Scholar
COVID Data Tracker. US Centers for Disease Control and Prevention https://covid.cdc.gov/covid-data-tracker (2021).
Li, R. et al. Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV-2). Science 368, 489–493 (2020).
Article CAS ADS Google Scholar
Brazeau, N. et al. Report 34: COVID-19 Infection Fatality Ratio: Estimates From Seroprevalence, http://spiral.imperial.ac.uk/handle/10044/1/83545 (2020).
O’Driscoll, M. et al. Age-specific mortality and immunity patterns of SARS-CoV-2. Nature 590, 140–145 (2021).
Article ADS Google Scholar
Meyerowitz-Katz, G. & Merone, L. A systematic review and meta-analysis of published research data on COVID-19 infection fatality rates. Int. J. Infect. Dis. 101, 138–148 (2020).
Article CAS Google Scholar
Kalish, H. et al. Undiagnosed SARS-CoV-2 seropositivity during the first six months of the COVID-19 pandemic in the United States. Sci. Transl. Med. 13, abh3826 (2021).
Article Google Scholar
Pei, S., Kandula, S. & Shaman, J. Differential effects of intervention timing on COVID-19 spread in the United States. Sci. Adv. 6, eabd6370 (2020).
Article CAS ADS Google Scholar
Yamana, T., Pei, S., Kandula, S. & Shaman, J. Projection of COVID-19 cases and deaths in the US as individual states re-open May 4, 2020. Preprint at https://doi.org/10.1101/2020.05.04.20090670 (2020).
Buss, L. F. et al. Three-quarters attack rate of SARS-CoV-2 in the Brazilian Amazon during a largely unmitigated epidemic. Science 371, 288–292 (2021).
Article CAS ADS Google Scholar
Shioda, K. et al. Estimating the cumulative incidence of SARS-CoV-2 infection and the infection fatality ratio in light of waning antibodies. Epidemiology 32, 518–524 (2021).
Article Google Scholar
Estimated Disease Burden of COVID-19. Centers for Disease Control and Prevention https://www.cdc.gov/coronavirus/2019-ncov/cases-updates/burden.html (2021).
COVID-19 Case Surveillance Public Use Data with Geography. Centers for Disease Control and Prevention https://data.cdc.gov/Case-Surveillance/COVID-19-Case-Surveillance-Public-Use-Data-with-Ge/n8mc-b4w4 (2021).
Goldstein, E. et al. Reconstructing influenza incidence by deconvolution of daily mortality time series. Proc. Natl Acad. Sci. USA 106, 21825–21829 (2009).
Article CAS ADS Google Scholar
RECOVERY Collaborative Group Dexamethasone in hospitalized patients with Covid-19. N. Engl. J. Med. 384, 693–704 (2021).
Article Google Scholar
Horwitz, L. I. et al. Trends in COVID-19 risk-adjusted mortality rates. J. Hosp. Med. 16, 90–92 (2021).
Article Google Scholar
Beigel, J. H. et al. Remdesivir for the treatment of Covid-19—final report. N. Engl. J. Med. 383, 1813–1826 (2020).
Article CAS Google Scholar
Lefrancq, N. et al. Evolution of outcomes for patients hospitalised during the first 9 months of the SARS-CoV-2 pandemic in France: a retrospective national surveillance data analysis. Lancet Reg. Health Eur. 5, 100087 (2021).
Article Google Scholar
Burden of influenza. Centers for Disease Control and Prevention https://www.cdc.gov/flu/about/burden/index.html (2020).
Riley, S. et al. Epidemiological characteristics of 2009 (H1N1) pandemic influenza based on paired sera from a longitudinal community cohort study. PLOS Med. 8, e1000442 (2011).
Article Google Scholar
Du, Z. et al. Pandemic fatigue impedes mitigation of COVID-19 in Hong Kong. Res. Sq. https://doi.org/10.21203/rs.3.rs-591241/v1 (2021).
Ma, Y., Pei, S., Shaman, J., Dubrow, R. & Chen, K. Role of meteorological factors in the transmission of SARS-CoV-2 in the United States. Nat. Commun. 12, 3602 (2021).
Article CAS ADS Google Scholar
To, K. K.-W. et al. Coronavirus disease 2019 (COVID-19) re-infection by a phylogenetically distinct severe acute respiratory syndrome coronavirus 2 strain confirmed by whole genome sequencing. Clin. Infect. Dis. ciaa1275 (2020).
Tillett, R. L. et al. Genomic evidence for reinfection with SARS-CoV-2: a case study. Lancet Infect. Dis. 21, 52–58 (2021).
Article CAS Google Scholar
Self, W. H. Decline in SARS-CoV-2 antibodies after mild infection among frontline health care personnel in a multistate hospital network—12 states, April–August 2020. Morb. Mortal. Wkly. Rep. 69, 1762–1766 (2020).
Article CAS Google Scholar
Choe, P. G. et al. Waning antibody responses in asymptomatic and symptomatic SARS-CoV-2 infection. 27, 327–329 (2020).
Google Scholar
Fiorentini, S. et al. First detection of SARS-CoV-2 spike protein N501 mutation in Italy in August, 2020. Lancet Infect. Dis. 21, s1473-3099 (2021).
Article Google Scholar
Rambaut, A. et al. Preliminary genomic characterization of an emergent SARS-CoV-2 lineage in the UK defined by a novel set of spike mutations. Virological https://virological.org/t/preliminary-genomic-characterisation-of-an-emergent-sars-cov-2-lineage-in-the-uk-defined-by-a-novel-set-of-spike-mutations/563 (2020).
Shaman, J. & Galanti, M. Will SARS-CoV-2 become endemic? Science 370, 527–529 (2020).
Article CAS Google Scholar

Download references

Acknowledgements

This study was supported by funding from the National Science Foundation (DMS-2027369) and a gift from the Morris-Singer Foundation. We thank SafeGraph for providing human mobility data and Columbia University Mailman School of Public Health for high-performance computing resources

Author information

Authors and Affiliations

Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, USA
Sen Pei, Teresa K. Yamana, Sasikiran Kandula, Marta Galanti & Jeffrey Shaman

Authors

Sen Pei
View author publications
You can also search for this author in PubMed Google Scholar
Teresa K. Yamana
View author publications
You can also search for this author in PubMed Google Scholar
Sasikiran Kandula
View author publications
You can also search for this author in PubMed Google Scholar
Marta Galanti
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey Shaman
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.P. and J.S. conceived the study; S.P., T.K.Y., S.K. and M.G. performed the analysis; and S.P. and J.S. drafted the manuscript. All authors revised and reviewed the manuscript.

Corresponding authors

Correspondence to Sen Pei or Jeffrey Shaman.

Ethics declarations

Competing interests

J.S. and Columbia University disclose partial ownership of SK Analytics. J.S. discloses consulting for BNI. All other authors declare no competing interests.

Additional information

Peer review information Nature thanks the anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 Parameter inference for simulated outbreaks.

Results are shown for three major metropolitan areas – New York, Chicago, and Los Angeles. Outbreaks were generated for 60 days using four prescribed scenarios. National daily cases are shown in the top row. Parameter estimates for the last 10 days are not displayed as there is not enough data at the end of the time series to constrain the model. Solid and dashed lines show the median estimate and 95% CIs respectively.

Extended Data Fig. 2 Model fitting and inference results.

(a) Model fitting to daily case numbers (blue dots) in the US and five metropolitan areas. Solid and dashed lines show the median estimate and 95% CIs respectively. (b) Estimated daily ascertainment rates (left column) and transmission rates (right column) for five metropolitan areas. Solid and dashed lines show the median estimate and 95% CIs respectively. (c) Reliability plot for model calibration. Data points show the coverage of the 25%, 50%, 75% and 95% CIs of the posterior fitting at county and national levels. (d) The estimated national cumulative IFR in 2020. The cumulative IFR is computed using the estimated cumulative numbers of death (deconvolved) and infections prior to a given date.

Extended Data Fig. 3 Sensitivity analyses on inference results.

(a) Inference results using fixed parameters (Z, D, μ, θ) estimated from case data prior to April 2 2020. (b) Inference results from a modified version of the transmission model in which the relative infectiousness of undocumented infections, \(\mu \), is allowed to vary over time. Fitting to case data (top two rows), estimated monthly ascertainment rate (middle two rows) and population susceptibility (bottom two rows) are shown. Distributions are obtained from n = 100 ensemble members. In the top two and bottom two rows, the solid line represents the median, and the dash lines show 95% CIs. In the middle two rows, centre and box bounds represent the median, 25th, and 75th percentiles, and whiskers show 2.5th and 97.5th percentiles.

Extended Data Fig. 4 Inference results from a modified version of the transmission model permitting movement of documented infections among counties.

(a) 25% of documented infections are allowed to move among counties. (b) 50% of documented infections are allowed to move among counties. Fitting to case data (top two rows), estimated monthly ascertainment rate (middle two rows) and population susceptibility (bottom two rows) are shown. Distributions are obtained from n = 100 ensemble members. In the top two and bottom two rows, the solid line represents the median, and the dash lines show 95% CIs. In the middle two rows, the centre and box bounds represent the median, 25th, and 75th percentiles, and the whiskers show 2.5th and 97.5th percentiles.

Extended Data Fig. 5 The reported (black) and adjusted (red) seroprevalence.

(a) Results for the 10-site study. (b) and (c) show the results for state-level serological surveys obtained using a maximum monthly attenuation rate of 17.5% and 15%, respectively. Dots and whiskers show the median and 95% CIs respectively. Distributions are obtained from n = 1,000 simulated seroprevalence samples.

Extended Data Fig. 6 Validation of inference using seroprevalence data.

(a) – (b) Comparison between the inferred percentage of cumulative infections and seroprevalence at the state level adjusted for antibody waning. Seroprevalence data adjusted using a maximum monthly attenuation rate of 17.5% (a) and 15% (b) are included in the analysis. (c) – (d) Comparison between the model-generated seroprevalence and observed seroprevalence in 10 locations (c) and at the state level (d). (e) – (f) Comparison between the inferred percentage of cumulative infections and seroprevalence in 10 locations (e) and at the state level (f) adjusted for antibody waning using lower sensitivity and specificity. Distributions are obtained from n = 100 ensemble members. Centre and whiskers show median and 95% CIs. Color indicates the sample collection date for each location.

Extended Data Fig. 7 Inference results in the US and five metropolitan areas.

(a) Estimated monthly total infections (blue bars) and confirmed cases (orange bars) in the US and five metropolitan areas. Distributions are obtained from n = 100 ensemble members. The blue bars show medians and whiskers show 95% CIs. (b) Daily confirmed cases (blue line, 7-day moving average) and estimated prevalence of contagious infections (red line, median and 95% CIs) for the US and five metropolitan areas.

Extended Data Fig. 8 Key statistics obtained from line-list data for the US and five metropolitan areas.

(a) – (b) The crude monthly CFR (a) and HFR (b) obtained from line-list data for the US and five metropolitan areas. Note that due to incomplete reporting of deaths in the line-list data, these estimates are likely low. (c) – (d) The proportion of confirmed cases (c) and hospitalizations (d) in four age groups (0-17, 18-49, 50-64, 65+) in the line-list data. Data are shown monthly for the US and five metropolitan areas.

Extended Data Fig. 9 Estimation of the time-to-event distribution from case confirmation to death for Maricopa County AZ (a) and Miami-Dade County FL (c).

Deconvolution of daily deaths using the estimated delay distributions for Maricopa County AZ (b) and Miami-Dade County FL (d).

Extended Data Fig. 10 Weekly change of R_t in response to the change of weekly cases per 100,000 people at county level.

The analysis was performed for five US regions (Northeast, Southeast, Midwest, Southwest, West) during the spring (Feb 21 – May 31), summer (Jun 1 – Sep 15), and fall/winter (Sep 16 – Dec 31) waves. In the five US regions, 116, 162, 126, 45 and 54 counties that reported cumulative cases over 100 per 100,000 people during all three waves and had a population over 100,000 were included in the analysis. A positive/negative change of weekly cases in the x-axis indicates increasing/decreasing community prevalence of COVID-19. The dash lines are the linear fits. The statistical significance of the slope is indicated by asterisks (two-sided t-test. ***: p < 10⁻⁵, **: p < 0.001, *: p < 0.05; NS: not significant. P-values are reported in the legends).

Supplementary information

Supplementary Information

This file contains Supplementary Sections 1–8 including Supplementary Text and Data and Table 1.

Reporting Summary

Peer Review File

Source data

Source Data Fig. 1

Source Data Fig. 2

Source Data Fig. 3

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pei, S., Yamana, T.K., Kandula, S. et al. Burden and characteristics of COVID-19 in the United States during 2020. Nature 598, 338–341 (2021). https://doi.org/10.1038/s41586-021-03914-4

Download citation

Received: 15 February 2021
Accepted: 13 August 2021
Published: 26 August 2021
Issue Date: 14 October 2021
DOI: https://doi.org/10.1038/s41586-021-03914-4

This article is cited by

A response playbook for early detection and population surveillance of new SARS-CoV-2 variants in a regional public health laboratory
- Hannah J. Barbian
- Alyse Kittner
- Mary K. Hayden
BMC Public Health (2024)
Community transmission of SARS-CoV-2 during the Delta wave in New York City
- Katherine Dai
- Steffen Foerster
- Sen Pei
BMC Infectious Diseases (2023)
COVID-19 severity scale for claims data research
- Trudy Millard Krause
- Raymond Greenberg
- Caroline Schaefer
BMC Health Services Research (2023)
Quantifying the spatial spillover effects of non-pharmaceutical interventions on pandemic risk
- Keli Wang
- Xiaoyi Han
- Yu Liu
International Journal of Health Geographics (2023)
Modeling community COVID-19 transmission risk associated with U.S. universities
- J. A. Uelmen
- H. Kopsco
- R. L. Smith
Scientific Reports (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.