Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain
the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in
Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles
and JavaScript.
Mathematicians and public-health experts watched through their fingers in May as British Prime Minister Boris Johnson unveiled a series of charts to explain how the government would guide Britain out of coronavirus lockdown. Perhaps most prominent was a colourful dial with a needle hovering near a single digit: 1.
The dial indicated R, a now-totemic figure in the COVID-19 pandemic. The nation, said Johnson, would set a COVID-19 alert level, to be "primarily determined" by the number of coronavirus cases, and by R, the reproduction number.
To infectious-disease experts, Johnson’s focus on the reproduction number as a guiding light for policy was worryingly myopic. They worry about placing too much weight on R, the average number of people each person with a disease goes on to infect.
In this pandemic, R has leapt from the pages of academic journals into regular discussions by politicians and newspapers, framed as a number that will shape everyone’s lives. As Germany’s chancellor, Angela Merkel, explained in a widely viewed video this April, an R above one means an outbreak is growing, and below one means that it is shrinking. In many countries, it is publicly reported every week. In June, epidemiologists at the Harvard T.H. Chan School of Public Health in Boston, Massachusetts, announced a website where anyone can look up the value for any country — and for many smaller regions — in the world.
But fascination might have turned into unhealthy political and media fixation, say disease experts. R is an imprecise estimate that rests on assumptions, says Jeremy Rossman, a virologist at the University of Kent, UK. It doesn’t capture the current status of an epidemic and can spike up and down when case numbers are low. It is also an average for a population and therefore can hide local variation. Too much attention to it could obscure the importance of other measures, such as trends in numbers of new infections, deaths and hospital admissions, and cohort surveys to see how many people in a population currently have the disease, or have already had it.
“Epidemiologists are quite keen on downplaying R, but the politicians seem to have embraced it with enthusiasm,” says Mark Woolhouse, an infectious-diseases expert at the University of Edinburgh in the United Kingdom, who is a member of a modelling group that advises the British government on the pandemic. “We’re concerned that we’ve created a monster. R does not tell us what we need to know to manage this.”
Many policymakers understand this: no one else has linked R so tightly and explicitly to public policy as Johnson did, Rossman says. And despite the coloured-dial chart, it’s not clear how much R is actually driving UK policy. In the weeks after Johnson’s announcement, the government didn’t reference R when it took measures to ease restrictions or lowered the national alert level. (It did not respond to requests for comment for this article.)
But researchers remain concerned that R is looming too large, and is being used for purposes for which it was never intended. “It’s not yet clear what actions they are or are not taking on the back of R. But we are concerned because they’re giving it such prominence,” says Woolhouse.
The origins ofR
First used almost a century ago in demography, R originally measured the reproduction of people — whether a population was growing or not. In epidemiology, the same principle applies, but it measures the spread of infection in a population. If R is two, two infected people will, on average, infect four others, who will infect eight others, and so on. The measure allows modellers to work out the extent of the spread, but not the speed at which the infection grows.
Unless they regularly test an entire country’s population, epidemiologists can’t measure R directly. So it is usually estimated retrospectively: disease modellers look at current and previous numbers of cases and deaths, make some assumptions to find infection numbers that could have explained the trend and then derive R from these.
One variant of R, R0, assumes that everybody in a population is susceptible to infection. That usually isn’t true, but might be when a new virus, such as SARS-CoV-2, emerges. At the start of the epidemic, assessing R0 (and other metrics) was crucial for epidemiologists building models of how the disease might spread. But when politicians and scientists talk about R, they usually mean another variant called Rt (sometimes called Re, or ‘effective R’), which is calculated over time as an outbreak progresses and considers how some people might have gained immunity, perhaps because they have survived infection or been vaccinated.
Rt and R0 both vary with the social dynamics of a population: even an easily transmitted virus will have trouble spreading in a region where people rarely meet. In January, the COVID-19 R0 in Wuhan, China, was calculated to be between two and three; after lockdown, estimates put the Rt there at just over one1.
A lagging indicator
Working out Rt involves trade-offs and compromise. Confirmed cases and mortality figures can be used to infer the total number of infections, but both come with a significant lag — which scientists estimate could be anything from a week to three weeks or more. “If you have your Rt estimate lagging by at least ten days, possibly two weeks, then it’s not going to be that useful as a real-time decision-making tool,” says Gabriel Leung, a public-health scientist at the University of Hong Kong.
With a mathematical trick called nowcasting, researchers can use the observed statistical distribution of reporting delays to predict how much higher the number of fresh infections will be in, for example, two weeks. Some estimates of Rt already rely on nowcasting infection data in this way: it is "the method with the least guesses", says Lars Schaade, vice-president of the Robert Koch Institute in Berlin, Germany’s main public-health agency, which reports a daily and seven-day Rt value based on infections reported by state health authorities.
Nowcasting infections on the basis of trends in past COVID-19 cases is tricky enough, but mortality data typically come with a longer lag, because of the extra time someone has the disease before they succumb to it and because of the paperwork involved in registering deaths, which can take weeks or months to file. A group led by Sheila Bird at the University of Cambridge, UK, publishes nowcast data of COVID-19 deaths in English hospitals. But they cannot yet do the same with a separate data set of deaths compiled by the Office of National Statistics (ONS) because the researchers don’t have access to the necessary data on registration delays: the time difference between when a death occurred and when the ONS reported it.
Extra uncertainty
An issue with nowcasting is that it swaps one problem for another, says Sebastian Funk, a disease modeller at the London School of Hygiene and Tropical Medicine, who is also advising the British government on this pandemic. “You can try to do that, but for obvious reasons it always comes with uncertainty. There’s no way that you can know how many cases would still be observed that have already been infected,” he says.
Other data on the pandemic’s progress can feed into estimates of Rt by serving as proxies for infections and social behaviour. One is hospital and intensive-care admissions. Another is results from random testing of a population to see how many people currently have COVID-19, or have had it. Researchers also conduct contact surveys, which ask people who they mix with, and can be used to infer changes in R on the basis of estimates of how many others an infected person could meet, although these are time-consuming and could cover only small groups of people. Contact surveys in China showed daily contacts were reduced by seven- to eightfold during the COVID-19 social-distancing period, when most interactions were restricted to the household2. Another way to observe trends in people’s movements is to use location data based on the signals from mobile phones, published by Facebook and Google.
“There’s a bit of a trade-off here,” says Funk. “There are some methods that are more immediate but not epidemiological, and there are others that are more directly epidemiological but at the same time more out of date.”
Groups of epidemiologists, Funk says, each have their own approach to combining and using these disparate sources of data to work out Rt, relying on their own statistical models to look at trends in presumed infections. To calculate the official Rt of the United Kingdom, about ten groups present the results of their models to a dedicated government committee, which reaches consensus on a possible range. The figures are presented in that range (currently 0.7—0.9), showing how uncertain the estimates are, but the individual models are not released.
Unofficial estimates
Those ‘official’ Rt numbers are not the only versions available. Academic researchers have taken advantage of infection and mortality figures collated by the World Health Organization and independent groups such as the Coronavirus Resource Centre at Johns Hopkins University in Baltimore, Maryland, to publish Rt figures for numerous countries and states. In late April, for example, public-health researchers in Colombia claimed that the Rt for the first ten days of the pandemic was above two in seven Latin American countries3. The Harvard researchers’ website currently estimates that Rt is above one in more than 30 US states (see ‘Fall and rise: Rt in the United States’).