Introduction

Accurate real-time surveillance of disease spread is essential for effective pandemic response and for the allocation of scarce healthcare resources1,2. During the current COVID-19 pandemic, population-level surveillance has relied primarily on aggregated results from individual laboratory testing3. While laboratory testing remains the primary method for diagnosing individuals, using laboratory testing for population-level surveillance has its limitations—it is difficult to achieve population-level coverage due to delays in development and scaling of de novo laboratory testing methodologies during the crucial early stages of an emergent pandemic4,5,6. These limitations present an opportunity for exploring additional approaches to population-level surveillance that could serve as complementary information sources during the critical early stages of a pandemic7.

Aggregated data on Internet search volumes are freely available in near-real-time and at population-scale in areas with sufficient Internet penetration8. These Internet search patterns have been used to track a wide range of health phenomena, including influenza9, MERS10, measles11, abortion12 and immunization compliance13, and are a potential complementary source of information for population-level surveillance of pandemic spread. Previous studies of these data have yielded valuable lessons in their appropriate use, including avoiding non-specific search terms, ensuring that analyses are straightforward and transparent, and avoiding complex models that may increase the risks of overfitting. A number of recent studies have examined whether Internet search data could be useful for tracking the spread of COVID-19 at a population level. These studies have found correlations between specific Internet search terms and reported COVID-19 cases14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29. A detailed review of these studies is provided in the Discussion section below.

Unlike these recent studies which have focused on using Internet search data for tracking pandemic spread, in this study we examine whether Internet search data can be used for another important purpose: characterizing the clinical course of symptoms in affected individuals, especially during the early stages of an emergent pandemic. Characterizing the expected clinical course of a disease can help clinicians understand where an individual patient is in his or her expected disease course, and what can be expected for that patient in the future. For example, in the case of COVID-19, it is important to know when dyspnea (shortness of breath) is expected to occur for patients, so that appropriate planning of clinical care and resources can take place. Understanding the expected sequence of symptoms at a population level could also help public health officials track the different stages of the onset of a pandemic as it takes hold in a new region. It would therefore be beneficial for pandemic tracking and clinical planning if disease progression of novel pandemics could be studied in a timely manner and at population scales.

A number of recent clinical case studies have analyzed the day-by-day clinical course for COVID-19 patients, as detailed in the Discussion section below. These studies have reported a time lag between the onset of initial symptoms and the appearance of dyspnea (shortness of breath) of around 5 days30,31,32,33,34. These studies relied on a limited number of patients in a hospital setting, and were published weeks and months after the initial spread of the pandemic.

Unlike hospital-based clinical studies, analyses of Internet search data could potentially provide information on disease progression across a much larger population of patients, not only those seen within hospital settings. In this study, we conducted a systematic study across 32 countries on six continents to determine whether Internet search patterns can be useful as a complementary data source for understanding the clinical progression of COVID-19, especially during the critical early stages of the pandemic.

Results

Internet search volume and COVID-19 cases and deaths

We examined a few example countries in which the pandemic peaked at different times. Figure 1 shows search volumes for the terms “fever” and “dry cough”, alongside reported COVID-19 cases and deaths for China, Iran, Italy, United States and India. Even though outbreaks occurred at different times in each country, the temporal relationships between the search terms and reported COVID-19 cases and deaths remained similar across countries.

Fig. 1: Search volumes in different countries.
figure 1

Search volumes (purple) for the terms “fever” (left) and “dry cough” (right), alongside reported COVID-19 cases (cyan) and deaths (orange) for China, Iran, Italy, US and India. Even though outbreaks occur at different times in different countries, the relationships between the search terms and reported COVID-19 cases and deaths remain similar across countries. To highlight the temporal relationships between the curves, the magnitude of each curve was independently normalized to fit the vertical dimensions of the plot.

Expanding this analysis to all 32 countries, Fig. 2a shows the lags between search volumes for the term “fever” and COVID-19-related deaths for each of the 32 countries, along with a histogram showing the distribution of these lags. Figure 2b shows the lags between search volumes for the term “dry cough” and COVID-19-related deaths, along with the distribution of the lags.

Fig. 2: Search lags in different countries.
figure 2

a Lags between searches for “fever” and reported COVID-19 deaths across 32 countries, with a histogram showing the distribution of these lags. Each country is labeled with its ISO Alpha-2 country code. b The same plots shown for searches for “dry cough”.

Table 1 shows summary statistics over all 32 countries for the lags between each of the ten search terms and COVID-19 cases and deaths, including mean and 95% confidence intervals. Searches for “fever” preceded increases in reported COVID-19 cases and deaths by an average of 18.53 days [95% CI 15.98–21.08] and 22.16 days [95% CI 20.33–23.99], respectively. For “cough” these times were 18.34 days [95% CI 16.05–20.64] and 21.88 days [95% CI 20.19–23.56], respectively. The complete list of lags between each search term and COVID-19 deaths for each individual country is provided in Supplementary Table 1.

Table 1 Mean lags between search volumes for specific search terms and reported COVID-19 cases and deaths across 32 countries.

The average lags between searches and reported cases were shorter than those between searches and reported deaths, as cases are typically diagnosed and reported before deaths. We found that the inter-country variability of the average lags between searches and reported cases was greater (larger confidence intervals) than that for reported deaths, likely because case reporting is more dependent on local testing capacity and rates. We also found that the general term “coronavirus” has the greatest variability in its lags to reported cases and deaths (largest confidence intervals), compared to other symptom-specific terms, as would be expected for a more general search term.

Internet search volume and COVID-19 clinical course of illness

We examined examples from individual countries, this time looking at the progression of symptom-related search terms over time. Figure 3 shows search volumes for a range of symptom-related search terms plotted over time for France (Fig. 3a) and Mexico (Fig. 3b). For each country, searches for “shortness of breath” appear a few days after searches for other symptoms such as “fever”, “cough”, and “dry cough”.

Fig. 3: Clinical course of symptoms in individual countries.
figure 3

Clinical Course of COVID-19 symptoms in (a) France and (b) Mexico. Search volumes for the terms “coronavirus symptoms”, “fever”, “cough”, “dry cough”, and “shortness of breath” (black) are shown alongside COVID-19 cases (dashed cyan line) and deaths (dashed orange line). Initial symptoms appear clustered together in time, with searches for shortness of breath appearing a few days later. To highlight the temporal relationships between curves, the magnitude of each curve was independently normalized to fit the vertical dimensions of the plot.

Expanding this analysis to all 32 countries, we calculated ensemble average curves for each search term across all 32 countries. Figure 4a shows the ensemble average search volumes for “fever”, “cough”, “dry cough”, and “shortness of breath”, indexed by searches for “coronavirus symptoms”, alongside reported COVID-19 cases and deaths. Figure 4b shows this same analysis for additional search terms “sore throat”, “runny nose”, “chills”, and “coronavirus test”, also indexed by searches for “coronavirus symptoms”.

Fig. 4: Average clinical course of symptoms across 32 countries.
figure 4

Ensemble average clinical course of COVID-19 symptoms across 32 countries, as seen through Internet search volumes of search terms related to COVID-19 symptoms. a Search volumes for the terms “fever”, “cough”, “dry cough”, “shortness of breath” (black), indexed by searches for “coronavirus symptoms”, shown alongside COVID-19 cases (dashed line cyan) and deaths (dashed line orange). b Search volumes for the terms “sore throat”, “runny nose”, “chills”, and “coronavirus test”, alongside “shortness of breath” (black), indexed by searches for “coronavirus symptoms”, shown alongside COVID-19 cases (dashed line cyan) and deaths (dashed line orange).

The clinical progression that emerges from these data presents the following picture: As the pandemic begins to take hold in a country, people search for “coronavirus symptoms” and “coronavirus test”, followed by initial symptoms “fever”, “cough”, “runny nose”, “sore throat” and “chills”, followed by searches for “shortness of breath” approximately 5 days after the search for initial symptoms.

Table 2 shows the lags between different definitions for initial symptom onset and searches for “shortness of breath”. The average lag between the searches for “fever” and “shortness of breath” was 5.22 days [95% CI 3.30–7.14]. For “cough” it was 5.16 days [95% CI 3.13–7.18]. These lags, as well as the lags deriving from other symptom onset definitions, are all around 5 days, matching the clinical course of the disease reported in the literature30,31,32,33,34.

Table 2 Different search-term-based definitions for symptom onset were examined by looking at different combinations of early-peaking search terms.

The clinical progressions above are based on a 32-country ensemble average. While there is greater variability at the individual country level, the general temporal order of COVID-19 symptom progression remains fairly consistent across individual countries, with few exceptions. For example, only two of the 32 countries had searches for “coronavirus symptoms” peak after searches for “shortness of breath”, only 3 out of 32 countries had searches for “cough” peak after searches for “shortness of breath”, and only 4 out of 32 countries had searches for “fever” peak after searches for “shortness of breath” (Supplementary Table 2).

Discussion

In this systematic analysis of Internet search data from 32 countries across six continents, we found that increases in symptom-related Internet searches preceded increases in reported COVID-19 cases and deaths by approximately 2–3 weeks. Furthermore, we found that by analyzing data from all 32 countries, the temporal relationships between the different symptom-related search terms reflected the clinical progression of the symptoms of COVID-19 reported in the medical literature.

To the best of the authors’ knowledge, this study is the first to conduct a detailed investigation of multiple COVID-19 symptom-related search terms across such a large number of countries and languages. It is also the first to conduct a detailed analysis of the temporal relationships between different symptom-related searches in order to determine whether these data could be useful in understanding the clinical course of illness for a disease. During emergent pandemics, this detailed level of information can support health officials in tracking pandemic spread and planning clinical care and resources.

A number of clinical case studies have analyzed the day-by-day clinical course for COVID-19 patients: A study of 138 patients in Wuhan, China found that common initial symptoms include fever and dry cough, followed by dyspnea a median of 5.0 days later30. A meta-review found that the average length from illness onset to dyspnea was 4.99 days, based on evidence from 179 COVID-19 patients31. A study in the US found that dyspnea typically set in between the fourth and eighth day following initial symptoms32. A study of 41 patients in Wuhan, China found that the median duration from illness onset to dyspnea was 8.0 days33. The CDC’s Interim Clinical Guidance for Management of Patients with Confirmed Coronavirus Disease (COVID-19) states “Among patients who developed severe disease, the medium time to dyspnea ranged from 5 to 8 days”34.

A number of recent studies have analyzed Internet search data related to the COVID-19 pandemic. Most of these studies examined data from a single country or from a small number of countries: Some studies examined only the search term “coronavirus”35,36,37,38. In this study we found that general non-symptom-specific search terms such as “coronavirus” have a greater variability in their relation to reported cases and deaths, likely due to the fact that individuals seeking general information on the pandemic may search for “coronavirus” even if they are not experiencing specific symptoms themselves at that time. Other studies looked at additional search terms such as “handwashing”, “face masks”39,40, “quarantine”, “hand disinfection”14, “SARS”, “MERS”15, “antiseptic”, and “sanitizer”16, but did not include specific symptom-related search terms.

Another group of studies did examine symptom-related search terms, including: “COVID pneumonia” and “COVID heart” in the US17; “COVID” and “pneumonia” in China18; “worry”, “hysteria” and other mental health symptoms in the US19; “diarrhea”, “nausea”, “vomiting”, “abdominal pain”, “fever” and “cough” in the US20; “can’t smell” “ear pain”, “sinus pain”, “voice pain”, “ears ringing”. Among other terms in the US21; “ageusia”, “abdominal pain”, “loss of appetite”, “anorexia”, “diarrhea”, and “vomiting” in the US22; “chest pain,” “myocardial infarction,” “cough,” and “fever” in four countries23; “Loss of sense of smell”, “Sense of smell”, “Loss of sense of taste”, and “Sense of taste” across multiple regions in five countries24; and “smell”, “loss of smell”, “anosmia”, “hyposmia”, “olfaction”, “taste”, “loss of taste”, and “dysgeusia” in eight countries25. While these studies investigated a range of important questions, including the effects of COVID-19 on mental health19, gastrointestinal health20,22, otolaryngological health21, and coronary-related conditions23, they did not study whether the temporal relationships between these search terms could be useful in understanding the clinical progression of the disease.

Some previous studies have attempted to analyze information on temporal progression in online data sources. For example, Liu et al. analyzed Twitter data to resolve different stages of behavior relating to alcohol consumption (alcohol seeking, alcohol consuming, post-consumption reflection)26. Wu et al. studied seasonality in global public interest in psoriasis by analyzing seasonality in Internet search trends27. To the best of the authors’ knowledge, no previous studies have attempted to reconstruct the clinical progression of a disease based on Internet search data.

The use of Internet search data is subject to a number of important limitations9,28,29. Internet infrastructure and digital access levels differ across countries and communities. Even though digital access rates are generally increasing worldwide, many developing countries currently lack sufficient search volumes to support search-based tracking. Search data may be subject to demographic, socio-economic, geographic, or other biases inherent in the local digital divide8,41,42. In each country, the population of individuals who perform Internet searches may have different characteristics than those who do not, and the results inferred from Internet searching behaviors may not generalize to other populations.

Furthermore, the motivation behind each individual user’s search is not known: Changes in search volumes for symptom-related terms such as “fever” can result not only from increases in COVID-19 cases, but also from general curiosity about the pandemic, the occurrence of other diseases (e.g., influenza, Lassa fever43), news coverage, or other factors. Another important limitation of this study is that there is no definitive gold standard with which to compare the data, as individualized laboratory testing was very limited in the early phases of the pandemic. In addition, while we attempted to work with native speakers to help us with the translations, ultimately there were a few countries for which we had to rely on automated translations, which may not have yielded the optimal search terms.

In this study, we used specific symptom-related search terms and examined data from a geographically diverse set of 32 counties across six continents and multiple languages. In recognition of the inherent variability of the data at the individual country level, we performed a global analysis combining data from all 32 countries. Despite the inherent variability of country-specific search data, our results show that the temporal relationships between the symptom-specific Internet search terms and COVID-19-related cases and deaths remained generally consistent between countries.

The goal of this study is to examine whether Internet search patterns can be useful as a complementary data source for understanding the clinical course of a disease in the early stages of a novel pandemic, when the clinical course of the disease is not yet fully characterized. In such an operational real-time setting, when a novel pandemic emerges, public health authorities could analyze Internet search patterns relating to the reported symptoms of the pandemic in affected countries. Since not all symptoms of an emerging disease may be known in the early stages of a novel pandemic, a systematic scan over a range of possible symptoms could help identify candidate symptoms whose search volumes are trending upwards. If multiple countries display a similar progression in search trends for the same symptoms, this progression of symptoms can help to inform public health officials’ understanding of the clinical course of the disease.

An understanding of the clinical progression of symptoms in the early stages of a pandemic could help hospitals and healthcare workers deliver better care to patients. For example, clinicians would be better able to anticipate future symptoms that a patient might experience, as well as when such symptoms are expected to occur. This would also help with capacity and resource planning for hospital beds and necessary equipment such as ventilators.

The results of this study show that Internet search trends are able to reflect the clinical course of symptoms during the early stages of a pandemic. However, the relationships between search terms and illness may not hold during later waves of the pandemic, as the information-seeking behaviors of members of the public may change as they become better educated about the pandemic over time. The study of these patterns during later phases of a pandemic are a topic for future research.

Future work also includes applying these approaches to other diseases, as well as combining Internet search data with additional data sources such as local testing rates, public health measures, news reports, climatological and air quality variables, among others.

The ability of search data to not only help track pandemic spread, but also to reveal the clinical course of symptoms in emergent pandemics is significant. Given the limitations of laboratory testing for population-level surveillance, search data can be a valuable complementary source for population-scale tracking of pandemics in real time, particularly during the early stages of a pandemic when local testing is not yet available at scale, and can help guide public health response.

Methods

Data acquisition

We selected a diverse set of 32 countries from six continents (Table 3), in which sufficient search data volumes were available for the search terms of interest. We obtained data on reported COVID-19 cases and deaths for each of these countries from a publicly available dataset maintained by the Center for Systems Science and Engineering at Johns Hopkins University44.

Table 3 Countries included in the study, categorized by continent.

We collected daily relative search volume (RSV) data on a per-country basis for the period of January 1, 2020 through April 20, 2020, from Google Trends (Website. https://trends.google.com/trends/explore) using the pytrends API45. We used pytrends’ “Interest Over Time” API method, which returns aggregated and normalized RSV data from Google Trends’ “Interest Over Time” section. Google Trends normalizes the data by dividing each data point by the total searches in the geography and time range it represents in order to allow for comparisons of relative search-term popularity. These numbers are then scaled on a range of 0 to 100. Since Google Trends normalizes the search data and provides only the relative popularity of a search term over time, it was not necessary to include additional control search-terms in the study. As Google has limited availability in China, we also accessed search trend data from the search engine Weibo (Weibo Search Trends. https://data.weibo.com/index).

We accessed data for the following common symptoms of COVID-19: “fever”, “cough”, “dry cough”, “chills”, “sore throat”, “runny nose”, and “shortness of breath”, as well as the more general terms “coronavirus”, “coronavirus symptoms” and “coronavirus test”. Each search term was searched as an “exact phrase”. We also considered other less common symptoms such as loss of smell and loss of taste, but the search volumes on those terms were too sparse for many countries. As daily search data are inherently noisy, all search data were smoothed with a 7-day moving average24.

For translations of the English search terms into local languages, we recruited native speakers to translate each term into the following languages: Arabic, Mandarin Chinese, Dutch, French, German, Italian, Persian, Polish, Portuguese, and Spanish. These languages covered almost all of the countries included in the study. For languages in which we were unable to recruit native speakers for translations, we used Google Translate (https://translate.google.com/) and confirmed that sufficient data were available for these translated search terms on Google Trends (Website. https://trends.google.com/trends/explore). A complete table of search terms and translations for each country is provided in Supplementary Table 2.

While both “COVID” and “coronavirus” are commonly used in the United States, detailed consultations with native speakers in different countries revealed that in the majority of countries surveyed, members of the public did not commonly use the terms “COVID” or “COVID-19”, but rather only terms relating to “coronavirus”. Therefore, for reasons of standardization in this international study, we used terms relating to the word “coronavirus” in all countries.

Data analysis: pandemic spread

We conducted temporal correlation studies to study the relationships between Internet search data and reported COVID-19 cases and deaths. For each country and search term, we calculated the Pearson correlation coefficient between the time series of search volumes for that search term and the time series of COVID-19 cases. We then shifted the search term data by a variable lag, and identified the lag that yielded the highest correlation. We computed the mean of these optimal lags for each search term across all countries. We then repeated these analyses substituting reported COVID-19 deaths for reported COVID-19 cases.

Data analysis: clinical course of illness

We investigated whether Internet search data could be used to characterize the clinical course of COVID-19 symptoms over time. “Coronavirus symptoms” was selected as the index search term since it peaked first relative to the other search terms in 22 out of 32 countries. For each country, the date of peak search volume for the index search term was defined as the index date, and the dates for all other search data for that country were defined in relation to that date (Day 0, Day 1, Day 2, etc.). With the data from all countries aligned by the index date, cross-country ensemble average curves were calculated for each search term. (e.g., Day 1 values for “fever” searches in each of the 32 countries were averaged together to calculate the Day 1 value of the ensemble average “fever” curve. The same for Day 2, etc.) The ensemble average curves for all search terms were then overlaid on one plot, providing a search-data-based view of the clinical course of illness.

Since studies in the medical literature report the number of days between initial symptom onset and shortness of breath (dyspnea)30,31,32,33,34, we examined a range of possible search-term-based definitions for initial symptom onset, based on various combinations of the earliest-peaking search terms “fever”, “cough”, “coronavirus symptoms”, and “coronavirus test” (Table 2).

Reporting summary

Further information on experimental design is available in the Nature Research Reporting Summary linked to this paper.