To the Editor — In December 2019, a novel coronavirus was isolated, after a cluster of patients in China were diagnosed with pneumonia of unknown cause1. This new isolate was named ‘SARS-CoV-2’ and is the cause of the disease COVID-19. The virus has led to an ongoing outbreak and an unprecedented international health crisis. The number of infected people is rapidly increasing globally2 and most probably is a vast underestimation of the real number of patients worldwide, as infected people are contagious even when minimally symptomatic or asymptomatic3. The spread of the disease has presented an extreme challenge to the international community, and policy-makers from different countries have each chosen different strategies, depending on the local spread of the virus, healthcare-system resources, economic and political factors, public adherence, and their perception of the situation.
Coronavirus infection spreads in clusters, and early identification of these clusters is critical for slowing down the spread of the virus. Here we propose that daily population-wide surveys that assess the development of symptoms caused by the virus could serve as a strategic and valuable tool for identifying such clusters and informing epidemiologists, public-health officials and policymakers. We show preliminary results from an Israeli survey of a cumulative number of over 74,000 responses and call for additional countries to join an international consortium to extend this concept in order to develop predictive models. We expect such data will allow the following: faster detection of spreading zones and patients; acquisition of a current snapshot of the number of people in each area who have developed symptoms; prediction of future spreading zones several days before an outbreak occurs; and evaluation of the effectiveness of the various social-distancing measures taken and their contribution to reducing the number of symptomatic people. This information could provide a valuable tool for decision-makers in those areas in which strengthening of social-distancing measures is needed and those in which such measures can be relieved. Preliminary analysis shows that in neighborhoods with a confirmed patient history of COVID-19, more people report experiencing COVID-19-associated symptoms, which demonstrates the potential utility of our approach for the detection of outbreaks. Researchers from other countries, including the USA, UK, Canada, Switzerland, Germany, and several others are working on similar projects, such as the COVID Symptom Tracker in the UK4. We call with urgency for other countries to join our international consortium5, and to share methods and data collected from these daily, simple, one-minute surveys.
In Israel, the first infection of COVID-19 was confirmed on 21 February 2020, and in response, the Israeli Ministry of Health (MOH) instructed people who returned to Israel from specific countries in which COVID-19 was spreading to go into a 14-day home isolation. Since then, Israel has gradually imposed several additional measures (Extended Data Fig. 1): on 9 March, the 14-day home isolation was extended to people arriving from anywhere of international origin, and those who were in close contact with a patient with confirmed COVID-19 were instructed similarly. Symptomatic people were instructed to stay home for 2 days after symptom resolution6. On 11 March, gatherings were limited to a maximum of 100 people; this was further restricted to 10 people on 15 March. On 19 March, a national state of emergency was declared in the country. On 20 March, the first death of an Israeli citizen from COVID-19 occurred.
One of the main challenges of the current pandemic so far has been disease detection and diagnosis. Although the gold standard for the diagnosis of COVID-19 is detection of the virus by a real-time PCR testing7, current resource and policy limitations in many countries restrict the amount of testing that can be performed. The number of tests per day is increasing; however, not enough tests are being performed to provide a nationwide view of the spread of the virus, particularly as the Israeli MOH guidelines are to test only people who were in close contact with a person with confirmed COVID-19.
To obtain a real-time nationwide view of symptoms across the entire population, and since testing the entire population is not feasible, we developed a simple one-minute online questionnaire aimed at early and temporal detection of geographic clusters in which the virus is spreading. The survey was posted online (https://coronaisrael.org/) on 14 March, and participants were asked to fill it out on a daily basis and separately for each family member, including members who are unable to fill it out independently (e.g., children and older people). So that potential privacy issues that might occur can be avoided, our survey is filled out anonymously, and access to the data is restricted to only study investigators.
The survey contains questions on age, sex, geographic location (city and street), isolation status and smoking habits. Participants also report whether they are experiencing symptoms commonly described in patients with COVID-19 by healthcare professionals, on the basis of the existing literature8. Several other symptoms that are less common in patients with COVID-19 but are more common in other infectious diseases are also included to better identify possible patients with COVID-19. The initial symptoms included cough, fatigue, myalgia (muscle pain), shortness of breath, rhinorrhea or nasal congestion, diarrhea and nausea or vomiting. Additional symptoms, including type of cough (with or without sputum), sore throat, headache, chills, confusion and loss of taste and/or smell sensation, were added in a later version. Participants also report about existing chronic health conditions and are asked to report their daily body temperature (Extended Data Fig. 2 presents the most recent version of the survey).
Given that reports on the clinical characteristics of patients with COVID-19 are only starting to emerge, we defined an initial basic measure we called the ‘symptoms ratio’ using symptoms that were predefined by the Israeli MOH and are commonly reported by patients with COVID-198. Symptoms assessed were shortness of breath, fatigue, cough, muscle pains and fever (body temperature above 38 °C). For participants younger than 18 years of age, nausea and/or vomiting was also included in the ratio calculation. For each participant, the symptoms ratio was calculated as the number of reported symptoms divided by the number of symptoms in our predefined list (number of reported symptoms / 6, for participants 18 years of age or less; number of reported symptoms / 5, for participants over 18 years of age). We plan to refine this list of symptoms as more clinical information is accrued. By associating participants with an area corresponding to their address, we created a color map of Israel by the aggregated symptoms ratio in each neighborhood (Fig. 1).
The questionnaire was first distributed online on 14 March 2020, at 14:43 Israel Standard Time (Greenwich Mean Time + 2 hours), and was disseminated through social media and traditional press media. As of 23 March, 18:00 Israel Standard Time, a cumulative number of 74,256 responses had been received from 69,386 adults (93.44%) and 4,870 children (6.56%) (participant characteristics, Table 1). Of these, 3,007 respondents (4.05%) reported that they were currently in isolation, of which 1,458 (48.49%) were in isolation due to a recent international travel and 1,549 (51.51%) were in isolation due to a contact with a person with COVID-19 or a person who recently returned from abroad. A new version of the questionnaire was established on 21 March, driven by new policies implemented by the Israeli MOH (Extended Data Fig. 1) and accumulating data on patients’ symptoms8. The updated version includes several more questions (Extended Data Fig. 2) and has not been distributed yet.
We attempted to reach all sectors of the Israeli population in distributing the survey―first, by translating and distributing it in five languages (Hebrew, Arabic, English, Russian and Amharic) that reflect the most common languages spoken in Israel. Second, we are devoting efforts to reach underrepresented populations through several channels, including call centers, media appearance and promotion of the survey through Arabic—speaking television stations to gain interest and compliance in all sectors of the population.
We analyzed the symptoms ratio of participants by geographical location in Israel (Fig. 1). This analysis revealed differences in the proportion of reported symptoms in participants from different cities and different neighborhoods that are geographically close to each other, which might suggest the ability to detect changes at high geographical resolution.
We also analyzed the association between the prevalence of symptoms reported in the survey and the prevelence of the same symptoms in patients with COVID-198. We then integrated data from the Israeli MOH on the locations of known COVID-19 cases and divided the responses into two groups depending on whether they were living in neighborhoods in which confirmed cases were present or not. Notably, in neighborhoods in which people with confirmed COVID-19 were present, we detected a higher prevalence of symptoms that were highly prevalent in patients with confirmed COVID-19 (e.g., cough) and lower rates of symptoms that were less prevalent (e.g., rhinorrhea), which demonstrates the potential of our method for detecting disease clusters at high geographical resolution (Fig. 2).
In conclusion, we have developed a short survey based on symptoms associated with COVID-19 with the primary goal of early detection of clusters of COVID-19 outbreak. At the time of this writing, only 10 days after the survey was first distributed, 74,256 responses had been received. As expected, we also detected a higher percentage of symptoms among people who were in home isolation than among those who were not (0.06 and 0.05, respectively; P = 5 × 10–14 (two-sample t-test)).
Although the spread of COVID-19 is exponential9, and the number of patients with confirmed COVID-19 in Israel has increased from 193 on 14 March to 1,238 on 23 March10, it has yet to reach the vast majority of Israel’s population. Thus, it is possible that our measured symptoms could be reflective of other conditions (such as influenza) that were prevalent in Israel during this period, as many diseases share common symptoms11.
Our tool has several potential applications. Although it does not have the ability to diagnose individual cases of COVID-19, it might help to predict future spreading zones a few days before an outbreak occurs, with a high level of accuracy, given a sufficient sample size. Here we have provided a color map of Israel by regions of symptoms ratio (Fig. 1), but as the daily response rate increases, we expect to derive predictive models. We anticipate that these would be leveraged by policymakers to make informed decisions through the utilization of efficient regional prevention strategies rather than a uniform approach. Our survey might also be used to evaluate the effectiveness of prevention strategies implemented by public-health organizations, such as the various social-distancing measures that are currently being implemented in many countries, including Israel12. This can be done by measuring the effect of different strategies on reducing the number of symptomatic people. Finally, it might help in elucidating the clinical course of COVID-19 by tracking the dynamics of symptoms in the population over time.
Addressing the ongoing needs of the medical and scientific community, as well as feedback from policymakers, will drive the direction and focus of our future work. To improve ease of use by participants and streamline the data-collection process, we are also building a designated mobile application that will be finalized and rolled out as soon as it is available. We also plan to resolve privacy issues around location sharing in the future application, data for which will be used only at an aggregated level and can substantially improve our models, and provide valuable insights on population interactions, adherence and disease-spread dynamics.
Our approach has many possible clinical implications; however, we have also encountered several challenges. Given that participants will be asked for personal medical information, there are concerns about identification and potential misuse of information. As mentioned above, we ask participants to fill out the survey anonymously, but we do ask for address details. These data are accessible only by the study investigators, and we are investing resources in properly securing the data to ensure that the privacy rights of our participants will be protected. Since our survey is anonymous, we cannot link the same participant’s daily questionnaires, which could provide individual trends as we proceed. Another major challenge with the type of data we collect is that it is prone to selection bias. We observe that regions with relatively high response rate are regions associated with higher socioeconomic status. Some bias may decrease as these surveys become more widely used and thus better reflect the true population; we intend to model and adjust for different factors such as age and location, and to implement national socioeconomic indices, in future analyses.
We urge other countries to adopt this tool and encourage their populations to use these daily, simple, one-minute surveys. We call for an international collaboration that will allow the sharing of methods and collected data. We also call for the large technology and social-media companies already collecting elements of personal data to collaborate in this international effort by sharing regional information to help us improve our models.
The study protocol was approved by the institutional review board (IRB). Informed consent was waived by the IRB, as all identifying details of the participants were removed before the computational analysis. Participants were made fully aware of the way in which the data will be stored, handled and shared, which was provided to them and is in accord with the privacy and data-protection policy of the Weizmann Institute of Science (https://weizmann.ac.il/pages/privacy-policy).
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Guan, W.-J. et al. N. Engl. J. Med. https://doi.org/10.1056/NEJMoa2002032 (2020).
World Health Organization. https://www.who.int/emergencies/diseases/novel-coronavirus-2019 (accessed 24 March 2020).
Klompas, M. Ann. Intern. Med. https://doi.org/10.7326/M20-0751 (2020).
Davis, N. The Guardian https://www.theguardian.com/science/2020/mar/24/uk-app-aims-to-help-researchers-track-spread-of-coronavirus (2020).
Segal, E. et al. medRxiv https://doi.org/10.1101/2020.04.02.20051284 (2020).
State of Israel Ministry of Health. https://www.health.gov.il/English/Topics/Diseases/corona/Pages/default.aspx (accessed 24 March 2020).
Corman, V. M. et al. Euro Surveill. 25, 2000045 (2020).
Zhao, X. et al. medRxiv https://doi.org/10.1101/2020.03.17.20037572 (2020).
Li, Y. et al. medRxiv https://doi.org/10.1101/2020.03.01.20029819 (2020).
State of Israel Ministry of Health. https://www.health.gov.il/English/Topics/Diseases/corona/Pages/press-release.aspx (accessed 24 March 2020).
Zhang, H. et al. Preprints https://doi.org/10.20944/preprints202003.0160.v1 (2020).
Buckee, C. O. et al. Science https://doi.org/10.1126/science.abb8021 (2020).
We thank U. Feinstein for assisting us in defining the surveys, symptoms and medical conditions. We thank F. Zhang, O. Shalem, W. Allen, B. Silbermann, R. Probasco and D. Cheng for insightful discussions and look forward to jointly creating an international consortium with them. We thank the following for their contributions to our efforts: T. Meir, I. Kalka, A. Lavon, T. Karady, A. Godneva, D. Kolobkov, S. Shoer, O. Bartal and the people at Israel Corona Map; and T. Ben-Ami, M. Hashkes, H. Ben-Shushan, R. Miara, T. Eldar, S. Kasem, T. Bria, S. Avraham, B. Kirel, A. Terkeltaub, D. Hizi, A. Kariv, M. Zer-Aviv, N. Kastel, R. Folkman, G. Barabash and the Public Knowledge Workshop (‘Hasadna’).
The authors declare no competing interests.
Project timeline describing all major events in its development including national events which affected and drove its process, from time survey online publication (March 14th, 14:44) to March 23rd, March 21, 18:00 IST.
The most updated version is presented #Questions that were added in the new version of the questionnaire and are therefore not analyzed in this paper *Questions that the participant is required to answer, &Questions that should be filled only once.
About this article
Cite this article
Rossman, H., Keshet, A., Shilo, S. et al. A framework for identifying regional outbreak and spread of COVID-19 from one-minute population-wide surveys. Nat Med 26, 634–638 (2020). https://doi.org/10.1038/s41591-020-0857-9
International Journal of Medical Informatics (2021)
Data-Driven Epidemic Intelligence Strategies Based on Digital Proximity Tracing Technologies in the Fight against COVID-19 in Cities
A Prediction Model to Prioritize Individuals for a SARS-CoV-2 Test Built from National Symptom Surveys
Detecting COVID-19 and other respiratory infections in obstructive sleep apnoea patients through CPAP device telemonitoring
DIGITAL HEALTH (2021)
Estimating seroprevalence of SARS-CoV-2 antibodies using three self-reported symptoms: development of a prediction model based on data from Ischgl, Austria
Epidemiology and Infection (2021)