A structured open dataset of government interventions in response to COVID-19

Abstract

In response to the COVID-19 pandemic, governments have implemented a wide range of non-pharmaceutical interventions (NPIs). Monitoring and documenting government strategies during the COVID-19 crisis is crucial to understand the progression of the epidemic. Following a content analysis strategy of existing public information sources, we developed a specific hierarchical coding scheme for NPIs. We generated a comprehensive structured dataset of government interventions and their respective timelines of implementation. To improve transparency and motivate collaborative validation process, information sources are shared via an open library. We also provide codes that enable users to visualise the dataset. Standardization and structure of the dataset facilitate inter-country comparison and the assessment of the impacts of different NPI categories on the epidemic parameters, population health indicators, the economy, and human rights, among others. This dataset provides an in-depth insight of the government strategies and can be a valuable tool for developing relevant preparedness plans for pandemic. We intend to further develop and update this dataset until the end of December 2020.

Measurement(s) time at medical intervention • medical intervention
Technology Type(s) digital curation • content analysis strategy of existing information sources
Factor Type(s) non-pharmaceutical intervention • date
Sample Characteristic - Location global

Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.12668792

Background & Summary

Non-pharmaceutical interventions (NPIs), also known as public health and social measures (PHSM)1, aim to prevent the introduction of infectious diseases (preparedness and readiness measures), control their spread and reduce their burden on the health system (control measures). The general concept of containing the initial (exponential) spread of a disease is called “flattening the (epi-)curve”2. By reducing the growth rate of an epidemic, NPIs reduce the stress on the healthcare system and help gaining time to develop and produce vaccines and specific medications, which is of utmost importance in the case of emerging infectious diseases.

During the COVID-19 pandemic, governments have enforced a broad spectrum of interventions, under rapidly changing, unprecedented circumstances. Government responses to COVID-19 included the laissez-faire strategy, which implies doing little to nothing, the herd immunity strategy, which implies a few measures only or measures relying on voluntary compliance, and more aggressive approaches based on the implementation of a wide range of stringent NPIs, sometimes even limiting civil rights and liberty3,4. Government control policies have shown divergences in particular in the timeline of implementation and in the prioritization of the NPIs. In China for example, quarantine, social distancing, cordon sanitaire, and isolation of cases have been associated with improvements in the key epidemiological markers, including the number of infections and COVID-19-related deaths5. In Hong Kong and Taiwan, which experienced severe acute respiratory syndrome (SARS) epidemics in 2002–20036,7, early government actions, strict social distancing measures, contact tracing, extensive and proactive testing, and high compliance of the population, have, to date, successfully mitigated the COVID-19 epidemic8,9. Following a herd immunity approach, similar to the one initially adopted by the UK government, the Swedish government did not introduce strict bans but formulated non-binding recommendations only (https://www.folkhalsomyndigheten.se/nyheter-och-press/). Predictive models, however, suggest that such a strategy might ultimately overwhelm the healthcare system10.

Poor control policies have potentially dramatic repercussions on public health. Although the need for data on country-based responses to COVID-19 was urgent and is still crucial, there is a limited opportunity to capture this information. Started in mid-March 2020, our project aims to generate a comprehensive structured dataset on government responses to COVID-19, including the respective time schedules of their implementation.

During the COVID-19 crisis, several data collection efforts related to NPIs have emerged (https://lukaslehner.github.io/covid19policytrackers/). Some of them focus on a specific type of interventions, e.g. the closure of educational institutions (https://en.unesco.org/covid19/educationresponse), travel restrictions (https://www.iata.org/en/programs/safety/health/diseases/), trade-related measures (https://www.wto.org/english/tratop_e/covid19_e/trade_related_goods_measure_e.htm), or measures to ensure continuity of supply of personal protective equipment and critical medical products (http://www.wcoomd.org/en/topics/facilitation/activities-and-programmes/natural-disaster/list-of-countries-coronavirus.aspx), whereas others encompass a larger range of NPIs. In this paper, we also show how we distinguish our work from other concomitant initiatives.

In the context of the current COVID-19 health crisis, open knowledge11 and data sharing are crucial to understand and help to mitigate the pandemic. In this article, we document and share the methodologies, tools and approaches used to produce the Complexity Science Hub COVID-19 Control Strategies List (CCCSL) dataset following the principles of open science. We provide a detailed description of the dataset and present examples of how it can provide insights into the global government response to COVID-19.

The dataset is readily usable for modelling and machine learning analyses and exhibits a great analytical flexibility12. In particular, researchers have leveraged on the hierarchical structure and the granularity of the data to disentangle the individual impacts of the NPIs on the reduction of the effective reproduction number through a top-down approach (from theme to code). Results show that social distancing measures, travel restrictions, but also active risk communication, play a major role in containing the epidemic. The study further distinguishes the impact of different levels of implementation of some specific measures, e.g. those related to face covering12.

Considering the imperative necessity for data on government interventions, we released version 1 of the dataset on 2 April 2020. Version 2, displaying a consolidated coding scheme, is available since 7 May 2020. We also provide user-friendly documentation and materials (codes, visualisation interface, and library of sources) along with the dataset, which allow a maximum understanding of the data and promote its use among non-experts. The dataset is not complete and we continuously update it with new available records. Depending on resources, updates are planned until the end of December 2020.

Methods

We used a content analysis13,14,15 strategy of existing information sources to develop a hierarchical coding scheme specific to NPIs implemented to mitigate the burden of COVID-19. First, based on a literature review on community mitigation strategies and expert knowledge, eight themes (thereafter called level 1 (L1) in the coding scheme) were identified and labelled: (i) Case identification, contact tracing and related measures, (ii) Environmental measures, (iii) Healthcare and public health capacity, (iv) Resource allocation, (v) Risk communication, (vi) Social distancing, (vii) Travel restriction, and (viii) Returning to normal life. A definition for each theme is provided in the Online-only Table 1. At the start of our project, there were no previously published studies on NPIs against COVID-19 to be used as a reference for developing the labelling and coding scheme. Therefore, a list of NPIs that have been already implemented by different governments at this time (mid-March 2020) was compiled, that served as a preliminary template to generate a priori categories within a hierarchical coding scheme. Strategies that could provide assistance to the population (e.g., related to financial support or food supply) or that may encourage compliance with the measures (e.g. resource allocations, risk communication) were also included. Listed interventions were then assigned to one of the eight themes defined above. The specific details and descriptions of each NPI were coded into a priori categories (thereafter called level 2 (L2) in the coding scheme), and into subsequent a priori subcategories and codes whenever needed (thereafter called level 3 (L3) and level 4 (L4) in the coding scheme, respectively). Discrepancies in code assignments were discussed within the coding team and were resolved by consensus. The objective of this hierarchical coding scheme for NPIs was to standardize the data collection and obtain a structured dataset that uses a consistent taxonomy, and therefore, promotes common understanding.

On 19 March 2020, we set up a platform for students, researchers, and volunteers to collect data on the NPIs implemented by the governments for preventing and limiting the spread of COVID-19, including the time schedules for the implementation. Data collectors received clear instructions on the objective of the project and indications on how to proceed for data collection. Data collectors were asked to use the template of a priori themes, categories, subcategories, and codes or to refer to the data curators if a measure could not be coded using this a priori coding system. Therefore, throughout the data collection process, new categories, subcategories, and codes emerged, derived directly from the text data sources. The emergent (inductive) categories and subcategories were openly coded by the data collectors or by the data curators. In a second step, inductive categories and subcategories were compared together and in relation to the entire dataset to detect co-occurrences (codes that partially or completely overlap) and redundancies. Codes with the same meaning were aggregated16. The categories and subcategories were tightened up to the point that maximized mutual exclusivity and exhaustiveness15. This resulted in a Master List of Codes (a list of all the codes that were developed and used in the study), including the curated a priori and inductive coding categories. The Master List replaced the a priori template for categorisation of the measures during data collection. It was shared with the data collectors via a Google spreadsheet and updated daily.

Different public sources were used to populate, update and curate the dataset, including official government sources, peer-reviewed and non-peer-reviewed scientific papers, webpages of public health institutions (World Health Organization, Centers for Disease Control and Prevention, and European Centre for Disease Prevention and Control), press releases, newspaper articles, and government communication through social media. We collected data on the following: (i) country, (ii) state/region (when measures were implemented at subnational-level), (iii) date of implementation of the measure, (iv) implemented measure coded following the four-level classification scheme described above (theme, category, subcategory and code), and (v) source. For each country, data were preferentially collected in the language of the country by native data collectors (i.e. Austria, Belgium, Bosnia and Herzegovina, Brazil, Canada, Croatia, Czech Republic, Ecuador, El Salvador, France, Germany, Ghana, Honduras, Hong Kong, India, Italy, Kazakhstan, Kosovo, Kuwait, Mauritius, Mexico, Montenegro, North Macedonia, New Zealand, Poland, Portugal, Ireland, Romania, Senegal, Serbia, Spain, Syria, Taiwan, and United Kingdom). If this was not possible, Google Translate was used to translate documents17. All records were hand-coded.

Data Records

A static copy of the dataset has been archived in figshare18, including all NPIs recorded as of time of submission (17 July 2020), spanning the period 31 December 2019 to 15 July 2020. A dynamic version of the dataset, which is planned to be continually updated, can be accessed via GitHub: https://github.com/amel-github/covid19-interventionmeasures or from Google Drive: https://drive.google.com/open?id=1041U8iWPDSGI6KHIn9Dg7THkXIo3-gui, in CSV format. Each of the rows represents a single individual NPI and is identified by a unique ID. The Master List of Codes is also available (an additional Master List file displays the hierarchical relationship between each pair of parent/child codes, i.e. L1-L2, L2-L3, and L3-L4, and the number of times each pair occurs in the dataset). We also provide a Glossary of Codes, which gives the definition of each theme, category, subcategory, and code. An online interactive tool, which enables to visualise the dataset hierarchical structure and codes, completes the description of the dataset. It is accessible at: http://covid19-interventions.com/CCCSLgraph/. We have also established a GitHub repository available at: https://github.com/amel-github/CCCSL-Codes and provide codes19 for importing, exploring and visualising the data into R20. Furthermore, for purposes of transparency of data collection and to motivate collaborative validation process as well as a large use and development of the dataset, an open library is available, that contains all sources used to collect the data: https://www.zotero.org/groups/2488884/cccsl_covid_measure_project (>3,100 data sources are included as of date of submission). In order to leverage on the potential of crowdsourcing for populating and curating the CCCSL dataset, we have launched a webpage dedicated to this project at: http://covid19-interventions.com/ where contributors can fill up a Google Form at: https://bit.ly/2KsYOTn, if they wish to correct entries, add a measure, and/or provide a feedback.

The dataset contains the following fields:

ID – Unique identifier for each individually implemented measure. ID is also used in the Google Form to report erroneous entries.

Country – The country where the measure was implemented.

ISO3 – Three-letter country code as published by the International Organization for Standardization.

State – Subnational geographic area. State where the measure was implemented; the country name otherwise. Used for Germany, India, and USA.

Region – Subnational geographic area (e.g. region, department, municipality, city) where the NPI has been locally implemented (i.e. the measure was not implemented nationwide as of the mentioned date). The country or the state name otherwise (i.e. measure implemented nationwide).

Date – Date of implementation of the NPI. Date of announcement was used when the date of implementation of the NPI could not be found and this was specified in the field Comment.

L1_Measure – Theme (L1 of the classification scheme). Eight themes were defined (see Online-only Table 1).

L2_Measure – Category (L2 of the classification scheme). Online-only Table 1 provides the list of the categories for each theme.

L3_Measure – Subcategory (L3 of the classification scheme). Provides detailed information on the corresponding category (L2).

L4_Measure – Code (L4 of the classification scheme). Corresponds to the finest level of description of the measure.

Status – Indicates whether the measure is a prolongation of a previously implemented measure (“Extended”) or not (“”).

Comment – Provides the description of the measure as found in the text data source, translated into English. This field allows to judge the quality of the label for the different levels of the coding scheme and enables to re-assign the measure to the correct theme/category/subcategory/code in case of error or misinterpretation by the data collector21. When available, duration of the restriction, as officially announced, is mentioned in this field.

Source – Provides the reference for each entry, i.e. URL. Enables to trace back potential changes in the meaning of the label during the translation21. Enables to access the description of the measure in the source language and/or to access to the information as it was dispatched originally.

As of date of submission, the CCCSL dataset included information for 6,068 government interventions, from 56 countries, including 33 European countries, 12 Asian countries, five South American countries, two North American countries, one Oceanian country, three African countries, and the Diamond Princess cruise ship. Regarding the USA, data are available at the state level for 24 states. Figure 1, Table 1, and Online-only Table 2 summarize the dataset. A description of the measures grouped by theme (L1) for each country can be computed from the published codes19 (https://github.com/amel-github/CCCSL-Codes).

Fig. 1
figure1

Geographical coverage of the CCCSL and total number of recorded NPIs that were implemented in each country to control the spread of COVID-19. As of date of submission, the dataset includes 56 countries and dates of NPI implementation range from 31/12/2019 to 15/07/2020.

Table 1 Summary of the government interventions recorded in the CCCSL at level 1 (themes) of the coding scheme.

Technical Validation

After the initial data entry, the dataset was checked manually by the data curators. For each measure, concordance between L1, L2, L3, and L4 was checked. Moreover, the unique combinations of L1, L2, L3, and L4 were extracted and controlled for consistency. Typographical and coding errors were minimized through a manual process. We initiated a collaborative curation platform relying on internal and external collaborators who exchanged through Slack, GitHub, Skype, and via emails. This extended effort enabled us to correct typographical and coding errors, to remove line breaks, and to homogenize the dataset for universal use in different programming languages.

Beyond manual validations, we performed a technical validation step to detect possible duplicates. Using the dplyr package for the R Programming Language22, we identified any duplicate entries in the vector composed of country, region, date, and the codes from L1 to L4. Those entries were flagged as possible duplicates and reviewed by hand by two curators, ensuring that the dataset does not contain duplicated entries. An R script to reproduce this step is provided at: https://github.com/amel-github/CCCSL-Codes.

While an important effort has been made for standardizing the records, the four-level-a priori coding scheme originally proposed showed limitations. First, the existing classifications of NPIs are discordant23,24,25. We proposed an original classification scheme that best fitted our (emergency) needs and the specificity of the COVID-19 pandemic, but this scheme may be subjected to revisions in the future. Secondly, some NPIs have been uniquely implemented (e.g. the deportation of Chinese workers by the Kazakh government), which complicated the coding and categorisation process.

Access to the information from government or other official sources may be compromised if not performed timely. Indeed, several governments or national health agencies regularly update their webpage to provide the latest information to the public. Therefore, if sources are not consulted timely, previous content (i.e. previous restrictions and measures) might not be visible straight away and data will have to be retrieved indirectly or from archived websites, which eventually slows down the data collection process and may lead to missing data. Furthermore, while native speakers were recruited whenever possible for data collection, transliteration or translation errors may have occurred when extracting data from Google Translate translations.

Lastly, when using the data for epidemiologic or economic modelling, the absence of an “End date” data element might be a limitation. However, this data cannot be captured for each kind of NPI, e.g. “Increase of healthcare workforce” or “Work safety protocol”. We propose an alternative approach that leverages on the theme “Returning to normal life” and record individually all (i) variations in, (ii) conditions of, and (iii) adaptive measures to the gradual lifting of the restrictions (e.g. re-opening of shops > 400 m², re-opening of classes with examination, weddings allowed if the number of attendees is < 100, etc.). By providing data on each step of the phase-out process, the coding scheme allows therefore to retrieve even more specifically (but indirectly) the “End date” for each NPI (to the best of our knowledge, only the CoronaNet dataset provides a “End date” data element, although as of date of writing, for 30% of the interventions only26).

We plan to maintain the quality level of the dataset with regular updates on the countries currently described. Furthermore, we plan to increase the geographic coverage of the dataset, prioritizing large countries (e.g. China, US states not already covered, and Australia), those with a high number of reported cases (e.g. Vietnam, Iran, Turkey, Russia, Israel, Peru, Chile, Pakistan, Philippines, Saudi Arabia), and those where the epidemic is rising and which may suffer from a data gap (i.e. African and South American countries). The same technical procedures and the classification scheme described above will be applied to any new information to be included in the dataset. Future versions will be subjected to extensive data validation processes. We plan to stabilise the hierarchical coding scheme for NPIs implemented to contain COVID-19 within six months, including measures related to the lifting of the restrictions and adaptive measures that accompany them.

Usage Notes

The aim of this work is not only to improve the current knowledge on country-based interventions implemented to mitigate the burden of COVID-19, but also to characterise the political, public health, and economic strategies of the governments worldwide. Combined with publicly available data on the number of confirmed cases, recovered cases, and deaths, the CCCSL dataset makes it possible to assess the effectiveness of the control policies on the COVID-19 epidemic, e.g. the epidemic growth rate or the daily reproduction numbers12. The standardized coding facilitates an inter-country comparison of government responses. The dataset can further benefit the risk assessment of lifting some restrictions and the development of exit strategies. It can also become an essential data source in the aftermath of the first wave of COVID-19, to guide government control policies anticipating a potential second wave of cases. We envision the CCCSL dataset to become a timely valuable and long-lasting data source for assessing the impact of the NPIs on global public health indicators, the economy, and human rights, among others. We provide below two examples of data usages that give an insight into the responsiveness and aggressiveness of the governments in their management of the COVID-19 crisis.

Mapping the timeline of government interventions during the epidemic

We propose to visualise the time-series of the dates of implementation of the NPIs recorded in the CCCSL at the level 2 of the hierarchical coding scheme (categories) in the 56 countries using a heat map (Fig. 2). In order to highlight country-based differences in the timeline of implementation, we used the epidemic age instead of calendar time. For a given day, t, in a certain country, the epidemic age is defined as the time difference, t-t0, measured in days, where t0 is the first day when the number of confirmed cases was greater or equal to 10. The time-series data of the number of COVID-19 cases was retrieved from the COVID-19 Data Repository by the Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE) at: https://github.com/CSSEGISandData/COVID-19:https://github.com/CSSEGISandData/COVID-19.

Fig. 2
figure2

Heat map of the dates of implementation of the NPIs recorded in the CCCSL at the level 2 of the hierarchical coding scheme (categories) in 56 countries. Time is in epidemic age with t0 = day when 10 cases were reported (http://covid19-interventions.com/CountryMeasuresHeatmap.svg).

Country-cluster analysis of the government control strategies

In order to partition the countries based on the aggressiveness (number of NPIs) and responsiveness (timeline) of their control strategy, we applied a k-means clustering. We focused on mandatory government interventions (i.e. the theme “Risk communication” was not included) recorded in the CCCSL at the level 2 of the hierarchical coding scheme (categories) that appeared in at least 15 countries, leading to a total number of 40 categories. The clustering algorithm uses the date of implementation of the measures in each country to build a feature vector based on the epidemic age (see above). We considered “anticipatory measures” as those implemented before day when 10 cases were reported; “early measures” as those implemented at the beginning of the epidemic, i.e. between the day when 10 cases were reported and the day when 200 cases were reported; and “late measures” as those implemented at a later stage of the epidemic, i.e. after the day when 200 cases were reported. The algorithm takes also into account the number of measures implemented at these different stages of the epidemic. The time-series data of the number of COVID-19 cases was retrieved from the COVID-19 Data Repository by the JHU CSSE (https://github.com/CSSEGISandData/COVID-19: https://github.com/CSSEGISandData/COVID-19). The optimal number of clusters, k, was determined using the elbow method27. Briefly, this method consists in running k-means clustering on the dataset for a range of values of k (set here from 1 to 15), and for each value of k calculates the sum of squared errors (SSE). We then plotted the SSE for each value of k and identified the best value of k where the line chart looks like an arm (“elbow”). As of date of publication (static version 2020-07-12, 56 countries) the best value of k was eight, explaining 82.8% of the variance (Fig. 3). An interactive version of Fig. 3 is available online at: http://covid19-interventions.com/CountryClusters.html.

Fig. 3
figure3

Country-cluster analysis based on the number of mandatory government interventions and respective dates of implementation, as calculated using the epidemic age (t0 = day when 10 cases were reported) (http://covid19-interventions.com/CountryClusters.html).

Contextualising the project

During the COVID-19 crisis, other projects have concomitantly tracked data on government policies (interchangeably named NPIs18 or government(s’) responses26,28,29 or government measures30 or PHSM31 or policy actions26). We report here on five of them26,28,29,30,31 in order to contextualise our work. The comparison indicates similarities and differences among the NPI trackers and highlights how the CCCSL contributes to the global effort against COVID-19. Supplementary Information 1 outlines the main characteristics of the six datasets (including the CCCSL18).

The core value-added of the CCCSL dataset is the remarkable granularity of the data on NPIs (e.g. seven categories of travel restriction are reported, further divided into more than 50 subcategories) and the use of self-explanatory codes, which, completed with the Glossary of Codes, makes the dataset readily intelligible. As of date of submission, the dataset displays eight themes, 63 categories, >500 subcategories, and >2,000 codes.

With regard to the geographic unit, two datasets record data at the country level28,29 whereas four record data at a finer administrative scale18,26,30,31. One dataset uses a binary code (1/0) to assess the presence/absence of the NPIs28, another one uses an Likert-like scale to further differentiate the level of implementation29, whereas the others use a coding system based on words or short phrases that assign a summative attribute to the data18,26,30,31. Moreover, the aggregation scheme and, sometimes, the semantic of the NPIs diverge widely between the datasets. For example, the CoronaNet dataset26 groups school closure together with lockdown measures whereas the CCCSL18 and the ACAPS30 datasets classify school closure in the theme “Social distancing”. Regarding the restriction on individual movement, this measure is labelled “Partial lockdown” in the ACAPS dataset30, “Household confinement” in the HIT-COVID dataset31, “Lockdown applies to all people” in the CoronaNet dataset26, and “Movements for non-essential activities forbidden” in the CCCSL dataset18. Overall, these projects are independent of each other and the specific research question should indicate which one(s) to use. Harmonizing and integrating the different datasets could help accelerate epidemiological understanding on COVID-19 and the development of relevant preparedness plans for pandemic. The World Health Organization is currently making an important effort in this regard1.

Code availability

A live version of this project is accessible on GitHub at: https://github.com/amel-github/covid19-interventionmeasures. The codes used to describe the CCCSL dataset and the codes used to explore the CCCSL dataset are written in R language19. They are available at: https://github.com/amel-github/CCCSL-Codes. Please refer to the README file in the code release for further instructions.

References

  1. 1.

    World Health Organization. Tracking Public Health and Social Measures A Global Dataset. https://www.who.int/emergencies/diseases/novel-coronavirus-2019/phsm (2020).

  2. 2.

    Anderson, R. M., Heesterbeek, H., Klinkenberg, D. & Hollingsworth, T. D. How will country-based mitigation measures influence the course of the COVID-19 epidemic? Lancet 395, 931–934 (2020).

    CAS  Article  Google Scholar 

  3. 3.

    Ugarov, A. Inclusive costs of NPI measures for COVID-19 pandemic: three approaches. Preprint at https://doi.org/10.1101/2020.03.26.20044552 (2020).

  4. 4.

    Studdert, D. M. & Hall, M. A. Disease control, civil liberties, and mass testing — Calibrating restrictions during the Covid-19 pandemic. N. Engl. J. Med. 383, 102–104 (2020).

    CAS  Article  Google Scholar 

  5. 5.

    Pan, A. et al. Association of public health interventions with the epidemiology of the COVID-19 outbreak in Wuhan, China. JAMA 323, 1915–1923 (2020).

    CAS  Article  Google Scholar 

  6. 6.

    Chen, K.-T. et al. SARS in Taiwan: an overview and lessons learned. Int. J. Infect. Dis. 9, 77–85 (2005).

    Article  Google Scholar 

  7. 7.

    Hung, L. S. The SARS epidemic in Hong Kong: what lessons have we learned? J. R. Soc. Med. 96, 374–378 (2003).

    Article  Google Scholar 

  8. 8.

    Cowling, B. J. et al. Impact assessment of non-pharmaceutical interventions against coronavirus disease 2019 and influenza in Hong Kong: an observational study. Lancet Public Health 5, E279–E288 (2020).

    Article  Google Scholar 

  9. 9.

    Wang, C. J., Ng, C. Y. & Brook, R. H. Response to COVID-19 in Taiwan: Big data analytics, new technology, and proactive testing. JAMA 323, 1341–1342 (2020).

    CAS  Article  Google Scholar 

  10. 10.

    Rocklov, J. COVID-19 health care demand and mortality in Sweden in response to non-pharmaceutical (NPIs) mitigation and suppression scenarios. Preprint at https://doi.org/10.1101/2020.03.20.20039594 (2020).

  11. 11.

    Molloy, J. C. The open knowledge foundation: open data means better science. PLoS Biol. 9, e1001195 (2011).

    CAS  Article  Google Scholar 

  12. 12.

    Haug, N. et al. Ranking the effectiveness of worldwide COVID-19 government interventions. Preprint at https://doi.org/10.1101/2020.07.06.20147199 (2020).

  13. 13.

    Vaismoradi, M., Turunen, H. & Bondas, T. Content analysis and thematic analysis: implications for conducting a qualitative descriptive study. Nurs. Health Sci. 15, 398–405 (2013).

    Article  Google Scholar 

  14. 14.

    Erlingsson, C. & Brysiewicz, P. A hands-on guide to doing content analysis. Afr. J. Emerg. Med. 7, 93–99 (2017).

    Article  Google Scholar 

  15. 15.

    Weber, R. Basic Content Analysis 2 edn (SAGE Publications, Inc, 1990).

  16. 16.

    Gläser, J. & Laudel, G. Life with and without coding: Two methods for early-stage data analysis in qualitative research aiming at causal explanations. Forum Qual. Soc. Res. 14, Art. 5 (2013).

  17. 17.

    Windsor, L. C., Cupit, J. G. & Windsor, A. J. Automated content analysis across six languages. PLoS One 14, e0224425 (2019).

    CAS  Article  Google Scholar 

  18. 18.

    Desvars-Larrive, A. et al. A structured open dataset of government interventions in response to COVID-19. figshare https://doi.org/10.6084/m9.figshare.c.4962266 (2020).

  19. 19.

    Desvars-Larrive, A., Dervic, E., Haug, N. & Garcia, D. A structured open dataset of government interventions in response to COVID-19–Codes for exploration and visualisation. Zenodo, https://doi.org/10.5281/zenodo.3949808 (2020).

  20. 20.

    R Core Team. R: A language and environment for statistical computing, https://www.R-project.org/ (R Foundation for Statistical Computing, Vienna, Austria, 2020).

  21. 21.

    Vaismoradi, M., Jones, J., Turunen, H. & Snelgrove, S. Theme development in qualitative content analysis and thematic analysis. J. Nurs. Educ. Pract. 6, 100–110 (2016).

    Google Scholar 

  22. 22.

    Wickham, H., François, R., Henry, L. & Müller, K. dplyr: a grammar of data manipulation, https://CRAN.R-project.org/package=dplyr (2020).

  23. 23.

    Centers for Disease Control and Prevention. Interim Pre-Pandemic Planning Guidance: Community Strategy for Pandemic Influenza Mitigation in the United States: Early, Targeted, Layered Use of Nonpharmaceutical Interventions (Stephen B. Thacker CDC Library, 2007).

  24. 24.

    European Centre for Disease Prevention and Control. Technical Report. Guide to Revision of National Pandemic Influenza Preparedness Plans - Lessons Learned From the 2009 A(H1N1) Pandemic (ECDC, Stockholm, 2017).

  25. 25.

    World Health Organization. Non-Pharmaceutical Public Health Measures for Mitigating the Risk and Impact of Epidemic and Pandemic Influenza (World Health Organization, 2019).

  26. 26.

    Cheng, C., Barceló, J., Hartnett, A., Kubinec, R. & Messerschmidt, L. COVID-19 Government Response Event Dataset (CoronaNet v1.0). Nat. Hum. Behav. 4, 756–768 (2020).

    Article  Google Scholar 

  27. 27.

    Yuan, C. & Yang, H. Research on K-value selection method of K-means clustering algorithm. J. 2, 226–235 (2019).

    Google Scholar 

  28. 28.

    Porcher, S. Governments’ Responses to COVID-19. OPENICPSR https://doi.org/10.3886/E119061V4 (2020).

  29. 29.

    Hale, T., Webster, S., Petherick, A., Phillips, T. & Kira, B. Oxford COVID-19 Government Response Tracker, Blavatnik School of Government. https://www.bsg.ox.ac.uk/research/research-projects/coronavirus-government-response-tracker (2020).

  30. 30.

    ACAPS. #COVID19 Government Measures Dataset. https://www.acaps.org/covid19-government-measures-dataset (2020).

  31. 31.

    Zheng, Q. et al. HIT-COVID, a global database tracking public health interventions to COVID-19. Sci. Data https://doi.org/10.1038/s41597-020-00610-2 (2020).

  32. 32.

    European Centre for Disease Prevention and Control. Technical Report. Considerations Relating to Social Distancing Measures in Response to COVID-19 – Second Update (ECDC, Stockholm, 2020).

  33. 33.

    World Health Assembly. International Health Regulations (2005). (World Health Organization, 2016).

  34. 34.

    Kinlaw, K. & Levine, R. J. Ethical guidelines in Pandemic Influenza—Recommendations of the Ethics Subcommittee of the Advisory Committee to the Director, Centers for Disease Control and Prevention. (Centers for Disease Control and Prevention, 2007).

Download references

Acknowledgements

D.G. and A.D.N. acknowledge funding from the Vienna Science and Technology Fund - WWTF (VRG16-005). S.T. acknowledges funding from the Austrian Research Promotion Agency FFG (project number 882184) and the Vienna Science and Technology Fund - WWTF (COV20-017). The authors acknowledge William Schueller for his help in the recruitment of the team of data collectors and Petar Sekulic for his advice in the analysis of the dataset. We warmly thank Caspar Matzhold and Michaela Kaleta for checking and testing our code components.

Author information

Affiliations

Authors

Contributions

A.D.L. managed and coordinated the production of the dataset and prepared it for publication, including: developing the coding scheme, collecting and curating the data, managing the team of data collectors, creating the library of sources, writing the data descriptor, and creating the tables. E.D. performed exploratory data analyses and provided the major contribution to the production of the plots and figures. The following authors: N.H., T.N., E.C., A.D.N., J.L., D.S.G., A.R., J.S. and D.G. produced a substantial work to generate the dataset and prepare it for publication, including: developing the coding scheme, collecting the data, formatting the data for presentation of the published work, visualising the data, and creating the library of sources. D.G. created the webpage dedicated to this project. The following list of authors (listed alphabetically): A.C., A.T., A.D., A.P., A.J., D.C., D.L., D.Bu., D.Be., E.F.T., F.Á., H.T., J.K., J.R., J.G.M., J.S., L.H., L.S., L.G., L.G., M.R.F., M.B., R.V., S.H., S.H., V.A., V.R., V.D.P.S., X.C., X.M.P.O. and Z.G. includes the many individuals who collected and curated data through the dedicated platform and provided comments to facilitate the use of the dataset across different programming platforms. A.D.L. and D.G. jointly supervised this work. S.T. mentored the core team. All authors reviewed and approved the final manuscript.

Corresponding author

Correspondence to Amélie Desvars-Larrive.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Online-only Tables

Online-only Table 1 Definition of the eight themes (L1) used to classify the NPIs and associated categories (L2).
Online-only Table 2 Top 20 most frequent categories (L2) of NPIs implemented to control the spread of COVID-19. As of date of submission, the dataset includes 56 countries and dates of NPI implementation range from 31/12/2019 to 15/07/2020.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Desvars-Larrive, A., Dervic, E., Haug, N. et al. A structured open dataset of government interventions in response to COVID-19. Sci Data 7, 285 (2020). https://doi.org/10.1038/s41597-020-00609-9

Download citation

Further reading

Search

Sign up for the Nature Briefing newsletter for a daily update on COVID-19 science.
Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing