A structured open dataset of government interventions in response to COVID-19

Desvars-Larrive, Amélie; Dervic, Elma; Haug, Nina; Niederkrotenthaler, Thomas; Chen, Jiaying; Di Natale, Anna; Lasser, Jana; Gliga, Diana S.; Roux, Alexandra; Sorger, Johannes; Chakraborty, Abhijit; Ten, Alexandr; Dervic, Alija; Pacheco, Andrea; Jurczak, Ania; Cserjan, David; Lederhilger, Diana; Bulska, Dominika; Berishaj, Dorontinë; Tames, Erwin Flores; Álvarez, Francisco S.; Takriti, Huda; Korbel, Jan; Reddish, Jenny; Grzymała-Moszczyńska, Joanna; Stangl, Johannes; Hadziavdic, Lamija; Stoeger, Laura; Gooriah, Leana; Geyrhofer, Lukas; Ferreira, Marcia R.; Bartoszek, Marta; Vierlinger, Rainer; Holder, Samantha; Haberfellner, Simon; Ahne, Verena; Reisch, Viktoria; Servedio, Vito D. P.; Chen, Xiao; Pocasangre-Orellana, Xochilt María; Garncarek, Zuzanna; Garcia, David; Thurner, Stefan

doi:10.1038/s41597-020-00609-9

Download PDF

Data Descriptor
Open access
Published: 27 August 2020

A structured open dataset of government interventions in response to COVID-19

Amélie Desvars-Larrive ORCID: orcid.org/0000-0001-7671-696X^1,2,
Elma Dervic ORCID: orcid.org/0000-0001-7168-3310^2,3,
Nina Haug ORCID: orcid.org/0000-0002-5130-9193^2,3,
Thomas Niederkrotenthaler^2,4,
Jiaying Chen ORCID: orcid.org/0000-0003-4297-1545^2,3,
Anna Di Natale^2,3,
Jana Lasser^2,3,
Diana S. Gliga^na1,
Alexandra Roux^5,6,
Johannes Sorger²,
Abhijit Chakraborty^2,7,
Alexandr Ten⁸,
Alija Dervic ORCID: orcid.org/0000-0003-2149-193X⁹,
Andrea Pacheco¹⁰,
Ania Jurczak¹¹,
David Cserjan²,
Diana Lederhilger^na1,
Dominika Bulska¹²,
Dorontinë Berishaj^na1,
Erwin Flores Tames ORCID: orcid.org/0000-0003-0941-5512²,
Francisco S. Álvarez ORCID: orcid.org/0000-0002-4018-775X¹³,
Huda Takriti²,
Jan Korbel ORCID: orcid.org/0000-0002-5371-5320^2,3,
Jenny Reddish^2,14,
Joanna Grzymała-Moszczyńska¹¹,
Johannes Stangl^na1,
Lamija Hadziavdic^na1,
Laura Stoeger²,
Leana Gooriah ORCID: orcid.org/0000-0003-1064-972X¹⁰,
Lukas Geyrhofer ORCID: orcid.org/0000-0002-8043-2975²,
Marcia R. Ferreira²,
Marta Bartoszek¹¹,
Rainer Vierlinger^na1,
Samantha Holder,
Simon Haberfellner,
Verena Ahne²,
Viktoria Reisch^na1,
Vito D. P. Servedio²,
Xiao Chen^na1,
Xochilt María Pocasangre-Orellana ORCID: orcid.org/0000-0002-1022-2219¹³,
Zuzanna Garncarek¹¹,
David Garcia ORCID: orcid.org/0000-0002-2820-9151^2,3 &
…
Stefan Thurner^2,3,15

Scientific Data volume 7, Article number: 285 (2020) Cite this article

34k Accesses
115 Citations
225 Altmetric
Metrics details

Subjects

Abstract

In response to the COVID-19 pandemic, governments have implemented a wide range of non-pharmaceutical interventions (NPIs). Monitoring and documenting government strategies during the COVID-19 crisis is crucial to understand the progression of the epidemic. Following a content analysis strategy of existing public information sources, we developed a specific hierarchical coding scheme for NPIs. We generated a comprehensive structured dataset of government interventions and their respective timelines of implementation. To improve transparency and motivate collaborative validation process, information sources are shared via an open library. We also provide codes that enable users to visualise the dataset. Standardization and structure of the dataset facilitate inter-country comparison and the assessment of the impacts of different NPI categories on the epidemic parameters, population health indicators, the economy, and human rights, among others. This dataset provides an in-depth insight of the government strategies and can be a valuable tool for developing relevant preparedness plans for pandemic. We intend to further develop and update this dataset until the end of December 2020.

Measurement(s)	time at medical intervention • medical intervention
Technology Type(s)	digital curation • content analysis strategy of existing information sources
Factor Type(s)	non-pharmaceutical intervention • date
Sample Characteristic - Location	global

Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.12668792

A dataset of non-pharmaceutical interventions on SARS-CoV-2 in Europe

Article Open access 01 April 2022

George Altman, Janvi Ahuja, … Jan Markus Brauner

Non-pharmaceutical interventions to combat COVID-19 in the Americas described through daily sub-national data

Article Open access 21 October 2023

Michael Touchton, Felicia Marie Knaul, … Valentina Vargas Enciso

A global panel database of pandemic policies (Oxford COVID-19 Government Response Tracker)

Article 08 March 2021

Thomas Hale, Noam Angrist, … Helen Tatlow

Background & Summary

Non-pharmaceutical interventions (NPIs), also known as public health and social measures (PHSM)¹, aim to prevent the introduction of infectious diseases (preparedness and readiness measures), control their spread and reduce their burden on the health system (control measures). The general concept of containing the initial (exponential) spread of a disease is called “flattening the (epi-)curve”². By reducing the growth rate of an epidemic, NPIs reduce the stress on the healthcare system and help gaining time to develop and produce vaccines and specific medications, which is of utmost importance in the case of emerging infectious diseases.

During the COVID-19 pandemic, governments have enforced a broad spectrum of interventions, under rapidly changing, unprecedented circumstances. Government responses to COVID-19 included the laissez-faire strategy, which implies doing little to nothing, the herd immunity strategy, which implies a few measures only or measures relying on voluntary compliance, and more aggressive approaches based on the implementation of a wide range of stringent NPIs, sometimes even limiting civil rights and liberty^3,4. Government control policies have shown divergences in particular in the timeline of implementation and in the prioritization of the NPIs. In China for example, quarantine, social distancing, cordon sanitaire, and isolation of cases have been associated with improvements in the key epidemiological markers, including the number of infections and COVID-19-related deaths⁵. In Hong Kong and Taiwan, which experienced severe acute respiratory syndrome (SARS) epidemics in 2002–2003^6,7, early government actions, strict social distancing measures, contact tracing, extensive and proactive testing, and high compliance of the population, have, to date, successfully mitigated the COVID-19 epidemic^8,9. Following a herd immunity approach, similar to the one initially adopted by the UK government, the Swedish government did not introduce strict bans but formulated non-binding recommendations only (https://www.folkhalsomyndigheten.se/nyheter-och-press/). Predictive models, however, suggest that such a strategy might ultimately overwhelm the healthcare system¹⁰.

Poor control policies have potentially dramatic repercussions on public health. Although the need for data on country-based responses to COVID-19 was urgent and is still crucial, there is a limited opportunity to capture this information. Started in mid-March 2020, our project aims to generate a comprehensive structured dataset on government responses to COVID-19, including the respective time schedules of their implementation.

During the COVID-19 crisis, several data collection efforts related to NPIs have emerged (https://lukaslehner.github.io/covid19policytrackers/). Some of them focus on a specific type of interventions, e.g. the closure of educational institutions (https://en.unesco.org/covid19/educationresponse), travel restrictions (https://www.iata.org/en/programs/safety/health/diseases/), trade-related measures (https://www.wto.org/english/tratop_e/covid19_e/trade_related_goods_measure_e.htm), or measures to ensure continuity of supply of personal protective equipment and critical medical products (http://www.wcoomd.org/en/topics/facilitation/activities-and-programmes/natural-disaster/list-of-countries-coronavirus.aspx), whereas others encompass a larger range of NPIs. In this paper, we also show how we distinguish our work from other concomitant initiatives.

In the context of the current COVID-19 health crisis, open knowledge¹¹ and data sharing are crucial to understand and help to mitigate the pandemic. In this article, we document and share the methodologies, tools and approaches used to produce the Complexity Science Hub COVID-19 Control Strategies List (CCCSL) dataset following the principles of open science. We provide a detailed description of the dataset and present examples of how it can provide insights into the global government response to COVID-19.

The dataset is readily usable for modelling and machine learning analyses and exhibits a great analytical flexibility¹². In particular, researchers have leveraged on the hierarchical structure and the granularity of the data to disentangle the individual impacts of the NPIs on the reduction of the effective reproduction number through a top-down approach (from theme to code). Results show that social distancing measures, travel restrictions, but also active risk communication, play a major role in containing the epidemic. The study further distinguishes the impact of different levels of implementation of some specific measures, e.g. those related to face covering¹².

Considering the imperative necessity for data on government interventions, we released version 1 of the dataset on 2 April 2020. Version 2, displaying a consolidated coding scheme, is available since 7 May 2020. We also provide user-friendly documentation and materials (codes, visualisation interface, and library of sources) along with the dataset, which allow a maximum understanding of the data and promote its use among non-experts. The dataset is not complete and we continuously update it with new available records. Depending on resources, updates are planned until the end of December 2020.

Methods

We used a content analysis^13,14,15 strategy of existing information sources to develop a hierarchical coding scheme specific to NPIs implemented to mitigate the burden of COVID-19. First, based on a literature review on community mitigation strategies and expert knowledge, eight themes (thereafter called level 1 (L1) in the coding scheme) were identified and labelled: (i) Case identification, contact tracing and related measures, (ii) Environmental measures, (iii) Healthcare and public health capacity, (iv) Resource allocation, (v) Risk communication, (vi) Social distancing, (vii) Travel restriction, and (viii) Returning to normal life. A definition for each theme is provided in the Online-only Table 1. At the start of our project, there were no previously published studies on NPIs against COVID-19 to be used as a reference for developing the labelling and coding scheme. Therefore, a list of NPIs that have been already implemented by different governments at this time (mid-March 2020) was compiled, that served as a preliminary template to generate a priori categories within a hierarchical coding scheme. Strategies that could provide assistance to the population (e.g., related to financial support or food supply) or that may encourage compliance with the measures (e.g. resource allocations, risk communication) were also included. Listed interventions were then assigned to one of the eight themes defined above. The specific details and descriptions of each NPI were coded into a priori categories (thereafter called level 2 (L2) in the coding scheme), and into subsequent a priori subcategories and codes whenever needed (thereafter called level 3 (L3) and level 4 (L4) in the coding scheme, respectively). Discrepancies in code assignments were discussed within the coding team and were resolved by consensus. The objective of this hierarchical coding scheme for NPIs was to standardize the data collection and obtain a structured dataset that uses a consistent taxonomy, and therefore, promotes common understanding.

On 19 March 2020, we set up a platform for students, researchers, and volunteers to collect data on the NPIs implemented by the governments for preventing and limiting the spread of COVID-19, including the time schedules for the implementation. Data collectors received clear instructions on the objective of the project and indications on how to proceed for data collection. Data collectors were asked to use the template of a priori themes, categories, subcategories, and codes or to refer to the data curators if a measure could not be coded using this a priori coding system. Therefore, throughout the data collection process, new categories, subcategories, and codes emerged, derived directly from the text data sources. The emergent (inductive) categories and subcategories were openly coded by the data collectors or by the data curators. In a second step, inductive categories and subcategories were compared together and in relation to the entire dataset to detect co-occurrences (codes that partially or completely overlap) and redundancies. Codes with the same meaning were aggregated¹⁶. The categories and subcategories were tightened up to the point that maximized mutual exclusivity and exhaustiveness¹⁵. This resulted in a Master List of Codes (a list of all the codes that were developed and used in the study), including the curated a priori and inductive coding categories. The Master List replaced the a priori template for categorisation of the measures during data collection. It was shared with the data collectors via a Google spreadsheet and updated daily.

Different public sources were used to populate, update and curate the dataset, including official government sources, peer-reviewed and non-peer-reviewed scientific papers, webpages of public health institutions (World Health Organization, Centers for Disease Control and Prevention, and European Centre for Disease Prevention and Control), press releases, newspaper articles, and government communication through social media. We collected data on the following: (i) country, (ii) state/region (when measures were implemented at subnational-level), (iii) date of implementation of the measure, (iv) implemented measure coded following the four-level classification scheme described above (theme, category, subcategory and code), and (v) source. For each country, data were preferentially collected in the language of the country by native data collectors (i.e. Austria, Belgium, Bosnia and Herzegovina, Brazil, Canada, Croatia, Czech Republic, Ecuador, El Salvador, France, Germany, Ghana, Honduras, Hong Kong, India, Italy, Kazakhstan, Kosovo, Kuwait, Mauritius, Mexico, Montenegro, North Macedonia, New Zealand, Poland, Portugal, Ireland, Romania, Senegal, Serbia, Spain, Syria, Taiwan, and United Kingdom). If this was not possible, Google Translate was used to translate documents¹⁷. All records were hand-coded.

Data Records

A static copy of the dataset has been archived in figshare¹⁸, including all NPIs recorded as of time of submission (17 July 2020), spanning the period 31 December 2019 to 15 July 2020. A dynamic version of the dataset, which is planned to be continually updated, can be accessed via GitHub: https://github.com/amel-github/covid19-interventionmeasures or from Google Drive: https://drive.google.com/open?id=1041U8iWPDSGI6KHIn9Dg7THkXIo3-gui, in CSV format. Each of the rows represents a single individual NPI and is identified by a unique ID. The Master List of Codes is also available (an additional Master List file displays the hierarchical relationship between each pair of parent/child codes, i.e. L1-L2, L2-L3, and L3-L4, and the number of times each pair occurs in the dataset). We also provide a Glossary of Codes, which gives the definition of each theme, category, subcategory, and code. An online interactive tool, which enables to visualise the dataset hierarchical structure and codes, completes the description of the dataset. It is accessible at: http://covid19-interventions.com/CCCSLgraph/. We have also established a GitHub repository available at: https://github.com/amel-github/CCCSL-Codes and provide codes¹⁹ for importing, exploring and visualising the data into R²⁰. Furthermore, for purposes of transparency of data collection and to motivate collaborative validation process as well as a large use and development of the dataset, an open library is available, that contains all sources used to collect the data: https://www.zotero.org/groups/2488884/cccsl_covid_measure_project (>3,100 data sources are included as of date of submission). In order to leverage on the potential of crowdsourcing for populating and curating the CCCSL dataset, we have launched a webpage dedicated to this project at: http://covid19-interventions.com/ where contributors can fill up a Google Form at: https://bit.ly/2KsYOTn, if they wish to correct entries, add a measure, and/or provide a feedback.

The dataset contains the following fields:

ID – Unique identifier for each individually implemented measure. ID is also used in the Google Form to report erroneous entries.

Country – The country where the measure was implemented.

ISO3 – Three-letter country code as published by the International Organization for Standardization.

State – Subnational geographic area. State where the measure was implemented; the country name otherwise. Used for Germany, India, and USA.

Region – Subnational geographic area (e.g. region, department, municipality, city) where the NPI has been locally implemented (i.e. the measure was not implemented nationwide as of the mentioned date). The country or the state name otherwise (i.e. measure implemented nationwide).

Date – Date of implementation of the NPI. Date of announcement was used when the date of implementation of the NPI could not be found and this was specified in the field Comment.

L1_Measure – Theme (L1 of the classification scheme). Eight themes were defined (see Online-only Table 1).

L2_Measure – Category (L2 of the classification scheme). Online-only Table 1 provides the list of the categories for each theme.

L3_Measure – Subcategory (L3 of the classification scheme). Provides detailed information on the corresponding category (L2).

L4_Measure – Code (L4 of the classification scheme). Corresponds to the finest level of description of the measure.

Status – Indicates whether the measure is a prolongation of a previously implemented measure (“Extended”) or not (“”).

Comment – Provides the description of the measure as found in the text data source, translated into English. This field allows to judge the quality of the label for the different levels of the coding scheme and enables to re-assign the measure to the correct theme/category/subcategory/code in case of error or misinterpretation by the data collector²¹. When available, duration of the restriction, as officially announced, is mentioned in this field.

Source – Provides the reference for each entry, i.e. URL. Enables to trace back potential changes in the meaning of the label during the translation²¹. Enables to access the description of the measure in the source language and/or to access to the information as it was dispatched originally.

As of date of submission, the CCCSL dataset included information for 6,068 government interventions, from 56 countries, including 33 European countries, 12 Asian countries, five South American countries, two North American countries, one Oceanian country, three African countries, and the Diamond Princess cruise ship. Regarding the USA, data are available at the state level for 24 states. Figure 1, Table 1, and Online-only Table 2 summarize the dataset. A description of the measures grouped by theme (L1) for each country can be computed from the published codes¹⁹ (https://github.com/amel-github/CCCSL-Codes).

Table 1 Summary of the government interventions recorded in the CCCSL at level 1 (themes) of the coding scheme.

Full size table

Technical Validation

After the initial data entry, the dataset was checked manually by the data curators. For each measure, concordance between L1, L2, L3, and L4 was checked. Moreover, the unique combinations of L1, L2, L3, and L4 were extracted and controlled for consistency. Typographical and coding errors were minimized through a manual process. We initiated a collaborative curation platform relying on internal and external collaborators who exchanged through Slack, GitHub, Skype, and via emails. This extended effort enabled us to correct typographical and coding errors, to remove line breaks, and to homogenize the dataset for universal use in different programming languages.

Beyond manual validations, we performed a technical validation step to detect possible duplicates. Using the dplyr package for the R Programming Language²², we identified any duplicate entries in the vector composed of country, region, date, and the codes from L1 to L4. Those entries were flagged as possible duplicates and reviewed by hand by two curators, ensuring that the dataset does not contain duplicated entries. An R script to reproduce this step is provided at: https://github.com/amel-github/CCCSL-Codes.

While an important effort has been made for standardizing the records, the four-level-a priori coding scheme originally proposed showed limitations. First, the existing classifications of NPIs are discordant^23,24,25. We proposed an original classification scheme that best fitted our (emergency) needs and the specificity of the COVID-19 pandemic, but this scheme may be subjected to revisions in the future. Secondly, some NPIs have been uniquely implemented (e.g. the deportation of Chinese workers by the Kazakh government), which complicated the coding and categorisation process.

Access to the information from government or other official sources may be compromised if not performed timely. Indeed, several governments or national health agencies regularly update their webpage to provide the latest information to the public. Therefore, if sources are not consulted timely, previous content (i.e. previous restrictions and measures) might not be visible straight away and data will have to be retrieved indirectly or from archived websites, which eventually slows down the data collection process and may lead to missing data. Furthermore, while native speakers were recruited whenever possible for data collection, transliteration or translation errors may have occurred when extracting data from Google Translate translations.

Lastly, when using the data for epidemiologic or economic modelling, the absence of an “End date” data element might be a limitation. However, this data cannot be captured for each kind of NPI, e.g. “Increase of healthcare workforce” or “Work safety protocol”. We propose an alternative approach that leverages on the theme “Returning to normal life” and record individually all (i) variations in, (ii) conditions of, and (iii) adaptive measures to the gradual lifting of the restrictions (e.g. re-opening of shops > 400 m², re-opening of classes with examination, weddings allowed if the number of attendees is < 100, etc.). By providing data on each step of the phase-out process, the coding scheme allows therefore to retrieve even more specifically (but indirectly) the “End date” for each NPI (to the best of our knowledge, only the CoronaNet dataset provides a “End date” data element, although as of date of writing, for 30% of the interventions only²⁶).

We plan to maintain the quality level of the dataset with regular updates on the countries currently described. Furthermore, we plan to increase the geographic coverage of the dataset, prioritizing large countries (e.g. China, US states not already covered, and Australia), those with a high number of reported cases (e.g. Vietnam, Iran, Turkey, Russia, Israel, Peru, Chile, Pakistan, Philippines, Saudi Arabia), and those where the epidemic is rising and which may suffer from a data gap (i.e. African and South American countries). The same technical procedures and the classification scheme described above will be applied to any new information to be included in the dataset. Future versions will be subjected to extensive data validation processes. We plan to stabilise the hierarchical coding scheme for NPIs implemented to contain COVID-19 within six months, including measures related to the lifting of the restrictions and adaptive measures that accompany them.

Usage Notes

The aim of this work is not only to improve the current knowledge on country-based interventions implemented to mitigate the burden of COVID-19, but also to characterise the political, public health, and economic strategies of the governments worldwide. Combined with publicly available data on the number of confirmed cases, recovered cases, and deaths, the CCCSL dataset makes it possible to assess the effectiveness of the control policies on the COVID-19 epidemic, e.g. the epidemic growth rate or the daily reproduction numbers¹². The standardized coding facilitates an inter-country comparison of government responses. The dataset can further benefit the risk assessment of lifting some restrictions and the development of exit strategies. It can also become an essential data source in the aftermath of the first wave of COVID-19, to guide government control policies anticipating a potential second wave of cases. We envision the CCCSL dataset to become a timely valuable and long-lasting data source for assessing the impact of the NPIs on global public health indicators, the economy, and human rights, among others. We provide below two examples of data usages that give an insight into the responsiveness and aggressiveness of the governments in their management of the COVID-19 crisis.

Mapping the timeline of government interventions during the epidemic

We propose to visualise the time-series of the dates of implementation of the NPIs recorded in the CCCSL at the level 2 of the hierarchical coding scheme (categories) in the 56 countries using a heat map (Fig. 2). In order to highlight country-based differences in the timeline of implementation, we used the epidemic age instead of calendar time. For a given day, t, in a certain country, the epidemic age is defined as the time difference, t-t₀, measured in days, where t₀ is the first day when the number of confirmed cases was greater or equal to 10. The time-series data of the number of COVID-19 cases was retrieved from the COVID-19 Data Repository by the Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE) at: https://github.com/CSSEGISandData/COVID-19.

Country-cluster analysis of the government control strategies

In order to partition the countries based on the aggressiveness (number of NPIs) and responsiveness (timeline) of their control strategy, we applied a k-means clustering. We focused on mandatory government interventions (i.e. the theme “Risk communication” was not included) recorded in the CCCSL at the level 2 of the hierarchical coding scheme (categories) that appeared in at least 15 countries, leading to a total number of 40 categories. The clustering algorithm uses the date of implementation of the measures in each country to build a feature vector based on the epidemic age (see above). We considered “anticipatory measures” as those implemented before day when 10 cases were reported; “early measures” as those implemented at the beginning of the epidemic, i.e. between the day when 10 cases were reported and the day when 200 cases were reported; and “late measures” as those implemented at a later stage of the epidemic, i.e. after the day when 200 cases were reported. The algorithm takes also into account the number of measures implemented at these different stages of the epidemic. The time-series data of the number of COVID-19 cases was retrieved from the COVID-19 Data Repository by the JHU CSSE (https://github.com/CSSEGISandData/COVID-19). The optimal number of clusters, k, was determined using the elbow method²⁷. Briefly, this method consists in running k-means clustering on the dataset for a range of values of k (set here from 1 to 15), and for each value of k calculates the sum of squared errors (SSE). We then plotted the SSE for each value of k and identified the best value of k where the line chart looks like an arm (“elbow”). As of date of publication (static version 2020-07-12, 56 countries) the best value of k was eight, explaining 82.8% of the variance (Fig. 3). An interactive version of Fig. 3 is available online at: http://covid19-interventions.com/CountryClusters.html.

Contextualising the project

During the COVID-19 crisis, other projects have concomitantly tracked data on government policies (interchangeably named NPIs¹⁸ or government(s’) responses^26,28,29 or government measures³⁰ or PHSM³¹ or policy actions²⁶). We report here on five of them^{26,28,29,30,31} in order to contextualise our work. The comparison indicates similarities and differences among the NPI trackers and highlights how the CCCSL contributes to the global effort against COVID-19. Supplementary Information 1 outlines the main characteristics of the six datasets (including the CCCSL¹⁸).

The core value-added of the CCCSL dataset is the remarkable granularity of the data on NPIs (e.g. seven categories of travel restriction are reported, further divided into more than 50 subcategories) and the use of self-explanatory codes, which, completed with the Glossary of Codes, makes the dataset readily intelligible. As of date of submission, the dataset displays eight themes, 63 categories, >500 subcategories, and >2,000 codes.

With regard to the geographic unit, two datasets record data at the country level^28,29 whereas four record data at a finer administrative scale^18,26,30,31. One dataset uses a binary code (1/0) to assess the presence/absence of the NPIs²⁸, another one uses an Likert-like scale to further differentiate the level of implementation²⁹, whereas the others use a coding system based on words or short phrases that assign a summative attribute to the data^18,26,30,31. Moreover, the aggregation scheme and, sometimes, the semantic of the NPIs diverge widely between the datasets. For example, the CoronaNet dataset²⁶ groups school closure together with lockdown measures whereas the CCCSL¹⁸ and the ACAPS³⁰ datasets classify school closure in the theme “Social distancing”. Regarding the restriction on individual movement, this measure is labelled “Partial lockdown” in the ACAPS dataset³⁰, “Household confinement” in the HIT-COVID dataset³¹, “Lockdown applies to all people” in the CoronaNet dataset²⁶, and “Movements for non-essential activities forbidden” in the CCCSL dataset¹⁸. Overall, these projects are independent of each other and the specific research question should indicate which one(s) to use. Harmonizing and integrating the different datasets could help accelerate epidemiological understanding on COVID-19 and the development of relevant preparedness plans for pandemic. The World Health Organization is currently making an important effort in this regard¹.

Code availability

A live version of this project is accessible on GitHub at: https://github.com/amel-github/covid19-interventionmeasures. The codes used to describe the CCCSL dataset and the codes used to explore the CCCSL dataset are written in R language¹⁹. They are available at: https://github.com/amel-github/CCCSL-Codes. Please refer to the README file in the code release for further instructions.

References

World Health Organization. Tracking Public Health and Social Measures A Global Dataset. https://www.who.int/emergencies/diseases/novel-coronavirus-2019/phsm (2020).
Anderson, R. M., Heesterbeek, H., Klinkenberg, D. & Hollingsworth, T. D. How will country-based mitigation measures influence the course of the COVID-19 epidemic? Lancet 395, 931–934 (2020).
Article CAS PubMed PubMed Central Google Scholar
Ugarov, A. Inclusive costs of NPI measures for COVID-19 pandemic: three approaches. Preprint at https://doi.org/10.1101/2020.03.26.20044552 (2020).
Studdert, D. M. & Hall, M. A. Disease control, civil liberties, and mass testing — Calibrating restrictions during the Covid-19 pandemic. N. Engl. J. Med. 383, 102–104 (2020).
Article CAS PubMed Google Scholar
Pan, A. et al. Association of public health interventions with the epidemiology of the COVID-19 outbreak in Wuhan, China. JAMA 323, 1915–1923 (2020).
Article CAS PubMed Google Scholar
Chen, K.-T. et al. SARS in Taiwan: an overview and lessons learned. Int. J. Infect. Dis. 9, 77–85 (2005).
Article PubMed PubMed Central Google Scholar
Hung, L. S. The SARS epidemic in Hong Kong: what lessons have we learned? J. R. Soc. Med. 96, 374–378 (2003).
Article PubMed PubMed Central Google Scholar
Cowling, B. J. et al. Impact assessment of non-pharmaceutical interventions against coronavirus disease 2019 and influenza in Hong Kong: an observational study. Lancet Public Health 5, E279–E288 (2020).
Article PubMed PubMed Central Google Scholar
Wang, C. J., Ng, C. Y. & Brook, R. H. Response to COVID-19 in Taiwan: Big data analytics, new technology, and proactive testing. JAMA 323, 1341–1342 (2020).
Article CAS PubMed Google Scholar
Rocklov, J. COVID-19 health care demand and mortality in Sweden in response to non-pharmaceutical (NPIs) mitigation and suppression scenarios. Preprint at https://doi.org/10.1101/2020.03.20.20039594 (2020).
Molloy, J. C. The open knowledge foundation: open data means better science. PLoS Biol. 9, e1001195 (2011).
Article CAS PubMed PubMed Central Google Scholar
Haug, N. et al. Ranking the effectiveness of worldwide COVID-19 government interventions. Preprint at https://doi.org/10.1101/2020.07.06.20147199 (2020).
Vaismoradi, M., Turunen, H. & Bondas, T. Content analysis and thematic analysis: implications for conducting a qualitative descriptive study. Nurs. Health Sci. 15, 398–405 (2013).
Article PubMed Google Scholar
Erlingsson, C. & Brysiewicz, P. A hands-on guide to doing content analysis. Afr. J. Emerg. Med. 7, 93–99 (2017).
Article PubMed PubMed Central Google Scholar
Weber, R. Basic Content Analysis 2 edn (SAGE Publications, Inc, 1990).
Gläser, J. & Laudel, G. Life with and without coding: Two methods for early-stage data analysis in qualitative research aiming at causal explanations. Forum Qual. Soc. Res. 14, Art. 5 (2013).
Windsor, L. C., Cupit, J. G. & Windsor, A. J. Automated content analysis across six languages. PLoS One 14, e0224425 (2019).
Article CAS PubMed PubMed Central Google Scholar
Desvars-Larrive, A. et al. A structured open dataset of government interventions in response to COVID-19. figshare https://doi.org/10.6084/m9.figshare.c.4962266 (2020).
Desvars-Larrive, A., Dervic, E., Haug, N. & Garcia, D. A structured open dataset of government interventions in response to COVID-19–Codes for exploration and visualisation. Zenodo, https://doi.org/10.5281/zenodo.3949808 (2020).
R Core Team. R: A language and environment for statistical computing, https://www.R-project.org/ (R Foundation for Statistical Computing, Vienna, Austria, 2020).
Vaismoradi, M., Jones, J., Turunen, H. & Snelgrove, S. Theme development in qualitative content analysis and thematic analysis. J. Nurs. Educ. Pract. 6, 100–110 (2016).
Google Scholar
Wickham, H., François, R., Henry, L. & Müller, K. dplyr: a grammar of data manipulation, https://CRAN.R-project.org/package=dplyr (2020).
Centers for Disease Control and Prevention. Interim Pre-Pandemic Planning Guidance: Community Strategy for Pandemic Influenza Mitigation in the United States: Early, Targeted, Layered Use of Nonpharmaceutical Interventions (Stephen B. Thacker CDC Library, 2007).
European Centre for Disease Prevention and Control. Technical Report. Guide to Revision of National Pandemic Influenza Preparedness Plans - Lessons Learned From the 2009 A(H1N1) Pandemic (ECDC, Stockholm, 2017).
World Health Organization. Non-Pharmaceutical Public Health Measures for Mitigating the Risk and Impact of Epidemic and Pandemic Influenza (World Health Organization, 2019).
Cheng, C., Barceló, J., Hartnett, A., Kubinec, R. & Messerschmidt, L. COVID-19 Government Response Event Dataset (CoronaNet v1.0). Nat. Hum. Behav. 4, 756–768 (2020).
Article PubMed Google Scholar
Yuan, C. & Yang, H. Research on K-value selection method of K-means clustering algorithm. J. 2, 226–235 (2019).
Google Scholar
Porcher, S. Governments’ Responses to COVID-19. OPENICPSR https://doi.org/10.3886/E119061V4 (2020).
Hale, T., Webster, S., Petherick, A., Phillips, T. & Kira, B. Oxford COVID-19 Government Response Tracker, Blavatnik School of Government. https://www.bsg.ox.ac.uk/research/research-projects/coronavirus-government-response-tracker (2020).
ACAPS. #COVID19 Government Measures Dataset. https://www.acaps.org/covid19-government-measures-dataset (2020).
Zheng, Q. et al. HIT-COVID, a global database tracking public health interventions to COVID-19. Sci. Data https://doi.org/10.1038/s41597-020-00610-2 (2020).
European Centre for Disease Prevention and Control. Technical Report. Considerations Relating to Social Distancing Measures in Response to COVID-19 – Second Update (ECDC, Stockholm, 2020).
World Health Assembly. International Health Regulations (2005). (World Health Organization, 2016).
Kinlaw, K. & Levine, R. J. Ethical guidelines in Pandemic Influenza—Recommendations of the Ethics Subcommittee of the Advisory Committee to the Director, Centers for Disease Control and Prevention. (Centers for Disease Control and Prevention, 2007).

Download references

Acknowledgements

D.G. and A.D.N. acknowledge funding from the Vienna Science and Technology Fund - WWTF (VRG16-005). S.T. acknowledges funding from the Austrian Research Promotion Agency FFG (project number 882184) and the Vienna Science and Technology Fund - WWTF (COV20-017). The authors acknowledge William Schueller for his help in the recruitment of the team of data collectors and Petar Sekulic for his advice in the analysis of the dataset. We warmly thank Caspar Matzhold and Michaela Kaleta for checking and testing our code components.

Author information

Unaffiliated

Authors and Affiliations

Unit of Veterinary Public Health and Epidemiology, University of Veterinary Medicine Vienna, Veterinaerplatz 1, 1210, Vienna, Austria
Amélie Desvars-Larrive
Complexity Science Hub Vienna, Josefstaedter Strasse 39, 1080, Vienna, Austria
Amélie Desvars-Larrive, Elma Dervic, Nina Haug, Thomas Niederkrotenthaler, Jiaying Chen, Anna Di Natale, Jana Lasser, Johannes Sorger, Abhijit Chakraborty, David Cserjan, Erwin Flores Tames, Huda Takriti, Jan Korbel, Jenny Reddish, Laura Stoeger, Lukas Geyrhofer, Marcia R. Ferreira, Verena Ahne, Vito D. P. Servedio, David Garcia & Stefan Thurner
Section for Science of Complex Systems, Medical University of Vienna, Spitalgasse 23, 1090, Vienna, Austria
Elma Dervic, Nina Haug, Jiaying Chen, Anna Di Natale, Jana Lasser, Jan Korbel, David Garcia & Stefan Thurner
Unit Suicide Research & Mental Health Promotion, Center for Public Health, Department of Social and Preventive Medicine, Medical University of Vienna, Kinderspitalgasse 15, 1090, Vienna, Austria
Thomas Niederkrotenthaler
CERMES3, Ecole des Hautes Etudes en Sciences Sociales, 7 rue Guy Moquet, 94801, Villejuif, France
Alexandra Roux
Gender, Sexuality, Health, CESP, INSERM, Paris-Saclay University, Paris-Sud University, UVSQ, Hôpital Paul Brousse, 16 avenue Paul Vaillant Couturier, 94807, Villejuif, France
Alexandra Roux
Advanced Systems Analysis, International Institute for Applied Systems Analysis (IIASA), Schlossplatz 1, A-2361, Laxenburg, Austria
Abhijit Chakraborty
Flowers project-team, National Research Institute for Digital Sciences (INRIA), 200 avenue de la Vieille Tour, 33405, Talence, France
Alexandr Ten
Institute of Electrodynamics, Microwave and Circuit Engineering, Vienna University of Technology, Gusshausstrasse 25, 1040, Vienna, Austria
Alija Dervic
German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Deutscher Platz 5E, 04103, Leipzig, Germany
Andrea Pacheco & Leana Gooriah
Institute of Psychology, Jagiellonian University, ul. Gołębia 24, 31-007, Kraków, Poland
Ania Jurczak, Joanna Grzymała-Moszczyńska, Marta Bartoszek & Zuzanna Garncarek
Institute for Social Studies, University of Warsaw, ul. Krakowskie Przedmieście 26/28, 00-924, Warsaw, Poland
Dominika Bulska
Fundación Naturaleza El Salvador, Research Department, Colonia Escalón, San Salvador, CP 1101, El Salvador
Francisco S. Álvarez & Xochilt María Pocasangre-Orellana
Seshat: The Global History Databank http://seshatdatabank.info/
Jenny Reddish
Santa Fe Institute, Santa Fe, NM 87501, USA
Stefan Thurner

Authors

Amélie Desvars-Larrive
View author publications
You can also search for this author in PubMed Google Scholar
Elma Dervic
View author publications
You can also search for this author in PubMed Google Scholar
Nina Haug
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Niederkrotenthaler
View author publications
You can also search for this author in PubMed Google Scholar
Jiaying Chen
View author publications
You can also search for this author in PubMed Google Scholar
Anna Di Natale
View author publications
You can also search for this author in PubMed Google Scholar
Jana Lasser
View author publications
You can also search for this author in PubMed Google Scholar
Diana S. Gliga
View author publications
You can also search for this author in PubMed Google Scholar
Alexandra Roux
View author publications
You can also search for this author in PubMed Google Scholar
Johannes Sorger
View author publications
You can also search for this author in PubMed Google Scholar
Abhijit Chakraborty
View author publications
You can also search for this author in PubMed Google Scholar
Alexandr Ten
View author publications
You can also search for this author in PubMed Google Scholar
Alija Dervic
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Pacheco
View author publications
You can also search for this author in PubMed Google Scholar
Ania Jurczak
View author publications
You can also search for this author in PubMed Google Scholar
David Cserjan
View author publications
You can also search for this author in PubMed Google Scholar
Diana Lederhilger
View author publications
You can also search for this author in PubMed Google Scholar
Dominika Bulska
View author publications
You can also search for this author in PubMed Google Scholar
Dorontinë Berishaj
View author publications
You can also search for this author in PubMed Google Scholar
Erwin Flores Tames
View author publications
You can also search for this author in PubMed Google Scholar
Francisco S. Álvarez
View author publications
You can also search for this author in PubMed Google Scholar
Huda Takriti
View author publications
You can also search for this author in PubMed Google Scholar
Jan Korbel
View author publications
You can also search for this author in PubMed Google Scholar
Jenny Reddish
View author publications
You can also search for this author in PubMed Google Scholar
Joanna Grzymała-Moszczyńska
View author publications
You can also search for this author in PubMed Google Scholar
Johannes Stangl
View author publications
You can also search for this author in PubMed Google Scholar
Lamija Hadziavdic
View author publications
You can also search for this author in PubMed Google Scholar
Laura Stoeger
View author publications
You can also search for this author in PubMed Google Scholar
Leana Gooriah
View author publications
You can also search for this author in PubMed Google Scholar
Lukas Geyrhofer
View author publications
You can also search for this author in PubMed Google Scholar
Marcia R. Ferreira
View author publications
You can also search for this author in PubMed Google Scholar
Marta Bartoszek
View author publications
You can also search for this author in PubMed Google Scholar
Rainer Vierlinger
View author publications
You can also search for this author in PubMed Google Scholar
Samantha Holder
View author publications
You can also search for this author in PubMed Google Scholar
Simon Haberfellner
View author publications
You can also search for this author in PubMed Google Scholar
Verena Ahne
View author publications
You can also search for this author in PubMed Google Scholar
Viktoria Reisch
View author publications
You can also search for this author in PubMed Google Scholar
Vito D. P. Servedio
View author publications
You can also search for this author in PubMed Google Scholar
Xiao Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xochilt María Pocasangre-Orellana
View author publications
You can also search for this author in PubMed Google Scholar
Zuzanna Garncarek
View author publications
You can also search for this author in PubMed Google Scholar
David Garcia
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Thurner
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.D.L. managed and coordinated the production of the dataset and prepared it for publication, including: developing the coding scheme, collecting and curating the data, managing the team of data collectors, creating the library of sources, writing the data descriptor, and creating the tables. E.D. performed exploratory data analyses and provided the major contribution to the production of the plots and figures. The following authors: N.H., T.N., E.C., A.D.N., J.L., D.S.G., A.R., J.S. and D.G. produced a substantial work to generate the dataset and prepare it for publication, including: developing the coding scheme, collecting the data, formatting the data for presentation of the published work, visualising the data, and creating the library of sources. D.G. created the webpage dedicated to this project. The following list of authors (listed alphabetically): A.C., A.T., A.D., A.P., A.J., D.C., D.L., D.Bu., D.Be., E.F.T., F.Á., H.T., J.K., J.R., J.G.M., J.S., L.H., L.S., L.G., L.G., M.R.F., M.B., R.V., S.H., S.H., V.A., V.R., V.D.P.S., X.C., X.M.P.O. and Z.G. includes the many individuals who collected and curated data through the dedicated platform and provided comments to facilitate the use of the dataset across different programming platforms. A.D.L. and D.G. jointly supervised this work. S.T. mentored the core team. All authors reviewed and approved the final manuscript.

Corresponding author

Correspondence to Amélie Desvars-Larrive.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information 1

Online-only Tables

Online-only Table 1 Definition of the eight themes (L1) used to classify the NPIs and associated categories (L2).

Full size table

Online-only Table 2 Top 20 most frequent categories (L2) of NPIs implemented to control the spread of COVID-19. As of date of submission, the dataset includes 56 countries and dates of NPI implementation range from 31/12/2019 to 15/07/2020.

Full size table

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.

Reprints and permissions

About this article

Cite this article

Desvars-Larrive, A., Dervic, E., Haug, N. et al. A structured open dataset of government interventions in response to COVID-19. Sci Data 7, 285 (2020). https://doi.org/10.1038/s41597-020-00609-9

Download citation

Received: 05 May 2020
Accepted: 27 July 2020
Published: 27 August 2020
DOI: https://doi.org/10.1038/s41597-020-00609-9

This article is cited by

Mechanical ventilation as a major driver of COVID-19 hospitalization costs: a costing study in a German setting
- Leslie R. Zwerwer
- Jan Kloka
- Benjamin Friedrichson
Health Economics Review (2024)
Harmonizing government responses to the COVID-19 pandemic
- Cindy Cheng
- Luca Messerschmidt
- Joan Barceló
Scientific Data (2024)
Impact of COVID-19 restrictions on diabetes mellitus management in Qatari primary care settings
- Ahmed Sameer Al Nuaimi
- Muhammad Tanveer Alam
- Mohamed Ahmed Syed
Discover Health Systems (2024)
Digital Resilience in Dealing with Misinformation on Social Media during COVID-19
- Stefka Schmid
- Katrin Hartwig
- Christian Reuter
Information Systems Frontiers (2024)
Measuring changes in adult health and well-being during the COVID-19 pandemic and their relationship with adverse childhood experiences and current social assets: a cross-sectional survey
- Mark A. Bellis
- Karen Hughes
- Helen Lowey
BMC Public Health (2023)