COVID-19 is the clinical manifestation of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection1. The COVID-19 pandemic has been ongoing for more than two years, with the first case of COVID-19 in Africa reported in Egypt in mid-February 2020 (ref. 2). The first SARS-CoV-2 genome sequenced in Africa was reported in March 2020 (ref. 3). Genomic sequencing and surveillance have played a crucial role in monitoring and mitigating the COVID-19 pandemic. There have been approximately 12 million cases and more than 256,000 deaths reported to date in Africa4, and African countries have contributed substantial amounts of genomic sequencing data to global agencies. For example, two of the five variants of concern (Beta and Omicron) were first identified in Africa through genomic surveillance systems and real-time sequencing and data release5,6.

During the first year of the pandemic, SARS-CoV-2 genomes from Africa were mainly produced for a small number of countries with genomes available from 38 of the 54 African countries7. Subsequently, the Africa Centres for Disease Control and Prevention (Africa CDC) and the World Health Organization Regional Office for Africa (WHO AFRO) invested in capacity building and provided resources to equip more African countries to produce genomes locally8. For example, the African Union Commission and Africa CDC launched the Africa Pathogen Genomics Initiative (Africa PGI) with an initial investment of US$100 million. Currently, more than 100,000 genomes, originating from 51 African countries and 4 independent overseas territories, are publicly available from Global Initiative on Sharing Avian Influenza Data (GISAID)9.

Dashboards for live COVID-19 information

Online dashboards presenting global and regional COVID-19 data, including case numbers, reported deaths and vaccination rates, have proliferated since the onset of the pandemic10,11,12. These dashboards have a vital role in guiding the public health response and decision-making by policymakers, public health officials and scientists13. Data visualization in dashboards also keeps the public abreast of the state of the pandemic. Examples of genomic dashboards include the Welcome Sanger Institute’s COVID-19 Genomic Surveillance dashboard (https://go.nature.com/3U9wS8R) and the COVID-19 Genomics UK Consortium dashboard (https://go.nature.com/3Fw32r2). These dashboards include the number of genomes sequenced and the proportion of variants identified in the sequenced genomes, as well as information on the mutations in the lineages of interest. Although these dashboards display important genomic information about England, there was initially no genomics dashboard for the African continent. We therefore set out to devise a dashboard that provides real-time analytical tools for visualization of a genomics-oriented understanding of the state of the pandemic on the African continent.

Data inputs for the SARS-CoV-2 Africa dashboard

The SARS-CoV-2 Africa dashboard is an open-source web-based graphical user interface for presentation of the data produced by genomic surveillance of COVID-19 on the continent, and for provision of details of variants that are currently circulating. The dashboard is supported by the main commercially available web browsers, including Google Chrome, Mozilla Firefox, Microsoft Edge and Safari. The dashboard collates all sequencing data available in GISAID, with metadata linking data to a specific country in Africa, and uses these metadata to display temporal and spatial trends in SARS-CoV-2 evolution in Africa. Genomics data are incorporated into the dashboard using an application programming interface (API) via an agreement with GISAID. The web application processes it, and includes a data quality assessment that can eliminate poor quality registers — for example, sequences assigned to a variant that was submitted before the variant was identified (Fig. 1a).

Fig. 1: Incorporation of SARS-CoV-2 data into the SARS-CoV-2 Africa dashboard.
figure 1

a, An overview of the main features of the interface. b, General filters that allow users to select the data of interest. c, Figure controls that allow users to enable and disable legend elements and labels, select a part of the figure, zoom in and out, and download the plot. d, A tabulated description and mutation map that is provided for variants of interest and/or importance at the time of access. e, A timeline player that displays the mapped progression of the pandemic over time, based on the filter selection (b).

Data processing in the SARS-CoV-2 Africa dashboard

SARS-CoV-2 genomes are accessioned on GISAID with contextual metadata (such as patient details, collection and sampling strategies, and sequencing and assembly methods) that are subjected to curation by GISAID before release. GISAID data can be freely accessed and downloaded by users after registration. The data acquisition and processing pipelines use Python 3.6 and the web interface is implemented using Streamlit (https://go.nature.com/3DqDE3o), with charts created using Plotly14. The code can be locally installed for customization in a Conda environment15. Code and dependencies can be installed by cloning the Github repository, available at: https://go.nature.com/3WjtMRw.

Performance of the SARS-CoV-2 Africa dashboard

To evaluate dashboard performance, we implemented an experiment using ApacheBench version 2.3 (https://go.nature.com/3WjtZEi) and varying for different levels of concurrency (10, 100, 500 and 1,000 simultaneous access). For each level of concurrence, we performed 5,000 requisitions, which showed that the dashboard performed well for simultaneous access. For example, in the last level (1,000 requests), only 0.08% of the total requisitions were not answered. All requests for 0–1,000 were completed within an average 1.862 to 25.841 ms (Supplementary Table 1). The dashboard provides interactive visualizations of the temporal and spatial distributions of SARS-CoV-2 variants and their prevalence across different African regions and countries. Several filters are provided to customize the visualizations according to user needs. The main features of the interface are four modules (Fig. 1b–e): general filters allow users to select the data of interest, figure controls allow users to customize the display and snapshot a desired plot, a tabulated weekly summary of variant details is provided, and drop-down mutation maps for variants that are of interest can be used. We also included a timeline player that displays the progression of the pandemic over time, based on user-defined filter selections.

Case study for Omicron spread

A hypothetical example of the application of this dashboard is shown in Fig. 2. The scenario is that the Minister of Health in Namibia wants to understand the spread of the Omicron lineage in neighbouring countries after reports of Omicron in South Africa and Botswana, to better understand how Namibia may be affected. The users would use filters to display the number of Omicron lineage genomes in each neighbouring country. In Fig. 2, on the left-hand side panel of the dashboard, there are filters that allow data for specific countries, specific regions or all countries to be shown. In this example, Namibia and its neighbouring countries (South Africa, Botswana and Angola) have been selected. On the interactive map titled ‘Genomes per country’, the metric ‘Genomes by variant’ has been selected. In this case, the Omicron variant was selected. As seen in Fig. 2, if the cursor is hovering over a country on the map, the name of the country, the number of genomes produced by that country and the date are displayed.

Fig. 2: A case study of Omicron spread.
figure 2

The case study scenario screenshot shows how to investigate Omicron spread in Namibia and neighbouring countries using the filters and charts provided by the SARS-CoV-2 Africa dashboard interface. Logo courtesy of the GISAID Initiative.

When studying the figures on the dashboard with these filters applied, one can see that the proportion of genomes deposited in GISAID at the end of October 2021 is dominated by the Delta lineage, with few remnant Beta genomes. Within the first two weeks of November 2021, the Omicron BA.1 lineage rapidly increased to comprise 19% of all genomes. Watching the sliding scale animation for Omicron lineages on the map displays the early detection of the lineage in South Africa, with swift progression from a low (light pink) to high (dark purple) number of cumulative genomes. From these visualizations, the minister would be aware of the rapid spread of Omicron and its growth advantage over Delta, and would be able to see that, at that time, Omicron had the potential to be the dominant variant in southern Africa. The minister would be empowered with the information needed to enable consultation with local researchers, public health officials and clinicians for the provision of local and regional public health responses to mitigate the effects of Omicron on the population.

Outlook

Numerous dashboards for global and regional COVID-19 data, such as case numbers, reported deaths and vaccination rates, have proliferated since the onset of the COVID-19 pandemic. These dashboards have been vital in guiding the public health response and decision-making by policymakers, public health officials and scientists13. Data visualizations produced by these dashboards have also been useful for keeping the public informed. Genomic surveillance of SARS-CoV-2 has been crucial in monitoring the progression of the pandemic, particularly in the low-vaccination landscape of Africa, where globally important variants have emerged and are likely to continue to appear.

Africa has generated a wealth of genomic surveillance data, with more than 129,000 SARS-CoV-2 genomes currently available on GISAID. The SARS-CoV-2 Africa dashboard is the first detailed online, real-time and interactive tool produced for the Global South. It provides simple and clear graphics that are easy to interpret and equips developers to analyse and visualize the data themselves, by allowing manual input of data via custom csv files, formatted as per the provided template (Supplementary Information). Our dashboard makes often intimidating and complex genomic data accessible to all users, and can be used to inform policy and guide the public health response in Africa and for Africa. All datasets used in our dashboard are in publicly accessible repositories. Genomic data are available from the GISAID database (https://www.gisaid.org/). The SARS-CoV-2 Africa dashboard is freely available at https://climade.health/dashboard/covid-africa/. Source code is available at https://github.com/CERI-KRISP/SARS-Cov-2-Africa-dashboard. Supplementary methods are available at https://doi.org/10.25413/sun.19722025.