Introduction

The global report on coronavirus disease 2019 (COVID-19) revealed 770,563,467 confirmed cases (including 20,917,453 active cases) and 6,957,216 deaths (i.e. 0.9% case fatality rate [CFR]) in 224 affected countries as of September 7, 20231. In Africa, 54 countries have been affected by the pandemic, with 12,837,874 confirmed cases and 258,830 deaths (i.e. 2.1% CFR) across the continent. In Cameroon, there were 125,165 confirmed cases (including 34 active cases) and 1,974 deaths (1.6% CFR) across all health districts following the COVID-19 situation report of August 27, 20232,3.

COVID-19 pandemic has been characterised by several epidemiological waves and the emergence of new severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants from the ancestral strain from Wuhan, China4. Variants are classified according to their level of significance as variants of high consequence (VHC), variants of concern (VOC), variants of interest (VOI) or variants under monitoring (VUM)5,6. While VHCs have not yet been reported, several VOCs have been identified as the driving force of viral circulation and dispersal, with higher burdens reported in Northern Europe, Central America, and sub-Saharan Africa7,8.

Regarding the dynamics and spread of VOCs over time, the Alpha (B1.1.7), Beta (B1.351), and Gamma (P1) variants were among the first emerging viral clades, with a foremost circulation of Alpha; Delta variant (B1.617) then emerged predominantly (faster, fitter and more transmissible compared to previous variants), and finally the emergence of Omicron variant (B1.1.529) showed the highest viral fitness over previously known VOCs9. With these rapid changes in SARS-CoV-2 patterns, it is of paramount importance to establish a strategy for variant surveillance in order timely mitigate their potential impacts10. Of the 10,293,748 whole-genome sequences available by April 22, 2022, in the Global Initiative on Sharing All Influenza Data (GISAID), Omicron variant represents approximately half, followed proportionally by the Alpha, eta, Delta, and Gamma variants11.

In Cameroon, the first COVID-19 case was detected on March 6, 202012, and the country has experienced five different waves with varying outbreak magnitudes, durations, number of confirmed cases and hospitalisations, number of severe or critical cases, number of deaths, and CFR12. The hypothetical variability in the clinical features and epidemiological trends warrants investigating on possible implications of SARS-CoV-2 variants on the dynamics of the pandemic13. Such genomic investigation would contribute in designing context-specific public health measures as part of the pandemic response strategies. Of note, SARS-CoV-2 genomic surveillance can shed light on the origins of viral lineages (imported or emerging locally), the transmission dynamics and phylogeography of these viruses, and their potential clinical relevance (disease severity) and public health implications (transmissibility) at the national level13. In this frame, we sought to ascertain the introduction and dynamics of SARS-CoV-2 lineages and their effects on transmission and disease severity following the various epidemiological waves in the Cameroonian context.

Results

Based on whole-genome sequences of SARS-CoV-2 from Cameroon deposited in GISAID between August 2021 and March 2022, a total of 760 individual samples from Cameroonian residents were enrolled in the present study.

The mean age of the study population was 36 (min–max: 2–86) years and 45.0% were within the age range 26–45. Regarding gender distribution, 50.9% were male and 49.1% female.

Distribution of the study population with whole-genome sequences by region of residence

Samples were from 9/10 regions of Cameroon (Table 1) and classified into three categories according to sampling/proportions:

  • three regions with high proportions of samples (≥ 10%): Centre, Littoral, and Southwest;

  • two regions with moderate proportions of samples (between 5 and 10%): East and West;

  • four regions with low proportions of samples (< 5%): North, South, Adamawa, and Far-North.

Table 1 Distribution of the study samples according to region of origin.

This geographical distribution indicates that only 30% of the regions in Cameroon had an acceptable coverage for SARS-CoV-2 genomic surveillance nationwide.

Diversity of SARS-CoV-2 lineage from whole-genome sequencing

Phylogenetic analysis of the 760 whole-genome sequences revealed that the greater proportion of SARS-CoV-2 variants circulating in Cameroon belonged to the viral sub-lineages of the ancestral strain from Wuhan (74%), 15% Delta, 6% Omicron, 3% Alpha and 2% Beta variant (Fig. 1). The observed distribution reflects the high number of samples processed for genomic surveillance at the early phase of the pandemic (see Supplementary materials, SDC1 and SDC2).

Figure 1
figure 1

Phylogenetic tree of SARS-COV-2 lineages when using whole-genome sequences. Figure is composed of 760 individual genome sequences. The collection dates range from 2020-03-06 to 2022-02-02; Data were collected in one country and territory; all sequences in this dataset were rooted with the hCoV-19/Wuhan/WIV04/2019 (WIV04), the official reference sequence employed by GISAID (EPI_ISL_402124), https://gisaid.org/WIV04.

Dynamics of SARS-CoV-2 lineages over time

Throughout the study reporting period, the patterns of SARS-CoV-2 evolved over time. From March 2020 to November 2020, the introduction of the cases of SARS-CoV-2 occurred, solely with viruses of the lineage of origin. In December 2020, first cases of Alpha and Beta variants were identified and remained in circulation till May 2021 (for Alpha) and June 2021 (for Beta). First cases of the Delta variant were identified in March 2021, with the number of cases increasing substantially until October 2021, followed by a slight upsurge between November 2021 and January 2022. Finally, first cases of Omicron emerged by September 2021 overtaking the Delta variant to reach 100% circulation in February 2022 (Fig. 2).

Figure 2
figure 2

SARS-COV-2 lineage dynamics per month in Cameroon. X-axis represents month-year of sampling; Y-axis represents the percentage of SARS-CoV-2 strains; colours correspond to each viral strain.

Variations in major SARS-CoV-2 lineages according to epidemiological waves

Figure 3 provides the trends in duration, number of confirmed cases, number of deaths, and the CFR from one wave to another.

  1. i.

    During the first wave, the epidemic was driven by SARS-CoV-2 lineages of origin/non-VOC; the outbreak duration was moderate (16 weeks); the number of confirmed cases was moderate (16,948); the number of hospitalised cases was high (1847); the number of deaths was moderate (386); and the CFR was high (2.3%).

  2. ii.

    During the second wave, the epidemic was driven by the co-introduction of Alpha and Beta alongside SARS-CoV-2 lineages of origin/non-VOC; the outbreak duration was long (21 weeks); the number of confirmed cases was high (52,271); the number of hospitalised cases was high (4675); the number of deaths was high (835); and the CRF was moderate (1.6%).

  3. iii.

    During the third wave, the epidemic was driven by Delta alongside SARS-CoV-2 lineages of origin/non-VOC; the outbreak duration was moderate (11 weeks); the number of confirmed cases was high (21,753); the number of hospitalised cases was high (2230); the number of deaths was moderate (426); and the CFR was high (2.0%).

  4. iv.

    During the fourth wave, the epidemic was mainly driven by Omicron; the outbreak duration was short (8 weeks), the number of confirmed cases was moderate (10,803), the number of hospitalised cases was low (809), the number of deaths was low (79); and the CFR was low (0.73%).

Figure 3
figure 3

SARS-CoV-2 lineage dynamics per wave in Cameroon. The x-axis shows the epidemiological week and year (i.e. S10_2020 means week10_year2020); colours correspond to specific clinical conditions (confirmed cases, hospitalised, deaths, oxygen therapy). The following data are provided by wave: the order of wave; the duration (in weeks); the period-year of the wave; date start–date end; number confirmed cases per wave; number of hospitalised cases per wave; number of deaths per wave; case fatality rate per wave; viral strains isolated per wave.

Correlation between the wave duration and number of cases according to variant dynamics

Figure 4 presents the trend in the duration of each wave (Fig. 4a) and the number of cases per wave (Fig. 4b), alongside the detection of major circulating VOC for each wave.

Figure 4
figure 4

Correlation analysis between the wave length and the number of confirmed cases following variant dynamics. (a) Duration of outbreak by wave; (b) Number of confirmed cases by wave, WT wild type.

From wave 1 to wave 4, there was an overall declining trend in the wave duration (mean duration: 14 weeks) as well as the number of confirmed cases per wave (mean value: 25,444). These trends showed a significant positive correlation between the wave duration and the number of confirmed cases (z score = − 2.50672; p = 0.01208), indicating that the dynamics of SARS-CoV-2 variants were not the primary drivers of the number of cases observed per wave. Hence, viral transmission was mainly driven by outbreak duration.

Correlation between the number of hospitalisations and CFR according to variant dynamics

From wave 1 to wave 4, there was an overall declining trend in the number of hospitalisations (mean: 2390 cases) and the CFR (mean value: 1.66) per wave (Fig. 5a and b, respectively). Despite the significant correlation (z-score = 2.50672; p = 0.01208), there was a discrepancy between the low number of cases and the high CFRs in Wave 1 and Wave 3. This suggests that the original viral lineage and the Delta variant contributed to the severity of COVID-19 in the Cameroonian context.

Figure 5
figure 5

Correlation analysis between hospitalisation and CFR following variant dynamics. (a) Number of hospitalisations by wave; (b) Case fatality rate by wave, WT wild type.

Discussion

The present reveals the power of genomic surveillance for SARS-COV-2 in understanding the the dynamics in the epidemiology of COVID-19 at national level13,16. In settings with limited access to sequencing17,18, collaborative efforts enabled sampling and sequencing of SARS-CoV-2 through the genomic surveillance platform in place19,20,21, with international partnerships13,16,22. Thus, this initiative could be viable for any genomic surveillance of any pathogen of pandemic or epidemic potential (Ebola, Zika, Mpox viruses, cholera, antimicrobial resistant strains, etc.) as supported by Africa CDC3 and other agencies18,23.

In this study, the mean age of the population was 36 (26–45) years. This represents the most active population often involved in travels, social or occupational activities that increase exposure to SARS-CoV-2, as previously reported in similar settings24,25,26. However, this observation is different in the Western world most likely due to greater adherence to barrier measures27. The sex ratio showed a similar distribution in SARS-CoV-2 cases, indicating similar risk of infection/exposure at population-level28.

Sampling was from 9/10 of the national regions, suggesting wide near-national coverage (only North-West region excluded). However, only 30% of regions achieved a desirable sampling for genomic surveillance (at least 10% of sequence data). This is in line with coverage of molecular testing mostly found in major townships25,29. Thus, genomic data from difficult-to-reach settings are less covered30.

According to available sequence data, the viral sub-lineages of origin and non-VOCs represent the majority (74%) as compared to VOCs, reflecting efforts in genomic surveillance during the early phase of the pandemic even at the global level31,32,33. Interestingly, transmission dynamics confirms the first cases as viral sub-lineages of origin; followed between December 2020 and April 2021 by the co-introduction of alpha and beta as first VOCs, favoured by population migrations for end-of-year holidays from the Western world where these variants were already prevalent31,34. First cases of delta were identified in March 2021 with a peak in October 2021. The increase in cases with delta would be favoured by the variant affinity for the ACE2 receptor due to mutations35, leading to enhanced viral attachment, high viral load and prolonged duration of infection36. First cases of omicron were found in September 2021 and predominate as from December 2021 to reach 100% by end of February 2022. Regarding the evolutionary trends of variants per wave, the first wave (driven by lineages of origin) has a medium length in duration, a moderate number of cases and deaths, but a high CFR which could be attributed to limited experience/logistics of the health system to respond to an unknown disease37; not neglecting the effect of stigma and fear on late clinic attendance during that early phase38,39. The second wave was driven by the co-introduction of alpha and beta variants alongside other sub-lineages and non-VOCs, and was characterised by the longest outbreak duration of about five months, a high number of cases and hospitalisations, but a moderate CRF. The decreased CFR might be due to timely clinic attendance and gradual experience in case management39,40. Moreover, as compared to the high circulation of sub-lineages of origin and non-VOCs, alpha and beta variants had limited effects on transmission and disease severity, which prone their disappearance41,42. In contrast, the third wave (driven by the Delta variant) had a moderate duration of about three months but an increased/high number of confirmed cases (over 20,000), hospitalisations (over 1000) and high CFR (2%). Thus, in a relatively short timeframe, delta variant considerably increased both the transmission rate and the disease severity41,42. The fourth wave (driven solely by omicron) had a very short duration of about two months and a reduced/moderate number of cases (10,803) and low number of hospitalisations/deaths and CFR (0.73%), which underscores the contribution of omicron on viral transmission but without severity43,44. The overall decline in outbreak duration and number of cases across waves highlights the fact that the extent of viral transmission/spread was mainly driven by the duration of the outbreak. However, the low number of cases and high CFRs during wave 1 and wave 3 signifies that the original viral lineage and delta variant contributed to COVID-19 severity in the Cameroonian context, alongside other multifaceted determinants (naïve populations at the early stage of the pandemic, etc.)25. Thus, among VOCs, delta was a substantial driver of COVID-19 severity and death, likely attributed to loss in antibody affinity due to viral antigenic mutations in the receptor-binding domain45.

The main limitation of our study is the lack of proportionally representative samples across all regions and across the four waves. This suggests a reduced generalizability of the findings, especially in terms of overall prevailing viral lineages. Nonetheless, these findings, generated with high-quality and full-length sequences validated by a public repository (GISAID) and interpreted using a robust phylogenetic pipeline, provide evidence with major public health implications to prepare for future pandemics15.

In a nutshell, our genomic surveillance with full-length sequences reveals four VOCs (alpha, beta, delta, and omicron) in Cameroon across four different epidemiological waves. SARS-CoV-2 infection in Cameroon has been driven by the viral lineages of origin in wave 1, the co-introduction of alpha and beta variants in wave 2, delta variant in wave 3 and omicron variant in wave 4, with an overall declining trend in the wave duration, confirmed cases, hospitalisations and CFR over time. While viral transmission was not dependent on viral clades, SARS-CoV-2 viral sub-lineage of origin (at the early phase) and Delta variant appeared to be the drivers of COVID-19 severity in Cameroon.

Methods

A laboratory-based survey was conducted within the framework of the national Public Health Emergencies Operations Centre (PHEOC) for COVID-19 in Cameroon, from March 1, 2020 to March 30, 2022, through an assessment of the evolutionary patterns of SARS-CoV-2 lineages across the four COVID-19 waves in the country.

Specimen collection and referral for SARS-CoV-2 genomic surveillance

The identification, packaging, storage, and transportation of positive COVID-19 nasopharyngeal samples from the testing sites to the reference laboratories were performed by staff trained in field epidemiology. For every sample positive on antigen rapid diagnostic test (RDT), a swab was collected on viral transport medium (VTM) and transported using a triple packaging with a cold chain (refrigerated cooler) from the testing site to the PCR reference laboratory for molecular testing. For collection sites far from a PCR reference laboratory, samples were stored at − 20 °C and transported within 2–7 days to the nearest PCR reference laboratory.

Nucleic acid extraction, amplification and detection of SARS-CoV-2

At the PCR reference laboratory, viral RNA was extracted from 140 µL nasopharyngeal swab using the QIAamp Viral RNA Mini Kit (Qiagen Inc, Valencia, CA, USA) as per manufacturer’s instructions. Amplification was performed using the DaAn gene detection kit for 2019-nCoV (https://en.daangene.com/uploads/file/detection-kit-for-2019-novel-coronavirus-2019-ncov-rna-pcr-fluorescence-probing.pdf). The protocol used probes targeting the open reading frame (ORF1ab) gene and the nucleocapsid (N) protein gene, with a lower limit of detection of 500 copies/mL and an amplification reaction of 45 cycles. Briefly, 03 µL of enzyme (solution B) and 05 µL of SARS-CoV-2 RNA were added into 17 µL of master-mix (solution A). The total (master mix and biological sample) was then placed into a thermocycler for reverse transcription (at 50 °C, 15 min); Taq pol activation (95 °C, 15 min); and finally amplification during 45 cycles (94 °C, 15 s and 55 °C, 45 s). RT-PCR results were interpreted as the presence of viral RNA for cycle threshold (CT) value ≤ 37 (i.e. PCR-positive) as per national guidelines.

The eligibility criteria for sequencing were as follows: a PCR-positive sample, a cycle threshold (CT) value < 30 for manual RT-PCR or equivalent, and a minimum volume of 200 μL of swab and/or 30 μL of viral RNA. Moreover, wherever necessary, eligible samples were stored at − 20 °C for a maximum of 30 days and shipped under a stable cold chain to the sequencing reference laboratory, along with a standard electronic metadata file.

Whole-genome sequencing of SARS-CoV-2

Sequencing was performed using the Illumina protocol for whole genome. Briefly, libraries were generated using the amplicons generated; indexed paired-end libraries were prepared using the Nextera DNA Flex Library Prep Kits (Illumina) as per the manufacturer’s instructions. Each tagged amplicon was and barcoded with a unique barcode using the Nextera CD Indexes. Libraries were purified and normalized to 4 nM prior to pooling, and the pool was denatured using 0.2 N sodium acetate and then diluted to a final concentration of 8 pM. The library was spiked with 1% PhiX Control v3, and the libraries were sequenced using a 500-cycle v2 Reagent Kit as per the manufacturer’s instructions (Illumina, San Diego, CA, USA). Fastq files produced from Illumina MiSeq were assembled using Genome Detective (https://www.genomedetective.com/) and the coronavirus typing tool, and were visualized for quality using FastQC. Following cleaning of sequences, short reads are sorted and placed into groups and metagenomic de novo assembly was performed. Each group of sequence was then identified; Blastx and Blastn are used to search for candidate reference sequences against the NCBI RefSeq virus database. The results for all detected contigs are combined by the Advanced Genome Aligner and scored using by Genome Detective at the amino acid and nucleotide level. The five best scoring references for each config are then used for the alignment. Data on full-length sequencing were consecutively entered into the GISAID platform, under the following sequence accession numbers (from “hCoV-19/Cameroon/Yaounde-20V-3870/2020” to “hCoV-19/Cameroon/ECO284/2021”). These data were downloaded, and the molecular phylogeny of the SARS-CoV-2 sequences was performed using Nexstrain (see Supplementary Digital contents—SDC1)15. The fasta sequences a GenBank repository under the following accession number GenBank OQ520884-OQ521579. The phylogenetic analysis, the Nextstrain pipeline (https://github.com/nextstrain/ncov) was used to generate the build. Sequencing data from the rest of the world were included for phylogenetic context based on genomic proximity and even sampling over time, however for the Cameroon build, we zoomed in on the country-specific sequences. To prepare data for the Cameroon-specific Nextstrain analyses we included the sequence and metadata files. The customized workflow is available at the following link (https://nextstrain.org/groups/cameroon-genomics/Cameroon-ncov-build?c=clade_membership&f_country=Cameroon). To visualize the results, we employed Auspice to work with the output .json file generated in the Nextstrain environment.

A graphical representation of the dynamics of different SARS-CoV-2 variants identified during the outbreak was created using Excel version 2004. The wave durations according to detected VoCs were graphically displayed, along with the related epidemiological information. This provided a global picture of the evolution of SARS-CoV-2 variants during the first to the fourth waves of the pandemic in Cameroon for possible predictions toward the control of future outbreaks. Based on our local experience, the interpretation of severity by wave was performed, as shown in Table 2.

Table 2 Grading assessment of epidemiological and clinical conditions per wave.

A wave was considered to be of short, medium, or long duration if it lasted for < 10, 10–20, or > 20 weeks, respectively. The number of confirmed cases was defined as low, moderate, or high if it was < 10,000, 10,000–20,000, > 20,000, respectively. The number of hospitalised cases was classified as low, moderate, or high if < 500, 500–1000, or > 1000, respectively. The number of deaths was interpreted as low, moderate, or high if < 100, 100–500, or > 500, respectively, and the CFR was categorised as low, moderate, or high if < 1, 1–2, or > 2, respectively.

The Mann–Whitney U test was used to calculate the correlation between the duration of each outbreak and the number of confirmed cases and between hospitalised cases and CFR, with a p value < 0.05 and a Z-score ≥ 2 considered statistically significant.

Ethical considerations

The present study was performed in accordance with the Declaration of Helsinki. Briefly, ethical clearance was obtained from the National Ethics Committee for research on human health (reference N°2022/01/1430/CE/CNERSH/SP/SP; N°2020/05/1224/CE/CNERSH/SP/SP), and administrative authorisation was provided by the Ministry of Public Health (N°368/NS/MINSANTE/SG/CCOUSP/CSO). Ethical clearance was also obtained from the Centres for Disease Control and Prevention. Each participant provided their informed consent and filled the case reporting form. Confidentiality was ensured by using fully anonymised data from a secured public database repository (GISAID) that was populated with metadata provided by the study team.