The unprecedented scale of the Ebola outbreak in West Africa has, as of 29 April 2015, resulted in more than 10,884 deaths among 26,277 cases. Prior to the ongoing outbreak, Ebola virus disease (EVD) caused relatively small outbreaks (maximum outbreak size 425 in Gulu, Uganda) in isolated populations in central Africa. Here, we have compiled a comprehensive database of estimates of epidemiological parameters based on data from past outbreaks, including the incubation period distribution, case fatality rate, basic reproduction number (R0), effective reproduction number (Rt) and delay distributions. We have compared these to parameter estimates from the ongoing outbreak in West Africa. The ongoing outbreak, because of its size, provides a unique opportunity to better understand transmission patterns of EVD. We have not performed a meta-analysis of the data, but rather summarize the estimates by virus from comprehensive investigations of EVD and Marburg outbreaks over the past 40 years. These estimates can be used to parameterize transmission models to improve understanding of initial spread of EVD outbreaks and to inform surveillance and control guidelines.
Machine-accessible metadata file describing the reported data (ISA-tab format)
Background & Summary
Ebola virus disease (EVD), formerly known as Ebola hemorrhagic fever, is caused by a zoonotic virus first discovered in 1976 in remote villages of Democratic Republic of Congo (DRC, formerly Zaire) and Sudan1,
The primary reservoir of the Ebola virus is believed to be fruit bats12,13. However, non-human primates, including chimpanzees, gorillas, and cynomolgus monkeys, and forest antelopes have been reported as possible vectors in transmission to humans14, and EVD has caused devastating mortality in non-human-primate populations15. Once infected, the symptoms of human EVD are non-specific and typically include fever, headache, joint or muscle pain, sore throat, vomiting, and/or diarrhea15,
While it has been difficult to trace the source of human outbreaks, it is believed that EVD outbreaks usually start from a zoonotic source with subsequent human-to-human transmission22,23. Transmission between humans occurs through exposure to infectious bodily fluids, typically from close contact with infectious individuals when caring for EVD patients (e.g., sharing of contaminated needles, family home care, insufficient protective measures among health care workers in health care settings6,24,25) or with fatal EVD patients in preparation for burial19,20. Control measures for EVD are well documented and include identification, isolation and care of suspected patients, strict infection prevention and control among those caring for patients and safe burials26,27.
At the start of an infectious disease outbreak, it is critical to understand the transmission dynamics of the pathogen and to determine those at highest risk for infection or severe outcomes in the population(s) affected28,29. This information is needed to develop interventions to reduce the spread of disease and to reduce morbidity and mortality in the affected populations. Real-time analysis of any ongoing outbreak by analyzing detailed information collected on the confirmed, probable and suspected cases and deaths provides an opportunity to determine the stages of disease and areas where control measures can be applied. For example, knowledge of the incubation period distribution of the pathogen will inform the duration of time required to follow up the contacts of cases to evaluate whether or not they become secondary cases. Additionally, information on the timing of symptom onset, isolation, hospitalization and outcome (either death or recovery) are important to understand EVD progression. Mathematical models which make use of available data early in an outbreak to estimate the outbreak’s potential impact are increasingly used by public health policy makers to inform decision making around emerging and re-emerging pathogens28,
The purpose of this review was to collect all published epidemiological parameter estimates (reprinted in detailed tables containing estimates, and corresponding confidence intervals) estimated from past EVD outbreaks. Our aim was not to perform a meta-analysis, but rather to compile and document the available parameter estimates based on data from EVD outbreaks over the past 40 years. In order to estimate any of the parameters referenced in our manuscript, we would need detailed case data of each of the cohorts studied in the original papers, which we do not have. We also reprint parameter estimates from past Marburg outbreaks and the ongoing outbreak in West Africa for comparison. This information is valuable for public health organizations that need to quickly evaluate the early behavior of a new outbreak and estimate the potential impact, in terms of morbidity, mortality and geographic spread. We highlight how the parameter estimates we have examined improve our understanding of EVD epidemiology. Our results help to put the ongoing EVD outbreak in West Africa into context and to assess the likely effects of ongoing and novel interventions.
All searches using the following search terms (Ebola, Marburg, EHF, EVD, MHF, EBOV, Ebola Zaire, Ebola Sudan, Ebola Reston, Ebola Bundibugyo, outbreak, model, parameterization, incubation period, case fatality rate, case fatality rate (CFR), risk factors, basic reproduction number, R0, effective reproduction number, serial interval, delay distributions, generation time) were carried out on 1 August 2014, 15 September 2014 and again in February 2015 using the following databases: ScienceDirect, ResearchGate, Google, GoogleScholar, BioOne, Web of Science and PubMed. Our searches aimed to find primary reports describing and analyzing data collected from investigations of EVD and Marburg outbreaks since the virus was identified in 1976. The criteria for inclusion were: sample size of EVD cases described in the study ≥5, studies of human outbreaks, studies which evaluated potential risk factors had to report prevalence proportion ratios, odds ratios or relative risks. Reviews, commentaries, case reports on individual cases, and policy pieces were excluded. Additionally, literature evaluating non-human outbreaks or the potential for international (human) spread of EVD outside of an outbreak zone was excluded.
Using these search terms, a total of 49 papers were determined eligible for inclusion. In addition, for context we included additional published information on EVD including the final outbreak sizes as reported by the World Health Organization (WHO) Disease Outbreak News following declaration that each outbreak was over.
From the relevant EVD and Marburg literature, we extracted the following details for all parameter estimates (as provided): point estimates, confidence intervals, ranges, sample size used to estimate the parameter (total numbers of cases encompassing confirmed, suspected, and retrospectively diagnosed cases, depending on the study), EVD virus, and inferential methods. We then compiled the parameter estimate database into tables. Table 1 and Data Citation 1: http://dx.doi.org/10.6084/m9.figshare.1381874 list the human outbreaks of Ebola Zaire, Ebola Sudan and Ebola Bundibugyo that have occurred in Africa from 1976 to present. We have not provided detailed information on the outbreaks as these have been previously described9. Table 2 (available online only) summarizes the literature we used in this review.
Our manuscript and tables include estimates, confidence intervals and ranges obtained from the referenced publications (Table 2 (available online only) and Data Citation 2: Figshare http://dx.doi.org/10.6084/m9.figshare.1381876).
Definition of key parameters recorded
The incubation period is the interval between exposure to a pathogen and initial occurrence of symptoms and signs28,29. The incubation period distribution is usually characterized using the mean or the median incubation period.
The CFR is the proportion of cases (infected symptomatic individuals) within a designated population who die as a result of their infection. For past EVD and Marburg outbreaks, we report on the CFR estimated after the outbreak was declared over (estimated at least 42 days after the last case experienced symptom onset) by taking the number of deaths among cases divided by the total number of cases recorded during the outbreak. However, during outbreaks, the CFR is often estimated before all cases have been identified and before some cases have either recovered or died.
Risk factors for infection include demographic factors, medical conditions and behavioral exposures or practices that are associated with an individual’s risk of becoming infected with Ebola.
The basic reproduction number (R0) is used to measure the transmission potential of a disease. It is the average number of secondary infections produced by an infected case in a susceptible population31. If R0 >1, then once established the outbreak will continue, whereas if R0<1, then the outbreak will die out.
The effective reproduction number (Rt) is similar to R0 but relates to a particular calendar time t (after the start of the outbreak). Like R0, if Rt>1, then the outbreak will continue, whereas if Rt<1, then the outbreak will die out. Rt can be reduced through the use of successful control measures (e.g., by limiting contacts between susceptible and infectious individuals). Rt can also be reduced due to the depletion of susceptible individuals whether through extensive transmission or through the immunization of susceptible individuals32.
The serial interval is the interval between symptom onset in an index case and symptom onset in a secondary case infected by that index case33.
The generation time is the interval between infection of an index case and infection of a secondary case infected by that index case. The serial interval is more frequently estimated than the generation time and is often assumed to be the same duration as the generation time34.
Symptom onset to hospitalization (also referred to as onset to clinical assessment): The interval between symptom onset and hospitalization.
Hospital admission to day of first blood sample: The interval between admission to hospital or medical facility for treatment of EVD and when a biological sample is collected for diagnosis.
Symptom onset to recovery/discharge: The interval between symptom onset and recovery or hospital discharge.
Symptom onset to death: The interval between symptom onset and death.
Duration of admission (survivors)—hospitalization to discharge: The interval between admission to a hospital or medical facility for treatment of EVD and discharge from the facility.
Duration of admission (fatal cases)—hospitalization to death: The interval between admission to a hospital or medical facility for treatment of EVD and death.
The data from this analysis are summarized in two types of data format. Four data tables detail the methods and parameter estimates from each study included in our review. Our data tables:
Table S1: Human Outbreaks of Ebola Zaire, Ebola Sudan and Ebola Bundibugyo from 1976 presents compiled data on the year and location of the each human outbreak, the Ebola Virus causing the outbreak and number of cases reported (Data Citation 1: http://dx.doi.org/10.6084/m9.figshare.1381874).
Table S2: Parameter Estimates by Outbreak presents a comprehensive list of parameter estimates, including incubation period distribution, reproduction number, serial interval distribution, generation time distribution, delay distributions, and CFR by Ebola virus and study (Data Citation 2: Figshare http://dx.doi.org/10.6084/m9.figshare.1381876).
Table S3: Parameter estimates for the ongoing EVD outbreak in West Africa presents published estimates of delay distributions and CFR for the ongoing outbreak in West Africa (Data Citation 3: Figshare http://dx.doi.org/10.6084/m9.figshare.1381877).
Using these four tables, we then summarized the parameter database in six tables and two figures presented in this article. The parameters estimated for Ebola Zaire, Ebola Sudan and Ebola Bundibugyo outbreaks, including the incubation period distribution, serial interval distribution, R0, delay distributions and CFR, are shown in Tables 3 (available online only), 4 (available online only), 5, respectively. Parameter estimates for the ongoing outbreak in West Africa are summarized in Table 6 (available online only) and for Marburg outbreaks are presented in a single table (Table 7). Risk factors for Ebola and Marburg infection are summarized in Table 8 (available online only). Estimates of the incubation period distribution and CFR are presented in Figs 1 and 2, respectively.
Incubation period distribution
The incubation period distribution of EVD has been estimated for past EVD outbreaks (Fig. 1 and Tables 3 (available online only), 4 (available online only), 5; minimum sample size n=5, maximum sample size n=1,798). The mean (or median) incubation period (Fig. 1), for the different Ebola viruses ranged from 3.35 to 12.7 days (range 1–21 days), excluding an extreme outlier35. Central estimates for the incubation period distribution were between 5.3–12.7 days (range 1–21 days) for Ebola Zaire
The mean incubation period for the ongoing Ebola outbreak in West Africa has been estimated to be between 9–12 days (Table 6 (available online only))16,17,41,48. The range of incubation periods observed in past EVD outbreaks supports the policy of contact tracing for 21 days following contact with an EVD patient. An outbreak is officially declared over after no new cases are identified 42 days (2 times the 21-day maximum incubation period) after the last EVD case is found.
Case fatality rate (CFR)
In Fig. 2 and Tables 3 (available online only), 4 (available online only), 5, we reprint the estimated CFR for each Ebola outbreak (by virus) and for Marburg virus. The Ebola Zaire virus is the most lethal with an overall estimated CFR ranging from 69 to 88%2,5,25,38,43,49,50 (Table 3 (available online only)). The CFR of outbreaks due to Ebola Sudan virus ranged from 53 to 69%1,24,51,
R0 and Rt
Looking at past outbreaks, estimates of R0 for Ebola Zaire ranged from 1.4-4.735,42,44,58,
In the ongoing outbreak in West Africa, estimates of R0 and Rt have been estimated for all countries combined, as well as separately for Guinea, Liberia, Nigeria and Sierra Leone16,30,36,41,48,54,61,
Several groups have also estimated R0 for specific geographic areas within the region (full details in Table 6 (available online only) and Data Citation 2: Figshare http://dx.doi.org/10.6084/m9.figshare.1381876). For example, Faye et al. estimated R0 for cases occurring in Conakry, Guinea at the start of the outbreak (n=193)41 as 1.7 (95% CI 1.2, 2.3); whereas Lewnard et al. 64 estimated R0 for EVD cases in Montserrado, Liberia as of October 2014 (R0=2.5 (2.4, 2.6)).
Serial interval distribution
The serial interval, defined as the time interval between symptom onset in an index case and symptom onset in a secondary case infected by that index case, has been infrequently estimated due to the paucity of data on epidemiologically linked pairs of index and secondary cases. For Ebola Zaire (Table 3 (available online only)), the mean serial interval was estimated to be 10–16.1 days5,49,60,70. In the ongoing outbreak in West Africa, the mean serial interval has been estimated to be approximately 14–15 days16,17,30,41 (Table 6 (available online only)).
Generation time distribution
Closely related to the serial interval, the generation time is defined as the time interval between infection of an index case and infection of a secondary case infected by that index case. As such, the generation time distribution nearly always needs to be inferred indirectly from serial interval observations and knowledge of the incubation period distribution. We found one such estimate of the mean generation time for Marburg of 9 days (95% CI 8.2, 10.0)55.
For Ebola Zaire, including the ongoing outbreak, the mean time from symptom onset to hospitalization (Table 3 (available online only) and Table 6 (available online only)), ranged from 3.2 to 5.3 days5,16,17,20,38,41,48, whereas the mean time from symptom onset to death ranged from 6 to 10.1 days5,17,25,37,
Risk for developing EVD
Risk factors for human-to-human transmission of EVD or Marburg were evaluated from comparison of the exposures, behaviors and practices in cases compared to controls (including unaffected controls, defined to be suspected cases but negative serologic test results) and were described using a prevalence proportion ratio, an odds ratio or a relative risk (and the corresponding confidence interval). Significant risk factors associated with developing EVD are reported in Table 8 (available online only) and Data Citation 4: Figshare http://dx.doi.org/10.6084/m9.figshare.1381875 and include direct physical contact (sharing a bed, touching a cadaver or funeral preparations for an EVD patient, nursing care and contact with bodily fluids) and non-physical contact (sharing a meal, contact with a hospital where EVD patients were treated)24,39,45,47,56,71.
The data presented in this review summarize estimates of the epidemiological parameters of EVD and Marburg. These results can facilitate parameterization and sensitivity analysis of transmission models examining surveillance, control and treatment strategies. The results can also inform epidemiological studies investigating human-to-human transmission during Ebola and Marburg outbreaks, deepening our understanding of the transmission process.
The number of parameters estimated for each outbreak has generally increased with time (Table 2 (available online only)). While the incubation period distribution was consistently assessed, R0 has increasingly been estimated, most notably with the ongoing outbreak in West Africa (Table 6 (available online only) and Data Citation 2: Figshare http://dx.doi.org/10.6084/m9.figshare.1381876). Fig. 1 shows the central estimates and ranges for the different studies that estimated the incubation period distribution for EVD outbreaks. While there are small differences in the central estimates of the incubation period distribution of the four Ebola viruses, the ranges around the mean or median are consistent, with a maximum of ≤21 days. Current EVD guidance states that EVD has an incubation period of 2–21 days, which is the basis for the recommended duration of contact tracing of 21 days26,27. This is supported by the findings in our review.
Figure 2 shows CFR for different Ebola Zaire, Ebola Sudan, Ebola Bundibugyo and Marburg outbreaks. While the CFR for Ebola and Marburg are high (compared to other infectious diseases), outbreaks caused by Ebola Zaire and Ebola Sudan have experienced the highest CFR amongst these three Ebola viruses causing outbreaks in humans1,2,5,19,24,25,38,43,46,47,50,
Figure 2 also illustrates that recent CFR estimates for Ebola Zaire remain comparable to those observed in the 1970s. While there are ongoing efforts to develop medical treatments for EVD, treatment remains mainly supportive. The massive scale of the ongoing outbreak has highlighted the urgent need to develop new treatments and to fast track the use of experimental medical interventions77.
The transmission potential, as measured by R0, is fairly consistent among the three Ebola viruses, ranging from approximately >1 to 4 (also mentioned in ref. 78). Previously, EVD typically affected villages in remote areas of central Africa35,38,42,44,58,59, and while devastating in these areas, the populations that are at risk are generally limited in number. The ongoing EVD outbreak had circulated for at least three months prior to discovery22,79 which allowed spread of the virus to go unchecked while it infected people in an area of Guinea that shares borders with Sierra Leone and Liberia. Recent experience in Nigeria, has shown that an Ebola virus with R0>1, even in a population of over 20 million, can be controlled with vigorous application of control methods49,54.
Differences in estimates of R0 and Rt are likely, at least in part, to be the result of the quality of data available and the inferential method. The focus on R0 estimation together with serial interval estimates may reflect a shift from data collection purely for surveillance to recognition of the epidemiological value of such data.
The specific factors that result in an EVD outbreak have been under investigation since the emergence of this virus and include examination of human and susceptible non-human populations living in close proximity with each other in remote areas of central Africa. Recent investigations into the first cases of the ongoing outbreak found that the outbreak may have begun in Meliandou, Guinea in a village where the inhabitants frequently came in contact with fruit bats in a hollowed out tree80. Although the current focus is on limiting human-to-human transmission and treating the infected, the challenging underlying factors that led to this large outbreak in West Africa will require long-term investments to improve both health care and surveillance for infectious diseases.
Our dataset is the most complete collection of published epidemiological parameter estimates from EVD outbreaks available at the time of writing and provides an evidence-based foundation for both retrospective analyses and responses to future outbreaks.
How to cite this article: Van Kerkhove, M. D. et al. A review of epidemiological parameters from Ebola outbreaks to inform early public health decision-making. Sci. Data 2:150019 doi: 10.1038/sdata.2015.19 (2015).
Van Kerkhove, M., Bento, A. I., Mills, H. L., Ferguson, N. M., & Donnelly, C. A. Figshare http://dx.doi.org/10.6084/m9.figshare.1381875 (2015)
The authors would like to acknowledge the Medical Research Council, the Bill and Melinda Gates Foundation, the Wellcome Trust, the Health Protection Research Units of the National Institute for Health Research, and the European Union Seventh Framework Programme [FP7/2007–2013] under agreement Grant Agreement nu278433-PREDEMICS for funding.