The global distribution and burden of dengue

Journal name:
Nature
Volume:
496,
Pages:
504–507
Date published:
DOI:
doi:10.1038/nature12060
Received
Accepted
Published online
Corrected online

Dengue is a systemic viral infection transmitted between humans by Aedes mosquitoes1. For some patients, dengue is a life-threatening illness2. There are currently no licensed vaccines or specific therapeutics, and substantial vector control efforts have not stopped its rapid emergence and global spread3. The contemporary worldwide distribution of the risk of dengue virus infection4 and its public health burden are poorly known2, 5. Here we undertake an exhaustive assembly of known records of dengue occurrence worldwide, and use a formal modelling framework to map the global distribution of dengue risk. We then pair the resulting risk map with detailed longitudinal information from dengue cohort studies and population surfaces to infer the public health burden of dengue in 2010. We predict dengue to be ubiquitous throughout the tropics, with local spatial variations in risk influenced strongly by rainfall, temperature and the degree of urbanization. Using cartographic approaches, we estimate there to be 390 million (95% credible interval 284–528) dengue infections per year, of which 96 million (67–136) manifest apparently (any level of disease severity). This infection total is more than three times the dengue burden estimate of the World Health Organization2. Stratification of our estimates by country allows comparison with national dengue reporting, after taking into account the probability of an apparent infection being formally reported. The most notable differences are discussed. These new risk maps and infection estimates provide novel insights into the global, regional and national public health burden imposed by dengue. We anticipate that they will provide a starting point for a wider discussion about the global impact of this disease and will help to guide improvements in disease control strategies using vaccine, drug and vector control methods, and in their economic evaluation.

At a glance

Figures

  1. Global estimates of total dengue infections.
    Figure 1: Global estimates of total dengue infections.

    Comparison of previous estimates of total global dengue infections in individuals of all ages, 1985–2010. Black triangle, ref. 5; dark blue triangle, ref. 15; green triangle, ref. 17; orange triangle, ref. 16; light blue triangle, ref. 30; pink triangle, ref. 10; red triangle, apparent infections from this study. Estimates are aligned to the year of estimate and, if not stated, aligned to the publication date. Red shading marks the credible interval of our current estimate, for comparison. Error bars from ref. 10 and ref. 16 replicated the confidence intervals provided in these publications.

  2. Global evidence consensus, risk and burden of dengue in 2010.
    Figure 2: Global evidence consensus, risk and burden of dengue in 2010.

    a, National and subnational evidence consensus on complete absence (green) through to complete presence (red) of dengue4. b, Probability of dengue occurrence at 5km×5km spatial resolution of the mean predicted map (area under the receiver operator curve of 0.81 (±0.02 s.d., n = 336)) from 336 boosted regression tree models. Areas with a high probability of dengue occurrence are shown in red and areas with a low probability in green. c, Cartogram of the annual number of infections for all ages as a proportion of national or subnational (China) geographical area.

Main

Dengue is an acute systemic viral disease that has established itself globally in both endemic and epidemic transmission cycles. Dengue virus infection in humans is often inapparent1, 6 but can lead to a wide range of clinical manifestations, from mild fever to potentially fatal dengue shock syndrome2. The lifelong immunity developed after infection with one of the four virus types is type-specific1, and progression to more serious disease is frequently, but not exclusively, associated with secondary infection by heterologous types2, 5. No effective antiviral agents yet exist to treat dengue infection and treatment therefore remains supportive2. Furthermore, no licensed vaccine against dengue infection is available, and the most advanced dengue vaccine candidate did not meet expectations in a recent large trial7, 8. Current efforts to curb dengue transmission focus on the vector, using combinations of chemical and biological targeting of Aedes mosquitoes and management of breeding sites2. These control efforts have failed to stem the increasing incidence of dengue fever epidemics and expansion of the geographical range of endemic transmission9. Although the historical expansion of this disease is well documented, the potentially large burden of ill-health attributable to dengue across much of the tropical and subtropical world remains poorly enumerated.

Knowledge of the geographical distribution and burden of dengue is essential for understanding its contribution to global morbidity and mortality burdens, in determining how to allocate optimally the limited resources available for dengue control, and in evaluating the impact of such activities internationally. Additionally, estimates of both apparent and inapparent infection distributions form a key requirement for assessing clinical surveillance and for scoping reliably future vaccine demand and delivery strategies. Previous maps of dengue risk have used various approaches combining historical occurrence records and expert opinion to demarcate areas at endemic risk10, 11, 12. More sophisticated risk-mapping techniques have also been implemented13, 14, but the empirical evidence base has since been improved, alongside advances in disease modelling approaches. Furthermore, no studies have used a continuous global risk map as the foundation for dengue burden estimation.

The first global estimates of total dengue virus infections were based on an assumed constant annual infection rate among a crude approximation of the population at risk (10% in 1 billion (ref. 5) or 4% in 2 billion (ref. 15)), yielding figures of 80–100 million infections per year worldwide in 1988 (refs 5, 15). As more information was collated on the ratio of dengue haemorrhagic fever to dengue fever cases, and the ratio of deaths to dengue haemorrhagic fever cases, the global figure was revised to 50–100 million infections16, 17, although larger estimates of 100–200 million have also been made10 (Fig. 1). These estimates were intended solely as approximations but, in the absence of better evidence, the resulting figure of 50–100 million infections per year is widely cited and currently used by the World Health Organization (WHO). As the methods used were informal, these estimates were presented without confidence intervals, and no attempt was made to assess geographical or temporal variation in incidence or the inapparent infection reservoir.

Figure 1: Global estimates of total dengue infections.
Global estimates of total dengue infections.

Comparison of previous estimates of total global dengue infections in individuals of all ages, 1985–2010. Black triangle, ref. 5; dark blue triangle, ref. 15; green triangle, ref. 17; orange triangle, ref. 16; light blue triangle, ref. 30; pink triangle, ref. 10; red triangle, apparent infections from this study. Estimates are aligned to the year of estimate and, if not stated, aligned to the publication date. Red shading marks the credible interval of our current estimate, for comparison. Error bars from ref. 10 and ref. 16 replicated the confidence intervals provided in these publications.

Here we present the outcome of a new project to derive an evidence-based map of dengue risk and estimates of apparent and inapparent infections worldwide on the basis of the global population in 2010. We compiled a database of 8,309 geo-located records of dengue occurrence from a systematic search, resulting from 2,838 published literature sources as well as newer online resources18 (see Supplementary Information, section A; the full bibliography4 and occurrence data are available from authors on request). Using these occurrence records we: chose a set of gridded environmental and socioeconomic covariates known, or proposed, to affect dengue transmission (see Supplementary Information, section B); incorporated recent work assessing the strength of evidence on national and subnational-level dengue present/absent status4 (Fig. 2a); and built a boosted regression tree (BRT) statistical model of dengue risk that addressed the limitations of previous risk maps (see Supplementary Information, section C) to define the probability of occurrence of dengue infection (dengue risk) within each 5km×5km pixel globally (Fig. 2b). The model was run 336 times to reflect parameter uncertainty and an ensemble mean map was created (see Supplementary Information, section C). We then combined this ensemble map with detailed longitudinal information on dengue infection incidence from cohort studies and built a non-parametric Bayesian hierarchical model to describe the relationship between dengue risk and incidence (see Supplementary Information, section D). Finally, we used the estimated relationship to predict the number of apparent and inapparent dengue infections in 2010 (see Supplementary Information, section E). Our definition of an apparent infection is consistent with that used by the cohort studies: an infection with sufficient severity to modify a person’s regular schedule, such as attending school. This definition encompasses any level of severity of the disease.

Figure 2: Global evidence consensus, risk and burden of dengue in 2010.
Global evidence consensus, risk and burden of dengue in 2010.

a, National and subnational evidence consensus on complete absence (green) through to complete presence (red) of dengue4. b, Probability of dengue occurrence at 5km×5km spatial resolution of the mean predicted map (area under the receiver operator curve of 0.81 (±0.02 s.d., n = 336)) from 336 boosted regression tree models. Areas with a high probability of dengue occurrence are shown in red and areas with a low probability in green. c, Cartogram of the annual number of infections for all ages as a proportion of national or subnational (China) geographical area.

We predict that dengue transmission is ubiquitous throughout the tropics, with the highest risk zones in the Americas and Asia (Fig. 2b). Validation statistics indicated high predictive performance of the BRT ensemble mean map with area under the receiver operating characteristic (AUC) of 0.81 (±0.02 s.d., n = 336) (see Supplementary Information, section C). Predicted risk in Africa, although more unevenly distributed than in other tropical endemic regions, is much more widespread than suggested previously. Africa has the poorest record of occurrence data and, as such, increased information from this continent would help to define better the spatial distribution of dengue within it and to improve its derivative burden estimates. We found high levels of precipitation and temperature suitability for dengue transmission to be most strongly associated among the variables considered with elevated dengue risk, although low precipitation was not found to limit transmission strongly (see Supplementary Information, section C). Proximity to low-income urban and peri-urban centres was also linked to greater risk, particularly in highly connected areas, indicating that human movement between population centres is an important facilitator of dengue spread. These associations have previously been cited9, but have not been demonstrated at the global scale and highlight the importance of including socioeconomic covariates when assessing dengue risk.

We estimate that there were 96 million apparent dengue infections globally in 2010 (Table 1). Asia bore 70% (67 (47–94) million infections) of this burden, and is characterized by large swathes of densely populated regions coinciding with very high suitability for disease transmission. India19, 20 alone contributed 34% (33 (24–44) million infections) of the global total. The disproportionate infection burden borne by Asian countries is emphasized in the cartogram shown in Fig. 2c. The Americas contributed 14% (13 (9–18) million infections) of apparent infections worldwide, of which over half occurred in Brazil and Mexico. Our results indicate that Africa’s dengue burden is nearly equivalent to that of the Americas (16 (11–22) million infections, or 16% of the global total), representing a significantly larger burden than previously estimated. This disparity supports the notion of a largely hidden African dengue burden, being masked by symptomatically similar illnesses, under-reporting and highly variable treatment-seeking behaviour6, 9, 20. The countries of Oceania contributed less than 0.2% of global apparent infections.

Table 1: Estimated burden of dengue in 2010, by continent

We estimate that an additional 294 (217–392) million inapparent infections occurred worldwide in 2010. These mild or asymptomatic infections are not detected by the public health surveillance system and have no immediate implications for clinical management6. However, the presence of this huge potential reservoir of infection has profound implications for: (1) correctly enumerating economic impact (for example, how many vaccinations are needed to avert an apparent infection) and triangulating with independent assessments of disability adjusted life years (DALYs)21; (2) elucidating the population dynamics of dengue viruses22; and (3) making hypotheses about population effects of future vaccine programmes23 (volume, targeting efficacy, impacts in combination with vector control), which will need to be administered to maximize cross-protection and minimize post-vaccination susceptibility.

The absolute uncertainties in the national burden estimates are inevitably a function of population size, with the greatest uncertainties in India, Indonesia, Brazil and China (see full rankings in Supplementary Table 4). In addition, comparing the ratio of the mean to the width of the confidence interval24 revealed the greatest contributors to relative uncertainty (see full rankings in Supplementary Table 4). These were countries with sparse occurrence points and low evidence consensus on dengue presence, such as Afghanistan or Rwanda (see Fig. 2a), or those with ubiquitous high risk, such as Singapore or Djibouti, for which our burden prediction confidence interval is at its widest (see Supplementary Information, section D, Fig. 2). Therefore, increasing evidence consensus and occurrence data availability in low consensus countries and assembling new cohort studies, particularly in areas of high transmission, will reduce uncertainty in future burden estimates. Our approach, uniquely, provides new evidence to help maximize the value and cost-effectiveness of surveillance efforts, by indicating where limited resources can be targeted to have their maximum possible impact in improving our knowledge of the global burden and distribution of dengue.

Our estimates of total infection burden (apparent and inapparent) are more than three times higher than the WHO predicted figure (Supplementary Information, section E). Our definition of an apparent infection is broad, encompassing any disruption to the daily routine of the infected individual, and consequently is an inclusive measurement of the total population affected adversely by the disease. Within this broad class, the severity of symptoms will affect treatment-seeking behaviours and the probability of a correct diagnosis in response to a given infection. Our definition is therefore more comprehensive than those of traditional surveillance systems which, even in the most efficient system, report a much narrower range of dengue infections. By reviewing our database of longitudinal cohort studies, in which total infections in the community were documented exhaustively, we find that the biggest source of disparity between actual and reported infection numbers is the low proportion of individuals with apparent infections seeking care from formal health facilities (see Supplementary Information, section E, Fig. 5 for full analysis). Additional biases are introduced by misdiagnosis and the systematic failure of health management information systems to capture and report presenting dengue cases. By extracting the average magnitude of each of these sequential disparities from published cohort and clinical studies, we can recreate a hypothetical reporting chain with idealized reporting and arrive at estimates that are broadly comparable to those countries reported to the WHO. This is most clear in more reliable reporting regions such as the Americas. Systemic under-reporting and low hospitalization rates have important implications, for example, in the evaluation of vaccine efficacy based on reduced hospitalized caseloads. Inferences about these biases may be made from the comparison of estimated versus reported infection burdens in 2010, highlighting areas where particularly poor reporting might be strengthened (see Supplementary Information, section E).

We have strived to be exhaustive in the assembly of contemporary data on dengue occurrence and clinical incidence and have applied new modelling approaches to maximize the predictive power of these data. It remains the case, however, that the empirical evidence base for global dengue risk is more limited than that available, for example, for Plasmodium falciparum25 and Plasmodium vivax26 malaria. Records of disease occurrence carry less information than those of prevalence and, as databases of the latter become more widespread, future approaches should focus on assessing relationships between seroprevalence and clinical incidence as a means of assessing risk27. Additional cartographic refinements are also required to help differentiate endemic- from epidemic-prone areas, to determine the geographic diversity of dengue virus types and to predict the distributions of future risk under scenarios of socioeconomic and environmental change.

The global burden of dengue is formidable and represents a growing challenge to public health officials and policymakers. Success in tackling this growing global threat is, in part, contingent on strengthening the evidence base on which control planning decisions and their impact are evaluated. It is hoped that this evaluation of contemporary dengue risk distribution and burden will help to advance that goal.

Methods

Assembly of the occurrence database and its quality control

Occurrence data comprised of point or polygon locations of confirmed dengue infection presence derived from both peer-reviewed literature and HealthMap alerts18, 31 (see Supplementary Information, section A). An occurrence was defined as one or more laboratory or clinically confirmed infection(s) of dengue occurring at a unique location (a 5km×5km pixel) within one calendar year. All occurrence data underwent manual review and automatic quality control to ensure information fidelity and precise geo-positioning. In total, 9,648 and 1,622 occurrence locations were obtained from literature searches and HealthMap, respectively. After the quality control procedures, our final data set contained 8,309 occurrence locations (5,216 point locations and 3,093 small polygon centroids) spanning a period from 1960 to 2012. We assume any record of dengue occurrence, regardless of its age, represented an environment permissible for the disease, as dengue has expanded from a focal disease in Asia to a cosmopolitan disease of the tropics.

Explanatory covariates

We assembled gridded global data for a suite of eight explanatory covariates. The covariates were chosen based on factors known or hypothesized to contribute to suitability for dengue transmission (see Supplementary Information, section B). These covariates included: (1) annual maximum and minimum precipitation variables from a Fourier processed32 synoptic annual series interpolated from global meteorological stations33; (2) a biological model combining the effects of temperature on the extrinsic incubation period of dengue virus and lifespan of the Aedes aegypti vector to quantify the dengue-specific temperature suitability for transmission28, 34, 35; (3) Fourier-processed annual average normalized difference vegetation index36; (4) categorical demarcations of urban and peri-urban areas37; (5) an urban accessibility metric defining the travel time to nearest city of 50,000 people or more by land- or water-based travel38; and (6) an indicator of relative poverty derived from the finest geographic scale data available for economic productivity and adjusted for purchasing power parity39. No covariate grids were shown to be adversely affected by multicollinearity (see Supplementary Information, section B) and were standardized to ensure identical spatial resolution, extent and boundaries. For point records, covariate values corresponded to the pixel value containing the location of the point. For polygon occurrence records, covariate values were averaged across the whole polygon.

Predicting the probability of occurrence (risk) of dengue transmission

We used a boosted regression tree (BRT) approach to establish a multivariate empirical relationship between the probability of occurrence of a dengue virus infection and the environmental conditions sampled at each site from the covariate suite. The BRT method has been shown to fit complicated response functions efficiently, while guarding against overfitting, and is therefore widely used for vector and disease distribution mapping40, 41. The BRT approach combines regression trees42 with gradient boosting43, whereby an initial regression tree is fitted and iteratively improved upon in a forward stage-wise manner (boosting) by minimizing the variation in the response not explained by the model at each iteration (see Supplementary Information, section C).

Like other niche mapping approaches, the BRT models require not only presence data but also absence data defining areas of disease absence and potentially unsuitable environmental conditions at unsampled locations. Because data on absence of disease are not definitive, pseudo-absence data estimate areas of disease absence instead. No consensus approach has been developed to optimize the generation of pseudo-absence data and we therefore created an evidence-based probabilistic framework for generating pseudo-absences, incorporating the main biasing factors in pseudo-absence generation, namely: (1) geographical extent; (2) number; (3) contamination bias; and (4) sampling bias. To represent areas of absence, na pseudo-absence points29, 44, 45 were randomly generated based on dengue presence or absence certainty measures at a national or subnational level4. Pseudo-absence locations were restricted to a maximum distance μ from any recorded presence site46, 47. Additionally, to compensate for ‘contamination’ of true but unobserved presences within the generated pseudo-absences48, np pseudo-presence points were generated using the same procedure used to generate the pseudo-absences. Variation in the parameter set π = {μ, na, np} resulted in independent samples of the possible states of the real distribution, with all parameter combinations representing a null distribution of possible states. Therefore, rather than using an individual parameter combination from π, we created an ensemble49 of 336 BRT models spanning reasonable ranges in π and evaluated the central tendency as the mean across all 336 BRT models (see Supplementary Information, section C). The final ensemble BRT model was used to predict a global map of the probability of occurrence of dengue virus infection at a 5km×5km resolution.

Estimation of dengue burden and populations at risk

Formal literature searches were conducted for serological dengue virus incidence surveys. Inclusion criteria were restricted to longitudinal surveys of seroconversion to dengue-virus-specific antibodies carried out in parallel with active symptom surveillance in a defined cohort. The surveys were abstracted, standardized and geopositioned (see Supplementary Information, section D). In total, 54 dengue incidence surveys were collected. Of these, 39 contained information about the ratio of inapparent to apparent infections.

The empirical relationship between incidence and the probability of occurrence was represented using a Bayesian hierarchical model. We defined a negative binomial likelihood function50 with constant dispersion and a rate characterized by a highly flexible data-driven Gaussian process prior51. The Gaussian process prior was parameterized with a quadratic mean function and a squared exponential covariance function51. Uninformative hyperpriors were assigned hierarchically to the prior parameters and the full posterior distribution determined by Markov Chain Monte Carlo (MCMC) sampling52. The entire model was fitted separately for apparent and inapparent infection incidences, with missing inapparent to apparent ratio values imputed in the MCMC. Using human population gridded data for the year 2010 (ref. 53), estimates of apparent and inapparent dengue infections were calculated nationally, regionally and globally. These estimates were then compared to national clinical cases reported to the WHO and differences between our cartographic estimates of infections and the WHO surveillance estimates were reconciled in a comparative analysis addressing key factors in traditional surveillance under-reporting (see Supplementary Information, section E).

Change history

Corrected online 24 April 2013
Minor changes were made to the text about disease severity, and an additional citation to ref. 6 was added.

References

  1. Simmons, C. P., Farrar, J. J., van Vinh Chau, N. & Wills, B. Dengue. N. Engl. J. Med. 366, 14231432 (2012)
  2. World Health Organization. Dengue: Guidelines for Diagnosis, Treatment, Prevention and Control. WHO/HTM/NTD/DEN/2009.1 (World Health Organization, 2009)
  3. Tatem, A. J., Hay, S. I. & Rogers, D. J. Global traffic and disease vector dispersal. Proc. Natl Acad. Sci. USA 103, 62426247 (2006)
  4. Brady, O. J. et al. Refining the global spatial limits of dengue virus transmission by evidence-based consensus. PLoS Negl. Trop. Dis. 6, e1760 (2012)
  5. Halstead, S. B. Pathogenesis of dengue: challenges to molecular biology. Science 239, 476481 (1988)
  6. Endy, T. P. et al. Determinants of inapparent and symptomatic dengue infection in a prospective study of primary school children in Kamphaeng Phet, Thailand. PLoS Negl. Trop. Dis. 5, e975 (2011)
  7. Sabchareon, A. et al. Protective efficacy of the recombinant, live-attenuated, CYD tetravalent dengue vaccine in Thai schoolchildren: a randomised, controlled phase 2b trial. Lancet 380, 15591567 (2012)
  8. Halstead, S. B. Dengue vaccine development: a 75% solution? Lancet 380, 15351536 (2012)
  9. Gubler, D. J. Dengue and dengue hemorrhagic fever. Clin. Microbiol. Rev. 11, 480496 (1998)
  10. Beatty, M. E., Letson, G. W. & Margolis, H. S. Estimating the global burden of dengue. Am. J. Trop. Med. Hyg. 81 (Suppl. 1). 231 (2009)
  11. Van Kleef, E., Bambrick, H. & Hales, S. The geographic distribution of dengue fever and the potential influence of global climate change. TropIKA. net http://journal.tropika.net/scielo.php?script=sci_arttext&pid=S2078-86062010005000001&lng=en&nrm=iso (2009)
  12. World Health Organization. International Travel and Health: Situation as on 1 January 2012 (World Health Organization, 2012)
  13. Hales, S., de Wet, N., Maindonald, J. & Woodward, A. Potential effect of population and climate changes on global distribution of dengue fever: an empirical model. Lancet 360, 830834 (2002)
  14. Rogers, D. J., Wilson, A. J., Hay, S. I. & Graham, A. J. The global distribution of yellow fever and dengue. Adv. Parasitol. 62, 181220 (2006)
  15. Monath, T. P. Yellow fever and dengue-the interactions of virus, vector and host in the re-emergence of epidemic disease. Semin. Virol. 5, 133145 (1994)
  16. Rigau-Pérez, J. G. et al. Dengue and dengue haemorrhagic fever. Lancet 352, 971977 (1998)
  17. Rodhain, F. La situation de la dengue dans le monde. Bull. Soc. Pathol. Exot. 89, 8790 (1996)
  18. Freifeld, C. C., Mandl, K. D., Reis, B. Y. & Brownstein, J. S. HealthMap: global infectious disease monitoring through automated classification and visualization of Internet media reports. J. Am. Med. Inform. Assoc. 15, 150157 (2008)
  19. Chakravarti, A., Arora, R. & Luxemburger, C. Fifty years of dengue in India. Trans. R. Soc. Trop. Med. Hyg. 106, 273282 (2012)
  20. Kakkar, M. Dengue fever is massively under-reported in India, hampering our response. Br. Med. J. 345, e8574 (2012)
  21. Murray, C. J. L. et al. Disability-adjusted life years (DALYs) for 291 diseases and injuries in 21 regions, 1990–2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet 380, 21972223 (2012)
  22. Cummings, D. A. et al. The impact of the demographic transition on dengue in Thailand: insights from a statistical analysis and mathematical modeling. PLoS Med. 6, e1000139 (2009)
  23. Johansson, M. A., Hombach, J. & Cummings, D. A. Models of the impact of dengue vaccines: a review of current research and potential approaches. Vaccine 29, 58605868 (2011)
  24. Hay, S. I. et al. Estimating the global clinical burden of Plasmodium falciparum malaria in 2007. PLoS Med. 7, e1000290 (2010)
  25. Gething, P. W. et al. A new world malaria map: Plasmodium falciparum endemicity in 2010. Malar. J. 10, 378 (2011)
  26. Gething, P. W. et al. A long neglected world malaria map: Plasmodium vivax endemicity in 2010. PLoS Negl. Trop. Dis. 6, e1814 (2012)
  27. Anders, K. L. & Hay, S. I. Lessons from malaria control to help meet the rising challenge of dengue. Lancet Infect. Dis. 12, 977984 (2012)
  28. Gething, P. W. et al. Modelling the global constraints of temperature on transmission of Plasmodium falciparum and P. vivax. Parasites Vectors 4, 92 (2011)
  29. Chefaoui, R. M. & Lobo, J. M. Assessing the effects of pseudo-absences on predictive distribution model performance. Ecol. Modell. 210, 478486 (2008)
  30. TDR/World Health Organization. Report of the Scientific Working Group on Dengue, 2006. TDR/SWG/08 (TDR/World Health Organization, 2006)
  31. Brownstein, J. S., Freifeld, C. C., Reis, B. Y. & Mandl, K. D. Surveillance sans frontières: internet-based emerging infectious disease intelligence and the HealthMap project. PLoS Med. 5, e151 (2008)
  32. Scharlemann, J. P. W. et al. Global data for ecology and epidemiology: a novel algorithm for temporal Fourier processing MODIS data. PLoS ONE 3, e1408 (2008)
  33. Hijmans, R. J., Cameron, S. E., Parra, J. L., Jones, P. G. & Jarvis, A. Very high resolution interpolated climate surfaces for global land areas. Int. J. Climatol. 25, 19651978 (2005)
  34. Focks, D. A., Haile, D. G., Daniels, E. & Mount, G. A. Dynamic life table model for Aedes aegypti (Diptera: Culcidae): analysis of the literature and model development. J. Med. Entomol. 30, 10031017 (1993)
  35. Focks, D. A., Haile, D. G., Daniels, E. & Mount, G. A. Dynamic life table model for Aedes aegypti (Diptera: Culicidae): simulation and validation. J. Med. Entomol. 30, 10181028 (1993)
  36. Hay, S. I., Tatem, A. J., Graham, A. J., Goetz, S. J. & Rogers, D. J. Global environmental data for mapping infectious disease distribution. Adv. Parasitol. 62, 3777 (2006)
  37. Hay, S. I. et al. A world malaria map: Plasmodium falciparum endemicity in 2007. PLoS Med. 6, e48 (2009)
  38. Nelson, A. Estimated travel time to the nearest city of 50,000 or more people in year 2000. http://bioval.jrc.ec.europa.eu/products/gam (accessed 1 January 2012) (Global Environment Monitoring Unit – Joint Research Centre of the European Commission, 2008)
  39. Nordhaus, W. D. Geography and macroeconomics: new data and new findings. Proc. Natl Acad. Sci. USA 103, 35103517 (2006)
  40. Elith, J. et al. Novel methods improve prediction of species’ distributions from occurrence data. Ecography 29, 129151 (2006)
  41. Stevens, K. B. & Pfeiffer, D. U. Spatial modelling of disease using data- and knowledge-driven approaches. Spat. Spatiotemporal Epidemiol. 2, 125133 (2011)
  42. Breiman, L. Classification and Regression Trees (Chapman & Hall/CRC, 1984)
  43. Friedman, J. H. Greedy function approximation: a gradient boosting machine. Ann. Stat. 29, 11891232 (2001)
  44. Stokland, J. N., Halvorsen, R. & Stoa, B. Species distribution modelling. Effect of design and sample size of pseudo-absence observations. Ecol. Modell. 222, 18001809 (2011)
  45. Lobo, J. M. & Tognelli, M. F. Exploring the effects of quantity and location of pseudo-absences and sampling biases on the performance of distribution models with limited point occurrence data. J. Nat. Conserv. 19, 17 (2011)
  46. VanDerWal, J., Shoo, L. P., Graham, C. & William, S. E. Selecting pseudo-absence data for presence-only distribution modeling: how far should you stray from what you know? Ecol. Modell. 220, 589594 (2009)
  47. Barbet-Massin, M., Jiguet, F., Albert, C. H. & Thuiller, W. Selecting pseudo-absences for species distribution models: how, where and how many? Methods Ecol. Evol. 3, 327338 (2012)
  48. Ward, G., Hastie, T., Barry, S., Elith, J. & Leathwick, J. R. Presence-only data and the EM algorithm. Biometrics 65, 554563 (2009)
  49. Araújo, M. B. & New, M. Ensemble forecasting of species distributions. Trends Ecol. Evol. 22, 4247 (2007)
  50. Hilbe, J. M. Negative Binomial Regression 2nd edn, 251 (Cambridge Univ. Press, 2011)
  51. Banerjee, S., Carlin, B. P. & Gelfand, A. E. Hierarchical Modeling and Analysis for Spatial Data. Monographs on Statistics and Applied Probability 101 (Chapman & Hall/CRC, 2004)
  52. Patil, A., Huard, D. & Fonnesbeck, C. J. PyMC: Bayesian stochastic modelling in Python. J. Stat. Softw. 35, e1000301 (2010)
  53. Balk, D. L. et al. Determining global population distribution: methods, applications and data. Adv. Parasitol. 62, 119156 (2006)

Download references

Acknowledgements

S.I.H. is funded by a Senior Research Fellowship from the Wellcome Trust (095066) which also supports S.B. and P.W.G. C.P.S. is also funded by a Senior Research Fellowship from the Wellcome Trust (084368). O.J.B. is funded by a BBSRC Industrial CASE studentship. J.P.M., A.W.F., T.J., G.R.W.W., C.P.S., T.W.S. and S.I.H. received funding from, and with S.B., P.W.G., O.J.B. and J.J.F. acknowledge the contribution of, the International Research Consortium on Dengue Risk Assessment Management and Surveillance (IDAMS, 21803, http://www.idams.eu). This work was funded in part by EU grant 2011-261504 EDENEXT and the paper is catalogued by the EDENEXT Steering Committee as EDENEXT. S.I.H. and T.W.S. also acknowledge funding support from the RAPIDD program of the Science & Technology Directorate, Department of Homeland Security, and the Fogarty International Center, National Institutes of Health.

Author information

Affiliations

  1. Spatial Ecology and Epidemiology Group, Tinbergen Building, Department of Zoology, University of Oxford, South Parks Road, Oxford OX1 3PS, UK

    • Samir Bhatt,
    • Peter W. Gething,
    • Oliver J. Brady,
    • Jane P. Messina,
    • Andrew W. Farlow,
    • Catherine L. Moyes,
    • John M. Drake,
    • Monica F. Myers,
    • G. R. William Wint &
    • Simon I. Hay
  2. Oxitec Limited, Milton Park, Abingdon OX14 4RX, UK

    • Oliver J. Brady
  3. Odum School of Ecology, University of Georgia, Athens, Georgia 30602, USA

    • John M. Drake
  4. Department of Pediatrics, Harvard Medical School and Children’s Hospital Informatics Program, Boston Children’s Hospital, Boston, Massachusetts 02115, USA

    • John S. Brownstein
  5. Department of Community and Family Medicine, Geisel School of Medicine, Dartmouth College, Hanover, New Hampshire 03755, USA

    • Anne G. Hoen
  6. INDEPTH Network Secretariat, East Legon, PO Box KD 213, Accra, Ghana

    • Osman Sankoh
  7. School of Public Health, University of the Witwatersrand, Braamfontein 2000, Johannesburg, South Africa

    • Osman Sankoh
  8. Institute of Public Health, University of Heidelberg, 69120 Heidelberg, Germany

    • Osman Sankoh
  9. Fogarty International Center, National Institutes of Health, Bethesda, Maryland 20892, USA

    • Dylan B. George,
    • Thomas W. Scott &
    • Simon I. Hay
  10. Section Clinical Tropical Medicine, Department of Infectious Diseases, Heidelberg University Hospital, INF 324, D 69120 Heidelberg, Germany

    • Thomas Jaenisch
  11. Environmental Research Group Oxford (ERGO), Tinbergen Building, Department of Zoology, University of Oxford, South Parks Road, Oxford OX1 3PS, UK

    • G. R. William Wint
  12. Oxford University Clinical Research Unit, Hospital for Tropical Diseases, Ho Chi Minh City, Vietnam

    • Cameron P. Simmons &
    • Jeremy J. Farrar
  13. Centre for Tropical Medicine, University of Oxford, Churchill Hospital, Oxford OX3 7LJ, UK

    • Cameron P. Simmons &
    • Jeremy J. Farrar
  14. Department of Entomology, University of California Davis, Davis, California 95616, USA

    • Thomas W. Scott
  15. Department of Medicine, National University of Singapore, 119228 Singapore

    • Jeremy J. Farrar

Contributions

S.I.H. and J.J.F. conceived the research. S.B. and S.I.H. drafted the manuscript. S.B. drafted the Supplementary Information with significant support on sections A (O.J.B., C.L.M.), B (J.P.M., G.R.W.W.), C (P.W.G.), D (O.J.B., T.W.S.), and O.J.B. wrote section E. J.S.B. and A.G.H. provided HealthMap occurrence data and advice on its provenance. O.J.B. reviewed all the occurrence data. S.B. did the modelling and analysis with advice from J.M.D., P.W.G. and S.I.H. J.P.M. created all maps. All authors discussed the results and contributed to the revision of the final manuscript.

Competing financial interests

The authors declare no competing financial interests.

Corresponding author

Correspondence to:

Author details

Supplementary information

PDF files

  1. Supplementary Information (13.1 MB)

    This file contains Supplementary Information Sections A-F – see contents for details.

Additional data