Trade-offs between individual and ensemble forecasts of an emerging infectious disease

Oidtman, Rachel J.; Omodei, Elisa; Kraemer, Moritz U. G.; Castañeda-Orjuela, Carlos A.; Cruz-Rivera, Erica; Misnaza-Castrillón, Sandra; Cifuentes, Myriam Patricia; Rincon, Luz Emilse; Cañon, Viviana; Alarcon, Pedro de; España, Guido; Huber, John H.; Hill, Sarah C.; Barker, Christopher M.; Johansson, Michael A.; Manore, Carrie A.; Reiner, Jr., Robert C.; Rodriguez-Barraquer, Isabel; Siraj, Amir S.; Frias-Martinez, Enrique; García-Herranz, Manuel; Perkins, T. Alex

doi:10.1038/s41467-021-25695-0

Download PDF

Article
Open access
Published: 10 September 2021

Trade-offs between individual and ensemble forecasts of an emerging infectious disease

Nature Communications volume 12, Article number: 5379 (2021) Cite this article

4068 Accesses
10 Citations
24 Altmetric
Metrics details

Subjects

Abstract

Probabilistic forecasts play an indispensable role in answering questions about the spread of newly emerged pathogens. However, uncertainties about the epidemiology of emerging pathogens can make it difficult to choose among alternative model structures and assumptions. To assess the potential for uncertainties about emerging pathogens to affect forecasts of their spread, we evaluated the performance 16 forecasting models in the context of the 2015-2016 Zika epidemic in Colombia. Each model featured a different combination of assumptions about human mobility, spatiotemporal variation in transmission potential, and the number of virus introductions. We found that which model assumptions had the most ensemble weight changed through time. We additionally identified a trade-off whereby some individual models outperformed ensemble models early in the epidemic, but on average the ensembles outperformed all individual models. Our results suggest that multiple models spanning uncertainty across alternative assumptions are necessary to obtain robust forecasts for emerging infectious diseases.

Forecasting COVID-19 activity in Australia to support pandemic response: May to October 2020

Article Open access 30 May 2023

The United States COVID-19 Forecast Hub dataset

Article Open access 01 August 2022

A reproducible ensemble machine learning approach to forecast dengue outbreaks

Article Open access 15 February 2024

Introduction

Pathogen emergence, or the phenomenon of a novel or established pathogens invading a new host population, has been occurring more frequently in recent decades¹. In the last 40 years, more than 150 pathogens of humans have been identified as emerging or re-emerging^2,3. In these situations, host populations are largely susceptible, which can result in dynamics ranging from self-limiting outbreaks, as with Lassa virus⁴, to sustained pandemics, as with HIV⁵, depending on the pathogen’s traits and the context in which it emerges. When emergence does occur, mathematical models can be helpful for anticipating the future course of the pathogen’s spread^6,7,8.

A necessary part of using models to forecast emerging pathogens is making decisions about how to handle the many uncertainties associated with this unfamiliar microbes⁸. Given the biological and ecological diversity of emerging pathogens, there is often considerable uncertainty about various aspects of their natural histories, such as their potential for superspreading⁹, the role of human mobility in their spatial spread^10,11, drivers of spatiotemporal variation in their transmission^6,12, and even their modes of transmission¹³. In the case of MERS-CoV, for example, it took years to determine that the primary transmission route was spillover from camels rather than sustained human-to-human transmission¹⁴. A lack of definitive understanding about such basic aspects of natural history represents a major challenge for forecasting emerging pathogens.

Inevitably, different forecasters make diverse choices about how to address unknown aspects of an emerging pathogen’s natural history, as they do for numerous model features. This diversity of approaches has itself been viewed as part of the solution to the problem of model uncertainty, based on the idea that the biases of different models might counteract one another to produce a reliable forecast when viewed from the perspective of an ensemble of models¹⁵. This idea has support in multi-model efforts to forecast seasonal transmission of endemic pathogens, such as influenza and dengue viruses^{16,17,18,19,20}, with ensemble forecasts routinely outperforming individual models. These successes with endemic pathogens have motivated multi-model approaches in response to several emerging pathogens, including forecasting challenges for chikungunya²¹ and Ebola²², vaccine trial site selection for Zika²³, and a multi-model decision-making framework for COVID-19^15,24.

Although there has been increased attention to multi-model forecasting of emerging pathogens in the last few years, these initiatives have involved significant effort to coordinate forecasts among multiple modeling groups^25,26. Coordination across multiple groups has clear potential to add value beyond what any single modeling group can offer alone. At the same time, using multiple models to hedge against uncertainties about a pathogen’s natural history could potentially improve forecasts from a single modeling group, too^16,18. This could, in turn, improve ensemble forecasts based on contributions from multiple modeling groups. An ensemble-based approach by one modeling group that contributes to forecasts of seasonal influenza in the United States demonstrates the success that a single modeling group can achieve with an ensemble-based approach²⁷ and that such an ensemble can contribute value to an ensemble of forecasts from multiple modeling groups¹⁸. Similar approaches have not been widely adopted for forecasting emerging pathogens by a single modeling group (although see ref. ²⁸), despite the heightened uncertainty inherent to emerging pathogens.

Here, we evaluate the potential for an ensemble of models that span uncertainties in pathogen natural history but share a common core structure, to accurately forecast the dynamics of an emerging pathogen. We do so in the context of the 2015–2016 Zika epidemic in Colombia, which was well-characterized epidemiologically (Fig. 1)²⁹ and involved potentially consequential uncertainties about (i) the role of human mobility in facilitating spread across the country³⁰, (ii) the relationship between environmental conditions and transmission of this mosquito-borne virus^6,12, and (iii) the number of times the virus was introduced into the country³¹. In this retrospective analysis, we used data assimilation to update 16 distinct models throughout the epidemic period and assessed the forecast performance of all models relative to an equally weighted ensemble model. This allowed us to quantify the contribution of variants of each of the three aforementioned uncertainties to model performance during different phases of the epidemic. In doing so, we sought to not only assess the performance of the ensemble model relative to individual models but also to learn about features of individual models that may be associated with improved forecast accuracy over the course of an epidemic.

**Fig. 1: Temporal and spatial variation of Zika incidence, temperature, and mosquito occurrence probability in Colombia.**

Results

General forecast performance

Before any data assimilation had occurred, our 16 models (see Table 1) initially forecasted very low incidence across most departments over the 60-week period of our analysis (Fig. 2 top row, Supplementary Fig. 12). Even so, short-term forecasts over a 4-week horizon were consistent with the still-low observed incidence at that time (Fig. 1 purple, Supplementary Fig. 18). By the time twelve weeks of data had been assimilated into the models, forecasts over the 60-week period of our analysis were considerably higher than the initial forecasts and better aligned with the observed trajectory of the epidemic (Fig. 2 second row, Supplementary Fig. 13). Over those first 12 weeks, model parameters changed modestly (Supplementary Fig. 6) and correlations among parameters began to emerge (Supplementary Figs. 7–10). We observed a more substantial change in the proportion of individual stochastic realizations (where the nth stochastic realization is the nth “particle” generated from some set of parameters ${\vec{\theta }}_{t,n}$ at time t) resulting in an epidemic, with those particles resulting in no epidemic being filtered out almost entirely by week 12 (Supplementary Fig. 1). Because each particle retained its stochastic realization of past incidence across successive data assimilation periods, stochastic realizations of past incidence were inherited by particles much like parameter values. By week 24, many of the models correctly recognized that they were at or near the epidemic’s peak and forecasted a downward trajectory for the remainder of the 60-week period of our analysis (Fig. 2 third row, Supplementary Fig. 27). The particle filtering algorithm replaced nearly half of the original particles by that point (Supplementary Fig. 2), with the new particles consisting of stochastic realizations of past incidence selected through data assimilation and updated every four weeks with forward simulations based on either original or new parameter combinations. As the end of the 60-week period of our analysis was approached, parameter correlations continued to strengthen (Supplementary Figs. 7–10), our estimate of the reporting probability increased (Supplementary Fig. 6), and only around 20% of the original particles remained (Supplementary Fig. 1).

Table 1 Different model assumptions regarding the role of human mobility in facilitating pathogen spread across the country, the relationship between environmental conditions and transmission of ZIKV, and the number of times the virus was introduced into Colombia.

Full size table

Fig. 2: Observed incidence (navy points) with the median forecast for 16 models (black lines) with the equally weighted ensemble model (green band) for Antioquia, Norte de Santander, Cauca, and Amazonas at five points throughout the epidemic.

Model-specific forecast performance

To quantify the forecast performance of individual models over time, we used logarithmic scoring (hereafter, log scoring) to compare forecasts of cumulative incidence 4 weeks into the future to observed values at departmental and national levels. We assessed log scores once the first case was reported nationally for spatially coupled models (i.e., models with explicit human mobility), and once the first case was reported in each department for nonspatial models (i.e., models with no explicit human mobility). Log scores were generally high for spatially coupled models early in the epidemic, given that observed cases and forecasts were both low at that time (Supplementary Fig. 18a–c). By week 12, as cases were reported in more departments, the accuracy of forecasts from nonspatial models improved (Supplementary Fig. 18d onward). Forecast performance around the peak of the epidemic differed considerably across models and departments, with forecasts from non-spatial models being somewhat lower than observed incidence and forecasts from spatially coupled models being somewhat higher (Supplementary Fig. 14, Supplementary Fig. 18f–j). Around the peak of the epidemic, forecasts from spatially coupled models generally had higher log scores in departments with lower incidence (e.g., Nariño). Later in the epidemic (weeks 40–56), some models continued to forecast higher incidence than observed in some departments, despite having passed the peak incidence of reported cases (Supplementary Fig. 16). In particular, models that used the dynamic instead of the static formulation of the reproduction number (i.e., the temporal relationship between R and environmental drivers is dynamic instead of static) were more susceptible to this behavior (note lower log scores in “Rt” versus “R” models in Supplementary Fig. 18k–o), given that their forecasts were sensitive to seasonal changes in temperature and mosquito occurrence.

Next, we used these log scores in an expectation–maximization (EM) optimization algorithm³² to identify an optimal weighting of retrospective model-specific forecasts into an ensemble forecast (Supplementary Figs. 25–29) in each forecasting period (Supplementary Fig. 17). To learn how model assumptions affected the inclusion of different models into the optimally weighted ensemble for each forecasting period, we summed and then normalized models’ ensemble weights across each class of assumption (Fig. 3). Over the course of the epidemic, changes in weighting for the assumptions about human mobility and spatiotemporal variation in transmission, but not about the number of virus introductions into the country, closely followed patterns in the trajectory of the national epidemic. Spatially coupled models had most or all of the weight in the early and late stages of the epidemic, while non-spatial models had most of the weight around the peak of the epidemic (Fig. 3b). Although the non-spatial models somewhat under-predicted incidence in the middle stages of the epidemic, this was often to a lesser extent than the spatially coupled models’ over-predictions of incidence (Supplementary Fig. 3). As a result, the EM algorithm achieved a balance between the over- and under-predictions of these different models.

**Fig. 3: Ensemble weight partitioned across assumptions about the role of human mobility in driving transmission, drivers of spatiotemporal variation in R, and the number of ZIKV introductions.**

The maximum ensemble weight in any forecasting period was 0.802, held by one model with a static R, two ZIKV introductions into the country, and CDR-informed human mobility 12 weeks after the first reported Zika case (Supplementary Fig. 17). Combined, the two models with static R and CDR-informed human mobility data had the most instances of a nonzero ensemble weight (Supplementary Fig. 17), occurring in 13 of 15 assimilation periods, with an average weight of 0.18. Around the peak of the epidemic, nonspatial models had the highest ensemble weight, reflecting the accuracy of short-term forecasts in some departments (e.g., Magdalena and Vaupés) and their overall accuracy in nationally aggregated forecasts (Supplementary Fig. 11). Near the end of the epidemic, the ensemble weight for models with a static R (Fig. 3c) increased as their forecasts more closely matched the downturn of incidence later in the epidemic relative to models with dynamic R (Supplementary Fig. 20). This was likely the result of mosquito occurrence probability and temperature becoming more favorable for transmission in many departments later in the epidemic (Supplementary Figs. 21 and 22), causing the dynamic R models to forecast a late resurgence in Zika incidence.

Target-oriented forecast performance

Short-term changes in incidence are an important target of infectious disease forecasting, but there are other targets of potentially greater significance to public health decision-making. To explore these, we evaluated the ability of the 16 models—and an evenly weighted ensemble—to forecast three targets at the department level: peak incidence, week of peak incidence, and onset week, which we defined as the week by which ten cases were first reported. We evaluated models based on log scores of these targets. Summing log scores across departments to allow for comparisons across different forecasting periods (Fig. 4), we found that, on average, the ensemble model outperformed every individual model for all three forecasting targets (indicated by the ensemble model’s location on the y-axis). Early in the epidemic, spatially coupled models with a static R performed only slightly better (up to 1%) than the equally weighted ensemble (Fig. 4). For the remainder of the epidemic, the equally weighted ensemble model outperformed all individual models (Fig. 4). Such small changes in forecast performance when averaging over space shows that differences in forecast performance across space dominate relative to those across time.

**Fig. 4: Model-specific forecast scores are relative to the equally weighted ensemble model for each assimilation period and forecasting target.**

By summing log scores across forecasting periods to allow for comparisons across departments (Fig. 5), we found that some individual models outperformed the ensemble model in forecasting the peak incidence and the week of peak incidence. In departments on the Caribbean Coast that experienced intermediate epidemic sizes (e.g., Antioquia, Sucre, and Atlántico), spatially coupled models with a static R outperformed the ensemble model at forecasting the peak week by about 10% (Fig. 5a). At those same locations, the equally weighted ensemble performed better than or similar to those same models at forecasting peak incidence and onset week (Fig. 5b, c). Over forecasting periods and departments, the nonspatial models consistently had lower average forecast scores than the spatially coupled models (indicated by their location on the y-axis in Figs. 4 and 5). This trend appeared because initial forecasts from nonspatial models were not updated until the first case appeared in each department, while initial forecasts from spatially coupled models were updated when the first case appeared in the country.

**Fig. 5: Model-specific forecast scores are relative to equally weighted ensemble model for each department and forecasting target.**

Discussion

We assessed the potential for a suite of individual models that span a range of uncertainties, and ensembles of these models, to accurately forecast the dynamics of an emerging pathogen. Results from the general forecast performance analysis demonstrated that once we began assimilating data into models, forecasts rapidly became more accurate. Models were initialized with a wide range of parameter values³³, with many initial parameter combinations producing unrealistic forecast trajectories, but after only four assimilation periods (12 weeks), nearly 100% of those parameters that produced zero infections were dropped. Similar to other retrospective forecast analyses^16,34, as more data were assimilated into the models over time, the model fits and forecasts generally became more closely aligned with temporal trends in the data. This was because the particle filter allowed model parameters to continually adapt to noisy data³⁵. There were still some exceptions where the particle filter could not fully compensate for shortcomings of the transmission model, such as the drastic underestimates of incidence in departments with sub-optimal conditions for transmission (e.g., static R model in Risaralda in Supplementary Fig. 20). At the same time, the broader suite of models buffered against shortcomings of any single transmission model.

In the model-specific forecast performance analysis, we identified clear temporal trends related to when models with a static R versus a dynamic R should be included in an optimally weighted ensemble. In contrast, there were no clear temporal trends in weighting regarding the assumption about the number of times the virus was introduced into the country, potentially reflecting that, even with multiple introductions, the most transmission may have been linked to a single introduction³¹. Models with a dynamic R had higher weights in the ensemble at the peak of the epidemic, while models with a static R had higher weights at the beginning and end of the epidemic. This was likely due to temporal shifts in temperature and mosquito occurrence probabilities dominating forecasts of transmission potential for the models with a dynamic R. For example, in the latter parts of the epidemic when reported cases were declining, mosquito conditions and the temperature became more suitable for transmission in many departments. This caused models with a dynamic R to forecast a resurgence in ZIKV transmission in those departments, while models with a static R forecasted a downturn in incidence that was more similar to the observed dynamics. This finding that susceptible depletion may have been more influential than temporal variation in environmental conditions for the Zika epidemic is consistent with recent findings for SARS-CoV-2³⁶.

Through the model-specific forecast performance analysis, we also found that spatially coupled models had higher ensemble weights in the early and late stages of the epidemic, while non-spatial models had higher weights around the peak of the epidemic. The importance of including spatially coupled models in the optimally weighted ensemble early in the epidemic supports the general notion that human mobility may be particularly predictive of pathogen spread early in an epidemic^7,30,37,38. In part, temporal shifts in weighting around the peak of the epidemic were due to more accurate nationally aggregated forecasts from the non-spatial models. This result was consistent with a previous modeling analysis of the invasion of the chikungunya virus in Colombia, which showed that models fitted independently to subnational time series recreated national-level patterns well when aggregated³⁹. A shift in ensemble weights toward non-spatial models around the peak of the epidemic was also due to less accurate department-level forecasts from the spatially coupled models. At that point in the epidemic, the prevalence was at its highest, which means that we would expect local epidemics to be more endogenously driven and less sensitive to pathogen introductions across departments.

In the target-oriented forecast performance analysis, we found that the equally weighted ensemble generally outperformed individual models, with a few key exceptions. In the months leading up to the peak of the epidemic, spatially coupled models with a static R had slightly, but consistently, higher forecast scores with respect to peak week and onset week. Like the model-specific analysis results, this result illustrates the importance of human mobility in facilitating the spread of an emerging pathogen across a landscape³⁰. Individual models outperforming the equally weighted ensemble model in the early phase of the epidemic is not wholly surprising given that non-spatial models were represented equally in that ensemble throughout the epidemic. Nonspatial models may be realistic when locations have self-sustaining epidemics, but they are not appropriate for capturing early phase growth and its dependence on importations⁴⁰. Another instance when individual models had higher forecast scores than the equally weighted ensemble was with respect to peak week for spatially coupled models with a static R in departments along the Caribbean Coast. Compared to dynamic R models, the static R models more accurately forecasted peak week in these departments (e.g., Magdalena, Cesar, and Sucre), as they did not forecast a late-stage resurgence in transmission. The equal weighting of the dynamic R models in the ensemble, therefore, led to overall lower peak week forecast scores for the ensemble relative to static R models. Still, our results indicating that an equally weighted ensemble mostly outperformed individual models adds to the growing literature highlighting the importance of ensemble methods in epidemiological forecasting^{16,17,27,41,42}.

We considered both equally and optimally weighted ensembles and found that the equally weighted ensemble had a lower root mean square error than the optimally weighted ensemble (RMSE = 0.640 and 0.705, respectively)—therefore providing slightly more accurate forecasts of the observed data (Supplementary Fig. 23). With the optimally weighted ensemble, which we updated at each data assimilation period, we found that model weights changed over the course of the epidemic Supplementary Fig. 17). Although this is intuitive given the changing nature of an emerging epidemic through time⁸, it may be problematic in practice. It is almost as if the ensemble weights require their own forecast. On the one hand, promising new advances in ensemble modeling^27,41—such as adaptive stacking for seasonal influenza forecasting⁴³—are being used to address this issue of identifying optimal, adaptive weights without training to historical data. On the other hand, is an emerging pathogen context, establishing optimal model weights by way of model fitting and forecast generation is often reliant on available incidence data (rather than historical data) that is highly variable⁴⁴, given the delayed nature of data reporting⁴⁵. In this context, our results demonstrate that it is preferable to use an equally weighted ensemble to buffer against uncertainty in optimal ensemble weights. As is also being demonstrated in forecasts of COVID-19, equally weighted ensembles can provide accurate forecasts^26,44,46 and maybe a better reflection of the considerable structural uncertainty inherent to models of emerging pathogen transmission²⁴.

A few limitations of our study should be noted. First, while an equally weighted ensemble approach allowed us to consider contributions of several alternative model assumptions, there was high uncertainty associated with these forecasts (sometimes spanning orders of magnitude, see Supplementary Fig. 24). Potential end-users of these types of forecasts could consider high levels of uncertainty to be problematic for decision-making⁴⁷, though if the uncertainty does not affect the choice of a control measure, then the uncertainty may not be as relevant⁴⁸. In the future, ensemble approaches aimed at increasing precision and reducing uncertainty^27,49 could be used in conjunction with equally weighted ensembles. Second, we considered alternative models across only three assumptions. With ZIKV transmission, there are additional structural uncertainties that could be considered, such as the role of sexual transmission⁵⁰. In real-time applications of our or other Zika forecasting models, it could be worthwhile to explore these types of ZIKV-specific structural uncertainties. Relatedly, the static and dynamic R had minor differences in their formulations, such that the static R also included a socioeconomic index. In future work, it could be interesting to explore if the inclusion of this time-independent variable affected the dynamic R. Third, in this analysis, we did not explicitly consider delays in reporting that likely would have occurred had these forecasts been generated in real time⁵¹. In that context, temporally aggregating data to a wider interval (e.g., at 2-week intervals rather than 1-week intervals) could potentially help mitigate the effects of reporting delays to some extent. Fourth, we assumed that the reporting probability was constant through time. Although this is a standard assumption⁵² given the lack of data to inform a time-varying relationship for this mechanistic element⁵³, it would be interesting to include and test a reporting dynamics model (e.g., the reporting probability scales with incidence⁵⁴) as an additional component included in our ensemble framework. Fifth, we conducted this analysis at the departmental level instead of this municipality level, which could obfuscate meaningful differences across regions of a single department²⁹. In future work, it would be useful to test and assess our forecasting algorithm and outputs at different spatial scales³⁹.

As the world is reminded of on a daily basis with COVID-19, pathogen emergence is an ongoing phenomenon that will continue to pose threats in the future⁵⁵. A better understanding of an emerging pathogen’s natural history could help to reduce pathogen-specific structural uncertainties, but these insights may not always occur in time to inform model development for real-time forecasting⁸. Our results highlight important trade-offs between individual and ensemble models in this context. Specifically, we demonstrated that an equally weighted ensemble forecast was almost always more accurate than individual models. Instances in which individual models were better than the ensemble, or greatly improved the ensemble, also provided insight. For example, incorporating human mobility into models improved forecasts in the early and late phases of an epidemic, which underscores the importance of making aggregated mobility data available early in an epidemic⁵⁶. The range of outcomes resulting from alternative modeling assumptions in model-specific forecasts demonstrates why it will continue to be important to address structural uncertainties in forecasting models in the future.

Methods

Data

We used passive mandatory surveillance data for reported cases of Zika, from the National Surveillance System (Sivigila) at the first administrative level (31 mainland departments) in Colombia. To span the beginning, peak, and tail of the epidemic in Colombia, we focused on the 60-week period between August 9, 2015 and October 1, 2016. We used the version of these data collated by Siraj et al.²⁹, as well as modeled values of weekly average temperature and estimates of the department-level population from that data set. For some models, we worked with monthly estimates of mosquito occurrence probability (i.e., dynamic R models) obtained from Bogoch et al.⁵⁷, and for others, we worked with time-averaged estimates (i.e., static R models) from Kraemer et al.⁵⁸.

For models that relied on cell phone data to describe human mobility, we used anonymized and aggregated call detail records (CDRs). Every time a user receives or makes a call, a CDR including the time, date, ID, and the tower (BTS) providing the service is generated. The positions of the BTSs are georeferenced and so the aggregated mobility between towers can be tracked in time. We used this information to derive daily mobility matrices at the municipality level in Colombia from February 2015 to August 2015. Mobility matrices captured the number of individuals that moved in each given day from one municipality to another (i.e., that appeared in BTSs of different municipalities). The change for each day was captured by comparing the last known municipality to the current one. No individual information or records were available.

As these data did not align with the time frame of the epidemic, and to calculate a mobility matrix at a department level, we computed a representative mobility matrix by summing all available CDRs within the municipalities of each department and normalizing them to sum to one relative to the sum of CDRs originating from that department. In five departments (Amazonas, Cudinamarca, Guainía, Vaupés, and Vichada), the proportion of CDRs linking callers within the same department was below 60%. Given that this implied an unrealistically low proportion of time spent within an individual’s department of residence, we interpreted those values as idiosyncrasies of the data and not representative of human mobility⁵⁹. Thus, for those five departments, we replaced the proportion of within-department CDRs with the mean proportion of within-department CDRs from all other departments. We then re-normalized the number of CDRs originating from each department in our mobility matrix to sum to one.

Summary of models

To produce weekly forecasts of ZIKV transmission across Colombia, we sought to use a computationally efficient model with the flexibility to include relevant epidemiological and ecological mechanisms. We used a previously described semi-mechanistic, discrete-time, and stochastic model⁶⁰ that had been adapted and used to model mosquito-borne pathogen transmission^61,62. Using this model, we were able to account for the extended generation interval of ZIKV using overlapping pathogen generations across up to five weeks of the generation interval distribution of ZIKV⁶². Furthermore, we could specify this model to be either spatially connected or nonspatial—a key assumption that we considered in our analysis.

We considered a suite of 16 models that spanned all combinations of four assumptions about human mobility across Colombia’s 31 mainland departments, two assumptions about the relationship between environmental conditions and the reproduction number (R), and two assumptions about how many times the Zika virus was introduced to Colombia (Table 1). Twelve of 16 models allowed for spatial connectivity across departments⁶⁰, while four models were nonspatial. There were up to two steps in the transmission process: transmission across departments (for spatially connected models) and local transmission within departments.

Across departments, we simulated the movement of individuals using a spatial connectivity matrix (H), the dth column of which corresponds to the proportion of time spent by residents of department d in all departments $\vec{d}$. Using this matrix, we redistributed infections in department d in week t (I_d,t) across $\vec{d}$ as a multinomial random variable

$${I}_{\vec{d},\, t}^{\prime}\ \sim \ {{{{{{{{{\rm{multinomial}}}}}}}}}}\,({I}_{d,\, t},{{{{{{{{{{\bf{H}}}}}}}}}}}_{\vec{d},\, d}),$$

(1)

where the first and second arguments represent the number of trials and the probabilities of the outcomes, respectively. By taking this Lagrangian approach to modeling human mobility, transmission across departments in our model can occur either by infected visitors transmitting to local susceptibles or susceptible visitors becoming infected by local infecteds, but not between infected visitors and susceptible visitors in a transient location. The relative occurrence of these events depends on the prevalence of infection, susceptibility, local transmission potential, and mobility patterns of a given pair of departments.

Within each department, we defined a variable representing the effective number of infections that could have generated new infections in week t (${I}_{d,\,t}^{{\prime\prime} }$) as

$${I}_{d,\,t}^{{\prime\prime} }=\mathop{\sum }\limits_{j=1}^{5}{\omega }_{j}^{GI}{I}_{d,t-j}^{\prime},$$

(2)

where ${\omega }_{j}^{GI}$ is the probability that the generation interval is j weeks⁶³. The relationship between ${I}_{d,\,t}^{{\prime\prime} }$ and the expected number of new local infections in week t + 1 (I_d,t+1) follows

$${I}_{d,t+1}={\beta }_{d,t}\frac{{I}_{d,\,t}^{{\prime\prime} }}{{N}_{d}}{S}_{d,t},$$

(3)

where β_d,t is the transmission coefficient, N_d is the total population, and S_d,t is the total susceptible population prior to local transmission in week t. We accounted for the role of stochasticity in transmission by using the stochastic analog of Eq. (3), such that

$${I}_{d,t+1} \sim \ {{{{{{{{{\rm{negative}}}}}}}}}}\ {{{{{{{{{\rm{binomial}}}}}}}}}}\left({\beta }_{d,t}\frac{{I}_{d,\,t}^{{\prime\prime} }}{{N}_{d}}{S}_{d,\,t},{I}_{d,\,t}^{{\prime\prime} }\right)$$

(4)

where the first and second arguments are the mean and dispersion parameters, respectively⁶⁰.

To allow for comparison of the model’s simulations of infections (I_d,t) with empirical data on reported cases (y_d,t), we applied a reporting probability (ρ) to simulated infections to obtain simulated cases (C_d,t), such that C_d,t ~ binomial(I_d,t, ρ). Using this, we defined the contribution to the overall log-likelihood of the model and its parameters from a given department d and week t as

$${\ell}{{{{{{{{{{\boldsymbol{\ell }}}}}}}}}}}_{d,t}({\vec{\theta }}_{t})={{{{{{{{\mathrm{ln}}}}}}}}}\,\left({{{{{{{{{\rm{negative}}}}}}}}}}\ {{{{{{{{{\rm{binomial}}}}}}}}}}({y}_{d,t}+1\ | \ \phi ,{C}_{d,t}+1)\right),$$

(5)

where ϕ is a dispersion parameter that we estimated to account for variability in case reporting beyond that captured by ρ. Shifting y_d,t and C_d,t by one in Eq. (5) was intended to safeguard against ℓ_d,t being undefined in situations where C_d,t = 0.

Assumptions about human mobility

We allowed for spatial coupling across departments in 12 of 16 models. In these models, we informed H in three alternative ways: (i) with mobility data extracted from mobile phone CDRs, (ii) with a gravity model, or (iii) with a radiation model (Fig. 1d–f). For the gravity model, we used parameters previously fitted to CDRs from Spain and validated in West Africa¹¹. For the radiation model, we calculated human mobility fluxes according to the standard formulation of this model⁶⁴, which depends only on the geographic distribution of the population. In four of 16 models, we assumed that departments were spatially uncoupled (Table 1), such that each department was modeled individually with its own set of parameters. In those models, each department’s epidemic was seeded independently with its own set of imported infections. Further details about the spatially uncoupled models can be found in the Supplementary Text.

Assumptions about environmental drivers of transmission

We parameterized the transmission coefficient, β_d,t, based on a description of the reproduction number, R_d,t, appropriate to environmental drivers for department d and time t. We considered two alternative formulations of R_d,t that was informed by data that were available prior to the first reported case of Zika in Colombia. Specifically, both of these alternative formulations used different outputs from previous modeling efforts^6,12, and because of this, they contain slightly different components. Both formulations were defined such that

$${\beta }_{d,t}=k{R}_{d,t}$$

(6)

where k is a scalar that we estimated over the course of the epidemic to account for the unknown magnitude of ZIKV transmission in Colombia. In addition to the summary below, further details about these formulations of R_d,t are provided in the Supplementary Methods.

The formulation of β_d,t that we refer to as “dynamic” is defined at each time t in response to average temperature at that time (T_d,t) and mosquito occurrence probability at that time (OP_d,t). This relationship can be expressed generically as

$${\beta }_{d,t}=k{\tilde{R}}_{d,t}({T}_{d,t},O{P}_{d,t}| c,\psi ,\alpha ,v),$$

(7)

where c, ψ, α, and v are parameters governing the relationship among T_d,t, OP_d,t, and ${\tilde{R}}_{d,t}$. We informed the component of ${\tilde{R}}_{d,t}$ related to mosquito density with monthly estimates of OP_d,t, which derive from geostatistical modeling of Aedes aegypti occurrence records globally⁵⁷. Other components of ${\tilde{R}}_{d,t}$, which include several temperature-dependent transmission parameters, were informed by laboratory estimates¹². Given that this formulation of ${\tilde{R}}_{d,t}$ was not validated against field data prior to the Zika epidemic in Colombia, we estimated values of c, ψ, α, and v over the course of the epidemic.

The formulation of β_d,t that we refer to as “static” is defined as a time-averaged value that is constant across all times t. Temporal variation in T_d,t is still accounted for, but its time-varying effect on R_d,t is averaged out over all times $\vec{t}$ to result in a temporally constant R_d. Mosquito occurrence probability is also incorporated through a temporally constant value (OP_d)⁵⁸. The relationship among these variables can be expressed generically as

$${\beta }_{d,\, t}=k{\bar{R}}_{d}({T}_{d,\, \vec{t}},O{P}_{d},PP{P}_{d}),$$

(8)

where PPP_d is purchasing power parity in department d (a feature not included in the dynamic model)⁶⁵. This input is an economic index that was intended to serve as a proxy for spatial variation in conditions that could affect exposure to mosquito biting, such as housing quality or air conditioning use⁶. Given that this formulation of ${\bar{R}}_{d}$ was informed by data from outbreaks of Zika and chikungunya prior to the Zika epidemic in Colombia, we did not estimate its underlying parameters over the course of the epidemic in Colombia.

Assumptions about introduction events

Although many ZIKV infections were likely imported into Colombia throughout the epidemic, we assumed that ZIKV introductions into either one or two departments drove the establishment of ZIKV in Colombia³¹. Under the two different scenarios, there was either one introduction event into one department or there were two independent introduction events into two randomly drawn departments. For each parameter set, the initial number of imported infections was seeded randomly between one and five in a single week, the timing of which was estimated as a parameter. Following the initial introduction(s), we assumed that ZIKV transmission was driven by a combination of movement of infected people among departments and local transmission within departments, as specified by each model.

Data assimilation and forecasting

For each particle, we produced a single forecast to “initialize” the model prior to the first reported case in Colombia. Beginning with the time of the first reported case in Colombia, we then assimilated new data, updated parameter estimates, and generated forecasts every four weeks, consistent with the four-week frequency used by Johansson et al. in an evaluation of dengue forecasts¹⁶. We specified 20,000 initial parameter sets (${\vec{\theta }}_{1,n}$), indexed by n, by drawing independent samples from prior distributions of each parameter⁶⁶ (see Supplementary Methods). Each parameter set was used to generate a corresponding particle: a stochastic realization of the state variables (I_d,1,n and C_d,1,n). At each assimilation period, we normalized log-likelihoods summed across departments over the preceding four weeks to generate particle weights,

$$\omega (t,n)=\frac{{\sum }_{d}\mathop{\sum }\nolimits_{\tau = t-3}^{t}{{{{{{{{{{\boldsymbol{\ell }}}}}}}}}}}_{\ell_{d,\tau} }({\vec{\theta }}_{t,n})}{{\sum }_{n}{\sum }_{d}\mathop{\sum }\nolimits_{\tau = t-3}^{t}{{{{{{{{{{\boldsymbol{\ell }}}}}}}}}}}_{\ell_{d,\tau} }({\vec{\theta }}_{t,n})}.$$

(9)

Proportional to these particle weights (ω(t, n)), we sampled 18,000 sets of corresponding parameters (${\vec{{{{{{{{{{\boldsymbol{\theta }}}}}}}}}}}}_{t}^{{{{{{{{{{\rm{resampled}}}}}}}}}}}$) and state variables ($\{{{{{{{{{{{\boldsymbol{I}}}}}}}}}}}_{d,t}^{{{{{{{{{{\rm{resampled}}}}}}}}}}},{{{{{{{{{{\boldsymbol{C}}}}}}}}}}}_{d,t}^{{{{{{{{{{\rm{resampled}}}}}}}}}}}\}$) from time t with replacement and used them at the next data assimilation step four weeks later, where boldface indicates a set of parameters or state variables, respectively, overall n. In doing so, information including the initial prior assumptions (${\vec{\theta }}_{1,n}$) and the likelihoods at each four-week period was assimilated into the model sequentially over time. Given that particle filtering algorithms are susceptible to particle drift—or the convergence of particles onto very few states through iterative resampling³³—we also generated 2,000 new parameter sets at each data assimilation step. To do so, we drew random samples of model parameters from a multivariate normal distribution with parameter means and covariances fitted to the resampled 18,000 parameter sets (${\vec {{{{{{{{{{\boldsymbol{\theta }}}}}}}}}}}}_{t}^{{{{{{{{{{\rm{resampled}}}}}}}}}}}$). Whereas the 18,000 resampled parameter sets already included simulated values of state variables I_d,t,n and C_d,t,n through time t, the 2000 new parameter sets did not and so we informed initial conditions of ${{{{{{{{{{\boldsymbol{I}}}}}}}}}}}_{d,t}^{{{{{{{{{{\rm{new}}}}}}}}}}}$ with draws from ${{{{{{{{{{\boldsymbol{I}}}}}}}}}}}_{d,\iota :t}^{{{{{{{{{{\rm{resampled}}}}}}}}}}}$ for those parameter sets at the time they were created. Together, the 18,000 resampled parameter sets (${\vec {{{{{{{{{{\boldsymbol{\theta }}}}}}}}}}}}_{t}^{{{{{{{{{{\rm{resampled}}}}}}}}}}}$) and the 2000 new parameter sets (${\vec {{{{{{{{{{\boldsymbol{\theta }}}}}}}}}}}}_{t}^{{{{{{{{{{\rm{new}}}}}}}}}}}$) constituted the set of parameter sets used as input for the next data assimilation step (${\vec {{{{{{{{{{\boldsymbol{\theta }}}}}}}}}}}}_{t+4}=\{{\vec {{{{{{{{{{\boldsymbol{\theta }}}}}}}}}}}}_{t}^{{{{{{{{{{\rm{resampled}}}}}}}}}}},{\vec {{{{{{{{{{\boldsymbol{\theta }}}}}}}}}}}}_{t}^{{{{{{{{{{\rm{new}}}}}}}}}}}\}$). We also used this new set of parameters as the basis for forecasts made at time t, which simply involved simulating forward a single realization of the model for each parameter set.

Evaluating forecast performance

At each of the 15-time points at which we performed data assimilation through the 60-week forecasting period, we created an ensemble forecast that evenly weighted contributions from each of the 16 models⁴⁶. To populate this ensemble, we specified 20,000 total samples, with 1250 samples from each model. We assessed the model-specific performance of individual and ensemble forecasts using log scores, which are forecast scoring rules that assess both the precision and accuracy of forecasts⁶⁷. For a specific forecasting target, z, and model, m, the log score is defined as ${{{{{{{{{\rm{log}}}}}}}}}}{f}_{m}({z}^{* }| {{{{{{{{{\bf{x}}}}}}}}}})$, where f_m(z∣x) is the predicted density conditioned on the data, x, and z^* is the empirical value of the target Z¹⁶.

We computed log scores for departmental and national incidence over each four-week assimilation period. Following¹⁷, we used an EM algorithm to generate ensemble weights for each model in each assimilation period. For each model, we computed 32 log scores (i.e., one for each department and one nationally). To compute the ensemble weight for a given model feature, such as the static R assumption, we summed the weights of all models with that feature.

We assessed target-oriented forecast performance using log scores for three forecasting targets: timing of peak week (within two weeks), incidence at peak week, and onset week, which we defined as the week by which ten cumulative cases occurred. These choices were motivated by forecasting assessments for influenza and dengue^16,17,18,68 and deemed applicable to public health objectives for forecasting an emerging pathogen such as Zika. For peak week and onset week, we used modified log scores¹⁸, such that predictions within two weeks of the correct week were considered accurate. We evaluated a total of 7680 log scores, reflecting three targets for each of 16 models in each of 31 departments plus at the national level and at each of 15-time points at which data assimilation occurred.

As log scores only yield a relative measure of model performance, we used forecasting scores¹⁸ as a way to retrospectively compare forecast performance for different forecasting targets. Whereas log scores are preferable for comparing performance across models on the same data, forecasting scores are preferable for comparing forecast performance across data composed of different locations and times. A forecasting score is defined simply as the exponential of the average log score, where the latter reflects an average over one or more indices, such as models, time points, targets, or locations.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The mobile phone data set used in this study is proprietary and subject to strict privacy regulations. Access to this data set was granted after reaching a non-disclosure agreement with the proprietor, who anonymized and aggregated the original data before giving access to the authors. Access to the dataset is controlled and restricted under strict security and privacy measures due to the company’s policy towards preserving customer’s data privacy (in accordance with existing data protection regulations) as well as protecting business secrecy. The data could be available on request after negotiation of a non-disclosure agreement. The response to any request shall be provided within the next 15 business days. The contact person is Pedro A. de Alarcón (pedroantoniode.alarconsanchez@telefonica.com). The epidemiological, meteorological, and demographic data are publicly available from Siraj et al.²⁹ (Dryad repository: https://doi.org/10.5061/dryad.83nj1) and additionally available on GitHub (https://github.com/roidtman/eid_ensemble_forecasting).

Code availability

The code used to fit models, produce forecasts, analyze forecast outputs, and produce figures are available on GitHub (https://github.com/roidtman/eid_ensemble_forecasting) and Zenodo (https://doi.org/10.5281/zenodo.5176776).

References

Jones, K. E. et al. Global trends in emerging infectious diseases. Nature 451, 990–993 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Smith, K. F. et al. Global rise in human infectious disease outbreaks. J. R. Soc. Interface 11, 20140950 (2014).
Article PubMed PubMed Central Google Scholar
Bedford, J. et al. A new twenty-first century science for effective epidemic response. Nature 575, 130–136 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Iacono, G. L. et al. Using modelling to disentangle the relative contributions of zoonotic and anthroponotic transmission: the case of lassa fever. PLOS Negl. Trop. Dis. 9, 1–13 (2015).
Article CAS Google Scholar
Quinn, T. C. Global burden of the HIV pandemic. Lancet 348, 99–106 (1996).
Article CAS PubMed Google Scholar
Perkins, T. A., Siraj, A. S., Ruktanonchai, C. W., Kraemer, M. U. G. & Tatem, A. J. Model-based projections of zika virus infections in childbearing women in the americas. Nat. Microbiol. 1, 16126 (2016).
Article CAS PubMed Google Scholar
Kraemer, M. U. G. et al. Spread of yellow fever virus outbreak in angola and the democratic republic of the congo 2015-2016: a modelling study. Lancet Infect. Dis. 17, 330–338 (2017).
Article PubMed PubMed Central Google Scholar
Metcalf, C. J. E. & Lessler, J. Opportunities and challenges in modeling emerging infectious diseases. Science 357, 149–152 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Lloyd-Smith, J. O., Schreiber, S. J., Kopp, P. E. & Getz, W. M. Superspreading and the effect of individual variation on disease emergence. Nature 438, 355–359 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Wesolowski, A. et al. Impact of human mobility on the emergence of dengue epidemics in Pakistan. Proc. Natl Acad. Sci. USA 112, 11887–11892 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Kraemer, M. U. G. et al. Utilizing general human movement models to predict the spread of emerging infectious diseases in resource poor settings. Sci. Rep. 9, 5151 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Mordecai, E. A. et al. Detecting the impact of temperature on transmission of zika, dengue, and chikungunya using mechanistic models. PLOS Negl. Trop. Dis. 11, 1–18 (2017).
Article Google Scholar
Nikolay, B. et al. Transmission of Nipah virus—14 years of investigations in Bangladesh. N. Engl. J. Med. 380, 1804–1814 (2019).
Article PubMed PubMed Central Google Scholar
Dudas, G., Carvalho, L. M., Rambaut, A. & Bedford, T. Mers-cov spillover at the camel-human interface. eLife 7, e31257 (2018).
Article PubMed PubMed Central Google Scholar
Shea, K. et al. Harnessing multiple models for outbreak management. Science 368, 577–579 (2020).
Article ADS CAS PubMed Google Scholar
Johansson, M. A. et al. An open challenge to advance probabilistic forecasting for dengue epidemics. Proc. Natl Acad. Sci. USA 116, 24268–24274 (2019).
Article CAS PubMed PubMed Central Google Scholar
Reich, N. G. et al. Accuracy of real-time multi-model ensemble forecasts for seasonal influenza in the U.S. PLOS Comput. Biol. 15, 1–19 (2019).
Article ADS CAS Google Scholar
Reich, N. G. et al. A collaborative multiyear, multimodel assessment of seasonal influenza forecasting in the United States. Proc. Natl Acad. Sci. USA 116, 3146–3154 (2019).
Article CAS PubMed PubMed Central Google Scholar
McGowan, C. J. et al. Collaborative efforts to forecast seasonal influenza in the United States, 2015–2016. Sci. Rep. 9, 683 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Johnson, L. R. et al. Phenomenological forecasting of disease incidence using heteroskedastic Gaussian processes: a dengue case study. Ann. Appl. Stat. 12, 27–66 (2018).
Article MathSciNet MATH Google Scholar
Del Valle, S. Y. et al. Summary results of the 2014-2015 DARPA Chikungunya challenge. BMC Infect. Dis. 18, 245 (2018).
Article PubMed PubMed Central Google Scholar
Viboud, C. et al. The rapid Ebola forecasting challenge: synthesis and lessons learnt. Epidemics 22, 13–21 (2018).
Article PubMed Google Scholar
ZIKAVAT Collaboration, et al. Preliminary results of models to predict areas in the Americas with increased likelihood of Zika virus transmission in 2017. Preprint at bioRxiv https://doi.org/10.1101/187591 (2017).
Shea, K. et al. Covid-19 reopening strategies at the county level in the face of uncertainty: multiple models for outbreak decision support. Preprint at medRxiv https://doi.org/10.1101/2020.11.03.20225409 (2020).
George, D. B. et al. Technology to advance infectious disease forecasting for outbreak management. Nat. Commun. 10, 3932 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Ray, E. L. et al. Ensemble forecasts of coronavirus disease 2019 (Covid-19) in the U.S. Preprint at medRxiv https://doi.org/10.1101/2020.08.19.20177493 (2020).
Brooks, L. C., Farrow, D. C., Hyun, S., Tibshirani, R. J. & Rosenfeld, R. Nonmechanistic forecasts of seasonal influenza with iterative one-week-ahead distributions. PLOS Comput. Biol. 14, 1–29 (2018).
Article CAS Google Scholar
Chowell, G. et al. Real-time forecasting of epidemic trajectories using computational dynamic ensembles. Epidemics 30, 100379 (2020).
Article Google Scholar
Siraj, A. S. et al. Spatiotemporal incidence of zika and associated environmental drivers for the 2015-2016 epidemic in Colombia. Sci. Data 5, 180073 (2018).
Article PubMed PubMed Central Google Scholar
Kraemer, M. U. G. et al. The effect of human mobility and control measures on the covid-19 epidemic in china. Science 368, 493–497 (2020).
Article ADS CAS PubMed Google Scholar
Black, A. et al. Genomic epidemiology supports multiple introductions and cryptic transmission of zika virus in Colombia. BMC Infect. Dis. 19, 963 (2019).
Article PubMed PubMed Central Google Scholar
Rosenfeld, R. The EM Algorithm. http://www.cs.cmu.edu/afs/cs.cmu.edu/academic/class/11761-s97/WWW/tex/EM.ps (1997).
Dietze, M. C. Ecological Forecasting. (Princeton University Press, 2017).
DeFelice, N. B., Little, E., Campbell, S. R. & Shaman, J. Ensemble forecast of human West Nile virus cases and mosquito infection rates. Nat. Commun. 8, 14592 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Yang, W., Karspeck, A. & Shaman, J. Comparison of filtering methods for the modeling and retrospective forecasting of influenza epidemics. PLOS Comput. Biol. 10, 1–15 (2014).
Article CAS Google Scholar
Baker, R. E., Yang, W., Vecchi, G. A., Metcalf, C. J. E. & Grenfell, B. T. Susceptible supply limits the role of climate in the early Sars-Cov-2 pandemic. Science 369, 315–319 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Cauchemez, S. et al. Local and regional spread of chikungunya fever in the Americas. Eur. Surveill. 19, 20854–20854 (2014).
Article CAS Google Scholar
Johansson, M. A., Powers, A. M., Pesik, N., Cohen, N. J. & Staples, J. E. Nowcasting the spread of chikungunya virus in the Americas. PLoS ONE 9, 1–8 (2014).
Article Google Scholar
Moore, S. M. et al. Local and regional dynamics of chikungunya virus transmission in Colombia: the role of mismatched spatial heterogeneity. BMC Med. 16, 152 (2018).
Article PubMed PubMed Central Google Scholar
Lai, S. et al. Seasonal and interannual risks of dengue introduction from south-east Asia into china, 2005–2015. PLOS Negl. Trop. Dis. 12, 1–16 (2018).
Article Google Scholar
Lindström, T., Tildesley, M. & Webb, C. A Bayesian ensemble approach for epidemiological projections. PLOS Comput. Biol. 11, 1–30 (2015).
Article CAS Google Scholar
Yamana, T. K., Kandula, S. & Shaman, J. Superensemble forecasts of dengue outbreaks. J. R. Soc. Interface 13, 20160410 (2016).
Article PubMed PubMed Central Google Scholar
McAndrew, T. & Reich, N. G. Adaptively stacking ensembles for influenza forecasting with incomplete data. https://forecasters.org/blog/2021/04/09/challenges-in-training-ensembles-to-forecast-covid-19-cases-and-deaths-in-the-united-states/ (2020).
Ray, E. L. et al. Challenges in training ensembles to forecast covid-19 cases and deaths in the united states. Int. Inst. Forecasters (2021).
Perkins, T. A. et al. Estimating unobserved Sars-Cov-2 infections in the united states. Proc. Natl Acad. Sci. USA 117, 22597–22602 (2020).
Article CAS PubMed PubMed Central Google Scholar
McAndrew, T., Wattanachit, N., Gibson, G. C. & Reich, N. G. Aggregating predictions from experts: a review of statistical methods, experiments, and applications. WIREs Comput. Stat. 13, e1514 (2021).
Bodner, K., Fortin, M.-J. & Molnár, P. K. Making predictive modelling art: accurate, reliable, and transparent. Ecosphere 11, e03160 (2020).
Article Google Scholar
Li, S.-L. et al. Essential information: Uncertainty and optimal control of Ebola outbreaks. Proc. Natl Acad. Sci. USA 114, 5659–5664 (2017).
Article CAS PubMed PubMed Central Google Scholar
Pei, S. & Shaman, J. Counteracting structural errors in ensemble forecast of influenza outbreaks. Nat. Commun. 8, 925 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Allard, A., Althouse, B. M., Hébert-Dufresne, L. & Scarpino, S. V. The risk of sustained sexual transmission of zika is underestimated. PLOS Pathog. 13, 1–12 (2017).
Article CAS Google Scholar
Marinović, A. B., Swaan, C., van Steenbergen, J. & Kretzschmar, M. Quantifying reporting timeliness to improve outbreak control. Emerg. Infect. Dis. J. 21, 209 (2015).
Article Google Scholar
Keeling, M. J. & Rohani, P. Modeling Infectious Diseases in Humans and Animals. (Princeton University Press, 2008).
Figueiredo, L. T., Cavalcante, S. M. & Simões, M. C. Dengue serologic survey of schoolchildren in rio de janeiro, brazil, in 1986 and 1987. Bull. Pan Am. Health Organ. 24, 217–225 (1990).
CAS PubMed Google Scholar
Lim, J. T., Han, Y., Lee Dickens, B. S., Ng, L. C. & Cook, A. R. Time varying methods to infer extremes in dengue transmission dynamics. PLOS Comput. Biol. 16, 1–19 (2020).
Article Google Scholar
Sun, H. et al. Prevalent Eurasian avian-like H1N1 swine influenza virus with 2009 pandemic viral genes facilitating human infection. Proc. Natl Acad. Sci. USA 117, 17204–17210 (2020).
Buckee, C. O. et al. Aggregated mobility data could help fight covid-19. Science 368, 145–146 (2020).
Article ADS PubMed Google Scholar
Bogoch, I. I. et al. Potential for zika virus introduction and transmission in resource-limited countries in Africa and the Asia-pacific region: a modelling study. Lancet Infect. Dis. 16, 1237–1245 (2016).
Article PubMed PubMed Central Google Scholar
Kraemer, M. U. G. et al. The global distribution of the arbovirus vectors Aedes aegypti and Ae. albopictus. eLife 4, e08347 (2015).
Article PubMed PubMed Central Google Scholar
Wesolowski, A., Eagle, N., Noor, A. M., Snow, R. W. & Buckee, C. O. Heterogeneous mobile phone ownership and usage patterns in Kenya. PLoS ONE 7, 1–6 (2012).
Article Google Scholar
Xia, Y., Bjørnstad, O. N. & Grenfell, B. T. Measles metapopulation dynamics: a gravity model for epidemiological coupling and dynamics. Am. Nat. 164, 267–281 (2004).
Article PubMed Google Scholar
Oidtman, R. J. et al. Inter-annual variation in seasonal dengue epidemics driven by multiple interacting factors in Guangzhou, China. Nat. Commun. 10, 1148, https://doi.org/10.1038/s41467-019-09035-x (2019).
Perkins, T. A., Metcalf, C. J. E., Grenfell, B. T. & Tatem, A. J. Estimating drivers of autochthonous transmission of chikungunya virus in its invasion of the Americas. PLOS Curr. Outbreaks (2017).
Siraj, A. S. et al. Temperature modulates dengue virus epidemic growth rates through its effects on reproduction numbers and generation intervals. PLOS Negl. Trop. Dis. 11, 1–19 (2017).
Article Google Scholar
Simini, F., González, M. C., Maritan, A. & Barabási, A.-L. A universal model for mobility and migration patterns. Nature 484, 96–100 (2012).
Article ADS CAS PubMed Google Scholar
Nordhaus, W. D. Geography and macroeconomics: new data and new findings. Proc. Natl Acad. Sci. USA 103, 3510–3517 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Arulampalam, M. S., Maskell, S., Gordon, N. & Clapp, T. A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking. IEEE Trans. Signal Process. 50, 174–188 (2002).
Article ADS Google Scholar
Gneiting, T., Balabdaoui, F. & Raftery, A. Probabilistic forecasts, calibration and sharpness. J. R. Stat. Soc. 69, 243–268 (2007).
Article MathSciNet MATH Google Scholar
Pei, S., Kandula, S., Yang, W. & Shaman, J. Forecasting the spatial transmission of influenza in the United States. Proc. Natl Acad. Sci. USA 115, 2752–2757 (2018).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors would like to thank Clara Palau Montava for help with managing the early stages of this project and Chris Fabian, Evan Wheeler, and Vedran Sekara for comments, suggestions, and support throughout the duration of this project. The authors would like to thank Sebastian Baña for technical support in running models on the Databricks computing platform. The authors would additionally like to thank the UNICEF-Colombia Representative, Aida Oliver Arostegui, INS Director, Martha Lucia Ospina Martinez, and the past and present Ministers of the Colombia Ministry of Health, Juan Pablo Uribe Restrepo and Fernado Ruiz Gomez. Lastly, the authors would like to thank two anonymous reviewers for their constructive comments and useful suggestions.

R.J.O. acknowledges support from an Eck Institute for Global Health Fellowship, GLOBES grant, Arthur J. Schmitt Fellowship, and the UNICEF Office of Innovation. M.U.G.K. is supported by The Branco Weiss Fellowship—Society in Science, administered by the ETH Zurich and acknowledges funding from the Oxford Martin School and the European Union Horizon 2020 project MOOD (#874850). J.H.H. acknowledges funding from a National Science Foundation Graduate Research Fellowship and a Richard and Peggy Notebaert Premier Fellowship. S.C.H. is supported by the Wellcome Trust (220414/Z/20/Z). This research was funded in whole, or in part, by the Wellcome Trust [Grant no. 220414/Z/20/Z]. For the purpose of open access, the author has applied a CC BY public copyright license to any Author Accepted paper version arising from this submission. CMB, MAJ, CAM, RCR Jr., IR-B, ASS, and TAP were supported by a RAPID grant from the National Science Foundation (DEB 1641130).

Author information

Authors and Affiliations

Department of Biological Sciences and Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN, USA
Rachel J. Oidtman, Guido España, John H. Huber, Amir S. Siraj & T. Alex Perkins
UNICEF, New York, NY, USA
Rachel J. Oidtman, Elisa Omodei & Manuel García-Herranz
Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
Rachel J. Oidtman
Department of Zoology, University of Oxford, Oxford, UK
Moritz U. G. Kraemer & Sarah C. Hill
Boston Children’s Hospital, Boston, MA, USA
Moritz U. G. Kraemer
Harvard Medical School, Boston, MA, USA
Moritz U. G. Kraemer
Instituto Nacional de Salud, Bogotá, Colombia
Carlos A. Castañeda-Orjuela, Erica Cruz-Rivera & Sandra Misnaza-Castrillón
Ministerio de Salud y Protección Social, Bogotá, Colombia
Myriam Patricia Cifuentes & Luz Emilse Rincon
UNICEF, Bogotá, Colombia
Viviana Cañon
LUCA Telefonica Data Unit, Madrid, Spain
Pedro de Alarcon
Department of Pathobiology and Population Sciences, The Royal Veterinary College, London, UK
Sarah C. Hill
Department of Pathology, Microbiology, and Immunology, School of Veterinary Medicince, University of California, Davis, CA, USA
Christopher M. Barker
Division of Vector-Borne Diseases, Centers for Disease Control and Prevention, San Juan, Puerto Rico
Michael A. Johansson
Information Systems and Modeling (A-1), Los Alamos National Laboratory, Los Alamos, NM, USA
Carrie A. Manore
Institute for Health Metrics and Evaluation, University of Washington, Seattle, WA, USA
Robert C. Reiner, Jr.
Department of Medicine, University of California, San Francisco, CA, USA
Isabel Rodriguez-Barraquer
Telefonica Research, Madrid, Spain
Enrique Frias-Martinez

Authors

Rachel J. Oidtman
View author publications
You can also search for this author in PubMed Google Scholar
Elisa Omodei
View author publications
You can also search for this author in PubMed Google Scholar
Moritz U. G. Kraemer
View author publications
You can also search for this author in PubMed Google Scholar
Carlos A. Castañeda-Orjuela
View author publications
You can also search for this author in PubMed Google Scholar
Erica Cruz-Rivera
View author publications
You can also search for this author in PubMed Google Scholar
Sandra Misnaza-Castrillón
View author publications
You can also search for this author in PubMed Google Scholar
Myriam Patricia Cifuentes
View author publications
You can also search for this author in PubMed Google Scholar
Luz Emilse Rincon
View author publications
You can also search for this author in PubMed Google Scholar
Viviana Cañon
View author publications
You can also search for this author in PubMed Google Scholar
Pedro de Alarcon
View author publications
You can also search for this author in PubMed Google Scholar
Guido España
View author publications
You can also search for this author in PubMed Google Scholar
John H. Huber
View author publications
You can also search for this author in PubMed Google Scholar
Sarah C. Hill
View author publications
You can also search for this author in PubMed Google Scholar
Christopher M. Barker
View author publications
You can also search for this author in PubMed Google Scholar
Michael A. Johansson
View author publications
You can also search for this author in PubMed Google Scholar
Carrie A. Manore
View author publications
You can also search for this author in PubMed Google Scholar
Robert C. Reiner, Jr.
View author publications
You can also search for this author in PubMed Google Scholar
Isabel Rodriguez-Barraquer
View author publications
You can also search for this author in PubMed Google Scholar
Amir S. Siraj
View author publications
You can also search for this author in PubMed Google Scholar
Enrique Frias-Martinez
View author publications
You can also search for this author in PubMed Google Scholar
Manuel García-Herranz
View author publications
You can also search for this author in PubMed Google Scholar
T. Alex Perkins
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.J.O., E.O., M.U.G.K., C.M.B., M.A.J., C.A.M., R.C.R., I.R.-B., M.G.-H., and T.A.P. conceptualized the study; R.J.O., E.O., M.U.G.K., C.A.C.-O., E.C.-R., A.M.-C., P.C., L.E.R., V.C., P.A., G.E., J.H.H., S.C.H., A.S.S., E.F.-M., and M.G.-H. provided and/or processed data; R.J.O., E.O., M.U.G.K., C.A.-O., E.C.-R., S.M.-C., P.C., L.E.R., V.C., P.A., E.F.-M., M.G.-H., and T.A.P. participated in biweekly meetings to provide feedback on research; R.J.O., E.O., M.U.G.K., M.G.-H., and T.A.P. developed the model and wrote the first draft of the paper; R.J.O., E.O., M.U.G.K., J.H.H., and S.C.H. analyzed the data; E.O., M.G.-H., and T.A.P. supervised the research; all authors reviewed the paper.

Corresponding authors

Correspondence to Rachel J. Oidtman, Manuel García-Herranz or T. Alex Perkins.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Oidtman, R.J., Omodei, E., Kraemer, M.U.G. et al. Trade-offs between individual and ensemble forecasts of an emerging infectious disease. Nat Commun 12, 5379 (2021). https://doi.org/10.1038/s41467-021-25695-0

Download citation

Received: 03 March 2021
Accepted: 23 August 2021
Published: 10 September 2021
DOI: https://doi.org/10.1038/s41467-021-25695-0

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.