Reconstructing a spatially heterogeneous epidemic: Characterising the geographic spread of 2009 A/H1N1pdm infection in England

Birrell, Paul J.; Zhang, Xu-Sheng; Pebody, Richard G.; Gay, Nigel J.; De Angelis, Daniela

doi:10.1038/srep29004

Download PDF

Article
Open access
Published: 11 July 2016

Reconstructing a spatially heterogeneous epidemic: Characterising the geographic spread of 2009 A/H1N1pdm infection in England

Paul J. Birrell¹^na1,
Xu-Sheng Zhang²^na1,
Richard G. Pebody²^na1,
Nigel J. Gay³^na1 &
…
Daniela De Angelis^1,2^na1

Scientific Reports volume 6, Article number: 29004 (2016) Cite this article

2362 Accesses
8 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Understanding how the geographic distribution of and movements within a population influence the spatial spread of infections is crucial for the design of interventions to curb transmission. Existing knowledge is typically based on results from simulation studies whereas analyses of real data remain sparse. The main difficulty in quantifying the spatial pattern of disease spread is the paucity of available data together with the challenge of incorporating optimally the limited information into models of disease transmission. To address this challenge the role of routine migration on the spatial pattern of infection during the epidemic of 2009 pandemic influenza in England is investigated here through two modelling approaches: parallel-region models, where epidemics in different regions are assumed to occur in isolation with shared characteristics; and meta-region models where inter-region transmission is expressed as a function of the commuter flux between regions. Results highlight that the significantly less computationally demanding parallel-region approach is sufficiently flexible to capture the underlying dynamics. This suggests that inter-region movement is either inaccurately characterized by the available commuting data or insignificant once its initial impact on transmission has subsided.

Modelling the propagation of infectious disease via transportation networks

Article Open access 29 November 2022

Modelling and predicting the spatio-temporal spread of COVID-19, associated deaths and impact of key risk factors in England

Article Open access 08 March 2021

A novel geo-hierarchical population mobility model for spatial spreading of resurgent epidemics

Article Open access 12 July 2021

Introduction

Transmission and spread of infectious diseases depend, in part, on the frequency with which infected people come into contact with susceptible individuals. Understanding the spatial heterogeneity of transmission and spread from one location to another is crucial for policymakers to allocate healthcare resources and to design effective control strategies. This has been illustrated for influenza by simulation studies using spatial models of transmission, at global^1,2,3, continental⁴ and national levels^5,6,7,8, providing useful information on the role of spatial factors and control measures on the spread of infection. Estimation of such roles from data, as opposed to exploring them through simulation, is much more complex and is typically constrained by a paucity of data to identify the spatial dynamics of infection. Recently, finely resolved spatial and temporal influenza data has been used to estimate the spread of infection during the autumn 2009 wave of A/H1N1pdm influenza in the US, finding that it was dominated by short-range transmission events⁹. This type of study is, however, rare and hard evidence of how heterogeneity in demographic processes can influence transmission remains limited^10,11.

The global 2009 A/H1N1pdm outbreak gave rise to an epidemic in England characterised by two distinct waves of infection, occurring, atypically, in summer and in late autumn of 2009, outside of the traditional flu season. During this outbreak, sero-epidemiological data showed significant heterogeneity in the timing of the pandemic across the various government office regions (GORs)^12,13. This information, alongside a number of complementary data streams, was used to disentangle the complicated processes of transmission dynamics and disease reporting for London¹⁴. London was treated as a closed system fed by an initial number of infectious individuals, leading to two distinct epidemic waves with the peak times of infection mainly driven by the influence of school holidays on contact patterns. Using related data, a SEIR epidemic system was developed to estimate transmission in the whole of England¹⁵. The sampling of both the serological and, in particular, the virological data used was very uneven across England, being concentrated in regions of particularly high disease transmission. To provide a meaningful local description of the epidemic using data of this type, it is important to aggregate data at a spatial resolution that gives sufficiently large within-region sample sizes while still making assumptions of homogeneous mixing within spatial units justifiable.

Here we extend previous work¹⁴ by developing multi-region modelling approaches to investigate spatial transmission and the possible role of inter-region movements in the spread of infection in England. We consider two types of model: a parallel-region (PR) model, where epidemics in different regions are assumed to occur in isolation, but are described by models with some common parameters; and a meta-region (MR) model, where the epidemic acts on a single population, stratified by age and region, with the populations from each stratum interacting through commuter flux. We use these approaches to explore the spread of the first two waves of 2009 pandemic influenza across England, estimating their dynamic characteristics based on a range of epidemic surveillance data including general practitioner (GP) consultations, seropositivity, virological positivity and case confirmations (see Fig. 1 for the London data).

Results

We have divided England into four regions: London, West Midlands, the North and the South (see Materials and Methods: Data). These four regions are assumed either to be non-interacting, spatially disjoint populations (PR model) or to interact with each other via the movements of commuters within a single population subdivided into strata defined by the regions (MR model). Within each population, the model is as described in detail in Birrell et al.¹⁴. Briefly, the model includes a transmission component that feeds newly infected individuals into a disease and reporting component describing the progress of infected individuals to symptomatic illness and the mechanisms through which this illness is reported to the healthcare system. Table 1 itemises the model parameters to be estimated, specifying their spatial heterogeneity under both approaches. In expanding the model of Birrell et al.¹⁴ to the MR model, there are a number of modelling choices to be made: the handling of density dependent effects on transmission; the distribution of the seeding of infectious individuals; and the assumption of fixed versus random commuting. These issues are discussed in depth in the Methods: Modelling Approaches section and references therein. The MR model results presented here assume a model variant that has density dependence according to the size of the regional population; an assumption of random commuting (where every member of the population is assumed equally likely to commute on any given day); and an empirical-based seeding for the number of infections prior to the start date of May 1st 2009, the so-called ‘extended empirical’ seeding described in Supplementary Information (SI) Section 1.4.3.

Table 1 Model parameters classified in the parallel-region (PR) and meta-region (MR) models as either being ‘spatial’, where region-specific, or ‘global’.

Full size table

Reconstructing the epidemic

The two models are sufficiently flexible to reproduce the two epidemic waves of 2009 pandemic influenza (Fig. 2, SI Figs S2–S5). The estimated epidemic in the North is consistent across models. London and the West Midlands are characterized by bigger first waves of infection (and subsequently smaller second waves) under the PR model, with the opposite being estimated for the South. This is apparent from the height of the peaks in Fig. 2 and the attack rates in Table 2. Peak timings in both waves of infection are the same under both modelling approaches and coincide with the start of a school holiday. The exception to this is the second wave in the West Midlands, the region with the lowest estimated attack rate. Here, a sufficient supply of susceptible individuals remains in the population to allow transmission to increase once more (albeit briefly) when the schools re-opened after the short holiday. For comparison with other studies, SI Table S4 breaks these down into age-stratified results summarised at both regional and national levels.

Table 2 Posterior median and 95% CrI for cumulative incidence of infection, number of cases (thousands) and attack rates, by region and by pandemic wave (May-August or September-December).

Full size table

Estimated epidemic characteristics

Table 3 presents estimates of some key transmission parameters under each model. Estimates for the reproductive number (R₀) are centred on 1.8, consistent across modelling approaches and, in the PR model, across regions. Similarly, the estimates for the other transmission parameters are robust to the model specification (note the overlapping nature of the credible intervals (CrIs) in Table 3). In particular, estimates for m₁ indicate that the POLYMOD-estimated contact rates involving at least one adult had to be down-weighted by a factor of between 0.57 and 0.62. Estimates for m₃ indicate instead that the summer school holiday period led to a rather dramatic decline in effective contact rates among 5–14 year-olds, with the resulting rate being less than 1% of that during school terms. By comparing m₅ with m₃, it is seen that both models identify a much weaker effect for the other, shorter, school holidays, their shorter duration causing milder disruption to usual contact patterns.

Table 3 Posterior median and 95% CrI for key parameters by model.

Full size table

Model performance

The overall fit of the PR model is superior to that of the MR models considered. The PR model has a greater flexibility due to the greater number of free model parameters to be estimated: R₀ and the initial level of infectiousness, I₀, are each described by four region-specific parameters, quantities represented by just one global parameter in the MR model. However, even taking this into account, there is enough evidence (see the bottom two rows of SI Table S3) in favour of the PR model to suggest a significant improvement in the fit of the model. This compounds the practical benefit of the PR model being faster to implement; it is much more suited to parallel computation and only requires the calculation of the spectral radius of (7 × 7) next generation matrices as opposed to (28 × 28) matrices for the MR model.

Sensitivity to specification of the meta-region model

So far results from a ‘best’ variant of the MR model have been presented. We have also investigated a number of alternative parameterisations for this approach and the set of models considered is discussed further in Methods: Modelling Approaches. Density dependence, not a major consideration in the PR model, is best accounted for in the MR model by replacing N_ra with N_r = Σ_aN_ra. in Equation (3) and setting α = 1. This represents density dependent effects that are determined by the size of the regional population and not the population sizes of the individual stratum. Additionally, the model performs better when given the ‘extended empirical’ seeding (see SI Section 1.4.3) as opposed to any of those based purely on disease-free equilibria as done elsewhere¹⁴. The differences observed in the fit of the model when comparing the assumption of a fixed group of commuters versus random commuting are highly sensitive to the precise parameterisation. In the preferred model presented here, there is little difference between the two hypotheses, suggesting that the crude random commuting assumption is adequate enough. With no consistent difference in the model fit, random commuting requires fewer evaluations of SI Equations S1 and S4 and is, therefore, computationally more efficient to implement.

Discussion

We have conducted a coherent, unified, Bayesian statistical analysis of multiple streams of epidemic surveillance data from the 2009 A/H1N1pdm outbreak in England, producing age and region stratified epidemic reconstructions (with associated uncertainty) and robust estimates for some key parameters of the transmission process. In particular, we have assessed the strengths and weaknesses of two different approaches in the presence of strong regional heterogeneity in the spread of influenza infection. Both the PR and MR models fit adequately well to the various data sources, with highly comparable estimates for both model parameters and epidemic characteristics that are consistent with existing literature. Results highlight that the PR approach is parsimonious yet sufficiently flexible to capture the underlying dynamics. This may imply that the impacts of inter-regional movement are either inaccurately characterized by the available commuting data or not significant beyond a transient initial forcing.

Spatial heterogeneity in transmission arising from the interaction between regional populations, is incorporated in the MR model through commuting flows. Therefore, the MR model has the capacity to predict the spatial spread of influenza infection early in an epidemic, as infection is transmitted according to these flows. The PR model is ‘non-parametric’, in the sense that the parameters representing the epidemic growth and initial seeding of infectiousness in each region are estimated without being subject to any parametric assumption. The timing of the epidemic waves in each region is highly dependent on these parameters and estimation of the respective epidemic curves requires some epidemic activity in all regions. Early in a pandemic, therefore, the MR approach is more useful in a predictive modelling setting. However, as discussed in the Results section, the MR approach involves an additional computational burden that limits its use as a tool for timely epidemic tracking as data accumulate over time.

The non-parametric nature of the spatial variation in transmission of the PR model confers on it greater flexibility, lending it an advantage when it comes to epidemic reconstruction, observed here in a significant improvement in model fit. An additional advantage is that this modelling approach does not rely on the validity of the commuter data to describe the spread of infection, nor does it rely on the assumptions that individuals maintain routine commuting behaviour regardless of infection status.

Despite the spatial variation in epidemic growth rates, the PR model provides estimates for R₀ that are consistent across regions (see Table 3, caption). Therefore, the spatial heterogeneity in infection is being accounted for through the initial seeding of infectiousness. It has been seen elsewhere that long-range interactions have a declining role in the spread of a pandemic once infection is widespread in each region^3,8,10. This is exacerbated for A/H1N1pdm influenza as school-age children, the demographic group most affected, do not contribute to commuter flows. Therefore, an improved fit of the MR model would most effectively be achieved through more flexible estimation of the initial seeding of infectiousness.

One variant of the MR model investigated here involved the stratification of the population within each region into commuters and non-commuters¹⁶. This has the effect of assuming each region contains a fixed sub-population of individuals who commute daily. This yields no consistent improvement in model performance, whilst increasing even further the computational cost. Factoring in the ‘random’ movements of casual and occasional travellers, which has been quoted to potentially increase the rate of transmission between regions by 25%⁸, would involve further computational burden and is particularly difficult to implement in an inferential setting without appropriate auxiliary information (e.g. if the census data contained information on the purpose of travel). The MR model could be made more realistic and detailed by assuming that a proportion of those with symptomatic illness may not travel³, or that asymptomatic illness is less infectious¹⁷. However, consideration of such factors would only lessen the contribution of long-range transmission, leaving conclusions unchanged.

There are a number of studies that provide estimates for incidence and attack rates in England during the two waves of 2009 A/H1N1pdm. However, estimates stratified by age and region are not publicly available. Our attack rates estimates, when aggregated to a national level, are highly consistent with those published elsewhere, based largely on the serological data used here^13,18. A further study¹⁹ provides comparable overall incidence, but with a more even distribution of infection over the two waves and increased levels of infection in the older age groups. In this work the lower cumulative incidence in the first wave may be attributable to the parameter that measures the decrease in the rate of effective contact among 5–14 year-olds suggesting a drop of over 99%. Averaged across all age-groups, this represents a drop in R₀ of between 43% (in London) and 50% (in the South). To compare, He et al.²⁰ record a 28% fall in transmissibility during a school holiday period. These represent drops from a baseline R₀, estimated to be in the region of 1.8, a value corroborated in literature²¹.

The modelling approaches presented here have great potential for use in a future pandemic and will form a key component of the pandemic response protocol of the responsible public health body in England, Public Health England. Here, GP consultations have been used to inform the model, but in practice any time series of count data related to infection incidence could be used to inform the pattern of infection over time. Such data could alternatively come from hospital admittances, absenteeism²², antiviral prescriptions etc. Serological data underpin the scale of infection and in their absence the full scale of the epidemic cannot be accurately estimated until the epidemic has been fully observed¹⁴. If the count data are not pathogen-specific and hence contaminated, then some virological data are required to identify the signal due to the pandemic. All pandemic data sources discussed here do not need to cover the whole population. Data can be included provided that there is information on the covered fraction of the population and that any bias in this coverage is well understood.

Since 2009 in the UK there has been an investment in improving the quality of the surveillance data available in the event of a pandemic. Such improvements can only enhance the utility of the evidence synthesis model presented here. The prompt availability of hospitalisation and intensive care unit admission data could remove (at least in the early stages of a pandemic) the dependence on noisy GP data that are influenced by fluctuating healthcare-seeking behaviours of the public. These noisy GP data require attendant virological swabbing data, the positivity of which wanes over time from symptom onset. This sensitivity is crudely accounted for here by omitting any swabs taken more than five days since onset. Methods for incorporating the uncertainty in the swab results into this modelling framework would be valuable. The serological data come from the analysis of blood sera samples taken from patients admitted to hospital for a variety of non-respiratory reasons. It is unclear if this convenience sampling approach could lead to bias. Furthermore, the relationship between the recorded titre values and the presence of an immunological response is imprecise and uncertain¹³. Joint modelling of serological microarray data with syndromic surveillance data to reconstruct an epidemic has incorporated this imprecision²³, but in this exercise the data do not have sufficient resolution to clearly partition long-standing immunity from recent infection and from susceptibility.

To summarise, using a Bayesian statistical framework, the PR model is found to be sufficiently flexible to provide a good fit to data and is quick to implement as it includes lower dimension contact matrices and, particularly, as model code can be easily parallelised. Reassuringly, it also provided concurring estimates for the basic reproductive number (R₀) across the regions, in agreement with the MR approach. However, the PR model can provide little insight on inter-region transmission and the determinants of spatial heterogeneity in the spread of infection because of its simple structure. In a situation where school-age children are the main agents of transmission and baseline transmissibility is not high, spatial models that concentrate on local transmission, like the PR model, provide a powerful and timely tool for use by public health services, helping to inform effective control and containment measures.

Methods

Data

The epidemic dynamics are reconstructed on the basis of a suite of epidemic surveillance data available during the 2009 pandemic. This includes counts of non-disease specific illness in the form of GP consultations for influenza-like illness (ILI) and seroepidemiological and virological swabbing data. A full description of these data sources has been published in the SI of Birrell et al.¹⁴, so we only summarise them briefly here.

The ILI consultation data come from sentinel GP surveillance, providing daily counts of consultations in participating practices, stratified by age and region, as well as daily denominators giving the fraction of the population (typically >50%) covered by the scheme²⁴. Virological swabbing provide a (short) time series of case confirmation data derived as a result of contact tracing carried out on some of the early identified cases²⁵ and a longer companion dataset to the GP surveillance data on the proportion of GP ILI consultations testing swab-positive for the pandemic pathogen^26,27. Additionally, infrequent batches of serological data provide information on the proportion of the wider population carrying protective antibodies, assumed to be indicative of the level of cumulative infection up until two weeks (the length of time allowed for antibodies to establish within host) prior to the time of sample¹². With the exception of the case confirmation data, all datasets run from 1st May to 31st December, 2009, giving 245 days of consecutive data (see Fig. 1 for a presentation of this data from London).

To ensure large enough sample sizes, we divide England into four regions: two smaller regions that exhibited a significant first wave of infection, London and the West Midlands; and two regions that cover a larger area, labelled North (combining the North-West, North-East, Yorkshire and Humberside and the East Midlands GORs²⁸) and South (combing the East of England, South-East and South-West GORs). Commuting data have been extracted from the UK 2001 census²⁹. The census provides an estimated number of people of all ages >15 years moving between each of our regions on the date of the survey (including those that do not move). The age-specific commuter matrices are shown in SI Table S1. The population size and structure over seven age groups (<1, 1–4, 5–14, 15–24, 25–44, 45–64, >64 years) in each of the four regions have been extracted from mid-year population estimates released by the Office of National Statistics³⁰.

Modelling Approaches

To model the spatial spread of infection, the population is divided into strata defined by region and age pairs, (r, a), r = 1, …, R; a = 1, …, A. Each stratum is assigned an index j = a + A(r − 1), j = 1, …, RA. The infection status of the population within stratum j at discrete-times t_n = nδt is described by a deterministic SEEIIR system of difference equations:

for n = 1 … T and suitably small δt (here taken to be 0.5 days). Parameters σ and γ are related to the mean duration of latent and infectious infection, d_L and d_I respectively via σ = 2/d_L, γ = 2/d_I and the force of infection, λ_j(t_n), is expressed through the Reed-Frost formulation

where the (j, i)^th entry of the time-varying (RA × RA) matrix β(t_n) gives the infection pressure exerted on a susceptible individual within stratum j by a single infectious individual in stratum i. The structure of the matrix β(t_n) depends on assumptions governing spatial heterogeneity in interpersonal contact rates and the transmissibility of infection across different strata. Two approaches are formulated to handle the spatial heterogeneity: parallel-region and meta-region modelling (see below and SI Sections 1.3-4 for greater detail). Both formulations can be parameterised in a similar fashion, with slight differences in the spatial variation of some parameters as illustrated in Table 1.

Parallel-Region (PR) Modelling

The PR model assumes that infectious individuals exert negligible infectious pressure on individuals in any of the other regions and the transmission dynamics in each region are considered independent of the dynamics occurring elsewhere. This results in parallel, single-region, epidemics linked through the borrowing of strength between some parameters (e.g. the background model, see Table 1) or the sharing of some common parameters (e.g. the proportion of symptomatic infections, Table 1). Within each of the four English regions the epidemic dynamics are governed by Equations (1) and (2) with R = 1 and A = 7. The strata are then simply defined by age groups and the, now regionally-dependent, (A × A) infection rate matrix is given by:

where M(t_n) = {M_a,b(t_n)} is a matrix of relative infective contact rates between individual of age groups a and b derived from POLYMOD data³¹ and the contact parameters, m_k, k = 1, …, 5 (as described elsewhere¹⁴). This approach allows estimation of region-specific reproduction numbers, R_0,r (via the epidemic growth rates ψ_r, see SI Section 1.3.1 and Equation S6). The denote the dominant eigenvalues of the next generation matrices which has (a, b)^th entry given by N_r,a × M_a,b(0) × d_I, where N_r,a is the resident population size of people in age group a in region r.

Compared to the London study¹⁴, some minor amendments have been made to the model, including the addition of a day-of-the-week effect on the reporting of ILI (see SI Section 1.2.2) and the expansion of the background model, made feasible due to the integration of a spatial dimension in the modelling (see SI Section 1.2.1). Parameters that represent biological characteristics of the virus (mean infectious period, proportion symptomatic) are assumed to be consistent across all regions. Additionally, the contact parameters exhibit no regional variation. As well as the exponential growth rates, the initial levels of infectiousness (I_0,r) are allowed to vary among regions, as they are a function of the regional population as well as of the virus and can account for the different timing of the pandemic activity in each region. Spatial heterogeneities of model parameters are given in Table 1.

Meta-Region (MR) Modelling

In the meta-region modelling approach the four regions are connected by commuter flows into one system. The number of strata are defined by setting R = 4 and A = 7 and the resulting (28 × 28) contact matrix is denoted by Π. The (j, i)^th entry of Π, where j = a + A(r − 1), as above and i = b + A(s − 1) represents the generic region/age strata (s, b), is as follows:

and the infection rate matrix has entries

Matrices C(a) in Equation (3) have entries C_rs(a) representing the proportion of age group a resident in region r that commute into region s on any given day (see SI Table S1). The are the population sizes of stratum (r, a) at night (i.e. the size of the resident population) and are the day population sizes (see SI Section 1.4.2), the adjusted population sizes after commuter movements have occurred; and ξ is the proportion of total time that a commuter actually spends in the commuting region. We set ξ = 5/14 on the basis of a daily average of five working days per week, being away from home for a half day when working. The exponent α takes values in [0, 1] with a value of 0 indicating frequency-driven transmission and 1, density-driven transmission. Finally, in Equation 4, is again the dominant eigenvalue of a next generation matrix Π* which has entries . As all strata interact and the meta-population cannot be broken down into isolated regions, there is only a single growth rate and hence a single value for the epidemic’s reproductive number, R₀.

A structural comparison of the MR modelling to the PR modelling is illustrated in Fig. 3. There are a number of modelling considerations relevant to MR modelling that are not applicable to the PR approach and these are discussed below.

Density Dependence

To test different variants of density dependence, the exponent α in Equation (3) is given three different values: 0 (the frequency dependent formulation), 0.5 and 1.0 (density dependent formulation). Also we consider replacing the population size (N_r,a) of each stratum by the total population size (N_r) of the containing region r to investigate the precise form of any density dependence (see SI Section 1.4.2).

Effect of seed construction

The near-block diagonal structure of the contact matrix (see Fig. 3(C,D)) results in convergence to a disease-free equilibrium being very slow (if it occurs at all). Any simulated epidemic from the meta-regional model is, therefore, qualitatively sensitive to the choice of the initial seeding of infection⁷. To identify an appropriate approach for generating such epidemic seeds, we consider three specifications labelled ‘nextgen’, ‘empirical’ and ‘extended empirical’, the details for which are given in SI Section 1.4.3.

Random commuting vs. fixed commuting

So far, in the MR model all members of a given stratum are assumed to be equally likely to commute, with the total proportion of commuters remaining the same. However, it may be more realistic to assume that the commuters are a fixed group of people. To account for this, adult groups in each region are further sub-divided into commuters and non-commuters, giving 11 strata per region, 44 in total. Although the total attack rate is insensitive to this further stratification, the peak times across regions are affected (see SI Section 1.4.4). Results suggest that the introduction of a fixed commuting population, however, improves model fit to the 2009 pandemic data across England only marginally (SI Table S3; cf.¹⁶).

Parametric Inference

Assuming that the epidemic data are imperfectly observed, a Bayesian approach is used to estimate the unknown parameters. The posterior distributions of these parameters and various quantities of interest are derived through the combination of prior information and the likelihood function. The log-likelihood function includes information from four data components: the number of GP consultations, virological positivity, number of lab-confirmed cases and seropositivity. Full details of the likelihood function are given in SI Section 1.5.1.

Priors and Implementation

The Bayesian framework for statistical inference involves the specification of prior probability distributions for all model parameters. We have assumed a level of prior knowledge of the pandemic that was representative of the state of knowledge in 2009. Therefore, prior specifications are largely unchanged from those used in Birrell et al.¹⁴. In the PR model, where a single parameter is specified for each region, they are assumed a priori to be identically distributed according to the prior specified for London¹⁴. For the new parameters, such as day of week effect on reporting of GP consultations and the additional parameters of the background ILI consultation process, non-informative normal prior distributions are assumed (see SI Section 1.5.2 for technical details).

The Bayesian model is implemented using Markov Chain Monte Carlo³², using bespoke C++ code. The PR model, parallelised on an eight core machine takes c.15 hours to implement, as opposed to 60 hours for the MR model (with 28 strata). When assuming fixed commuting groups (i.e. 44 strata), this run-time doubles to approximately 120 hours. The code and input files used to generate the outputs in this paper, together with some dummy data can be found at www.mrc-bsu.cam.ac.uk/software/miscellaneous-software/. Requests for access to the data should be directed to: Richard.Pebody@phe.gov.uk.

Additional Information

How to cite this article: Birrell, P. J. et al. Reconstructing a spatially heterogeneous epidemic: Characterising the geographic spread of 2009 A/H1N1pdm infection in England. Sci. Rep. 6, 29004; doi: 10.1038/srep29004 (2016).

References

Colizza, V., Barrat, A., Barthélemy, M. & Vespignani, A. The role of the airline transportation network in the prediction and predictability of global epidemics. Proc. Nat. Acad. Sci. USA 103, 2015–2020, 10.1073/pnas.0510525103 (2006).
Article CAS ADS PubMed MATH Google Scholar
Cooper, B. S., Pitman, R. J., Edmunds, W. J. & Gay, N. J. Delaying the international speed of pandemic influenza. PLoS Medicine 3, 0845–0854 (2006).
Article Google Scholar
Balcan, D. et al. Seasonal transmission potential and activity peaks of the new influenza A(H1N1): a Monte Carlo likelihood analysis based on human mobility. BMC medicine 7, 45+, 10.1186/1741-7015-7-45 (2009).
Article PubMed PubMed Central Google Scholar
Merler, S. & Ajelli, M. The role of population heterogeneity and human mobility in the spread of pandemic influenza. Proc. R. Soc. B 277, 557–565, 10.1098/rspb.2009.1605 (2010).
Article PubMed Google Scholar
Ferguson, N. M. et al. Strategies for containing an emerging influenza pandemic in Southeast Asia. Nature 437, 209–214, 10.1038/nature04017 (2005).
Article CAS ADS PubMed Google Scholar
Ferguson, N. M. et al. Strategies for mitigating an influenza pandemic. Nature 442, 448–452 (2006).
CAS ADS Google Scholar
Germann, T. C., Kadau, K., Longini, I. M. & Macken, C. A. Mitigation strategies for pandemic influenza. Proc. Nat. Acad. Sci. USA 103, 5935–5940 (2006).
Article CAS ADS Google Scholar
Danon, L., House, T. & Keeling, M. J. The role of routine versus random movements on the spread of disease in Great Britain. Epidemics 1, 250–258, 10.1016/j.epidem.2009.11.002 (2009).
Article PubMed Google Scholar
Gog, J. R. et al. Spatial transmission of 2009 pandemic influenza in the US. PLoS Comput Biol 10, e1003635+, 10.1371/journal.pcbi.1003635 (2014).
Article CAS PubMed PubMed Central Google Scholar
Viboud, C. et al. Synchrony, waves and spatial hierarchies in the spread of influenza. Science 312, 447–451, 10.1126/science.1125237 (2006).
Article CAS ADS PubMed Google Scholar
Eggo, R. M., Cauchemez, S. & Ferguson, N. M. Spatial dynamics of the 1918 influenza pandemic in England, Wales and the United States. J. R. Soc. Interface 233–243, 10.1098/rsif.2010.0216 (2010).
Miller, E. et al. Incidence of 2009 pandemic influenza A H1N1 infection in England: a cross-sectional serological study. Lancet 375, 1100–1108 (2010).
Article Google Scholar
Hardelid, P. et al. Assessment of baseline age-specific antibody prevalence and incidence of infection to novel influenza AH1N1 2009. Health Technology Assessment 14, 115–192, 10.3310/hta14550-03 (2010).
Article CAS PubMed Google Scholar
Birrell, P. J. et al. Bayesian modelling to unmask and predict the influenza A/H1N1pdm dynamics in London. Proc. Nat. Acad. Sci. USA 108, 18238–18243, 10.1073/pnas.1103002108 (2011).
Article ADS PubMed Google Scholar
Baguelin, M., Van Hoek, A. J., Flasche, S., White, P. J. & Edmunds, W. J. Vaccination against pandemic influenza A/H1N1v in England: A real-time economic evaluation. Vaccine 28, 2370–2384 (2010).
Article Google Scholar
Keeling, M. J., Danon, L., Vernon, M. C. & House, T. A. Individual identity and movement networks for disease metapopulations. Proc. Nat. Acad. Sci. USA 107, 8866–8870, 10.1073/pnas.1000416107 (2010).
Article ADS PubMed Google Scholar
Longini, I. M., Halloran, E. E., Nizam, A. & Yang, Y. Containing pandemic influenza with antiviral agents. Am. J. Epidemiol 159, 623–633, 10.1093/aje/kwh092 (2004).
Article PubMed Google Scholar
Baguelin, M. et al. Age-specific incidence of A/H1N1 2009 influenza infection in England from sequential antibody prevalence data using likelihood-based estimation. PLoS one 6, e17074+, 10.1371/journal.pone.0017074 (2011).
Article CAS ADS PubMed PubMed Central Google Scholar
Dorigatti, I., Cauchemez, S. & Ferguson, N. M. Increased transmissibility explains the third wave of infection by the 2009 H1N1 pandemic virus in England. Proc. Nat. Acad. Sci. USA 110, 13422–13427, 10.1073/pnas.1303117110 (2013).
Article ADS PubMed Google Scholar
He, D., Dushoff, J., Eftimie, R. & Earn, D. J. Patterns of spread of influenza A in Canada. Proc. R. Soc. B 280, 10.1098/rspb.2013.1174 (2013).
Boëlle, P.-Y. Y., Ansart, S., Cori, A. & Valleron, A.-J. J. Transmission parameters of the A/H1N1 (2009) influenza virus pandemic: a review. Influenza Other Respir. Viruses 5, 306–316 (2011).
Article Google Scholar
Drumright, L. N. et al. Assessing the use of hospital staff influenza-like absence (ILA) for enhancing hospital preparedness and national surveillance. BMC Infect. Dis. 15, 110+, 10.1186/s12879-015-0789-z (2015).
Article PubMed PubMed Central Google Scholar
te Beest, D. E., Birrell, P. J., Wallinga, J., De Angelis, D. & van Boven, M. Joint modelling of serological and hospitalization data reveals that high levels of pre-existing immunity and school holidays shaped the influenza A pandemic of 2009 in the Netherlands. J. R. Soc. Interface 12, 20141244+, 10.1098/rsif.2014.1244 (2015).
Article PubMed PubMed Central Google Scholar
Harcourt, S. E. et al. Use of a large general practice syndromic surveillance system to monitor the progress of the influenza A(H1N1) pandemic 2009 in the UK. Epidemiol. Infect. 140, 100–105, 10.1017/S095026881100046X (2012).
Article CAS PubMed Google Scholar
Whelan, J., Greenland, K., Rondy, M., van der Hoek, W. & Robert-Du Ry van Beest Holle, M. Case registry systems for pandemic influenza A(H1N1)pdm09 in Europe: are there lessons for the future? Eurosurveillance 17 (2012).
McCartney, C. Regional microbiology network. British Journal of Infection Control 8, 28–29 (2008).
Article Google Scholar
Fleming, D. M. Weekly returns service of the Royal College of General Practitioners. Communicable disease and public health/PHLS 2, 96–100 (1999).
CAS Google Scholar
Office for National Statistics. 2001 census: Special workplace statistics (England, Wales and Northern Ireland) (2009). URL http://www.cids.census.ac.uk.
Office for National Statistics. Map of Government Office Regions in England (2015). URL http://www.ons.gov.uk/ons/rel/family-spending/family-spending/2013-edition/mdl-final-2013.pngAccessed 19/5/2016.
Office for National Statistics. Super output area mid-year population estimates for England and Wales (experimental) (2009). URL http://www.statistics.gov.uk/StatBase/Product.asp?vlnk=14357Accessed 31/7/2009.
Mossong, J. et al. Social contacts and mixing patterns relevant to the spread of infectious disease. PLoS Medicine 5, e74 (2008).
Article Google Scholar
Hastings, W. K. Monte Carlo sampling methods using Markov chains and their applications. Biometrika 57, 97–109 (1970).
Article MathSciNet Google Scholar

Download references

Acknowledgements

This work was supported by the National Institute for Health Research (HTA Project:11/46/03) the UK Medical Research Council (Unit Programme Number U105260566) and Public Health England. The authors thank the University of Nottingham, Egton Medical Information Systems (EMIS) and EMIS practices contributing to the QSurveillance database. We thank colleagues at PHE Respiratory Virus Reference Unit and the Specialist Microbiology Network for the provision of GP swab positivity data and for the use of their ‘whiteboard’ confirmed case data. We also extend thanks to patients of Royal College of General Practitioners Research and Surveillance Centre (RSC) practices who consented to having a flu swab taken and RSC practices for processing and sharing these data.

Author information

Birrell Paul J. and Zhang Xu-Sheng contributed equally to this work.

Authors and Affiliations

Medical Research Council Biostatistics Unit, Cambridge Insitute of Public Health, Forvie Site, Robinson Way, Cambridge Biomedical Campus, Cambridge, CB2 0SR, UK
Paul J. Birrell & Daniela De Angelis
Centre for Infectious Disease Surveillance and Control, Public Health England, 61 Colindale Avenue, London, NW9 5EQ, UK
Xu-Sheng Zhang, Richard G. Pebody & Daniela De Angelis
Fu Consulting, Hungerford, UK
Nigel J. Gay

Authors

Paul J. Birrell
View author publications
You can also search for this author in PubMed Google Scholar
Xu-Sheng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Richard G. Pebody
View author publications
You can also search for this author in PubMed Google Scholar
Nigel J. Gay
View author publications
You can also search for this author in PubMed Google Scholar
Daniela De Angelis
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.-S.Z., P.J.B. and D.D.A. wrote the main manuscript, X.-S.Z. and P.J.B. prepared figures and P.J.B. carried out the modelling and statistical analyses. R.G.P. and D.D.A. supervised the work and N.J.G. conceptualised the research.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Birrell, P., Zhang, XS., Pebody, R. et al. Reconstructing a spatially heterogeneous epidemic: Characterising the geographic spread of 2009 A/H1N1pdm infection in England. Sci Rep 6, 29004 (2016). https://doi.org/10.1038/srep29004

Download citation

Received: 01 December 2015
Accepted: 09 June 2016
Published: 11 July 2016
DOI: https://doi.org/10.1038/srep29004

This article is cited by

Forecasting the 2017/2018 seasonal influenza epidemic in England using multiple dynamic transmission models: a case study
- Paul J. Birrell
- Xu-Sheng Zhang
- Daniela De Angelis
BMC Public Health (2020)
Detecting a Surprisingly Low Transmission Distance in the Early Phase of the 2009 Influenza Pandemic
- Valentina Marziano
- Andrea Pugliese
- Marco Ajelli
Scientific Reports (2017)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Modelling the propagation of infectious disease via transportation networks

Modelling and predicting the spatio-temporal spread of COVID-19, associated deaths and impact of key risk factors in England

A novel geo-hierarchical population mobility model for spatial spreading of resurgent epidemics

Introduction

Results

Reconstructing the epidemic

Estimated epidemic characteristics

Model performance

Sensitivity to specification of the meta-region model

Discussion

Methods

Data

Modelling Approaches

Parallel-Region (PR) Modelling

Meta-Region (MR) Modelling

Density Dependence

Effect of seed construction

Random commuting vs. fixed commuting

Parametric Inference

Priors and Implementation

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Ethics declarations

Competing interests

Electronic supplementary material

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Forecasting the 2017/2018 seasonal influenza epidemic in England using multiple dynamic transmission models: a case study

Detecting a Surprisingly Low Transmission Distance in the Early Phase of the 2009 Influenza Pandemic

Comments

Search

Quick links