Article | Open

# Big-data-driven modeling unveils country-wide drivers of endemic schistosomiasis

• Scientific Reports 7, Article number: 489 (2017)
• doi:10.1038/s41598-017-00493-1
Accepted:
Published online:

## Abstract

Schistosomiasis is a parasitic infection that is widespread in sub-Saharan Africa, where it represents a major health problem. We study the drivers of its geographical distribution in Senegal via a spatially explicit network model accounting for epidemiological dynamics driven by local socioeconomic and environmental conditions, and human mobility. The model is parameterized by tapping several available geodatabases and a large dataset of mobile phone traces. It reliably reproduces the observed spatial patterns of regional schistosomiasis prevalence throughout the country, provided that spatial heterogeneity and human mobility are suitably accounted for. Specifically, a fine-grained description of the socioeconomic and environmental heterogeneities involved in local disease transmission is crucial to capturing the spatial variability of disease prevalence, while the inclusion of human mobility significantly improves the explanatory power of the model. Concerning human movement, we find that moderate mobility may reduce disease prevalence, whereas either high or low mobility may result in increased prevalence of infection. The effects of control strategies based on exposure and contamination reduction via improved access to safe water or educational campaigns are also analyzed. To our knowledge, this represents the first application of an integrative schistosomiasis transmission model at a whole-country scale.

## Introduction

Schistosomiasis is a major parasitic infection that affects about 250 million individuals in many areas of the developing world and that puts at risk about 700 million people in regions where the disease is endemic1. It is a major cause of mortality, being directly responsible for the death of about 12,000 people yearly2 and a co-factor in at least 200,000 deaths annually3. Schistosomiasis is also an important determinant of morbidity, with 20 million people suffering severe consequences from the disease4 and an estimated burden of 4.5 million disability-adjusted life years5. These figures make schistosomiasis the second most common parasitic disease after malaria and likely the deadliest among neglected tropical diseases. Schistosomiasis is disproportionately concentrated in sub-Saharan Africa, which accounts for at least 90% of cases worldwide1.

The disease is caused by trematode parasites belonging to the genus Schistosoma6. Most human infections are caused by three species, namely S. haematobium, S. mansoni or S. japonicum. These parasites need as obligate intermediate hosts some species of freshwater snails belonging to the genus Bulinus (for S. haematobium), Biomphalaria (for S. mansoni) or Oncomelania (for S. japonicum). The geographical distribution of schistosomes is thus linked to the species-specific range of the snail host habitat. The infectious form of the parasite for humans is a freely swimming, short-lived larval stage, known as cercaria, that is shed by infected snails. Cercariae can infect humans exposed to infested water by penetrating their skin. Within the human body, they develop into sexually mature adult parasites that live for years, mating and producing hundreds to thousands of eggs daily. Eggs leave the human host through urine (S. haematobium) or feces (S. mansoni or S. japonicum). After reaching freshwater, they hatch into so-called miracidia, a second short-lived larval form of the parasite that can infect snails. Miracidia replicate asexually in snails, which then daily shed hundreds of cercariae into water, thus completing the parasite's life cycle. The population dynamics of human and snail hosts, as well as of different vital stages of the parasite, are thus essential components in the description of the transmission cycle of schistosomiasis (Fig. 1a).

Spatial coupling mechanisms are very important for the spread, persistence and infection intensity of schistosomiasis7, 8. Parasites may in fact be carried in advective flows along canals and streams as larvae, moved between aquatic and riparian habitats inside snail hosts, or transported by human hosts as adult worms. While larval transport and snail movement may represent significant propagation pathways for the disease only over short spatial scales (e.g. in the order of hundreds of meters9, 10) or long temporal windows (e.g. because of habitat expansion following water resources development11, 12), human mobility can play a significant role in disease propagation within and from endemic areas13,14,15. People can in fact be exposed to water infested with cercariae while visiting endemic regions and import the parasites back to their home communities; also, if infected, they can contribute to water contamination while traveling outside their home communities. Both mechanisms are expected to favor parasite dispersion and may even introduce schistosomes into villages that were previously disease-free.

Human mobility differs from the ecohydrological pathways of schistosomiasis propagation in that human movement can occur between otherwise environmentally unconnected areas, over larger spatial scales, and over shorter (and less predictable) temporal scales16. As a matter of fact, despite recent advances in the modeling of human mobility17,18,19, there still remain fundamental limits to our understanding of where, when, why and how people move20, 21. Standard mobility models have been found to perform poorly in the African context22. Therefore, proxies of human mobility that can be remotely acquired, properly anonymized and quantitatively elaborated represent an invaluable tool to inform epidemiological modeling. In this respect, the analysis of Call Detail Records (CDRs) from mobile phone users represents one of the most promising tools to infer human movement patterns23, 24 – also in an epidemiological context, as shown by the rapidly increasing number of studies that make use of CDRs to characterize human mobility25,26,27,28,29,30,31,32,33,34,35,36,37.

In this work we explore country-wide patterns of schistosomiasis transmission in Senegal, where the urogenital form of the infection is widespread38, 39. Schistosomiasis represents a major health problem in the country, being the third disease (after malaria and lymphatic filariasis) in terms of years lived with disability40. We apply a spatially explicit network model of schistosomiasis accounting for the dynamics of human hosts, intermediate snail hosts and larval parasite stages (Methods and Fig. S1). Because of the large spatial scale of interest, the exposure/contamination rates for the human host communities are assumed to be spatially heterogeneous to account for local differences in transmission risk. For the same reason, human mobility is here retained as the most important mechanism for the spatial spread of the disease; human movement patterns are extracted from a large dataset of anonymized CDRs (more than 15 billion records) made available by Sonatel, the largest Senegalese telecommunication provider (with a customer base of more than 9 million people). The model is parameterized with georeferenced data on population abundance, socioeconomic conditions and freshwater distribution, and calibrated against the most up-to-date regional estimates of urogenital schistosomiasis prevalence currently available at the Senegalese Ministry of Health (Figs 1b–e and S2). Specific aim of this work is to assess the impact of environmental conditions and human mobility on schistosomiasis transmission from local to regional scales. To that end, we compare the performances of four model set-ups characterized by different assumptions regarding the spatial grain of environmental heterogeneity and the inclusion of human mobility (Methods). We also illustrate how the analysis of local heterogeneities in disease transmission can help guide resource allocation in the fight against the disease, with the overarching goal of showcasing how mathematical modeling can be used for societal development, namely by assisting decision makers in the fight against a poverty-reinforcing infection like schistosomiasis.

## Results

### Drivers of endemic schistosomiasis in Senegal

Of the four tested model set-ups (Methods), the one accounting for both a fine-grained description of spatially heterogeneous transmission risk and human mobility (M4) performs best in reproducing regional schistosomiasis prevalence values (Fig. S3). The simulation results from this model are in good quantitative agreement with the available epidemiological data (Fig. 2a, coefficient of determination data vs. model R2 = 0.76). The average absolute data-model deviation is 6.0%, while the largest differences are found for the regions of Kaolack (6), Ziguinchor (11) and Kolda (9), where the model overestimates (Kaolack and Kolda) or underestimates (Ziguinchor) disease prevalence by more than 10%. Conversely, a model accounting for fine-grained environmental heterogeneity but neglecting human mobility (M3) shows worse explanatory power at the regional scale (R2 = 0.64), while models characterized by coarse-grained heterogeneity, either accounting for (M2) or neglecting (M1) human mobility, cannot even capture the spatial variability of regional prevalence (R2 < 0, see Supplementary Information, SI). The best-fit set-up including fine-grained spatial heterogeneity and human mobility (M4) is thus retained as reference model and used for further analyses. Although calibrated with regional prevalence data, the reference model can project infection patterns throughout the country at the spatial scale of third-level administrative units (so-called arrondissements, Fig. 2b).

### Spatial patterns of infection intensity

From an epidemiological standpoint, the reference model projects a country-wide schistosomiasis prevalence of about 21%, with a regional maximum in Tambacounda (59% of clinically infected people). Sensitivity analysis (Fig. S4) suggests that these figures are quite robust to (moderate) changes in model parameterization, with the baseline human exposure rate, the schistosome mortality rate and the threshold for clinical infection in humans being responsible for the largest variations in model predictions. The model also projects an Average Parasite Burden (APB, see Methods) of approximately 7.2 parasites per person, with arrondissement-level values ranging between 2.3 and 12.7 (Fig. S5a). The frequency distribution of APB is actually bimodal, with marked peaks around 3 and 10 parasites per person (Fig. S5b). The within-host parasite distributions in each of the 123 arrondissements can be well approximated by negative binomial distributions (Fig. S5c), with aggregation parameters ranging from ≈24.5 in arrondissements where APB is lowest to ≈20.5 where APB is highest, and dispersion indexes ranging between ≈1.1 and ≈1.5 for increasing values of APB (Fig. S5d).

### Role of human mobility

Contrasting the reference model against a simulation with the same parameter values but no mobility (SI) is useful to enucleate the impact of human mobility on the spatial patterns of schistosomiasis prevalence. Regional disease prevalence is found to be higher in all regions but Dakar if human mobility is completely switched off (Fig. 3a). Instead, the impact of human mobility at the arrondissement level is relatively more diversified in space: the effects of the mobility switch-off are less pronounced in the westernmost part of the country, where smaller increments (or even decrements in some arrondissements) of disease prevalence are predicted in the absence of mobility (inset of Fig. 3a). More in general, it is possible to contrast the reference model with simulations in which the human mobility rate is artificially manipulated, i.e. decreased or increased with respect to the country-wide estimate from CDR analysis (26% of mobile people in a one-year interval); this also requires a suitable redistribution of mobility fluxes (SI). According to model projections, country-scale APB shows an increasing trend with increasing mobility, while disease prevalence attains a well-defined minimum for intermediate levels of mobility – remarkably, close to the actual estimate of mobility obtained from CDRs (Fig. 3b). Therefore, changes in mobility can alter within-host parasite distributions at the community level in a way that nontrivially influences disease prevalence.

### Disease control

The reference model is finally used to evaluate the effects of so-called WAter, Sanitation and Hygiene (WASH) or Information, Education and Communication (IEC) strategies aimed to reduce the burden of schistosomiasis in Senegal through prevention of human exposure and contamination (SI). Concerning WASH (Fig. 4a,b), the model suggests that targeted interventions, prioritizing either high-risk communities (where schistosomiasis transmission is expected to be highest because of the synergistic effect of rural living conditions and abundance of freshwater environments) or high-prevalence communities, may be more effective than untargeted ones in reducing both the average and the maximum regional prevalence of infection. Specifically, risk-targeted actions are predicted to be the most effective option if the expected efficiency of the interventions is high; conversely, prevalence-targeted interventions may be more effective in reducing average disease prevalence for low expected efficiency. Different results are obtained with IEC campaigns (Fig. 4c,d). In this case, untargeted actions may represent the most effective option to reduce average prevalence, namely if the expected efficiency of the interventions is high and the plan involves at least ≈1 million people (≈5 millions for maximum regional prevalence). In case of smaller-scale plans or low expected efficiency, targeted actions are predicted again to be more efficient; in this case, the effects of prioritizing high-risk vs. high-prevalence communities depend on the planned extent of the intervention.

## Discussion

In this work we have proposed a big-data-driven modeling framework to study the country-scale dynamics of schistosomiasis transmission in Senegal. We have shown that a fine-grained description of the socioeconomic and environmental heterogeneities involved in local disease transmission is crucial to capturing the spatial variability of the regional prevalence patterns, and that the inclusion of human mobility estimates obtained from mobile phone traces significantly improves the explanatory power of the model. In this respect, the best-fit model including these two drivers of disease transmission is able to quite reliably reproduce large-scale patterns of schistosomiasis prevalence throughout the country. To our knowledge, this represents the first application of a country-scale schistosomiasis transmission model making an integrative use of socioeconomic, environmental and mobility proxies to infer human exposure/contamination patterns.

Human mobility can have remarkable implications in the definition of the country-scale prevalence patterns of an endemic disease like schistosomiasis. Specifically, our results show that the level of human mobility estimated from CDRs may be associated with lower values of schistosomiasis prevalence compared to those obtained in a no-mobility scenario at regional and country scales. This (possibly quite unexpected) finding can be explained by the fact that, under the current mobility scenario, the largest mobility fluxes are attracted by the most populated and urbanized regions (Dakar, Thiès and Diourbel), where schistosomiasis transmission is low. The higher disease prevalence predicted in the absence of mobility in prevalently rural areas is a clear indication of the importance of mobility directed to urban areas in preventing local disease transmission. Conversely, mobility may represent a risk factor for people living in prevalently urban areas, as found for the region of Dakar and several other areas in the western part of the country, where local infection prevalence is expected to be higher in the presence of mobility, clearly because of movement to/from rural areas where schistosomiasis thrives. Numerical simulations also show that, should propensity to moving increase with respect to what currently inferred from CDRs, country-wide average schistosomiasis prevalence could become higher as well. Country-wide APB as a function of mobility shows a different trend, namely a nearly monotonic increase for higher levels of mobility. This result seems to agree with other modeling studies that have reported a positive relationship between human mobility and transmission emergence, parasite burden and disease spread7. Per se, however, APB may represent a relatively poor epidemiological indicator for schistosomiasis dynamics. By accounting for the stratification of the infection41, instead, our framework shows that human mobility may play a more complex role in the definition of the spatial patterns of schistosomiasis prevalence than previously thought, especially at small spatial scales.

In this respect, one possible limitation of our model is the current choice of third-level administrative entities as computational units. Such choice is the result of trading off between data availability (the spatial resolution of country-wide data sets and sources is often relatively coarse) and computational cost (which clearly increases with the granularity of the model) on the one hand, and the need of preserving at least some of the heterogeneity that is inherent42, 43 to schistosomiasis transmission (as resulting, for instance, from regional differences in water availability and living conditions, or from human mobility) on the other. Because all variability within third-level administrative units is averaged out, our model misses some possibly important sources of heterogeneity (such as the demographic structure of the human population and small-scale changes in environmental conditions). For this reason, our description of schistosomiasis dynamics can be seen as a first-order approximation of the actual mechanisms responsible for the transmission of this disease. The current spatial granularity of the model may also result in an underestimation of the aggregation of community-level parasite distributions. The best-fit model suggests relatively low parasite aggregation throughout the country (arrondissement-level parasite distributions characterized by aggregation parameters >20 and dispersion indexes close to 1), which however is also a typical feature of models based on worm burden stratification44 (see also SI). We argue that high-resolution models targeted to specific regions of the country, where more accurate data – including snail distribution and infection status – may also be available37, could better elucidate the actual role of spatial heterogeneities and coupling mechanisms at fine spatiotemporal scales, and possibly help identify the focal hotspots of disease transmission.

High-resolution models, in turn, would call for a more-in-depth look at several sources of complexity that have been neglected at this stage. In particular, human exposure and contamination are directly related to water contact patterns, which are linked to demography and social structure. Including a simple, yet realistic, demographic model able to describe the age structure of the population at risk, as well as to track intra- and inter-annual changes in local population abundance, could greatly improve the reliability of epidemiological projections44. While census microdata could be used to describe long-term mobility trends45, CDRs can be exploited to derive (or validate) short-term movement patterns and/or time-varying mobility fluxes. A closer look at the connectivity matrices derived from CDRs shows in fact that human movement is highly heterogeneous, not only in space but also in time (Fig. S6). Overall mobility displays clear weekly patterns (especially in the most urbanized regions, such as Dakar), longer-term trends (possibly linked to seasonal economic activities, such as agriculture and fishing) and sudden pulses46. Religious gatherings attract pilgrims from all regions of Senegal and thus produce remarkable mobility fluxes, with the temporary displacement of hundreds of thousands of people, like in the case of the Grand Magal de Touba and Kazu Rajab (also held in Touba). Mobility pulses of this kind have been both anecdotally47 and quantitatively35 associated with cholera outbreaks, yet their possible role in the transmission of endemic diseases like schistosomiasis is still to be investigated.

From a biological perspective, a detailed description of parasite-host interactions and in-host parasite biology48, as well as of the ecology of the obligate intermediate snail host of schistosomes49, 50 has yet to be integrated in our modeling framework. As the presence of snail hosts is a major determinant of transmission risk, accounting for the spatiotemporal variability of the environmental drivers (most notably, water temperature and rainfall51) that influence their distribution and abundance could greatly improve the explanatory power of the model. Particular attention should be devoted to studying the possible interplay between the seasonality of environmental signals (which per se can induce complex dynamics52) and time-varying human mobility, which could introduce non-trivial effects on disease transmission53. Integrating the ecology of the intermediate host into the modeling framework described here is also deemed crucial to planning and optimizing non-conventional intervention strategies based on biological snail control. A promising avenue is represented by the restoration, possibly via village-based aquaculture54, of a native prawn species (Macrobrachium vollenhovenii) that has virtually disappeared from the Senegal river because of anthropogenic alterations, namely the construction of the Diama dam in the 1980's. M. vollenhovenii is a voracious snail predator, whose feeding activity can permanently interrupt disease transmission by suppressing the intermediate host population55, 56.

When building a fine-scale account of the ecological interactions that are relevant to schistosomiasis transmission, hydrological dispersal of the snail intermediate hosts, as well as of the larval stages of the parasite, has also to be included7, 16, 37. A detailed description of hydrological connectivity at a fine spatial scale may also allow studying the effects of agricultural development57, which requires the implementation of irrigation schemes and the construction of dam reservoirs. These interventions, in turn, can induce severe perturbations of the natural matrix that influences the population dynamics of snails and their natural enemies11, 58. As an example, the development of irrigation channels following the construction of the Diama dam resulted in increased transmission of S. haematobium and the introduction of S. mansoni in villages upriver of the dam, with a globally unprecedented velocity of transmission59. These observations highlight the importance of addressing the conflict between the need for water resources development and infectious disease management8, 11.

Current measures for fighting schistosomiasis are principally focused on chemotherapy3, 6. Senegal has implemented a national control program (Programme National de Lutte contre la Bilharziose, PNLB) since 199938. The PNLB is still ongoing, but the scarcity of large-scale data on treatment coverage and intervention effectiveness prevented us from including the effects of any mass drug administration in our country-wide model. Because chemotherapy does not confer permanent immunity, preventing infection by improving access to safe water and spreading awareness about disease transmission can represent a sustainable path towards reducing the burden of schistosomiasis. In this respect, our analysis of control strategies based on water contact conforms with a meta-analysis of observational studies60 that found that both safe water supplies and adequate sanitation are associated with significantly lower odds of schistosomiasis infection. However, safe water supplies may reduce contact with environmental water, but cannot completely avert it; similarly, sanitation can prevent snail infection, but the availability of adequate sanitation does not guarantee its use61. Our analysis also indicates that disease control efforts should be guided by a thorough understanding of the drivers that determine local heterogeneities in transmission risk, especially if resource availability is limited (a commonplace in the fight of neglected tropical diseases in developing countries) and/or a high efficiency of the interventions cannot be taken for granted. It is also to be remarked that no single action based on WASH or IEC alone would realistically be able to stop schistosomiasis transmission in Senegal. Sensitivity analysis seems to suggest that a combination of exposure prevention and chemotherapy could prove effective for large-scale disease control in humans. All these observations stress the importance of a comprehensive approach to schistosomiasis management62.

The results of this study encourage us to further elaborate on the current modeling framework to create operational tools aimed at supporting decision makers in the design of effective plans for disease control, as well as at informing citizens about the geography of transmission risk at different spatial scales. These decision-support systems should be able to accommodate real-time data assimilation (epidemiological reports, ecological surveys, demographic updates), in addition to reliable projections of relevant environmental drivers, like temperature and rainfall63, 64. Achieving greater detail in the description of epidemiological dynamics, human mobility, ecological interactions, water resources development and control plans is unlikely to be feasible at the country scale, but should be possible if looking at smaller spatial scales (e.g. specific regions of Senegal), at which the underlying modeling hypotheses can be substantiated by knowledge gathered in situ, possibly with the help of local institutions. The lessons learned from local experiences could then be scaled up to define country-scale strategies to help the fight against schistosomiasis transmission in Senegal, which could thus serve as an example for other countries (and/or other diseases) in sub-Saharan Africa.

## Methods

### A spatially explicit network model for schistosomiasis dynamics

The human population is subdivided into n communities (following e.g. administrative boundaries, health zones or geographical divides). Within each community i, the resident human population (of size K i ) is considered to be ‘stratified’41, 44, 48, i.e. divided into different infection classes characterized by increasing parasite burden p (from p = 0 to some maximum abundance p = P). Except for this, the population is assumed to be well-mixed within each community (no demographic/socioeconomic grouping). Let $H i p$ be the abundance of individuals in community i who host exactly p parasites. Furthermore, let S i and I i be the densities of susceptible and infected snails in community i, and let C i and M i be the concentrations of cercariae and miracidia in the freshwater resources accessible to community i. Following the transmission cycle of schistosomiasis (Fig. 1), disease transmission can be described by the following set of n(P + 5) differential equations:

$H ̇ i 0 = μ H ( K i − H i 0 ) − ℱ i H i 0 + γ 1 H i 1 H ̇ i p = ℱ i H i p − 1 − ( μ H + α H p + ℱ i + γ p ) H i p + γ p + 1 H i p + 1 (0 < p < P ) H ̇ i P = ℱ i H i P − 1 − ( μ H + α H P + γ P ) H i P S ̇ i = μ S ( N i − S i ) − b M i S i I ̇ i = b M i S i − ( μ S + α S ) I i C ̇ i = π C I i − μ C C i M ̇ i = G i − μ M M i .$

The first n(P + 1) equations of the model describe the dynamics of human hosts, the following 2n the dynamics of intermediate snail hosts, the last 2n the dynamics of the larval stages of the parasite. Model variables and parameters are summarized in Table 1, while a graphical representation of the transmission model is provided in Fig. S1.

As for the dynamics of human hosts, μ H is the baseline per capita mortality rate, while μ H K i represents the total birth rate, here assumed to be constant (i.e. leading to a constant community size K i in the absence of disease-induced mortality). Human hosts progress from one infection class to the following because of exposure to water infested with cercariae. Specifically, $ℱ i =a ∑ j = 1 n Q i j θ j C j$ is the force of infection for the inhabitants of community i: Q = [Q ij ] is a row-stochastic matrix (i.e. a matrix in which rows sum to one) that describes the probability that residents of community i travel to community j (possibly different from their home community as a result of human mobility), θ j is the human exposure rate, i.e. the rate at which human hosts either permanently or temporarily staying in community j are exposed to contaminated freshwater (exposure rate is assumed to be possibly community-dependent, so as to account for the geographical heterogeneity of living conditions), and a is the probability that a cercaria successfully develops into a reproductive adult parasite following contact with a human host. The term γp represents the parasite resolution rate, i.e. the transition rate from infection class p to infection class p − 1 because of the death of one parasite (γp =  P , with μ P being the per capita schistosome mortality rate). Disease-related mortality in humans is accounted for by the term $α H p$, which describes increasing mortality for increasing parasite burden ($α H p =p α H$, where α H is the additional mortality rate possibly experienced by an infected host because of the presence of each adult parasite). As for the dynamics of snail hosts, μ S is the baseline mortality rate, whereas μ S N i is the constant recruitment rate (local population size in the absence of the parasite is N i ). The parameter b represents the exposure rate of susceptible snails to miracidia in the freshwater environment. Exposure triggers a transition to the infected compartment (possible delays between exposure and onset of infectivity52 are neglected here for the sake of model minimality). Infective snails suffer from an extra-mortality rate α S . As for the dynamics of larval stages, cercariae are shed by infected snails at rate π C and die at rate μ C . Similarly, miracidia are shed by infected human hosts and die at rate μ M ; specifically, the total human contamination rate for community i is $G i = π M δ i ∑ j = 1 n Q j i W j /2$, with π M being the shedding rate of miracidia by infected humans, δ i the possibly site-specific probability of contaminating accessible freshwater, and Q ji the probability that inhabitants of community j come in contact with freshwater in community i. Shedding is assumed to be proportional to the total number $W j /2$ of adult parasite pairs undergoing sexual reproduction in the human hosts of community j, with $W j = ∑ p = 1 P p H j p$. Note that this may represent an overestimate, especially at low parasite counts48.

From the model, it is straightforward to evaluate disease prevalence (namely, by assuming that a minimum number T of parasites per host is required for the infection to be clinically apparent), the APB (a common measure of community-level infection intensity) and suitable indicators of parasite aggregation within each community, such as the dispersion index (defined as the ratio between the sample variance of the parasite distribution and the APB) and the aggregation parameter (obtained by fitting a negative binomial to the simulated parasite distribution). These quantities can be evaluated ex-post, i.e. as outputs of model simulations. Technical details are reported in SI.

### Model set-ups

We use four model set-ups (M1–M4) to investigate the role played by spatial heterogeneity and human mobility in schistosomiasis transmission. The different set-ups are characterized by either a coarse-grained (M1 and M2) or a fine-grained (M3 and M4) description of environmental heterogeneity, and by either neglecting (M1 and M3) or taking into account (M2 and M4) human mobility. As for spatial heterogeneity, communities are grouped into two clusters according to transmission risk (either low or high) in M1 and M2, while they are endowed with site-specific exposure/contamination rates (depending on environmental and socioeconomic factors) in M3 and M4. As for human movement, the mobility matrix is set to be the identity matrix in M1 and M3, while its entries are estimated from CDRs in M2 and M4; note that in M1 and M3 the system describing schistosomiasis transmission reduces to a set of spatially disconnected local models. Technical details on the evaluation of environmental heterogeneity and human mobility from georeferenced data are given in SI.

### Application of the model to Senegal

The model is run at the spatial scale of the arrondissements (third-level administrative units as of 2013). A high-resolution population density map (Fig. 1b) is used to obtain local values of K i (Fig. S2a). To simplify the structure of the model some equilibrium assumptions are made for the larval stages of the parasite. As a result, the synthetic exposure (β i , accounting also for snail abundance) and contamination (χ i ) rates are introduced. These parameters are assumed to increase with the product (Fig. S2b,c) between the fraction of people living in rural areas (ρ i ) and the availability of environmental freshwater (ω i , measured as the total length of the rivers encompassed in each spatial unit, Fig. 1c), i.e. $β i = β 0 (1+φ ρ i ω i )$ and $χ i = χ 0 (1+ξ ρ i ω i )$. Human mobility is estimated from the anonymized movement routes of about 9 million Sonatel mobile phone users (corresponding to more than 60% of the Senegalese population) collected for one year, from January 1 to December 31, 2013. The entries of the mobility matrix Q are assumed to be proportional to the number of phone calls made by users living in site i while being in site j (Fig. 1d). The home site of each user is identified as the place where the most calls are made during night hours (7 pm–7 am). The model is calibrated against regional estimates of urogenital schistosomiasis prevalence (Fig. S2d) upscaled from the health-district data available at the Senegalese Ministry of Health (Fig. 1e). The prevalence of schistosomiasis in the country is periodically evaluated during national surveys conducted within the PNLB. Model calibration is performed against the data that are currently in use at the Ministry of Health, and that refer to surveys conducted through standard diagnostic techniques (urine testing via reagent strips, followed by filtration and microscopic examination of samples positive for haematuria) in schools selected from all of the 14 regions of Senegal between 1996 and 2013. Performing model calibration at the regional (rather than a finer) spatial scale is deemed to decrease the effects of the uncertainties possibly associated with census and/or epidemiological data. Details and references for model parameterization and calibration are reported in SI, along with a description of some control strategies aimed at decreasing disease burden by preventing human exposure and contamination. Although used here to study schistosomiasis transmission in Senegal, our modeling framework can easily be applied to other geographical regions, provided that suitable data for model calibration are available.

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## References

1. 1.

WHO. Schistosomiasis. Fact sheet n. 115. Tech. Rep., Available online at http://www.who.int/mediacentre/factsheets/fs115/en/, Date of access: 01/08/2016 (2016).

2. 2.

Lozano, R. et al. Global and regional mortality from 235 causes of death for 20 age groups in 1990 and 2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet 380, 2095–2128 (2012).

3. 3.

Thétiot-Laurent, S. A., Boissier, J., Robert, A. & Meunier, B. Schistosomiasis chemotherapy. Angewandte Chemie 52, 7936–7956 (2013).

4. 4.

Kheir, M. M. et al. Mortality due to schistosomiasis mansoni: a field study in Sudan. The American Journal of Tropical Medicine and Hygiene 60, 307–310 (1999).

5. 5.

Fenwick, A. The global burden of neglected tropical diseases. Public Health 126, 233–236 (2012).

6. 6.

Colley, D. G., Bustinduy, A. L., Secor, W. E. & King, C. H. Human schistosomiasis. Lancet 383, 2253–2264 (2014).

7. 7.

Gurarie, D. & Seto, E. Y. W. Connectivity sustains disease transmission in environments with low potential for endemicity: modelling schistosomiasis with hydrologic and social connectivities. Journal of the Royal Society Interface 6, 495–508 (2009).

8. 8.

Perez-Saez, J. et al. A theoretical analysis of the geography of schistosomiasis in Burkina Faso highlights the roles of human mobility and water resources development in disease transmission. PLoS Neglected Tropical Diseases 9, e0004127 (2015).

9. 9.

Maszle, D. R., Whitehead, P. G., Johnson, R. C. & Spear, R. C. Hydrological studies of schistosomiasis transport in Sichuan Province, China. Science of the Total Environment 216, 193–203 (1998).

10. 10.

Lowe, D. et al. Transport of Schistosoma japonicum cercariae and the feasibility of niclosamide for cercariae control. Parasitology International 54, 83–89 (2005).

11. 11.

Steinmann, P., Keiser, J., Bos, R., Tanner, M. & Utzinger, J. Schistosomiasis and water resources development: systematic review, meta-analysis, and estimates of people at risk. Lancet Infectious Diseases 7, 411–426 (2006).

12. 12.

Clennon, J. A., King, C. H., Muchiri, E. M. & Kitron, U. Hydrological modelling of snail dispersal patterns in Msambweni, Kenya and potential resurgence of Schistosoma haematobium transmission. Parasitology 134, 683–693 (2007).

13. 13.

Bella, H., de C. Marshall, T. F., Omer, A. H. S. & Vaughan, J. P. Migrant workers and schistosomiasis in the Gezira, Sudan. Transactions of the Royal Society of Tropical Medicine and Hygiene 74, 36–39 (1980).

14. 14.

Cetron, M. S. et al. Schistosomiasis in Lake Malawi. Lancet 348, 1274–1278 (1996).

15. 15.

Kloos, H. et al. The role of population movement in the epidemiology and control of schistosomiasis in Brazil: a preliminary typology of population movement. Memórias do Instituto Oswaldo Cruz 105, 578–586 (2010).

16. 16.

Remais, J. Modelling environmentally-mediated infectious diseases of humans: transmission dynamics of schistosomiasis in China. In Michael, E. & Spear, R. (eds) Modelling Parasite Transmission and Control, 79–98 (Springer, 2010).

17. 17.

González, M. C., Hidalgo, C. A. & Barabási, A. L. Understanding individual human mobility patterns. Nature 453, 479–482 (2008).

18. 18.

Song, C., Koren, T., Wang, P. & Barabási, A. L. Modelling the scaling properties of human mobility. Nature Physics 6, 818–823 (2010).

19. 19.

Simini, F., González, M. C., Maritan, A. & Barabási, A. L. A universal model for mobility and migration patterns. Nature 484, 96–100 (2012).

20. 20.

Song, C., Qu, Z., Blumm, N. & Barabási, A. L. Limits of predictability in human mobility. Science 327, 1018–1021 (2010).

21. 21.

Lu, X., Wetter, E., Bharti, N., Tatem, A. J. & Bengtsson, L. Approaching the limit of predictability in human mobility. Scientific Reports 3, 2923 (2013).

22. 22.

Wesolowski, A. et al. Evaluating spatial interaction models for regional mobility in sub-Saharan Africa. PLoS Computational Biology 11, e1004267 (2015).

23. 23.

Palchykov, V., Mitrović, M., Jo, H. H., Saramäki, J. & Pan, R. K. Inferring human mobility using communication patterns. Scientific Reports 4, 6174 (2014).

24. 24.

Blondel, V. D., Decuyper, A. & Krings, G. A survey of results on mobile phone datasets analysis. EPJ Data Science 4, 10 (2015).

25. 25.

Tatem, A. J. et al. The use of mobile phone data for the estimation of the travel patterns and imported Plasmodium falciparum rates among Zanzibar residents. Malaria Journal 8, 287 (2009).

26. 26.

Le Menach, A. et al. Travel risk, malaria importation and malaria transmission in Zanzibar. Scientific Reports 1, 93 (2011).

27. 27.

Wesolowski, A. et al. Quantifying the impact of human mobility on malaria. Science 6104, 267–270 (2012).

28. 28.

Tatem, A. J. et al. Integrating rapid risk mapping and mobile phone call record data for strategic malaria elimination planning. Malaria Journal 13, 52 (2014).

29. 29.

Tizzoni, M. et al. On the use of human mobility proxies for modeling epidemics. PLoS Computational Biology 10, e1003716 (2014).

30. 30.

Wesolowski, A. et al. Quantifying travel behavior for infectious disease research: a comparison of data from surveys and mobile phones. Scientific Reports 4, 5678 (2014).

31. 31.

Bengtsson, L. et al. Using mobile phone data to predict the spatial spread of cholera. Scientific Reports 5, 8923 (2015).

32. 32.

Wesolowski, A. et al. Quantifying seasonal population fluxes driving rubella transmission dynamics using mobile phone data. Proceedings of the National Academy of Sciences USA 112, 11114–11119 (2015).

33. 33.

Wesolowski, A. et al. Impact of human mobility on the emergence of dengue epidemics in Pakistan. Proceedings of the National Academy of Sciences USA 112, 11887–11892 (2015).

34. 34.

Brdar, S., Gavrić, K., Ćulibrk, F. & Crnojević, V. Unveiling spatial epidemiology of HIV with mobile phone data. Scientific Reports 6, 19342 (2016).

35. 35.

Finger, F. et al. Mobile phone data highlights the role of mass gatherings in the spreading of cholera outbreaks. Proceedings of the National Academy of Sciences USA 113, 6421–6426 (2016).

36. 36.

Mao, L., Yin, L., Song, X. & Mei, S. Mapping intra-urban transmission risk of dengue fever with big hourly cellphone data. Acta Tropica 162, 188–195 (2016).

37. 37.

Ciddio, M. et al. The spatial spread of schistosomiasis: A multidimensional network model applied to Saint-Louis region, Senegal. Advances in Water Resources in press, doi:10.1016/j.advwatres.2016.10.012 (2017).

38. 38.

Ndir, O. Situation des schistosomoses au Sénégal. In Chippaux, J. P. (ed.) La Lutte contre les Schistosomoses en Afrique de lOuest, 225–236 (IRD Editions, 2000).

39. 39.

Schur, N. et al. Geostatistical model-based estimates of schistosomiasis prevalence among individuals aged ≤20 years in West Africa. PLoS Neglected Tropical Diseases 5, e1194 (2011).

40. 40.

WHO. Global health estimates, years 2000–2012. Tech. Rep., Available online at http://www.who.int/healthinfo/global_burden_disease, Date of access: 01/08/2016 (2016).

41. 41.

Gurarie, D., King, C. H. & Wang, X. A new approach to modelling schistosomiasis transmission based on stratified worm burden. Parasitology 137, 1951–1965 (2010).

42. 42.

Woolhouse, M. E. J., Etard, J. F., Dietz, K., Ndhlovu, P. D. & Chandiwana, S. K. Heterogeneities in schistosome transmission dynamics and control. Parasitology 117, 475–482 (1998).

43. 43.

Gurarie, D. & King, C. H. Heterogeneous model of schistosomiasis transmission and long-term control: the combined influence of spatial variation and age-dependent factors on optimal allocation of drug therapy. Parasitology 130, 49–65 (2005).

44. 44.

Gurarie, D. & King, C. H. Population biology of Schistosoma mating, aggregation, and transmission breakpoints: More reliable model analysis for the end-game in communities at risk. PLoS One 9, e115875 (2014).

45. 45.

Garcia, A. J., Pindolia, D. K., Lopiano, K. K. & Tatem, A. J. Modeling internal migration flows in sub-Saharan Africa using census microdata. Migration Studies 3, 89–110 (2015).

46. 46.

Martin-Gutierrez, S. et al. Agricultural activity shapes the communication and migration patterns in Senegal. Chaos 26, 065305 (2016).

47. 47.

Echenberg, M. Africa in the Time of Cholera (Cambridge University Press, New York, USA, 2011).

48. 48.

Gurarie, D., King, C. H., Yoon, N. & Li, E. Refined stratified-worm-burden models that incorporate specific biological features of human and snail hosts provide better estimates of Schistosoma diagnosis, transmission, and control. Parasites and Vectors 9, 428 (2016).

49. 49.

Perez-Saez, J. et al. Hydrology and density feedbacks control the ecology of intermediate hosts of schistosomiasis across habitats in seasonal climates. Proceedings of the National Academy of Sciences USA 113, 6427–6432 (2016).

50. 50.

Gurarie, D., King, C. H., Yoon, N., Alsallaq, R. & Wang, X. Seasonal dynamics of snail populations in coastal Kenya: Model calibration and snail control. Advances in Water Resources in press, doi:10.1016/j.advwatres.2016.11.008 (2017).

51. 51.

McCreesh, N. & Booth, M. Challenges in predicting the effects of climate change on Schistosoma mansoni and Schistosoma haematobium transmission potential. Trends in Parasitology 29, 548–555 (2013).

52. 52.

Ciddio, M., Mari, L., Gatto, M., Rinaldo, A. & Casagrandi, R. The temporal patterns of disease severity and prevalence in schistosomiasis. Chaos 25, 036405 (2015).

53. 53.

Mari, L., Casagrandi, R., Bertuzzo, E., Rinaldo, A. & Gatto, M. Floquet theory for seasonal environmental forcing of spatially-explicit waterborne epidemics. Theoretical Ecology 7, 351–365 (2014).

54. 54.

Alkalay, A. S. et al. The prawn Macrobrachium vollenhovenii in the Senegal river basin: towards sustainable restocking of all-male populations for biological control of schistosomiasis. PLoS Neglected Tropical Diseases 8, e3060 (2014).

55. 55.

Sokolow, S. H., Lafferty, K. D. & Kuris, A. M. Regulation of laboratory populations of snails (Biomphalaria and Bulinus spp.) by river prawns, Macrobrachium spp. (Decapoda, Palaemonidae): implications for control of schistosomiasis. Acta Tropica 132, 64–74 (2014).

56. 56.

Sokolow, S. H. et al. Reduced transmission of human schistosomiasis after restoration of a native river prawn that preys on the snail intermediate host. Proceedings of the National Academy of Sciences USA 112, 9650–9655 (2015).

57. 57.

Rohr, J. R. et al. Agrochemicals increase trematode infections in a declining amphibian species. Nature 455, 1235–1239 (2008).

58. 58.

Li, Y. S. et al. Large water management projects and schistosomiasis control, Dongting Lake Region, China. Emerging Infectious Diseases 13, 973–979 (2007).

59. 59.

Picquet, M. et al. The epidemiology of human schistosomiasis in the Senegal river basin. Transactions of the Royal Society of Tropical Medicine and Hygiene 90, 340–346 (1996).

60. 60.

Grimes, J. E. T. et al. The relationship between water, sanitation and schistosomiasis: a systematic review and meta-analysis. PLoS Neglected Tropical Diseases 8, e3296 (2014).

61. 61.

Grimes, J. E. T. et al. The roles of water, sanitation and hygiene in reducing schistosomiasis: a review. Parasites & Vectors 8, 156 (2015).

62. 62.

Rollinson, D. et al. Time to set the agenda for schistosomiasis elimination. Acta Tropica 128, 423–440 (2013).

63. 63.

Mari, L. et al. On the predictive ability of mechanistic models for the Haitian cholera epidemic. Journal of the Royal Society Interface 12, 20140840 (2015).

64. 64.

Pasetto, D., Finger, F., Rinaldo, A. & Bertuzzo, E. Real-time projections of cholera outbreaks through data assimilation and rainfall forecasting. Advances in Water Resources in press, doi:10.1016/j.advwatres.2016.10.004 (2017).

## Acknowledgements

The authors are grateful to A. Rinaldo, E. Bertuzzo and J. Perez-Saez (Ecole Polytechnique Fédérale de Lausanne), Nicolas de Cordes and Stephanie de Prevoisin (Orange), Dr. Marie Khémesse Ngom Ndiaye (Senegalese Ministry of Health) and the other winner teams of the D4D (Data for Development) Senegal Challenge organized by Orange and Sonatel in 2014 (Orange, Challenge 4 development, http://www.d4d.orange.com/; last date of access: 03/02/2017). LM, MG, MC and RC acknowledge support from the Bill & Melinda Gates Foundation (grant OPP1114791). LM and RC were also supported by Politecnico di Milano through the Polisocial Award programme (project MASTR-SLS, http://www.polisocial.polimi.it/en/home-en/; last date of access: 03/02/2017). SHS and GADL were supported by NSF grant #1414102 and NIH grant #R01TW010286. The authors wish also to thank two anonymous reviewers for their useful comments.

## Author information

### Affiliations

1. #### Politecnico di Milano, Dipartimento di Elettronica, Informazione e Bioingegneria, Milano, IT, 20133, Italy

• Lorenzo Mari
• , Marino Gatto
• , Manuela Ciddio
•  & Renato Casagrandi

3. #### Stanford University, Hopkins Marine Station, Pacific Grove, CA, 93950, USA

• Susanne H. Sokolow
•  & Giulio A. De Leo
4. #### University of California, Marine Science Institute, Santa Barbara, CA, 93106, USA

• Susanne H. Sokolow

### Contributions

L.M., M.G., R.C. designed research; L.M., M.C., R.C. performed research; L.M., M.G., M.C., E.D.D., S.H.S., G.A.D.L., R.C. analyzed data; L.M., M.G., R.C. wrote the paper.

### Competing Interests

The authors declare that they have no competing interests.

### Corresponding authors

Correspondence to Lorenzo Mari or Renato Casagrandi.