Exposure measurement error and the characterization of child exposure to fecal contamination in drinking water

Goddard, Frederick G. B.; Chang, Howard H.; Clasen, Thomas F.; Sarnat, Jeremy A.

doi:10.1038/s41545-020-0063-9

Download PDF

Article
Open access
Published: 06 April 2020

Exposure measurement error and the characterization of child exposure to fecal contamination in drinking water

Frederick G. B. Goddard ORCID: orcid.org/0000-0001-7585-0120¹,
Howard H. Chang²,
Thomas F. Clasen¹ &
…
Jeremy A. Sarnat¹

npj Clean Water volume 3, Article number: 19 (2020) Cite this article

1521 Accesses
9 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Characterizing fecal contamination exposure from drinking water can introduce exposure measurement errors, i.e., differences between the observed and true exposure. These errors can mask the true relationship between fecal contamination exposure and waterborne diseases. We present a framework to quantify the impact of measurement errors on exposure–outcome health effect estimates introduced by variability in measured drinking water fecal contamination levels and household versus community sampling strategies. We matched fecal indicator bacteria (FIB) data for >37,000 drinking water samples to children aged 0–72 months from 19 studies in low- and middle-income countries and took two complementary analytical approaches. We found that household-level exposure assessments may attenuate effect estimates of FIB concentrations in drinking water on diarrhea, and single water samples may attenuate health effect estimates of FIB concentrations on linear growth. To understand the health effects of fecal contamination exposure, measurement error frameworks can be used to estimate more biologically relevant exposures.

Perspectives from the Society for Pediatric Research: contaminants of water and children’s health: Can we do better?

Article 29 May 2020

Kamil Evy A. Bantol, Heather L. Brumberg, … Joyce R. Javier

Comparison of Trihalomethane exposure assessment metrics in epidemiologic analyses of reproductive and developmental outcomes

Article 14 June 2023

Thomas J. Luben, Rachel M. Shaffer, … J. Michael Wright

Nitrate contamination in drinking water and adverse reproductive and birth outcomes: a systematic review and meta-analysis

Article Open access 11 January 2023

Luling Lin, Sophie St Clair, … Jane E. Harding

Introduction

Methods to characterize exposure to fecal contamination from contaminated drinking water in low- and middle-income settings have typically been limited to estimating fecal loading in the environment using indicators of fecal contamination¹. These are proxy measures, in the sense that rather than measuring the actual ingestion of enteric pathogens associated with fecal contamination, they infer exposure by measuring fecal indicator bacteria (FIB) concentrations in the environment and are heavily based on assumptions on the interactions of individuals with that environment. In addition to the well-documented shortcomings of using FIB as a proxy for enteric pathogens², the difference between the observed exposure, i.e., the exposure assigned from these proxy measures, and the true exposure represents a potential form of exposure measurement error. Other areas of environmental health have detailed how exposure measurement error may introduce bias and uncertainty in estimated exposure–outcome relationships, thereby obscuring true associations³. Correspondingly, fields such as air pollution epidemiology have seen increased emphasis on understanding errors associated with differences between individual or personal exposures and other proxy exposure measures used in health effects modeling⁴.

We recently compiled individual participant data (IPD) for a systematic review and meta-analysis. We matched household-level drinking water FIB concentrations as a proxy measure for individual-level enteric pathogen exposure to >43,000 diarrhea reports and >10,000 growth measures for children under the age of 5 years across 19 studies in low- and middle-income countries (Goddard et al., under review). This analysis was an update and expansion of previous meta-analyses that found mixed results on the relationship between FIB concentrations in drinking water and diarrhea and did not consider a more chronic health outcome such as child growth^5,6,7. Our findings suggest that FIB concentrations in household drinking water are associated with both reported diarrhea (odds ratio (OR) 1.09; 95% confidence interval (CI) 1.04, 1.13) and lower height-for-age Z (HAZ) scores (HAZ −0.04; 95% CI −0.06, −0.01). Notably, we also observed moderate heterogeneity among studies in the strengths of association for both the diarrhea (I² = 34%; 95% CI 0–62%) and linear growth analyses (I² = 19%; 95% CI 0–63%).

A primary limitation of this analysis was potential error in outcome measurement. In these studies, most of the data are from non-blinded intervention protocols where caregiver-reported diarrhea and linear growth measures are subject to participant and enumerator bias⁸. Another possible source of bias and uncertainty in our analysis may be in part due to errors in the assigned exposure. Prior research studying the effects of exposure measurement error from proxy measures of exposure in air pollution epidemiology suggest that these proxy measures can introduce uncertainty and bias risk estimates toward no observed effect^9,10,11. Similar effects of exposure measurement error have also been shown in other areas, such as chemical exposures¹² and diastolic blood pressure measurements¹³. Based on these findings, it is possible that findings from our IPD analysis may also exhibit similar uncertainty and bias due to exposure measurement error from the use of FIB concentrations in household drinking water as a proxy for personal exposure to enteric pathogens.

Theoretically, there are a number of different potential sources of error in enteric pathogen exposure assessments. Some examples include (1) temporal or spatial variability in water quality, (2) assigning household- or community-level water quality to individuals, (3) the use of FIBs as proxies for enteric pathogens, and (4) processing errors (i.e., during sample collection, transport, or laboratory instrumentation errors). This is not an exclusive list of sources of error, and each source could be further broken down into underlying sources of error. Here we introduce a conceptualized model of potential sources of measurement error using a formal measurement error framework¹⁴. We demonstrate how such a framework might be evaluated and its ability to quantify the relative contributions of measurement error, using empirical data of drinking water across several global low- and middle-income settings.

Although quite limited, prior research on the effects of exposure measurement error on waterborne disease epidemiology has found preliminary evidence of regression dilution bias between 14% and 57% from the use of FIB on the relationship between fecal contamination in recreational water and swimming-associated illness¹⁵. Another study found that spatiotemporal variability in rainfall data attenuated subsequent associations between heavy rainfall and diarrhea between 35% and 45%¹⁶. These studies focused on single components of error, and we did not find any formal discussions pertaining to multiple sources of measurement error for exposure to fecal contamination in the peer-reviewed literature. Here we introduce an exposure measurement error framework to conceptualize multiple possible components of error. We adapt a framework for fecal contamination exposure in drinking water based on an approach presented in Zeger et al. to distinguish sources of exposure measurement error¹⁴. While our analysis borrows from extensive work examining air pollution health effects and is limited to fecal contamination as a proxy for enteric pathogens, we intend that this paper serves as an initial step for discussing ways of incorporating estimates of exposure measurement error for enteric pathogen exposures.

Zeger et al. consider sources of error in the assignment of ambient air quality to a population from central monitoring sites, a more distal measure of exposure than attempting to quantify personal exposures. While this framework is contextual to time series studies of air pollution health effects, we see parallels in the proxy measures commonly used for fecal contamination exposure assessments to central-site air pollution exposure assessments, although we do not claim this framework includes all possible sources of error. In the current example involving fecal contamination exposure and response, as in the air pollution design settings that Zeger et al. used, technical and logistical constraints as well as limited resources lead to an inability to obtain measures of true personal exposure x for individual i at time t. Instead, exposure may be estimated by measuring household FIB concentrations z at time t. In Fig. 1, we summarize the differences between measured FIB concentrations z_t, the only component in this framework that is actively measured, and true personal exposure to fecal contamination x_it as a proxy for fecal contamination exposure.

**Fig. 1: Fecal contamination in drinking water exposure measurement error framework (adapted from Zeger et al.¹⁴).**

There is a difference between measured FIB concentrations in household drinking water z_t and true personal exposure to fecal contamination x_it—the exposure measurement error—which we split into three components of error in accordance with the Zeger et al. framework (Eq. 1).

$$x_{it} = z_t + \left( x_{it} - {\bar x_t} \right) + \left( {\bar x_t} - {z_t^ \ast } \right) + \left( {z_t^ \ast - z_t} \right)$$

(1)

where $\left( x_{it} - {\bar x_t} \right)$ describes error from the difference in aggregate fecal contamination exposure across a population ${\bar x_t}$ (i.e., members of a household) and personal exposure x_it; $\left( {{\bar x_t} - z_t^ \ast } \right)$ describes measurement error from assigning household water fecal contamination $z_t^ \ast $ as the exposure and not considering other exposures, such as exposure to fecal contamination experienced in the community $w_t^ \ast$ that may make up the aggregate exposure ${\bar x_t}$ across a population; and $\left( {z_t^ \ast - z_t} \right)$ describes measurement error from the difference in measured household water FIB concentrations z_t as an indicator of fecal contamination and the true levels of fecal contamination in household drinking water $z_t^ \ast$.

Our current analysis sought to address the second and third components of this framework with the goal of examining how they may affect exposure–outcome relationships for exposure to fecal contamination in drinking water. This analysis does not seek to validate any single exposure characterization method but rather to describe potential sources of error in current methods to help inform future method development. In conducting this analysis, we sought to assess evidence and magnitude of exposure measurement error from: (1) from assigning household-level FIB concentrations $z_t^ \ast$ as the exposure and not considering community-level FIB concentrations $w_t^ \ast$ (component 2) on health effect estimates of FIB concentrations in drinking water on child diarrhea and (2) from using single FIB measure z_t compared to repeated longitudinal measures $z_t^ \ast$ (component 3) on health effect estimates of FIB concentrations in drinking water on linear growth.

Results

The dataset we compiled included studies of varying sizes with FIB data for drinking water available from 98 to 2137 households per study (Table 1). Four studies included only cross-sectional water sample collection, but most had collected repeated water samples over time with samples typically being collected monthly, quarterly, or annually. To evaluate findings from our simulations, matched diarrhea data were available from all included studies and matched growth data were available from seven studies.

Table 1 Summary characteristics of the included studies.

Full size table

Household versus community exposure

The simulations comparing household to community exposure included 37,119 observations (82% rural, 18% urban) from 16 of the included studies, with between 1 and 12 observations per child. Studies by Arnold et al.²⁶, Brown et al.³⁶, and Pickering et al.⁴⁶ were excluded from these simulations because the data were not compatible for estimating median community water FIB concentrations (i.e., specific sample collection dates were not available or only one sample was collected in every community). The simulations found that if children experienced exposure to fecal contamination in drinking water outside of their household, then using household FIB concentrations $z_t^ \ast$ exclusively as the error-prone exposure variable may lead to an attenuation in the observed FIB–diarrhea relationship even at low levels of community exposure (Fig. 2). If the aggregate exposure $\bar x_t$ was represented by 90% household and 10% community exposure, we found that the estimated odds of diarrhea for 1 − log₁₀ higher FIB concentrations in drinking water, using household FIB concentrations in drinking water $z_t^ \ast$ as the error-prone exposure variable, were OR = 1.32 compared to the assigned odds of diarrhea OR = 1.50 (α = 0.69). This trend continued as the assumed aggregate exposure $\bar x_t$ consisted of increasing levels of community exposure $w_t^ \ast$. If the aggregate exposure was represented by 100% community exposure, we found that the estimated odds of diarrhea, using household water as the error-prone exposure variable, were OR = 1.06 (α = 0.15). Our findings were consistent between urban and rural areas. Sensitivity analyses for these findings are provided in Supplementary Tables S1 and S2.

Fig. 2: Simulated odds of diarrhea for 1 − log₁₀ higher FIB concentrations in drinking water when only household water FIB concentrations ${\it{z}}_{\it{t}}^ \ast $ was used as the exposure variable for different household–community exposure scenarios and the corresponding attenuation factors.

For the evaluation with empirical diarrhea data, we used the same dataset as we did for the simulations, with the exception of using field-reported diarrhea instead of assigning diarrhea cases. In the combined analysis, we found that household water fecal contamination $z_t^ \ast$ assigned exclusively as the error-prone exposure variable attenuated the association between FIB concentrations in drinking water and diarrhea in comparison to a mixture of household and community water fecal contamination up to assigning 20% household and 80% community water FIB concentrations (Fig. 3). However, this attenuation was not as pronounced as suggested in the simulations, with the greatest attenuation factor found to be α = 0.79 (60% household, 40% community exposure). The stratified results suggested that in urban areas the attenuation from using household water fecal contamination $z_t^ \ast$ was limited to scenarios up to 70% household and 30% community water quality, and in rural areas, it extended to 10% household and 90% community water fecal contamination.

Fig. 3: Estimated odds of diarrhea for 1 − log₁₀ higher FIB concentrations in drinking water for different household–community exposure assignments and the corresponding attenuation factors.

After stratifying by age, the odds of diarrhea for 1 − log₁₀ higher FIB concentrations in drinking water for children aged 0–23 months were similar to the odds of diarrhea for children aged 24–72 months when household water fecal contamination $z_t^ \ast$ was assigned as the exposure variable. However, when replacing household water fecal contamination with community water fecal contamination $w_t^ \ast$ in 10% increments, there was a trend of higher odds of diarrhea for children aged 24–72 months but not for children aged 0–23 months for up to 20% household and 80% community water quality (Fig. 4).

Fig. 4: Estimated odds of diarrhea for 1 − log₁₀ higher FIB concentrations in drinking water for different household–community exposure assignments, stratified by non-ambulatory and ambulatory pre-school-aged children, and the corresponding attenuation factors.

Single versus multiple samples

The simulations comparing the effects of defining exposure with a single water sample compared to multiple samples included 24,806 unique children from the 19 included studies that had ≥1 matched FIB concentration estimates in drinking water. These simulations indicated that using FIB concentrations from single water samples z_t compared to the median of multiple samples attenuated estimated differences in HAZ scores associated with fecal contamination in drinking water, with similar findings between wet and dry season months (Fig. 5). If the household water fecal contamination $z_t^ \ast$ was represented by the median of two samples, then randomly selecting one of the two samples almost halved the observed difference in HAZ scores associated with FIB concentrations in drinking water from the assigned HAZ = −0.20 to HAZ = −0.11 (α = 0.56). This finding was more pronounced when household water fecal contamination $z_t^ \ast$ was represented by three (HAZ = −0.10; α = 0.52) or four samples (HAZ = −0.09; α = 0.43). Sensitivity analyses for these findings are detailed in Supplementary Tables S3 and S4.

**Fig. 5: Simulated difference in HAZ scores for 1 − log₁₀ higher FIB concentrations in drinking water and the corresponding attenuation factors.**

For the evaluation of these simulations, we were limited by empirical linear growth data availability, with linear growth data available for 3311, 743, and 233 children with 2, 3, and ≥4 matched water samples, respectively. As a result, we were not able to stratify this analysis by season as we did with the simulations, and the baseline effect estimates and corresponding uncertainties around these estimates vary by group because they represent different samples (Fig. 6). The difference in HAZ scores associated with higher FIB concentrations in drinking water was consistently closer to zero (i.e., no effect) when using a single sample z_t compared to the median of multiple samples. Similar to findings from the simulations, using a single sample compared to the median of two samples approximately halved the estimated difference in HAZ scores associated with FIB concentrations in drinking water (α = 0.56), and this was more pronounced for the median of three or four samples (α = 0.54; α = 0.38).

**Fig. 6: Estimated difference in HAZ scores with 95% confidence intervals for 1 − log₁₀ higher FIB concentrations in drinking water and the corresponding attenuation factors.**

Discussion

We adapted and introduced a framework to assess measurement error when characterizing child exposure to fecal contamination in drinking water, i.e., the difference between exposure assigned by proxy measures of exposure and the true exposure experienced by an individual. These frameworks can help prioritize current research gaps by identifying areas within fecal exposure assessments that are limited or missing and by quantifying components of error that are most critical to biases in waterborne disease epidemiology. Ideally, generating improved exposure data could lead to a better understanding of the true associations between fecal contamination along different pathways and child health. This analysis primarily serves as an initial effort to apply an exposure measurement error framework within the field of waterborne disease epidemiology. In so doing, we aspire to understand the presence and magnitude of several sources of measurement error. Our analyses showed how components of error may attenuate estimated exposure–outcome relationships using empirical data from an extensive dataset of studies collected in low- and middle-income settings. Our findings provide indication that the previously reported odds of diarrhea and reduction in HAZ scores associated with fecal contamination in drinking water (Goddard et al., under review) may be prone to regression dilution bias and thus may be underestimating true exposure–outcome relationships.

We introduced three different components of exposure measurement error. The first component may emerge from assigning household water fecal contamination data to individual household members who interact with their environment differently. Substantial heterogeneity of between-child interactions with their domestic environment has been shown in both urban and rural settings for different age groups in the 0–5-year age range^17,18. In addition, differential drinking water ingestion rates by age can lead to heterogeneity in the ingested doses of fecal contamination¹⁹, and infants may experience limited exposure to household water from ingestion before weaning²⁰. This may lead to differences in dose–response between members of the same household. To test how the first component of this exposure measurement error framework can be applied, small controlled panel studies are needed to generate estimates of personal exposure and compare those to household-level estimates⁹.

The second component of error may occur when exposure to fecal contamination in drinking water outside of the household is not incorporated into exposure assessments. A recent study characterizing fecal exposure in Accra, Ghana as part of the SaniPath research program reported widespread fecal contamination in both domestic and public domains²¹. Measurement error from assigning household water fecal contamination as the exposure does not only depend on the presence of fecal contamination in the public domain but also on the study population’s interaction with water in that domain. To our knowledge, no published studies have quantified child exposure to contaminated water in different microenvironments in the domestic and public domains, but time–activity analyses in air pollution studies have long been conducted for exposure assessments²² and have shown that children spend extensive amounts of time outside of their domestic environment²³.

Findings from our diarrhea simulations suggest that, if children are experiencing exposure to fecal contamination in drinking water outside of their households, then using household water FIB concentrations as a proxy for their overall exposure may result in attenuated health effect estimates for FIB concentrations in drinking water and diarrhea. While evaluating these results, we found that this attenuation may be more pronounced in children above the age of 2 years. This suggests that children under the age of 2 years may be experiencing most of their exposure within the confines of their homes, so household-level exposure assessments may be appropriate for this age group. However, for older ambulatory children exposure outside of the home might be more readily considered.

The third component of error may not only emerge from limited precision associated with methods to characterize FIB concentrations^24,25, i.e., from variability in water quality measurements due to sampling and laboratory processing methods, but can also stem from temporal differences in water quality. FIB levels in household water can vary on a weekly, daily, and even hourly basis²⁶. For an outcome such as diarrhea that is normally acute, the biologically relevant household water fecal contamination levels might be representative of the fecal contamination levels during the incubation period of enteric pathogens found in water, which depending on the pathogen can vary from a matter of hours to up to a month²⁷. If water samples are collected on the same day as diarrheal disease data, the measured FIB concentrations on that day may not be representative of the biologically relevant fecal contamination in the lead up to a diarrhea episode. These discrepancies could be due to environmental factors, such as short-term weather changes like extreme rainfall events²⁸, or human factors, such as water treatment behavior change in response to a diarrhea episode^29,30. For chronic outcomes such as child growth, the biologically relevant household water fecal contamination likely needs to consider longer-term fecal contamination exposure, which may not be adequately represented by single or a few repeat measurements of household water fecal contamination, due to short-term and seasonal variability in fecal contamination in drinking water³¹.

Our simulations suggest that long-term household water fecal contamination may not be adequately represented by a single sample and hence can result in attenuations of the health effects of FIB concentrations in drinking water on child linear growth. These results were consistent with our evaluations using empirical growth data. While our previously reported IPD analysis found a significant association between fecal contamination and linear growth, 70% of the sample population only had a single matched water quality measure available to characterize exposure. The findings from this analysis imply that the reported effect sizes may be attenuated, and fecal exposure assessments may consider characterizing fecal contamination using multiple longitudinal samples to estimate more biologically relevant exposure.

The results from our analyses need to be interpreted with caution. First, due to data availability we were limited to applying this framework to two sources of error. There are many more possible sources of error that we were not able to consider here, such as assigning household-level exposures to individuals and the use of FIB as proxies for enteric pathogens. Second, this analysis was limited to quantifying the effects of measurement error on the magnitudes of health effects and not on the precision of those effect estimates. Uncertainty in health effect estimates introduced by exposure measurement error may obscure associations where they exist, thus increasing the likelihood of false-negative findings. Third, the current framework is limited to drinking water, but there are a number of other important fecal–oral transmission pathways, such as hands, food, soil, fomites, and flies. Findings from our IPD analysis suggest that fecal contamination along select pathways is associated with child diarrhea and growth, so a similar framework could be applied to other pathways to test whether those findings may have suffered from regression dilution bias. Fourth, we did not have access to repeated water samples within the shorter timeframe of pathogen incubation periods for acute gastroenteritis, so were not able to quantify measurement error on health effects from FIB concentrations in drinking water on diarrhea from the use of single samples used to estimate household water fecal contamination.

Our results suggest that exposure measurement error can contribute to attenuated fecal exposure–outcome relationships for outcomes that are typically acute, such as diarrhea, as well as for more chronic outcomes such as linear growth. Fecal exposure assessments in drinking water may consider exposure outside of the household as well as attempting to characterize fecal contamination with repeat samples to account for variability in water quality. They may leverage measurement error frameworks to design exposure assessments that are more proximal to the true exposure experienced by individuals, which in turn may inform the design of more effective interventions to reduce waterborne disease burdens.

Methods

Data

We used data from 19 studies conducted in South America, Sub-Saharan Africa, and South and South-East Asia^{29,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49}. We requested permission from data owners for use of these data for this study. Eligible datasets included variables describing FIB concentrations in household drinking water, child age, and intervention status. We included children aged 0–72 months. Datasets also included unique anonymized identifiers for each community, household, and child. For the diarrhea analysis, we defined community water fecal contamination levels for a given household on a specific day as the median household water FIB concentrations of all other households in its community on the same day. We generated a variable for each city or collection of communities in rural areas describing whether water quality data was collected in a wet or dry season month by using the 30-year average monthly precipitation from the WorldClim dataset⁵⁰ and designating wet season months as those where average precipitation was >60 mm and dry season months if average precipitation was <60 mm⁵¹. For the diarrhea analysis, we used household water fecal contamination data in long format by matching single household water FIB concentration observations z_t to child survey data if they were collected on the same day or up to 7 days before the survey was conducted and then generated different scenarios for the aggregate fecal contamination $\bar x_t$ by incorporating the median community water fecal contamination. For the growth analysis, we used household water FIB concentrations data in wide format by matching all available water samples collected over the course of a child’s life up to the day anthropometric measurements were taken.

Analytical approach

We used a two-tiered analytical approach to examine evidence and magnitude of random exposure measurement error. First, in a simulated analysis we randomly assigned health outcomes (diarrhea cases and HAZ scores) to each observation with an estimated exposure and then regressed those outcomes on the error-prone exposure variables, represented by the measured proxies of exposure. Exposure was assigned based on household water FIB concentration measurements, so the simulations retained existing correlations between communities, households, and individuals. Evidence and magnitude of exposure measurement error was assessed by estimating the attenuation factor associated with the error-prone exposure variable⁵². Second, we evaluated findings from our simulations by using empirical health outcome data from the same datasets and regressing it on both the estimated exposure and error-prone exposure variables. All analyses were conducted in R version 3.6⁵³.

Household versus community exposure

We simulated the effect of exclusively assigning household water fecal contamination $z_t^ \ast$ for individual exposure, if estimated exposure is actually a combination of both household and community water fecal contamination $w_t^ \ast$, by:

1.
Randomly generating diarrhea cases for each included child with a combination of household- and community-level drinking water fecal contamination as the aggregate drinking water fecal contamination $\bar x_t$ experienced by a child, using the Bernoulli distribution where the log odds of diarrhea d_ijkl for child i in household j in community k and study l is given by:
$${\mathrm{logit}}\left( {{d}_{ijkl}} \right) = \beta _0 + \beta _1{\mathrm{FIB}}_{ij} + \beta _2{\mathrm{Treat}}_{ij} + \beta _3{\mathrm{Age}}_{i} + \beta _4{\mathrm{Season}}_{ijk} + \mu _{ijkl} + \mu _{ijk}$$
(2)
$$\mu _{ijkl} \sim N\left( {0,0.6} \right);\mu _{ijk} \sim N\left( {0,0.3} \right)$$
$${p}_{ijkl} = \frac{{{\mathrm{e}}^{d_{ijkl}}}}{{1 + {\mathrm{e}}^{d_{ijkl}}}}$$
$${d}_{ijkl} = {\mathrm{Bernoulli}}({ijkl},p_{ijkl})$$
2.
Assuming that (1) community-level drinking water fecal contamination is represented by the median household water FIB concentrations in all other community households; (2) baseline odds of diarrhea β₀ for this population is 0.15; (3) odds of diarrhea for 1 − log₁₀ higher FIB concentrations in drinking water β₁ is 1.5; and (4) odds of diarrhea for children receiving an intervention β₂, child age β₃ (in years), and for data collected in the wet compared to the dry season β₄ are 0.9, 0.8, and 1.2, respectively. Effect estimates were broadly based on model outputs from our IPD analyses, although we assumed higher odds of diarrhea for FIB concentrations in drinking water because we hypothesize that the effect estimate for the exposure–outcome relationship in our IPD analysis may have been suffering from regression dilution bias⁵⁴. The model accounted for clustering at the study-level μ_ijkl and community-level μ_ijk.
3.
Fitting multilevel generalized mixed effects models with the assigned diarrheal cases and replacing the combined household and community drinking water FIB concentrations $\bar x_t$ with household-level FIB concentrations $z_t^ \ast$ exclusively as the error-prone exposure variable.
4.
Calculating the attenuation associated with the estimated log odds of diarrhea (β₁*) from assigning household-level FIB concentrations $z_t^ \ast$ exclusively as the error-prone exposure variable, compared to the assigned log odds of diarrhea (β₁ = log(1.5)) if combined household and community drinking water FIB concentrations $\bar x_t$ represent the exposure: $a = \frac{{\beta _1^ \ast }}{{\beta _1}} = \frac{{\beta _1^ \ast }}{{\log \left( {1.5} \right)}}.$
5.
Repeating simulations for a range of exposure scenarios by adding community water fecal contamination $w_t^ \ast$ in 10% increments, starting with 100% household water fecal contamination and ending with 100% community water fecal contamination representing the estimated exposure.
6.
Stratifying the combined analysis: As reported previously, estimated odds of diarrhea for a 1 − log₁₀ increase in FIB concentrations in drinking water was higher in urban compared to rural settings (Goddard et al., under review), so we stratified the simulation by urban versus rural areas to differentiate whether exclusively assigning household drinking water fecal contamination $z_t^ \ast$ may introduce more error in one setting compared to the other.
7.
Conducting sensitivity analyses: Assessed the effects our assumptions had on the simulation findings by repeating the simulations with (a) higher and lower assumed odds of diarrhea for higher FIB concentrations in drinking water and (b) using the highest and lowest community water FIB concentrations instead of the median.

To evaluate findings from the simulations, we applied empirical diarrhea data by:

1.
Beginning with household water fecal contamination $z_t^ \ast$ as the estimated exposure and fitting a multilevel generalized mixed effects model to estimate the odds of diarrhea for 1 − log₁₀ higher FIB concentrations in household drinking water.
2.
Replacing household water fecal contamination $z_t^ \ast$ with community water fecal contamination $w_t^ \ast$ in 10% increments and fitting the same regression model with each new exposure assignment.
3.
Calculating the attenuation associated with the log odds of diarrhea (β₁*) from assigning household-level FIB concentrations exclusively as the error-prone exposure variable, compared to effect estimates that combine household/community water fecal contamination (β₁).
4.
Stratifying the analysis: In addition to stratifying by rural versus urban areas, we also stratified by children aged 0–23 and 24–72 months to consider how child mobility may modify the effect of assigning community water quality to exposure. We hypothesized that children aged 0–23 months are mostly non-ambulatory and spend the majority of their time within the confines of their home, and pre-school children aged 24–72 months are ambulatory and spend their time both in their home and within the confines of the community.

Single versus multiple samples

We simulated the effect of assigning a single measure of FIB concentrations in drinking water z_t as the error-prone exposure variable by

1.
Randomly generating expected HAZ scores with the estimated household water fecal contamination $z_t^ \ast$ represented by the median household water FIB concentrations from repeat samples, using the following model where the difference in HAZ scores HAZ_ijkl for child i in household j in community k and study l is given by:
$${\mathrm{HAZ}}_{ijkl} = \beta _0 + \beta _1{\mathrm{FIB}}_{ij} + \beta _2{\mathrm{Treat}}_{ij} + \beta _3{\mathrm{Age}}_i + \mu _{ijkl}\,+ \in _{ijkl};$$
(3)
$$\mu _{ijkl} \sim N\left( {0,0.5} \right); \in _{ijkl} \,\sim N(0,1)$$
2.
Assuming that the (1) mean baseline HAZ score β₀ in this population is −1.6; (2) difference in HAZ score for 1 − log₁₀ higher median FIB concentrations β₁ is −0.2; (3) difference in HAZ score for children receiving an intervention β₂ and for child age β₃ (years) are 0.1 and −0.05, respectively; and (4) HAZ scores follow a normal distribution. The model accounted for clustering at the study-level μ_ijkl.
3.
Fitting multilevel generalized mixed-effects models with the assigned HAZ scores and replacing the estimated household water fecal contamination $z_t^ \ast$ represented by the median household water FIB concentrations from repeat samples with a randomly chosen single measure of household water FIB concentrations z_t as the error-prone exposure variable.
4.
Calculating the attenuation associated with the estimated difference in HAZ score (β₁*) from randomly choosing a single measure of water quality z_t as the error-prone exposure variable, compared to the assigned difference in HAZ score (β₁ = −0.2), if the estimated exposure is represented by repeat samples of household water fecal contamination $z_t^ \ast$: $\alpha = \frac{{\beta _1^ \ast }}{{\beta _1}} = \frac{{\beta _1^ \ast }}{{ - 0.2}}.$
5.
Repeating the simulations for children with at least two, three, or four matched household water FIB concentration measures making up the median household water fecal contamination $z_t^ \ast$. We did not have sufficient data to conduct these simulations with more than four matched water samples.
6.
Stratifying the analysis: Previous research has found that fecal contamination in drinking water sources in low-income countries is higher in the wet season compared to the dry season⁵⁵, so we stratified these simulations by season to examine whether error introduced from variability in water quality is greater in one season compared to the other.
7.
Conducting sensitivity analyses: Assessed the effects our assumptions had on the simulation findings by repeating the simulations with (a) higher and lower assumed difference in HAZ scores for higher FIB concentrations in drinking water and (b) using the highest and lowest drinking water FIB concentrations from the repeat samples instead of the median.

To evaluate findings from the simulations, we applied empirical linear growth data for a subset of children in our dataset where HAZ scores were available by:

1.
Fitting multilevel generalized mixed-effects models with the median household water fecal contamination $z_t^ \ast$ from repeated measures of household FIB concentrations as the exposure variable. Repeating this for two, three, and four repeat measures.
2.
Fitting the same models after randomly selecting a single measure of household water FIB concentrations z_t as the error-prone exposure variable from the repeat measures.
3.
Calculating the attenuation associated with the estimated difference in HAZ score (β₁*) from randomly selecting a single measure of household water FIB concentrations z_t as the error-prone exposure variable, compared to effect estimate (β₁) from the median of repeat samples of household water fecal contamination $z_t^ \ast$ as the exposure variable.
4.
Stratifying the analysis: We conducted the same stratification by season as we did for the simulations.

Data availability

The data that support the findings of this study are available from the corresponding authors of the included studies, but restrictions apply to the availability of these data. Most of the included data are not publicly available; however, data are available from the authors upon reasonable request. The corresponding author of this study does not own any of the included data.

Code availability

The code used for the analyses is available from the corresponding author upon request.

References

Sclar, G. D. et al. Assessing the impact of sanitation on indicators of fecal exposure along principal transmission pathways: a systematic review. Int. J. Hyg. Environ. Health 219, 709–723 (2016).
Google Scholar
Leclerc, H., Mossel, D. A. A., Edberg, S. C. & Struijk, C. B. Advances in the bacteriology of the coliform group: their suitability as markers of microbial water safety. Annu. Rev. Microbiol. 55, 201–234 (2001).
CAS Google Scholar
Armstrong, B. G. Effect of measurement error on epidemiological studies of environmental and occupational exposures. Occup. Environ. Med. 55, 651–656 (1998).
CAS Google Scholar
Sarnat, J. A. et al. Panel discussion review: session 1—exposure assessment and related errors in air pollution epidemiologic studies. J. Expo. Sci. Environ. Epidemiol. 17, S75–S82 (2007).
CAS Google Scholar
Gundry, S., Wright, J. & Conroy, R. A systematic review of the health outcomes related to household water quality in developing countries. J. Water Health 2, 1–13 (2004).
Google Scholar
Gruber, J. S., Ercumen, A. & Colford, J. M. Coliform bacteria as indicators of diarrheal risk in household drinking water: systematic review and meta-analysis. PLoS ONE 9, e107429 (2014).
Google Scholar
Hodge, J. et al. Assessing the association between thermotolerant coliforms in drinking water and diarrhea: an analysis of individual-level data from multiple studies. Environ. Health Perspect. 124, 1560–1567 (2016).
CAS Google Scholar
Schmidt, W.-P. et al. Epidemiological methods in diarrhoea studies—an update. Int. J. Epidemiol. 40, 1678–1692 (2011).
Google Scholar
Schwartz, J., Sarnat, J. A., Coull, B. A. & Wilson, W. E. Effects of exposure measurement error on particle matter epidemiology: a simulation using data from a panel study in Baltimore, MD. J. Expo. Sci. Environ. Epidemiol. 17, S2–S10 (2007).
CAS Google Scholar
Goldman, G. T. et al. Impact of exposure measurement error in air pollution epidemiology: effect of error type in time-series studies. Environ. Health 10, 61 (2011).
Google Scholar
Pennington, A. F. et al. Measurement error in mobile source air pollution exposure estimates due to residential mobility during pregnancy. J. Expo. Sci. Environ. Epidemiol. 27, 513–520 (2017).
CAS Google Scholar
Perrier, F., Giorgis-Allemand, L., Slama, R. & Philippat, C. Within-subject pooling of biological samples to reduce exposure misclassification in biomarker-based studies. Epidemiology 27, 378–88 (2016).
Google Scholar
MacMahon, S. et al. Blood pressure, stroke, and coronary heart disease. Part 1, Prolonged differences in blood pressure: prospective observational studies corrected for the regression dilution bias. Lancet 335, 765–74 (1990).
CAS Google Scholar
Zeger, S. L. et al. Exposure measurement error in time-series studies of air pollution: concepts and consequences. Environ. Health Perspect. 108, 419–426 (2000).
CAS Google Scholar
Fleisher, J. M. The effects of measurement error on previously reported mathematical relationships between indicator organism density and swimming-associated illness: a quantitative estimate of the resulting bias. Int. J. Epidemiol. 19, 1100–1106 (1990).
CAS Google Scholar
Levy, M. C. et al. Spatiotemporal error in rainfall data: consequences for epidemiologic analysis of waterborne diseases. Am. J. Epidemiol. 188, 950–959 (2019).
Google Scholar
Teunis, P. F. M., Reese, H. E., Null, C., Yakubu, H. & Moe, C. L. Quantifying contact with the environment: behaviors of young children in Accra, Ghana. Am. J. Trop. Med. Hyg. 94, 920–931 (2016).
Google Scholar
Kwong, L. H. et al. Age-related changes to environmental exposure: variation in the frequency that young children place hands and objects in their mouths. J. Expo. Sci. Environ. Epidemiol. https://doi.org/10.1038/s41370-019-0115-8 (2019).
US EPA. Exposure Factors Handbook: 2011 Edition (EPA, 2011).
VanDerslice, J., Popkin, B. & Briscoe, J. Drinking-water quality, sanitation, and breast-feeding: their interactive effects on infant health. Bull. World Health Organ. 72, 589–601 (1994).
CAS Google Scholar
Robb, K. et al. Assessment of fecal exposure pathways in low-income urban neighborhoods in Accra, Ghana: rationale, design, methods, and key findings of the SaniPath Study. Am. J. Trop. Med. Hyg. 97, 1020–1032 (2017).
Google Scholar
Branco, P. T. B. S., Alvim-Ferraz, M. C. M., Martins, F. G. & Sousa, S. I. V. The microenvironmental modelling approach to assess children’s exposure to air pollution – a review. Environ. Res. 135, 317–332 (2014).
CAS Google Scholar
Devakumar, D. et al. Biomass fuel use and the exposure of children to particulate air pollution in southern Nepal. Environ. Int. 66, 79–87 (2014).
CAS Google Scholar
Harmel, R. D. et al. Uncertainty in monitoring E. coli concentrations in streams and stormwater runoff. J. Hydrol. 534, 524–533 (2016).
Google Scholar
Gronewold, A. D. & Wolpert, R. L. Modeling the relationship between most probable number (MPN) and colony-forming unit (CFU) estimates of fecal coliform concentration. Water Res. 42, 3327–3334 (2008).
CAS Google Scholar
Levy, K., Hubbard, A. E., Nelson, K. L. & Eisenberg, J. N. S. Drivers of water quality variability in Northern Coastal Ecuador. Environ. Sci. Technol. 43, 1788–1797 (2009).
CAS Google Scholar
European Centre for Disease Prevention and Control. Systematic Review on the Incubation and Infectiousness/Shedding Period of Communicable Diseases in Children (European Centre for Disease Prevention and Control, 2016).
Guzman Herrador, B. R. et al. Analytical studies assessing the association between extreme precipitation or temperature and drinking water-related waterborne infections: a review. Environ. Health 14, 29 (2015).
Google Scholar
Luby, S. P. et al. Microbiological contamination of drinking water associated with subsequent child diarrhea. Am. J. Trop. Med. Hyg. 93, 904–911 (2015).
CAS Google Scholar
Ercumen, A. et al. Potential sources of bias in the use of Escherichia coli to measure waterborne diarrhoea risk in low-income settings. Trop. Med. Int. Health 22, 2–11 (2016).
Google Scholar
Wilkes, G. et al. Seasonal relationships among indicator bacteria, pathogenic bacteria, Cryptosporidium oocysts, Giardia cysts, and hydrological indices for surface waters within an agricultural landscape. Water Res. 43, 2209–2223 (2009).
CAS Google Scholar
Arnold, B. F. et al. Causal inference methods to study nonrandomized, preexisting development interventions. Proc. Natl Acad. Sci. USA 107, 22605–22610 (2010).
CAS Google Scholar
Benjamin-Chung, J. et al. A randomized controlled trial to measure spillover effects of a combined water, sanitation, and handwashing intervention in rural Bangladesh. Am. J. Epidemiol. 187, 1733–1744 (2018).
Google Scholar
Boisson, S. et al. Field assessment of a novel household-based water filtration device: a randomised, placebo-controlled trial in the democratic Republic of Congo. PLoS ONE 5, 1–10 (2010).
Google Scholar
Boisson, S. et al. Effect of household-based drinking water chlorination on diarrhoea among children under five in Orissa, India: a double-blind randomised placebo-controlled trial. PLoS Med. 10, e1001497 (2013).
Google Scholar
Brown, J., Sobsey, M. D. & Loomis, D. Local drinking water filters reduce diarrheal disease in Cambodia: a randomized, controlled trial of the ceramic water purifier. Am. J. Trop. Med. Hyg. 79, 394–400 (2008).
Google Scholar
Clasen, T., Parra, G. G., Boisson, S. & Collin, S. Household-based ceramic water filters for the prevention of diarrhea: a randomized, controlled trial of a pilot program in Colombia. Am. J. Trop. Med. Hyg. 73, 790–795 (2005).
Google Scholar
Clasen, T. et al. Effectiveness of a rural sanitation programme on diarrhoea, soil-transmitted helminth infection, and child malnutrition in Odisha, India: a cluster-randomised trial. Lancet Glob. Health 2, e645–e653 (2014).
Google Scholar
Mattioli, M. C. et al. Enteric pathogens in stored drinking water and on caregiver’s hands in Tanzanian households with and without reported cases of child diarrhea. PLoS ONE 9, e84939 (2014).
Google Scholar
Ercumen, A. et al. Effects of source- versus household contamination of tubewell water on child diarrhea in rural Bangladesh: a randomized controlled trial. PLoS ONE 10, e0121907 (2015).
Google Scholar
Kirby, M. A. et al. Use, microbiological effectiveness and health impact of a household water filter intervention in rural Rwanda—a matched cohort study. Int. J. Hyg. Environ. Health 220, 1020–1029 (2017).
Google Scholar
Kirby, M. A. et al. Effects of a large-scale distribution of water filters and natural draft rocket-style cookstoves on diarrhea and acute respiratory infection: a cluster-randomized controlled trial in Western Province, Rwanda. PLoS Med. 16, e1002812 (2019).
Google Scholar
Patil, S. R. et al. The effect of India’s total sanitation campaign on defecation behaviors and child health in rural Madhya Pradesh: a cluster randomized controlled trial. PLoS Med. 11, e1001709 (2015).
Peletz, R. et al. Drinking water quality, feeding practices, and diarrhea among children under 2 years of HIV-positive mothers in peri-urban Zambia. Am. J. Trop. Med. Hyg. 85, 318–26 (2011).
Google Scholar
Peletz, R. et al. Assessing water filtration and safe storage in households with young children of HIV-positive mothers: a randomized, controlled trial in Zambia. PLoS ONE 7, e46548 (2012).
CAS Google Scholar
Pickering, A. J. et al. Fecal indicator bacteria along multiple environmental transmission pathways (water, hands, food, soil, flies) and subsequent child diarrhea in rural Bangladesh. Environ. Sci. Technol. 52, 7928–7936 (2018).
CAS Google Scholar
Pickering, A. et al. Can individual and integrated water, sanitation, and handwashing interventions reduce fecal contamination in the household environment? Evidence from the WASH Benefits cluster-randomized trial in rural Kenya. Preprint at BiorXiv https://doi.org/10.1101/731992 (2019).
Reese, H. et al. Assessing longer-term effectiveness of a combined household-level piped water and sanitation intervention on child diarrhoea, acute respiratory infection, soil-transmitted helminth infection and nutritional status: a matched cohort study in rural Odisha. Int. J. Epidemiol. 0, 1–11 (2019).
Google Scholar
Sinharoy, S. S. et al. Effect of community health clubs on child diarrhoea in western Rwanda: cluster-randomised controlled trial. Lancet Glob. Health 5, e699–e709 (2017).
Google Scholar
Fick, S. E. & Hijmans, R. J. WorldClim 2: new 1-km spatial resolution climate surfaces for global land areas. Int. J. Climatol. 37, 4302–4315 (2017).
Google Scholar
Peel, M. C., Finlayson, B. L. & Mcmahon, T. A. Updated world map of the Köppen-Geiger climate classification. Hydrol. Earth Syst. Sci. 11, 1633–1644 (2007).
Google Scholar
Fuller, W. A. Measurement Error Models (John Wiley & Sons, 1987).
R Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2019).
Hutcheon, J. A., Chiolero, A. & Hanley, J. A. Random measurement error and regression dilution bias. BMJ 340, c2289 (2010).
Google Scholar
Kostyla, C., Bain, R., Cronk, R. & Bartram, J. Seasonal variation of fecal contamination in drinking water sources in developing countries: a systematic review. Sci. Total Environ. 514, 333–343 (2015).
CAS Google Scholar

Download references

Acknowledgements

We thank Amy Pickering, Ayse Ercumen, Joe Brown, Ben Arnold, Jade Benjamin-Chung, Sophie Boisson, Jenna Davis, Angela Harris, Wolf-Peter Schmidt, Steve Luby, Miles Kirby, Sumeet Patil, Rachel Peletz, Heather Reese, and Sheela Sinharoy for providing data for this study.

Author information

Authors and Affiliations

Gangarosa Department of Environmental Health, Rollins School of Public Health, Emory University, Atlanta, GA, USA
Frederick G. B. Goddard, Thomas F. Clasen & Jeremy A. Sarnat
Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA, USA
Howard H. Chang

Authors

Frederick G. B. Goddard
View author publications
You can also search for this author in PubMed Google Scholar
Howard H. Chang
View author publications
You can also search for this author in PubMed Google Scholar
Thomas F. Clasen
View author publications
You can also search for this author in PubMed Google Scholar
Jeremy A. Sarnat
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the conception of this study. J.A.S., F.G.B.G., and H.H.C. developed the analysis plan. F.G.B.G. conducted the systematic search and data compilation for the parent study that identified the data used in this study. F.G.B.G. conducted the data analysis and wrote the first draft of the manuscript. T.F.C. contributed original data to this study. All authors contributed to the final manuscript.

Corresponding author

Correspondence to Frederick G. B. Goddard.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary materials

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Goddard, F.G.B., Chang, H.H., Clasen, T.F. et al. Exposure measurement error and the characterization of child exposure to fecal contamination in drinking water. npj Clean Water 3, 19 (2020). https://doi.org/10.1038/s41545-020-0063-9

Download citation

Received: 27 September 2019
Accepted: 06 March 2020
Published: 06 April 2020
DOI: https://doi.org/10.1038/s41545-020-0063-9

This article is cited by

Effects of adding household water filters to Rwanda’s Community-Based Environmental Health Promotion Programme: a cluster-randomized controlled trial in Rwamagana district
- Sabrina Haque
- Miles A. Kirby
- Thomas Clasen
npj Clean Water (2022)
Recent Advancements on Photothermal Conversion and Antibacterial Applications over MXenes-Based Materials
- Shuyan Hao
- Hecheng Han
- Fenglong Wang
Nano-Micro Letters (2022)
Drinking water quality and the SDGs
- Robert Bain
- Rick Johnston
- Tom Slaymaker
npj Clean Water (2020)
Microbial Indicators of Fecal Pollution: Recent Progress and Challenges in Assessing Water Quality
- David A. Holcomb
- Jill R. Stewart
Current Environmental Health Reports (2020)