Introduction

In most low-income countries, there is insufficient access to safe drinking water, adequate sanitation and hygiene facilities (WASH henceforth) at home1. This inadequacy particularly affects women and girls2,3. In addition to this being a public health problem, the literature indicates that lack of access to, or inadequate provision of WASH facilities can increase vulnerability to violence against women, VAW henceforth4. However, there is no comprehensive study that quantifies the association between these key household resources (WASH facilities) and VAW. We attempt to fill this gap by analyzing data from a large survey of Indian households to understand the association between WASH and VAW. In doing so, we analyze both IPV (intimate partner violence) and NPV (non-partner violence) and explain the different channels through which a lack of WASH facilities at home can lead to such violence.

While the lack of WASH can affect everyone, women and girls are disproportionately affected. In low-income countries, women and girls are frequently required to walk long distances in search of a water supply for drinking, cooking, laundry, as well as wait until dark to look for a private place to defecate and satisfy their sanitation needs. Stepping out of the house particularly at night exposes them to the risks of NPV in the form of physical and sexual violence5,6. Post-pubescent girls and women have the added difficulty of menstruation, which for a number of days per month increases their daily needs for water and sanitation4. Household water insecurity may increase the vulnerability of women to IPV as a penalty for failure to fulfil household activities dependent on water such as cooking and cleaning7.

The theoretical relationship between economic resources and violence is complex. In particular, the mechanism through which NPV and IPV are affected are different. For both types of violence, the simplest framework is to assume a ‘taste for violence’ which is nonetheless sensitive to factors that make it more costly to inflict violence. By taste for violence, we mean that the potential perpetrator enjoys inflicting violence i.e. it directly enters his utility function. This is as opposed to instrumental violence where the perpetrator does not directly enjoy violence but uses it as a tool to extract other goods or acts from which he gains utility. For example, in Bloch and Rao8, the perpetrator inflicts violence to extract dowry payments. For men who inflict NPV, one may hypothesise that they either have a direct taste for violence or use violence to extract sexual favours. For NPV, the access to WASH facilities lowers the expected gain for the opportunistic perpetrator as fewer women need to leave the house and away from other people. Applying the model by Becker9 that criminals weigh cost and benefits of committing crime, one can predict that the lowered opportunity to commit violence would make it costly for the potential perpetrator, as fewer women need to leave the house for their WASH needs. In line with the Becker framework, this would lower violence. While not the primary aim, improved WASH facilities lead to situational crime prevention i.e. it changes the environment causing a lowered opportunity for crime to occur10,11 thereby minimising expected reward via decreasing opportunities for perpetrators to inflict violence without risk of discovery, which is again consistent with the Becker framework.

For IPV, the relationship between economic resources and violence can be driven by changes in a women’s bargaining power. Access to WASH facilities frees up a woman’s time to pursue economic activities outside the house i.e. it leaves her time to supply wage labour which may increase her bargaining power within the household and also increase household income. More directly, the presence of WASH facilities may lead to a reduction in cognitive load and time pressure in their day, which may in turn reduce the opportunity for intra-household tensions that have the potential to lead to violent outcomes. The direction of causality could go in either direction, in that:

  1. A)

    Better off household may experience fewer stressors than poorer households and experience reduced IPV, such households also have access to improved WASH;

  2. B)

    A household that has easy accesses to WASH may experience reduced economic stress and decreased IPV.

Unfortunately, we cannot observe the direction of causality but both (1) and (2) are consistent with the observed associations.

A few studies have examined the link between resource availability and VAW. Cools and Kotsadam12 show that resource inequality (based on wealth, education and employment) is associated with higher intimate partner violence in Sub-Saharan Africa. Guimbeau et al.13 find that proximity to resources such as mineral deposits is associated with reduced acceptance of physical violence by women in India. Coming to household resources, there is very little work analyzing the relationship between toilet access and VAW. Gonsalves et al.14 quantify the association between toilet construction and reduced sexual violence in an urban township in South Africa using a mathematical simulation approach. There are also some qualitative studies on the link between lack of toilet facilities in households and perception of violence faced by women in India15,16. A few papers provide econometric analyses of the association between toilet availability and NPV6,17,18 but there is no existing evidence for IPV. Chaplin’s19 survey of the literature finds that the linkage between VAW and sanitation is poorly researched and documented. When it comes to the role of other key resources such as water, the literature is even more scant. The only study for the association between water access and VAW is a study on Nepal relating to IPV7.

Our study makes three distinct contributions to the literature. This is the first comprehensive study to analyze the association between the key household resources (water and sanitation) and both NPV and IPV. Second, our study uses data from a nationally representative survey which has greater potential for generalizability than local data-sets analyzed in some of the existing studies. We use household data for India obtained from the latest (fourth) round of the National Family Health Survey (NFHS-4 henceforth) conducted in 2015–16. Third, this is the only study to employ logistic regression and the inverse-probability-weighted regression adjustment to control for selection bias as well as examine the sensitivity of the results to selection on unobservables.

The NFHS is a large-scale, multi-round survey conducted on a representative sample of households throughout India. It is a nationally important source of data on population, health and nutrition indicators for each state and union territory (UT) and has been widely used in studies related to both IPV and NPV6,17,18,20,21,22,23,24,25. NFHS surveys are performed under the supervision of the Ministry of Health and Family Welfare (MoHFW), Government of India and the data collection is coordinated by the International Institute for Population Sciences (IIPS), Mumbai. Using data from NFHS-426 provides us with a large sample to draw inferences from. The survey covers approximately 601,509 households from all 640 districts (across 29 states and 7 UTs) of India. Among those women who participated in the survey questions on IPV and NPV, we filter out the missing values to arrive at a final sample of 59,093 women for IPV and 76,580 for NPV. Second, NFHS-4 is distinct from the previous three rounds (conducted in 1992–93, 1998–99, 2005–06) as it provides district level estimates for the first time pertaining to a number of important socio-economic indicators.

We use logistic regression method to study the association between WASH and VAW, based on NFHS-4 data. However, we face the challenge of drawing causal inferences from a cross-sectional dataset. NFHS does not survey the same individuals across waves and therefore does not provide longitudinal information. While this is a limitation of the dataset, there is no other available dataset in India that has the kind of detailed information that NFHS provide. Moreover, being an observational dataset, the treatment variable representing individuals’ access to household WASH facilities is not random and may have possible relationships with both their observable and unobservable characteristics, causing selection bias. In the absence of appropriate instrumental variables, to mitigate a potential endogeneity problem, this study estimates treatment effects by applying a methodology that can control for the observed heterogeneity: inverse-probability-weighted regression adjustment (IPWRA henceforth) which can control for observed differences across the treatment and control groups. Individuals having access to household resources are considered to be the treatment group while non-users represent the counterfactual group or control group27. Robustness analysis is conducted using a standard matching method viz. Propensity Score Matching, PSM henceforth. Our review of the literature suggests that studies investigating the relationship between lack of sanitation resources in households and associated VAW have not used treatment effects estimation approaches. Thus, to the best of our knowledge, our study is the first to use such approaches (based on IPWRA) to mitigate selection bias in analyzing the link between lack of access to key household resources and VAW. However, due to the use of observational data, we are unable to fully control for the role of unobservables and hence we do not claim causal effects in our results.

The results of our logistic regression analysis show that improved access to drinking water has a statistically significant association with lower NPV experienced by women. The treatment effect obtained from the IPWRA method supports this result. We find using the IPWRA method that having access to drinking water reduces NPV by an average of 0.005. This reduction amounts to significant numbers of women being saved from NPV. For instance, the reduction in NPV amounts to a 10% decline or 371 women in our sample who would potentially experience lower NPV if they had access to drinking water. Projecting for the country, we are able to estimate that providing drinking water access can reduce NPV for 1.7 million women in India.

With respect to IPV, the logistic regression results show that improved access to toilets has a statistically significant association with reduction in IPV experienced by women. The IPWRA analysis produces similar results and suggests that access to toilets can reduce IPV for 1682 women in our sample and potentially 9.7 million in India. Thus, our results suggest that policy initiatives targeted at WASH and related behavioral change play a role in improving households’ welfare through the associated decrease in VAW. A challenge in this analysis is that IPRWA controls only for the role of observable factors. However, our results are robust to the presence of unobservable characteristics as found by sensitivity analysis conducted using the bounding approach28.

Methods

For our empirical analysis, we use data from NFHS-4, 2015–2016. The Household Questionnaire lists all members who are usual residents of the household as well as visitors who have stayed the night before the interview. Basic demographic information on age, sex, marital status and schooling, pertaining to each person is collected. Information is also collected on characteristics of the dwelling unit such as source of drinking water, time to get to water source and the type of toilet facilities available. The information on age and sex of household members based on the household questionnaire is then used to further identify women who are eligible for individual interviews using the separate women’s questionnaire. Information on various background characteristics of women such as demographics, socio economic status, empowerment indicators and husband/ partner’s background are then collated through the women’s questionnaire26.

One woman (between the ages 15–49) per household is randomly selected in compliance with WHO guidelines on the ethical collection of such data in order to assess exposure to violence. To ensure that the violence subsample is nationally representative, special weights are then used to account for the random selection of only one woman per household. For the measurement of NPV, married and unmarried women are asked about their experience of physical as well as sexual violence committed by anyone, other than a current or most recent husband, in the last year. Additionally, information from currently married women about the violence committed by the current husband and from formerly married women about their most recent husband is collected to determine exposure to emotional, physical and sexual IPV26.

Our sample for assessing the relationship between WASH facilities and NPV consists of 76,580 currently, formerly and never married women and for our parallel analysis pertaining to IPV, we have a dataset of 59,093 currently and formerly married women. In both samples, only those women who are usual residents of their households have been considered. For IPV only currently and formerly married women are considered while for NPV, along with ever married women, never married women are also included. The percentage shares of women who experienced IPV and NPV, out of the women surveyed in each of the 29 states and 7 UTs of India are highlighted in Table 1. We observe that IPV is more prevalent than NPV everywhere and is the most reported (as a share of the women surveyed) in Bihar, Tamil Nadu and Manipur. NPV is the highest (as a share of the women surveyed) in Tamil Nadu, Telangana and Puducherry.

Table 1 Composition of Violence Against Women in States/UTs (in percent).

We begin our analysis by estimating a logistic regression where the dependent variable is the experience of violence by the respondent and the independent variable is access to water or toilets along with a host of socio-economic and socio-demographic variables. Next, we consider treatment effects analysis where access to WASH resources in a household represents a treatment wherein individuals using the facilities form the treatment group while non-users, i.e., those without access constitute the untreated group (counterfactual or control group). However, such assignment of the treatment is non-random which can lead to a potential selection bias in estimation of the treatment effects29,30. This is because the reasons for having access to WASH facilities can be based on observable household features of women as well as other unobservable characteristics, thus making the choice of usage endogenous. In order to mitigate this problem, our study employs treatment effects estimation using the IPWRA method27. The premise behind this method is to imitate randomization regarding the assignment of the treatment as is done in randomized controlled trials (RCTs)31. Linnemayr and Alderman32 point out that the external validity of RCTs is limited and recommend the use of matching estimators such as propensity score matching method to overcome the problems associated with RCTs.

Our objective is to measure the treatment effect (i.e., presence of WASH resources) on VAW. This is captured by the average treatment effect on the treated (ATET) computed as follows33:

$$ATET=E\left[{Y}_{i1}|{D}_{i}=1\right]-E\left[{Y}_{i0}|{D}_{i}=1\right],$$

where E[.] is the expectation operator, Yi1 is the potential outcome for the units that receive treatment (D = 1), Yi0 is the potential outcome for the units that do not receive treatment. The problem is that we do not observe the outcome of the treated units had they not received the treatment, i.e., E[Yi0|Di = 1] but replacing these unobserved counterfactuals with the outcomes of the untreated i.e. E[Yi0|Di = 0] may result in biased estimates33. Wooldridge34 suggests IPWRA as the way out where two models are estimated: the first model to predict treatment status (which gives us propensity scores) and the second model to predict outcomes (which uses propensity scores to calculate weights for the regression adjustment model). This means that only one model must be correctly specified for the regression coefficients to provide consistent average treatment effects. Thus, this procedure has been referred to as “doubly robust” in the sense that if one model is mis-specified, the other should still hold34,35.

In the first step of the IPWRA method, we estimate the treatment model using logistic regression with treatment status as the dependent variable and suitable covariates as explanatory variables. The predicted probabilities are known as propensity scores which can be expressed as: p(x) = Prob(D = 1|x) where x is the set of relevant pretreatment covariates. The second step is to fit weighted regression models of the outcomes for each treatment level and obtain the treatment-specific predicted outcomes, once again using logistic regression models. Each ‘treated’ person receives a weight equal to the inverse of the propensity score, and each ‘untreated’ person receives a weight equal to the inverse of one minus the propensity score. Finally, we compute the potential means of the treatment-specific predicted outcomes to obtain the average treatment effect on the treated (ATET). The IPWRA estimator is expressed as33:

$$ATET \left(IPWRA\right)={n}_{T}^{-1}{\sum }_{i=1}^{n}{D}_{i}\left[{r}_{T}^{*}\left(x,{\delta }_{T}^{*}\right)-{r}_{UT}^{*}\left(x,{\delta }_{UT}^{*}\right)\right],$$

where \({n}_{T}\) is the number of treated units out of the entire sample size of n,\({r}_{T}^{*}\left(x,{\delta }_{T}^{*}\right)\) is the weighted regression model for treated (T) units with the inverse of \(\widehat{p}\left(x\right)\) as the weight and \({r}_{UT}^{*}\left(x,{\delta }_{UT}^{*}\right)\) is the weighted regression model for untreated (UT) units with \(1/(1-\widehat{p}\left(x\right))\) as the weight.

The use of the IPWRA method relies on two assumptions. The first is the conditional independence assumption (CIA) or unconfoundedness which means no unobservable variable affects both the likelihood of treatment as well as the outcome of interest after conditioning on covariates. We try to reduce this problem of selection on unobservables by following the Rosenbaum bounds approach28. The second assumption is the common support or overlap assumption which suggests that every observation comes with a positive probability of being both treated and controlled. We assess the overlap assumption by balancing on covariates. A covariate is said to be balanced when its distribution does not differ over treatment thresholds. We compute standardized differences which take into account both means and variances36,37. A perfectly balanced covariate has a standardized mean difference of zero and variance ratio of one38.

As a robustness check of our IPWRA results we also used the propensity score matching (PSM henceforth) method which depends on matching the individuals on their propensity scores and then comparing the outcomes to arrive at the ATET. Although we use PSM as a robustness check, we note that IPWRA has at least three advantages over PSM. The first one is the property of double robustness which makes it less prone to misspecification issues. The second advantage of IPWRA is the inclusion of controls for the observation’s baseline characteristics in the outcome model. Both IPWRA and PSM must satisfy the conditional independence assumption, which states that no unobservable variable affects both the likelihood of treatment and the outcome of interest after conditioning on covariates. Since IPWRA includes more covariates in the outcome model than PSM, which includes only the covariates in the treatment model, this assumption is more likely to hold with IPWRA than with PSM. The third improvement is that, unlike PSM, which compares each treatment observation to control observations that have a similar likelihood of being treated in a restrictive way, IPWRA implicitly compares every unit to every other unit while placing higher weights on observations that have a similar likelihood of being treated and lower weights on observations that are dissimilar27.

Now we discuss the choice of the variables in our analysis starting with the explanatory variables in the main logistic regression which include a treatment variable (drinking water or toilet facilities) along with the socio-economic and socio-demographic variables. Each of the treatment variables (drinking water and toilet facilities) is captured dichotomously where presence of the resource in the household is considered as the treatment or \({D}_{i}\)=1 and absence as \({D}_{i}\)=0. In line with Howard et al.39, we define drinking water variable as yes (or equals 1) if a household reports that it has water available on premises. If the household reports time taken for water collection (going and returning in minutes), we define it as no (or equals 0). Following Jadhav et al.6, we define toilet facility variable as yes (or equals 1) if a household reports that it has a facility available (flush, pit latrine), if no facility/bush/field, the variable is defined as no (or equals 0). Following the literature (see for instance Jadhav et al.6), the explanatory variables include place of residence (urban, rural), whether the dwelling has electricity (yes, no), education, ethnicity, religion and and region of residence (Northeast India, East India, North India, Central India, West India and lastly South India which is used as the reference category). A list of all potential covariates is provided in Table 2 even though the final choice of variables depended on criteria such as covariate balancing in the IPWRA (which we discuss below) and we included the same variables, along with the treatment variables, as the determinants of VAW in the logistic regression analysis. Table 3 shows a break-up of the sample across different individual and household characteristics.

Table 2 List of Variables and their Categories used in the Study.
Table 3 Description of the Sample.

For the dependent variables in the logistic regression and in the outcome models of the IPWRA, we consider IPV and NPV which are modeled dichotomously such that the presence of any type of IPV (physical, sexual or emotional) = 1, absence = 0 and any type of NPV (physical or sexual) = 1, absence = 0. Following the literature6,20,40, the common regressors for the outcome models which are expected to be risk factors for the experience of both IPV and NPV include the woman’s age (15–49 years), marital status, ethnicity (scheduled caste, scheduled tribe, other backward classes), education (0–20 years) and religion (Christian, Hindu, Muslim, Sikh, others). For IPV, in addition to the above, the following regressors are included in the outcome model as risk factors, viz. number of unions (once, more than once), employment status of the woman (working, not working), woman has control over how to spend her own money (yes, no), whether the woman is afraid of husband/partner i.e. psychological control (yes, no), woman accepts IPV (yes, no), marital control exercised by husband/partner (yes, no) and whether the woman’s father beat her mother, i.e. intergenerational IPV (yes, no), husband/partner’s employment status (working, not working), husband/partner’s education (0–20 years) and husband/partner drinks alcohol (yes, no). These variables are explained in details in Table 2. The treatment model follows the same specification as mentioned earlier.

Results

Logistic regression model results for the relationship between household resources and VAW

We first report the results of the logit regression analysis to estimate the relationship between VAW and WASH. Table 4 presents the logistic regression results for the cases of IPV. The results in Panel A show that, improved access to drinking water does not have a statistically significant association with IPV. Among the control variables, the woman’s characteristics (such as age, marital status, education, work status etc.), husband’s characteristics (such as education, alcohol), religion, ethnicity and locational dummies seem to be significant determinants of IPV. The results for toilet facilities (Panel B of Table 4) show that toilet access has a negative and statistically significant coefficient, even after controlling for a host of control variables. In other words, access to toilet is associated with a reduction in IPV experienced by women. The odds ratio suggests that provision of toilet facility is associated with lower odds of experiencing IPV by 0.894 times. With respect to NPV, the logistic regression results (shown in Table 5) suggest that in the case of drinking water (see Panel A), water access has a negative and statistically significant association with NPV. Therefore, access to water within the house appears to reduce the NPV experienced by women. The odds ratio implies that water access can reduce the odds of experiencing NPV by 0.925 times. We also observe that control variables such as the woman’s age, education, ethnicity, marital status and region are significant determinants of NPV. Finally, in the case of toilet facilities (see Panel B of Table 5), we find that though improved access to toilets does not have a statistically significant association with reduction in NPV at 5% level of significance but the relationship is significant at the 10% level. This result is similar to the finding of Srinivasan17. The odds ratio can be interpreted to mean that toilet access reduces the odds of a woman experiencing NPV by 0.908 times.

Table 4 Logistic regression Estimation for the Association between IPV and WASH.
Table 5 Logistic regression Estimation for the Association between NPV and WASH.

IPWRA: treatment and outcome model results

Next, we move to the IPWRA analysis starting with the treatment models that are necessary for estimating the propensity scores for each of the treatment variables. Table 6 shows the logistic regression results for the two treatment variables viz. drinking water and toilet facility pertaining to the IPV sample. The results show that women with more education and having electricity supply in their houses have greater access to both resources. With respect to region of residence, religion and ethnicity, there are significant differences across various regions, religions and ethnicities in terms of access to the resources. Further, women belonging to rural areas have lower access to all three resources. The results are similar for the NPV sample as illustrated in Table 7.

Table 6 Treatment Models Estimated from Logit Regression – IPV (N = 59,093).
Table 7 Treatment Models Estimated from Logit Regression – NPV (N = 76,580).

Tables 8, 9, 10 and 11 present the outcome model estimates for both categories of violence. While the results are mixed, our broad findings are that the following variables have significant association with the woman’s experience of violence: her age, ethnicity, education, marital and work status, husband’s education , intergenerational IPV (in the case of IPV), control over how to spend money, whether husband drinks alcohol, number of unions, religion and empowerment (measured by whether the woman is afraid of her husband, whether she justifies violence, whether marital control is exercised by husband).

Table 8 IPWRA outcome model logit regression for ‘untreated’ sample- IPV.
Table 9 IPWRA outcome model logit regression for ‘treated’ sample-IPV.
Table 10 IPWRA outcome model logistic regression for ‘untreated’ sample-NPV.
Table 11 IPWRA outcome model logistic regression for ‘treated’ sample-NPV.

Results of balance checks post treatment effects estimation are shown in Tables 12 and 13 respectively. They illustrate that although we find substantial differences on many unweighted covariates between treatment and control groups in the raw data, once we use matching and weighting techniques to balance the treatment and comparison groups, we obtain good balance on all covariates—all standardized differences are close to 0 and nearly all variance ratios are close to 1. Figure A1 in the Appendix shows that the propensity score is balanced across treatment and comparison groups as the range of common support shows that there is overlap of the distributions of propensity scores in the treatment and comparison groups. We find that the matched sample on the right-hand side in every case is in the form of one line, which is encouraging as this indicates that there are no large deviations. After matching/ weighting is applied, the common support is good, which leads us to infer that both groups are similar on average41.

Table 12 Balance Checks- IPV (N = 59,093).
Table 13 Balance checks NPV (N = 76,580).

IPWRA and PSM results for the relationship between household resources and IPV

Table 14 presents the treatment effects results for the association between access to WASH resources and VAW using IPWRA and PSM methods. We begin the discussion of our results with reference to the reduction in IPV achieved by each household resource starting with toilet availability. The estimates of the respective ATETs from the IPWRA analysis suggest that, having access to toilets is associated with reduced exposure to IPV by an average of 0.026 (and the ATET is statistically significant at the 5% level). Using the PSM method, the reduction turns out to be 0.044 (and statistically significant at the 1% level). To arrive at the estimated number of women who experience lower violence, we apply these percentage point reductions from IPWRA to the potential outcome means (POM) shown in Table 15 (the POMs are the mean outcomes of the untreated individuals). For instance, 22.9% of women without access to toilets experience IPV (as per Table 15) and when women have access to toilets this figure goes down by 0.026 to 20.3%. Based on the proportion of women who experience IPV, this is a reduction of 11%. For our sample of 59,093 women who answered the survey question on IPV out of whom 14,818 said they experienced IPV, this translates into 1682 women who could be saved from violence from their intimate partner if they are provided access to drinking water.

Table 14 Average Treatment Effects on the Treated (IPV N = 59,093 and NPV N = 76,580).
Table 15 Potential Outcome Means (IPV N = 59,093 and NPV N = 76,580).

Extrapolating for the country (with around 341 million women in the 15–59 age group in 2016, as per data from https://www.populationpyramid.net/india/2016/), we can project the benefits of providing drinking water access as resulting in reduced IPV for 9.7 million women in India. However, considering the standard error in our point estimates of ATET and POM, the reduction in IPV could range from 2 to 16 million women in the country. We add here the caveat that such extrapolations may not necessarily hold for the entire population, but we nevertheless present them to give an idea of the scale of the potential benefits at a national level. There are certain constraining factors that may limit the effectiveness of WASH facilities, such as high cost of operations and capital maintenance42,43.

Coming to availability of water, we observe from Table 14 that, based on the IPWRA analysis, the ATET is 0.006 but not statistically significant. According to this method, water access is not associated with a reduction in IPV. However, as per the PSM results, IPV reduces on an average by 0.035 and the ATET is statistically significant at 1%.

Therefore, availability of toilet facility in a woman’s house has a significant association with reduced violence exercised by their husbands. Some studies have argued that gender roles may not change when key household resources are accessible, e.g. Clancy et al.44 state, “Access to modern energy appears to enable women to fulfill their traditional roles (to their satisfaction and wellbeing) rather than bringing significant transformation in gender roles”. However, it has also been argued that if women spend their time savings from access to resources on increasing their income, they may increase their bargaining power within the family45.

IPWRA and PSM results for the relationship between household resources and NPV

Next, with respect to NPV, we find that according to the IPWRA estimates, access to drinking water and toilet facilities are associated with lower NPV by 0.005 and 0.004 respectively (though not statistically significant for toilet access). The corresponding figures from the PSM method are 0.005 and 0.010 respectively. It implies that the lesser the need to step out of the house to access WASH resources, the lower is the exposure to physical violence from non-partners. In Table 15, we observe from the POM estimates that the percentage of women without access to drinking water experiencing NPV is 5%. Applying the estimated ATETs from the IPWRA analysis, we see that access to drinking water can reduce NPV for 371 women in our sample and potentially 1.7 million women in India (which could vary between 0.4 million to 2.8 million women in view of the standard error in the point estimates of ATET and POM).

Finally, we evaluate the robustness of our results to the conditional independence assumption underlying our estimation methods. A concern with both IPWRA and PSM methods is that they do not control for the presence of unobserved covariates that can be correlated with both the treatment and the outcome variables. For example, communities which are more concerned about women’s safety may also have invested more in construction of indoor toilets. The presence of such unobserved factors can bias our estimates of the average treatment effects. If the unobservable characteristics affect the treatment (household resources) and outcome (VAW) variables simultaneously, a ‘hidden bias’ might arise, affecting the robustness of the IPWRA and PSM results. To find out how strongly hidden biases may influence our results, we employ sensitivity analysis following the boundness approach of Rosenbaum31. Let Γ be the ratio of the odds of receiving treatment for two matched individuals i and j with different unobserved characteristics. Following Rosenbaum28, we can write:

$$\frac{1}{\Gamma }\le \frac{{P}_{i}/\left(1-{P}_{i}\right)}{{P}_{j}/\left(1-{P}_{j}\right)}\le\Gamma$$

where, Pi and Pj are the true treatment probabilities that depends on both the observables and the unobservables. Then we can vary the values of Γ starting from 1 and test whether we have overestimated the true treatment effect i.e., whether the estimated treatment effect remains significant across values of Γ46. Since our outcome variable is binary, we compute the Mantel–Haenszel test statistic as suggested by Becker and Caliendo47 and search for evidence of overestimation of the treatment effects due to the presence of unobservables (see Tables 16 and 17). We find that the assumption of overestimation gets rejected even up to a Γ of 5 which means that, in order to invalidate our results, the unmeasured factor would have to increase the odds of receiving treatment by 5 times compared to an individual without these characteristics. Therefore, we conclude that our IPWRA and PSM results are robust to unobserved confounders.

Table 16 Mantel–Haenszel bounds sensitivity analysis for IPV (N = 59,093).
Table 17 Mantel–Haenszel bounds sensitivity analysis for NPV (N = 76,580).

In conclusion, our findings imply that policies and programs aimed at addressing VAW need to recognize the importance of providing key household resources to protect vulnerable women. While WASH facilities are usually provided as part of anti-poverty programs, these resources can have the added benefit of bringing down violence faced by the women in the target households, thereby potentially causing another type of welfare enhancement by improving the well-being of the beneficiaries.

Thus, the findings from our analysis seem to suggest that the Indian government’s recent schemes of building more toilets (Swachh Bharat or Clean India Mission) may produce the additional benefit of reduced violence experienced by vulnerable women. The Indian government has also embarked on a scheme of providing piped water at every rural home within 2024. Our results indicate that such interventions to bring water access to rural households will also contribute to the reduction of violence faced by rural women. Thus, we advocate moving beyond a silo approach in public service delivery and developing citizen centric programmes (instead of isolated interventions), by analysing additional factors such as attitudinal change, cost of provision and feasibility of schemes. There is a clear need for designing more multi-sectoral programming and cross-ministerial coordination. One increasingly popular mechanism for developing cross-sectoral linkages is a ‘one-stop-shop’ or ‘single-window- service’ model, where target beneficiaries of one government service or program receive information, assistance with applications, assessments for and/or direct referrals to other government services or programs48. Broadly speaking, there is a need for development policies to be gender sensitive rather than gender blind49.