Introduction

Chronic lymphocytic leukemia (CLL) is a hematologic neoplasm of memory B cells characterized by the clonal proliferation and accumulation of mature and typically CD5-positive B cells within the blood, bone marrow, lymph nodes, and/or spleen [1]. It is the most common type of leukemia in developed countries, with a current age-adjusted incidence of ~5 per 100 000 population [2]. CLL is more common in males than in females, in the elderly, with a mean age of diagnosis around 73 years, and, unlike other types of leukemia, CLL appears to be unique to persons of predominately European descent [3].

While enormous progress has been made in the management of CLL in the past decade to improve clinical outcomes, it is still an incurable cancer whose etiology is poorly understood. Family history of non-Hodgkin lymphoma or lymphoproliferative disorder, hepatitis C virus seropositivity, respiratory tract infections, history of atopic health disorders, reduced sun exposure, living on a farm or exposure to pesticides, and occupation as hairdresser are factors linked to CLL development [4, 5]. The elevated risk observed for occupational exposures of farming and hairdressers suggest that carcinogenic agents are likely involved in the pathogenesis of this blood disease. Thus, carcinogens contained both in drinking water and in recreational pools around the world such as some disinfection by-products (DBPs) and nitrate, are potential candidates.

DBPs constitute a complex mixture of chemicals formed as unintended by-products of the disinfectants applied to both drinking and recreational water to inactivate microbial pathogens [6]. Chlorine is a cost-effective disinfectant widely used worldwide and trihalomethanes are the most prevalent DBPs formed after chlorination. However, although disinfection is essential to avoid waterborne disease, it raises a public health issue: the potential carcinogenicity of DBPs [7]. Moreover, DBP concentrations in swimming pool water often exceed those found in drinking water. The primary routes of DBP uptake during swimming are dermal absorption and inhalation, that result in higher and more persistent levels of DBPs in the bloodstream compared to oral exposure, owing to their direct access to the bloodstream [8].

Nitrate occurrence in the water cycle is steadily increasing on a global scale, primarily due to the widespread use of nitrogen fertilizers and the intensification of agricultural practices [9]. Human exposure to nitrate mainly occurs through ingestion of food and drinking water [10]. When ingested, nitrate is reduced to nitrate, which can interact with endogenous molecules to form N-nitroso compounds such as nitrosamines and nitrosamides. Ingested nitrate leads to endogenous nitrosation to form nitrogenous compounds such as nitrosamines and nitrosamides, which are classified as probably carcinogenic to humans [11, 12]. Within this context and to provide more insight about the health impact of prevalent water contaminants, we explored the link between long-term exposures to THMs and nitrate in drinking water, as well as, the recreational use of swimming pools and CLL development.

Methods

Study design and population

CLL cases were recruited within the multicentric multicase-control (MCC) study in Spain (http://www.mccspain.org) in collaboration with the International Cancer Genome Consortium on Chronic Lymphocytic Leukemia Project (ICGC-CLL, www.cllgenome.es and www.icgc.org). Briefly, cases and controls were recruited between January 2010 and January 2013. CLL cases were identified in 11 hospitals from different Spanish regions, together with a set of frequency-matched controls by sex, age, and region. Controls were selected from the general population, identified from the lists of randomly selected family practitioners in primary health centers located within the catchment area of the hospitals recruiting CLL [13]. Response rates were 87% for cases and 53% for controls. Inclusion criteria required participants to be 20–85 years old, be able to understand and answer the questionnaire, and have lived for at least 6 months in the study area.

Data collection

At recruitment, participants answered a structured computer-assisted questionnaire administered by trained personnel in a face-to-face interview to gather information on anthropometrics (self-reported), socio-demographics, lifestyle factors, and personal and family medical history. Cases and controls provided full address, year start and stop living in all the residences where they lived for at least 12 months since age 18 until the time of the interview, and the type of water consumed in each residence (municipal, bottled, well, other). Amount (glasses/day) of water ingested on average lifetime at home, work and other places was ascertained. Information related to swimming pool attendance included ever attendance (defined as more than 10 times in the lifetime), year (or age) when swimming pool attendance started and ended, average attendance frequency in summer and the rest of the year separately, average time in the water, and type of swimming pool (indoor, outdoor, both). Dietary habits the year before the interview were collected through a self-administered semiquantitative food frequency questionnaire, including a total of 140 food items, previously validated in Spain [14]. Questionnaires used are available online (http://mccspain.org). A final section evaluating the reliability of the interview was completed by the interviewer.

Outcome definition

CLL cases were diagnosed according to the criteria of the International Workshop on Chronic Lymphocytic Leukemia [15]. CLL and small lymphocytic lymphoma are considered the same underlying disease. All diagnoses were morphologically and immunologically confirmed using flow cytometry immunophenotype and complete blood count. In the vast majority of CLL cases, we could differentiate between rai stage 0 or I–IV tumors. Given the generally indolent course of CLL, with most of the patients having a slow progression disease and with no treatment need, CLL cases were invited to participate independently of their date of diagnosis. Patients who received a CLL diagnosis within three years prior to the start of the study (January 2010) were eligible for inclusion. Incident cases were defined as patients diagnosed at during the study recruitment period (January 2010 – January 2013), while prevalent cases were defined as participants diagnosed ≤ 3 years prior to the start of the study (January 2010).

THM and nitrate levels in municipal drinking water

We designed a structured questionnaire aimed at water utilities, local and/or health authorities to collect drinking water source (surface/ground water proportion) and treatment in the study areas back to 1940, as well as available THMs and nitrate measurements. We targeted data collection among study municipalities that contributed up to 80% of person-years. In addition, centralized routine monitoring data on THMs (chloroform, bromodichloromethane, dibromochloromethane and bromoform) and nitrate was provided by the SINAC (Spanish acronym of the National Information System on Water for Consumption) for the years 2004–2010. This data corresponded to samples collected at drinking water treatment plants and the distribution network, and included information at the water zone level fed by water supply operators from public or private companies or municipalities, and public or private laboratories. The water zone, that mostly corresponds to municipality, was defined as a geographical area supplied by water with a homogeneous source and treatment, and whose quality in the water distributed in the networks can be considered homogeneous. We linked each postal code from the residence to the corresponding water zone. The distribution of the sampling points and the sampling frequency varied greatly depending on the population served, extension of the water zone and the year, and could be more than once a day (e.g., Madrid), up to once every three months, or once a year in less populated areas. Measurements below the analytical limit of quantification (QL) were substituted with half the QL (QL/2) [16]. If the QL was missing, we imputed half of the most frequently reported QL.

THM and nitrate levels in non-municipal drinking water

We measured nitrate in the 9 most-consumed bottled water brands in Spain using UV spectrophotometry, with 0.5/0.1 mg/L detection/quantification limit. Nitrate concentrations were in the range of 2.3–15.6 mg/l [17]. THMs were previously measured in 15 popular bottled water brands in Spain, through purge-and-trap and gas chromatography. Mean concentrations for chloroform and brominated THMs were ≤0.1 μg/L [18]; and limits of detection were 0.015 (chloroform), 0.004 (bromodichloromethane), 0.005 (dibromochloromethane) and 0.011 μg/L (bromoform). We used THM data from 56 measurements in different Spanish areas that were supplied by chlorinated ground water (e.g. wells). Average concentrations were 0.3, 0.3, 0.8, and 1.8 μg/L for chloroform, bromodichloromethane, dibromochloromethane, and bromoform, respectively. Nitrate data in private wells was not available.

Estimation of long-term levels in municipal drinking water

We calculated the annual average level of nitrate and THMs at the water zone level. Years without measurements were assigned the average of all available measurements in the water zone if the water source and treatment did not change over the years. In the case of changes in the water source and/or treatment, procedures to back extrapolate were applied.

For THMs, since their concentrations in surface water are generally higher than in ground sources [19], we used surface water percentage as a weight to back extrapolate individual THM concentrations when water source changed through linear interpolation, assuming that concentrations increased proportionately to the percentage of surface water. Likewise, water zones with changes in treatment over the years and THM measurements were used to estimate the change percentage of THM concentrations after introducing such treatments. These percentages were applied as a weight to back-extrapolate THM concentrations in areas with changes in these specific treatments when measurements were unavailable. Before chlorination started, THMs concentrations were assumed to be zero. Total THMs (TTHM) levels were calculated by adding up chloroform, bromodichloromethane, dibromochloromethane, and bromoform concentrations.

For years without nitrate measurements in water zones where water source changed over the years, the ground water percentage was used as a weight to back-extrapolate concentrations using linear interpolation, assuming that nitrate levels increased with ground water proportion [9]. In municipalities without any nitrate measurement (covering ~0.5% of the total person-years), we imputed the levels of neighboring municipalities supplied with similar ground water proportion ± 10%.

Individual indices of THM and nitrate exposure in the study population

Average THMs and nitrate levels in residential tap water

We used municipality and year to link levels in drinking water with residential histories of study participants from age 18 years to 2 years before the interview. We estimated the average concentration of nitrate (mg/L) and THMs (µg/L) for this period, henceforth referred to as “lifetime” or “long-term exposure”. In general, participants were assigned to different water zones based on their residence patterns. On average, individuals lived in three different residences during the exposure window period, with the residence at the time of the interview being the longest, spanning approximately 30 years.

Average ingested nitrate

To calculate waterborne ingested nitrate (mg/day), we assigned levels in drinking water by year according to the water type consumed at home, including municipal (tap), bottled, and private well/other water. Nitrate levels in municipal water were assigned for tap water consumption. Nitrate levels in the sampled bottled waters (range 2.3–15.6 mg/l) [17] were averaged using the sales frequency of each brand as a weight, leading to 6.1 mg/L of nitrate, that was assigned to study participants consuming bottled water. Since nitrate levels in well water were not available, waterborne ingested nitrate was considered missing for years when well water consumption was reported (~2%). The annual nitrate estimates were averaged from age 18 to 2 years before the interview and multiplied by the average daily water intake at the residence. Total amount of ingested water was ascertained as the number of water glasses per day consumed on average by the participant at home (L/day, assuming 200 mL/glass). Water intakes = 0 and above the 99th percentile (4 L/day), considered implausible, were treated as missing values in the analyses.

Covariables

We collected information on self-reported age, education, weight and height 1 year before the interview to compute body mass index (BMI, kg/m2), family history of hematologic malignancy, smoking and physical activity. Smokers were defined as those smoking at least one cigarette/day for ≥6 months. Former smokers were defined as those who quit smoking ≥1 year before the interview. Physical activity was ascertained through open questions on any type of physical activity practiced in life, years, and frequency (hours/ week), to calculate metabolic equivalents (METs) from age 16 to 2 years before the interview.

Statistical Analysis

The initial sample consisted of 1,842 participants, comprising 246 CLL cases (92 incident, 154 prevalent) and 1,596 controls. Number of controls was greater than the number of cases because controls were selected to also be matched to other cancer sites. We excluded 1 subject with unreliable interview as qualified by the trained interviewers, and, in order to have a similar geographical distribution of cases and controls, only municipalities with at least one case and one control were included (n = 229 excluded). For the residential tap water analyses, participants with THMs or nitrate estimates covering less than 70% of the years between age 18 to 2 years before the interview were also excluded (n = 238), resulting in a sample of 1374 (144 cases, 1230 controls). For the nitrate waterborne ingestion analyses the sample was made up of 1245 (131 cases, 1114 controls) due to missing data on water consumption in 129 participants. Finally, out of 1612 there were 372 participants without complete data on swimming pool attendance and the sample for these analyses it was made up of 1240 (157 cases, 1803 controls) (Fig. 1).

Fig. 1: Flow chart showing exclusions of study participants from the Multicase-Control Study in Spain (MCC-Spain).
figure 1

Cases were defined as patients with CLL recruited within three years from diagnosis to interview (diagnosed ≤ 3 y prior to recruitment). * From 18 years of age to 2 years before the interview.

As exposure to nitrate in drinking water is exclusively through the oral route, waterborne nitrate ingestion was evaluated as nitrate exposure. However, since inhalation and dermal absorption are suggested as relevant routes for THMs (volatile and permeable to the skin), residential tap water THM levels were used as main exposure. As a supplementary analysis, waterborne THM ingestion was additionally explored. Average lifetime residential tap water of total THMs, brominated THMs, chloroform (μg/L), and waterborne nitrate ingestion (mg/day), were treated as continuous and as tertiles defined according to the distribution among controls.

Our study also encompassed an evaluation of swimming pool attendance. Firstly, we conducted a comparative analysis between individuals classified as ‘users’ (defined as those who have attended the pool ≥10 times in their lifetime) and ‘non-users’ (defined as those who have attended the pool <10 times in their lifetime). Subsequently, we performed a secondary analysis by categorizing participants based on the average annual frequency of pool attendance. Given that over half of the participants fell into the non-user category, the reference category was designated as the group of ‘non-users, while users were further divided based on their attendance frequency, both below and above the median of the yearly average times of attendance. To achieve a finer-grained measurement, we further evaluated each successive 25th percentile increment of yearly pool attendance frequency, distinguishing between summer and rest of the year.

We estimated odds ratios (OR) and 95% confidence intervals (CI) of CLL using mixed models with recruitment area (Barcelona, Asturias, Cantabria) as random effect. To test for linear trends (P-trend) across increasing categories of exposure, the median concentration within each category was treated as a continuous variable in the model. Smoothed spline with three degrees of freedom from general additive models (GAM) were used to visually display the exposure–response relationships on continuous variables.

The selection of adjustment variables was made based on prior knowledge about CLL risk or preventive factors and variables potentially correlated with exposures of interest. Thus, all models were adjusted for recruitment area (introduced as random effect in the mixed model), age (<55, 55–64, 65–69, 70–74, ≥75 y) sex, and education (primary school or lower, secondary school, university). Further adjustment included family history of hematologic malignancy (yes/no), BMI (kg/m2), smoking (never, former, current), physical activity (inactive, low, moderate, very active), ever worked in farming or agriculture (yes/no) and alcohol consumption (never, former, current moderate consumption [(≤20 g/day men; ≤10 g/day women)], current high consumption [>20 g/day men; >10 g/day women]). An additional model was reported with mutual adjustments between nitrate waterborne ingestion, residential tap water chloroform and brominated THMs levels. Concerning the analysis of swimming pool attendance, the second model was not adjusted for physical activity, while the subsequent third model did include adjustments for physical activity. We used stochastic regression (which adds a random error term that appropriately reproduces the correlation between X and Y) to impute missing values in BMI (4%), physical activity (<1%) and alcohol consumption (18%). Treating the missing of the alcohol variable as a separate category had no impact on any estimate.

We explored the stratification by sex and by rai stage (0 and I–IV) and some sensitivity analyzes were further conducted: (i) excluding participants with family history of hematologic malignancy, to completely avoid any genetic influence and (ii) only including incident cases (i.e., CLL diagnosed after recruitment), to minimize the possibility of reverse causation. For this supplementary analysis, exposures were exclusively investigated in their continuous form, as this approach was necessitated by a considerable reduction in the sample size. All p values presented are two-tailed; <0.05 was considered statistically significant. Analyses were performed using STATA version 16.0 (Stata Corp, College Station, TX).

Results

The recruitment area contributing with the largest population was Barcelona. The average age was 64 (11) years old for controls and 66 (10) for CLL cases. On average, cases had more often family history of hematologic malignancy, were less often current smokers, less physically active, worked more often in farming or agriculture, and consumed less alcohol (Table 1). The excluded participants exhibited no discernible differences in comparison to the selected sample (Supplementary Table 1).

Table 1 Characteristics of the study population.

Mean (SD) values for average lifetime residential tap water concentrations of brominated THMs (μg/L) and chloroform (μg/L), and waterborne ingested nitrate (mg/day) were 48.1 (35.6), 18.5 (6.7) and 13.7 (9.6), respectively, in controls; and 72.9 (40.7), 17.9 (5.4) and 14.1 (8.8) in CLL cases. While 58% of the cases exhibited a history of pool attendance over their lifespan, 39% of the control group manifested such attendance. Among users, average annual frequencies were slightly higher in controls than in cases (Table 1). Pool attendance during summer was 31% indoors, 67% outdoors and 2% mixed, while during the rest of the year, 98.5% was indoors, and 1.5% was outdoors (data not in tables).

Trihalomethanes in residential tap water and nitrate waterborne ingestion

The ORs of CLL were 1.24 (95%CI: 1.16, 1.34) and 1.22 (1.14, 1.31) for each 10 μg /L increase of, respectively, total and brominated THMs in residential tap water. Upon performing exposure analysis through tertile categorization, the third tertile showed an OR exceeding 2 for both total and brominated THMs. However, statistical significance was not attained. In contrast, an inverse association was identified between chloroform levels present in residential tap water and CLL, OR of 0.54 (0.34, 0.87) for 10 μg/L chloroform increase (Table 2, Fig. 2). These associations remained consistent and unchanged in both the stratified analyses and the sensitivity analyses (Table 3). However, the ingestion of waterborne THMs (total, brominated and chloroform) was not found to be associated with CLL (Supplementary Table 2).

Table 2 Association of trihalomethanes (THMs) in residential tap water and nitrate waterborne ingestion with chronic lymphocytic leukemia (CLL).
Fig. 2: Exposure–response relationship of trihalomethanes in residential tap water, nitrate waterborne ingestion, and lifetime total swimming pool hours with chronic lymphocytic leukemia (CLL); expressed as odds ratios (ORs) and 95% confidence intervals (CIs).
figure 2

Smoothed spline with three degrees of freedom from general additive models adjusted for region, age, sex, and educational level, family history of hematologic malignancy, body mass index, smoking, physical activity, ever worked in farming or agriculture, alcohol consumption. The dashed lines represent the 95% confidence intervals (CIs).

Table 3 Stratified analyses by chronic lymphocytic leukemia (CLL) rai stage and sex, and sensitivity analyses.

Waterborne ingested nitrate was consistently not associated with CLL. The OR associated for each 5 mg/day increase was 0.91 (0.80, 1.04), and when comparing tertile 3 vs. 1, the OR was 1.04 (0.57, 1.89; P-trend 0.64) (Table 2 and Fig. 2). Null associations for nitrate exposure remained in stratified and sensitivity analyses (Table 3).

Swimming pool attendance

A positive association was observed between ever swimming pool attendance and CLL. The OR of swimming pool users (vs. non-users) was 2.38 (1.61, 3.52). Upon performing annual frequency of attending pools analysis through categorization, the second and third categories showed an ORs of 2.36 (1.49, 3.72) and 2.40 (1.51, 3.83), respectively (P-trend 0.001). For each 25-fold increment in the annual frequency of attending pools, the odds of CLL increased by 5%, OR = 1.05 (0.99, 1.12) (Table 4). This association was sustained both in attendance during the summer and winter months and remained similar in stratified and sensitivity analyses (Table 3).

Table 4 Association of average frequency of pool attendance with chronic lymphocytic leukemia (CLL).

Discussion

Despite the extensive use of chlorination as a cost-effective water disinfection method that has been applied for decades, there is surprising little evidence about its potential risks on human health. To our knowledge, this is the first study to evaluate the relationship between exposure to water contaminants, specifically THMs and nitrate in drinking water and swimming pool attendance, with CLL risk.

Elevated long-term total and brominated THM levels in residential tap water exhibited a positive association with an increased likelihood of developing CLL. Conversely, chloroform demonstrated an inverse association with CLL and no significant association was observed between nitrate and CLL. The absence of an association between THMs waterborne ingestion and CLL, in contrast to the observed association with residential THM levels, which account for different exposure routes, may suggest that inhalation and/or dermal exposures may play a more significant role in elevating the risk of CLL than THM ingestion. Swimming pool attendance was also associated with increased risk of CLL.

Chloroform, bromodichloromethane, dichloroacetic acid, and trichloroacetic acid have been classified as possible human carcinogens (Group 2B) by the International Agency for Research on Cancer [7, 20]. However, evidence of DBP carcinogenicity in animals differs by chemical, animal species, organ, and administration route. Since some DBPs (e.g., THMs) are volatile and skin permeable, exposure occurs through dermal absorption and inhalation when showering, bathing, hand dishwashing, cooking or attending swimming pools [21], in addition to ingestion. Long-term exposure to THMs, as a marker of DBP exposure, has been consistently associated with increased bladder cancer risk in epidemiological studies [22]. DPB exposure has also been linked to colorectal cancer, miscarriage, and birth defects, although evidence is less consistent [23, 24]. Exposure to swimming pool waters has also been related to eye and skin irritation, asthma and respiratory disorders [25, 26].

Drinking water chlorination form chloroform but also brominated THMs in the presence of bromide. In this study, mean levels of brominated THMs in drinking water were twice the levels of chloroform. Our data suggests that exposure to brominated THMs, but not chloroform, increases the risk of CLL, which could be expected given their different mechanisms of action and toxicological potential [26, 27]. Brominated THMs are more cytotoxic than chloroform in addition to being also genotoxic and mutagenic when tested under a wide range of experimental conditions (including human cells) [28, 29]. On the other hand, some animal data have observed that chloroform may have an inhibitory effect on certain tumors [30, 31]. In our sample, chloroform levels showed limited variability in the range where was most of the population. Associations should be cautiously interpreted and remain to be confirmed.

Among the 700 DBP identified, THMs and haloacetic acids are the most prevalent chlorination by-products, and have consequently been used in epidemiological studies as surrogates of total DBPs. However, there are other DBPs more toxic [32] such as haloacetonitriles, haloacetamides, or halonitromethanes [28, 33, 34]. Thus, a possible synergism or additive effect among the different DBPs in drinking water can be suspected and future studies should evaluate more completely the potential effects of combined exposures.

Our results on swimming pool attendance suggest that DBP exposure over the years in swimmers –which occurs through inhalation and dermal absorption, leading to a higher dose and longer duration in the bloodstream relative to oral exposure [8, 35, 36]—might be linked to CLL risk. Swimming pool water frequently exceeds DBP content relative drinking water due to the constant input of organic matter from swimmers and application of disinfectant [37]. The body fluids of the swimmers (such as urine, sweat, hair, skin cells) and personal care products (such as sunscreen and body lotions) together with other contaminants from leaves, dust or rainwater in the case of outdoor pools, react with the disinfectants used to prevent microbial proliferation –mostly chlorine, and to a lesser extend bromine or ozone– and forms unwanted DBPs. Hundreds of DBPs have also been identified in pools, including brominated THMs [38]. The DBP formation in swimming pools depends on several factors such as number of people in the pool, water temperature, and the amount of total organic carbon [39]. DBPs are reported to be significantly higher in the outdoor and recreational pools than in the indoors [40]. Outdoor and recreational pools are generally smaller in size and depth, with a greater influx of users, many of whom are children, which translates into a higher organic load and, therefore, a higher concentration of DBPs [41]. We, however, did not observed a greater risk of CLL associated with attendance to pools during summer (31% reported to be indoors, 67% outdoors and 2% mixed) than during the rest of the year (98.5% was indoors, and 1.5% was outdoors).

In a previous experimental study, in which we collected blood, urine, and exhaled air samples from adult volunteers before and after they swam for 40 min, we observed genotoxicity of swimming pool water and in swimmers. This genotoxicity was derived mainly from brominated THM but not chloroform [42]. The importance of brominated DBPs in the mutagenicity of recreational waters has been highlighted by other studies [43, 44]. Likewise, the presence of nitrogen-containing organic matter from bathers leads to the formation of nitrogenated by-products such as chloramines (mochloramine,dichloramine, trichloramine), haloacetonitriles and carcinogenic nitrosamines, also important drivers of toxicity [45, 46].

The main strengths of our study include the relatively large sample size that allowed us to analyze exposures using an updated and rigorous classification of CLL, and the long-term exposure approach, from 18 years of age to 2 years before the study interview. We were able to collect detailed individual information and to statistically adjust for a number of potential confounding factors. Residual confounding cannot be ruled out. Finally, the multicentric nature of the study allowed a wide geographic variability of THM and nitrate levels in residential tap water.

The limited historical measurements (particularly before 1980) and assumptions to model historical concentrations could reduce the accuracy of exposure estimates. However, to minimize exposure measurement error, we only included subjects with known exposures for at least 70% of the exposure window. Inability to account for exposures outside home and use of domestic filtration systems may have introduced nondifferential misclassification in the waterborne ingested estimates. However, the reported amount of water consumed at work (mean ± SD: 0.2 ± 0.3 L/day) and other places (0.01 ± 0.05 L/day) was smaller than that consumed at home (1.2 ± 0.7 L/day), and minor bias is expected. As for the use of domestic filters, a reduction of THMs levels has been reported [47]. There were no statistics on the use of domestic filters in Spain for the study exposure window, but expert knowledge suggests that the use of household water filters was probably uncommon. Overall, the expected effect on the associations from exposure measurement error is attenuation towards the null [48], as has been shown for other residence-based exposures [49].

In order to minimize potential recall bias in assessing exposure to swimming pools, we focused solely on individual average frequency of pool attendance, distinguishing between summer and the rest of the year. Attempting to obtain more precise information, such as the average hours spent in the pool during in life, would likely be prone to reporting errors. However, despite our efforts, we acknowledge that some degree of recall bias and exposure misclassification may be present. Notwithstanding, it is plausible that any such biases would be nondifferential, thereby tending to bias the association towards a null.

Personal information was collected retrospectively after diagnosis, and differential recall between cases and controls may not be totally ruled out. However, the questionnaire was administered by trained personnel in a face-to-face interview, and there is not an obvious link between the water questions and CLL that could motivate different responses between cases and controls. Moreover, the interviewers rated the quality of the interview and unreliable or inconsistent interviews were excluded from our analyses.

Selection bias arising from control sampling might be of concern since response rates were moderate, especially among controls, partly explained by the population-based source as opposed to hospital-based cases. However, there were no differences in the main characteristics between cases and controls and the probability of participation can be assumed to be independent from the exposure, thus, nondifferential bias is expected, if any. It was not possible to perform analyses by recruitment area due to the limited within area variability, but we conducted mixed models with area as random effect, which adjusts for environmental factors with potential different geographic distribution. Nevertheless, since correlation of these factors with our main exposures is unlikely, the expected impact on the results is minimal. Finally, the inclusion of prevalent cases might be another cause for attention since patients who survived might have a different etiology than those who died rapidly after diagnosis or might have modified their lifestyle, presumably towards healthier-like habits. However, sensitivity analysis suggested that the use of prevalent cases might not have introduced selective survival bias or reverse causality.

Our findings suggest that long-term exposure to brominated THMs in drinking water, as well as swimming pool attendance may have a role in the CLL etiology. Our unprecedented findings are highly relevant since no previous study has explored these associations. However, this is a first piece of evidence that needs to be confirmed in other studies and the biologic basis for the associations observed remains to be elucidated.