Main

Childhood leukaemia is the most common cancer among children in the United States, representing almost one-third of all cancer cases occurring in children under the age of 15 years (Ries et al, 1999). Although the causes of childhood leukaemia are still largely unknown, two major infectious hypotheses have been proposed, the ‘delayed infection’ (Greaves, 2006) and the ‘population mixing’ (Kinlen, 1995) hypotheses.

Studies have examined the association between childhood leukaemia and different proxy measures of child's risk of infection, including the daycare attendance (Ma et al, 2005) and birth order (Greaves, 2001). Seven childhood leukaemia studies have examined paternal occupational social contact as another proxy measure of child's risk of infection and the results have been inconsistent (Roman et al, 1994; Kinlen, 1997; Fear et al, 1999, 2005; Kinlen and Bramald, 2001; Kinlen et al, 2002; Pearce et al, 2004). The current analysis examines the association between childhood acute lymphoblastic leukaemia (ALL) and both paternal and maternal occupational social contact, incorporating the duration and the contact level of each job to determine whether adding the duration as part of the social contact measure will influence any association between parental social contact in the work place and ALL risk.

Materials and methods

Study population

The Northern California Childhood Leukaemia Study (NCCLS) is an ongoing case–control study that began in 1995. The study has been recruiting subjects from nine hospitals in the 35 counties located in the Northern California region, starting with 17 counties in the San Francisco Bay Area (years 1995–1999) and subsequently (since 1999) expanded to include 18 additional counties in the California Central Valley. For this analysis, NCCLS subjects included were recruited from 1995 to 2002 with leukaemia under the age of 15 and ascertained usually within 72 h of diagnosis. For each case, one or two controls matched on age, sex, Hispanic ethnicity, and maternal race were randomly selected based on birth certificates obtained through the California Office of Vital Records. The eligibility criteria for all subjects were (1) being a resident of the study area, (2) aged below 15 years at case diagnosis (reference date for the matched controls), (3) having at least one English- or Spanish-speaking parent or guardian, and (4) without previous cancer. Of the eligible cases, 84% consented to participate in the study and 84% of the eligible controls contacted agreed to participate in the study. Overall, 58% of the controls participated (the number of the enrolled controls divided by the total number of birth control searches excluding the known and presumed ineligibles). A detailed description of control selection has been published previously (Chang et al, 2006). The participating controls of the NCCLS are representative of the sampled population in parental age, parental education, and mother's reproductive history, which indicates a reduced potential for selection bias (Ma et al, 2004).

The study was approved by the University of California Committee for the Protection of Human Subjects, the California Health and Human Services Agency Committee for the Protection of Human Subjects, and the Institutional Review Boards of all the participating hospitals. A written informed consent was obtained from the parents of all participating subjects.

Data collection and management

Data were collected from interviewed parents on employment history (and duration) of each job held after the child's birth up to the age of 3 years for those cases of children diagnosed from age 3 to 14.9 or up to the date of leukaemia diagnosis for those diagnosed under the age of 3 years. The diagnosed dates for the cases were used as the reference dates for exposure assessment for the matched control subjects. Each job was assigned a standard job title according to the 1990 US Census Occupational Classification (US Department of Health and Human Services, 1998). The level of occupational social contact was then assigned by the following two methods: first, the US job titles equivalent to the job titles of the UK 1990 Standard Occupational Classification were classified into three social contact groups (low, medium, and high) based on a previously published classification (Fear et al, 2005) (Supplementary Table 1). If occupations with different social contact levels were reported by the same person, the contact level was based on the job with the highest level. Those parents who did not work during the period of interest were assigned a low-level social contact. Like previous studies, this method does not include information on duration of employment. Second, an index was created by summarising each parent's employment history during the period of interest in the following equation: occupational social contact months=(months of employment with low social contact × 1)+(months of employment with medium social contact × 1.5)+(months of employment with high social contact × 2). In spite of the difference in social contacts between the three job contact categories, the factors for creating social contact months were chosen conservatively to give higher weightage to jobs with higher social contact levels. Sensitivity analyses were performed with different ranges of multiplication factors (see Supplementary Table 5).

A detailed method for classifying the rural/urban status has been published elsewhere (Reynolds et al, 2005). Briefly, each participant was assigned to a US Census block group according to birth residence and then assigned to one of the five urbanisation categories based on US Census 1990 and 2000 block group data for children born before 1 January 1995 and on or after 1 January 1995, respectively. For the purpose of this analysis, a dichotomous rural/urban status variable was created, which combined ‘urban’, ‘suburban’, ‘city’, and ‘small town’ into a more general urban group, while ‘rural’ included unpopulated areas and census-defined places with less than 50 000 people, outside of an urbanised area, with population density in the lowest quartile

Statistical analysis

The present study included only the ALL cases diagnosed between ages 0 and 14.9. Analyses were performed first with all ALL cases and then separately with common ALL (c-ALL, which is defined as ALL with the expression of CD10 and CD19 surface antigens, and diagnosed in children aged 2–5.9 years; the 165 c-ALL cases in this study represent 92% of ALL cases at ages 2–5.9). Conditional logistic regression models were used to estimate the odds ratios.

For analyses with categorical predictor variables, analyses were first performed separately with paternal and maternal occupational social contact data using ‘low’ contact group as the reference with two indicator variables included in the statistical model for ‘medium’ and ‘high’ contact groups. Subsequent analyses were performed with a combined parental contact variable. The low exposure group in the combined parental analysis consisted of children whose both parents had a low contact level, the high exposure group consisted of children with either parent having a high contact job, and the remaining children were categorised as the medium group. For analyses with continuous predictor variables, the index of occupational social contact months was divided into tertiles with the lowest tertile as the reference group for comparison. Additional analyses were performed using contact months as a continuous variable estimating the odds ratio associated with one unit increment of social contact month.

Analyses examining the influence of rural/urban status on the association between parental contact and childhood ALL were performed using conditional logistic regression, first with paternal contact level for comparison with previous studies. Additional analyses were performed with combined parental occupational social contact. Subjects were categorised according to rural/urban status and the level of parental occupational social contact with those living in urban areas and having the lowest level of parental occupational social contact as the reference group. Product terms of rural/urban status and parental contact were included in the conditional logistic regression model. Log-likelihood ratio tests were used to compare the goodness-of-fit of the model with the product terms to the model without product terms.

All analyses were adjusted for household income, birth order, total child-hours spent in day care, and number of other children in the household before the index child attended the first grade. Since adjusting for parental educational level did not change risk estimates by more than 10%, these two variables were not included in the final statistical models. Although different associations were previously reported between daycare attendance and childhood leukaemia for Hispanic and non-Hispanic white children (Ma et al, 2005), statistical testing for interaction between Hispanic status and parental contact months indicated that the association with childhood ALL (P=0.81) or c-ALL (P=0.47) did not differ by Hispanic status; subjects of all race/ethnicity were therefore combined for these analyses.

Because of concern that positive results (Kinlen, 1997; Kinlen and Bramald, 2001; Kinlen et al, 2002) may reflect coding differences of contact level by job titles from those with null results (Fear et al, 1999, 2005), we repeated the same analyses using Kinlen's coding (Kinlen and Bramald, 2001) (Supplementary Table 2).

Results

The demographic characteristics of the study subjects are shown in Table 1. Controls had statistically significant (P<0.05) higher levels of household income and parental education.

Table 1 Characteristics of 294 ALL cases and 376 controls, the Northern California Childhood Leukaemia Study, 1995–2002

Neither the paternal nor the maternal social contact through occupation was associated with either ALL or c-ALL for the analyses with the categorical exposure variable (low, medium, and high occupational social contact) or for the continuous variable (social contact month) (Table 2). Similarly, no significantly elevated risks were observed with either ALL or c-ALL with combined parental occupational social contact.

Table 2 Parental occupational social contact and risk of childhood ALL, the Northern California Childhood Leukaemia Study, 1995–2002

Table 3 presents the association between the paternal occupational social contact and ALL or c-ALL by rural/urban status. Analyses with categorical (low, medium, or high) paternal occupational social contact variable did not show a significant influence of rural/urban status on its association with childhood ALL or c-ALL (P for interaction=0.41 and 0.42, respectively). However, the analyses with social contact months showed a significant positive interaction between social contact months and living in rural areas for ALL (P for interaction=0.04 for tertiles and 0.09 for increment of one occupational social contact month) and for c-ALL (P for interaction=0.007 for tertiles and 0.02 for increment of one occupational social contact month). The same analysis with maternal data did not show any significant interaction between maternal occupational social contact and rural/urban status on the risk of ALL or c-ALL (data not shown).

Table 3 Paternal occupational social contact level and risk of childhood ALL by rural/urban status, the Northern California Childhood Leukaemia Study, 1995–2002

An analysis examining the association between the combined parental occupational social contact and either ALL or c-ALL by rural/urban status showed similar results as those of paternal occupational social contact (Table 4). Analyses with categorical (low, medium, or high) parental occupational social contact variable did not show a significant influence of rural/urban status on its association with childhood ALL or c-ALL (P for interaction=0.55 and 0.86, respectively). However, the analyses with contact months showed a significant positive interaction between contact months and living in rural areas for ALL (P for interaction=0.02 for tertiles and 0.05 for increment of one occupational social contact month) and for c-ALL (P for interaction=0.02 for tertiles and 0.02 for increment of one occupational social contact month).

Table 4 Parental (paternal+maternal) occupational social contact level and risk of childhood ALL by rural/urban status, the Northern California Childhood Leukaemia Study, 1995–2002

Tables 5 and 6 repeat the same analyses presented in Tables 2 and 4, respectively, but stratified by two age groups, ages 0–4.9 and 5–14.9. Though the point estimates were quite unstable due to smaller number of subjects after stratifying by age, the results in the two age groups were similar enough to justify combining the subjects from the two age groups for analysis.

Table 5 Parental occupational social contact and risk of childhood ALL, the Northern California Childhood Leukaemia Study, 1995–2002, ages 0–4 and 5–14
Table 6 Parental (paternal+maternal) occupational social contact level and risk of childhood ALL by rural/urban status, the Northern California Childhood Leukaemia Study, 1995–2002, ages 0–4 and ages 5–14

The same analyses were repeated using coding by Kinlen and Bramald (2001) and the results were similar (Supplementary Tables 3 and 4). Sensitivity analyses were performed to test for the interaction between occupational social contact months and rural/urban status using three other combinations of multiplication factors for creating the index of occupational social contact months (1) 1.0 for both the medium and high social contact occupations; (2) 1.25 and 1.5 for medium and high social contact occupations, respectively; and (3) 2.0 and 3.0 for medium and high social contact occupations, respectively. Regardless of the multiplication factors used, the interaction between occupational social contact months and rural/urban status remained mostly statistically significant (P<0.05), especially for c-ALL (Supplementary Table 5). Interestingly, even when no weighting was assigned (multiplication factor 1.0 for both medium and high social contact occupation), there was still a significant interaction between rural/urban status and the duration of parental occupation on the risk of childhood leukaemia, which further support the importance of including duration as part of the parental occupational social contact measure.

Discussion

Seven published studies have examined parental social contact at work and the risk of childhood leukaemia, and the results have been inconsistent (Roman et al, 1994; Kinlen, 1997; Fear et al, 1999, 2005; Kinlen and Bramald, 2001; Kinlen et al, 2002; Pearce et al, 2004). The inconsistent results could be attributed to several sources. Most of those studies did not include information on maternal occupational contact level, since maternal occupation was not routinely recorded on available birth or death certificates (Kinlen, 1997; Fear et al, 1999, 2005; Kinlen and Bramald, 2001; Kinlen et al, 2002; Pearce et al, 2004). In addition, those studies only had information on father's employment at the following one or two points in time: at birth (Roman et al, 1994; Kinlen, 1997; Kinlen and Bramald, 2001; Kinlen et al, 2002; Pearce et al, 2004; Fear et al, 2005), at the child's death (Fear et al, 1999), or at diagnosis (Roman et al, 1994; Fear et al, 2005); none included information on job duration. This may be important in that the longer the duration the higher the probability of contact with carriers of infectious agents.

Our results suggest that duration of parental occupation may be important when examining the association between parental social contact in the workplace and childhood leukaemia. No statistically significant association between parental contact and either ALL or c-ALL was observed when job duration was not considered, which is consistent with three studies (Roman et al, 1994; Fear et al, 1999, 2005). In contrast, when duration of the occupation was accounted for, the results suggested that those children who live in a rural area have an increased risk of ALL or c-ALL associated with parents, specifically fathers, having occupations of higher social contact and longer duration; these results are consistent with four of the seven previously published studies (Kinlen, 1997; Kinlen and Bramald, 2001; Kinlen et al, 2002; Pearce et al, 2004) and support Kinlen's population mixing hypothesis (Kinlen, 1995). However, our own results (like those of Roman et al, 1994) need to be interpreted with caution because of the small number of subjects in rural areas, and the significant results were mainly attributed to father's exposure.

A major difference between our and Kinlen's studies conducted in Sweden and Scotland is that only 15% (39 cases and 45 controls) of our subjects lived in rural areas according to our definition, whereas Kinlen's studies had much larger numbers in rural areas (316 cases and 924 controls for the Scottish study; 196 cases aged 0–4 years and 784 controls for the Swedish study) (Kinlen and Bramald, 2001; Kinlen et al, 2002). In addition, the most rural counties in the Swedish study have a very low population density of 4 km−2 (Kinlen et al, 2002), whereas in our study, even the county with the lowest density (Glenn County) had a population density of 7.3 km−2 according to the 1990 US Census (US Census Bureau, 1990). Geographical isolation is also relevant to the population mixing hypothesis, and the rural counties in our study may be less geographically isolated than the rural areas in Sweden or Scotland, so that any association with parental occupational contact level would be correspondingly reduced.

Several issues are relevant in interpreting our results they are as follows: (1) a positive association between occupational social contact and childhood ALL or c-ALL in rural areas was observed with paternal but not with maternal exposure. The job contact levels were developed for paternal occupations and may have captured paternal exposures more accurately; however, it is unlikely that fathers and mothers with the same job titles (e.g., teachers and physicians) experience different levels of social contact. Another possible reason is that mothers, in general, have lower level of occupational social contact months (median=36.5 social contact months for control fathers vs median=27 for control mothers; 7.7% of control fathers vs 25.5% of mothers did not work during the period of interest). Finally, one can not rule out chance in the significant interaction between paternal contact months and rural/urban status due to the small number of rural subjects; however, the strength of interaction generally seems stronger for c-ALL, a subtype claimed as more associated with an infectious aetiology (Greaves, 2006), and thus decrease the probability that our results may be spurious. (2) All the fathers may not have lived with their child after birth, such details being available only for subjects recruited after 1999, among whom, 9% (34/378) of the fathers did not live in the same household as the child. The proportions of fathers who did not live in the same household as the child were similar between cases and controls (9.5% for cases and 8.7% for controls, P for χ2=0.80). Even so, they could still have had contact with their child, and thus, may have been an important source of exposure for childhood infections. Sensitivity analysis with subjects recruited after 1999, excluding those fathers who did not live in the same household as the child produced similar results. (3) Misclassification could also be a problem when assigning the level of social contact based on job titles, since not everyone with the same job title necessarily has the same level of social contact.

This study has several strengths, they are as follows: (1) it is one of the only two studies to investigate both paternal and maternal occupational contact level, which may involve less misclassification than using only paternal data; (2) it is the first study to take account of duration of occupation. Studies using parental occupation at a single point can misclassify the contact level, for example, a low contact job held for 12 months will be classified as a lower group than a high contact job for 1 month if based on birth certificate information, whereas by our index of social contact months, the reverse is the case. The concept is that with increasing duration of occupation, exposure to infection is more likely even if the job itself has a lower level of social contact. The index can also incorporate multiple jobs with different contact levels to produce a summary measure, thereby reducing the probability of misclassification. The duration information can accommodate breaks in employment, for example, taking time off work after the birth of their child; and (3) the cases appear to be representative of children in the 35-county study area. A previous comparison of the study with the California Cancer Registry indicated that 88% of childhood leukaemias were included in this study. NCCLS controls selected from birth certificates also appear to be representative of the population of the study area (Ma et al, 2004).

In conclusion, our results indicated that including information of the duration of parental occupation may be important when studying the association between parental occupational social contact and childhood leukaemia.